mirror of https://github.com/wassname/ml-debug.git synced 2026-06-27 16:15:57 +08:00

Files

T

wassname 4393cceefd initial: ML debugging folklore skill

Deep research to uplift LLMs for ML debugging, opinionated by source
selection. Distilled from Schulman, Jones, Rahtz, Goodfellow, CS231n,
FSDL, and more. Includes runnable diagnostic scripts and LLM-specific
anti-patterns.

Author: wassname (https://github.com/wassname)

2026-03-06 10:11:30 +08:00

431 B

Raw Permalink Blame History

Source: https://old.reddit.com/r/reinforcementlearning/comments/5hereu/ Title: "The Nuts and Bolts of Deep RL Research" (Schulman December 2016 slides) Fetched-via: Reddit JSON API (limit=500, depth=10) Fetch-status: verbatim

"The Nuts and Bolts of Deep RL Research" (Schulman December 2016 slides)

Posted by: u/gwern | Score: 5 | 0 comments

Link: http://rll.berkeley.edu/deeprlcourse/docs/nuts-and-bolts.pdf

431 B Raw Permalink Blame History

"The Nuts and Bolts of Deep RL Research" (Schulman December 2016 slides)

Comments

431 B

Raw Permalink Blame History