mirror of
https://github.com/wassname/ml-debug.git
synced 2026-06-27 16:15:57 +08:00
4393cceefd
Deep research to uplift LLMs for ML debugging, opinionated by source selection. Distilled from Schulman, Jones, Rahtz, Goodfellow, CS231n, FSDL, and more. Includes runnable diagnostic scripts and LLM-specific anti-patterns. Author: wassname (https://github.com/wassname)
431 B
431 B
Source: https://old.reddit.com/r/reinforcementlearning/comments/5hereu/ Title: "The Nuts and Bolts of Deep RL Research" (Schulman December 2016 slides) Fetched-via: Reddit JSON API (limit=500, depth=10) Fetch-status: verbatim
"The Nuts and Bolts of Deep RL Research" (Schulman December 2016 slides)
Posted by: u/gwern | Score: 5 | 0 comments
Link: http://rll.berkeley.edu/deeprlcourse/docs/nuts-and-bolts.pdf