mirror of
https://github.com/wassname/ml-debug.git
synced 2026-06-27 17:31:04 +08:00
4393cceefd
Deep research to uplift LLMs for ML debugging, opinionated by source selection. Distilled from Schulman, Jones, Rahtz, Goodfellow, CS231n, FSDL, and more. Includes runnable diagnostic scripts and LLM-specific anti-patterns. Author: wassname (https://github.com/wassname)
13 lines
431 B
Markdown
13 lines
431 B
Markdown
Source: https://old.reddit.com/r/reinforcementlearning/comments/5hereu/
|
|
Title: "The Nuts and Bolts of Deep RL Research" (Schulman December 2016 slides)
|
|
Fetched-via: Reddit JSON API (limit=500, depth=10)
|
|
Fetch-status: verbatim
|
|
|
|
# "The Nuts and Bolts of Deep RL Research" (Schulman December 2016 slides)
|
|
|
|
**Posted by:** u/gwern | Score: 5 | 0 comments
|
|
|
|
Link: http://rll.berkeley.edu/deeprlcourse/docs/nuts-and-bolts.pdf
|
|
|
|
## Comments
|