ml-debug

wassname/ml-debug

Fork 0

mirror of https://github.com/wassname/ml-debug.git synced 2026-06-27 16:15:57 +08:00

Commit Graph

Author	SHA1	Message	Date
wassname	a602ea5a0e	rl: quote Spinning Up (Achiam) on silent failure and bug-first debugging Spinning Up as a Deep RL Researcher was only a bare code link; it's the canonical RL-researcher guide and its debugging advice is gold. Cache the rigour/debugging sections verbatim and quote the sharpest lines in the RL sub-skill: "broken RL code almost always fails silently", "if it doesn't work, assume there's a bug", "measure everything ... you can't tell it's broken if you can't see that it's breaking", and test on more than one env. Add to RL sources. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-02 21:04:55 +08:00

Author

SHA1

Message

Date

wassname

a602ea5a0e

rl: quote Spinning Up (Achiam) on silent failure and bug-first debugging

Spinning Up as a Deep RL Researcher was only a bare code link; it's the
canonical RL-researcher guide and its debugging advice is gold. Cache the
rigour/debugging sections verbatim and quote the sharpest lines in the RL
sub-skill: "broken RL code almost always fails silently", "if it doesn't work,
assume there's a bug", "measure everything ... you can't tell it's broken if
you can't see that it's breaking", and test on more than one env. Add to RL
sources.

Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>

2026-06-02 21:04:55 +08:00

1 Commits