ml-debug

mirror of https://github.com/wassname/ml-debug.git synced 2026-06-27 16:00:43 +08:00

Files

T

wassname a602ea5a0e rl: quote Spinning Up (Achiam) on silent failure and bug-first debugging

Spinning Up as a Deep RL Researcher was only a bare code link; it's the
canonical RL-researcher guide and its debugging advice is gold. Cache the
rigour/debugging sections verbatim and quote the sharpest lines in the RL
sub-skill: "broken RL code almost always fails silently", "if it doesn't work,
assume there's a bug", "measure everything ... you can't tell it's broken if
you can't see that it's breaking", and test on more than one env. Add to RL
sources.

Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>

2026-06-02 21:04:55 +08:00

SKILL.md

rl: quote Spinning Up (Achiam) on silent failure and bug-first debugging

2026-06-02 21:04:55 +08:00