mirror of https://github.com/wassname/ml-debug.git synced 2026-06-27 15:00:40 +08:00

T

wassname 8ee980d62f diagnostics: add NaN-poisoning leakage tracer + Karpathy backprop-to-input check; README citation

NaN poisoning: inject NaN where info must not come from (future/test/labels), run the real pipeline, assert past outputs stay finite. Documents false negatives (pandas skipna, nanmean) and false positives (softmax rows, batch stats). Backprop-to-input is its gradient dual for inside the model; quote already frozen in docs/evidence/karpathy_recipe_training_nn_2019.md.

Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>

2026-06-11 10:18:51 +08:00

docs

folklore: promote Spinning Up to main; add a Research-taste section

2026-06-02 21:08:49 +08:00

pinn

name

2026-04-09 05:09:25 +08:00

refs

diagnostics: add NaN-poisoning leakage tracer + Karpathy backprop-to-input check; README citation

2026-06-11 10:18:51 +08:00

rl: quote Spinning Up (Achiam) on silent failure and bug-first debugging

2026-06-02 21:04:55 +08:00

.gitignore

chore: fix .gitignore (dlbooks path, *_log.md pattern)

2026-03-06 12:22:22 +08:00

README.md

diagnostics: add NaN-poisoning leakage tracer + Karpathy backprop-to-input check; README citation

2026-06-11 10:18:51 +08:00

SKILL.md

diagnostics: add NaN-poisoning leakage tracer + Karpathy backprop-to-input check; README citation

2026-06-11 10:18:51 +08:00

README.md

wassname's ML Debugging Folklore

In an attempt to upskill the machine learning debugging on AI coding assistants (and humans), I've collected high quality sources on how to debug machine learning projects, focusing on the mindset and the "taste". When I started ML I went searching for discussions on best practices, and started a few discussions of my own and they helped me a lot, over the years I've collected good ones. I hope they can help others, as well as help in auto research setups. This intro is human written, and the below is AI written with human guidance.

Use as a Claude skill

/skills add https://github.com/wassname/ml_debug

Or paste SKILL.md into your system prompt / context when debugging.

What's here

SKILL.md -- the main artifact. Load into an LLM agent's context as a debugging skill. Leads with the mindset (calibrate, mental models, general debugging tricks, and reading a working implementation when stuck), then a folklore section of sourced quotes, then an LLM-agent playbook (debugging loop, triage menu, anti-patterns). Deeper one-off tricks (loss-surface analysis, stuck-metric diagnosis, sweep reliability) live in refs/.
docs/evidence/ -- frozen local copies of source material (blog posts, talks, papers, reddit threads). Claims in SKILL.md link back to exact quotes here.

Citation

@misc{wassname2026mldebug,
  title = {ML Debugging Folklore: A Practitioner Debugging Skill for LLM Agents},
  author = {Michael J. Clark},
  year = {2026},
  url = {https://github.com/wassname/ml_debug/}
}