mirror of https://github.com/wassname/ml-debug.git synced 2026-06-27 15:00:40 +08:00

T

wassname 715164416b loop: add likelihood-ratio test selection, path bisection, falsifiers, pseudocode

- triplet now carries a prior + cheapest falsifier (Check:) per hypothesis
- discriminating-test step: forward-predict each hypothesis, prefer where
  predictions diverge (strong vs weak evidence) instead of just "discriminating"
- new step: bisect the forward/backward path to localize where it breaks
- compact pseudocode summary of the whole loop
- resolve FIXME: drop references to the non-public research-journal skill

Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>

2026-06-02 12:06:30 +08:00

docs

docs(ml_debug): annotate EMNLP 2018 NLP code tutorial; note sparse Adam embedding bug

2026-03-10 05:48:36 +08:00

pinn

name

2026-04-09 05:09:25 +08:00

refs

feat(ml_debug): expand nanochat evidence, add lec4 diagnostics file

2026-03-10 05:38:33 +08:00

name

2026-04-09 05:09:25 +08:00

.gitignore

chore: fix .gitignore (dlbooks path, *_log.md pattern)

2026-03-06 12:22:22 +08:00

README.md

feat(ml_debug): add Karpathy recipe + nanochat evidence, update-ratio diagnostic

2026-03-10 05:32:37 +08:00

SKILL.md

loop: add likelihood-ratio test selection, path bisection, falsifiers, pseudocode

2026-06-02 12:06:30 +08:00

README.md

ML Debugging Folklore

Practitioner knowledge for debugging ML systems, curated and synthesized by wassname. Opinionated by source selection -- I picked sources I trust (Schulman, Goodfellow, CS231n, ...) and had an LLM extract the most relevant information for debugging ML systems.

Use as a Claude skill

/skills add https://github.com/wassname/ml_debug

Or paste SKILL.md into your system prompt / context when debugging.

What's here

SKILL.md -- the main artifact. Load into an LLM agent's context as a debugging skill. Parts 1-5 are reference knowledge; Part 6 is a runnable triage protocol (grep patterns, diagnostic snippets, decision tree); Part 7 is debugging mental models and practitioner priors.
docs/evidence/ -- frozen local copies of source material (blog posts, talks, papers, reddit threads). Claims in SKILL.md link back to exact quotes here.