mirror of
https://github.com/wassname/ml-debug.git
synced 2026-06-27 17:16:20 +08:00
ced4edc200
Add 3 new evidence files from modern open-source sources: - karpathy_recipe_training_nn_2019.md: Karpathy's training recipe blog post - nanochat_deepwiki_llm_pretraining_2026.md: 320+ HP sweeps for GPT-2-scale pretraining - sanh_simple_considerations_hf_2021.md: HuggingFace NLP debugging notes Add update-to-data ratio diagnostic to refs/diagnostics.md (target ~1e-3). Add LLM pretraining gap note to SKILL.md intro linking the new sources. Add tanh saturation % to logging checklist.
18 lines
955 B
Markdown
18 lines
955 B
Markdown
# ML Debugging Folklore
|
|
|
|
Practitioner knowledge for debugging ML systems, curated and synthesized by [wassname](https://github.com/wassname). Opinionated by source selection -- I picked sources I trust (Schulman, Goodfellow, CS231n, ...) and had an LLM extract the most relevant information for debugging ML systems.
|
|
|
|
## Use as a Claude skill
|
|
|
|
```
|
|
/skills add https://github.com/wassname/ml_debug
|
|
```
|
|
|
|
Or paste `SKILL.md` into your system prompt / context when debugging.
|
|
|
|
## What's here
|
|
|
|
- **[SKILL.md](SKILL.md)** -- the main artifact. Load into an LLM agent's context as a debugging skill. Parts 1-5 are reference knowledge; Part 6 is a runnable triage protocol (grep patterns, diagnostic snippets, decision tree); Part 7 is debugging mental models and practitioner priors.
|
|
|
|
- **[docs/evidence/](docs/evidence/)** -- frozen local copies of source material (blog posts, talks, papers, reddit threads). Claims in SKILL.md link back to exact quotes here.
|