ml-debug

mirror of https://github.com/wassname/ml-debug.git synced 2026-06-27 18:24:28 +08:00

Author	SHA1	Message	Date
wassname	bb1a6bc61c	feat(ml_debug): lead with judgment gates over the symptom-lookup encyclopedia The skill was thorough but failed to instill debugging taste: an agent would pattern-match a symptom-table row to a fix and ship a guess, because the behaviour-changing material sat 550 lines down. Promote three gates to the top: - "Before you debug: calibrate" -- you're likely OOD in research code; the failure mode is overconfidence/impatience; the tables are a menu to widen the search, never lookup-and-apply. - "The debugging loop (judgment, not a checklist)" -- collect clues, hold a few competing hypotheses scaled to the problem, sanity-check with the likely/subtle/null triplet (shared vocab with research-journal), run the cheapest discriminating observation, then act. - "When stuck, read a working implementation" -- promoted from a buried Part 7.3 one-liner; extract the algorithm-done-right, the engineering tricks the paper omits, and proven hyperparams; rank candidates by trust signal. Collapse duplicated advice to pointers; de-bold Part 6.4 (8 bolded openers -> a plain list). Net +10 lines, bold markers 112 -> 100. Verified by a blind comprehension-by-inference quiz (5 cold-reader models, OLD vs NEW): NEW 9.6/10 vs OLD 6.8/10, with the gap localized to the two added sections (read-working-impl 5/5 vs 0/5; tables-as-menu 2.0 vs 1.6) while untouched sections tied -- ruling out a "reads nicer" halo. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-01 10:11:36 +08:00
wassname	fde5ac62fd	name	2026-04-09 05:09:25 +08:00
wassname	ced4edc200	feat(ml_debug): add Karpathy recipe + nanochat evidence, update-ratio diagnostic Add 3 new evidence files from modern open-source sources: - karpathy_recipe_training_nn_2019.md: Karpathy's training recipe blog post - nanochat_deepwiki_llm_pretraining_2026.md: 320+ HP sweeps for GPT-2-scale pretraining - sanh_simple_considerations_hf_2021.md: HuggingFace NLP debugging notes Add update-to-data ratio diagnostic to refs/diagnostics.md (target ~1e-3). Add LLM pretraining gap note to SKILL.md intro linking the new sources. Add tanh saturation % to logging checklist.	2026-03-10 05:32:37 +08:00
wassname	7ac7aacac7	fix(ml_debug): address review feedback - Fix stale Part 2 cross-references to link to rl/SKILL.md - Add McCandlish + Slavv back to parent Sources (cited in Part 7) - Add back-links from refs/ files to parent SKILL.md	2026-03-06 13:59:48 +08:00
wassname	70c28f06ac	refactor(ml_debug): extract grep patterns and diagnostics to refs/ Moved 6.1 (static analysis grep patterns) and 6.2 (diagnostic code snippets) to refs/static_analysis.md and refs/diagnostics.md. Triage tree (6.3) stays in main with references to the ref files. ml_debug/SKILL.md reduced from 7229w to 5093w (~30% from original).	2026-03-06 13:54:37 +08:00
wassname	7f34f26a5c	refactor(ml_debug): extract RL debugging into rl/ sub-skill Part 2 (RL-Specific Debugging) + RL-specific sources moved to ml_debug/rl/SKILL.md as a sub-skill, following the pinn/ precedent. Parent SKILL.md reduced from 9158w to 7229w (~21%). General sources (Goodfellow, CS231n, Tobin, Ng) kept in parent.	2026-03-06 13:36:29 +08:00
wassname	7a9c667aa7	chore: add wassname attribution to description, gitignore dlbooks	2026-03-06 12:17:50 +08:00
wassname	463c8fdbbc	fix: apply Gemini review fixes (device kwarg, gradcheck requires_grad, torch prefix) Review: Gemini 3.1 Pro approved. 3 fixes applied: - pinn/SKILL.md: PchipFunction torch.tensor missing device=h.device (GPU crash) - SKILL.md: gradcheck needs .requires_grad_(True) on doubled inputs - SKILL.md: loss surface pseudocode now has torch. prefix + indexing='ij'	2026-03-06 12:15:37 +08:00
wassname	4393cceefd	initial: ML debugging folklore skill Deep research to uplift LLMs for ML debugging, opinionated by source selection. Distilled from Schulman, Jones, Rahtz, Goodfellow, CS231n, FSDL, and more. Includes runnable diagnostic scripts and LLM-specific anti-patterns. Author: wassname (https://github.com/wassname)	2026-03-06 10:11:30 +08:00

9 Commits