ml_debug

mirror of https://github.com/wassname/ml_debug.git synced 2026-06-27 01:00:14 +08:00

Author	SHA1	Message	Date
wassname	5fca5ad2b2	Refresh Schulman cache anchors after transcript rewrite Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-25 10:31:39 +08:00
wassname	b8c3ffcf11	gpt5.5/fable	2026-06-12 09:30:25 +08:00
wassname	3e28a950e9	docs: clarify competing-worlds debugging loop	2026-06-12 06:53:36 +08:00
wassname	8b9a1d62ed	docs: resolve ml-debug TODO references	2026-06-12 06:52:38 +08:00
wassname	966f948d36	docs: refine numerical and scheduler debugging guidance	2026-06-12 06:35:29 +08:00
wassname	30ac76053e	chore: drop links to deleted/tombstone gists (repo is canonical now) Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-11 16:55:04 +08:00
wassname	0837f27f08	fix: companion gist link pointed at the wrong gist Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-11 16:41:50 +08:00
wassname	8cd3c61050	folklore: tuning playbook, Domingos, Bekman loss spikes, Ng error analysis; LLM-judge bias appendix - SKILL.md: 3 new entries (exploration-over-exploitation + nuisance HPs, test-set contamination, loss-spikes-mean-bad-data-pocket) and an Ng 100-misclassified-examples quote under inspect-the-data - refs/llm_judges.md: position/verbosity/self-preference biases (Zheng, Wang 66/80 flip, Panickssery) + mitigation checklist from verdict docs - Lones pitfalls linked as the exhaustive 36-item do/don't checklist - 6 new frozen evidence files; Hamel evals link in further reading Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-11 15:30:41 +08:00
wassname	fb753d093e	restructure: quotes-first SKILL.md, synthesized playbook split out SKILL.md is now folklore only: verbatim practitioner quotes ordered most-general-first, transformer/LLM fine-tuning entries in their own section, minimal context, links and footnotes. New sources: unsloth, axolotl (+training stability), HF course ch8.4, Bekman debug_utils (evidence frozen in docs/evidence/). The synthesized material (mental models, priors, symptom tables, agent loop, triage, anti-patterns) moves to PLAYBOOK.md, framed as menus of hypotheses rather than authoritative diagnoses. Made-up symptom tables no longer sit next to sourced quotes. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-11 14:33:32 +08:00

9 Commits