evil_MoE

mirror of https://github.com/wassname/evil_MoE.git synced 2026-06-29 21:25:04 +08:00

Author	SHA1	Message	Date
wassname	b616970e42	fix plot integrity: drop n=28 hack_s fallback in train-vs-deploy series A vanilla seed (s43) lacked the held-out deploy eval, so its train series fell back to the noisy n=28 per-step hack_s while other seeds used the n=64 eval. Averaging mixed estimators fabricated a vanilla train-vs-deploy gap that does not exist (lie-factor). Now: train series reuses the knob-off eval only (nan if absent -> seed drops from the mean), and missing eval columns normalise to nan so absent==all-nan. Regenerated all figures from logs. The canonical train_vs_deploy_60 (has hk_on) is unchanged; sub4/longrun byproducts now show train==deploy honestly (no knob-on data to split). Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-05 03:21:48 +00:00
wassname	0645ae2dd2	fig:longrun: rebuild from job84 route2 + job97 fixed vanilla (no collapse) Old figure paired route2 (job 84) with job 85 vanilla, whose step-88 'collapse' was a hot-preset artifact. Job 97 re-ran vanilla-200 gentle and stays coherent. New pairing: route2 holds deploy hack at 0; vanilla rises to ~0.32 (onset ~step 40); route2 solve ends higher (0.61 vs 0.47). Caption now flags the remaining optimizer mismatch (route2 hot / vanilla gentle, both beta=0) and TODOs the matched beta=1e-5 regen (jobs 100/101). Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-05 02:18:34 +00:00
wassname	24fa924c8d	plot: 2x2 train(knob-on) vs deploy(knob-off) x arm figure The A4 framing in one figure: vanilla train==deploy (cheat in the weights), route2 train HACKS while deploy is clean (cheat in the deletable knob). parse_log now keeps the raw train series (hack_train/solve_train) before the deploy substitution. New fig: dyn_longrun_200_train_deploy.png. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-02 23:53:08 +00:00
wassname	e00292860f	results: commit longrun A4 fig + CSV data source (force-add, out/ is gitignored) Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-02 23:19:29 +00:00

4 Commits