evil_MoE

mirror of https://github.com/wassname/evil_MoE.git synced 2026-07-04 05:51:46 +08:00

Author	SHA1	Message	Date
wassname	8daf58d25e	figs: a5 vanilla->route arrows, equiv0->approx0, skip degenerate train_deploy, prune orphans - a5_generalisation: connectors -> arrows (baseline->ours direction, shows the drop and the stdout solve-cost honestly). - equiv0 -> approx0 everywhere: these are finite-sample estimates, not identically 0. - plot_train_vs_deploy skips when train==deploy for every run (no knob-ON contrast); fixes the 'can't see train' longrun/sub4 figures (they had no hk_on data). - Prune 9 orphan figure sets not referenced in paper or blog (regenerable on demand); keep the 3 referenced + a5 + train_vs_deploy_60_train_deploy. All 4 CSVs committed. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-05 04:08:58 +00:00
wassname	a9523c9cb8	fix overlay label collisions: common right-gutter anchor + leaders End-labels sat on the line termini (2-arm figs) and piled up bottom-left on ragged-length multi-arm overlays (substrate, where arms end at different steps). Now all labels anchor at one gutter x with a leader fanning back to each line's actual end, y-de-collided. Added right margin so the gutter is clear. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-05 03:31:26 +00:00
wassname	504922a3d6	fix collision: lift 'deploy hack =0' off the y=0 line in train_vs_deploy The solid-red deploy line ran straight through the annotation text (tufte collision test). Move it into the empty band above the flat line (axes y=0.12). Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-05 03:25:49 +00:00
wassname	b616970e42	fix plot integrity: drop n=28 hack_s fallback in train-vs-deploy series A vanilla seed (s43) lacked the held-out deploy eval, so its train series fell back to the noisy n=28 per-step hack_s while other seeds used the n=64 eval. Averaging mixed estimators fabricated a vanilla train-vs-deploy gap that does not exist (lie-factor). Now: train series reuses the knob-off eval only (nan if absent -> seed drops from the mean), and missing eval columns normalise to nan so absent==all-nan. Regenerated all figures from logs. The canonical train_vs_deploy_60 (has hk_on) is unchanged; sub4/longrun byproducts now show train==deploy honestly (no knob-on data to split). Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-05 03:21:48 +00:00

4 Commits