mirror of
https://github.com/wassname/evil_MoE.git
synced 2026-06-27 20:52:18 +08:00
ff82fbb940
The routing arms' benefit shows on the DEPLOYED model (quarantine deleted). Prefer the dense per-step proxy hk_abl/slv_abl (every step, rollout_ablate_frac>0) over the sparse held-out hk_dep eval for the plotted hack_s/gt_s curve; fall back to hk_dep for runs that predate the proxy. - parse hk_abl/slv_abl; routing+routing2 substitute it (else hk_dep) into hack_s/gt_s - classify/ARM_ORDER/ARM_COLORS recognise routing2 - gate cos cols (cin_t/cin_s) by presence: vanilla/routing2 lack them, so parse and panels skip them instead of KeyError (also fixes a pre-existing vanilla crash) Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>