evil_MoE

mirror of https://github.com/wassname/evil_MoE.git synced 2026-06-27 20:21:41 +08:00

Files

T

wassname 34a2eec704 viz: floor->ceiling as two normalized panels (best vs control vs reference)

Rework per feedback: hack and solve are not opposites, so they get separate
floor->ceiling axes (each 0=floor..1=ceiling) rather than sharing a zero -- this
also stops solve (range ~0.13-0.22) being squished next to hack (0-0.61).
Minimal: routeV per-token (best) vs random-V (direction control) vs the SGTM
gradient-routing paper placed on the same floor->ceiling % axis (approx, LM task).

Reads: hack suppression 93% best / 84% control / ~98% reference (9pp = direction
signal); solve gained +17% / -17% / ~95% (far from ceiling -- model barely learns
to solve in 60 steps). Moved out/plots -> out/figs.

Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>

2026-06-09 09:55:03 +00:00

figs

viz: floor->ceiling as two normalized panels (best vs control vs reference)

2026-06-09 09:55:03 +00:00

pairsets

fix: rename 4 canonical LeetCode function names in authored/clean pairsets

2026-06-09 09:23:33 +00:00

pools

fix: ship smoke fixtures so the gate runs on a fresh clone

2026-06-05 07:13:33 +00:00

vhack

fix: ship smoke fixtures so the gate runs on a fresh clone

2026-06-05 07:13:33 +00:00