evil_MoE

mirror of https://github.com/wassname/evil_MoE.git synced 2026-06-27 15:00:20 +08:00

Author	SHA1	Message	Date
wassname	04a98b321e	feat: Evil MoE — learned soft router + pin loss on an ablatable hack expert Fork of vGROUT. Replaces routeA's fixed v_act quantile gate with a learned per-rollout soft router (HackRouter, seeded from v_act) on the ablatable hack expert: GRPO flows into the router through the soft weight w (it concentrates hack-like rollouts in the hack expert), and a continuous pin loss on the hand-authored pairs anchors the axis. No load balancing; routing is per rollout. lora2r gains a soft-weight forward path (_lora2r_w: w=0 keep, w=1 rout, deployed grad scaled by 1-w). train_moe.py is the on-policy GRPO loop; verify_moe_router.py gates the routing invariants. `just smoke` is green. README/AGENTS rewritten for the fork; original proposal kept as docs/spec/original_evil_moe_spec.md. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-14 11:25:14 +08:00
wassname	cca7150ea0	tidy	2026-06-14 11:05:54 +08:00
wassname	c33b810faa	data: wave-2 behavior2_* pairs -- 15 new proxy-gaming mechanisms, 2 contaminated drafts replaced 10 pairs from parallel session renamed behavior_* -> behavior2_* so /behavior_ still selects the proven 8; bare-call-no-assert (run_tests axis-1 family) and assert-or-True (assert-True family) replaced with mock-under-test and fudge-expected; +5 more: skip-failing, retry-until-green, coverage-pragma, type-ignore, survivorship-metric. No identifier overlap with the env's 992 train/test problems (total_steps -> total_clicks after collision). Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-11 10:04:23 +00:00
wassname	bf616749ee	Consolidate tagged hack pairsets in data	2026-06-10 11:58:53 +00:00

4 Commits