evil_MoE/docs/spec at ea4f4ee657959f4ef2e376acaa5cfd8ebdaa1ac2 - evil_MoE - Gitea: Git with a cup of tea

wassname/evil_MoE

mirror of https://github.com/wassname/evil_MoE.git synced 2026-06-28 00:43:53 +08:00

Files

T

History

wassname 8158adb543 refactor: route2 quarantine = scale-matched delta_S_hack, rip out 33M LoRA

The distinct-basis A_q/B_q LoRA (~33M params at rank-16) gave the quarantine a
~100x capacity edge over delta_S, so routing-everything-there was the low-
resistance path: qE pinned ~0.97 (energy into the thrown-away knob) while the
deployed delta_S learned nothing (job 54). The cause was capacity imbalance, not
the routing gate (calibrated-tau already separated hack/clean, hkgap>0).

Consolidate to one adapter type: the quarantine is now delta_S_hack, the second
diagonal in the same frozen SVD basis, shape [r], capacity-matched to delta_S,
zeroed at deploy. route2's calibrated-tau gate parks the flagged rollouts' grad
into delta_S_hack.grad (like proj.py's route parks its subspace projection);
delta_S keeps the unflagged. Both diagonals train at one shared lr.

Removed: A_q/B_q params, v_act + extract_v_act, the act-mask arm (a shared
diagonal can't be per-token gated), route2_mask / route2_quarantine_rank /
route2_quar_lr_scale knobs, the separate quar optimizer group. Arm name
routing2_{act,grad} -> routing2. v_grad refresh extracts from delta_S (main)
with the quarantine ablated.

SGTM check: their gradient routing uses a hard detach on capacity-matched
reserved dims, no soft/tanh/sigmoid gate -- balance is the fix, not gating.

Smoked clean: tau/hkgap/qE render, ||delta_S_hack||>0 assert passes, exit 0.

Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>

2026-06-01 02:52:02 +00:00

..

20260525_distill_cosine_probe.md

spec(distill_probe): Phase 1 done (UAT 4/4), Phase 2 candidates R5-R7

2026-05-25 10:22:19 +00:00

20260525_review_T5.md

spec: reject T5 mixed-policy design after external review

2026-05-25 10:26:33 +00:00

20260527_code_review.md

v_hack v2: top-k + S magnitudes + runtime suspicion gate + per-source cin

2026-05-27 06:39:05 +00:00

20260528_cross_mechanism_v_hack.md

wip

2026-05-28 12:44:20 +00:00

20260528_g2_g3_checkpoint_selection.md

docs

2026-05-29 05:42:28 +00:00

20260529_gradient_routing_and_env_split.md

feat: T8 run-cell + regen-dynamics recipes; spec T5 done, T8 in progress

2026-05-30 00:52:14 +00:00

20260530_code_review.md

docs: review outputs + figs; drop stale Qwen3.5-0.8B svd cache

2026-05-31 00:00:40 +00:00

20260530_faithful_multi_loophole_env.md

feat: object-attribute sentinel + exhaustive non-overlap matrix

2026-05-30 10:15:36 +00:00

20260530_out_dir_reorg.md

docs+chore: out/ reorg scheme (queue-gated) + archive dead _OLD_step_format dirs

2026-05-30 02:43:10 +00:00

20260530_plan_review.md

docs: review outputs + figs; drop stale Qwen3.5-0.8B svd cache

2026-05-31 00:00:40 +00:00

20260530_refactor_code_review.md

rewards: robust strict oracle (review fixes) — SystemExit guard around test calls + whitelist __strict_eq

2026-05-30 05:48:24 +00:00

20260530_requeue_manifest.md

reorg: out/ sorted by datatype (vhack/ pools/ runs/ vhack_grads/ figs/)

2026-05-30 03:52:24 +00:00

20260530_review_gpt55_v2.md

docs: review outputs + figs; drop stale Qwen3.5-0.8B svd cache

2026-05-31 00:00:40 +00:00

20260530_review_student_deepseek.md

docs: review outputs + figs; drop stale Qwen3.5-0.8B svd cache

2026-05-31 00:00:40 +00:00

20260530_substrate_code_review.md

fix: external-review criticals — os._exit oracle hole + exact even matching + honest teacher gt

2026-05-30 09:15:23 +00:00

20260530_substrate_review_deepseek.md

feat: lean per-step table w/ per-mode hack cols, generic elicit, ship->deploy

2026-05-30 10:35:26 +00:00

20260530_substrate_review_gemini.md

fix: two more oracle holes (gpt-5.5 review) — sentinel forgery + int-subclass eq

2026-05-30 09:57:46 +00:00

20260530_substrate_review_gpt55.md

fix: two more oracle holes (gpt-5.5 review) — sentinel forgery + int-subclass eq

2026-05-30 09:57:46 +00:00

20260530_substrate_review_grok.md

feat: lean per-step table w/ per-mode hack cols, generic elicit, ship->deploy

2026-05-30 10:35:26 +00:00

20260530_substrate_review_qwen.md

feat: lean per-step table w/ per-mode hack cols, generic elicit, ship->deploy

2026-05-30 10:35:26 +00:00

20260531_route2_code_review_v2.md

fix: route2 Arm A flags per-rollout not per-token (external review)

2026-05-31 11:25:13 +00:00

20260531_routing_v2_distinct_basis.md

spec: log execution pass (refresh no-op + bf16 dtype fixes, random-V cancelled, defaults cleanup, T4 split)

2026-05-31 13:39:31 +00:00

20260601_calibrated_tau_route2grad.md

refactor: route2 quarantine = scale-matched delta_S_hack, rip out 33M LoRA

2026-06-01 02:52:02 +00:00

handover.md

tidy

2026-05-29 06:29:43 +00:00

spec2.md

spec2 + base_pool generator + slim replay save (partial mixed-replay TODO)

2026-05-25 11:48:48 +00:00