evil_MoE/scripts at 5714996c564d566cb1383d512ed19cdef8716594 - evil_MoE - Gitea: Git with a cup of tea

wassname/evil_MoE

mirror of https://github.com/wassname/evil_MoE.git synced 2026-06-27 17:48:43 +08:00

Files

T

History

wassname 5c97975185 refactor: collapse to lora2r-only (none/routeV/absorb); delete erase/antipasto/lora_frozen_b paths

train.py rewritten straight-line for the single rank-2r Gaussian-init LoRA adapter
and three arms (intervention none|routeV|absorb). Removes the erase grad-surgery,
act_vote/online_stats gates, beta/KL reference path, per-source split harvest, the
v_hack injection block, and all per-mechanism E/C/D/A-B tallies. Folds in:
- T2 Gaussian init (lora2r.py): A0~N(0,1/d_in), B0~N(0,1/2r), net delta 0 at init.
- T3 width-pooled gate labels: single (num/den) fraction across modules, skip
  zero-width modules, raise if none separate (was per-module equal-weight blowup).
- T5 absorb arm: masks pinned (1,0) -> both blocks train, no gate.
- T6 self-contained ckpt: A/B/A0/B0 in one file (no _hack file, no SVD cache),
  adapter:"lora2r" in saved cfg.
- T8 m3: step_flagged logs the hack share (d.mean), not m.mean.

Gates green: verify_lora2r_routing (4 invariants) + smoke none/routeV/absorb
end-to-end on tiny-random Qwen3 (logs in /tmp/claude-1000/smoke_*.log).

Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>

2026-06-10 10:58:22 +00:00

..

build_combined_pool.py

reorg: out/ sorted by datatype (vhack/ pools/ runs/ vhack_grads/ figs/)

2026-05-30 03:52:24 +00:00

build_runtests_pool.py

fix: dense run_tests teacher pool (6 -> 215 prompts) so the hack seeds in 60 steps

2026-06-07 11:01:31 +00:00

build_substrate.py

cleanup: consolidate stale loaders and pair scripts

2026-06-09 12:47:32 +00:00

diag_cosine_dist.py

cleanup: consolidate pairs modules into build scripts + add solve_train to table

2026-06-09 09:17:42 +00:00

diag_pairs_compare.py

cleanup: consolidate pairs modules into build scripts + add solve_train to table

2026-06-09 09:17:42 +00:00

eval_checkpoint_curve.py

rename: deployed/as_trained policy views, kill 'knob' (schema paired_final_v2)

2026-06-10 05:26:51 +00:00

make_random_vhack.py

cleanup: trim 2 stale provenance/train-of-thought comments

2026-06-03 00:25:22 +00:00

migrate_deploy_v1_to_v2.py

tool: migrate v1 deploy_test/eval_curve -> v2 field names (for mid-flight runs)

2026-06-10 05:27:38 +00:00

pairs_from_rollouts.py

rename python package projected_grpo -> vgrout

2026-06-05 14:51:48 +08:00

pairset_build_authored.py

fix: rename 4 canonical LeetCode function names in authored/clean pairsets

2026-06-09 09:23:33 +00:00

pairset_build_intent.py

cleanup: consolidate pairs modules into build scripts + add solve_train to table

2026-06-09 09:17:42 +00:00

pairset_build_progsets.py

refactor: named pairset JSONs + explicit --vhack-pairs-path, remove None fallback

2026-06-09 08:09:09 +00:00

plot_deploy_overlay.py

rename python package projected_grpo -> vgrout

2026-06-05 14:51:48 +08:00

plot_dynamics.py

refactor: extract train_config.py + run_artifacts.py from train.py; slim results scripts

2026-06-09 13:34:50 +00:00

plot_emergence.py

rename python package projected_grpo -> vgrout

2026-06-05 14:51:48 +08:00

plot_floor_ceiling.py

rename: deployed/as_trained policy views, kill 'knob' (schema paired_final_v2)

2026-06-10 05:26:51 +00:00

plot_substrate.py

rename python package projected_grpo -> vgrout

2026-06-05 14:51:48 +08:00

probe_distill.py

refactor: extract train_config.py + run_artifacts.py from train.py; slim results scripts

2026-06-09 13:34:50 +00:00

probe_plot_stack.py

refactor: move 5 leaf entrypoints src/ -> scripts/ (src is now library-only)

2026-06-03 00:23:56 +00:00

rescore_deploy.py

rename: deployed/as_trained policy views, kill 'knob' (schema paired_final_v2)

2026-06-10 05:26:51 +00:00

results_deploy.py

rename: deployed/as_trained policy views, kill 'knob' (schema paired_final_v2)

2026-06-10 05:26:51 +00:00

results.py

refactor: extract train_config.py + run_artifacts.py from train.py; slim results scripts

2026-06-09 13:34:50 +00:00

tt_erase_bench.py

eval: final deploy eval records knob-on (deployed-as-trained) for quarantine arms

2026-06-09 13:09:50 +00:00

validate_spoonfeed.py

cleanup: consolidate stale loaders and pair scripts

2026-06-09 12:47:32 +00:00

verify_base_solve.py

fix: eval on paper test set, not contaminated holdout (base solve 0.94->0.094)

2026-06-07 11:01:31 +00:00

verify_eval_gap.py

refactor: extract train_config.py + run_artifacts.py from train.py; slim results scripts

2026-06-09 13:34:50 +00:00

verify_lora2r_routing.py

refactor: collapse to lora2r-only (none/routeV/absorb); delete erase/antipasto/lora_frozen_b paths

2026-06-10 10:58:22 +00:00

verify_partition.py

test: no-cheat partition + teacher-pool composition gate (verify_partition.py)

2026-06-05 04:36:03 +00:00

verify_rewards.py

fix: rotate the unhackable (gt_only) subset per step, not frozen per pid

2026-06-10 06:14:08 +00:00

verify_rotation.py

fix: rotate the unhackable (gt_only) subset per step, not frozen per pid

2026-06-10 06:14:08 +00:00

verify_science_invariants.py

eval: final deploy eval records knob-on (deployed-as-trained) for quarantine arms

2026-06-09 13:09:50 +00:00

verify_vhack_heldout.py

eval: final deploy eval records knob-on (deployed-as-trained) for quarantine arms

2026-06-09 13:09:50 +00:00