evil_MoE

mirror of https://github.com/wassname/evil_MoE.git synced 2026-06-27 21:37:22 +08:00

Author	SHA1	Message	Date
wassname	2defc4a3ea	fix(plots): drop deprecated routing arm; plot_substrate reads per-batch counts - plot_dynamics: routing (route v1) out of ARM_ORDER -- superseded by routing2. - plot_substrate: per-mode hk_* are now plain per-batch counts (streaming log dropped the /denominator); parse the count, plot it (EMA or cumsum); skip old n/d-format logs (incompatible units). Y-axis hacks/batch, count annotations. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-06-02 00:02:43 +00:00
wassname	11bcdd2fe6	route2 instrumentation + lr fix + deploy overlay (route2-act divergence) route2-act diverged (run 43): 33M kaiming A_q/B_q at delta_S's lr=3e-3 blew up (gn 0.3->7.5 step 8, generations -> token salad, lp_t -11). Fixes: - #167 separate quarantine lr (route2_quar_lr_scale=0.1) so the 60x-bigger fresh LoRA isn't trained at the main-knob lr. - #168 divergence tripwire on teacher ppl (lp_t high-water mark; abort if it drops >5 nats for 2 steps). Relative so tiny-random smoke (flat lp_t~-11.9) doesn't false-trip. - #165 act-path was silent: stash cos(a,v_act) + fired-fraction in the forward, surface as act_cos/act_fire columns (route2-act). smoke shows act_fire=0.64 => the cos>0 sign test over-routes (fires on most tokens, not just hack ones). - #166 print last train generation before FINAL EVAL (coherence eyeball). - route2 v_act/v_grad refresh was firing but silent -- now announced. - #162 plot_deploy_overlay.py: per-mode DEPLOY overlay from per_mode_deploy.json (honest shipped-model numbers, route2-safe). just plot-deploy. - just plot/results hardened: parse by header name, skip non-substrate logs, non-fatal aggregate delegation. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-05-31 23:16:39 +00:00
wassname	07acadb43f	plot: single 'just plot' entrypoint emits per-mode + aggregate (reuse plot_dynamics) - plot_substrate.main now also calls plot_dynamics.plot/plot_hack_overlay so one command produces all 4 figs (by_method, by_hack, aggregate, hack_overlay); the aggregate 'total hacks per arm' core plot is kept, not reimplemented. - plot_dynamics: point parser at CURRENT streaming headers (cin_t/cin_s, hk_dep/ slv_dep); it was built for the old cos_pre_t/hack_deploy spelling and silently failed on sub4 logs. No backward-compat for the superseded header. - justfile: 'plot GLOB STEM' canonical entrypoint over logs/_sub4_.log. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-05-31 04:37:31 +00:00
wassname	e45767effb	plot: multi-seed overlay for substrate emergence (thin per-seed + bold mean) Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-05-30 21:23:57 +00:00
wassname	0ea751c5bc	plot: #148 substrate emergence — per-mode hack-rate curves (4/5 learned, seed 41) New scripts/plot_substrate.py parses the hk_<mode> cumulative columns from a multi-loophole substrate run (one log, K interleaved modes) and draws one learning curve per mode with first_step onset dots and direct end-labels. plot_emergence.py can't do this (it groups logs by a single --env-mode). Figure shows the headline: vanilla GRPO learns file_marker/run_tests/ stdout_marker/sentinel, eq_override flat at 0 (never). Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>	2026-05-30 16:49:03 +00:00

5 Commits