mirror of
https://github.com/wassname/evil_MoE.git
synced 2026-06-27 18:23:57 +08:00
80e82f0b29
Finding: v_grad/As barely separate LIVE hack from clean (authored pairs are off-distribution: localized run_tests-block contrast vs full novel-problem rollouts). act-cosine best AUROC 0.69; grad-cosine best confident-tail p@10 0.70; magnitude inverted. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>