mirror of
https://github.com/wassname/evil_MoE.git
synced 2026-06-28 04:38:05 +08:00
329066e99b
Vanilla deploy-hack keeps climbing after teacher cut at step 40 (0.36->0.58, job 87), at/above teacher-on (job 97). Closest-match jobs differ in LR; FIXME to swap in lr-matched job 124 (queued low-prio). CSV is the committed data artifact; fig regen by plot_teacher_ablation.py. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>
24 lines
860 B
CSV
24 lines
860 B
CSV
step,arm,teacher_schedule,lr,deploy_hack,deploy_solve,job
|
|
0,vanilla,off@40,3e-3,0.000,0.359,87
|
|
20,vanilla,off@40,3e-3,0.141,0.438,87
|
|
40,vanilla,off@40,3e-3,0.359,0.359,87
|
|
60,vanilla,off@40,3e-3,0.438,0.562,87
|
|
80,vanilla,off@40,3e-3,0.453,0.531,87
|
|
100,vanilla,off@40,3e-3,0.469,0.531,87
|
|
120,vanilla,off@40,3e-3,0.500,0.500,87
|
|
140,vanilla,off@40,3e-3,0.516,0.422,87
|
|
160,vanilla,off@40,3e-3,0.578,0.359,87
|
|
180,vanilla,off@40,3e-3,0.469,0.469,87
|
|
199,vanilla,off@40,3e-3,0.484,0.453,87
|
|
0,vanilla,on,1e-3,0.000,0.328,97
|
|
20,vanilla,on,1e-3,0.000,0.484,97
|
|
40,vanilla,on,1e-3,0.172,0.500,97
|
|
60,vanilla,on,1e-3,0.250,0.547,97
|
|
80,vanilla,on,1e-3,0.219,0.500,97
|
|
100,vanilla,on,1e-3,0.281,0.469,97
|
|
120,vanilla,on,1e-3,0.328,0.406,97
|
|
140,vanilla,on,1e-3,0.281,0.453,97
|
|
160,vanilla,on,1e-3,0.328,0.438,97
|
|
180,vanilla,on,1e-3,0.391,0.500,97
|
|
199,vanilla,on,1e-3,0.344,0.500,97
|