mirror of
https://github.com/wassname/evil_MoE.git
synced 2026-06-27 18:23:57 +08:00
f1f1c00f41
Paper (longer training, >512 tok/gen) and ours (60-step fast) are not directly comparable -- now shown as separate column pairs in both main.tex tab:anchors and docs/results.md Q14. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>