Logo
Explore Help
Register Sign In
wassname/evil_MoE
1
0
Fork 0
You've already forked evil_MoE
mirror of https://github.com/wassname/evil_MoE.git synced 2026-06-27 17:30:41 +08:00
Code Issues Packages Projects Releases Wiki Activity
Files
c1388e5325917eda58bd91bf9ce3cf266a5a4b84
evil_MoE/docs
T
History
wassname c1388e5325 paper: title -> question form 'Can We Quarantine Reward Hacking with a Reward-Hacking Representation?'
Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>
2026-06-03 01:42:03 +00:00
..
blog
docs: refresh blog+README for route2/deploy-eval; embed key dynamics plot; drop sparse-only dots
2026-06-02 01:24:29 +00:00
brainstorm
ready
2026-05-23 14:19:41 +08:00
figs
misc
2026-06-02 02:06:43 +00:00
papers
wip
2026-05-30 04:33:33 +00:00
personas
fix smoke.
2026-05-23 11:26:39 +08:00
reviews
review: 3-model external panel on route2 pseudocode + synthesis
2026-06-01 01:44:31 +00:00
spec
feat: mix=0 no-teacher ablation path (pure on-policy, pool kept for v_grad+partition)
2026-06-02 23:26:26 +00:00
vendor
concepts
2026-05-29 06:29:20 +00:00
writeup
paper: title -> question form 'Can We Quarantine Reward Hacking with a Reward-Hacking Representation?'
2026-06-03 01:42:03 +00:00
extract_vhack_grad-vec.md
Doc cleanup: mark susp gate as REMOVED in design doc
2026-05-27 09:08:34 +00:00
grpo_hyperparams.md
fix smoke.
2026-05-23 11:26:39 +08:00
human_journal.md
init
2026-05-23 10:22:54 +08:00
results.md
results: fill keynote table/figure at n=3 route2 / n=2 vanilla
2026-06-02 11:08:41 +00:00
Powered by Gitea Version: 1.26.4 Page: 148ms Template: 2ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API