Files
evil_MoE/docs
wassname 5d83adbb25 fix: correct the "18 vs 21 pair" basis claim (it was never about pair count)
Read the safetensors shapes/metadata: v_hack_full = 10 pairs / k=5,
v_hack_21pairs = 16 pairs / k=12 (n_heldout=2; neither is 18 or 21). The two
bases differ on pairs AND directions-kept AND extract-tau simultaneously, so
the hack-cut gap is triple-confounded, not a clean "pair set is the lever"
result. Nothing was lost: the strong basis reproduces from current pairs.py
via --top-k=12 --v-hack-drop-bottom-frac=0.0, and refresh already re-extracts
at k=12. Rewrites Q8 + the top confound bullet + the README findings caveat.
A one-knob k-sweep is needed to attribute the gain.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-05-29 10:12:12 +00:00
..
2026-05-23 14:19:41 +08:00
2026-05-29 06:29:43 +00:00
2026-05-23 11:26:39 +08:00
2026-05-29 06:29:43 +00:00
2026-05-29 06:29:20 +00:00
2026-05-23 11:26:39 +08:00
2026-05-23 10:22:54 +08:00
2026-05-23 10:40:02 +08:00