From 00f8cd0872d6a40c5db901b7a8e27ddc66904d6f Mon Sep 17 00:00:00 2001 From: wassname <1103714+wassname@users.noreply.github.com> Date: Thu, 18 Jun 2026 03:31:42 +0800 Subject: [PATCH] Correct rot ablation: V>U>both was seed variance, not a real ordering Seed-1 confirmation (jobs 106-108) flips the seed-0 ranking: seed0 V57.2>U56.5> both55.6, seed1 U57.5>both56.9>V56.2. 2-seed test means (U57.0,V56.7,both56.3) span 0.7pp, inside the ~1pp SE of a 2-seed mean, so the rotation basis is within noise. rot(V) stays the default as a cheap representative, not a measured winner. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com> --- README.md | 11 ++++++----- 1 file changed, 6 insertions(+), 5 deletions(-) diff --git a/README.md b/README.md index 50621e7..45b0090 100644 --- a/README.md +++ b/README.md @@ -84,11 +84,12 @@ off-diagonal orientation is the slowest arm (120 s init vs 2.5 s) and lands slig calibration, so plain top-r SVD is the right default for this bounded-gain adapter here. AntiPaSTO-rot tunes that basis instead of the metric: a block-diagonal Cayley rotation of the -input (V), output (U), or both. The table row is V (the default); the ablation gives V 57.2 > -U 56.5 > both 55.6 (single seed). So rotating which inputs feed each frozen direction helps most, -the output-side rotation is slightly worse, and doing both is worst -- the second rotation is -redundant capacity that hurts. rot(V) is the best small-parameter arm overall (57.2 at 15K params -vs LoRA's 59.8 at 3.54M). +input (V, the table row), output (U), or both. Across two seeds the basis choice is within noise: +seed0 ranks V 57.2 > U 56.5 > both 55.6, but seed1 flips it to U 57.5 > both 56.9 > V 56.2, and +the 2-seed test means (U 57.0, V 56.7, both 56.3) span 0.7pp -- inside the ~1pp standard error of +a 2-seed mean at n=1319. So no rotation basis is reliably best here; the single-seed V>U>both +ordering was seed variance. rot(V) is the default as a fine, cheapest representative (15K params, +~230x under LoRA's 3.54M), not a measured winner. ## Developer docs