diff --git a/RESEARCH_JOURNAL.md b/RESEARCH_JOURNAL.md index 4ef0371..28d20c0 100644 --- a/RESEARCH_JOURNAL.md +++ b/RESEARCH_JOURNAL.md @@ -4,7 +4,7 @@ Append-only. New entries at the top, date-stamped. Never edit old entries. ## 2026-05-29 (j) — WIP projected-vs-vanilla matched-seed table: at the two seeds where I have all three arms, projection drops whole-run HACK_STUDENT by 12-23pp vs vanilla, and refresh-every=2 adds about 5pp on top of frozen V -**When**: 2026-05-29, commit `f70743c`. Pueue tasks already landed: #59 (vanilla s=41), #61 (vanilla s=43), #62 (vanilla s=44), #90/#101/#95 (projected frozen s=41/42/43), #91/#94/#104 (projected refresh-2 s=41/42/43). Queued for the missing matched cells: #137 (vanilla s=42), #138 (projected frozen s=44), #139 (projected refresh-2 s=44). +**When**: 2026-05-29, commit `f70743c`. Pueue tasks already landed: #59 (vanilla s=41), #61 (vanilla s=43), #62 (vanilla s=44), #90/#101/#95 (projected frozen s=41/42/43), #91/#94/#104 (projected refresh-2 s=41/42/43). Queued for the missing matched cells: originally #137 (vanilla s=42), #138 (projected frozen s=44), #139 (projected refresh-2 s=44); AFK reorder via `pueue switch` (2026-05-29 ~04:30 UTC) moved these commands to slots #120/#121/#122 to land before bed. Original G2-screen commands displaced to slots #137/#138/#139. **Why this run.** Entry (h) reported a 30pp drop in last-5 hack_s from widening the v_hack pair set, but lacked a matched-seed vanilla baseline at the 21-pair regime. Entry (i) then noted that refresh-every=2 added only about 2.5pp last-5 on top of widening. Both entries left the projected-vs-vanilla gap unresolved at the seed level. The user pointed out we already have vanilla baselines at this exact preset (mix=0.5 fast, n=3 seeds), so I assembled the cross-arm table.