diff --git a/AGENTS.md b/AGENTS.md index 5cfa666..4504e6c 100644 --- a/AGENTS.md +++ b/AGENTS.md @@ -56,20 +56,15 @@ Inherit global rules from `~/.claude/CLAUDE.md`. ## Files -- Read [docs/spec.md](spec.md) for the preregistered plan. - Read [docs/brainstorm/extracted_prefs.md](docs/brainstorm/extracted_prefs.md) for design rationale. - New sweep arms get recipes in [justfile](justfile) with `# H:` hypothesis comments. - `just smoke` before any real run (~1-2 min, beartype on, real pipeline on tiny inputs). - Real runs go through `pueue` on the 96GB GPU box. Label each job with `why:` and `resolve:`. -- Head [docs/RESEARCH_JOURNAL.md](docs/RESEARCH_JOURNAL.md) for latest results. +- Head [RESEARCH_JOURNAL.md](RESEARCH_JOURNAL.md) for latest results. - No `tests/` dir; `smoke` is the correctness gate. Unit checks are `scripts/verify_*.py` (assert + exit nonzero), wired into the `smoke` recipe so every smoke runs them. -- Every load-bearing invariant gets a `verify_*.py` gate. The no-cheat leak (held-out - modes received <=1.1% detector labels via the route2 gate, 2026-06-05) shipped because - nothing asserted it: the existing gates were green but none covered the property. If a - claim is load-bearing, write the assertion test in the same commit -- "the tests passed" - means nothing if the property was never tested. This one was found by hand-counting, not - by a gate; now `scripts/verify_gate_anchor.py` covers it. +- Every load-bearing invariant gets a `verify_*.py` gate, written in the same commit as + the claim -- "the tests passed" means nothing if the property was never tested. On persona pairs - ./docs/personas/how_to_rewrite_pairs.md diff --git a/README.md b/README.md index 0f817ba..05c2c69 100644 --- a/README.md +++ b/README.md @@ -11,7 +11,7 @@ LeetCode benchmark. Method differs from concurrent work (Wu & Tang 2026, "Advantage Modification") by intervening at the gradient level rather than the advantage level. -See [docs/spec.md](spec.md), [docs/brainstorm/extracted_prefs.md](docs/brainstorm/extracted_prefs.md), +See [docs/brainstorm/extracted_prefs.md](docs/brainstorm/extracted_prefs.md) and [docs/papers/](docs/papers/). ## How it works