From 52619519dcb2bc7d45cbf46a382dc091e90c2046 Mon Sep 17 00:00:00 2001 From: wassname <1103714+wassname@users.noreply.github.com> Date: Sun, 7 Jun 2026 10:20:27 +0000 Subject: [PATCH] docs: drop dead refs (spec.md link, verify_gate_anchor.py paragraph) - spec.md never existed at root or docs/; removed the link from AGENTS.md + README.md (the live plan is in docs/spec/ dated files). - RESEARCH_JOURNAL.md link pointed at docs/; it lives at repo root. Fixed. - Trimmed the no-cheat-leak paragraph citing scripts/verify_gate_anchor.py (that file doesn't exist); kept the general 'gate every load-bearing invariant in the same commit' rule. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com> --- AGENTS.md | 11 +++-------- README.md | 2 +- 2 files changed, 4 insertions(+), 9 deletions(-) diff --git a/AGENTS.md b/AGENTS.md index 5cfa666..4504e6c 100644 --- a/AGENTS.md +++ b/AGENTS.md @@ -56,20 +56,15 @@ Inherit global rules from `~/.claude/CLAUDE.md`. ## Files -- Read [docs/spec.md](spec.md) for the preregistered plan. - Read [docs/brainstorm/extracted_prefs.md](docs/brainstorm/extracted_prefs.md) for design rationale. - New sweep arms get recipes in [justfile](justfile) with `# H:` hypothesis comments. - `just smoke` before any real run (~1-2 min, beartype on, real pipeline on tiny inputs). - Real runs go through `pueue` on the 96GB GPU box. Label each job with `why:` and `resolve:`. -- Head [docs/RESEARCH_JOURNAL.md](docs/RESEARCH_JOURNAL.md) for latest results. +- Head [RESEARCH_JOURNAL.md](RESEARCH_JOURNAL.md) for latest results. - No `tests/` dir; `smoke` is the correctness gate. Unit checks are `scripts/verify_*.py` (assert + exit nonzero), wired into the `smoke` recipe so every smoke runs them. -- Every load-bearing invariant gets a `verify_*.py` gate. The no-cheat leak (held-out - modes received <=1.1% detector labels via the route2 gate, 2026-06-05) shipped because - nothing asserted it: the existing gates were green but none covered the property. If a - claim is load-bearing, write the assertion test in the same commit -- "the tests passed" - means nothing if the property was never tested. This one was found by hand-counting, not - by a gate; now `scripts/verify_gate_anchor.py` covers it. +- Every load-bearing invariant gets a `verify_*.py` gate, written in the same commit as + the claim -- "the tests passed" means nothing if the property was never tested. On persona pairs - ./docs/personas/how_to_rewrite_pairs.md diff --git a/README.md b/README.md index 0f817ba..05c2c69 100644 --- a/README.md +++ b/README.md @@ -11,7 +11,7 @@ LeetCode benchmark. Method differs from concurrent work (Wu & Tang 2026, "Advantage Modification") by intervening at the gradient level rather than the advantage level. -See [docs/spec.md](spec.md), [docs/brainstorm/extracted_prefs.md](docs/brainstorm/extracted_prefs.md), +See [docs/brainstorm/extracted_prefs.md](docs/brainstorm/extracted_prefs.md) and [docs/papers/](docs/papers/). ## How it works