9 Commits

Author SHA1 Message Date
wassname 6442d01ade short judge output: verdict + reason + blind spots only 2026-06-14 21:40:48 +08:00
wassname c0ceb95ea4 simplify proof logs and keep tasks repo-local 2026-06-14 20:09:30 +08:00
wassname 927a482d79 rename to pi-proof-tasks and simplify proof log 2026-06-14 11:59:47 +08:00
wassname d76ed41655 Relax verification hints auto-review gate 2026-06-14 06:35:00 +08:00
wassname 25116e399a add lgtm evidence history and artifact metadata 2026-06-07 19:03:01 +08:00
wassname 023a2c2263 fix lgtm review gating and evidence display 2026-06-07 18:55:05 +08:00
wassname 5b800653a3 feat: two-tier task model — trivial tasks self-complete, lgtm gates significant claims
Reviewer feedback: the LGTM extension's epistemic core is good but UX is too
ceremonial — every task forced through lgtm_ask + /lgtm even bookkeeping like
"monitor pueue 30". Two-tier split:

- Tasks: agent-managed. TaskUpdate(status=completed) now allowed when no lgtm
  evidence is stored. Trivial subtasks lead up to verification without ceremony.
- LGTMs: significant claims. lgtm_ask still triggers robot review; once evidence
  is stored, completion is locked to /lgtm so the gate can't be bypassed.

Other UX:
- TaskList output grouped: Active / Awaiting sign-off / Pending / Completed.
- New getDisplayStatus(task) derives awaiting_signoff from pending_approval.
- Widget header shows N awaiting sign-off count.
- /lgtm accepts multiple ids: /lgtm 1 2 3 (also #1, commas).
- lgtm_ask field descriptions encourage one short sentence per field — keep
  thinking discipline, drop verbosity.
- SYSTEM_REMINDER nudges progress updates and cleanup of completed/irrelevant
  tasks, not just lgtm_ask.

Also includes pending rubric extension on RobotReviewRecord.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-04-25 18:18:48 +08:00
wassname 0ba0173615 feat: gate lgtm on robot review acceptance 2026-04-17 08:31:07 +08:00
wassname 3c7d2f5140 feat: iterate and auto-run robot reviews 2026-04-17 08:20:30 +08:00