pi-plan/docs/reviews/review.md at main

mirror of https://github.com/wassname/pi-plan.git synced 2026-06-27 16:16:14 +08:00

Files

T

wassname 489f9b8c35 Clean pi-plan references, add judge timeout, fix heading format

- Rename spec doc to 2026-06-15_pi-goals.md, update title
- Update review.md spec reference
- Rename piPlanExtension -> piGoalsExtension in src/index.ts
- Add 120s timeout to judge subprocess (was unbounded, caused hang)
- Change planInjection heading from 'Goals (goals.md):' to '.pi/goals.md:'
- Add FIXMEs for tool label, progress visibility, heading format

2026-06-17 18:09:03 +08:00

3.7 KiB

Raw Permalink Blame History

Code review against spec docs/spec/2026-06-15_pi-goals.md.

(A) SPEC MISMATCH — code does not match spec intent

No loop judge (spec §9, §3b). The extension lacks any per‑turn evaluation that would decide continue/pause; the loop‑judge prompt (loopJudgeSystem, loopJudgeUser) is defined but never invoked. No motion.
/goal command missing (spec §7). No handler for /goal (restart loop, pause, resume, clear, status). The only command is /plan.
/subgoal command missing (spec §7). Not implemented.
CancelGoal tool not implemented (spec §5, optional but present in spec). Not a blocker but a gap.
Plan‑phase model selection (D12) not implemented. planDrafting always runs on the default model; there is no sticky per‑phase model choice, no selection menu, and no persisting of a plan‑phase model reference.
Widget does not flag done goals that lack a sign‑off log line (spec §7, §6). The widget hides all done goals unconditionally; the visibility guard is missing.
/plan (no args) does not render the task‑list widget (spec §7). showPlan() dumps raw file content via notify; the widget is only set through updateWidget() on other events, not by the command itself.
Injection message role (spec §11). The before_agent_start hook returns a customType message with display: false. The spec demands a late user‑role message to avoid system‑prompt mutation; the actual message role depends on the pi API and may be system, not user, risking cache breakage.
Missing pre‑compact hook (spec §8). No pre‑compact hook to flush any in‑memory state (even just ensuring plan.md is up‑to‑date) before compaction.
Reminder cadence deviates (spec §8a). The spec calls for firing after N file‑modifying turns since last plan.md update. The code fires if plan.md is byte‑identical between agent starts, which is a coarser proxy.

(B) DEAD/UNUSED CODE

File	Lines	Reason
`src/prompts.ts`	128‑146	`loopJudgeSystem` and `loopJudgeUser` exported but never used.
`src/prompts.ts`	115‑118	`continuation` exported but never used (the loop is not built).

(C) OVERLY LONG OR REDUNDANT COMMENTS

The file‑header comments in index.ts (lines 1‑20) and plan‑file.ts (lines 1‑26) are fairly concise descriptions of the design; they are not excessive. No comment bloat worth flagging.

(D) OVER‑ENGINEERING vs. “super simple” goal

None. The line‑scanner in plan‑file.ts is minimal; the getPiInvocation() helper is a straightforward copy from the oracle extension; no unnecessary abstraction or defensive layers.

(E) REAL BUGS

cmdCtx.newSession cast risk (src/index.ts:272, 201).
reviewLoop casts ctx (type ExtensionContext) to ExtensionCommandContext to pass to startExecution, which calls cmdCtx.newSession(...). If the concrete context does not carry that method, it fails at runtime. (In practice the same object may satisfy it, but the cast hides the truth.)
showPlan raw content instead of widget (src/index.ts:136‑143).
/plan with no arguments shows the file content via ctx.ui.notify, not the structured task‑list widget the spec expects. The widget is rendered separately via updateWidget, but the command does not trigger it, so the output is inconsistent.

No other obvious logic errors; the sign‑off flow, logging, and parsing work as intended.

Verdict: A clean scaffold for the sign‑off path, but missing the autonomous loop, /goal command, and plan‑phase model selection means it’s not yet the “work autonomously” extension the spec describes.

3.7 KiB Raw Permalink Blame History Unescape Escape