Block a user
Updated 2026-06-27 17:49:37 +08:00
Measured persona prompt templates and contrastive persona pairs for steering experiments
Updated 2026-06-25 14:08:19 +08:00
Hypothesis: you can distill a steering vector into LoRA weights and "heal" the incoherency the vector injects by regularising the training (KL to base, or weight decay). Then loop and see what multiple rounds give you.
Updated 2026-06-24 20:50:29 +08:00
pi extension: plan-mode goals with evidence in one plan.md, signed off by a read-only subagent check. Small successor to pi-lgtm.
Updated 2026-06-17 18:21:45 +08:00
pi extension: plan-mode goals with evidence in one plan.md, signed off by a read-only subagent check. Small successor to pi-lgtm.
Updated 2026-06-17 18:21:45 +08:00
UAT-style task tree with verify commands and done criteria for pi coding agent
Updated 2026-06-15 17:29:24 +08:00
Each lora type adapter can tell us something about how to look at transformer internals, and they some with causal evidence
Updated 2026-06-11 10:29:47 +08:00
Updated 2026-06-01 14:30:20 +08:00