Update README.md

This commit is contained in:
wassname (Michael J Clark)
2026-06-07 11:52:15 +08:00
committed by GitHub
parent 2e99f62658
commit 08329ab86d
+2
View File
@@ -25,6 +25,8 @@ Steering is interesting because it's and internal and unsupervised intervention
## Heal ## Heal
Can we heal after steering? This is the key hypothesis:
### Hypothesis ### Hypothesis
Hypothesis: you can distill a steering vector into LoRA weights and "heal" the incoherency the vector injects by regularising the training. Then loop and see what multiple rounds give you. Hypothesis: you can distill a steering vector into LoRA weights and "heal" the incoherency the vector injects by regularising the training. Then loop and see what multiple rounds give you.