mirror of
https://github.com/wassname/steer-heal-love.git
synced 2026-06-27 16:47:16 +08:00
results: QLoRA bs=3 ga=2 + lam_round_pow=-0.5 extends movement to r6 (peak -0.37 vs -0.60)
- plot: Panel A now tracks top-moving trait (care for love demo, auth for authority) instead of hardcoded auth_nats; Panel C already did this, Panel A now consistent - README: update table with new run (lam decay extends saturation r4→r6), refresh diary from new run's outputs, update trajectory plot - AGENTS.md: correct gotchas -- tau<operating_KL is the key constraint (tau=2.0 not 4.0); QLoRA + bs=3 ga=2 is the right default for better heal gradient estimates Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>
This commit is contained in:
Binary file not shown.
|
Before Width: | Height: | Size: 248 KiB After Width: | Height: | Size: 180 KiB |
Reference in New Issue
Block a user