mirror of
https://github.com/wassname/steer-heal-love.git
synced 2026-06-27 16:47:16 +08:00
5cdc0ba16d
The mean-diff vector is L2-normalised, so p95 KL ~ c^2 and reaching the 1-nat target needs c ~ O(100). steering-lite's default bracket hi (~16) pinned c_star at the top (KL ~0.1 << 1.0) on both tiny-random and real gemma. With bracket=(0.1, 1024) gemma calibrates to c_star=64.03 at p95 KL=1.035. Also detach div before .item() in heal logging. See RESEARCH_JOURNAL. Co-Authored-By: Claudypoo <288921227+claudypoo@users.noreply.github.com>