From 10444385aaf483d06fb483f615792b1f202dd0eb Mon Sep 17 00:00:00 2001 From: wassname <1103714+wassname@users.noreply.github.com> Date: Wed, 20 Aug 2025 14:21:57 +0800 Subject: [PATCH] readme --- README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 0e18cf1..5910b6c 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,6 @@ -An experiment to see how rating changed along a chain of thought +An experiment to see how rating changes along a chain of thought + +It turns out it's quite unstable, depending on where the chain of thought goes, at least in 8B parameter sized models. ![alt text](img/README-1755664824339-image.png)