Commit Graph

16 Commits

Author SHA1 Message Date
wassname 562c8fd0f0 docs: keep generated stats out of data 2026-06-13 19:12:24 +08:00
wassname 8dbc02066b eval: rerun dual judges and refresh results 2026-06-13 19:12:24 +08:00
wassname e2546fe0ab eval: refine judge rubric and README baselines 2026-06-13 19:12:24 +08:00
wassname ede354f07a eval: add dual judges and controls 2026-06-13 19:12:24 +08:00
wassname d1ee948760 tidy 2026-06-13 19:12:24 +08:00
wassname f55ba7576f misc 2026-06-13 17:36:16 +08:00
wassname 849b1de0b1 clarify persona template scoring 2026-06-13 15:28:53 +08:00
wassname 5b92bdf7a7 expand confound audit docs 2026-06-13 14:43:03 +08:00
wassname ae3fc096d7 add source urls and confound audits 2026-06-13 14:39:45 +08:00
wassname de071e79ca use normalized score components 2026-06-13 14:34:02 +08:00
wassname bce30daee9 make main dataset table human-facing 2026-06-13 14:28:10 +08:00
wassname 1461e930e5 simplify public readme 2026-06-13 14:23:47 +08:00
wassname 6a19b65e49 add clean score tables 2026-06-13 14:05:26 +08:00
wassname 9b1a6e7573 simplify public docs and parquet upload 2026-06-13 13:55:43 +08:00
wassname 4e27617821 add v2 candidate persona library 2026-06-13 10:09:32 +08:00
wassname 97ceaf5908 release persona steering template library 2026-06-13 10:05:35 +08:00