82 Commits

Author SHA1 Message Date
wassname 934a2ffb5d wip 2025-06-02 05:26:08 +00:00
wassname 6b882373bb to uv 2025-06-02 05:04:46 +00:00
wassname dda269f58a 3b 2024-10-01 21:39:47 +00:00
wassname dedc6adcd2 3b and fix just 2024-10-01 20:31:48 +00:00
wassname 53af87561f bs 2024-09-30 12:56:06 +00:00
wassname 5a05456c93 misc 2024-09-30 12:05:26 +00:00
wassname 8e423c7b8b wip 2024-09-30 07:46:12 +00:00
wassname 32cbd80650 training llama 3.1 1b 2024-09-30 07:14:38 +00:00
Yu Meng ed54e415be Update README.md 2024-08-22 16:17:43 -04:00
Yu Meng 3bace0069b Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-08-22 16:06:09 -04:00
Yu Meng 2dcc4350f8 fix on-policy data order 2024-08-22 16:06:04 -04:00
Mengzhou Xia 4f80aa5f15 Update README.md 2024-08-06 16:39:29 -04:00
Yu Meng 54545e803b change attn implementation arg 2024-08-05 00:24:47 -04:00
Yu Meng f9f7042105 Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-08-04 23:12:31 -04:00
Yu Meng 1aa921d748 fix alignment-handbook version 2024-08-04 23:12:21 -04:00
Yu Meng dad051a2f6 Update README.md 2024-07-20 00:34:31 -04:00
Yu Meng 481df8ed57 Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-07-20 00:32:41 -04:00
Yu Meng ba76f2540b add data generation scripts 2024-07-20 00:32:36 -04:00
Yu Meng 44dcd63aaa Update README.md 2024-07-18 15:51:06 -04:00
Yu Meng e6b139711c license 2024-07-18 15:38:41 -04:00
Yu Meng d8e0dff9c2 simplifying generation 2024-07-18 15:19:05 -04:00
xiamengzhou 7f568be951 Update README.md
added AE2 leaderboard link
2024-07-18 09:06:11 -04:00
xiamengzhou fcdca83800 Update README.md 2024-07-17 15:37:03 -04:00
xiamengzhou 3690ad3b2e Update README.md 2024-07-17 13:37:04 -04:00
xiamengzhou 29ae22cc3c Update README.md 2024-07-17 13:35:24 -04:00
Yu Meng 039e23af55 Update README.md 2024-07-17 13:31:08 -04:00
Yu Meng 2f8e445daa Update README.md 2024-07-17 13:30:47 -04:00
Yu Meng 8b3c066509 Update README.md 2024-07-17 13:29:28 -04:00
Yu Meng 05f4f26872 add gemma training script 2024-07-17 11:44:49 -04:00
Yu Meng e6c3d771a2 Update README.md 2024-07-17 11:34:06 -04:00
xiamengzhou aa5c3062fc Added gemma models 2024-07-17 11:17:36 -04:00
xiamengzhou 17f3559c88 Update README.md
add caveat to using v0.2 SimPO models
2024-07-17 10:57:02 -04:00
xiamengzhou 356213df77 Update README.md 2024-07-17 08:44:31 -04:00
xiamengzhou 219e6c2ac9 Update README.md 2024-07-15 09:26:20 -04:00
Yu Meng 3e9c4cc3bd update trainer argument 2024-07-13 16:50:42 -04:00
Yu Meng 6c3757f3f2 Update README.md 2024-07-12 00:47:10 -04:00
Yu Meng da33c05f1b Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-07-12 00:38:55 -04:00
Yu Meng ad86d8adf9 add env file 2024-07-12 00:38:50 -04:00
Yu Meng e7186a8134 Update README.md 2024-07-10 00:33:48 -04:00
Yu Meng 3e5532e0b2 Update README.md 2024-07-09 18:05:29 -04:00
Yu Meng 26685dd9c6 Update README.md 2024-07-09 16:51:53 -04:00
Yu Meng 65a7ac97d5 Update README.md 2024-07-09 16:40:57 -04:00
Yu Meng 68cc8bc75a Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-07-09 15:38:05 -04:00
Yu Meng 896942c7d2 script update 2024-07-09 15:37:59 -04:00
Yu Meng fa7d6e3b5b Update README.md 2024-07-09 15:07:40 -04:00
Yu Meng 0c89e67e72 Add files via upload 2024-07-09 12:00:16 -07:00
Yu Meng c19e21140e Add files via upload 2024-07-09 11:58:47 -07:00
Yu Meng 995fbaf260 Update README.md 2024-07-09 14:55:28 -04:00
Yu Meng 9ef5dcbd69 add v0.2 training script 2024-07-09 14:52:58 -04:00
Yu Meng 1da92c59cc Update README.md 2024-07-09 14:43:23 -04:00