Commit Graph

75 Commits

Author SHA1 Message Date
wassname 32cbd80650 training llama 3.1 1b 2024-09-30 07:14:38 +00:00
Yu Meng ed54e415be Update README.md 2024-08-22 16:17:43 -04:00
Yu Meng 3bace0069b Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-08-22 16:06:09 -04:00
Yu Meng 2dcc4350f8 fix on-policy data order 2024-08-22 16:06:04 -04:00
Mengzhou Xia 4f80aa5f15 Update README.md 2024-08-06 16:39:29 -04:00
Yu Meng 54545e803b change attn implementation arg 2024-08-05 00:24:47 -04:00
Yu Meng f9f7042105 Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-08-04 23:12:31 -04:00
Yu Meng 1aa921d748 fix alignment-handbook version 2024-08-04 23:12:21 -04:00
Yu Meng dad051a2f6 Update README.md 2024-07-20 00:34:31 -04:00
Yu Meng 481df8ed57 Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-07-20 00:32:41 -04:00
Yu Meng ba76f2540b add data generation scripts 2024-07-20 00:32:36 -04:00
Yu Meng 44dcd63aaa Update README.md 2024-07-18 15:51:06 -04:00
Yu Meng e6b139711c license 2024-07-18 15:38:41 -04:00
Yu Meng d8e0dff9c2 simplifying generation 2024-07-18 15:19:05 -04:00
xiamengzhou 7f568be951 Update README.md
added AE2 leaderboard link
2024-07-18 09:06:11 -04:00
xiamengzhou fcdca83800 Update README.md 2024-07-17 15:37:03 -04:00
xiamengzhou 3690ad3b2e Update README.md 2024-07-17 13:37:04 -04:00
xiamengzhou 29ae22cc3c Update README.md 2024-07-17 13:35:24 -04:00
Yu Meng 039e23af55 Update README.md 2024-07-17 13:31:08 -04:00
Yu Meng 2f8e445daa Update README.md 2024-07-17 13:30:47 -04:00
Yu Meng 8b3c066509 Update README.md 2024-07-17 13:29:28 -04:00
Yu Meng 05f4f26872 add gemma training script 2024-07-17 11:44:49 -04:00
Yu Meng e6c3d771a2 Update README.md 2024-07-17 11:34:06 -04:00
xiamengzhou aa5c3062fc Added gemma models 2024-07-17 11:17:36 -04:00
xiamengzhou 17f3559c88 Update README.md
add caveat to using v0.2 SimPO models
2024-07-17 10:57:02 -04:00
xiamengzhou 356213df77 Update README.md 2024-07-17 08:44:31 -04:00
xiamengzhou 219e6c2ac9 Update README.md 2024-07-15 09:26:20 -04:00
Yu Meng 3e9c4cc3bd update trainer argument 2024-07-13 16:50:42 -04:00
Yu Meng 6c3757f3f2 Update README.md 2024-07-12 00:47:10 -04:00
Yu Meng da33c05f1b Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-07-12 00:38:55 -04:00
Yu Meng ad86d8adf9 add env file 2024-07-12 00:38:50 -04:00
Yu Meng e7186a8134 Update README.md 2024-07-10 00:33:48 -04:00
Yu Meng 3e5532e0b2 Update README.md 2024-07-09 18:05:29 -04:00
Yu Meng 26685dd9c6 Update README.md 2024-07-09 16:51:53 -04:00
Yu Meng 65a7ac97d5 Update README.md 2024-07-09 16:40:57 -04:00
Yu Meng 68cc8bc75a Merge branch 'main' of github.com:princeton-nlp/SimPO 2024-07-09 15:38:05 -04:00
Yu Meng 896942c7d2 script update 2024-07-09 15:37:59 -04:00
Yu Meng fa7d6e3b5b Update README.md 2024-07-09 15:07:40 -04:00
Yu Meng 0c89e67e72 Add files via upload 2024-07-09 12:00:16 -07:00
Yu Meng c19e21140e Add files via upload 2024-07-09 11:58:47 -07:00
Yu Meng 995fbaf260 Update README.md 2024-07-09 14:55:28 -04:00
Yu Meng 9ef5dcbd69 add v0.2 training script 2024-07-09 14:52:58 -04:00
Yu Meng 1da92c59cc Update README.md 2024-07-09 14:43:23 -04:00
Yu Meng b1be711212 Update README.md 2024-07-09 14:31:23 -04:00
Yu Meng 3235adf17a Update README.md 2024-07-09 14:29:01 -04:00
Yu Meng 2d5cbf1dad Update README.md 2024-07-09 14:24:12 -04:00
xiamengzhou 72511a5102 Update README.md 2024-07-09 12:15:37 -04:00
xiamengzhou 005072f88c Update README.md 2024-07-09 09:12:44 -04:00
xiamengzhou efe419e587 Merge branch 'main' of https://github.com/princeton-nlp/SimPO 2024-07-07 10:16:09 -04:00
xiamengzhou 9bd67dc925 update 2024-07-07 10:15:46 -04:00