Yu Meng
|
2dcc4350f8
|
fix on-policy data order
|
2024-08-22 16:06:04 -04:00 |
|
Yu Meng
|
54545e803b
|
change attn implementation arg
|
2024-08-05 00:24:47 -04:00 |
|
Yu Meng
|
f9f7042105
|
Merge branch 'main' of github.com:princeton-nlp/SimPO
|
2024-08-04 23:12:31 -04:00 |
|
Yu Meng
|
1aa921d748
|
fix alignment-handbook version
|
2024-08-04 23:12:21 -04:00 |
|
Yu Meng
|
dad051a2f6
|
Update README.md
|
2024-07-20 00:34:31 -04:00 |
|
Yu Meng
|
481df8ed57
|
Merge branch 'main' of github.com:princeton-nlp/SimPO
|
2024-07-20 00:32:41 -04:00 |
|
Yu Meng
|
ba76f2540b
|
add data generation scripts
|
2024-07-20 00:32:36 -04:00 |
|
Yu Meng
|
44dcd63aaa
|
Update README.md
|
2024-07-18 15:51:06 -04:00 |
|
Yu Meng
|
e6b139711c
|
license
|
2024-07-18 15:38:41 -04:00 |
|
Yu Meng
|
d8e0dff9c2
|
simplifying generation
|
2024-07-18 15:19:05 -04:00 |
|
xiamengzhou
|
7f568be951
|
Update README.md
added AE2 leaderboard link
|
2024-07-18 09:06:11 -04:00 |
|
xiamengzhou
|
fcdca83800
|
Update README.md
|
2024-07-17 15:37:03 -04:00 |
|
xiamengzhou
|
3690ad3b2e
|
Update README.md
|
2024-07-17 13:37:04 -04:00 |
|
xiamengzhou
|
29ae22cc3c
|
Update README.md
|
2024-07-17 13:35:24 -04:00 |
|
Yu Meng
|
039e23af55
|
Update README.md
|
2024-07-17 13:31:08 -04:00 |
|
Yu Meng
|
2f8e445daa
|
Update README.md
|
2024-07-17 13:30:47 -04:00 |
|
Yu Meng
|
8b3c066509
|
Update README.md
|
2024-07-17 13:29:28 -04:00 |
|
Yu Meng
|
05f4f26872
|
add gemma training script
|
2024-07-17 11:44:49 -04:00 |
|
Yu Meng
|
e6c3d771a2
|
Update README.md
|
2024-07-17 11:34:06 -04:00 |
|
xiamengzhou
|
aa5c3062fc
|
Added gemma models
|
2024-07-17 11:17:36 -04:00 |
|
xiamengzhou
|
17f3559c88
|
Update README.md
add caveat to using v0.2 SimPO models
|
2024-07-17 10:57:02 -04:00 |
|
xiamengzhou
|
356213df77
|
Update README.md
|
2024-07-17 08:44:31 -04:00 |
|
xiamengzhou
|
219e6c2ac9
|
Update README.md
|
2024-07-15 09:26:20 -04:00 |
|
Yu Meng
|
3e9c4cc3bd
|
update trainer argument
|
2024-07-13 16:50:42 -04:00 |
|
Yu Meng
|
6c3757f3f2
|
Update README.md
|
2024-07-12 00:47:10 -04:00 |
|
Yu Meng
|
da33c05f1b
|
Merge branch 'main' of github.com:princeton-nlp/SimPO
|
2024-07-12 00:38:55 -04:00 |
|
Yu Meng
|
ad86d8adf9
|
add env file
|
2024-07-12 00:38:50 -04:00 |
|
Yu Meng
|
e7186a8134
|
Update README.md
|
2024-07-10 00:33:48 -04:00 |
|
Yu Meng
|
3e5532e0b2
|
Update README.md
|
2024-07-09 18:05:29 -04:00 |
|
Yu Meng
|
26685dd9c6
|
Update README.md
|
2024-07-09 16:51:53 -04:00 |
|
Yu Meng
|
65a7ac97d5
|
Update README.md
|
2024-07-09 16:40:57 -04:00 |
|
Yu Meng
|
68cc8bc75a
|
Merge branch 'main' of github.com:princeton-nlp/SimPO
|
2024-07-09 15:38:05 -04:00 |
|
Yu Meng
|
896942c7d2
|
script update
|
2024-07-09 15:37:59 -04:00 |
|
Yu Meng
|
fa7d6e3b5b
|
Update README.md
|
2024-07-09 15:07:40 -04:00 |
|
Yu Meng
|
0c89e67e72
|
Add files via upload
|
2024-07-09 12:00:16 -07:00 |
|
Yu Meng
|
c19e21140e
|
Add files via upload
|
2024-07-09 11:58:47 -07:00 |
|
Yu Meng
|
995fbaf260
|
Update README.md
|
2024-07-09 14:55:28 -04:00 |
|
Yu Meng
|
9ef5dcbd69
|
add v0.2 training script
|
2024-07-09 14:52:58 -04:00 |
|
Yu Meng
|
1da92c59cc
|
Update README.md
|
2024-07-09 14:43:23 -04:00 |
|
Yu Meng
|
b1be711212
|
Update README.md
|
2024-07-09 14:31:23 -04:00 |
|
Yu Meng
|
3235adf17a
|
Update README.md
|
2024-07-09 14:29:01 -04:00 |
|
Yu Meng
|
2d5cbf1dad
|
Update README.md
|
2024-07-09 14:24:12 -04:00 |
|
xiamengzhou
|
72511a5102
|
Update README.md
|
2024-07-09 12:15:37 -04:00 |
|
xiamengzhou
|
005072f88c
|
Update README.md
|
2024-07-09 09:12:44 -04:00 |
|
xiamengzhou
|
efe419e587
|
Merge branch 'main' of https://github.com/princeton-nlp/SimPO
|
2024-07-07 10:16:09 -04:00 |
|
xiamengzhou
|
9bd67dc925
|
update
|
2024-07-07 10:15:46 -04:00 |
|
Yu Meng
|
daed53ee55
|
Update README.md
|
2024-07-06 23:44:55 -04:00 |
|
Yu Meng
|
26cbb4a033
|
Update generate.py for demo
|
2024-07-06 23:34:34 -04:00 |
|
Yu Meng
|
15c4ff8918
|
update trainer & hyperparameter gamma
|
2024-07-06 23:29:26 -04:00 |
|
xiamengzhou
|
a1d07195c2
|
Update README.md
|
2024-07-03 17:31:27 -04:00 |
|