Commit Graph

  • 934a2ffb5d wip main wassname 2025-06-02 05:26:08 +00:00
  • 6b882373bb to uv wassname 2025-06-02 05:04:46 +00:00
  • dda269f58a 3b wassname 2024-10-01 21:39:47 +00:00
  • dedc6adcd2 3b and fix just wassname 2024-10-01 20:31:48 +00:00
  • 53af87561f bs wassname 2024-09-30 12:56:06 +00:00
  • 5a05456c93 misc wassname 2024-09-30 12:05:26 +00:00
  • 8e423c7b8b wip wassname 2024-09-30 07:46:12 +00:00
  • 32cbd80650 training llama 3.1 1b wassname 2024-09-30 07:14:38 +00:00
  • ed54e415be Update README.md Yu Meng 2024-08-22 16:17:43 -04:00
  • 3bace0069b Merge branch 'main' of github.com:princeton-nlp/SimPO Yu Meng 2024-08-22 16:06:09 -04:00
  • 2dcc4350f8 fix on-policy data order Yu Meng 2024-08-22 16:06:04 -04:00
  • 4f80aa5f15 Update README.md Mengzhou Xia 2024-08-06 16:39:29 -04:00
  • 54545e803b change attn implementation arg Yu Meng 2024-08-05 00:24:47 -04:00
  • f9f7042105 Merge branch 'main' of github.com:princeton-nlp/SimPO Yu Meng 2024-08-04 23:12:31 -04:00
  • 1aa921d748 fix alignment-handbook version Yu Meng 2024-08-04 23:12:21 -04:00
  • dad051a2f6 Update README.md Yu Meng 2024-07-20 00:34:31 -04:00
  • 481df8ed57 Merge branch 'main' of github.com:princeton-nlp/SimPO Yu Meng 2024-07-20 00:32:41 -04:00
  • ba76f2540b add data generation scripts Yu Meng 2024-07-20 00:32:36 -04:00
  • 44dcd63aaa Update README.md Yu Meng 2024-07-18 15:51:06 -04:00
  • e6b139711c license Yu Meng 2024-07-18 15:38:41 -04:00
  • d8e0dff9c2 simplifying generation Yu Meng 2024-07-18 15:19:05 -04:00
  • 7f568be951 Update README.md xiamengzhou 2024-07-18 09:06:11 -04:00
  • fcdca83800 Update README.md xiamengzhou 2024-07-17 15:37:03 -04:00
  • 3690ad3b2e Update README.md xiamengzhou 2024-07-17 13:37:04 -04:00
  • 29ae22cc3c Update README.md xiamengzhou 2024-07-17 13:35:24 -04:00
  • 039e23af55 Update README.md Yu Meng 2024-07-17 13:31:08 -04:00
  • 2f8e445daa Update README.md Yu Meng 2024-07-17 13:30:47 -04:00
  • 8b3c066509 Update README.md Yu Meng 2024-07-17 13:29:28 -04:00
  • 05f4f26872 add gemma training script Yu Meng 2024-07-17 11:44:49 -04:00
  • e6c3d771a2 Update README.md Yu Meng 2024-07-17 11:34:06 -04:00
  • aa5c3062fc Added gemma models xiamengzhou 2024-07-17 11:17:36 -04:00
  • 17f3559c88 Update README.md xiamengzhou 2024-07-17 10:57:02 -04:00
  • 356213df77 Update README.md xiamengzhou 2024-07-17 08:44:31 -04:00
  • 219e6c2ac9 Update README.md xiamengzhou 2024-07-15 09:26:20 -04:00
  • 3e9c4cc3bd update trainer argument Yu Meng 2024-07-13 16:50:42 -04:00
  • 6c3757f3f2 Update README.md Yu Meng 2024-07-12 00:47:10 -04:00
  • da33c05f1b Merge branch 'main' of github.com:princeton-nlp/SimPO Yu Meng 2024-07-12 00:38:55 -04:00
  • ad86d8adf9 add env file Yu Meng 2024-07-12 00:38:50 -04:00
  • e7186a8134 Update README.md Yu Meng 2024-07-10 00:33:48 -04:00
  • 3e5532e0b2 Update README.md Yu Meng 2024-07-09 18:05:29 -04:00
  • 26685dd9c6 Update README.md Yu Meng 2024-07-09 16:51:53 -04:00
  • 65a7ac97d5 Update README.md Yu Meng 2024-07-09 16:40:57 -04:00
  • 68cc8bc75a Merge branch 'main' of github.com:princeton-nlp/SimPO Yu Meng 2024-07-09 15:38:05 -04:00
  • 896942c7d2 script update Yu Meng 2024-07-09 15:37:59 -04:00
  • fa7d6e3b5b Update README.md Yu Meng 2024-07-09 15:07:40 -04:00
  • 0c89e67e72 Add files via upload Yu Meng 2024-07-09 12:00:16 -07:00
  • c19e21140e Add files via upload Yu Meng 2024-07-09 11:58:47 -07:00
  • 995fbaf260 Update README.md Yu Meng 2024-07-09 14:55:28 -04:00
  • 9ef5dcbd69 add v0.2 training script Yu Meng 2024-07-09 14:52:58 -04:00
  • 1da92c59cc Update README.md Yu Meng 2024-07-09 14:43:23 -04:00
  • b1be711212 Update README.md Yu Meng 2024-07-09 14:31:23 -04:00
  • 3235adf17a Update README.md Yu Meng 2024-07-09 14:29:01 -04:00
  • 2d5cbf1dad Update README.md Yu Meng 2024-07-09 14:24:12 -04:00
  • 72511a5102 Update README.md xiamengzhou 2024-07-09 12:15:37 -04:00
  • 005072f88c Update README.md xiamengzhou 2024-07-09 09:12:44 -04:00
  • efe419e587 Merge branch 'main' of https://github.com/princeton-nlp/SimPO xiamengzhou 2024-07-07 10:16:09 -04:00
  • 9bd67dc925 update xiamengzhou 2024-07-07 10:15:46 -04:00
  • daed53ee55 Update README.md Yu Meng 2024-07-06 23:44:55 -04:00
  • 26cbb4a033 Update generate.py for demo Yu Meng 2024-07-06 23:34:34 -04:00
  • 15c4ff8918 update trainer & hyperparameter gamma Yu Meng 2024-07-06 23:29:26 -04:00
  • a1d07195c2 Update README.md xiamengzhou 2024-07-03 17:31:27 -04:00
  • 4b96360354 update xiamengzhou 2024-07-03 09:44:33 -04:00
  • 4142d762bc Create llama3-nobos.txt xiamengzhou 2024-06-25 04:45:28 -04:00
  • 679676262e Update README.md xiamengzhou 2024-06-25 04:44:37 -04:00
  • c18247e882 Update README.md xiamengzhou 2024-06-25 04:06:37 -04:00
  • baf9b8428d Update README.md Yu Meng 2024-06-02 18:26:27 -04:00
  • 62088ca319 Update README.md Yu Meng 2024-06-02 18:25:09 -04:00
  • e36b6d5946 add mt-bench gpt-4 turbo reference answer Yu Meng 2024-06-02 17:57:32 -04:00
  • b982a84748 Merge branch 'main' of github.com:princeton-nlp/SimPO Yu Meng 2024-06-02 17:55:37 -04:00
  • 6cc70cec47 fix dataset Yu Meng 2024-06-02 17:55:15 -04:00
  • 26c060cbfc update xiamengzhou 2024-05-31 11:06:08 -04:00
  • 57312a6e3b fix double init Yu Meng 2024-05-31 03:03:58 -04:00
  • 1ebb93306d Merge pull request #11 from cameron-chen/resolve-import-error Yu Meng 2024-05-29 23:13:46 -04:00
  • a5d1c138fd resolve the import error and unexpected arugment cameron-chen 2024-05-29 06:31:13 +00:00
  • 4e5d6d7ea1 Merge pull request #1 from CrispStrobe/patch-1 Yu Meng 2024-05-25 10:36:40 -04:00
  • 17603840dc linkfix CrispStrobe 2024-05-25 08:33:03 +02:00
  • e80a0fc4f4 Update README.md Yu Meng 2024-05-24 10:28:52 -04:00
  • a53eb35980 Update README.md xiamengzhou 2024-05-24 00:00:53 -04:00
  • 84b9656ef6 Create generate.py xiamengzhou 2024-05-23 23:59:36 -04:00
  • 18c99b9e3b update with chat templates Yu Meng 2024-05-23 23:53:20 -04:00
  • 7da63e9a6e Update README.md xiamengzhou 2024-05-23 23:04:48 -04:00
  • 586e5e3d0a release Yu Meng 2024-05-23 22:53:13 -04:00