Commit Graph

39 Commits

Author SHA1 Message Date
theblackcat102 a1e1445de9 [fix] evaluation dataset is incorrect in reward-model/trainer.py 2023-01-08 03:42:40 +00:00
theblackcat102 9d05b73efc [fix] syntax error and some typing issue in py38 2023-01-08 02:15:08 +00:00
theblackcat102 116045915e [fix] syntax error and some typing issue in py38 2023-01-08 02:14:49 +00:00
theblackcat102 9eb401c633 [fix] resolve conflict from main 2023-01-08 01:32:39 +00:00
Szymon Ożóg f304921bd5 Added deberta configs 2023-01-07 16:36:55 +01:00
Szymon Ożóg d0942a3256 Added option to use cosine scheduler 2023-01-07 16:36:38 +01:00
theblackcat102 3625f39948 [feature] Add GPTJ synthetic dataset, fix reference removal regex for webgpt 2023-01-07 01:36:27 +00:00
theblackcat102 325c97857c Merge pull request #313 from bth5032/bth5032/78-blackcat-trainer
Bth5032/78 blackcat trainer
2023-01-05 10:50:04 +08:00
Bobak Hashemi 061d621953 removed old precommit pragma requirement 2023-01-04 20:51:19 -05:00
Bobak Hashemi da79aa04a0 Cleaned up default argument logic. 2023-01-03 21:45:16 -05:00
Bobak Hashemi 4569bcf354 fixed linting 2023-01-03 20:47:33 -05:00
theblackcat102 f7bd22246e [fix] rename old summary human feedback dataset to new one 2023-01-04 01:00:37 +00:00
Bobak Hashemi 45c147362e added precommit hooks and cleaned up configs for rankgen 2023-01-03 01:41:45 -05:00
Bobak Hashemi 568a42066a FP32 Training Works 2023-01-03 00:53:07 -05:00
Bobak Hashemi 34ab948ade testing rankgen integration into instructor trainer 2023-01-01 23:30:12 -05:00
Gareth Davidson 7000e10bc0 apply pre-commit rules 2023-01-02 00:01:45 +00:00
Yannic Kilcher 4841550cd4 Merge pull request #212 from bitplane/prettier-markdown
Format markdown with prettier --prose-wrap=always
2023-01-01 22:11:11 +01:00
Gareth Davidson c3c7a1701a run prettier with new params 2023-01-01 20:57:35 +00:00
Alexander Goryunov e871de693c A typo in import 2023-01-01 21:52:38 +02:00
theblackcat102 8f0028bc44 [fix] Fix provider 2023-01-01 13:28:48 +00:00
theblackcat102 28e0b4f770 [fix] Revert deleted vscode 2023-01-01 12:03:34 +00:00
theblackcat102 fe99b46f2e [fix] pre-commit update 2023-01-01 11:43:15 +00:00
theblackcat102 168e9ca6b3 [feature] added summary quality rater 2023-01-01 10:19:25 +00:00
theblackcat102 1197dccf11 [fix] dataset split name 2023-01-01 08:25:09 +00:00
theblackcat102 8b1553642f [feature] remove dependency to download hfsummary manually 2023-01-01 08:22:30 +00:00
theblackcat102 4b7f1f25a1 [fix] Use official split for eval 2023-01-01 03:07:40 +00:00
theblackcat102 a5a2625e2d [merge] most of the bugs should be fixed. #77 2023-01-01 02:55:54 +00:00
theblackcat102 e27a3eb3c7 [fix] Tidy up todo and trainer comments 2023-01-01 02:22:57 +00:00
theblackcat102 0119ee666b [feature] Add support for bloomz 2023-01-01 02:09:21 +00:00
theblackcat102 c5b31d0b9e [feature] update reamde 2022-12-31 18:20:41 +00:00
theblackcat102 ba336fb087 [fix] fix freeze top N layers 2022-12-31 17:43:27 +00:00
theblackcat102 918b7b7ec0 [feature] Add galactica training config 2023-01-01 01:25:53 +08:00
theblackcat102 24e06626f4 [fix] Fix missing configs 2022-12-31 17:04:44 +00:00
theblackcat102 f3c299757d [feature] added configs argument for parameters training and recording 2022-12-31 17:02:46 +00:00
theblackcat102 d2572d0323 [fix] Add drop_token_type to use galactica 2022-12-31 09:42:49 +00:00
theblackcat102 3a10f1024a [fix] Fix truncation in collate fn 2022-12-31 09:27:09 +00:00
theblackcat102 b2ef4695a0 [fix] Fix missing accuracy and eval loss 2022-12-31 03:47:54 +00:00
theblackcat102 bcd5c52b3b [feature] working trainer code 2022-12-31 03:02:10 +00:00
theblackcat102 ad98a28241 [feature] add rank dataset for webgpt and human feedback summary 2022-12-30 17:25:50 +00:00