Commit Graph

78 Commits

Author SHA1 Message Date
theblackcat102 736f46fb00 [fix] prosocial dialogue format error 2023-01-22 14:00:20 +00:00
theblackcat102 f5b2a34857 [feature] add pythia and limit translation pair 2023-01-22 00:56:17 +00:00
theblackcat102 62a203fd8c [feature] move data formatting into dataset, instead of collator 2023-01-21 03:31:35 +00:00
theblackcat102 aca3e9de89 [fix] wait it pass? 2023-01-20 07:26:26 +00:00
theblackcat102 22e3ab1a89 [fix] linter fix 2023-01-20 07:23:02 +00:00
theblackcat102 c255148dc6 [fix] Merge main branch update 2023-01-20 06:18:43 +00:00
theblackcat102 6cd62e3d48 [fix] Fix missing russian and update readme 2023-01-20 06:16:26 +00:00
theblackcat102 74cb9aaa5a [feature] added translation, rallio instruct tuning dataset, prosocial for safety, new summary dataset 2023-01-20 03:02:07 +00:00
Sotirios Anagnostidis 5d9591d82c added soda dialogue dataset 2023-01-19 14:25:19 +01:00
theblackcat102 670be60ca8 [fix] Fix config typo 2023-01-14 12:17:58 +00:00
theblackcat102 6f6c590e57 [fix] Disable task specific evaluation 2023-01-14 06:47:21 +00:00
theblackcat102 1546111094 [feature] added GSM8k and code refactoring 2023-01-14 06:24:47 +00:00
theblackcat102 3966024871 [fix] Fix summarizer bug and QA typo issue 2023-01-14 05:49:22 +00:00
theblackcat102 9451aff6cc [fix] @ekurtulus major logic bug in summarization 2023-01-14 03:49:19 +00:00
Sotirios Anagnostidis c8f47eef9f precommits 2023-01-11 22:58:17 +01:00
Sotirios Anagnostidis d46ff8c4ee better logging with deepspeed 2023-01-11 22:48:02 +01:00
Sotirios Anagnostidis 6438fdbe2c quantization from #582 2023-01-11 22:44:20 +01:00
Sotirios Anagnostidis 4a3ea0b033 refactoring, now running 2023-01-11 22:42:04 +01:00
ekurtulus 5b77dd2e9f better 2023-01-11 11:37:27 +03:00
mrcabbage972 d95c741ea0 Fixing requirements file 2023-01-10 20:16:02 -05:00
mrcabbage972 67aeed2cd7 Adding override of 32-bit optimization for embedding layer 2023-01-09 23:03:29 -05:00
mrcabbage972 08bdadf222 Adding BNB 8-bit Adam 2023-01-09 22:07:06 -05:00
theblackcat102 a1e1445de9 [fix] evaluation dataset is incorrect in reward-model/trainer.py 2023-01-08 03:42:40 +00:00
theblackcat102 9d05b73efc [fix] syntax error and some typing issue in py38 2023-01-08 02:15:08 +00:00
theblackcat102 116045915e [fix] syntax error and some typing issue in py38 2023-01-08 02:14:49 +00:00
theblackcat102 9eb401c633 [fix] resolve conflict from main 2023-01-08 01:32:39 +00:00
Szymon Ożóg f304921bd5 Added deberta configs 2023-01-07 16:36:55 +01:00
Szymon Ożóg d0942a3256 Added option to use cosine scheduler 2023-01-07 16:36:38 +01:00
theblackcat102 3625f39948 [feature] Add GPTJ synthetic dataset, fix reference removal regex for webgpt 2023-01-07 01:36:27 +00:00
Sotirios Anagnostidis d3952354e2 pre commits 2023-01-06 22:09:24 +01:00
Sotirios Anagnostidis 148244455c refactor 2023-01-06 21:29:38 +01:00
Sotirios Anagnostidis 88ee3b3264 merge deepspeed 2023-01-06 21:28:26 +01:00
Sotirios Anagnostidis f2b125cbe3 merge 2023-01-06 21:24:36 +01:00
Sotirios Anagnostidis 91853753a8 conf 2023-01-06 18:25:20 +01:00
Sotirios Anagnostidis ef02693ac9 quantization 2023-01-06 18:24:28 +01:00
theblackcat102 50e7472ae6 [fix] push fix by linter 2023-01-06 17:14:19 +00:00
theblackcat102 577b14a702 [fix] push fix by linter 2023-01-06 17:11:06 +00:00
theblackcat102 fc6eab9edc [fix] new code complete answer and update readme for clarity 2023-01-06 17:05:47 +00:00
theblackcat102 8e30b419bf [fix] new code complete answer 2023-01-06 16:54:46 +00:00
theblackcat102 b67181776a [feature] add deepspeed, rallio dialogue dataset and codegen parameters 2023-01-06 16:47:58 +00:00
theblackcat102 325c97857c Merge pull request #313 from bth5032/bth5032/78-blackcat-trainer
Bth5032/78 blackcat trainer
2023-01-05 10:50:04 +08:00
theblackcat102 93b2be918e Merge pull request #347 from LAION-AI/sft-gptjt-qa-labels
Sft gptjt qa labels
2023-01-05 10:49:51 +08:00
Bobak Hashemi 061d621953 removed old precommit pragma requirement 2023-01-04 20:51:19 -05:00
Sotirios Anagnostidis dfaa00dccc gptj 8bit 2023-01-05 00:33:16 +01:00
Bobak Hashemi da79aa04a0 Cleaned up default argument logic. 2023-01-03 21:45:16 -05:00
Bobak Hashemi 4569bcf354 fixed linting 2023-01-03 20:47:33 -05:00
theblackcat102 f7bd22246e [fix] rename old summary human feedback dataset to new one 2023-01-04 01:00:37 +00:00
Sotirios Anagnostidis c20dfaad5b pre-commits 2023-01-03 22:45:34 +01:00
Sotirios Anagnostidis 525e6964e8 requirements 2023-01-03 22:06:59 +01:00
Sotirios Anagnostidis 3a10e9412d Question-Answer special tokens 2023-01-03 22:02:32 +01:00