jack.butler
|
24b07523aa
|
add test for tokenizer matching behaviour
|
2023-02-10 09:47:20 +00:00 |
|
jack.butler
|
2fbf2fa457
|
add docstring info about tokenizer matching
|
2023-02-10 09:46:58 +00:00 |
|
jack.butler
|
090c5cbcc2
|
fix tokenizer matching and add tests
|
2023-02-09 18:47:38 +00:00 |
|
theblackcat102
|
59dbfea48f
|
Merge pull request #1262 from jackapbutler/create-tokeniser-configs
Add tokenizer config classes
|
2023-02-08 09:34:57 +08:00 |
|
hyunwoongko
|
cb722768f7
|
Apply pre-commit
|
2023-02-08 03:50:27 +09:00 |
|
hyunwoongko
|
44c555cad1
|
Add gelu fusion
|
2023-02-08 03:45:22 +09:00 |
|
jack.butler
|
eb1c4ada2a
|
rename arg name to model_name
|
2023-02-07 09:19:24 +00:00 |
|
jack.butler
|
dc7f255f01
|
add tokenizer config classes
|
2023-02-06 17:58:55 +00:00 |
|
Kian-Meng Ang
|
1e321a6fca
|
Fix typos (#1143)
Found via `codespell -S .mypy_cache,yarn.lock,*.json,*.ipynb -L
rouge,nam,vie`
|
2023-02-05 20:18:03 +01:00 |
|
sanagnos
|
222059b1b2
|
Merge pull request #991 from LAION-AI/rm-anthropic
Add anthropic RLHF dataset & deepspeed support for reward model
|
2023-01-29 13:41:55 +01:00 |
|
theblackcat102
|
0e024e3955
|
[fix] Add working A100 config for deberta-xxlarge (deepspeed stuck during evaluation, deadlock?)
|
2023-01-29 03:17:11 +00:00 |
|
theblackcat102
|
fdcc629678
|
[feature] add reddit eli5, asks, askh; bug fix
|
2023-01-29 03:05:56 +00:00 |
|
theblackcat102
|
def03d75d2
|
[fix] Trim anthropic dataset down to last 2 convo only
|
2023-01-28 03:44:38 +00:00 |
|
theblackcat102
|
f43435efc9
|
[feature] add deepspeed default stage 2 config
|
2023-01-28 00:56:52 +00:00 |
|
theblackcat102
|
2a2f34391a
|
[fix] Added support for deepspeed
|
2023-01-28 00:55:40 +00:00 |
|
theblackcat102
|
3215a7bbf8
|
[feature] add initial version of anthropic dataset
|
2023-01-25 05:03:43 +00:00 |
|
theblackcat102
|
b8990d9078
|
[fix] remove spaces in format_pair
|
2023-01-23 02:48:47 +00:00 |
|
theblackcat102
|
736f46fb00
|
[fix] prosocial dialogue format error
|
2023-01-22 14:00:20 +00:00 |
|
theblackcat102
|
f5b2a34857
|
[feature] add pythia and limit translation pair
|
2023-01-22 00:56:17 +00:00 |
|
theblackcat102
|
62a203fd8c
|
[feature] move data formatting into dataset, instead of collator
|
2023-01-21 03:31:35 +00:00 |
|
theblackcat102
|
aca3e9de89
|
[fix] wait it pass?
|
2023-01-20 07:26:26 +00:00 |
|
theblackcat102
|
22e3ab1a89
|
[fix] linter fix
|
2023-01-20 07:23:02 +00:00 |
|
theblackcat102
|
c255148dc6
|
[fix] Merge main branch update
|
2023-01-20 06:18:43 +00:00 |
|
theblackcat102
|
6cd62e3d48
|
[fix] Fix missing russian and update readme
|
2023-01-20 06:16:26 +00:00 |
|
theblackcat102
|
74cb9aaa5a
|
[feature] added translation, rallio instruct tuning dataset, prosocial for safety, new summary dataset
|
2023-01-20 03:02:07 +00:00 |
|
Sotirios Anagnostidis
|
5d9591d82c
|
added soda dialogue dataset
|
2023-01-19 14:25:19 +01:00 |
|
theblackcat102
|
670be60ca8
|
[fix] Fix config typo
|
2023-01-14 12:17:58 +00:00 |
|
theblackcat102
|
6f6c590e57
|
[fix] Disable task specific evaluation
|
2023-01-14 06:47:21 +00:00 |
|
theblackcat102
|
1546111094
|
[feature] added GSM8k and code refactoring
|
2023-01-14 06:24:47 +00:00 |
|
theblackcat102
|
3966024871
|
[fix] Fix summarizer bug and QA typo issue
|
2023-01-14 05:49:22 +00:00 |
|
theblackcat102
|
9451aff6cc
|
[fix] @ekurtulus major logic bug in summarization
|
2023-01-14 03:49:19 +00:00 |
|
Sotirios Anagnostidis
|
c8f47eef9f
|
precommits
|
2023-01-11 22:58:17 +01:00 |
|
Sotirios Anagnostidis
|
d46ff8c4ee
|
better logging with deepspeed
|
2023-01-11 22:48:02 +01:00 |
|
Sotirios Anagnostidis
|
6438fdbe2c
|
quantization from #582
|
2023-01-11 22:44:20 +01:00 |
|
Sotirios Anagnostidis
|
4a3ea0b033
|
refactoring, now running
|
2023-01-11 22:42:04 +01:00 |
|
ekurtulus
|
5b77dd2e9f
|
better
|
2023-01-11 11:37:27 +03:00 |
|
mrcabbage972
|
d95c741ea0
|
Fixing requirements file
|
2023-01-10 20:16:02 -05:00 |
|
mrcabbage972
|
67aeed2cd7
|
Adding override of 32-bit optimization for embedding layer
|
2023-01-09 23:03:29 -05:00 |
|
mrcabbage972
|
08bdadf222
|
Adding BNB 8-bit Adam
|
2023-01-09 22:07:06 -05:00 |
|
theblackcat102
|
a1e1445de9
|
[fix] evaluation dataset is incorrect in reward-model/trainer.py
|
2023-01-08 03:42:40 +00:00 |
|
theblackcat102
|
9d05b73efc
|
[fix] syntax error and some typing issue in py38
|
2023-01-08 02:15:08 +00:00 |
|
theblackcat102
|
116045915e
|
[fix] syntax error and some typing issue in py38
|
2023-01-08 02:14:49 +00:00 |
|
theblackcat102
|
9eb401c633
|
[fix] resolve conflict from main
|
2023-01-08 01:32:39 +00:00 |
|
Szymon Ożóg
|
f304921bd5
|
Added deberta configs
|
2023-01-07 16:36:55 +01:00 |
|
Szymon Ożóg
|
d0942a3256
|
Added option to use cosine scheduler
|
2023-01-07 16:36:38 +01:00 |
|
theblackcat102
|
3625f39948
|
[feature] Add GPTJ synthetic dataset, fix reference removal regex for webgpt
|
2023-01-07 01:36:27 +00:00 |
|
Sotirios Anagnostidis
|
d3952354e2
|
pre commits
|
2023-01-06 22:09:24 +01:00 |
|
Sotirios Anagnostidis
|
148244455c
|
refactor
|
2023-01-06 21:29:38 +01:00 |
|
Sotirios Anagnostidis
|
88ee3b3264
|
merge deepspeed
|
2023-01-06 21:28:26 +01:00 |
|
Sotirios Anagnostidis
|
f2b125cbe3
|
merge
|
2023-01-06 21:24:36 +01:00 |
|