mirror of https://github.com/wassname/Open-Assistant.git synced 2026-06-27 16:10:30 +08:00

Files

T

theblackcat102 59dbfea48f Merge pull request #1262 from jackapbutler/create-tokeniser-configs

Add tokenizer config classes

2023-02-08 09:34:57 +08:00

configs

Add gelu fusion

2023-02-08 03:45:22 +09:00

custom_datasets

Fix typos (#1143 )

2023-02-05 20:18:03 +01:00

models

better

2023-01-11 11:37:27 +03:00

tests

[feature] move data formatting into dataset, instead of collator

2023-01-21 03:31:35 +00:00

efficiency_utils.py

Apply pre-commit

2023-02-08 03:50:27 +09:00

losses.py

better

2023-01-11 11:37:27 +03:00

README.md

[fix] linter fix

2023-01-20 07:23:02 +00:00

requirements.txt

added soda dialogue dataset

2023-01-19 14:25:19 +01:00

trainer.py

Apply pre-commit

2023-02-08 03:50:27 +09:00

utils.py

rename arg name to model_name

2023-02-07 09:19:24 +00:00

README.md

Train using supervised examples

Requirements

wandb
evaluate
datasets
transformers
torch

Start training reward model

python trainer.py --configs defaults galactica-125

Dataset

For now we only support webgpt and summary dataset from OpenAI. Once open-asisstant dataset are available it will be added here.

Model

Normally you should be able to add new models in configs/config.yml

your-model-name:
  learning_rate: 2e-6
  model_name: <huggingface model name>
  weight_decay: 0.01
  max_length: 812
  warmup_steps: 600
  gradient_checkpointing: false
  gradient_accumulation_steps: 5
  per_device_train_batch_size: 4
  per_device_eval_batch_size: 4

python trainer.py --configs defaults your-model-name

However, if the model of your choice doesn't have pad_token, eos_token, sep_token, you have to update utils.py get_tokenizer to use the right token.

Deepspeed support

You can edit the configs/zero_config.json and use any stage you wish. The current config uses zero-stage 3. For more details on how to setup the config checkout this page

Once you are satisfy with your deepzero config, you can add --deepspeed flag at the end to trigger deepspeed

python trainer.py --configs defaults your-model-name --deepspeed

Dataset choices

To specify which translation pair for WMT and TED Talk translation simply add the supported language pair at the postfix

  datasets:
    - wmt2019_zh-en
    - wmt2019_ru-en
    - wmt2019_de-en
    - ted_trans_nl-en
    - ted_trans_de-ja

Currently only these languages are supported via prompt translation:

ar,de,fr,en,it,nl,tr,ru,ms,ko,ja,zh

Results

Experimental results in wandb here.

TODOS

decide on a model
Merge utils etc with reward model
Casual Modelling for GPT-JT does not leverage the bidirectional mask for the prompt? (https://huggingface.co/togethercomputer/GPT-JT-6B-v1)