wassname/Open-Assistant

Fork 0

mirror of https://github.com/wassname/Open-Assistant.git synced 2026-06-27 16:10:30 +08:00

Files

T

History

theblackcat102 a1e1445de9 [fix] evaluation dataset is incorrect in reward-model/trainer.py

2023-01-08 03:42:40 +00:00

configs

Added deberta configs

2023-01-07 16:36:55 +01:00

tests

[feature] Add GPTJ synthetic dataset, fix reference removal regex for webgpt

2023-01-07 01:36:27 +00:00

cls_dataset.py

apply pre-commit rules

2023-01-02 00:01:45 +00:00

experimental_dataset.py

[fix] rename old summary human feedback dataset to new one

2023-01-04 01:00:37 +00:00

models.py

removed old precommit pragma requirement

2023-01-04 20:51:19 -05:00

rank_datasets.py

[fix] syntax error and some typing issue in py38

2023-01-08 02:15:08 +00:00

README.md

run prettier with new params

2023-01-01 20:57:35 +00:00

requirements.txt

fixed linting

2023-01-03 20:47:33 -05:00

summary_quality_trainer.py

apply pre-commit rules

2023-01-02 00:01:45 +00:00

TODO.md

run prettier with new params

2023-01-01 20:57:35 +00:00

trainer.py

[fix] evaluation dataset is incorrect in reward-model/trainer.py

2023-01-08 03:42:40 +00:00

utils.py

[feature] Add GPTJ synthetic dataset, fix reference removal regex for webgpt

2023-01-07 01:36:27 +00:00

README.md

Sections to train Reward Model (RM)

Trainer code based on huggingface. Compatible with deepspeed or accelerate

Requirements

wandb
evaluate
datasets
transformers
torch==1.12

Start training reward model

python trainer.py configs/electra-base-dis-webgpt.yml

Additional axis labeling, this outputs a 4 summary quality evaluation metrics (score are normalized to 0-1 )

python summary_quality_trainer.py configs/test-bloomz-560m-quality.yml

The four summary are :

overall
accuracy
coverage
coherence

Dataset

For now we only supports webgpt and summary dataset from OpenAI. Once open-asisstant dataset are available it will be added here.

Model

Check out configs

Open-Assistant/model/reward/instructor/configs/
    bloomz-560m.yml
    electra-base-dis-webgpt.yml
    galactica-125m.yml
    galactica-1b.yml

You can add new huggingface model as you want.