Files
Open-Assistant/model/reward/instructor
2023-01-02 00:01:45 +00:00
..
2023-01-01 11:43:15 +00:00
2023-01-02 00:01:45 +00:00
2023-01-02 00:01:45 +00:00
2023-01-02 00:01:45 +00:00
2023-01-02 00:01:45 +00:00
2023-01-01 20:57:35 +00:00
2023-01-01 13:28:48 +00:00
2023-01-01 20:57:35 +00:00
2023-01-02 00:01:45 +00:00
2023-01-02 00:01:45 +00:00

Sections to train Reward Model (RM)

Trainer code based on huggingface. Compatible with deepspeed or accelerate

Requirements

wandb
evaluate
datasets
transformers
torch==1.12

Start training reward model

python trainer.py configs/electra-base-dis-webgpt.yml

Additional axis labeling, this outputs a 4 summary quality evaluation metrics (score are normalized to 0-1 )

python summary_quality_trainer.py configs/test-bloomz-560m-quality.yml

The four summary are :

  • overall

  • accuracy

  • coverage

  • coherence

Dataset

For now we only supports webgpt and summary dataset from OpenAI. Once open-asisstant dataset are available it will be added here.

Model

Check out configs

Open-Assistant/model/reward/instructor/configs/
    bloomz-560m.yml
    electra-base-dis-webgpt.yml
    galactica-125m.yml
    galactica-1b.yml

You can add new huggingface model as you want.