Files
Open-Assistant/model/supervised_finetuning
Sotirios Anagnostidis c20dfaad5b pre-commits
2023-01-03 22:45:34 +01:00
..
2023-01-03 22:45:34 +01:00
2023-01-03 22:45:34 +01:00
2023-01-03 01:31:56 +01:00
2023-01-03 22:45:34 +01:00
2023-01-03 22:06:59 +01:00
2023-01-03 22:02:32 +01:00
2023-01-03 22:45:34 +01:00

Train using supervised examples

Requirements

wandb
evaluate
datasets
transformers
torch

Start training reward model

python trainer.py --configs defaults galactica-125

Dataset

For now we only support webgpt and summary dataset from OpenAI. Once open-asisstant dataset are available it will be added here.

Model

TBD

Results

Experimental results in wandb here.

TODOS