update readme

This commit is contained in:
jack.butler
2023-02-10 15:18:31 +00:00
parent 5de1c02a7a
commit 2ebb336141
+13 -12
View File
@@ -2,20 +2,21 @@
Trainer code based on huggingface. Compatible with deepspeed or accelerate
Requirements
```
wandb
evaluate
datasets
transformers
torch==1.12
```
Start training reward model
Install Python requirements
```bash
python trainer.py configs/electra-base-dis-webgpt.yml
pip install -r requirements.txt
```
Write or inherit a `configs/<config-name>.yml` file to store training
configuration details.
> The configuration file must have _at least_ all the keys present in dummy.yml
Run training procedure
```bash
python trainer.py configs/<config-name>.yml
```
Additional axis labeling, this outputs a 4 summary quality evaluation metrics