mirror of
https://github.com/wassname/Open-Assistant.git
synced 2026-06-27 16:10:30 +08:00
update readme
This commit is contained in:
@@ -2,20 +2,21 @@
|
||||
|
||||
Trainer code based on huggingface. Compatible with deepspeed or accelerate
|
||||
|
||||
Requirements
|
||||
|
||||
```
|
||||
wandb
|
||||
evaluate
|
||||
datasets
|
||||
transformers
|
||||
torch==1.12
|
||||
```
|
||||
|
||||
Start training reward model
|
||||
Install Python requirements
|
||||
|
||||
```bash
|
||||
python trainer.py configs/electra-base-dis-webgpt.yml
|
||||
pip install -r requirements.txt
|
||||
```
|
||||
|
||||
Write or inherit a `configs/<config-name>.yml` file to store training
|
||||
configuration details.
|
||||
|
||||
> The configuration file must have _at least_ all the keys present in dummy.yml
|
||||
|
||||
Run training procedure
|
||||
|
||||
```bash
|
||||
python trainer.py configs/<config-name>.yml
|
||||
```
|
||||
|
||||
Additional axis labeling, this outputs a 4 summary quality evaluation metrics
|
||||
|
||||
Reference in New Issue
Block a user