Nathan Lambert d2fed11ce0 update dir
2023-08-29 16:38:28 -07:00
2023-08-29 16:38:28 -07:00
2023-08-29 16:30:18 -07:00
2023-08-29 09:33:26 +02:00
2023-08-29 12:44:34 -07:00
2023-08-25 13:35:35 +02:00
2023-08-29 16:30:18 -07:00
2023-08-29 16:38:28 -07:00

The RLHF Handbook

Robust recipes for RLHF

Developing

To view this locally, run the following (need to point to a specific language if previewing only one section, preview/build_doc does not use the --language flag):

doc-builder preview rlhf-handbook {docs_dir} --not_python_module

Example docs_dir is ~/Documents/HuggingFace/dev/rlhf-handbook/chapters/en

Installation

Create a new conda environment with:

conda create -n rlhf-handbook python=3.10

Install the limited requirements with

pip install -r requirements.txt
S
Description
Robust recipes to align language models with human and AI preferences
Readme Apache-2.0 691 KiB
Languages
Python 64.1%
Jupyter Notebook 32.5%
Shell 2.7%
Makefile 0.7%