2023-08-25 13:35:35 +02:00
2023-08-25 13:35:35 +02:00
2023-08-25 13:35:35 +02:00
2023-08-25 13:35:35 +02:00

rlhf-handbook

Robust recipes for RLHF

S
Description
Robust recipes to align language models with human and AI preferences
Readme Apache-2.0 691 KiB
Languages
Python 64.1%
Jupyter Notebook 32.5%
Shell 2.7%
Makefile 0.7%