2023-04-22 11:17:56 +08:00
2023-04-22 11:17:56 +08:00

This is a list of resources for reinforcement learning from human feedback and other methods to instruct large language models.

Evaluation

There are multiple ways to formally evaluate LLM capabilities. Right now project generally use one of these 3 libraries. Personally I prefer Eleuther's work, but opinions and github stars are divided.

Training

Data

Data can generally be divided along two axis:

  • high quality 🗹 or Lower quality ☐
  • natural 🧑 or unnatural 🤖

Depending on your training objectives you will want lots of low quality instruction data, or a small amount of high quality data. Which should you use? Lets see what Anthropic have to say Askell et all Antrhopic]:

How can we improve the sample efficiency of preference modeling? We find that we can significantly improve sample efficiency using a preference model pre-training (PMP) stage of training, where we first pre-train on large public datasets that encode human preference information, such as Stack Exchange, Reddit, and Wikipedia edits, before finetuning on smaller datasets encoding more specific human preferences.

Natural 🧑 & High quality 🗹

Natural 🧑 & Lower quality ☐

Unnatural 🤖 & High quality 🗹

Unnatural 🤖 & Lower quality ☐

  • unnatural-instructions used above and GPT3 to make 256k examples
  • OIG - Open Instruction Generalist Dataset a compilation of ~43M instructions. "The OIG dataset is almost purely a synthetic data set created using data augmentation.""
    • note there is a higher quality subset OIG-small-chip2

Uncategorized

Finding more data

A great way to find new instruction datasets is to

Similar lists

S
Description
Lists of datasets, training, and evals for RLHF and similar
Readme 182 KiB