Open-Assistant/model/reward/instructor/TODO.md at 581f31203ca94de45ba55e12c0a8c1a944ca3a74 - Open-Assistant - Gitea: Git with a cup of tea

wassname/Open-Assistant

mirror of https://github.com/wassname/Open-Assistant.git synced 2026-07-03 17:10:10 +08:00

Files

T

Gareth Davidson c3c7a1701a run prettier with new params

2023-01-01 20:57:35 +00:00

732 B

Raw Blame History

Some other reward features we can use

Finish classifcation feature
Summaries from human feedback

use confidence score into the RM learning, ensure the output rank score correlates with confidence
each labeling has a labeling note, basically comments by labeler, not sure what else we can use
~~Use the score for "overall", "accuracy", "coverage", "coherence" from axis/evals to train an addition model (rank additional aspect of the policy model)~~
- this should be placed under experimental_dataset.py

Add support for anthropic dataset

anthropic dataset is more like a conversation tree which is much complex than simply question-answer schema
- this is basically a MCTS from alphazero.