From e7186a81342486b3bf0d4fed8360ec35eb7db02a Mon Sep 17 00:00:00 2001 From: Yu Meng Date: Wed, 10 Jul 2024 00:33:48 -0400 Subject: [PATCH] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 411f0f5..3f19486 100644 --- a/README.md +++ b/README.md @@ -38,7 +38,7 @@ We used the following hyperparameters for training the released models (note tha | Llama3-Instruct | 2.5 | 0.55 | 1e-6 | | Llama3-Instruct v0.2 | 10 | 0.3 | 1e-6 | -For DPO, we use the following hyperparameters for training. +For DPO, the best hyperparameters for each setting are as follows. | Setting | β | Learning Rate | |------------------------|------|---------------| | Mistral-Base | 0.01 | 5e-7 |