From e7186a81342486b3bf0d4fed8360ec35eb7db02a Mon Sep 17 00:00:00 2001
From: Yu Meng <yumeng5@virginia.edu>
Date: Wed, 10 Jul 2024 00:33:48 -0400
Subject: [PATCH] Update README.md

---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index 411f0f5..3f19486 100644
--- a/README.md
+++ b/README.md
@@ -38,7 +38,7 @@ We used the following hyperparameters for training the released models (note tha
 | Llama3-Instruct   | 2.5 | 0.55 | 1e-6           |
 | Llama3-Instruct v0.2   | 10 | 0.3 | 1e-6           |
 
-For DPO, we use the following hyperparameters for training.
+For DPO, the best hyperparameters for each setting are as follows.
 | Setting                  | β | Learning Rate |
 |------------------------|------|---------------|
 | Mistral-Base           | 0.01 | 5e-7      |