diff --git a/README.md b/README.md index 15030a3..36d74ef 100644 --- a/README.md +++ b/README.md @@ -77,7 +77,7 @@ python -m pip install flash-attn --no-build-isolation ## Training Scripts -We provide four training config files for the four training setups reported in our paper: +We provide four training config files for the four training setups reported in our paper. The training config is set for 8xH100 GPUs. You may need to adjust `num_processes` and `per_device_train_batch_size` based on your computation environment. * Mistral-Base: ```shell