diff --git a/doc/source/rllib-algorithms.rst b/doc/source/rllib-algorithms.rst index 575900776..afc37b120 100644 --- a/doc/source/rllib-algorithms.rst +++ b/doc/source/rllib-algorithms.rst @@ -278,12 +278,12 @@ RLlib's soft-actor critic implementation is ported from the `official SAC repo < Tuned examples: `Pendulum-v0 `__, `HalfCheetah-v3 `__ -**MuJoCo results @500k steps:** `more details `__ +**MuJoCo results @3M steps:** `more details `__ ============= ========== =================== MuJoCo env RLlib SAC Haarnoja et al SAC ============= ========== =================== -HalfCheetah 8752 ~9000 +HalfCheetah 13000 ~15000 ============= ========== =================== **SAC-specific configs** (see also `common configs `__):