mirror of
https://github.com/wassname/pytorch-soft-actor-critic.git
synced 2026-06-27 16:46:28 +08:00
Update README.md
This commit is contained in:
@@ -8,14 +8,12 @@ Learning with a Stochastic Actor](https://arxiv.org/pdf/1801.01290.pdf) -> [SAC_
|
||||
|
||||
### Requirements
|
||||
------------
|
||||
|
||||
- [mujoco-py](https://github.com/openai/mujoco-py)
|
||||
- [TensorboardX](https://github.com/lanpa/tensorboardX)
|
||||
- [PyTorch](http://pytorch.org/)
|
||||
- [mujoco-py](https://github.com/openai/mujoco-py)
|
||||
- [TensorboardX](https://github.com/lanpa/tensorboardX)
|
||||
- [PyTorch](http://pytorch.org/)
|
||||
|
||||
### Default Arguments and Usage
|
||||
-------------
|
||||
|
||||
------------
|
||||
### Usage
|
||||
|
||||
```
|
||||
@@ -29,25 +27,26 @@ usage: main.py [-h] [--env-name ENV_NAME] [--policy POLICY] [--eval EVAL]
|
||||
|
||||
(Note: There is no need for setting Temperature(`--alpha`) if `--automatic_entropy_tuning` is True.)
|
||||
|
||||
##### For SAC
|
||||
#### For SAC
|
||||
|
||||
```
|
||||
python main.py --env-name Humanoid-v2 --alpha 0.05
|
||||
```
|
||||
|
||||
##### For SAC (Hard Update)
|
||||
#### For SAC (Hard Update)
|
||||
|
||||
```
|
||||
python main.py --env-name Humanoid-v2 --alpha 0.05 --tau 1 --target_update_interval 1000
|
||||
```
|
||||
|
||||
##### For SAC (Deterministic, Hard Update)
|
||||
#### For SAC (Deterministic, Hard Update)
|
||||
|
||||
```
|
||||
python main.py --env-name Humanoid-v2 --policy Deterministic --tau 1 --target_update_interval 1000
|
||||
```
|
||||
|
||||
### Arguments
|
||||
------------
|
||||
```
|
||||
PyTorch Soft Actor-Critic Args
|
||||
|
||||
@@ -79,8 +78,6 @@ optional arguments:
|
||||
--cuda run on CUDA (default: False)
|
||||
```
|
||||
|
||||
------------
|
||||
|
||||
| Environment **(`--env-name`)**| Temperature **(`--alpha`)**|
|
||||
| --------------- | ------------- |
|
||||
| HalfCheetah-v2 | 0.2 |
|
||||
|
||||
Reference in New Issue
Block a user