Update README.md

This commit is contained in:
Pranjal Tandon
2020-02-03 14:08:50 +05:30
committed by GitHub
parent 0da86b413f
commit e687d35243
+8 -11
View File
@@ -8,14 +8,12 @@ Learning with a Stochastic Actor](https://arxiv.org/pdf/1801.01290.pdf) -> [SAC_
### Requirements
------------
- [mujoco-py](https://github.com/openai/mujoco-py)
- [TensorboardX](https://github.com/lanpa/tensorboardX)
- [PyTorch](http://pytorch.org/)
- [mujoco-py](https://github.com/openai/mujoco-py)
- [TensorboardX](https://github.com/lanpa/tensorboardX)
- [PyTorch](http://pytorch.org/)
### Default Arguments and Usage
-------------
------------
### Usage
```
@@ -29,25 +27,26 @@ usage: main.py [-h] [--env-name ENV_NAME] [--policy POLICY] [--eval EVAL]
(Note: There is no need for setting Temperature(`--alpha`) if `--automatic_entropy_tuning` is True.)
##### For SAC
#### For SAC
```
python main.py --env-name Humanoid-v2 --alpha 0.05
```
##### For SAC (Hard Update)
#### For SAC (Hard Update)
```
python main.py --env-name Humanoid-v2 --alpha 0.05 --tau 1 --target_update_interval 1000
```
##### For SAC (Deterministic, Hard Update)
#### For SAC (Deterministic, Hard Update)
```
python main.py --env-name Humanoid-v2 --policy Deterministic --tau 1 --target_update_interval 1000
```
### Arguments
------------
```
PyTorch Soft Actor-Critic Args
@@ -79,8 +78,6 @@ optional arguments:
--cuda run on CUDA (default: False)
```
------------
| Environment **(`--env-name`)**| Temperature **(`--alpha`)**|
| --------------- | ------------- |
| HalfCheetah-v2 | 0.2 |