pytorch-soft-actor-critic

mirror of https://github.com/wassname/pytorch-soft-actor-critic.git synced 2026-06-27 18:06:10 +08:00

T

pranz24 ab1ac786ac Fix inconsistent seeding & clean up code

2020-07-11 14:18:02 +05:30

.gitignore

2020-07-11 14:18:02 +05:30

LICENSE

Initial commit

2018-08-31 17:23:01 +05:30

main.py

2020-07-11 14:18:02 +05:30

model.py

Add Reg. Loss

2019-04-06 21:51:07 +05:30

README.md

Fix normalized actions

2019-07-09 13:06:51 +05:30

replay_memory.py

2020-07-11 14:18:02 +05:30

sac.py

2020-06-06 09:32:55 +05:30

utils.py

Add files via upload

2018-08-31 17:25:08 +05:30

python main.py --env-name Humanoid-v2 --aplha 0.05

python main.py --env-name Humanoid-v2 --alpha 0.05 --tau 1 --target_update_interval 1000

Parameters	Value
Shared	-
optimizer	Adam
learning rate(`--lr`)	3x10⁻⁴
discount(`--gamma`) (γ)	0.99
replay buffer size(`--replay_size`)	1x10⁶
number of hidden layers (all networks)	2
number of hidden units per layer(`--hidden_size`)	256
number of samples per minibatch(`--batch_size`)	256
nonlinearity	ReLU
SAC	-
target smoothing coefficient(`--tau`) (τ)	0.005
target update interval(`--target_update_interval`)	1
gradient steps(`--updates_per_step`)	1
SAC (Hard Update)	-
target smoothing coefficient(`--tau`) (τ)	1
target update interval(`--target_update_interval`)	1000
gradient steps (except humanoids)(`--updates_per_step`)	4
gradient steps (humanoids)(`--updates_per_step`)	1