pranz24
dbbbacc39d
remove tensorboardX
2020-06-06 08:57:18 +05:30
pranz24
1a3f379b79
small cleanup
2020-06-06 00:38:20 +05:30
pranz24
a1e8d7319e
fix for pytorch-1.5 & cleanup
2020-06-06 00:20:05 +05:30
pranz24
e961172767
fix for pytorch-1.5
2020-06-06 00:19:15 +05:30
Pranjal Tandon
847edf58a5
Merge pull request #28 from ihexx/patch-1
...
Update main.py
2020-04-02 05:58:25 +05:30
Gershom
b298849694
Update main.py
2020-04-01 01:39:10 -07:00
Pranjal Tandon
a45ed97761
Update README.md
2020-02-03 14:31:51 +05:30
Pranjal Tandon
d25a856304
Update README.md
2020-02-03 14:25:16 +05:30
Pranjal Tandon
1b0087277d
Update sac.py
2020-02-03 14:16:34 +05:30
Pranjal Tandon
f1294bb974
Update README.md
2020-02-03 14:16:05 +05:30
Pranjal Tandon
269478d41d
Update README.md
2020-02-03 14:11:57 +05:30
Pranjal Tandon
15f725e61c
Update sac.py
2020-02-03 14:10:57 +05:30
Pranjal Tandon
e687d35243
Update README.md
2020-02-03 14:08:50 +05:30
Pranjal Tandon
0da86b413f
Update sac.py
2020-02-03 14:04:15 +05:30
Pranjal Tandon
589b56b264
Update sac.py
2020-02-03 14:00:34 +05:30
Pranjal Tandon
5189f44caa
Update README.md
2020-02-03 13:55:23 +05:30
Pranjal Tandon
42d2ff08cb
Update sac.py
2020-02-03 13:48:45 +05:30
Pranjal Tandon
73064f31ea
Update main.py
2020-02-03 13:46:39 +05:30
Pranjal Tandon
d8ba7370e5
Merge pull request #21 from Shmuma/patch-1
...
Fix error with DeterministicPolicy
2019-11-27 13:30:57 +05:30
Max Lapan
3664ba4e60
Fix error with DeterministicPolicy
...
More pytorch-native way would be to use `Module.register_buffer()` method. In that case, buffer won't be used in parameters(), but will be converted to CUDA and CPU with `to()` call transparently.
2019-11-24 18:00:57 +03:00
Pranjal Tandon
cc42a1f31c
Merge pull request #19 from fgolemo/patch-1
...
Update README.md
2019-09-30 08:05:36 +00:00
Florian Golemo
b86fabc23c
Update README.md
...
typo
2019-09-29 20:10:11 -04:00
pranz24
5663db7e22
Edit README.md & main.py
2019-09-16 16:42:30 +05:30
pranz24
a1fe838d64
Edit README.md & main.py
2019-09-16 16:40:12 +05:30
pranz24
92486c2498
Edit README.md & main.py
2019-09-16 16:34:56 +05:30
pranz24
6e49320c8c
Edit README.md & main.py
2019-09-16 16:31:31 +05:30
pranz24
c2d50837db
small fix
2019-09-10 22:29:35 +05:30
Pranjal Tandon
6b6f64db37
Merge pull request #15 from ku2482/master
...
Fix bugs of action re-scaling
2019-08-05 15:25:34 +05:30
Toshiki Watanabe
d4cce3869e
fix bugs
2019-07-23 11:59:59 +09:00
Toshiki Watanabe
d3a6ffda45
Merge branch 'master' of https://github.com/pranz24/pytorch-soft-actor-critic
2019-07-23 11:31:56 +09:00
Toshiki Watanabe
ab2c461af0
fix bugs of action rescaling
2019-07-23 11:30:36 +09:00
Pranjal Tandon
a40fe29ac6
Merge pull request #13 from ku2482/fix_normalized_actions
...
Fix normalized actions
2019-06-27 15:07:38 +05:30
Toshiki Watanabe
3f64157068
add action rescaling
2019-06-27 16:45:51 +09:00
Toshiki Watanabe
97ad6f2ff9
fix typo in README
2019-06-27 16:43:40 +09:00
Pranjal Tandon
56fe9033f9
Update README.md
2019-06-16 20:36:28 +05:30
Pranjal Tandon
b65a61a289
Upgrade
2019-05-29 12:14:40 +05:30
Pranjal Tandon
7556ebab4c
small update
2019-05-29 11:12:18 +05:30
Pranjal Tandon
a83d48d752
Update README.md
2019-05-22 17:18:17 +05:30
pranz24
a7c5822024
Why?
2019-05-22 17:10:52 +05:30
Pranjal Tandon
2340ddfcde
Update main.py
2019-05-22 11:42:27 +05:30
Pranjal Tandon
2cf792007f
Update main.py
2019-05-21 12:39:16 +05:30
pranz24
076f46707d
Add Normalized Actions
2019-05-20 13:26:49 +05:30
Pranjal Tandon
f480391cfd
Update main.py
2019-05-20 13:20:27 +05:30
pranz24
98b2cbfa7f
Add Normalized Actions
2019-05-20 12:37:41 +05:30
pranz24
7ff1e2f4e4
remove reward list
2019-04-07 23:22:14 +05:30
pranz24
82d54f0f6a
Clean Up
2019-04-07 12:39:47 +05:30
pranz24
86412e14e1
Merge branch 'master' of https://github.com/pranz24/pytorch-soft-actor-critic
2019-04-06 20:42:05 +05:30
pranz24
8d3fc82d7d
Clean Up
2019-04-06 20:40:48 +05:30
Pranjal Tandon
5b22889d9e
Update README.md
2019-04-06 20:38:55 +05:30
pranz24
0d0be950c1
Clean Up
2019-04-06 18:56:01 +05:30