Commit Graph

163 Commits

Author SHA1 Message Date
pranz24 dbbbacc39d remove tensorboardX 2020-06-06 08:57:18 +05:30
pranz24 1a3f379b79 small cleanup 2020-06-06 00:38:20 +05:30
pranz24 a1e8d7319e fix for pytorch-1.5 & cleanup 2020-06-06 00:20:05 +05:30
pranz24 e961172767 fix for pytorch-1.5 2020-06-06 00:19:15 +05:30
Pranjal Tandon 847edf58a5 Merge pull request #28 from ihexx/patch-1
Update main.py
2020-04-02 05:58:25 +05:30
Gershom b298849694 Update main.py 2020-04-01 01:39:10 -07:00
Pranjal Tandon a45ed97761 Update README.md 2020-02-03 14:31:51 +05:30
Pranjal Tandon d25a856304 Update README.md 2020-02-03 14:25:16 +05:30
Pranjal Tandon 1b0087277d Update sac.py 2020-02-03 14:16:34 +05:30
Pranjal Tandon f1294bb974 Update README.md 2020-02-03 14:16:05 +05:30
Pranjal Tandon 269478d41d Update README.md 2020-02-03 14:11:57 +05:30
Pranjal Tandon 15f725e61c Update sac.py 2020-02-03 14:10:57 +05:30
Pranjal Tandon e687d35243 Update README.md 2020-02-03 14:08:50 +05:30
Pranjal Tandon 0da86b413f Update sac.py 2020-02-03 14:04:15 +05:30
Pranjal Tandon 589b56b264 Update sac.py 2020-02-03 14:00:34 +05:30
Pranjal Tandon 5189f44caa Update README.md 2020-02-03 13:55:23 +05:30
Pranjal Tandon 42d2ff08cb Update sac.py 2020-02-03 13:48:45 +05:30
Pranjal Tandon 73064f31ea Update main.py 2020-02-03 13:46:39 +05:30
Pranjal Tandon d8ba7370e5 Merge pull request #21 from Shmuma/patch-1
Fix error with DeterministicPolicy
2019-11-27 13:30:57 +05:30
Max Lapan 3664ba4e60 Fix error with DeterministicPolicy
More pytorch-native way would be to use `Module.register_buffer()` method. In that case, buffer won't be used in parameters(), but will be converted to CUDA and CPU with `to()` call transparently.
2019-11-24 18:00:57 +03:00
Pranjal Tandon cc42a1f31c Merge pull request #19 from fgolemo/patch-1
Update README.md
2019-09-30 08:05:36 +00:00
Florian Golemo b86fabc23c Update README.md
typo
2019-09-29 20:10:11 -04:00
pranz24 5663db7e22 Edit README.md & main.py 2019-09-16 16:42:30 +05:30
pranz24 a1fe838d64 Edit README.md & main.py 2019-09-16 16:40:12 +05:30
pranz24 92486c2498 Edit README.md & main.py 2019-09-16 16:34:56 +05:30
pranz24 6e49320c8c Edit README.md & main.py 2019-09-16 16:31:31 +05:30
pranz24 c2d50837db small fix 2019-09-10 22:29:35 +05:30
Pranjal Tandon 6b6f64db37 Merge pull request #15 from ku2482/master
Fix bugs of action re-scaling
2019-08-05 15:25:34 +05:30
Toshiki Watanabe d4cce3869e fix bugs 2019-07-23 11:59:59 +09:00
Toshiki Watanabe d3a6ffda45 Merge branch 'master' of https://github.com/pranz24/pytorch-soft-actor-critic 2019-07-23 11:31:56 +09:00
Toshiki Watanabe ab2c461af0 fix bugs of action rescaling 2019-07-23 11:30:36 +09:00
Pranjal Tandon a40fe29ac6 Merge pull request #13 from ku2482/fix_normalized_actions
Fix normalized actions
2019-06-27 15:07:38 +05:30
Toshiki Watanabe 3f64157068 add action rescaling 2019-06-27 16:45:51 +09:00
Toshiki Watanabe 97ad6f2ff9 fix typo in README 2019-06-27 16:43:40 +09:00
Pranjal Tandon 56fe9033f9 Update README.md 2019-06-16 20:36:28 +05:30
Pranjal Tandon b65a61a289 Upgrade 2019-05-29 12:14:40 +05:30
Pranjal Tandon 7556ebab4c small update 2019-05-29 11:12:18 +05:30
Pranjal Tandon a83d48d752 Update README.md 2019-05-22 17:18:17 +05:30
pranz24 a7c5822024 Why? 2019-05-22 17:10:52 +05:30
Pranjal Tandon 2340ddfcde Update main.py 2019-05-22 11:42:27 +05:30
Pranjal Tandon 2cf792007f Update main.py 2019-05-21 12:39:16 +05:30
pranz24 076f46707d Add Normalized Actions 2019-05-20 13:26:49 +05:30
Pranjal Tandon f480391cfd Update main.py 2019-05-20 13:20:27 +05:30
pranz24 98b2cbfa7f Add Normalized Actions 2019-05-20 12:37:41 +05:30
pranz24 7ff1e2f4e4 remove reward list 2019-04-07 23:22:14 +05:30
pranz24 82d54f0f6a Clean Up 2019-04-07 12:39:47 +05:30
pranz24 86412e14e1 Merge branch 'master' of https://github.com/pranz24/pytorch-soft-actor-critic 2019-04-06 20:42:05 +05:30
pranz24 8d3fc82d7d Clean Up 2019-04-06 20:40:48 +05:30
Pranjal Tandon 5b22889d9e Update README.md 2019-04-06 20:38:55 +05:30
pranz24 0d0be950c1 Clean Up 2019-04-06 18:56:01 +05:30