Commit Graph

172 Commits

Author SHA1 Message Date
wassname 0a71ce15c7 logging 2021-01-10 12:34:34 +08:00
wassname a2c113d754 misc 2021-01-09 20:22:39 +08:00
wassname 4248a88ea4 tune tau etc 2021-01-03 14:54:23 +08:00
wassname 59b845a8a1 play and gitignore 2021-01-03 13:06:01 +08:00
wassname 617ff797ba apple gym runs 2020-12-29 08:53:19 +08:00
wassname 10c6b6e595 load demonstrations use apple_gym 2020-12-29 07:58:53 +08:00
pranz24 1bd1158116 Fix inconsistent seeding & clean up code 2020-07-11 14:15:04 +05:30
pranz24 ec004304a9 I have no idea what I'm doing. 2020-06-06 22:56:10 +05:30
pranz24 e5c349f0b0 tensorboard cleanup 2020-06-06 09:34:03 +05:30
pranz24 dbbbacc39d remove tensorboardX 2020-06-06 08:57:18 +05:30
pranz24 1a3f379b79 small cleanup 2020-06-06 00:38:20 +05:30
pranz24 a1e8d7319e fix for pytorch-1.5 & cleanup 2020-06-06 00:20:05 +05:30
pranz24 e961172767 fix for pytorch-1.5 2020-06-06 00:19:15 +05:30
Pranjal Tandon 847edf58a5 Merge pull request #28 from ihexx/patch-1
Update main.py
2020-04-02 05:58:25 +05:30
Gershom b298849694 Update main.py 2020-04-01 01:39:10 -07:00
Pranjal Tandon a45ed97761 Update README.md 2020-02-03 14:31:51 +05:30
Pranjal Tandon d25a856304 Update README.md 2020-02-03 14:25:16 +05:30
Pranjal Tandon 1b0087277d Update sac.py 2020-02-03 14:16:34 +05:30
Pranjal Tandon f1294bb974 Update README.md 2020-02-03 14:16:05 +05:30
Pranjal Tandon 269478d41d Update README.md 2020-02-03 14:11:57 +05:30
Pranjal Tandon 15f725e61c Update sac.py 2020-02-03 14:10:57 +05:30
Pranjal Tandon e687d35243 Update README.md 2020-02-03 14:08:50 +05:30
Pranjal Tandon 0da86b413f Update sac.py 2020-02-03 14:04:15 +05:30
Pranjal Tandon 589b56b264 Update sac.py 2020-02-03 14:00:34 +05:30
Pranjal Tandon 5189f44caa Update README.md 2020-02-03 13:55:23 +05:30
Pranjal Tandon 42d2ff08cb Update sac.py 2020-02-03 13:48:45 +05:30
Pranjal Tandon 73064f31ea Update main.py 2020-02-03 13:46:39 +05:30
Pranjal Tandon d8ba7370e5 Merge pull request #21 from Shmuma/patch-1
Fix error with DeterministicPolicy
2019-11-27 13:30:57 +05:30
Max Lapan 3664ba4e60 Fix error with DeterministicPolicy
More pytorch-native way would be to use `Module.register_buffer()` method. In that case, buffer won't be used in parameters(), but will be converted to CUDA and CPU with `to()` call transparently.
2019-11-24 18:00:57 +03:00
Pranjal Tandon cc42a1f31c Merge pull request #19 from fgolemo/patch-1
Update README.md
2019-09-30 08:05:36 +00:00
Florian Golemo b86fabc23c Update README.md
typo
2019-09-29 20:10:11 -04:00
pranz24 5663db7e22 Edit README.md & main.py 2019-09-16 16:42:30 +05:30
pranz24 a1fe838d64 Edit README.md & main.py 2019-09-16 16:40:12 +05:30
pranz24 92486c2498 Edit README.md & main.py 2019-09-16 16:34:56 +05:30
pranz24 6e49320c8c Edit README.md & main.py 2019-09-16 16:31:31 +05:30
pranz24 c2d50837db small fix 2019-09-10 22:29:35 +05:30
Pranjal Tandon 6b6f64db37 Merge pull request #15 from ku2482/master
Fix bugs of action re-scaling
2019-08-05 15:25:34 +05:30
Toshiki Watanabe d4cce3869e fix bugs 2019-07-23 11:59:59 +09:00
Toshiki Watanabe d3a6ffda45 Merge branch 'master' of https://github.com/pranz24/pytorch-soft-actor-critic 2019-07-23 11:31:56 +09:00
Toshiki Watanabe ab2c461af0 fix bugs of action rescaling 2019-07-23 11:30:36 +09:00
Pranjal Tandon a40fe29ac6 Merge pull request #13 from ku2482/fix_normalized_actions
Fix normalized actions
2019-06-27 15:07:38 +05:30
Toshiki Watanabe 3f64157068 add action rescaling 2019-06-27 16:45:51 +09:00
Toshiki Watanabe 97ad6f2ff9 fix typo in README 2019-06-27 16:43:40 +09:00
Pranjal Tandon 56fe9033f9 Update README.md 2019-06-16 20:36:28 +05:30
Pranjal Tandon b65a61a289 Upgrade 2019-05-29 12:14:40 +05:30
Pranjal Tandon 7556ebab4c small update 2019-05-29 11:12:18 +05:30
Pranjal Tandon a83d48d752 Update README.md 2019-05-22 17:18:17 +05:30
pranz24 a7c5822024 Why? 2019-05-22 17:10:52 +05:30
Pranjal Tandon 2340ddfcde Update main.py 2019-05-22 11:42:27 +05:30
Pranjal Tandon 2cf792007f Update main.py 2019-05-21 12:39:16 +05:30