Commit Graph

  • f28ca774ed smaller camera, detach critic inputs master wassname 2021-01-17 18:27:36 +08:00
  • 16ca1a351b misc wassname 2021-01-17 13:35:48 +08:00
  • 9ffe9fa9a2 tidy wassname 2021-01-17 11:53:18 +08:00
  • e4fd67f3b5 progbar wassname 2021-01-17 11:29:35 +08:00
  • 5534d4b078 prcoess_obs works, with training by both obs and critic wassname 2021-01-16 18:14:57 +08:00
  • 093876e414 process obs part1 wassname 2021-01-16 16:41:40 +08:00
  • 0805bfa98f logging wassname 2021-01-16 16:41:14 +08:00
  • 90d207ca9b logging wassname 2021-01-16 16:40:53 +08:00
  • cc6e0f2035 misc wassname 2021-01-16 08:35:11 +08:00
  • 0a71ce15c7 logging wassname 2021-01-10 12:34:34 +08:00
  • a2c113d754 misc wassname 2021-01-09 20:22:39 +08:00
  • 4248a88ea4 tune tau etc wassname 2021-01-03 14:54:23 +08:00
  • 59b845a8a1 play and gitignore wassname 2021-01-03 13:06:01 +08:00
  • 617ff797ba apple gym runs wassname 2020-12-29 08:53:19 +08:00
  • 10c6b6e595 load demonstrations use apple_gym wassname 2020-12-29 07:58:53 +08:00
  • ab1ac786ac Fix inconsistent seeding & clean up code SAC_V pranz24 2020-07-11 14:18:02 +05:30
  • 1bd1158116 Fix inconsistent seeding & clean up code pranz24 2020-07-11 14:15:04 +05:30
  • ec004304a9 I have no idea what I'm doing. pranz24 2020-06-06 22:56:10 +05:30
  • e5c349f0b0 tensorboard cleanup pranz24 2020-06-06 09:34:03 +05:30
  • 86422617e5 torch-1.5 update + tensorboardX removal pranz24 2020-06-06 09:32:55 +05:30
  • dbbbacc39d remove tensorboardX pranz24 2020-06-06 08:57:18 +05:30
  • 1a3f379b79 small cleanup pranz24 2020-06-06 00:38:20 +05:30
  • a1e8d7319e fix for pytorch-1.5 & cleanup pranz24 2020-06-06 00:20:05 +05:30
  • e961172767 fix for pytorch-1.5 pranz24 2020-06-06 00:19:15 +05:30
  • 847edf58a5 Merge pull request #28 from ihexx/patch-1 Pranjal Tandon 2020-04-02 05:58:25 +05:30
  • b298849694 Update main.py Gershom 2020-04-01 01:39:10 -07:00
  • a45ed97761 Update README.md Pranjal Tandon 2020-02-03 14:31:51 +05:30
  • d25a856304 Update README.md Pranjal Tandon 2020-02-03 14:25:16 +05:30
  • 1b0087277d Update sac.py Pranjal Tandon 2020-02-03 14:16:34 +05:30
  • f1294bb974 Update README.md Pranjal Tandon 2020-02-03 14:16:05 +05:30
  • 269478d41d Update README.md Pranjal Tandon 2020-02-03 14:11:57 +05:30
  • 15f725e61c Update sac.py Pranjal Tandon 2020-02-03 14:10:57 +05:30
  • e687d35243 Update README.md Pranjal Tandon 2020-02-03 14:08:50 +05:30
  • 0da86b413f Update sac.py Pranjal Tandon 2020-02-03 14:04:15 +05:30
  • 589b56b264 Update sac.py Pranjal Tandon 2020-02-03 14:00:34 +05:30
  • 5189f44caa Update README.md Pranjal Tandon 2020-02-03 13:55:23 +05:30
  • 42d2ff08cb Update sac.py Pranjal Tandon 2020-02-03 13:48:45 +05:30
  • 73064f31ea Update main.py Pranjal Tandon 2020-02-03 13:46:39 +05:30
  • d8ba7370e5 Merge pull request #21 from Shmuma/patch-1 Pranjal Tandon 2019-11-27 13:30:57 +05:30
  • 3664ba4e60 Fix error with DeterministicPolicy Max Lapan 2019-11-24 18:00:57 +03:00
  • cc42a1f31c Merge pull request #19 from fgolemo/patch-1 Pranjal Tandon 2019-09-30 08:05:36 +00:00
  • b86fabc23c Update README.md Florian Golemo 2019-09-29 20:10:11 -04:00
  • 5663db7e22 Edit README.md & main.py pranz24 2019-09-16 16:42:30 +05:30
  • a1fe838d64 Edit README.md & main.py pranz24 2019-09-16 16:40:12 +05:30
  • 92486c2498 Edit README.md & main.py pranz24 2019-09-16 16:34:56 +05:30
  • 6e49320c8c Edit README.md & main.py pranz24 2019-09-16 16:31:31 +05:30
  • c2d50837db small fix pranz24 2019-09-10 22:29:35 +05:30
  • 6b6f64db37 Merge pull request #15 from ku2482/master Pranjal Tandon 2019-08-05 15:25:34 +05:30
  • d4cce3869e fix bugs Toshiki Watanabe 2019-07-23 11:59:59 +09:00
  • d3a6ffda45 Merge branch 'master' of https://github.com/pranz24/pytorch-soft-actor-critic Toshiki Watanabe 2019-07-23 11:31:56 +09:00
  • ab2c461af0 fix bugs of action rescaling Toshiki Watanabe 2019-07-23 11:30:36 +09:00
  • 2d6931705a why? ;( pranz24 2019-07-09 13:18:02 +05:30
  • 5c18ba7998 Fix normalized actions pranz24 2019-07-09 13:06:51 +05:30
  • a40fe29ac6 Merge pull request #13 from ku2482/fix_normalized_actions Pranjal Tandon 2019-06-27 15:07:38 +05:30
  • 3f64157068 add action rescaling Toshiki Watanabe 2019-06-27 16:45:51 +09:00
  • 97ad6f2ff9 fix typo in README Toshiki Watanabe 2019-06-27 16:43:40 +09:00
  • 56fe9033f9 Update README.md Pranjal Tandon 2019-06-16 20:36:28 +05:30
  • b65a61a289 Upgrade Pranjal Tandon 2019-05-29 12:14:40 +05:30
  • 7556ebab4c small update Pranjal Tandon 2019-05-29 11:12:18 +05:30
  • a83d48d752 Update README.md Pranjal Tandon 2019-05-22 17:18:17 +05:30
  • efe8d5a672 Why? pranz24 2019-05-22 17:14:34 +05:30
  • a7c5822024 Why? pranz24 2019-05-22 17:10:52 +05:30
  • 2340ddfcde Update main.py Pranjal Tandon 2019-05-22 11:42:27 +05:30
  • 2cf792007f Update main.py Pranjal Tandon 2019-05-21 12:39:16 +05:30
  • eabd181a21 small fix pranz24 2019-05-20 13:41:25 +05:30
  • e9cc3fd7e8 Add Normalized Actions pranz24 2019-05-20 13:29:23 +05:30
  • 076f46707d Add Normalized Actions pranz24 2019-05-20 13:26:49 +05:30
  • f480391cfd Update main.py Pranjal Tandon 2019-05-20 13:20:27 +05:30
  • 98b2cbfa7f Add Normalized Actions pranz24 2019-05-20 12:37:41 +05:30
  • 7c5f0cc3b2 Update main.py Pranjal Tandon 2019-05-20 12:29:11 +05:30
  • b6f23f761a Merge branch 'old' of https://github.com/pranz24/pytorch-soft-actor-critic into old pranz24 2019-05-20 12:25:44 +05:30
  • 9fc0b8af78 Add Normalized Actions pranz24 2019-05-20 12:22:14 +05:30
  • 192db2f0a1 Update main.py Pranjal Tandon 2019-05-18 22:22:57 +05:30
  • 7ff1e2f4e4 remove reward list pranz24 2019-04-07 23:22:14 +05:30
  • ed1d0a55d2 No need for list of rewards pranz24 2019-04-07 22:25:55 +05:30
  • 82d54f0f6a Clean Up pranz24 2019-04-07 12:39:47 +05:30
  • 311491796a new pranz24 2019-04-07 11:27:45 +05:30
  • e0ee7fcb83 Add Reg. Loss pranz24 2019-04-06 21:51:07 +05:30
  • 86412e14e1 Merge branch 'master' of https://github.com/pranz24/pytorch-soft-actor-critic pranz24 2019-04-06 20:42:05 +05:30
  • 8d3fc82d7d Clean Up pranz24 2019-04-06 20:40:48 +05:30
  • 5b22889d9e Update README.md Pranjal Tandon 2019-04-06 20:38:55 +05:30
  • 01f1793ca5 Clean Up pranz24 2019-04-06 20:33:13 +05:30
  • 56c367cced Add Value Function pranz24 2019-04-06 20:28:44 +05:30
  • 0d0be950c1 Clean Up pranz24 2019-04-06 18:56:01 +05:30
  • 0a56cba2be Update main.py Pranjal Tandon 2019-04-06 11:54:35 +05:30
  • 878dfe0f10 minor change pranz24 2019-04-06 04:36:00 +05:30
  • 491f4c4643 Remove Value Function pranz24 2019-04-06 04:03:18 +05:30
  • d273b0221c Update main.py Pranjal Tandon 2019-04-05 01:12:31 +05:30
  • ee617f3fed Update model.py Pranjal Tandon 2019-04-05 01:05:36 +05:30
  • 7e1496f87f Update model.py Pranjal Tandon 2019-04-05 00:30:31 +05:30
  • 80d4caec20 Update sac.py Pranjal Tandon 2019-04-05 00:29:54 +05:30
  • ac88237a28 minor changes pranz24 2019-04-04 23:55:53 +05:30
  • 8ffca0a34d Merge pull request #8 from jendelel/cuda_support Pranjal Tandon 2019-03-25 23:35:18 +05:30
  • 9e1eda980f Merge pull request #7 from jendelel/init_eval_state_bug Pranjal Tandon 2019-03-25 23:33:36 +05:30
  • b5501ffd6f Support for running on CUDA. Faster for simple environments. Lukas Jendele 2019-03-25 16:41:04 +01:00
  • 8ab8cd61f9 Fixed bug with double tensoring the initial state. Lukas Jendele 2019-03-25 16:39:13 +01:00
  • 3072d6b727 Update README.md Pranjal Tandon 2019-03-18 22:40:30 +05:30
  • f863895e30 Update README.md Pranjal Tandon 2019-03-18 22:39:30 +05:30
  • 2cebe8921c Update README.md Pranjal Tandon 2019-02-21 21:19:32 +05:30
  • f8b7fe9968 Update normalized_actions.py Pranjal Tandon 2019-02-20 14:44:44 +05:30