23 Commits

Author SHA1 Message Date
wassname 3c7ee12182 wip 2024-06-07 06:28:59 +08:00
wassname 718e92a9a1 fix mem overflow, torchinfo 2024-06-07 06:00:35 +08:00
wassname ff14ca4639 trains 2024-06-03 20:09:54 +08:00
wassname 5a0e8dc5ac wip 2024-06-03 08:29:29 +08:00
NM512 a4fdfad938 bug fix for onehot distribution 2024-01-14 21:55:34 +09:00
NM512 7f66ed5333 erased unused options 2024-01-05 23:23:09 +09:00
NM512 a27711ab96 limit action values in sampling stage 2024-01-05 11:42:45 +09:00
NM512 a9e85e8b7c modified weight initialization 2024-01-05 10:46:54 +09:00
NM512 78e86703f4 modified loss calculation 2024-01-05 10:44:04 +09:00
NM512 e0487f8206 merged action head into MLP and modified configs 2024-01-05 10:26:48 +09:00
NM512 e0f2017e28 unified the place to initialize the latents 2024-01-05 10:09:13 +09:00
NM512 16635df3e4 removed scheduling function 2023-09-26 20:58:55 +09:00
NM512 3f6659d365 changed treatment of obs shape in minecraft 2023-08-03 08:12:44 +09:00
NM512 9c58ab62c0 introduced return used in author's code 2023-06-17 16:59:40 +09:00
NM512 f7c505579c erased unnecessary lines 2023-06-17 15:27:09 +09:00
NM512 02c3d45fcf modification of expl. 2023-05-21 08:17:47 +09:00
NM512 b984e69b6e added state input capability 2023-05-14 23:38:46 +09:00
NM512 0eb66997fb learnable initial state options for RSSM 2023-04-29 07:54:03 +09:00
NM512 2a8b44eb0c erased unnecessary code 2023-04-27 07:42:08 +09:00
NM512 628b856c63 changed the discount head to predict terminal 2023-04-22 09:34:23 +09:00
NM512 942eae10a9 updated result, requirements and torch version 2023-03-24 07:51:57 +09:00
NM512 6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512 fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00