37 Commits

Author SHA1 Message Date
wassname 3c7ee12182 wip 2024-06-07 06:28:59 +08:00
wassname 718e92a9a1 fix mem overflow, torchinfo 2024-06-07 06:00:35 +08:00
wassname ff14ca4639 trains 2024-06-03 20:09:54 +08:00
wassname 5a0e8dc5ac wip 2024-06-03 08:29:29 +08:00
wassname f853c03f4b wip 2024-06-02 18:52:57 +08:00
NM512 2cfcaefea2 avoid mutable default argument 2024-03-11 06:21:35 +09:00
NM512 7f66ed5333 erased unused options 2024-01-05 23:23:09 +09:00
NM512 a27711ab96 limit action values in sampling stage 2024-01-05 11:42:45 +09:00
NM512 a9e85e8b7c modified weight initialization 2024-01-05 10:46:54 +09:00
NM512 78e86703f4 modified loss calculation 2024-01-05 10:44:04 +09:00
NM512 1002d8b115 avoid cyclic reference 2023-10-02 07:27:26 +09:00
NM512 f35480f2a6 policy is not given logs 2023-10-01 06:25:23 +09:00
NM512 d3576c5a98 added save and load for optimizers 2023-09-27 09:15:37 +09:00
NM512 16635df3e4 removed scheduling function 2023-09-26 20:58:55 +09:00
NM512 606ec8af8c added the option for a deterministic run 2023-08-16 21:46:06 +09:00
NM512 68096d1f62 added log for inventory items in minecraft 2023-08-16 15:52:33 +09:00
NM512 8c471e12d6 erased unnecessary lines of code 2023-08-05 21:11:34 +09:00
NM512 43e1b2ab88 fix bug when resetting envs at different time 2023-07-24 22:26:21 +09:00
NM512 12ed21e06d applied formatter 2023-07-23 22:02:06 +09:00
NM512 afa5ab988d introduced parallel processing for envs 2023-07-23 21:58:46 +09:00
NM512 106317015d erased unused lines of code 2023-07-22 21:20:55 +09:00
NM512 03d91cb2c1 make sure "is_first" is set 0 at beginning 2023-07-22 21:08:53 +09:00
NM512 f07d843953 erased unnecessary reward input 2023-07-22 20:53:43 +09:00
NM512 9ca5082da3 separated cache management of episode from env 2023-07-22 19:22:41 +09:00
NM512 88514ec022 removed unnecessary imports 2023-07-02 11:52:33 +09:00
NM512 0ae6d2d1e0 step-based counting 2023-07-02 11:51:11 +09:00
NM512 b408067d9a avoid DeprecationWarning 2023-06-18 17:18:24 +09:00
NM512 970d1dc3e9 bug fix of limits for trunc_normal_ 2023-06-17 15:28:26 +09:00
NM512 b984e69b6e added state input capability 2023-05-14 23:38:46 +09:00
NM512 e5e8bcb284 modified a variable name 2023-04-29 07:57:05 +09:00
NM512 1328ff1088 sampling from the replay buffer across episodes 2023-04-29 07:43:02 +09:00
NM512 432a359bcf put running episode into replay buffer 2023-04-24 06:25:17 +09:00
NM512 fba87a33e0 applied formatter to tools 2023-04-15 15:28:09 +09:00
NM512 55ed69bdf7 fix bug when using envs > 1 2023-04-15 15:25:25 +09:00
NM512 57ac1c11d3 replaced all tf function to torch 2023-04-03 08:06:34 +09:00
NM512 6273444394 modified based on author's implementation 2023-03-18 08:38:23 +09:00
NM512 fb5c21557a Initial Commit 2023-02-12 22:35:25 +09:00