Fix error with DeterministicPolicy

More pytorch-native way would be to use `Module.register_buffer()` method. In that case, buffer won't be used in parameters(), but will be converted to CUDA and CPU with `to()` call transparently.
2026-06-27 15:16:26 +08:00 · 2019-11-24 18:00:57 +03:00
parent cc42a1f31c
commit 3664ba4e60
1 changed files with 2 additions and 1 deletions
@@ -148,4 +148,5 @@ class DeterministicPolicy(nn.Module):
    def to(self, device):
        self.action_scale = self.action_scale.to(device)
        self.action_bias = self.action_bias.to(device)
-        return super(GaussianPolicy, self).to(device)
+        self.noise = self.noise.to(device)
+        return super(DeterministicPolicy, self).to(device)