Skip to content
Snippets Groups Projects
Commit 716119c9 authored by Egli Adrian (IT-SCI-API-PFI)'s avatar Egli Adrian (IT-SCI-API-PFI)
Browse files

typo

parent 8d6304b3
No related branches found
No related tags found
No related merge requests found
......@@ -189,7 +189,7 @@ class PPOAgent(Policy):
# The loss function is used to estimate the gardient and use the entropy function based
# heuristic to penalize the gradient function when the policy becomes deterministic this would let
# the gardient to become very flat and so the gradient is no longer useful.
# the gradient becomes very flat and so the gradient is no longer useful.
loss = \
-torch.min(surr1, surr2) \
+ self.weight_loss * self.loss_function(state_values, rewards) \
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment