Commit 43c04453 authored by adrian_egli2's avatar adrian_egli2
Browse files

Fix in documentation - reporter tgeorg-ethz Wrong values for alpha and beta in comment.
The values stated for alpha and beta in flatland/envs/rail_env are stated to be 1 for both in the comment but are 0 in the actual code a few lines below.
parent df578e04
Pipeline #11514 canceled with stages
......@@ -62,8 +62,8 @@ class RailEnv(Environment):
It costs each agent a step_penalty for every time-step taken in the environment. Independent of the movement
of the agent. Currently all other penalties such as penalty for stopping, starting and invalid actions are set to 0.
alpha = 1
beta = 1
alpha = 0
beta = 0
Reward function parameters:
- invalid_action_penalty = 0
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment