@@ -147,7 +147,7 @@ Because the different speeds are implemented as fractions the agents ability to
...
@@ -147,7 +147,7 @@ Because the different speeds are implemented as fractions the agents ability to
- Agent enters switch and choses to deviate left. Agent fractional speed is 1/4 and thus the agent will take for time steps to complete its journey through the cell. On the 4th time step the agent will leave the cell deviating left as chosen at the entry of the cell.
- Agent enters switch and choses to deviate left. Agent fractional speed is 1/4 and thus the agent will take for time steps to complete its journey through the cell. On the 4th time step the agent will leave the cell deviating left as chosen at the entry of the cell.
- All actions chosen by the agent during its travels within a cell are ignored
- All actions chosen by the agent during its travels within a cell are ignored
- Agents can make observations at any time step. Make sure to dscard observations without any information.
- Agents can make observations at any time step. Make sure to dscard observations without any information. See this [example](https://gitlab.aicrowd.com/flatland/baselines/blob/master/torch_training/training_navigation.py) for a simple implementation.
- The environment checks if agent is allowed to move to next cell only at the time of the switch to the next cell
- The environment checks if agent is allowed to move to next cell only at the time of the switch to the next cell