Skip to content
Snippets Groups Projects
Commit ac0c34ec authored by Erik Nygren's avatar Erik Nygren :bullettrain_front:
Browse files

Update Getting_Started_Training.md

parent 9c673c27
No related branches found
No related tags found
No related merge requests found
...@@ -23,4 +23,30 @@ Here is a small example of a railway network with an agent in the top left corne ...@@ -23,4 +23,30 @@ Here is a small example of a railway network with an agent in the top left corne
As we move along the allowed transitions we build up a tree where a new node is created at every cell where the agent has different possibilities (Switch) or the target is reached. It is important to note that the tree observation is always build according to the orientation of the agent at a given node. This means that each node always has 4 branches coming from it in the directions *Left, Forward, Right and Backward*. These are illustrated with different colors in the figure below. The tree is build form the example rail above. Nodes where there are no possibilitis are fill with `-inf` and are not all shown here for simplicity. The tree however, always hase the same number of nodes for a given tree depth. As we move along the allowed transitions we build up a tree where a new node is created at every cell where the agent has different possibilities (Switch) or the target is reached. It is important to note that the tree observation is always build according to the orientation of the agent at a given node. This means that each node always has 4 branches coming from it in the directions *Left, Forward, Right and Backward*. These are illustrated with different colors in the figure below. The tree is build form the example rail above. Nodes where there are no possibilitis are fill with `-inf` and are not all shown here for simplicity. The tree however, always hase the same number of nodes for a given tree depth.
![Tree_Observation](https://i.imgur.com/VsUQOQz.png) ![Tree_Observation](https://i.imgur.com/VsUQOQz.png)
\ No newline at end of file
### Node Information
Each node is filled with information gathered along the path to the node. Currently each node contains 9 features:
- 1: if own target lies on the explored branch the current distance from the agent in number of cells is stored.
- 2: if another agents target is detected the distance in number of cells from the agents current locaiton is stored.
- 3: if another agent is detected the distance in number of cells from current agent position is stored.
- 4: possible conflict detected
tot_dist = Other agent predicts to pass along this cell at the same time as the agent, we store the distance in number of cells from current agent position 0 = No other agent reserve the same cell at similar time.
- 5: if an not usable switch (for agent) is detected we store the distance.
- 6: This feature stores the distance in number of cells to the next branching (current node)
- 7: minimum distance from node to the agent's target given the direction of the agent if this path is chosen
- 8: agent in the same direction
- n = number of agents present same direction (possible future use: number of other agents in the same direction in this branch)
- 0 = no agent present same direction
- 9: agent in the opposite direction
- n = number of agents present other direction than myself
- 0 = no agent present other direction than myself
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment