diff --git a/torch_training/Getting_Started_Training.md b/torch_training/Getting_Started_Training.md index 273404195f5ac490c8c5080497fc6144bb087808..b4a5a6fd085d143aeb8f419b96633d6f73f92cab 100644 --- a/torch_training/Getting_Started_Training.md +++ b/torch_training/Getting_Started_Training.md @@ -50,3 +50,16 @@ Each node is filled with information gathered along the path to the node. Curren - 9: agent in the opposite direction - n = number of agents present other direction than myself - 0 = no agent present other direction than myself + +For training purposes the tree is flattend into a single array. + +## Training + +Let us now train a simle double dueling DQN agent to navigate to its target on flatland. We start by importing flatland +``` +from flatland.envs.generators import complex_rail_generator +from flatland.envs.observations import TreeObsForRailEnv +from flatland.envs.rail_env import RailEnv +from flatland.utils.rendertools import RenderTool +from utils.observation_utils import norm_obs_clip, split_tree +``` \ No newline at end of file