diff --git a/torch_training/Multi_Agent_Training_Intro.md b/torch_training/Multi_Agent_Training_Intro.md index f9aaa215bc7c1fe3bc573926abfaa206a895a63c..569d7f03c1574d18ac5f2739439572c1ba652b53 100644 --- a/torch_training/Multi_Agent_Training_Intro.md +++ b/torch_training/Multi_Agent_Training_Intro.md @@ -245,9 +245,9 @@ We now use the normalized `agent_obs` for our training loop: Running the `multi_agent_training.py` file trains a simple agent to navigate to any random target within the railway network. After running you should see a learning curve similiar to this one: - +*Learning curve provided soon* and the agent behavior should look like this: - +*Gif provided soon*