Training in Flatland 2.0 Documentation
Write document on how to train agents properly in new environment. Here we need to show how to respect multi speed and malfunctions. These two new features mean that agents cannot act at all time steps. Thus during training we need to respect this.