reset() should take initial number of agents and not that of last generated rail
If we change number of agents, then a reset with new rail generation should try to reset the number of agents to the initial number of agents.
If we change number of agents, then a reset with new rail generation should try to reset the number of agents to the initial number of agents.