Slight help To find stations.
Right now an Agent will not see the target if it is at the range of its perception, because the value calculation for last_isTarget in observation.py 364 is the same as for the devolt in line 381, given that the target was encountered on the last step. Setting 364 to root_observation[3] + num_steps-1, helps with this fringe case.