Optimize Global Observation
Global Observation was taking about 3s / step for about 500 agents.
Adding a small fix to bring it down to 500ms / step for now, but we will have to rethink how global observation is generated. The ideal way would be to maintain a state of the env, and simply pass a reference to it.