Skip to content
Snippets Groups Projects
Commit 80d33766 authored by Erik Nygren's avatar Erik Nygren
Browse files

bugfix in observation and prediction for tree observation

parent 24cb4f84
No related branches found
No related tags found
No related merge requests found
...@@ -80,7 +80,6 @@ for trials in range(1, n_trials + 1): ...@@ -80,7 +80,6 @@ for trials in range(1, n_trials + 1):
# Environment step which returns the observations for all agents, their corresponding # Environment step which returns the observations for all agents, their corresponding
# reward and whether their are done # reward and whether their are done
next_obs, all_rewards, done, _ = env.step(action_dict) next_obs, all_rewards, done, _ = env.step(action_dict)
# Update replay buffer and train agent # Update replay buffer and train agent
for a in range(env.get_num_agents()): for a in range(env.get_num_agents()):
agent.step((obs[a], action_dict[a], all_rewards[a], next_obs[a], done[a])) agent.step((obs[a], action_dict[a], all_rewards[a], next_obs[a], done[a]))
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment