Showing
- checkpoints/ppo/README.md 1 addition, 0 deletionscheckpoints/ppo/README.md
- checkpoints/ppo/model_checkpoint.meta 0 additions, 0 deletionscheckpoints/ppo/model_checkpoint.meta
- checkpoints/ppo/model_checkpoint.optimizer 0 additions, 0 deletionscheckpoints/ppo/model_checkpoint.optimizer
- checkpoints/ppo/model_checkpoint.policy 0 additions, 0 deletionscheckpoints/ppo/model_checkpoint.policy
- checkpoints/sample-checkpoint.pth 0 additions, 0 deletionscheckpoints/sample-checkpoint.pth
- docker_run.sh 18 additions, 18 deletionsdocker_run.sh
- environment.yml 9 additions, 82 deletionsenvironment.yml
- nets/training_5500.pth.local 0 additions, 0 deletionsnets/training_5500.pth.local
- nets/training_5500.pth.target 0 additions, 0 deletionsnets/training_5500.pth.target
- nets/training_best_0.626_agents_5276.pth.local 0 additions, 0 deletionsnets/training_best_0.626_agents_5276.pth.local
- nets/training_best_0.626_agents_5276.pth.target 0 additions, 0 deletionsnets/training_best_0.626_agents_5276.pth.target
- reinforcement_learning/__init__.py 0 additions, 0 deletionsreinforcement_learning/__init__.py
- reinforcement_learning/dddqn_policy.py 164 additions, 0 deletionsreinforcement_learning/dddqn_policy.py
- reinforcement_learning/deadlockavoidance_with_decision_agent.py 85 additions, 0 deletions...rcement_learning/deadlockavoidance_with_decision_agent.py
- reinforcement_learning/evaluate_agent.py 383 additions, 0 deletionsreinforcement_learning/evaluate_agent.py
- reinforcement_learning/model.py 31 additions, 0 deletionsreinforcement_learning/model.py
- reinforcement_learning/multi_agent_training.py 646 additions, 0 deletionsreinforcement_learning/multi_agent_training.py
- reinforcement_learning/multi_decision_agent.py 90 additions, 0 deletionsreinforcement_learning/multi_decision_agent.py
- reinforcement_learning/multi_policy.py 63 additions, 0 deletionsreinforcement_learning/multi_policy.py
- reinforcement_learning/ordered_policy.py 34 additions, 0 deletionsreinforcement_learning/ordered_policy.py
checkpoints/ppo/README.md
0 → 100644
checkpoints/ppo/model_checkpoint.meta
0 → 100644
File added
checkpoints/ppo/model_checkpoint.optimizer
0 → 100644
File added
checkpoints/ppo/model_checkpoint.policy
0 → 100644
File added
checkpoints/sample-checkpoint.pth
0 → 100644
File added
nets/training_5500.pth.local
0 → 100644
File added
nets/training_5500.pth.target
0 → 100644
File added
File added
File added
reinforcement_learning/__init__.py
0 → 100644
reinforcement_learning/dddqn_policy.py
0 → 100644
reinforcement_learning/evaluate_agent.py
0 → 100644
reinforcement_learning/model.py
0 → 100644
This diff is collapsed.
reinforcement_learning/multi_policy.py
0 → 100644
reinforcement_learning/ordered_policy.py
0 → 100644