Commits on Source (55)
-
MasterScrat authored691d1b64
-
Egli Adrian (IT-SCI-API-PFI) authored924ac3d6
-
Egli Adrian (IT-SCI-API-PFI) authored168d6728
-
Egli Adrian (IT-SCI-API-PFI) authored6c8a30f9
-
Egli Adrian (IT-SCI-API-PFI) authored83acaa40
-
Egli Adrian (IT-SCI-API-PFI) authored21a51891
-
Egli Adrian (IT-SCI-API-PFI) authored66929e4a
-
Egli Adrian (IT-SCI-API-PFI) authored1c60b970
-
Egli Adrian (IT-SCI-API-PFI) authoredcf80f503
-
Egli Adrian (IT-SCI-API-PFI) authored03748921
-
Egli Adrian (IT-SCI-API-PFI) authored52a015e1
-
Egli Adrian (IT-SCI-API-PFI) authored91c77d25
-
Egli Adrian (IT-SCI-API-PFI) authoredcb4df49c
-
Egli Adrian (IT-SCI-API-PFI) authoreded050703
-
Egli Adrian (IT-SCI-API-PFI) authored44fc3248
-
Egli Adrian (IT-SCI-API-PFI) authored3c443618
-
Egli Adrian (IT-SCI-API-PFI) authored155cd804
-
Egli Adrian (IT-SCI-API-PFI) authored0a273a94
-
Egli Adrian (IT-SCI-API-PFI) authored388822a0
-
Egli Adrian (IT-SCI-API-PFI) authorede28c57e5
-
Egli Adrian (IT-SCI-API-PFI) authoredf77cddf3
-
Egli Adrian (IT-SCI-API-PFI) authored6ebb521d
-
Egli Adrian (IT-SCI-API-PFI) authored7387e9a5
-
Egli Adrian (IT-SCI-API-PFI) authored7365be1a
-
Egli Adrian (IT-SCI-API-PFI) authored2ac37596
-
Egli Adrian (IT-SCI-API-PFI) authored6a2fd382
-
Egli Adrian (IT-SCI-API-PFI) authoreddac3eee5
-
Egli Adrian (IT-SCI-API-PFI) authored2a3839db
-
Egli Adrian (IT-SCI-API-PFI) authored409a87d4
-
Egli Adrian (IT-SCI-API-PFI) authoredaae6d260
-
Egli Adrian (IT-SCI-API-PFI) authoredd4efd784
-
Egli Adrian (IT-SCI-API-PFI) authored6155bff8
-
Egli Adrian (IT-SCI-API-PFI) authored
python reinforcement_learning/multi_agent_training.py --use_fast_tree_observation --checkpoint_interval 1000 -n 5000 --policy XXX
b89ffde5 -
Egli Adrian (IT-SCI-API-PFI) authoredd368be4f
-
Egli Adrian (IT-SCI-API-PFI) authoredcc46818b
-
Egli Adrian (IT-SCI-API-PFI) authored25878c72
-
Egli Adrian (IT-SCI-API-PFI) authorede4443c95
-
Egli Adrian (IT-SCI-API-PFI) authored97104dee
-
Egli Adrian (IT-SCI-API-PFI) authored007611b3
-
Egli Adrian (IT-SCI-API-PFI) authored3f52dbd4
-
Egli Adrian (IT-SCI-API-PFI) authoredce45a97b
-
Egli Adrian (IT-SCI-API-PFI) authoredc834f7f3
-
Egli Adrian (IT-SCI-API-PFI) authoredee47292e
-
Egli Adrian (IT-SCI-API-PFI) authoredb99d9b01
-
Egli Adrian (IT-SCI-API-PFI) authored59ff8a13
-
adrian_egli authored
Feature/dead lock avoidance rl post challenge See merge request adrian_egli/neurips2020-flatland-starter-kit!1
b5ebe634 -
Egli Adrian (IT-SCI-API-PFI) authored9818ac72
-
Egli Adrian (IT-SCI-API-PFI) authored9f5d3406
-
Egli Adrian (IT-SCI-API-PFI) authored1b0b5005
-
Egli Adrian (IT-SCI-API-PFI) authored52e6334c
-
adrian_egli2 authored3c721de5
-
adrian_egli2 authoredd387c2e6
-
adrian_egli2 authoreded89570e
-
adrian_egli2 authored0006c040
Showing
- README.md 151 additions, 25 deletionsREADME.md
- checkpoints/201124171810-7800.pth.local 0 additions, 0 deletionscheckpoints/201124171810-7800.pth.local
- checkpoints/201124171810-7800.pth.target 0 additions, 0 deletionscheckpoints/201124171810-7800.pth.target
- checkpoints/210122120236-3000.pth.local 0 additions, 0 deletionscheckpoints/210122120236-3000.pth.local
- checkpoints/210122120236-3000.pth.target 0 additions, 0 deletionscheckpoints/210122120236-3000.pth.target
- checkpoints/210122165109-5000.pth.local 0 additions, 0 deletionscheckpoints/210122165109-5000.pth.local
- checkpoints/210122165109-5000.pth.target 0 additions, 0 deletionscheckpoints/210122165109-5000.pth.target
- checkpoints/210122235754-5000.pth.actor 0 additions, 0 deletionscheckpoints/210122235754-5000.pth.actor
- checkpoints/210122235754-5000.pth.optimizer 0 additions, 0 deletionscheckpoints/210122235754-5000.pth.optimizer
- checkpoints/210122235754-5000.pth.value 0 additions, 0 deletionscheckpoints/210122235754-5000.pth.value
- dump.rdb 0 additions, 0 deletionsdump.rdb
- reinforcement_learning/dddqn_policy.py 10 additions, 59 deletionsreinforcement_learning/dddqn_policy.py
- reinforcement_learning/deadlockavoidance_with_decision_agent.py 85 additions, 0 deletions...rcement_learning/deadlockavoidance_with_decision_agent.py
- reinforcement_learning/evaluate_agent.py 14 additions, 7 deletionsreinforcement_learning/evaluate_agent.py
- reinforcement_learning/multi_agent_training.py 67 additions, 48 deletionsreinforcement_learning/multi_agent_training.py
- reinforcement_learning/multi_decision_agent.py 90 additions, 0 deletionsreinforcement_learning/multi_decision_agent.py
- reinforcement_learning/multi_policy.py 6 additions, 5 deletionsreinforcement_learning/multi_policy.py
- reinforcement_learning/ordered_policy.py 1 addition, 1 deletionreinforcement_learning/ordered_policy.py
- reinforcement_learning/policy.py 32 additions, 8 deletionsreinforcement_learning/policy.py
- reinforcement_learning/ppo_agent.py 106 additions, 45 deletionsreinforcement_learning/ppo_agent.py
File deleted
File deleted
checkpoints/210122120236-3000.pth.local
0 → 100644
File added
checkpoints/210122120236-3000.pth.target
0 → 100644
File added
checkpoints/210122165109-5000.pth.local
0 → 100644
File added
checkpoints/210122165109-5000.pth.target
0 → 100644
File added
checkpoints/210122235754-5000.pth.actor
0 → 100644
File added
checkpoints/210122235754-5000.pth.optimizer
0 → 100644
File added
checkpoints/210122235754-5000.pth.value
0 → 100644
File added
dump.rdb
deleted
100644 → 0
File deleted