Commits on Source (87)
-
Egli Adrian (IT-SCI-API-PFI) authored0d55f042
-
Egli Adrian (IT-SCI-API-PFI) authored57c1973b
-
Egli Adrian (IT-SCI-API-PFI) authoreda8631db6
-
Egli Adrian (IT-SCI-API-PFI) authoredc53739a9
-
Egli Adrian (IT-SCI-API-PFI) authoredb8fc444d
-
Egli Adrian (IT-SCI-API-PFI) authored225bbed9
-
Egli Adrian (IT-SCI-API-PFI) authored3b48de68
-
Egli Adrian (IT-SCI-API-PFI) authoredf310811b
-
Egli Adrian (IT-SCI-API-PFI) authored316a40d0
-
Egli Adrian (IT-SCI-API-PFI) authoredd3479be5
-
Egli Adrian (IT-SCI-API-PFI) authored2393654f
-
Egli Adrian (IT-SCI-API-PFI) authored0e7aa90c
-
Egli Adrian (IT-SCI-API-PFI) authored04a942e5
-
Egli Adrian (IT-SCI-API-PFI) authoredc12f806e
-
Egli Adrian (IT-SCI-API-PFI) authoredf4fca1d5
-
Egli Adrian (IT-SCI-API-PFI) authored0a0b7389
-
Egli Adrian (IT-SCI-API-PFI) authoredbfb8373a
-
Egli Adrian (IT-SCI-API-PFI) authored8749b02f
-
Egli Adrian (IT-SCI-API-PFI) authored016b9a58
-
Egli Adrian (IT-SCI-API-PFI) authored8cf48167
-
Egli Adrian (IT-SCI-API-PFI) authoreda854eed7
-
Egli Adrian (IT-SCI-API-PFI) authoredf1cb653e
-
Egli Adrian (IT-SCI-API-PFI) authored87273288
-
Egli Adrian (IT-SCI-API-PFI) authored769f25ec
-
Egli Adrian (IT-SCI-API-PFI) authored70393c79
-
Egli Adrian (IT-SCI-API-PFI) authoredc46cbfd9
-
Egli Adrian (IT-SCI-API-PFI) authored05f62176
-
Egli Adrian (IT-SCI-API-PFI) authored729722e3
-
Egli Adrian (IT-SCI-API-PFI) authored41d4b483
-
Egli Adrian (IT-SCI-API-PFI) authored8d6304b3
-
Egli Adrian (IT-SCI-API-PFI) authored716119c9
-
MasterScrat authored691d1b64
-
Egli Adrian (IT-SCI-API-PFI) authored1e092263
-
Egli Adrian (IT-SCI-API-PFI) authored924ac3d6
-
Egli Adrian (IT-SCI-API-PFI) authored168d6728
-
Egli Adrian (IT-SCI-API-PFI) authored6c8a30f9
-
Egli Adrian (IT-SCI-API-PFI) authored83acaa40
-
Egli Adrian (IT-SCI-API-PFI) authored21a51891
-
Egli Adrian (IT-SCI-API-PFI) authored66929e4a
-
Egli Adrian (IT-SCI-API-PFI) authored1c60b970
-
Egli Adrian (IT-SCI-API-PFI) authoredcf80f503
-
Egli Adrian (IT-SCI-API-PFI) authored03748921
-
Egli Adrian (IT-SCI-API-PFI) authored52a015e1
-
Egli Adrian (IT-SCI-API-PFI) authored91c77d25
-
Egli Adrian (IT-SCI-API-PFI) authoredcb4df49c
-
Egli Adrian (IT-SCI-API-PFI) authoreded050703
-
Egli Adrian (IT-SCI-API-PFI) authored44fc3248
-
Egli Adrian (IT-SCI-API-PFI) authored3c443618
-
Egli Adrian (IT-SCI-API-PFI) authored155cd804
-
Egli Adrian (IT-SCI-API-PFI) authored0a273a94
-
Egli Adrian (IT-SCI-API-PFI) authored388822a0
-
Egli Adrian (IT-SCI-API-PFI) authorede28c57e5
-
Egli Adrian (IT-SCI-API-PFI) authoredf77cddf3
-
Egli Adrian (IT-SCI-API-PFI) authored6ebb521d
-
Egli Adrian (IT-SCI-API-PFI) authored7387e9a5
-
Egli Adrian (IT-SCI-API-PFI) authored7365be1a
-
Egli Adrian (IT-SCI-API-PFI) authored2ac37596
-
Egli Adrian (IT-SCI-API-PFI) authored6a2fd382
-
Egli Adrian (IT-SCI-API-PFI) authoreddac3eee5
-
Egli Adrian (IT-SCI-API-PFI) authored2a3839db
-
Egli Adrian (IT-SCI-API-PFI) authored409a87d4
-
Egli Adrian (IT-SCI-API-PFI) authoredaae6d260
-
Egli Adrian (IT-SCI-API-PFI) authoredd4efd784
-
Egli Adrian (IT-SCI-API-PFI) authored6155bff8
-
Egli Adrian (IT-SCI-API-PFI) authored
python reinforcement_learning/multi_agent_training.py --use_fast_tree_observation --checkpoint_interval 1000 -n 5000 --policy XXX
b89ffde5 -
Egli Adrian (IT-SCI-API-PFI) authoredd368be4f
-
Egli Adrian (IT-SCI-API-PFI) authoredcc46818b
-
Egli Adrian (IT-SCI-API-PFI) authored25878c72
-
Egli Adrian (IT-SCI-API-PFI) authorede4443c95
-
Egli Adrian (IT-SCI-API-PFI) authored97104dee
-
Egli Adrian (IT-SCI-API-PFI) authored007611b3
-
Egli Adrian (IT-SCI-API-PFI) authored3f52dbd4
-
Egli Adrian (IT-SCI-API-PFI) authoredce45a97b
-
Egli Adrian (IT-SCI-API-PFI) authoredc834f7f3
-
Egli Adrian (IT-SCI-API-PFI) authoredee47292e
-
Egli Adrian (IT-SCI-API-PFI) authoredb99d9b01
-
Egli Adrian (IT-SCI-API-PFI) authored59ff8a13
-
adrian_egli authored
Feature/dead lock avoidance rl post challenge See merge request adrian_egli/neurips2020-flatland-starter-kit!1
b5ebe634 -
Egli Adrian (IT-SCI-API-PFI) authored9818ac72
-
Egli Adrian (IT-SCI-API-PFI) authored9f5d3406
-
Egli Adrian (IT-SCI-API-PFI) authored1b0b5005
-
Egli Adrian (IT-SCI-API-PFI) authored52e6334c
-
adrian_egli2 authored3c721de5
-
adrian_egli2 authoredd387c2e6
-
adrian_egli2 authoreded89570e
-
adrian_egli2 authored0006c040
Showing
- README.md 151 additions, 25 deletionsREADME.md
- checkpoints/201112143850-5400.pth.local 0 additions, 0 deletionscheckpoints/201112143850-5400.pth.local
- checkpoints/201112143850-5400.pth.target 0 additions, 0 deletionscheckpoints/201112143850-5400.pth.target
- checkpoints/201113211844-6100.pth.local 0 additions, 0 deletionscheckpoints/201113211844-6100.pth.local
- checkpoints/201113211844-6100.pth.target 0 additions, 0 deletionscheckpoints/201113211844-6100.pth.target
- checkpoints/210122120236-3000.pth.local 0 additions, 0 deletionscheckpoints/210122120236-3000.pth.local
- checkpoints/210122120236-3000.pth.target 0 additions, 0 deletionscheckpoints/210122120236-3000.pth.target
- checkpoints/210122165109-5000.pth.local 0 additions, 0 deletionscheckpoints/210122165109-5000.pth.local
- checkpoints/210122165109-5000.pth.target 0 additions, 0 deletionscheckpoints/210122165109-5000.pth.target
- checkpoints/210122235754-5000.pth.actor 0 additions, 0 deletionscheckpoints/210122235754-5000.pth.actor
- checkpoints/210122235754-5000.pth.optimizer 0 additions, 0 deletionscheckpoints/210122235754-5000.pth.optimizer
- checkpoints/210122235754-5000.pth.value 0 additions, 0 deletionscheckpoints/210122235754-5000.pth.value
- dump.rdb 0 additions, 0 deletionsdump.rdb
- reinforcement_learning/dddqn_policy.py 28 additions, 77 deletionsreinforcement_learning/dddqn_policy.py
- reinforcement_learning/deadlockavoidance_with_decision_agent.py 85 additions, 0 deletions...rcement_learning/deadlockavoidance_with_decision_agent.py
- reinforcement_learning/evaluate_agent.py 14 additions, 7 deletionsreinforcement_learning/evaluate_agent.py
- reinforcement_learning/multi_agent_training.py 123 additions, 92 deletionsreinforcement_learning/multi_agent_training.py
- reinforcement_learning/multi_decision_agent.py 90 additions, 0 deletionsreinforcement_learning/multi_decision_agent.py
- reinforcement_learning/multi_policy.py 20 additions, 21 deletionsreinforcement_learning/multi_policy.py
- reinforcement_learning/ordered_policy.py 1 addition, 1 deletionreinforcement_learning/ordered_policy.py
File deleted
File deleted
File deleted
File deleted
checkpoints/210122120236-3000.pth.local
0 → 100644
File added
checkpoints/210122120236-3000.pth.target
0 → 100644
File added
checkpoints/210122165109-5000.pth.local
0 → 100644
File added
checkpoints/210122165109-5000.pth.target
0 → 100644
File added
checkpoints/210122235754-5000.pth.actor
0 → 100644
File added
checkpoints/210122235754-5000.pth.optimizer
0 → 100644
File added
checkpoints/210122235754-5000.pth.value
0 → 100644
File added
dump.rdb
deleted
100644 → 0
File deleted
This diff is collapsed.