Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
Dipam Chakraborty
neurips-2020-procgen-competition
Commits
56d2a488
Commit
56d2a488
authored
Oct 29, 2020
by
Chakraborty
Browse files
Merge branch 'master' of
https://github.com/Dipamc77/procgen-competition
parents
3b8fbdc3
19f6ab29
Changes
2
Hide whitespace changes
Inline
Side-by-side
experiments/custom-ppg.yaml
View file @
56d2a488
...
...
@@ -45,13 +45,13 @@ procgen-ppo:
no_done_at_end
:
False
# Custom switches
skips
:
6
n_pi
:
1
0
num_retunes
:
1
6
retune_epochs
:
6
skips
:
2
n_pi
:
1
6
num_retunes
:
1
4
retune_epochs
:
7
standardize_rewards
:
True
aux_mbsize
:
2
aux_num_accumulates
:
4
aux_mbsize
:
4
aux_num_accumulates
:
2
augment_buffer
:
True
scale_reward
:
1.0
reset_returns
:
False
...
...
@@ -59,9 +59,9 @@ procgen-ppo:
augment_randint_num
:
3
## Hacky name fix later
aux_lr
:
5.0e-4
value_lr
:
1.0e-3
same_lr_everywhere
:
Fals
e
same_lr_everywhere
:
Tru
e
aux_phase_mixed_precision
:
True
single_optimizer
:
Tru
e
single_optimizer
:
Fals
e
max_time
:
7200
pi_phase_mixed_precision
:
False
...
...
@@ -72,7 +72,7 @@ procgen-ppo:
entropy_schedule
:
False
# Memory management, if batch size overflow, batch splitting is done to handle it
max_minibatch_size
:
5
00
max_minibatch_size
:
10
00
updates_per_batch
:
8
normalize_actions
:
False
...
...
@@ -89,10 +89,8 @@ procgen-ppo:
model
:
custom_model
:
impala_torch_ppg
custom_model_config
:
# depths: [32, 64, 64]
# nlatents: 512
depths
:
[
64
,
128
,
128
]
nlatents
:
1024
depths
:
[
32
,
64
,
64
]
nlatents
:
512
init_normed
:
True
use_layernorm
:
False
diff_framestack
:
True
...
...
experiments/custom-torch-ppo.yaml
View file @
56d2a488
...
...
@@ -46,15 +46,15 @@ procgen-ppo:
no_done_at_end
:
False
# Custom switches
retune_skips
:
10
0000
retune_replay_size
:
45
0000
retune_skips
:
35
0000
retune_replay_size
:
20
0000
num_retunes
:
13
retune_epochs
:
3
retune_epochs
:
6
standardize_rewards
:
True
scale_reward
:
1.0
return_reset
:
False
aux_phase_mixed_precision
:
True
max_time
:
10000
00
max_time
:
72
00
adaptive_gamma
:
False
final_lr
:
5.0e-5
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment