Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
Dipam Chakraborty
neurips-2020-procgen-competition
Commits
606189ba
Commit
606189ba
authored
Oct 26, 2020
by
Dipam Chakraborty
Browse files
ppo max time
parent
4a9bbaea
Changes
3
Show whitespace changes
Inline
Side-by-side
algorithms/custom_torch_agent/custom_torch_policy.py
View file @
606189ba
...
...
@@ -77,7 +77,7 @@ class CustomTorchPolicy(TorchPolicy):
self
.
exp_replay
=
np
.
empty
((
self
.
retune_selector
.
replay_size
,
*
self
.
observation_space
.
shape
),
dtype
=
np
.
uint8
)
self
.
target_timesteps
=
8_000_000
self
.
buffer_time
=
20
# TODO: Could try to do a median or mean time step check instead
self
.
max_time
=
10000000000000
# ignore timekeeping because spot instances are messing it up
self
.
max_time
=
self
.
config
[
'max_time'
]
self
.
maxrewep_lenbuf
=
deque
(
maxlen
=
100
)
self
.
gamma
=
self
.
config
[
'gamma'
]
self
.
adaptive_discount_tuner
=
AdaptiveDiscountTuner
(
self
.
gamma
,
momentum
=
0.98
,
eplenmult
=
3
)
...
...
algorithms/custom_torch_agent/ppo.py
View file @
606189ba
...
...
@@ -89,6 +89,7 @@ DEFAULT_CONFIG = with_common_config({
"scale_reward"
:
1.0
,
"return_reset"
:
True
,
"aux_phase_mixed_precision"
:
False
,
"max_time"
:
100000000
,
})
# __sphinx_doc_end__
# yapf: enable
...
...
experiments/custom-torch-ppo.yaml
View file @
606189ba
...
...
@@ -46,14 +46,15 @@ procgen-ppo:
no_done_at_end
:
False
# Custom switches
retune_skips
:
5
0000
retune_replay_size
:
2
00000
num_retunes
:
28
retune_skips
:
10
0000
retune_replay_size
:
4
00000
num_retunes
:
14
retune_epochs
:
3
standardize_rewards
:
True
scale_reward
:
1.0
return_reset
:
False
aux_phase_mixed_precision
:
True
max_time
:
7200
adaptive_gamma
:
False
final_lr
:
5.0e-5
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment