neurips-2021-the-nethack-challenge issueshttps://gitlab.aicrowd.com/nethack/neurips-2021-the-nethack-challenge/-/issues2021-07-13T09:51:40Zhttps://gitlab.aicrowd.com/nethack/neurips-2021-the-nethack-challenge/-/issues/17Using more than 2 GPUs with polybeast2021-07-13T09:51:40ZBatomUsing more than 2 GPUs with polybeastHi,
First of all, thanks a lot for the instructions and great competition.
We wanted to use polybeast on a single machine with more than 2 GPUs with the goal of getting as much throughput as possible. We were wondering what the best way ...Hi,
First of all, thanks a lot for the instructions and great competition.
We wanted to use polybeast on a single machine with more than 2 GPUs with the goal of getting as much throughput as possible. We were wondering what the best way to achieve this (e.g. more learner GPUs etc.) and what do we have to add/change.https://gitlab.aicrowd.com/nethack/neurips-2021-the-nethack-challenge/-/issues/16Error running polyhydra2021-07-12T18:11:02ZCireNeikualError running polyhydraHi,
I am trying to get the torchbeast agent to run in order to generate some trajectories I can initialize my own agents to. I installed torchbeast as per the README in the torchbeast repository (for polybeast). I was able to compile it...Hi,
I am trying to get the torchbeast agent to run in order to generate some trajectories I can initialize my own agents to. I installed torchbeast as per the README in the torchbeast repository (for polybeast). I was able to compile it, although I did have to make a modification to one of the dependencies (abseil-cpp) to fix an incompatibility with GCC 11 (missing limits header).
However, now when I try to run the baseline (nethack_baselines/torchbeast/polyhydra.py) I get the following:
```
python3 polyhydra.py
Traceback (most recent call last):
File "/home/.../neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast/polyhydra.py", line 40, in <module>
import polybeast_env
File "/home/.../neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast/polybeast_env.py", line 23, in <module>
import libtorchbeast
File "/usr/lib/python3.9/site-packages/libtorchbeast-0.0.20-py3.9-linux-x86_64.egg/libtorchbeast/__init__.py", line 18, in <module>
from ._C import (
ImportError: /usr/lib/python3.9/site-packages/libtorchbeast-0.0.20-py3.9-linux-x86_64.egg/libtorchbeast/_C.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZN3c106detail14torchCheckFailEPKcS2_jRKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE
```
Any ideas? I am using Python 3.9 on Manjaro Linux. Note that I do not have an Nvidia GPU and intend to run inference on the CPU.https://gitlab.aicrowd.com/nethack/neurips-2021-the-nethack-challenge/-/issues/13baseline doesn't support hydra 1.12021-07-05T23:53:37Zmax_reederbaseline doesn't support hydra 1.1Lots of errors here, if you follow the instructions inside [nethack_baselines/torchbeast/polyhydra.py](https://gitlab.aicrowd.com/nethack/neurips-2021-the-nethack-challenge/-/blob/master/nethack_baselines/torchbeast/polyhydra.py#L17), th...Lots of errors here, if you follow the instructions inside [nethack_baselines/torchbeast/polyhydra.py](https://gitlab.aicrowd.com/nethack/neurips-2021-the-nethack-challenge/-/blob/master/nethack_baselines/torchbeast/polyhydra.py#L17), that says to install hydra with --upgrade, but that installs **hydra-core==1.1** and now i am sad. Please consolidate all your instructions in one place, maybe with two conda(or requirements) files for vanilla vs. baseline for those customers, so i am not chasing my tail.
```
Installation for hydra:
pip install hydra-core hydra_colorlog --upgrade
```
then i get some deal about overrides, so i added that to config.yaml
```
defaults:
- override hydra/job_logging: colorlog
- override hydra/hydra_logging: colorlog
```
Trying again i get the below error. Note that `Be aware that cfg.pretty() is now deprecated and you should use OmegaConf.to_yaml(cfg) instead.` via https://github.com/facebookresearch/hydra/blob/2808e71248ac5a04a1b1a770d3a60f8ec9a38569/NEWS.md#L220
```
(nethack) 88665a14b754:torchbeast maxreede$ python polyhydra.py
[DEBUG:79067 cmd:814 2021-07-05 16:16:49,799] Popen(['git', 'version'], cwd=/Volumes/workplace/ml/aicrowd/neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast, universal_newlines=False, shell=None, istream=None)
[DEBUG:79067 cmd:814 2021-07-05 16:16:49,836] Popen(['git', 'version'], cwd=/Volumes/workplace/ml/aicrowd/neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast, universal_newlines=False, shell=None, istream=None)
polyhydra.py:108: UserWarning:
config_path is not specified in @hydra.main().
See https://hydra.cc/docs/next/upgrades/1.0_to_1.1/changes_to_hydra_main_config_path for more information.
@hydra.main(config_name="config")
[DEBUG:79067 utils:252 2021-07-05 16:16:50,090] Setting JobRuntime:name=UNKNOWN_NAME
[DEBUG:79067 utils:252 2021-07-05 16:16:50,091] Setting JobRuntime:name=polyhydra
Error executing job with overrides: []
Traceback (most recent call last):
File "polyhydra.py", line 123, in main
logging.info(flags.pretty(resolve=True))
omegaconf.errors.ConfigAttributeError: Key 'pretty' is not in struct
full_key: pretty
object_type=dict
Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.
(nethack) 88665a14b754:torchbeast maxreede$
```https://gitlab.aicrowd.com/nethack/neurips-2021-the-nethack-challenge/-/issues/12Crash Missing key max_learner_queue_size2021-06-13T14:42:51Zchristophe_cerisaraCrash Missing key max_learner_queue_sizeRun
```
python polyhydra.py actor_device=cpu
```
and the baseline crashes immediately with error:
```
Traceback (most recent call last):
Server listening on unix:/tmp/poly.outputs.2021-06-13.16-27-24.56
File "/gpfsdswork/projects/re...Run
```
python polyhydra.py actor_device=cpu
```
and the baseline crashes immediately with error:
```
Traceback (most recent call last):
Server listening on unix:/tmp/poly.outputs.2021-06-13.16-27-24.56
File "/gpfsdswork/projects/rech/knb/uyr14tk/home/xtofNLE/neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast/polyhydra.py", line 141, in main
run_learner(lrn_flags)
File "/gpfsdswork/projects/rech/knb/uyr14tk/home/xtofNLE/neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast/polyhydra.py", line 77, in run_learner
polybeast_learner.main(flags)
File "/gpfsdswork/projects/rech/knb/uyr14tk/home/xtofNLE/neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast/polybeast_learner.py", line 515, in main
train(flags)
File "/gpfsdswork/projects/rech/knb/uyr14tk/home/xtofNLE/neurips-2021-the-nethack-challenge/nethack_baselines/torchbeast/polybeast_learner.py", line 254, in train
if flags.max_learner_queue_size is None:
omegaconf.errors.ConfigAttributeError: Missing key max_learner_queue_size
full_key: max_learner_queue_size
object_type=dict
```