Update keep_checkpoints_num to 100 million for all to ensure all checkpoints are saved

1 job for FlatlandPaper_v2 in 2 minutes and 54 seconds (queued for 45 seconds)
Name Stage Failure
test Test
  Downloading GPUtil-1.4.0.tar.gz (5.5 kB)
Collecting wandb==0.9.2
Downloading wandb-0.9.2-py2.py3-none-any.whl (1.4 MB)
Collecting ray[rllib]==0.8.5
Downloading ray-0.8.5-cp37-cp37m-manylinux1_x86_64.whl (21.2 MB)
Collecting tensorflow==2.1.0
Downloading tensorflow-2.1.0-cp37-cp37m-manylinux2010_x86_64.whl (421.8 MB)

ERROR: Job failed: command terminated with exit code 1