Commit 5fa48bae authored by Eric Hambro's avatar Eric Hambro
Browse files

Fixup typo.

parent 63a8c5b0
......@@ -123,11 +123,11 @@
```python
obs = env.reset() # produces the first observation
done = False # initialize this so we know when episode ends
total_reward = 0 # total reward
while not done:
action = agent.act(obs) # action processes observation and computes an action
action = agent.act(obs) # agent processes observation and computes an action
obs, reward, done, info = env.step(action) # updates the new observation and provides the reward/done
total_reward += reward # keep track of cumulative reward
```
When the episode is over (very likely YASD) the total_reward will be the score of the agent, used for training RL agents, and to get an idea of the current performance for symbolic ones.
......@@ -274,11 +274,11 @@
![Model](./model.png)
As can be seen, the model utilized both an agent centric view and a global view, which are both processed with convolutional neural network (CNN) layers. In addition, the blstats are processed with an MLP. Finally, the embeddings are passed into an LSTM to deal with partial observability.
The baseline is almost identical except wit one key difference - we haven added an CNN encoder for the `message` observation. This architecture may provide a promising starting point for development, but the sky is the limit for new ideas! Check out the [README.md](./nethack_baselines/torchbeast/README.md) to get started!
The baseline is almost identical except with one key difference - we haven't added an CNN encoder for the `message` observation. This architecture may provide a promising starting point for development, but the sky is the limit for new ideas! Check out the [README.md](./nethack_baselines/torchbeast/README.md) to get started!
%% Cell type:markdown id:af86ddfe tags:
And if you want to learn more about NetHack, checkout:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment