浏览代码

Add new flags to ppo best practices

/tag-0.2.0
Arthur Juliani 7 年前
当前提交
c1ccc4e8
共有 1 个文件被更改,包括 12 次插入0 次删除
  1. 12
      docs/best-practices-ppo.md

12
docs/best-practices-ppo.md


Typical Range: `5e5 - 1e7`
### Normalize
`normalize` corresponds to whether normalization is applied to the state inputs. This normalization is based on the running average and variance of the states.
Normalization can be helpful in cases with complex continuous control problems, but may be harmful with simpler discrete control problems.
### Number of Layers
`num_layers` corresponds to how many hidden layers are present after the state input, or after the CNN encoding of the observation. For simple problems,
fewer layers are likely to train faster and more efficiently. More layers may be necessary for more complex control problems.
Typical range: `1` - `3`
## Training Statistics
To view training statistics, use Tensorboard. For information on launching and using Tensorboard, see [here](./Getting-Started-with-Balance-Ball.md#observing-training-progress).

正在加载...
取消
保存