浏览代码

Adding documentation for learning rate schedule

/bug-failed-api-check
Anupam Bhatnagar 5 年前
当前提交
9007164c
共有 1 个文件被更改,包括 1 次插入0 次删除
  1. 1
      docs/Training-ML-Agents.md

1
docs/Training-ML-Agents.md


| init_entcoef | How much the agent should explore in the beginning of training. | SAC |
| lambd | The regularization parameter. | PPO |
| learning_rate | The initial learning rate for gradient descent. | PPO, SAC |
| learning_rate_schedule | Determines how learning rate changes over time. Options: linear (default), constant | PPO, SAC |
| max_steps | The maximum number of simulation steps to run during a training session. | PPO, SAC |
| memory_size | The size of the memory an agent must keep. Used for training with a recurrent neural network. See [Using Recurrent Neural Networks](Feature-Memory.md). | PPO, SAC |
| normalize | Whether to automatically normalize observations. | PPO, SAC |

正在加载...
取消
保存