浏览代码

Fix doc which refers to nonexistent agents. (#923)

/hotfix-v0.9.2a
Vincent(Yuan) Gao 6 年前
当前提交
f6e45db2
共有 1 个文件被更改,包括 2 次插入2 次删除
  1. 4
      docs/Training-Curriculum-Learning.md

4
docs/Training-Curriculum-Learning.md


To see this in action, observe the two learning curves below. Each displays the reward
over time for an agent trained using PPO with the same set of training hyperparameters.
The difference is that the agent on the left was trained using the full-height wall
version of the task, and the right agent was trained using the curriculum version of
The difference is that one agent was trained using the full-height wall
version of the task, and the other agent was trained using the curriculum version of
the task. As you can see, without using curriculum learning the agent has a lot of
difficulty. We think that by using well-crafted curricula, agents trained using
reinforcement learning will be able to accomplish tasks otherwise much more difficult.

正在加载...
取消
保存