浏览代码

Fixed typo in Training-Imitation-Learning.md (#2485)

/develop-gpu-test
Jeffrey Shih 5 年前
当前提交
df64b64a
共有 1 个文件被更改,包括 1 次插入1 次删除
  1. 2
      docs/Training-Imitation-Learning.md

2
docs/Training-Imitation-Learning.md


of a reward function, we can give the medic real world examples of observations
from the game and actions from a game controller to guide the medic's behavior.
Imitation Learning uses pairs of observations and actions from
from a demonstration to learn a policy. [Video Link](https://youtu.be/kpb8ZkMBFYs).
a demonstration to learn a policy. [Video Link](https://youtu.be/kpb8ZkMBFYs).
Imitation learning can also be used to help reinforcement learning. Especially in
environments with sparse (i.e., infrequent or rare) rewards, the agent may never see

正在加载...
取消
保存