浏览代码

Merge pull request #1278 from eltronix/patch-1

Fix typos
/develop-generalizationTraining-TrainerController
GitHub 6 年前
当前提交
93d64333
共有 1 个文件被更改,包括 5 次插入5 次删除
  1. 10
      docs/ML-Agents-Overview.md

10
docs/ML-Agents-Overview.md


_Example of a mathematics curriculum. Lessons progress from simpler topics to
more complex ones, with each building on the last._
When we think about how reinforcement learning actually works, the learning
signal is reward received occasionally throughout training. The starting point
When we think about how reinforcement learning actually works, the learning reward
signal is received occasionally throughout training. The starting point
when training an agent to accomplish this task will be a random policy. That
starting policy will have the agent running in circles, and will likely never,
or very rarely achieve the reward for complex environments. Thus by simplifying

done alone. Examples include environments where each agent only has access to
partial information, which needs to be shared in order to accomplish the task
or collaboratively solve a puzzle.
- Competitive Multi-Agent. Multiple interacting s with inverse reward
- Competitive Multi-Agent. Multiple interacting agents with inverse reward
scenario, s must compete with one another to either win a competition, or
scenario, agents must compete with one another to either win a competition, or
- Ecosystem. Multiple interacting s with independent reward signals linked
- Ecosystem. Multiple interacting agents with independent reward signals linked
to either a single or multiple different Brains. This scenario can be thought
of as creating a small world in which animals with different goals all
interact, such as a savanna in which there might be zebras, elephants and

正在加载...
取消
保存