浏览代码

Removed lingering link to old page.

/develop-generalizationTraining-TrainerController
Marwan Mattar 7 年前
当前提交
1ecc8cf9
共有 1 个文件被更改,包括 0 次插入2 次删除
  1. 2
      docs/Learning-Environment-Design-Agents.md

2
docs/Learning-Environment-Design-Agents.md


When you turn on **On Demand Decisions** for an agent, your agent code must call the `Agent.RequestDecision()` function. This function call starts one iteration of the observation-decision-action-reward cycle. The Brain invokes the agent's `CollectObservations()` method, makes a decision and returns it by calling the `AgentAction()` method. The Brain waits for the agent to request the next decision before starting another iteration.
See [On Demand Decision Making](Feature-On-Demand-Decision.md).
## Observations
To make decisions, an agent must observe its environment to determine its current state. A state observation can take the following forms:

正在加载...
取消
保存