浏览代码

Update the basic example documentation (#3281)

/asymm-envs
GitHub 4 年前
当前提交
a200309a
共有 1 个文件被更改,包括 2 次插入1 次删除
  1. 3
      docs/Learning-Environment-Examples.md

3
docs/Learning-Environment-Examples.md


* Goal: Move to the most reward state.
* Agents: The environment contains one agent.
* Agent Reward Function:
* -0.01 at each step
* +0.1 for arriving at suboptimal state.
* +1.0 for arriving at optimal state.
* Behavior Parameters:

* Visual Observations: None
* Float Properties: None
* Benchmark Mean Reward: 0.94
* Benchmark Mean Reward: 0.93
## [3DBall: 3D Balance Ball](https://youtu.be/dheeCO29-EI)

正在加载...
取消
保存