浏览代码

Added benchmark mean rewards for both Crawler scenes

/develop-generalizationTraining-TrainerController
eshvk 6 年前
当前提交
6bf54ed8
共有 1 个文件被更改,包括 2 次插入1 次删除
  1. 3
      docs/Learning-Environment-Examples.md

3
docs/Learning-Environment-Examples.md


rotations for joints.
* Visual Observations: None.
* Reset Parameters: None
* Benchmark Mean Reward: 2000
* Benchmark Mean Reward for `CrawlerStaticTarget`: 2000
* Benchmark Mean Reward for `CrawlerDynamicTarget`: 400
## [Banana Collector](https://youtu.be/heVMs3t9qSk)

正在加载...
取消
保存