浏览代码

Corrected state space of tennis environment (#160)

/develop-generalizationTraining-TrainerController
Arthur Juliani 7 年前
当前提交
e25aa997
共有 1 个文件被更改,包括 1 次插入1 次删除
  1. 2
      docs/Example-Environments.md

2
docs/Example-Environments.md


* +0.1 To agent when hitting ball after ball was hit by the other agent.
* +0.1 To agent who didn't hit ball last when ball hits ground.
* Brains: One brain with the following state/action space.
* State space: (Continuous) 6 variables corresponding to position of agent and nearest goal and obstacle.
* State space: (Continuous) 8 variables corresponding to position and velocity of ball and racket.
* Action space: (Discrete) Size of 4, corresponding to movement toward net, away from net, jumping, and no-movement.
* Observations: None
* Reset Parameters: One, corresponding to size of ball.
正在加载...
取消
保存