[Documentation] Added information for the environments the trainer cannot train with the default configurations (#1995)

6 年前 · 2f2a7334
--- a/docs/Learning-Environment-Examples.md
+++ b/docs/Learning-Environment-Examples.md
  * Vector Action space: (Discrete) Size of 6, corresponding to turn clockwise
    and counterclockwise and move along four different face directions.
  * Visual Observations (Optional): One first-person camera. Use
-    `VisualPushBlock` scene.
+    `VisualPushBlock` scene. __The visual observation version of
+     this environment does not train with the provided default
+     training parameters.__
 * Reset Parameters: None.
 * Benchmark Mean Reward: 4.5
 * Optional Imitation Learning scene: `PushBlockIL`.
    * Rotation (3 possible actions: Rotate Left, Rotate Right, No Action)
    * Laser (2 possible actions: Laser, No Action)
  * Visual Observations (Optional): First-person camera per-agent. Use
-    `VisualBanana` scene.
+    `VisualBanana` scene. __The visual observation version of
+     this environment does not train with the provided default
+     training parameters.__
 * Reset Parameters: None.
 * Benchmark Mean Reward: 10
 * Optional Imitation Learning scene: `BananaIL`.
  * Vector Action space: (Discrete) 1 Branch, 4 actions corresponding to agent
    rotation and forward/backward movement.
  * Visual Observations (Optional): First-person view for the agent. Use
-    `VisualHallway` scene.
+    `VisualHallway` scene. __The visual observation version of
+     this environment does not train with the provided default
+     training parameters.__
 * Reset Parameters: None.
 * Benchmark Mean Reward: 0.7
  * To speed up training, you can enable curiosity by adding `use_curiosity: true` in `config/trainer_config.yaml`
  * Visual Observations: None.
 * Reset Parameters: None
 * Benchmark Mean Reward (Striker & Goalie Brain): 0 (the means will be inverse
-  of each other and criss crosses during training)
+  of each other and criss crosses during training) __Note that our trainer is currently unable to consistently train this environment__

 ## Walker

  * Vector Action space: (Discrete) 4 corresponding to agent rotation and
    forward/backward movement.
  * Visual Observations (Optional): First-person camera per-agent. Us
-    `VisualPyramids` scene.
+    `VisualPyramids` scene. __The visual observation version of
+     this environment does not train with the provided default
+     training parameters.__
 * Reset Parameters: None.
 * Optional Imitation Learning scene: `PyramidsIL`.
 * Benchmark Mean Reward: 1.75