### Bug Fixes
- Fixed an issue which caused self-play training sessions to consume a lot of memory. (#3451)
- Fixed an IndexError when using GAIL or behavioral cloning with demonstrations recorded with 0.14.0 or later (#3464)
- Updated the `gail_config.yaml` to work with per-Agent steps (#3475)
## [0.14.0-preview] - 2020-02-13
num_layers: 2
time_horizon: 64
sequence_length: 64
summary_freq: 1000
summary_freq: 10000
use_recurrent: false
reward_signals:
extrinsic:
Pyramids:
summary_freq: 2000
summary_freq: 30000
time_horizon: 128
batch_size: 128
buffer_size: 2048
max_steps: 5.0e5
max_steps: 1.0e7
steps: 10000
steps: 150000
strength: 1.0
time_horizon: 1000
batch_size: 2024
buffer_size: 20240
max_steps: 1e6
summary_freq: 3000
max_steps: 1e7
steps: 5000
steps: 50000
gail:
PushBlock:
max_steps: 5.0e4
max_steps: 1.5e7
summary_freq: 60000
encoding_size: 128
demo_path: Project/Assets/ML-Agents/Examples/PushBlock/Demos/ExpertPush.demo
demo_path: Project/Assets/Demonstrations/PushblockDemo.demo
Hallway:
use_recurrent: true
num_epoch: 3
buffer_size: 1024
FoodCollector:
batch_size: 64
max_steps: 2.0e6
hidden_units: 128
learning_rate: 3.0e-4