浏览代码

[bug-fix] Update the gail config for the new steps in 0.14.0 (#3475)

/release-0.14.1
Anupam Bhatnagar 5 年前
当前提交
d5617834
共有 2 个文件被更改,包括 14 次插入14 次删除
  1. 1
      com.unity.ml-agents/CHANGELOG.md
  2. 27
      config/gail_config.yaml

1
com.unity.ml-agents/CHANGELOG.md


### Bug Fixes
- Fixed an issue which caused self-play training sessions to consume a lot of memory. (#3451)
- Fixed an IndexError when using GAIL or behavioral cloning with demonstrations recorded with 0.14.0 or later (#3464)
- Updated the `gail_config.yaml` to work with per-Agent steps (#3475)
## [0.14.0-preview] - 2020-02-13

27
config/gail_config.yaml


num_layers: 2
time_horizon: 64
sequence_length: 64
summary_freq: 1000
summary_freq: 10000
use_recurrent: false
reward_signals:
extrinsic:

Pyramids:
summary_freq: 2000
summary_freq: 30000
time_horizon: 128
batch_size: 128
buffer_size: 2048

max_steps: 5.0e5
max_steps: 1.0e7
steps: 10000
steps: 150000
reward_signals:
extrinsic:
strength: 1.0

time_horizon: 1000
batch_size: 2024
buffer_size: 20240
max_steps: 1e6
summary_freq: 3000
max_steps: 1e7
summary_freq: 30000
steps: 5000
steps: 50000
reward_signals:
gail:
strength: 1.0

PushBlock:
max_steps: 5.0e4
max_steps: 1.5e7
summary_freq: 2000
summary_freq: 60000
time_horizon: 64
num_layers: 2
reward_signals:

encoding_size: 128
demo_path: Project/Assets/ML-Agents/Examples/PushBlock/Demos/ExpertPush.demo
demo_path: Project/Assets/Demonstrations/PushblockDemo.demo
Hallway:
use_recurrent: true

num_epoch: 3
buffer_size: 1024
batch_size: 128
max_steps: 5.0e5
summary_freq: 1000
max_steps: 1.0e7
summary_freq: 10000
time_horizon: 64
reward_signals:
extrinsic:

FoodCollector:
batch_size: 64
summary_freq: 1000
max_steps: 5.0e4
max_steps: 2.0e6
use_recurrent: false
hidden_units: 128
learning_rate: 3.0e-4

正在加载...
取消
保存