浏览代码
Develop environment bc fix and doc update (#1317)
Develop environment bc fix and doc update (#1317)
* split the config into two files * fixed the Training-ML-Agents.md doc * added the configs for all of the IL scenes/develop-generalizationTraining-TrainerController
GitHub
6 年前
当前提交
bcd487a1
共有 5 个文件被更改,包括 156 次插入 和 77 次删除
-
8docs/Training-Imitation-Learning.md
-
32docs/Training-ML-Agents.md
-
27config/offline_bc_config.yaml
-
110config/online_bc_config.yaml
-
56config/bc_config.yaml
|
|||
default: |
|||
trainer: offline_bc |
|||
batch_size: 64 |
|||
summary_freq: 1000 |
|||
max_steps: 5.0e4 |
|||
batches_per_epoch: 10 |
|||
use_recurrent: false |
|||
hidden_units: 128 |
|||
learning_rate: 3.0e-4 |
|||
num_layers: 2 |
|||
sequence_length: 32 |
|||
memory_size: 256 |
|||
demo_path: ./UnitySDK/Assets/Demonstrations/<Your_Demon_File>.demo |
|||
|
|||
HallwayBrain: |
|||
trainer: offline_bc |
|||
max_steps: 5.0e5 |
|||
num_epoch: 5 |
|||
batch_size: 64 |
|||
batches_per_epoch: 5 |
|||
num_layers: 2 |
|||
hidden_units: 128 |
|||
sequence_length: 16 |
|||
use_recurrent: true |
|||
memory_size: 256 |
|||
sequence_length: 32 |
|||
demo_path: ./UnitySDK/Assets/Demonstrations/Hallway.demo |
|
|||
default: |
|||
trainer: online_bc |
|||
brain_to_imitate: <Your_Brain_Asset_Name> |
|||
batch_size: 64 |
|||
time_horizon: 64 |
|||
summary_freq: 1000 |
|||
max_steps: 5.0e4 |
|||
batches_per_epoch: 10 |
|||
use_recurrent: false |
|||
hidden_units: 128 |
|||
learning_rate: 3.0e-4 |
|||
num_layers: 2 |
|||
sequence_length: 32 |
|||
memory_size: 256 |
|||
|
|||
BananaLearning: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: BananaPlayer |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: false |
|||
sequence_length: 16 |
|||
|
|||
BouncerLearning: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 10 |
|||
brain_to_imitate: BouncerPlayer |
|||
batch_size: 16 |
|||
batches_per_epoch: 1 |
|||
num_layers: 1 |
|||
hidden_units: 64 |
|||
use_recurrent: false |
|||
sequence_length: 16 |
|||
|
|||
HallwayLearning: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: HallwayPlayer |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: false |
|||
sequence_length: 16 |
|||
|
|||
PushBlockLearning: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: PushBlockPlayer |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: false |
|||
sequence_length: 16 |
|||
|
|||
PyramidsLearning: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: PyramidsPlayer |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: false |
|||
sequence_length: 16 |
|||
|
|||
TennisLearning: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: TennisPlayer |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: false |
|||
sequence_length: 16 |
|||
|
|||
StudentBrain: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: TeacherBrain |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: false |
|||
sequence_length: 16 |
|||
|
|||
StudentRecurrentBrain: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: TeacherBrain |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: true |
|||
sequence_length: 32 |
|
|||
default: |
|||
trainer: offline_bc |
|||
batch_size: 64 |
|||
beta: 5.0e-3 |
|||
hidden_units: 128 |
|||
learning_rate: 3.0e-4 |
|||
max_steps: 5.0e4 |
|||
memory_size: 256 |
|||
batches_per_epoch: 10 |
|||
time_horizon: 64 |
|||
num_epoch: 5 |
|||
num_layers: 2 |
|||
summary_freq: 1000 |
|||
use_recurrent: false |
|||
sequence_length: 32 |
|||
demo_path: ./UnitySDK/Assets/Demonstrations/Crawler_test.demo |
|||
|
|||
HallwayBrain: |
|||
trainer: offline_bc |
|||
max_steps: 5.0e5 |
|||
num_epoch: 5 |
|||
batch_size: 64 |
|||
batches_per_epoch: 5 |
|||
num_layers: 2 |
|||
hidden_units: 128 |
|||
sequence_length: 16 |
|||
buffer_size: 512 |
|||
use_recurrent: true |
|||
memory_size: 256 |
|||
sequence_length: 32 |
|||
demo_path: ./UnitySDK/Assets/Demonstrations/Hallway.demo |
|||
|
|||
StudentBrain: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: TeacherBrain |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
sequence_length: 16 |
|||
buffer_size: 128 |
|||
|
|||
StudentRecurrentBrain: |
|||
trainer: online_bc |
|||
max_steps: 10000 |
|||
summary_freq: 1000 |
|||
brain_to_imitate: TeacherBrain |
|||
batch_size: 16 |
|||
batches_per_epoch: 5 |
|||
num_layers: 4 |
|||
hidden_units: 64 |
|||
use_recurrent: true |
|||
sequence_length: 32 |
|||
buffer_size: 128 |
撰写
预览
正在加载...
取消
保存
Reference in new issue