浏览代码

Rename brains to new names (#1321)

/develop-generalizationTraining-TrainerController
GitHub 6 年前
当前提交
f99dc261
共有 1 个文件被更改,包括 20 次插入20 次删除
  1. 40
      config/trainer_config.yaml

40
config/trainer_config.yaml


curiosity_strength: 0.01
curiosity_enc_size: 128
BananaBrain:
BananaLearning:
normalize: false
batch_size: 1024
beta: 5.0e-3

BouncerBrain:
BouncerLearning:
PushBlockBrain:
PushBlockLearning:
max_steps: 5.0e4
batch_size: 128
buffer_size: 2048

time_horizon: 64
num_layers: 2
SmallWallBrain:
SmallWallLearning:
max_steps: 1.0e6
batch_size: 128
buffer_size: 2048

num_layers: 2
normalize: false
BigWallBrain:
BigWallLearning:
max_steps: 1.0e6
batch_size: 128
buffer_size: 2048

num_layers: 2
normalize: false
StrikerBrain:
StrikerLearning:
max_steps: 5.0e5
learning_rate: 1e-3
batch_size: 128

num_layers: 2
normalize: false
GoalieBrain:
GoalieLearning:
max_steps: 5.0e5
learning_rate: 1e-3
batch_size: 320

num_layers: 2
normalize: false
PyramidBrain:
PyramidLearning:
use_curiosity: true
summary_freq: 2000
curiosity_strength: 0.01

max_steps: 5.0e5
num_epoch: 3
VisualPyramidBrain:
VisualPyramidLearning:
use_curiosity: true
curiosity_strength: 0.01
curiosity_enc_size: 256

max_steps: 5.0e5
num_epoch: 3
Ball3DBrain:
Ball3DLearning:
normalize: true
batch_size: 64
buffer_size: 12000

gamma: 0.995
beta: 0.001
Ball3DHardBrain:
Ball3DHardLearning:
normalize: true
batch_size: 1200
buffer_size: 12000

beta: 0.001
TennisBrain:
TennisLearning:
CrawlerBrain:
CrawlerLearning:
normalize: true
num_epoch: 3
time_horizon: 1000

num_layers: 3
hidden_units: 512
WalkerBrain:
WalkerLearning:
normalize: true
num_epoch: 3
time_horizon: 1000

num_layers: 3
hidden_units: 512
ReacherBrain:
ReacherLearning:
normalize: true
num_epoch: 3
time_horizon: 1000

max_steps: 1e6
summary_freq: 3000
HallwayBrain:
HallwayLearning:
use_recurrent: true
sequence_length: 64
num_layers: 2

summary_freq: 1000
time_horizon: 64
VisualHallwayBrain:
VisualHallwayLearning:
use_recurrent: true
sequence_length: 64
num_layers: 1

summary_freq: 1000
time_horizon: 64
VisualPushBlockBrain:
VisualPushBlockLearning:
use_recurrent: true
sequence_length: 32
num_layers: 1

summary_freq: 1000
time_horizon: 64
GridWorldBrain:
GridWorldLearning:
batch_size: 32
normalize: false
num_layers: 1

summary_freq: 2000
time_horizon: 5
BasicBrain:
BasicLearning:
batch_size: 32
normalize: false
num_layers: 1

正在加载...
取消
保存