浏览代码

configs

/develop/bisim-sac-transfer
yanchaosun 4 年前
当前提交
a9c6105d
共有 5 个文件被更改,包括 11 次插入11 次删除
  1. 4
      config/ppo_transfer/CrawlerStatic.yaml
  2. 2
      config/ppo_transfer/TransferCrawlerStatic.yaml
  3. 2
      config/sac_transfer/3DBall.yaml
  4. 6
      config/sac_transfer/3DBallHard.yaml
  5. 8
      config/sac_transfer/3DBallHardTransfer.yaml

4
config/ppo_transfer/CrawlerStatic.yaml


lambd: 0.95
num_epoch: 3
learning_rate_schedule: linear
model_schedule: linear
model_schedule: constant
encoder_layers: 2
policy_layers: 2
forward_layers: 2

in_epoch_alter: false
in_batch_alter: false
in_batch_alter: true
use_op_buffer: false
use_var_predict: true
with_prior: false

2
config/ppo_transfer/TransferCrawlerStatic.yaml


use_transfer: true
load_model: true
train_model: false
transfer_path: "results/csold/CrawlerStatic"
transfer_path: "results/csold-const/CrawlerStatic"
network_settings:
normalize: true
hidden_units: 512

2
config/sac_transfer/3DBall.yaml


policy_layers: 1
forward_layers: 1
value_layers: 2
feature_size: 16
feature_size: 32
reuse_encoder: false
in_epoch_alter: false
in_batch_alter: true

6
config/sac_transfer/3DBallHard.yaml


learning_rate: 0.0003
learning_rate_schedule: linear
batch_size: 256
buffer_size: 50000
buffer_size: 24000
buffer_init_steps: 0
tau: 0.005
steps_per_update: 10.0

policy_layers: 1
forward_layers: 1
value_layers: 2
feature_size: 16
feature_size: 32
reuse_encoder: false
in_epoch_alter: false
in_batch_alter: true

gamma: 0.99
strength: 1.0
keep_checkpoints: 5
max_steps: 500000
max_steps: 1000000
time_horizon: 1000
summary_freq: 12000
threaded: true

8
config/sac_transfer/3DBallHardTransfer.yaml


learning_rate: 0.0003
learning_rate_schedule: linear
batch_size: 256
buffer_size: 50000
buffer_size: 24000
buffer_init_steps: 0
tau: 0.005
steps_per_update: 10.0

policy_layers: 1
forward_layers: 1
value_layers: 2
feature_size: 16
feature_size: 32
reuse_encoder: false
in_epoch_alter: false
in_batch_alter: true

use_transfer: true
load_model: true
train_model: false
transfer_path: "results/sac_model_ball_sep_bisim/3DBall"
transfer_path: "results/sac_model_ball_sep_linear_f32/3DBall"
network_settings:
normalize: true
hidden_units: 128

gamma: 0.99
strength: 1.0
keep_checkpoints: 5
max_steps: 500000
max_steps: 1000000
time_horizon: 1000
summary_freq: 12000
threaded: true
正在加载...
取消
保存