浏览代码

separate value train and model schedule to const

/develop/bisim-review
Andrew Cohen 5 年前
当前提交
e6066ffd
共有 2 个文件被更改,包括 4 次插入0 次删除
  1. 2
      config/ppo_transfer/CrawlerStatic.yaml
  2. 2
      config/ppo_transfer/OldCrawlerStatic.yaml

2
config/ppo_transfer/CrawlerStatic.yaml


lambd: 0.95
num_epoch: 3
learning_rate_schedule: constant
model_schedule: constant
encoder_layers: 2
action_layers: 2
policy_layers: 1

predict_return: true
use_bisim: false
separate_value_train: true
separate_value_net: true
in_batch_alter: true
network_settings:
normalize: true

2
config/ppo_transfer/OldCrawlerStatic.yaml


lambd: 0.95
num_epoch: 3
learning_rate_schedule: constant
model_schedule: constant
encoder_layers: 2
action_layers: 2
policy_layers: 1

predict_return: true
use_bisim: false
separate_value_train: true
separate_value_net: true
train_model: false
load_model: true
train_action: false

正在加载...
取消
保存