浏览代码

fig bug

/develop/bisim-sac-transfer
yanchaosun 4 年前
当前提交
42c0c333
共有 4 个文件被更改,包括 12 次插入9 次删除
  1. 5
      config/sac_transfer/3DBall.yaml
  2. 5
      config/sac_transfer/3DBallHard.yaml
  3. 7
      config/sac_transfer/3DBallHardTransfer.yaml
  4. 4
      ml-agents/mlagents/trainers/sac_transfer/network.py

5
config/sac_transfer/3DBall.yaml


forward_layers: 1
value_layers: 1
feature_size: 16
separate_value_net: true
# separate_value_net: true
separate_policy_train: true
reuse_encoder: true
in_epoch_alter: false
in_batch_alter: true

predict_return: true
use_bisim: false
use_bisim: true
network_settings:
normalize: true
hidden_units: 64

5
config/sac_transfer/3DBallHard.yaml


forward_layers: 1
value_layers: 1
feature_size: 16
separate_value_net: true
# separate_value_net: true
separate_policy_train: true
reuse_encoder: false
in_epoch_alter: false
in_batch_alter: true

predict_return: true
use_bisim: false
use_bisim: true
network_settings:
normalize: true
hidden_units: 64

7
config/sac_transfer/3DBallHardTransfer.yaml


forward_layers: 1
value_layers: 1
feature_size: 16
separate_value_net: true
# separate_value_net: true
separate_policy_train: true
reuse_encoder: false
in_epoch_alter: false
in_batch_alter: false

predict_return: true
use_bisim: false
use_bisim: true
transfer_path: "results/sac_model_ball/3DBall"
transfer_path: "results/sac_model_ball_bisim/3DBall"
network_settings:
normalize: true
hidden_units: 64

4
ml-agents/mlagents/trainers/sac_transfer/network.py


self.sequence_length_ph = self.policy.sequence_length_ph
hidden_critic = self._create_encoder(
self.visual_in,
self.processed_vector_in,
self.policy.visual_in,
self.policy.processed_vector_in,
vis_encode_type,
encoder_layers=encoder_layers,
scope="encoding",

正在加载...
取消
保存