浏览代码

op buffer

/develop/bisim-review
Andrew Cohen 4 年前
当前提交
9d7ed6cc
共有 1 个文件被更改,包括 6 次插入6 次删除
  1. 12
      ml-agents/mlagents/trainers/policy/transfer_policy.py

12
ml-agents/mlagents/trainers/policy/transfer_policy.py


reuse_encoder,
)
self.action_encoder = self.current_action # self._create_action_encoder(
# self.current_action,
# self.h_size,
# self.action_feature_size,
# action_layers,
# )
self.action_encoder = self._create_action_encoder(
self.current_action,
self.h_size,
self.action_feature_size,
action_layers,
)
if not reuse_encoder:
self.targ_encoder = tf.stop_gradient(self.targ_encoder)

正在加载...
取消
保存