浏览代码

fix action stop gradient

/develop/bisim-sac-transfer
yanchaosun 4 年前
当前提交
3762358d
共有 1 个文件被更改,包括 8 次插入5 次删除
  1. 13
      ml-agents/mlagents/trainers/policy/transfer_policy.py

13
ml-agents/mlagents/trainers/policy/transfer_policy.py


:param encoded_state: Tensor corresponding to encoded current state.
:param encoded_next_state: Tensor corresponding to encoded next state.
"""
if separate_train:
encoded_state = tf.stop_gradient(encoded_state)
if separate_train:
hidden = tf.stop_gradient(hidden)
for i in range(forward_layers):
hidden = tf.layers.dense(

separate_train: bool = False
):
if separate_train:
encoded_state = tf.stop_gradient(encoded_state)
if separate_train:
hidden = tf.stop_gradient(hidden)
for i in range(forward_layers):
hidden = tf.layers.dense(
hidden,

正在加载...
取消
保存