浏览代码

backprop enc single task

/develop/bisim-review
Andrew Cohen 4 年前
当前提交
463db9e8
共有 1 个文件被更改,包括 4 次插入4 次删除
  1. 8
      ml-agents/mlagents/trainers/policy/transfer_policy.py

8
ml-agents/mlagents/trainers/policy/transfer_policy.py


"""
combined_input = tf.concat([encoded_state, encoded_action], axis=1)
hidden = combined_input
if not self.transfer:
hidden = tf.stop_gradient(hidden)
#if not self.transfer:
# hidden = tf.stop_gradient(hidden)
for i in range(forward_layers):
hidden = tf.layers.dense(

combined_input = tf.concat([encoded_state, encoded_action], axis=1)
hidden = combined_input
if not self.transfer:
hidden = tf.stop_gradient(hidden)
#if not self.transfer:
# hidden = tf.stop_gradient(hidden)
for i in range(forward_layers):
hidden = tf.layers.dense(

正在加载...
取消
保存