浏览代码

Fix discrete scoping

/develop/nopreviousactions
Ervin Teng 4 年前
当前提交
2eda5575
共有 1 个文件被更改,包括 10 次插入9 次删除
  1. 19
      ml-agents/mlagents/trainers/common/nn_policy.py

19
ml-agents/mlagents/trainers/common/nn_policy.py


hidden_policy = hidden_stream
policy_branches = []
for size in self.act_size:
policy_branches.append(
tf.layers.dense(
hidden_policy,
size,
activation=None,
use_bias=False,
kernel_initializer=LearningModel.scaled_init(0.01),
with tf.variable_scope("policy"):
for size in self.act_size:
policy_branches.append(
tf.layers.dense(
hidden_policy,
size,
activation=None,
use_bias=False,
kernel_initializer=LearningModel.scaled_init(0.01),
)
)
raw_log_probs = tf.concat(policy_branches, axis=1, name="action_probs")

正在加载...
取消
保存