浏览代码

constant beta

/asymm-envs
Andrew Cohen 5 年前
当前提交
d794964f
共有 1 个文件被更改,包括 4 次插入3 次删除
  1. 7
      ml-agents/mlagents/trainers/ppo/optimizer.py

7
ml-agents/mlagents/trainers/ppo/optimizer.py


decay_epsilon = tf.train.polynomial_decay(
epsilon, self.policy.global_step, max_step, 0.1, power=1.0
)
decay_beta = tf.train.polynomial_decay(
beta, self.policy.global_step, max_step, 1e-5, power=1.0
)
# decay_beta = tf.train.polynomial_decay(
# beta, self.policy.global_step, max_step, 1e-5, power=1.0
# )
decay_beta = tf.Variable(beta)
value_losses = []
for name, head in value_heads.items():

正在加载...
取消
保存