浏览代码

fix problem

/develop/bisim-review
yanchaosun 4 年前
当前提交
f74af710
共有 1 个文件被更改,包括 10 次插入2 次删除
  1. 12
      ml-agents/mlagents/trainers/ppo_transfer/optimizer.py

12
ml-agents/mlagents/trainers/ppo_transfer/optimizer.py


"learning_rate": self.learning_rate,
"decay_epsilon": self.decay_epsilon,
"decay_beta": self.decay_beta,
"reward_loss": self.policy.reward_loss,
self.model_update_dict.update(
{

"decay_epsilon": self.decay_epsilon,
"decay_beta": self.decay_beta,
"reward_loss": self.policy.reward_loss,
if self.predict_return:
self.ppo_update_dict.update({
"reward_loss": self.policy.reward_loss,
})
self.model_update_dict.update({
"reward_loss": self.policy.reward_loss,
})
@timed
def update(self, batch: AgentBuffer, num_sequences: int) -> Dict[str, float]:

正在加载...
取消
保存