浏览代码

fix

/develop/bisim-review
yanchaosun 5 年前
当前提交
3f0cc587
共有 1 个文件被更改,包括 6 次插入1 次删除
  1. 7
      ml-agents/mlagents/trainers/ppo_transfer/optimizer.py

7
ml-agents/mlagents/trainers/ppo_transfer/optimizer.py


"value_loss": self.value_loss,
"policy_loss": self.abs_policy_loss,
"model_loss": self.model_loss,
"reward_loss": self.policy.reward_loss,
}
)
if self.predict_return:
self.update_dict.update(
{
"reward_loss": self.policy.reward_loss,
}
)

正在加载...
取消
保存