浏览代码

Swap 0 set and reward buffer append (#2273)

Fix bug with reward_buffer always 0
/develop-generalizationTraining-TrainerController
GitHub 5 年前
当前提交
1c18bd18
共有 1 个文件被更改,包括 1 次插入1 次删除
  1. 2
      ml-agents/mlagents/trainers/ppo/trainer.py

2
ml-agents/mlagents/trainers/ppo/trainer.py


self.stats["Environment/Cumulative Reward"].append(
rewards.get(agent_id, 0)
)
rewards[agent_id] = 0
rewards[agent_id] = 0
else:
self.stats[
self.policy.reward_signals[name].stat_name

正在加载...
取消
保存