浏览代码

move from average to sum of rewards

/develop/coma2
Andrew Cohen 3 年前
当前提交
45dd7401
共有 1 个文件被更改,包括 3 次插入1 次删除
  1. 4
      ml-agents/mlagents/trainers/trajectory.py

4
ml-agents/mlagents/trainers/trajectory.py


agent_buffer_trajectory["team_rewards"].append(teammate_rewards)
team_reward = teammate_rewards + [exp.reward]
agent_buffer_trajectory["average_team_reward"].append(
sum(team_reward) / len(team_reward)
sum(team_reward)
# sum(team_reward) / len(team_reward)
#)
# Next actions
teammate_cont_next_actions = []

正在加载...
取消
保存