浏览代码

Use reward sum

/develop/coma2/samenet/sum
Ervin Teng 4 年前
当前提交
b21094f1
共有 1 个文件被更改,包括 1 次插入3 次删除
  1. 4
      ml-agents/mlagents/trainers/trajectory.py

4
ml-agents/mlagents/trainers/trajectory.py


)
agent_buffer_trajectory["team_rewards"].append(teammate_rewards)
team_reward = teammate_rewards + [exp.reward]
agent_buffer_trajectory["average_team_reward"].append(
sum(team_reward) / len(team_reward)
)
agent_buffer_trajectory["average_team_reward"].append(sum(team_reward))
# Next actions
teammate_cont_next_actions = []

正在加载...
取消
保存