浏览代码

Fix team ELOs

/develop/coma2/samenet/sum
Ervin Teng 4 年前
当前提交
4893f4b2
共有 1 个文件被更改,包括 1 次插入1 次删除
  1. 2
      ml-agents/mlagents/trainers/ghost/trainer.py

2
ml-agents/mlagents/trainers/ghost/trainer.py


i.e. in asymmetric games. We assume the last reward determines the winner.
:param trajectory: Trajectory.
"""
if trajectory.done_reached:
if trajectory.done_reached and trajectory.teammate_dones_reached:
# Assumption is that final reward is >0/0/<0 for win/draw/loss
final_reward = trajectory.steps[-1].reward
result = 0.5

正在加载...
取消
保存