浏览代码

cast time penalty to float (#5424)

/main
GitHub 3 年前
当前提交
9c891c97
共有 1 个文件被更改,包括 2 次插入2 次删除
  1. 4
      Project/Assets/ML-Agents/Examples/Soccer/Scripts/SoccerEnvController.cs

4
Project/Assets/ML-Agents/Examples/Soccer/Scripts/SoccerEnvController.cs


{
if (scoredTeam == Team.Blue)
{
m_BlueAgentGroup.AddGroupReward(1 - m_ResetTimer / MaxEnvironmentSteps);
m_BlueAgentGroup.AddGroupReward(1 - (float)m_ResetTimer / MaxEnvironmentSteps);
m_PurpleAgentGroup.AddGroupReward(1 - m_ResetTimer / MaxEnvironmentSteps);
m_PurpleAgentGroup.AddGroupReward(1 - (float)m_ResetTimer / MaxEnvironmentSteps);
m_BlueAgentGroup.AddGroupReward(-1);
}
m_PurpleAgentGroup.EndGroupEpisode();

正在加载...
取消
保存