25 次代码提交 (082609be-7c77-44e9-a427-d1d0ef0f2b20)

作者 SHA1 备注 提交日期
GitHub 3b866e9f Use Clipped Gaussian (#649) 6 年前
sankalp04 c6fba86a tennis reset parameter implementation ported over 5 年前
GitHub 88b917b3 [format] Format code whitespace with Unity Formatter. (#2550) 5 年前
GitHub f01dd1c1 [coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555) 5 年前
GitHub 5d2e466f Fix Code convention warnings in Rider. (#2801) 5 年前
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
GitHub 411bb64a Renaming Agent's methods (#3557) 5 年前
Andrew Cohen d9f1a2f5 more experiments for self-play 5 年前
Andrew Cohen 8431ecb5 tennis reward fix 5 年前
Andrew Cohen 5d659946 update tennis reward function 5 年前
Andrew Cohen 9f36cd36 added floorhit obs tennis 5 年前
Andrew Cohen e5b883db added bounce obs to agent/more downward force on ball 5 年前
Andrew Cohen 1c4ba1a5 add timestep bonus to loss 5 年前
Andrew Cohen a6e6e63e timestep penalty on loss only 5 年前
Andrew Cohen 251dcc76 remove timepenalty from tennis 4 年前
Andrew Cohen b7bd4c2c reduce winning reward 4 年前
Andrew Cohen 1c2e1d79 increase beta 4 年前
Andrew Cohen d77f2566 energy usage penalty to prevent superstition on serve 4 年前
Andrew Cohen a8f2f613 no energy penalty 4 年前
Andrew Cohen 69acdeec fixed reset tennis 4 年前
Andrew Cohen 84f231ce time penalty 4 年前
Andrew Cohen 43d5ef17 fixed opponent setting 4 年前
Andrew Cohen 0c17dc1b cannot hit scenery tennis 4 年前
Andrew Cohen 7475ad11 tunneling is a loss 4 年前
GitHub e7916b08 add pre-commit hook for dotnet-format (#4362) 4 年前