17 次代码提交 (8c4966be-c125-4d81-af77-f3a8b35dc10d)

作者 SHA1 备注 提交日期
GitHub 8f35bdd3 POCA trainer (#5005) 4 年前
GitHub 62314056 Fix ghost curriculum and make steps private (#5098) 4 年前
Ervin Teng 54ffbed6 [cherry-pick] Fix ghost curriculum and make steps private (#5098) 4 年前
Andrew Cohen 9176247c Merge branch 'main' into develop-soccer-groupman-mod 4 年前
GitHub e81e038b Fix end episode for POCA, add warning for group reward if not POCA (#5113) 4 年前
GitHub e79d8a9d [bug-fix] Move POCA critic to default device (#5124) 4 年前
GitHub 63169e2c [cherry-pick] Fix group rewards for POCA, add warning for non-POCA trainers (#5120) 4 年前
GitHub e6143a83 [bug-fix] Move POCA critic to default device (#5124) (#5131) 4 年前
Ervin Teng d1c24251 [bug-fix] When agent isn't training, don't clear update buffer (#5205) 4 年前
Ervin Teng c108da4a [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 4 年前
Andrew Cohen 18be47e8 Merge branch 'main' into develop-soccer-groupman-mod 4 年前
Ervin Teng 81b74634 Fix additional bugs and POCA 4 年前
Ervin Teng c05ec9af Fix groupmate obs, add tests 4 年前
Ervin Teng 9fd4a81e Address comments 4 年前
GitHub ff21216d [bug-fix] When agent isn't training, don't clear update buffer (#5205) 4 年前
GitHub c5589b59 [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 4 年前
vincentpierre 983982ee Removing misleading learning rate 4 年前