16 次代码提交 (9060da06-e64b-47ce-b792-5619dd26774c)

作者 SHA1 备注 提交日期
Andrew Cohen 418cc778 coma trainer and optimizer 4 年前
Andrew Cohen 98d647de MultiInputNetBody 4 年前
Ervin Teng 9bc88c41 Running COMA (not sure if learning) 4 年前
Ervin Teng a4fcbb63 Right loss function for stability, fix some pypi 4 年前
Andrew Cohen 5d517c5e clean ups 4 年前
Andrew Cohen 131fa328 inital evaluate_by_seq, does not run 4 年前
Andrew Cohen 67beef88 finished evaluate_by_seq, does not run 4 年前
Andrew Cohen 8f799687 ignoring precommit, grabbing baseline/critic mems from buffer in trainer 4 年前
Andrew Cohen 81524ee8 lstm almost runs 4 年前
Andrew Cohen 4c56e6ad lstm runs with coma 4 年前
GitHub ba2af269 [coma2] Make group extrinsic reward part of extrinsic (#5033) 4 年前
GitHub 6ae8ea1e [coma2] Add support for variable length obs in COMA2 (#5038) 4 年前
Andrew Cohen 0afe5f24 add slice function to agent action 4 年前
Andrew Cohen 21d7ab85 add torch no_grad to coma LSTM value computation 4 年前
Ervin Teng 252c1f36 Fix warning message format 4 年前
Ervin Teng 58122103 Fix warning message formatting again 4 年前