17 次代码提交 (2677d314-546f-4cae-8cef-d6e1f2dd7f5a)

作者 SHA1 备注 提交日期
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
GitHub b2cc1c25 [bug-fix] Fix continuous LSTMs and add test (#3521) 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Anupam Bhatnagar 07b15ae7 [skip-ci] small refactors 5 年前
Anupam Bhatnagar 06a54ae8 step increment moved to _update_policy, fixed exit status issue 4 年前
Anupam Bhatnagar 5d180caf [skip ci] modify learning rate in horovod optimizer 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
Anupam Bhatnagar 4afd8f92 first commit 4 年前
Anupam Bhatnagar 24d5f881 first commit 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Andrew Cohen 02df39ab ignore precommit 4 年前
Andrew Cohen fa35292c write hist to tb 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
GitHub 3f44a0bc cleanup around AdamOptimizer (#4333) 4 年前
GitHub c188781b [life improvement] Moving Python files around (#4531) 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
Andrew Cohen 3c65b964 fixed recurrent prev_action issue 4 年前