43 次代码提交 (beb5aca5-8748-4913-b270-f746f2f8b1dd)

作者 SHA1 备注 提交日期
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
GitHub 97a1d4b1 [change] Remove the action_holder placeholder from the policy. (#3492) 5 年前
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
GitHub 870338b4 [bug-fix] Fix issue with more than one continuous actions (#3547) 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
Anupam Bhatnagar e8e0078e first commit 5 年前
GitHub 94de596b [change] Remove concatenate in discrete action probabilities to improve inference performance (#3598) 5 年前
Anupam Bhatnagar 001fce2a first commit 5 年前
Arthur Juliani 8c6f4696 Fix a couple additional bugs 5 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 5 年前
Andrew Cohen 4a3ad193 Add constant decay to beta and epsilon 5 年前
Arthur Juliani 2b3a6347 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
Andrew Cohen 1e50c76e calculating gradient norms 5 年前
Andrew Cohen 0e965a4d sensitivity 5 年前
Andrew Cohen c1f91b5a slightly nicer output 5 年前
Andrew Cohen 23b84dea ignoring commit checks but write to csv 5 年前
Andrew Cohen 61aa9915 write to csv 5 年前
Andrew Cohen d794964f constant beta 5 年前
Andrew Cohen 13c2a209 added opp, decay eps removed 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
Andrew Cohen 50e4585f fixed beta 5 年前
Arthur Juliani 9724c9ac Merge master 5 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Ruo-Ping Dong 6feec58a add Saver class (only TF working) 4 年前
GitHub 93517833 [feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305) 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
Andrew Cohen d8c123a0 Merge branch 'master' into sensitivity 4 年前
Andrew Cohen 02df39ab ignore precommit 4 年前
Andrew Cohen fa35292c write hist to tb 4 年前
GitHub 380fef57 [refactor] Move TF-specific files to tf/ folder (#4266) 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
GitHub 3f44a0bc cleanup around AdamOptimizer (#4333) 4 年前
Ruo-Ping Dong d3eb6c46 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前
Andrew Cohen cd73cce2 test_trajectory fixed 4 年前
GitHub d2d46103 Remove print from ppo tf opti 4 年前