15 次代码提交 (4e05233f-1184-4c57-bee7-435b2b78bdec)

作者 SHA1 备注 提交日期
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
GitHub 97a1d4b1 [change] Remove the action_holder placeholder from the policy. (#3492) 5 年前
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
GitHub cde8bd29 Convert List[np.ndarray] to np.ndarray before using torch.as_tensor (#4183) 4 年前
GitHub 05a11c96 Develop add fire exp framework (#4213) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Andrew Cohen 02df39ab ignore precommit 4 年前
Andrew Cohen fa35292c write hist to tb 4 年前