6 次代码提交 (da6d25c9-4e76-454f-81d0-1258ba68390b)

作者 SHA1 备注 提交日期
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 4 年前
GitHub ed2eb6ef [bug-fix] Fix entropy computation in MultiCategorialDistribution (#3607) 4 年前
GitHub 94de596b [change] Remove concatenate in discrete action probabilities to improve inference performance (#3598) 4 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 4 年前
GitHub 29f82921 [bug-fix] Improve performance for PPO with continuous actions (#3662) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前