6 次代码提交 (2677d314-546f-4cae-8cef-d6e1f2dd7f5a)

作者 SHA1 备注 提交日期
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 5 年前
GitHub ed2eb6ef [bug-fix] Fix entropy computation in MultiCategorialDistribution (#3607) 5 年前
GitHub 94de596b [change] Remove concatenate in discrete action probabilities to improve inference performance (#3598) 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
GitHub 29f82921 [bug-fix] Improve performance for PPO with continuous actions (#3662) 5 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前