6 次代码提交 (36528481-64cf-4929-9a13-116fbfc78c7d)

作者 SHA1 备注 提交日期
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 6 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 6 年前
GitHub d80d5852 add some types to the reward signals (#2215) 6 年前
GitHub 6a212f73 Improvements for GAIL (#2296) 6 年前
GitHub bd7eb286 Update reward signals in parallel with policy (#2362) 6 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 6 年前