17 次代码提交 (2cd2048b-df50-4b26-9563-c0cc6a6bfaae)

作者 SHA1 备注 提交日期
GitHub a9fe719c Add Multi-GPU implementation for PPO (#2288) 5 年前
GitHub bd7eb286 Update reward signals in parallel with policy (#2362) 5 年前
GitHub 3683cc1c Enable learning rate decay to be disabled (#2567) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 36ed3c16 Fix issue exporting graph with multi-GPU (#2573) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub c6c01a03 Enable pylint and fix a few things (#2767) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub bf68edcf ingore attribute-defined-outside-init in multi_gpu_policy (#2876) 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub f57b7ac6 Allow usage with tensorflow 2.0.0 (via tf.compat.v1) (#2665) 5 年前
Ervin Teng 3a4fa244 Switch to tanh squash in PPO 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
Ervin Teng e912fa47 Simplify creation of optimizer, breaks multi-GPU 5 年前
Ervin Teng 151e3b1c Move policy to common location, remove epsilon 5 年前