32 次代码提交 (develop-horovod)

作者 SHA1 备注 提交日期
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
GitHub d80d5852 add some types to the reward signals (#2215) 5 年前
GitHub 9c50abcf GAIL and Pretraining (#2118) 5 年前
Chris Elion dfdf7b83 fix whitespace and line breaks 5 年前
GitHub 6a212f73 Improvements for GAIL (#2296) 5 年前
GitHub dd0d2a10 Remove unnecessary feed_dicts for GAIL and Curiosity (#2348) 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub bd7eb286 Update reward signals in parallel with policy (#2362) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub 619465e1 Fix crash when SAC is used with Curiosity and Continuous Actions (#2740) 5 年前
Chris Elion 3d8a70fb Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng abc98c23 Change reward signal creation 5 年前
Ervin Teng 151e3b1c Move policy to common location, remove epsilon 5 年前
Ervin Teng b61d2fa1 Fix some typing issues with curiosity 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
Ervin Teng 7c0fa1c4 Remove action_holder placeholder 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
GitHub ffd8f855 [bug-fix] Fix crash when demo size is smaller than batch size (#3591) 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub c188781b [life improvement] Moving Python files around (#4531) 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前