34 次代码提交 (a54aef02-cc1d-4cb5-8ebb-9edf93d55212)

作者 SHA1 备注 提交日期
GitHub 9c50abcf GAIL and Pretraining (#2118) 5 年前
Chris Elion dfdf7b83 fix whitespace and line breaks 5 年前
GitHub 6a212f73 Improvements for GAIL (#2296) 5 年前
GitHub dd0d2a10 Remove unnecessary feed_dicts for GAIL and Curiosity (#2348) 5 年前
Jeffrey Shih 728afebf Release 0.9.0 docs checklist and cleanup - v2 (#2372) 5 年前
GitHub c7f0ed04 Merge pull request #2381 from Unity-Technologies/release-0.9.0 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub bd7eb286 Update reward signals in parallel with policy (#2362) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng abc98c23 Change reward signal creation 5 年前
Ervin Teng cb2d2526 Reformat using black 5 年前
Ervin Teng 7c0fa1c4 Remove action_holder placeholder 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub ffd8f855 [bug-fix] Fix crash when demo size is smaller than batch size (#3591) 5 年前
GitHub adeb6536 Catch dimension mismatches between demos and policy (#3821) 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub c188781b [life improvement] Moving Python files around (#4531) 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前