106 次代码提交 (4671cf17-7b01-4722-935e-99b514a6ebbe)

作者 SHA1 备注 提交日期
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
GitHub d80d5852 add some types to the reward signals (#2215) 5 年前
GitHub 9c50abcf GAIL and Pretraining (#2118) 5 年前
Chris Elion 5d07ca1f Merge remote-tracking branch 'origin/develop' into enable-flake8 5 年前
Chris Elion dfdf7b83 fix whitespace and line breaks 5 年前
GitHub f8041534 Merge pull request #2236 from Unity-Technologies/enable-flake8 5 年前
GitHub 6a212f73 Improvements for GAIL (#2296) 5 年前
GitHub dd0d2a10 Remove unnecessary feed_dicts for GAIL and Curiosity (#2348) 5 年前
Jeffrey Shih 728afebf Release 0.9.0 docs checklist and cleanup - v2 (#2372) 5 年前
GitHub c7f0ed04 Merge pull request #2381 from Unity-Technologies/release-0.9.0 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub ab690b93 Fix naming conflict between Curiosity and GAIL (#2406) 5 年前
GitHub afb6ede5 Merge pull request #2393 from Unity-Technologies/hotfix-v0.9.0a 5 年前
Ervin Teng 072d2ef8 Merge latest develop 5 年前
GitHub 4472838e Merge pull request #2421 from Unity-Technologies/hotfix-v0.9.1 5 年前
GitHub bd7eb286 Update reward signals in parallel with policy (#2362) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
GitHub 0a163871 Merge pull request #2469 from Unity-Technologies/release-0.9.2 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 149ebd67 Fix crash with VAIL + GAIL (#2598) 5 年前
GitHub b2fa2268 Merge pull request #2648 from Unity-Technologies/release-0.10.0 5 年前
Anupam Bhatnagar cc208c00 resolving conflicts 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub 619465e1 Fix crash when SAC is used with Curiosity and Continuous Actions (#2740) 5 年前
Chris Elion 3d8a70fb Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 495873e5 Merge pull request #2833 from Unity-Technologies/release-0.11.0 5 年前
GitHub c6c01a03 Enable pylint and fix a few things (#2767) 5 年前
Jonathan Harper 8550679d Merge branch 'develop' into release-0.11.0 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub f57b7ac6 Allow usage with tensorflow 2.0.0 (via tf.compat.v1) (#2665) 5 年前
Ervin Teng 987e0e3a Merge tf2 branch 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Ervin Teng 54644477 Merge branch 'develop' of github.com:Unity-Technologies/ml-agents into develop-nomaxstep-test 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub d4780a55 Merge pull request #3010 from Unity-Technologies/release-0.12.0-to-master 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng 34f9577c Merge branch 'develop' into develop-agentprocessor 5 年前
GitHub 35c995e9 Merge pull request #3038 from Unity-Technologies/develop 5 年前
Ervin Teng eb4a04a5 Merge branch 'master' into develop-tanhsquash 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
Ervin Teng 9d1eff12 Fix one more np float32 issue 5 年前
Ervin Teng c330f6f6 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 29f3330f Merge master into hotfix-0.13.1 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
Ervin Teng 3348bcef Commit init file 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng abc98c23 Change reward signal creation 5 年前
Ervin Teng 151e3b1c Move policy to common location, remove epsilon 5 年前
Ervin Teng b61d2fa1 Fix some typing issues with curiosity 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
Ervin Teng cb2d2526 Reformat using black 5 年前
Ervin Teng 5bfc0b87 Update docstring 5 年前
Ervin Teng 7c0fa1c4 Remove action_holder placeholder 5 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
Andrew Cohen 5b0aca29 Merge branch 'master' into soccer-fives 5 年前
Ervin Teng 14f2a7f2 Rename LearningModel to ModelUtils 5 年前
Ervin Teng 1156b9b3 Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前
Anupam Bhatnagar e04fcd71 Merge branch 'master' into master-into-release-0.14.1 5 年前
GitHub 97a1d4b1 [change] Remove the action_holder placeholder from the policy. (#3492) 5 年前
Andrew Cohen de73baa9 Merge branch 'master' into soccer-fives 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Andrew Cohen 573b1f6d Merge branch 'master' into soccer-fives 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub 86141eee Merge pull request #3560 from Unity-Technologies/new-logger 5 年前
GitHub e3af96ca Merge branch 'master' into develop-demo-load-seek 5 年前
GitHub ffd8f855 [bug-fix] Fix crash when demo size is smaller than batch size (#3591) 5 年前
Chris Elion 7f2e815a Merge remote-tracking branch 'origin/master' into develop-sidechannel-usability 5 年前
Chris Elion fa5e7e6d Merge remote-tracking branch 'origin/master' into develop-BehaviorParams-public 5 年前
Andrew Cohen b1cfa74d Merge branch 'master' into develop-test-imitation 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
Andrew Cohen 53bea15c Merge branch 'master' into soccer-fives 5 年前
Andrew Cohen ac261e36 Merge branch 'master' into self-play-mutex 5 年前
GitHub 4ecd6ad3 Fix how we set logging levels (#3703) 5 年前
Andrew Cohen 59b88be6 Merge branch 'master' into self-play-mutex 5 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 5 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 5 年前
Andrew Cohen 930d6fa3 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen b0c506a6 Merge branch 'soccer-2v1' into asymm-envs 5 年前
GitHub adeb6536 Catch dimension mismatches between demos and policy (#3821) 5 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
vincentpierre c34dd5b6 Merge branch 'master' into develop-gym-wrapper 5 年前
Andrew Cohen 0ec2a890 Merge branch 'master' into asymm-envs 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub f7373172 Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
Arthur Juliani 9724c9ac Merge master 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
vincentpierre 599d7e9f Merging master 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
Andrew Cohen d8c123a0 Merge branch 'master' into sensitivity 4 年前
GitHub 1b098c9a Refactor TFPolicy and Policy (#4254) 4 年前
GitHub 380fef57 [refactor] Move TF-specific files to tf/ folder (#4266) 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
Arthur Juliani 1a123641 Merge remote-tracking branch 'origin/master' into r5-master 4 年前
HH 8eaddb61 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前