152 次代码提交 (7cd80378-a0c9-4e55-9305-49b50be655c2)

作者 SHA1 备注 提交日期
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
GitHub 10d2a19d Release v0.5 (Develop) (#1203) 6 年前
GitHub ab5c49e8 Release v0.5 delete unityagents (#1151) 6 年前
GitHub 2d4b4209 Use single scope declaration for models (#1160) 6 年前
GitHub d2c320dd Remove graph scope (#1205) 6 年前
GitHub 6c354d16 New Learning Brain (#1303) 6 年前
vincentpierre 47de43f6 reverted .tf to .bytes into the policy.py script 6 年前
GitHub cc083fd8 fixed the windows ctrl-c bug (#1558) 6 年前
Vincent-Pierre BERGES 4a6ae4e0 Barracuda integration into ML-Agents (#1557) 6 年前
GitHub c258b1c3 Move 'take_action' into Policy class (#1669) 6 年前
GitHub cfb8f208 Release v0.7 minor fixes (#1759) 6 年前
GitHub 20ff1436 Merge pull request #1765 from Unity-Technologies/release-v0.7 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub 2671e1a0 Enable mypy in precommit checks (#2177) 6 年前
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
GitHub a9fe719c Add Multi-GPU implementation for PPO (#2288) 5 年前
GitHub 832e4a47 Normalize observations when adding experiences (#2556) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 36ed3c16 Fix issue exporting graph with multi-GPU (#2573) 5 年前
Jonathan Harper 3fc14963 EXPERIMENTAL horovod support 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub 0fe5adc2 Develop remove memories (#2795) 5 年前
Chris Elion 691d21e6 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub c6c01a03 Enable pylint and fix a few things (#2767) 5 年前
GitHub ccb7eab4 Remove {text,custom} {action,observations} (#2839) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 681093cf cherry pick PR#3032 (#3066) 5 年前
GitHub ef2514ba Develop cold fix recurrent (#3032) 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub 90db165f Add --namespace-packages to mypy for mlagents (#3075) 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
Ervin Teng 2b811fc8 Properly report value estimates and episode length 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
Ervin Teng c330f6f6 Merge branch 'master' into develop-agentprocessor 5 年前
Ervin Teng fdf9aea7 Make conversion methods part of NamedTuples 5 年前
Ervin Teng 1bd791e5 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub d798b1cb Prevent tf.Session() from eating up all the GPU memory (#3219) 5 年前
GitHub 4c241a80 Only send previous action and current BrainInfo (#3187) 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 76ad64d7 Some more bugfixes 5 年前
Ervin Teng 29f3330f Merge master into hotfix-0.13.1 5 年前
Ervin Teng 2b63415e Clean up policy files 5 年前
GitHub ca96b293 Move advance() logic for environment manager out of trainer_controller (#3234) 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng 0ef40c08 SAC CC working 5 年前
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
Ervin Teng db249ceb Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng edeceefd Zeroed version of LSTM working for PPO 5 年前
Ervin Teng 4de71b84 0 out value estimates as well 5 年前
Ervin Teng 4871f49c Fix comments for PPO 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
GitHub dd86e879 Separate out optimizer creation and policy graph creation (#3355) 5 年前
Ervin Teng cdd57468 Re-fix scoping and add method to get all variables 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 5 年前
Ervin Teng c350c6d8 Added enforcement of m_size to be divisible by 2 5 年前
Ervin Teng 441e6a0c Add typing to optimizer, rename self.tf_optimizer 5 年前
Ervin Teng 7004604d Used NamedTuple for create normalization tensors 5 年前
Ervin Teng 8abd4129 Clean up nn_policy 5 年前
Ervin Teng 7c0fa1c4 Remove action_holder placeholder 5 年前
GitHub 587dd165 Support for ONNX export (#3101) 5 年前
GitHub 3641293f Change checkpoint suffix to "ckpt" (#3470) 5 年前
Ervin Teng bcc25d59 Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng 1cfc461a Remove and rename tf_optimizer 5 年前
Ervin Teng 63463bd1 Make TF graph seed deterministic 5 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
Ervin Teng 1156b9b3 Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
Anupam Bhatnagar e8e0078e first commit 5 年前
Anupam Bhatnagar 07b15ae7 [skip-ci] small refactors 5 年前
GitHub 873ba7fd [bug-fix] Fix stats reporting for reward signals in SAC (#3606) 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
GitHub de3fc4e8 Hotfix memory leak on Python (#3664) 5 年前
Andrew Cohen 93d344ff simple rl asymm ghost tests 5 年前
GitHub bc1fdf07 [refactor] CLI changes (#3705) 5 年前
Andrew Cohen 59b88be6 Merge branch 'master' into self-play-mutex 5 年前
GitHub 9cbc3fa2 Asymmetric self-play (#3653) 5 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 5 年前
GitHub d7ca6b8d [feature] Add --initialize-from option (#3710) 5 年前
Anupam Bhatnagar 001fce2a first commit 5 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 5 年前
Anupam Bhatnagar 9341f7a2 [skip-ci] small refactors 5 年前
Arthur Juliani 7c3bd376 Refactoring policy and optimizer 5 年前
Arthur Juliani b997f214 Share more code between tf and torch policies 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
Arthur Juliani 1736559f Combine actor and critic classes. Initial export. 5 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 5 年前
GitHub d2bc86c8 Release 2 cherry pick (#3971) 5 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub abbc6424 [bug-fix] Fix issue with initialize not resetting step count (#3962) 5 年前
Arthur Juliani 28e095e0 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub 335cff3e [versioning] Save ML-Agents version in checkpoints and check on load (#4035) 5 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 5 年前
Anupam Bhatnagar 4afd8f92 first commit 4 年前
Arthur Juliani 9724c9ac Merge master 5 年前
Ervin Teng 510583d2 Move memory validation to settings 5 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Ruo-Ping Dong 6feec58a add Saver class (only TF working) 4 年前
Ruo-Ping Dong 71fe4df6 fix formatting and test 4 年前
Ruo-Ping Dong b4713baa small improvements 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
yanchaosun 5a778ca3 fix normalization 4 年前
yanchaosun a212fef9 new bisim implementation 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
GitHub 1b098c9a Refactor TFPolicy and Policy (#4254) 4 年前
GitHub 380fef57 [refactor] Move TF-specific files to tf/ folder (#4266) 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
Andrew Cohen 18ff42a6 use mean of first trajectory to initialize the normalizer 4 年前
Andrew Cohen 5878b952 remove blank line 4 年前
Andrew Cohen ce9bcefe cleaned up initialization of variance/mean 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Anupam Bhatnagar 87bdf353 [skip ci] save model on worker zero only 4 年前
Anupam Bhatnagar d3e8f124 removing horovod from tf policy 4 年前
Anupam Bhatnagar abc1220f Merge branch 'master' into global-variables 4 年前
Chris Elion d2133d83 comments and cleanup 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Andrew Cohen a65d08c7 ghost trainer tests 4 年前
GitHub 49545ce1 Pytorch ghost trainer (#4370) 4 年前
Andrew Cohen 71f9c241 fix tf policy for ghosts 4 年前
Anupam Bhatnagar 5e8aa485 renaming file from globals.py to global_values.py 4 年前
Anupam Bhatnagar 71c301bc minor fixes 4 年前
Anupam Bhatnagar 1f60979f [skip ci] change self.rank to global_values.get_rank() 4 年前
Anupam Bhatnagar f4f1a8d9 merge master into trainer-plugin branch 4 年前
Andrew Cohen fc3027ac tf tests except gail pass 4 年前
Andrew Cohen f654df34 fixing tensorflow tests 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
GitHub 87a7ccf8 use int64 steps, check for NaN actions (#4607) 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前
GitHub a0d1c829 Action Docs part2 (#4739) 4 年前
Andrew Cohen cd73cce2 test_trajectory fixed 4 年前
Andrew Cohen 97d94a83 fix test_tf_policy 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 35769b53 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
GitHub 9d8a7d6f Update ml-agents/mlagents/trainers/policy/tf_policy.py 4 年前
Andrew Cohen 7ba10239 remove action spec attribute from policy 4 年前
Andrew Cohen 662fd6b1 added docstrings to action flattener 4 年前
GitHub cc948a41 Policy output actiontuple (#4651) 4 年前