99 次代码提交 (c912d140-bcba-4efa-95e8-1f61d2eb9d7c)

作者 SHA1 备注 提交日期
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
Deric Pang cdb41480 Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure 6 年前
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
GitHub d2c320dd Remove graph scope (#1205) 6 年前
GitHub 2b6b4570 Fix the Python Tests (#1327) 6 年前
GitHub cd087609 added the pypiwin32 package (#1668) 6 年前
Ervin T b30f4c90 Split `mlagents` into two packages (#1812) 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub e916dc48 use yaml.safe_load instead of yaml.load (#2124) 5 年前
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
GitHub a5b7cf95 Fix get_value_estimate and buffer append (#2276) 5 年前
GitHub be4292fb Add different types of visual encoder (nature cnn/resnet) 5 年前
GitHub 6a212f73 Improvements for GAIL (#2296) 5 年前
GitHub 6225317d refactor vis_encoder_type and add to doc 5 年前
GitHub a9fe719c Add Multi-GPU implementation for PPO (#2288) 5 年前
GitHub 43696d60 Fix bug in add_rewards_output and add test (#2442) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Ervin Teng 094cbe4d Fix bug when batch size is a non-multiple of sequence length (#2661) 5 年前
Ervin Teng df44ee8d Fix crash in trainer tests (trainer_metrics) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub 24ba9d58 Develop deprecate broadcasting (#2669) 5 年前
Chris Elion a1967c19 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 619465e1 Fix crash when SAC is used with Curiosity and Continuous Actions (#2740) 5 年前
GitHub 0892ef2c [WIP] ISensor interface and use for visual observations (#2731) 5 年前
Chris Elion 3d8a70fb Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub e6f549dc [MLA-12] update protobuf for vector observations (#2862) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
Ervin Teng 29cdf77a Fix RL tests 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
GitHub a6df9f43 Develop new ll api (#3022) 5 年前
Andrew Cohen ef2dfd4c adjusting tests to expect trainer.add_policy to be called 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
GitHub 58b6c7c2 Rename mlagents.envs to mlagents_envs (#3083) 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
Ervin Teng 97d66e71 Remove BootstrapExperience 5 年前
Ervin Teng 83126bb2 Fix PPO value tests 5 年前
Andrew Cohen 70357569 adjusting tests to expect trainer.add_policy to be called 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
Ervin Teng c330f6f6 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub 3de3c1f1 check min size for visual encoders (#3112) 5 年前
Ervin Teng 47f8fa7a Fix some import errors 5 年前
Ervin Teng fdf9aea7 Make conversion methods part of NamedTuples 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Andrew Cohen c8514c18 Merge branch 'master' into develop-magic-string 5 年前
GitHub 45010af3 Add stats reporter class and re-enable missing stats (#3076) 5 年前
Ervin Teng 3d25f9d2 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub 29c91b14 update flake8 plugin version and fix warnings (#3180) 5 年前
Ervin Teng ce75b378 update flake8 plugin version and fix warnings (#3180) 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng e83276f6 Fix PPO test 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
GitHub 329b23e0 Fix extra summary being written when loading from checkpoint (#3272) 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng 151e3b1c Move policy to common location, remove epsilon 5 年前
Ervin Teng db249ceb Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng d02bfbd4 Remove PPO policy tests 5 年前
Ervin Teng 1c4f60d4 remove more PPO tests 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 5 年前
Ervin Teng 00017bab Temporarily remove multi-GPU 5 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
GitHub b2cc1c25 [bug-fix] Fix continuous LSTMs and add test (#3521) 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
GitHub ffd8f855 [bug-fix] Fix crash when demo size is smaller than batch size (#3591) 5 年前
GitHub c42a11c3 [change] Throw a proper error when sequence length is greater than batch size. (#3583) 5 年前
Andrew Cohen 3013774b alternative to internal-policy fix 5 年前
Andrew Cohen d1bee64b fixed test_ghost and test_ppo 5 年前
Andrew Cohen 0af2a651 fixed test_sac 5 年前
Andrew Cohen 89db8428 Merge branch 'internal-policy-ghost-alternate' into soccer-2v1 5 年前
Arthur Juliani 3c82bf59 Training runs, but doesn’t actually work 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 5 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 93517833 [feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
GitHub 1b098c9a Refactor TFPolicy and Policy (#4254) 4 年前
Ruo-Ping Dong d3eb6c46 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Ruo-Ping Dong 523248be update 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
Ruo-Ping Dong 27fb4270 brain_name to behavior_name 4 年前
GitHub e471bd8b Refactoring of the tests folder for the trainers (#4510) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 590adc01 make_fake_trajectory/step take ActionSpec arg 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
Andrew Cohen 0e28dd8f add static method to create continuous/discrete 4 年前