133 次代码提交 (1db18bd6-65c7-4859-bf35-891bb0856880)

作者 SHA1 备注 提交日期
Arthur Juliani de700c3a Multi Brain Training and Recurrent state encoder (#166) 7 年前
GitHub 51621334 State Stacking & Banan Environment (#262) 7 年前
Arthur Juliani 4418421a Rename variables in imitation trainer 7 年前
GitHub 8317a659 Behavioral Cloning & Trainers Reorg (#328) 7 年前
eshvk 030ac5c5 [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups. 7 年前
GitHub 237b41f9 Hotfix 0.3.0c (#618) 7 年前
GitHub 702d98c6 [Fix] The summary writer is now implemented in the abtract trainer class. (#806) 6 年前
Arthur Juliani d7338050 Enable concurrent sessions 6 年前
Arthur Juliani 5d402be9 Minor Optimizations (#836) 6 年前
Arthur Juliani 195ac934 Merge branch 'develop' into develop-runs 6 年前
Arthur Juliani fad0da30 Log run-id in console 6 年前
unityjeffrey 0d67f311 changed ml agents to ml-agents 6 年前
unityjeffrey 19fb437a changed to Unity ML-Agents Toolkit (english) 6 年前
Arthur Juliani f52d5a92 Merge remote-tracking branch 'origin/develop' into develop-runs 6 年前
Deric Pang c88c7e42 Fixing bugs, updating tests. 6 年前
Deric Pang db6fa4ba Removing commented line. 6 年前
Deric Pang ff4ce695 Updated logging in trainer. 6 年前
Deric Pang 9d9c91e4 Fixed TensorBoard lesson logging. 6 年前
Deric Pang 822d329a Fixing bug when no curriculum folder is passed. 6 年前
Arthur Juliani 9e8049f0 Will now print summaries even when not training or when training is over (#1020) 6 年前
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
GitHub 10d2a19d Release v0.5 (Develop) (#1203) 6 年前
GitHub 29084e77 Curriculum learning reward thresholding bug fix (#1141) 6 年前
GitHub d2c320dd Remove graph scope (#1205) 6 年前
GitHub 3c9603d6 Demonstration Recorder (#1240) 6 年前
GitHub 840417ff Use organized tags for tensorboard stats (#1248) 6 年前
GitHub 78374601 vince's fix for model step (#1329) 6 年前
GitHub c258b1c3 Move 'take_action' into Policy class (#1669) 6 年前
Ervin T b30f4c90 Split `mlagents` into two packages (#1812) 6 年前
eshvk cc9bdf17 Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return 6 年前
eshvk fb04c40c Reorganize to make metrics collection more accurate 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 5 年前
GitHub 2671e1a0 Enable mypy in precommit checks (#2177) 5 年前
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
Jonathan Harper 177ee5b8 Remove unused "last reward" logic, TF nodes 5 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
Chris Elion bb7773c1 add flake8 to precommit 5 年前
Chris Elion 5d07ca1f Merge remote-tracking branch 'origin/develop' into enable-flake8 5 年前
GitHub 19283bfa Very simple environment for testing (#2266) 5 年前
GitHub 9eb3f049 Cleanup unused code in TrainerController (#2315) 5 年前
GitHub 83875376 Add "gauges" to timer system (#2329) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub f628d18b initialize trainer step count (#2498) (#2505) 5 年前
GitHub 7720db33 Fix run_id typing in trainer.py (#2537) 5 年前
GitHub 3683cc1c Enable learning rate decay to be disabled (#2567) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
Ervin Teng 748c250e Somewhat running 5 年前
Andrew Cohen 13fe9cf8 Bubbled up indexing of AllBrainInfo to trainer controller from trainers 5 年前
Andrew Cohen e96b80db recieves brain_name and identifier on python side 5 年前
Andrew Cohen 8578b0b7 add_policy and create_policy separated 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub 42bea858 Improve mypy coverage by adding --namespace-packages (#3049) 5 年前
Andrew Cohen 614d276f recieves brain_name and identifier on python side 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
GitHub 58b6c7c2 Rename mlagents.envs to mlagents_envs (#3083) 5 年前
Andrew Cohen d1edbf43 add_policy and create_policy separated 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 2ac242f7 Remove TrainerMetrics and add CSVWriter using new StatsWriter API (#3108) 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
Andrew Cohen 082789ea Merge branch 'master' into develop-magic-string 5 年前
Andrew Cohen b14680f1 fixing ci tests 5 年前
Ervin Teng e577d5ea Fix some mypy issues and remove unused code 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Andrew Cohen fc485077 fixed more ci problems/removed self.policies 5 年前
Ervin Teng db743971 Move private methods out of trainer, simplify interface 5 年前
Andrew Cohen c8514c18 Merge branch 'master' into develop-magic-string 5 年前
GitHub 45010af3 Add stats reporter class and re-enable missing stats (#3076) 5 年前
Ervin Teng 3d25f9d2 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub d798b1cb Prevent tf.Session() from eating up all the GPU memory (#3219) 5 年前
GitHub 56a67403 Fix lost trajectories when they are produced faster than they are consumed (#3233) 5 年前
Ervin Teng 29f3330f Merge master into hotfix-0.13.1 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
GitHub 329b23e0 Fix extra summary being written when loading from checkpoint (#3272) 5 年前
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
Ervin Teng db249ceb Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
GitHub 587dd165 Support for ONNX export (#3101) 5 年前
Ervin Teng bcc25d59 Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Anupam Bhatnagar abc369a6 Adding a logging utility for improved logs 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
Anupam Bhatnagar e8e0078e first commit 5 年前
Anupam Bhatnagar 07b15ae7 [skip-ci] small refactors 5 年前
Anupam Bhatnagar 455adc60 [skip ci] continue training until worker-0 is done 5 年前
Anupam Bhatnagar e49f186b removing logging statements 5 年前
Ervin Teng ce6ab0de Make progress bar class and add to trainer 5 年前
Ervin Teng bcf073bf Move console logging to ConsoleWriter 5 年前
Ervin Teng 6b578de4 Merge branch 'develop-refactorprint' into develop-progress-bar 5 年前
Ervin Teng 49df4038 Make progress bar a statswriter 5 年前
GitHub 25cc9f15 [change] Move hyperparameter printing entirely into StatsWriters (#3630) 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 5 年前
Andrew Cohen 9f09a65d team id centric ghost trainer 5 年前
GitHub 4ecd6ad3 Fix how we set logging levels (#3703) 5 年前
Andrew Cohen 59b88be6 Merge branch 'master' into self-play-mutex 5 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 5 年前
Andrew Cohen 3de78baa wrapped trainer has internal policy ghost 5 年前
Andrew Cohen 3013774b alternative to internal-policy fix 5 年前
Ervin Teng ed06f37c Ability to disable threading 5 年前
Anupam Bhatnagar 001fce2a first commit 5 年前
Anupam Bhatnagar 9341f7a2 [skip-ci] small refactors 5 年前
Anupam Bhatnagar f36108a9 [skip ci] continue training until worker-0 is done 5 年前
Anupam Bhatnagar c49cc069 removing logging statements 5 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 4 年前
Ervin Teng 9fe104d6 Make threading disable-able per trainer 4 年前
Arthur Juliani 7c3bd376 Refactoring policy and optimizer 4 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 4 年前
Arthur Juliani 1736559f Combine actor and critic classes. Initial export. 4 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 4 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
PSankalp Patro 45c4ea36 Save checkpoint files as .nn files in checkpoint directory 4 年前
GitHub 7229214c [cleanup] Remove unused param keys (#4067) 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
Jonathan Harper 80127232 Convert checkpoints to .nn format 4 年前
Ervin Teng 510583d2 Move memory validation to settings 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Ruo-Ping Dong 71fe4df6 fix formatting and test 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Andrew Cohen a65d08c7 ghost trainer tests 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前