186 次代码提交 (da6d25c9-4e76-454f-81d0-1258ba68390b)

作者 SHA1 备注 提交日期
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 4 年前
Ervin Teng dcbb90e1 Fix graph init in ghost trainer 4 年前
GitHub 34792205 adding init to ghost trainer directory (#3381) 4 年前
GitHub 25c41f83 adding init to ghost trainer directory (#3381) (#3382) 4 年前
Anupam Bhatnagar c70d0243 [bug-fix] Empty ignored trajectory queues, make sure queues don't overflow (#3451) 4 年前
Ervin Teng 5ef902bf Merge branch 'master' into develop-splitpolicyoptimizer 4 年前
GitHub 6876a1d6 [bug-fix] Empty ignored trajectory queues, make sure queues don't overflow (#3451) 4 年前
Andrew Cohen e4d776c3 Merge branch 'master' into soccer-fives 4 年前
Ervin Teng bcc25d59 Merge branch 'master' into develop-splitpolicyoptimizer 4 年前
GitHub 472f9f0e Merge branch 'master' into develop-badEnvReturnCode 4 年前
Ervin Teng 88998fc9 Add add_policy docstrings 4 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 4 年前
Andrew Cohen 5b0aca29 Merge branch 'master' into soccer-fives 4 年前
Ervin Teng 1156b9b3 Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder 4 年前
Anupam Bhatnagar e04fcd71 Merge branch 'master' into master-into-release-0.14.1 4 年前
Andrew Cohen bd78ec40 self-play assym hacked branch 4 年前
Andrew Cohen 8fe1a27d fixed save_snapshot 4 年前
Andrew Cohen 30725c27 2v1 soccer config and env 4 年前
Andrew Cohen 94654de4 ghost controller 4 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 4 年前
Andrew Cohen 573b1f6d Merge branch 'master' into soccer-fives 4 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 4 年前
GitHub 86141eee Merge pull request #3560 from Unity-Technologies/new-logger 4 年前
GitHub e3af96ca Merge branch 'master' into develop-demo-load-seek 4 年前
Andrew Cohen b1cfa74d Merge branch 'master' into develop-test-imitation 4 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 4 年前
Andrew Cohen 53bea15c Merge branch 'master' into soccer-fives 4 年前
Andrew Cohen ac261e36 Merge branch 'master' into self-play-mutex 4 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 4 年前
Andrew Cohen eefc4811 Merge branch 'master' into self-play-mutex 4 年前
Andrew Cohen 9f09a65d team id centric ghost trainer 4 年前
Andrew Cohen 79076b70 ELO calculation done in ghost controller 4 年前
Andrew Cohen 03b40795 removed opponent elo from stat collection 4 年前
Andrew Cohen 579bbd88 passing all tests locally 4 年前
Andrew Cohen 66b505c3 fixed controller behavior when first team discovered isnt 0 4 年前
Andrew Cohen 1a6e99bb save step on trainer step count/swap on ghost 4 年前
Andrew Cohen 072b4135 soccer 2v1 on the cloud 4 年前
Andrew Cohen 1269b555 docstrings/ghost_swap -> team_change 4 年前
Andrew Cohen b42c9482 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen b15a8b75 docstrings for all ghost trainer functions 4 年前
Andrew Cohen cbba8f52 SELF-PLAY NOW SUPPORTS MULTIAGENT TRAINERS 4 年前
Andrew Cohen 31ef5a84 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 81a141c0 next learning team from get step 4 年前
Andrew Cohen 15770bec comment for self.ghost_step 4 年前
Andrew Cohen 80fd858a ghost->get_step 4 年前
Andrew Cohen 052a24a0 fixed export so both teams have current model 4 年前
Andrew Cohen a13f107f updated self-play doc for asymmetric games/changed current_self->current_best 4 年前
Andrew Cohen 88b8a922 count trainer steps in controller by team id 4 年前
Andrew Cohen c05d6c49 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 60ea278d added team_change as a yaml config 4 年前
Andrew Cohen bc611906 removed team-change CLI 4 年前
Andrew Cohen 42518d84 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 19552661 added team_change as a yaml config 4 年前
Andrew Cohen 650ec121 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 74d37a11 removed not max step reached as condition for ELO 4 年前
Andrew Cohen 0d460514 warning for team change hyperparam 4 年前
Andrew Cohen aa18bef6 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 93d344ff simple rl asymm ghost tests 4 年前
Andrew Cohen c60d0c5a renamed controller methods/doc fixes 4 年前
GitHub 4ecd6ad3 Fix how we set logging levels (#3703) 4 年前
Andrew Cohen 345fa382 current_best_ratio -> latest_model_ratio 4 年前
Andrew Cohen c7a34413 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 59b88be6 Merge branch 'master' into self-play-mutex 4 年前
GitHub 9cbc3fa2 Asymmetric self-play (#3653) 4 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 4 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 4 年前
Andrew Cohen 335e70ea using mlagents_env.logging instead of logging 4 年前
GitHub 9c8142c2 Fix save snapshot bug in ghost trainer (#3722) 4 年前
Andrew Cohen 3de78baa wrapped trainer has internal policy ghost 4 年前
Andrew Cohen b9179f0f fixed order of load weight/create tf graph in add_policy 4 年前
Andrew Cohen 3013774b alternative to internal-policy fix 4 年前
Ervin Teng ed06f37c Ability to disable threading 4 年前
Ervin Teng 971e4b2d Don't block when disabling threading 4 年前
Andrew Cohen 573f80cd added to mig doc/address comments 4 年前
Andrew Cohen 189b4765 remove incorrect docstring 4 年前
Ervin Teng d1895272 Fix ghost trainer locking up 4 年前
Andrew Cohen 3a1912c1 raise warning when latest_model_ratio not btwn 0, 1 4 年前
GitHub b841c9ab Wrapped trainer has internal policy in GhostTrainer 4 年前
Andrew Cohen 930d6fa3 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen fc732b29 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
GitHub aae58330 Merge branch 'master' into develop-add-inference-examples 4 年前
Andrew Cohen b0c506a6 Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen c07e0fce Merge branch 'soccer-2v1' into asymm-envs 4 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 4 年前
Andrew Cohen c79f9f02 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen ed1bda98 Merge branch 'master' into soccer-2v1 4 年前
Ervin Teng d1fed8ae Remove empty_queue interface 4 年前
Ervin Teng e90ef688 Revert to get_nowait method in AgentManagerQueue 4 年前
Andrew Cohen 413633dc Merge branch 'master' into soccer-2v1 4 年前
Andrew Cohen 02d26c3f Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen de0656b6 Merge branch 'internal-policy-ghost' into soccer-2v1 4 年前
Andrew Cohen 4bc36520 Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen a3383ee9 Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen 85304aff Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen 89db8428 Merge branch 'internal-policy-ghost-alternate' into soccer-2v1 4 年前
Andrew Cohen 26c0033c Merge branch 'soccer-2v1' into asymm-envs 4 年前
GitHub 2e939d50 Clean up and fix save and load in ghost (#3797) 4 年前
Arthur Juliani 3769d943 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
GitHub 4d23200b [refactor] Run Trainers in separate threads (#3690) 4 年前
Ervin Teng 9cd2c034 Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-sac-apex 4 年前
Andrew Cohen ddb6787c hard reset when team changes 4 年前
GitHub 4092d937 [Bug fix] Hard reset when team changes (#3870) 4 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
GitHub d8b93f8f [Bug fix] Hard reset when team changes (#3870) (#3899) 4 年前
Andrew Cohen 9d5d6fa7 Merge branch 'master' into asymm-envs 4 年前
vincentpierre c34dd5b6 Merge branch 'master' into develop-gym-wrapper 4 年前
Andrew Cohen a2f8319a Merge branch 'master' into asymm-envs 4 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub f7373172 Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2 4 年前
Ruo-Ping Dong 2ca79207 [bug-fix] Don't load non-wrapped policy (#4593) 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub 5cce69ae add "the the" to precommit spell check (#4059) 4 年前
Andrew Cohen e7750fc9 Merge branch 'master' into develop-sampler-refactor 4 年前
Andrew Cohen e0aa5cee Merge branch 'develop-team-change-reset' into asymm-envs 4 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
Andrew Cohen 22786526 Merge branch 'master' into asymm-envs 4 年前
Andrew Cohen c0f7052b Merge branch 'master' into develop-sampler-refactor 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
GitHub 8a49e8e0 [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
Jonathan Harper 80127232 Convert checkpoints to .nn format 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
vincentpierre 599d7e9f Merging master 4 年前
HH 7afa1761 Merge branch 'master' into hh/develop/ragdoll-updates 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
HH 0fdac847 Merge branch 'master' into hh/develop/crawler-ragdoll-updates 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
Arthur Juliani 6bee0fd1 Merge master 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
GitHub 2c64d623 don't try/except for control flow (#4251) 4 年前
Andrew Cohen d8c123a0 Merge branch 'master' into sensitivity 4 年前
GitHub beb5aca5 [refactor] Make classes except Optimizer framework agnostic (#4268) 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
Arthur Juliani 1a123641 Merge remote-tracking branch 'origin/master' into r5-master 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Ruo-Ping Dong 523248be update 4 年前
HH 8eaddb61 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
GitHub bd6bcd2f Merge master and add Saver class for save/load checkpoints 4 年前
Ervin Teng 42e25b25 Merge branch 'develop-add-fire' into develop-add-fire-memoryclass 4 年前
Christopher Goy 5a233353 Merge remote-tracking branch 'origin/master' into release_6-to-master 4 年前
Andrew Cohen a65d08c7 ghost trainer tests 4 年前
GitHub 49545ce1 Pytorch ghost trainer (#4370) 4 年前
GitHub 1955af9e [feature] Add experimental PyTorch support (#4335) 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
GitHub 48f217b9 Rename Saver to ModelSaver (#4402) 4 年前
Anupam Bhatnagar f4f1a8d9 merge master into trainer-plugin branch 4 年前
Ruo-Ping Dong fd1dc3a6 Merge branch 'master' into develop-torch-omp 4 年前
Andrew Cohen 3997b14b Merge branch 'master' into develop-hybrid-actions 4 年前
GitHub b3bc7896 Cherrypick bug fixes to release_9_branch (#4617) 4 年前
GitHub b5dd43f2 [bug-fix] Don't load non-wrapped policy (#4593) 4 年前
vincentpierre a3a9a56b Merge branch 'exp-multi-head-attention' into exp-bullet-hell 4 年前
GitHub 23800f33 Merge branch 'master' into develop-action-spec 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Ervin Teng 7087b7b3 Add cc to ghost trainer 4 年前
Ervin Teng 80598c48 Actually add comment to ghosttrainer 4 年前
Andrew Cohen c72e00c9 fix multiple policy issue 4 年前
Ruo-Ping Dong 8ed14762 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Ervin Teng 4893f4b2 Fix team ELOs 3 年前
Ervin Teng 05db051e Remove some unneeded changes 3 年前
GitHub 5022d710 Add additional logic to avoid load being called on every advance (#4934) 3 年前
Ruo-Ping Dong c87bce9e Merge branch 'master' into develop-base-teammanager 3 年前
Ervin Teng 219e773b Merge branch 'develop-fix-lstms' into develop-critic-op-lstm 3 年前
vincentpierre e1b94b8b Merge branch 'master' into develop-var-len-obs-feature 3 年前
Andrew Cohen dc8e8494 Merge branch 'master' into develop-critic-optimizer 3 年前
Chris Elion e4f51ca7 Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 3 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 3 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 3 年前
HH 15d512f9 Merge branch 'master' into hh/develop/dodgeball 3 年前
Ervin Teng 08db7c2f Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer-mm 3 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 3 年前
Ervin Teng c8137dcd Merge branch 'main' into develop-superpush-int 3 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 3 年前
Christopher Goy 921ba4f0 Update v2-staging from main (March 15) (#5123) 3 年前
Chris Elion 970f1d40 Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec 3 年前
GitHub 8f35bdd3 POCA trainer (#5005) 3 年前
Andrew Cohen 9e77d7e1 Merge branch 'main' into develop-soccer-groupman 3 年前
GitHub 39f8b6ac add group done to ELO computation (#5150) 3 年前
GitHub 88ef8f25 R15 fix elo (#5151) 3 年前
Ervin Teng e1c23ad7 [🐛 🔨 ]Adding the ELO to the GlobalTrainingStatus (#5202) 3 年前
Andrew Cohen 18be47e8 Merge branch 'main' into develop-soccer-groupman-mod 3 年前
GitHub 640b2e00 [🐛 🔨 ]Adding the ELO to the GlobalTrainingStatus (#5202) 3 年前