90 次代码提交 (9695b89a-39ea-4843-bafc-7b6ba4027929)

作者 SHA1 备注 提交日期
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 5 年前
Ervin Teng dcbb90e1 Fix graph init in ghost trainer 5 年前
Anupam Bhatnagar c70d0243 [bug-fix] Empty ignored trajectory queues, make sure queues don't overflow (#3451) 5 年前
Ervin Teng bcc25d59 Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng 88998fc9 Add add_policy docstrings 5 年前
Andrew Cohen bd78ec40 self-play assym hacked branch 5 年前
Andrew Cohen 8fe1a27d fixed save_snapshot 5 年前
Andrew Cohen 30725c27 2v1 soccer config and env 5 年前
Andrew Cohen 94654de4 ghost controller 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
Andrew Cohen ac261e36 Merge branch 'master' into self-play-mutex 5 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 5 年前
Andrew Cohen eefc4811 Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen 9f09a65d team id centric ghost trainer 5 年前
Andrew Cohen 79076b70 ELO calculation done in ghost controller 5 年前
Andrew Cohen 03b40795 removed opponent elo from stat collection 5 年前
Andrew Cohen 579bbd88 passing all tests locally 5 年前
Andrew Cohen 66b505c3 fixed controller behavior when first team discovered isnt 0 5 年前
Andrew Cohen 1a6e99bb save step on trainer step count/swap on ghost 5 年前
Andrew Cohen 072b4135 soccer 2v1 on the cloud 5 年前
Andrew Cohen b15a8b75 docstrings for all ghost trainer functions 5 年前
Andrew Cohen cbba8f52 SELF-PLAY NOW SUPPORTS MULTIAGENT TRAINERS 5 年前
Andrew Cohen 31ef5a84 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 81a141c0 next learning team from get step 5 年前
Andrew Cohen 15770bec comment for self.ghost_step 5 年前
Andrew Cohen 80fd858a ghost->get_step 5 年前
Andrew Cohen 052a24a0 fixed export so both teams have current model 5 年前
Andrew Cohen a13f107f updated self-play doc for asymmetric games/changed current_self->current_best 5 年前
Andrew Cohen c05d6c49 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 60ea278d added team_change as a yaml config 5 年前
Andrew Cohen bc611906 removed team-change CLI 5 年前
Andrew Cohen 42518d84 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 19552661 added team_change as a yaml config 5 年前
Andrew Cohen 650ec121 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 74d37a11 removed not max step reached as condition for ELO 5 年前
Andrew Cohen 0d460514 warning for team change hyperparam 5 年前
Andrew Cohen aa18bef6 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 93d344ff simple rl asymm ghost tests 5 年前
Andrew Cohen c60d0c5a renamed controller methods/doc fixes 5 年前
GitHub 4ecd6ad3 Fix how we set logging levels (#3703) 5 年前
Andrew Cohen 345fa382 current_best_ratio -> latest_model_ratio 5 年前
Andrew Cohen c7a34413 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 59b88be6 Merge branch 'master' into self-play-mutex 5 年前
GitHub 9cbc3fa2 Asymmetric self-play (#3653) 5 年前
GitHub 9c8142c2 Fix save snapshot bug in ghost trainer (#3722) 5 年前
Andrew Cohen 3de78baa wrapped trainer has internal policy ghost 5 年前
Andrew Cohen b9179f0f fixed order of load weight/create tf graph in add_policy 5 年前
Andrew Cohen 3013774b alternative to internal-policy fix 5 年前
Ervin Teng ed06f37c Ability to disable threading 5 年前
Ervin Teng 971e4b2d Don't block when disabling threading 5 年前
Andrew Cohen 189b4765 remove incorrect docstring 5 年前
Ervin Teng d1895272 Fix ghost trainer locking up 5 年前
Andrew Cohen 930d6fa3 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 5 年前
Andrew Cohen c79f9f02 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Ervin Teng d1fed8ae Remove empty_queue interface 5 年前
Ervin Teng e90ef688 Revert to get_nowait method in AgentManagerQueue 5 年前
GitHub 2e939d50 Clean up and fix save and load in ghost (#3797) 5 年前
GitHub 4d23200b [refactor] Run Trainers in separate threads (#3690) 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
Ruo-Ping Dong 2ca79207 [bug-fix] Don't load non-wrapped policy (#4593) 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub 5cce69ae add "the the" to precommit spell check (#4059) 4 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
Jonathan Harper 80127232 Convert checkpoints to .nn format 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
GitHub 2c64d623 don't try/except for control flow (#4251) 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Ruo-Ping Dong 523248be update 4 年前
Andrew Cohen a65d08c7 ghost trainer tests 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
GitHub b3bc7896 Cherrypick bug fixes to release_9_branch (#4617) 4 年前
Ervin Teng 7087b7b3 Add cc to ghost trainer 4 年前
Ervin Teng 80598c48 Actually add comment to ghosttrainer 4 年前
Andrew Cohen c72e00c9 fix multiple policy issue 4 年前
Ervin Teng 4893f4b2 Fix team ELOs 4 年前
GitHub 5022d710 Add additional logic to avoid load being called on every advance (#4934) 4 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
GitHub 39f8b6ac add group done to ELO computation (#5150) 4 年前
Ervin Teng e1c23ad7 [🐛 🔨 ]Adding the ELO to the GlobalTrainingStatus (#5202) 4 年前