439 次代码提交 (7006b5ff-59cf-4ebd-bc26-d7f40cfc3791)

作者 SHA1 备注 提交日期
GitHub 6a81a2f4 Add Soft Actor-Critic as trainer option (#2341) 5 年前
GitHub 3df585d9 Fix issue where SAC encoder type is always simple (#2548) 5 年前
GitHub 3683cc1c Enable learning rate decay to be disabled (#2567) 5 年前
GitHub 832e4a47 Normalize observations when adding experiences (#2556) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub cb144f20 small mypy cleanup (#2637) 5 年前
Jonathan Harper 3fc14963 EXPERIMENTAL horovod support 5 年前
Jonathan Harper 47893e9c minor tweaks 5 年前
GitHub 8e931d8d Merge branch 'develop' into release-0.10.0 5 年前
Anupam Bhatnagar cc208c00 resolving conflicts 5 年前
Ervin Teng 35669d27 Fix SAC + LSTM Barracuda inference (#2698) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Ervin Teng 024e3677 small mypy cleanup (#2637) 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
Chris Elion 8da16bdb move compat functions 5 年前
GitHub f22c41db Merge pull request #2704 from Unity-Technologies/hotfix-0.10.1 5 年前
GitHub 9bac2771 Fix SAC + LSTM Barracuda inference (#2698) 5 年前
Chris Elion 254c7d86 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub b95c4d1d check for unecessary list comprehensions (#2707) 5 年前
GitHub 619465e1 Fix crash when SAC is used with Curiosity and Continuous Actions (#2740) 5 年前
Chris Elion 3d8a70fb Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 0fe5adc2 Develop remove memories (#2795) 5 年前
GitHub 495873e5 Merge pull request #2833 from Unity-Technologies/release-0.11.0 5 年前
Chris Elion 691d21e6 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub c6c01a03 Enable pylint and fix a few things (#2767) 5 年前
Jonathan Harper 8550679d Merge branch 'develop' into release-0.11.0 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub f57b7ac6 Allow usage with tensorflow 2.0.0 (via tf.compat.v1) (#2665) 5 年前
Ervin Teng 987e0e3a Merge tf2 branch 5 年前
Andrew Cohen 13fe9cf8 Bubbled up indexing of AllBrainInfo to trainer controller from trainers 5 年前
GitHub c0453ae1 Merge pull request #2912 from Unity-Technologies/develop-allbraininfo 5 年前
GitHub 99981937 fix errors from new flake8-comprehensions (#2917) 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Andrew Cohen e96b80db recieves brain_name and identifier on python side 5 年前
Ervin Teng 54644477 Merge branch 'develop' of github.com:Unity-Technologies/ml-agents into develop-nomaxstep-test 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
Ervin Teng e5459c49 buffer split for SAC 5 年前
Ervin Teng 3a4fa244 Switch to tanh squash in PPO 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
Andrew Cohen bd056007 recieves brain_name and identifier on python side 5 年前
GitHub d4780a55 Merge pull request #3010 from Unity-Technologies/release-0.12.0-to-master 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng 34f9577c Merge branch 'develop' into develop-agentprocessor 5 年前
GitHub 35c995e9 Merge pull request #3038 from Unity-Technologies/develop 5 年前
Ervin Teng 9c5fdd31 Stats reporting is working 5 年前
Ervin Teng eb4a04a5 Merge branch 'master' into develop-tanhsquash 5 年前
Andrew Cohen 5097bcc0 recieves brain_name and identifier on python side 5 年前
Ervin Teng 76abf968 Add back max_step logic 5 年前
Ervin Teng 28eba789 Migrate SAC 5 年前
Andrew Cohen 8578b0b7 add_policy and create_policy separated 5 年前
Ervin Teng f2b3cd7f Remove dead code 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
Ervin Teng c9116ed2 Move some common logic to buffer class 5 年前
GitHub 42bea858 Improve mypy coverage by adding --namespace-packages (#3049) 5 年前
GitHub 90db165f Add --namespace-packages to mypy for mlagents (#3075) 5 年前
GitHub 1fa07edb Remove Standalone Offline BC Training (#2969) 5 年前
Andrew Cohen 614d276f recieves brain_name and identifier on python side 5 年前
Andrew Cohen 96922f84 recieves brain_name and identifier on python side 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
GitHub 58b6c7c2 Rename mlagents.envs to mlagents_envs (#3083) 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
Ervin Teng 97d66e71 Remove BootstrapExperience 5 年前
Ervin Teng 324d217b Move agent_id to Trajectory 5 年前
Ervin Teng 77ff4822 Add back next_obs 5 年前
Andrew Cohen d1edbf43 add_policy and create_policy separated 5 年前
Ervin Teng 2b811fc8 Properly report value estimates and episode length 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
Ervin Teng c330f6f6 Merge branch 'master' into develop-agentprocessor 5 年前
Andrew Cohen de902fbb passes all pytest and C# tests 5 年前
GitHub 2ac242f7 Remove TrainerMetrics and add CSVWriter using new StatsWriter API (#3108) 5 年前
Ervin Teng fdf9aea7 Make conversion methods part of NamedTuples 5 年前
Ervin Teng 6242b67d Add way to check if trajectory is done or max_reached 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
GitHub c7da0139 Fix mypy errors in trainer code. (#3135) 5 年前
Andrew Cohen 082789ea Merge branch 'master' into develop-magic-string 5 年前
Andrew Cohen 6a4e7cf9 added ppo/sac_policy attributes to keep up with master 5 年前
GitHub e536c09c Remove unused tf.placeholder (#3138) 5 年前
Ervin Teng 1bd791e5 Merge branch 'master' into develop-agentprocessor 5 年前
Andrew Cohen 3e76adbd fixing more ci tests 5 年前
GitHub 7fbf6b1d add flake8-bugbear (#3137) 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Ervin Teng db743971 Move private methods out of trainer, simplify interface 5 年前
Andrew Cohen c8514c18 Merge branch 'master' into develop-magic-string 5 年前
GitHub 45010af3 Add stats reporter class and re-enable missing stats (#3076) 5 年前
Ervin Teng b3a4e641 Remove some vestigial code 5 年前
Ervin Teng 48793ec1 Fix test 5 年前
Ervin Teng 3d25f9d2 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub 5bc7531b Get step from policy (#3223) 5 年前
GitHub d985dded Merge branch 'master' into merge-release-0.13.0 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 29f3330f Merge master into hotfix-0.13.1 5 年前
GitHub d52fb483 Merge pull request #3264 from Unity-Technologies/hotfix-0.13.1 5 年前
GitHub 329b23e0 Fix extra summary being written when loading from checkpoint (#3272) 5 年前
Ervin Teng 0ef40c08 SAC CC working 5 年前
Ervin Teng db249ceb Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng 28f7608f Clean up value head creation 5 年前
Ervin Teng b21b3d5c Use resamp policy for SAC 5 年前
Ervin Teng 1b6e175c Fix discrete SAC and clean up policy 5 年前
Ervin Teng a5caf4d6 Remove epsilon from everywhere 5 年前
Ervin Teng 8e300036 Add some typing to optimizer 5 年前
Ervin Teng edeceefd Zeroed version of LSTM working for PPO 5 年前
Ervin Teng 5ec49542 SAC LSTM isn't broken 5 年前
Ervin Teng cfc2f455 Fix BC and tests 5 年前
Ervin Teng 78671383 Move initialization call around 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
Ervin Teng 85249afc Fix SAC scoping 5 年前
GitHub dd86e879 Separate out optimizer creation and policy graph creation (#3355) 5 年前
Ervin Teng cdd57468 Re-fix scoping and add method to get all variables 5 年前
Ervin Teng dcbb90e1 Fix graph init in ghost trainer 5 年前
Ervin Teng 5f00782b Clean up some SAC LSTM 5 年前
Ervin Teng 328476d8 Move check for creation into nn_policy 5 年前
Ervin Teng ce110201 Add optional burn-in for SAC as well 5 年前
Ervin Teng cbfbff2c Split optimizer and TFOptimizer 5 年前
Ervin Teng 4d94e180 Move optimizer to common folder 5 年前
Ervin Teng ffdc41bb Removed floating constants 5 年前
Ervin Teng 7004604d Used NamedTuple for create normalization tensors 5 年前
Ervin Teng 7c0fa1c4 Remove action_holder placeholder 5 年前
Ervin Teng 1cfc461a Remove and rename tf_optimizer 5 年前
Ervin Teng ff607162 Move learning rate reporting 5 年前
Ervin Teng c735e722 Make create critic methods private 5 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
Ervin Teng da6daebd Make create losses private 5 年前
Andrew Cohen 5b0aca29 Merge branch 'master' into soccer-fives 5 年前
Ervin Teng 14f2a7f2 Rename LearningModel to ModelUtils 5 年前
Ervin Teng 1156b9b3 Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder 5 年前
Ervin Teng d57124b4 Merge 'master' into develop-removeactionholder 5 年前
Ervin Teng d6eb262c Rename resample to reparameterize 5 年前
Ervin Teng 23088088 Remove outdated comment 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前
Anupam Bhatnagar e04fcd71 Merge branch 'master' into master-into-release-0.14.1 5 年前
GitHub 97a1d4b1 [change] Remove the action_holder placeholder from the policy. (#3492) 5 年前
Andrew Cohen de73baa9 Merge branch 'master' into soccer-fives 5 年前
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Andrew Cohen 573b1f6d Merge branch 'master' into soccer-fives 5 年前
GitHub cb153a0f [change] Change warning language when adversarial scene is used without self-play (#3561) 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub 86141eee Merge pull request #3560 from Unity-Technologies/new-logger 5 年前
GitHub e3af96ca Merge branch 'master' into develop-demo-load-seek 5 年前
GitHub ffd8f855 [bug-fix] Fix crash when demo size is smaller than batch size (#3591) 5 年前
Chris Elion 7f2e815a Merge remote-tracking branch 'origin/master' into develop-sidechannel-usability 5 年前
Chris Elion fa5e7e6d Merge remote-tracking branch 'origin/master' into develop-BehaviorParams-public 5 年前
GitHub 873ba7fd [bug-fix] Fix stats reporting for reward signals in SAC (#3606) 5 年前
GitHub c42a11c3 [change] Throw a proper error when sequence length is greater than batch size. (#3583) 5 年前
GitHub 94de596b [change] Remove concatenate in discrete action probabilities to improve inference performance (#3598) 5 年前
Andrew Cohen b1cfa74d Merge branch 'master' into develop-test-imitation 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
Andrew Cohen 53bea15c Merge branch 'master' into soccer-fives 5 年前
Andrew Cohen ac261e36 Merge branch 'master' into self-play-mutex 5 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 5 年前
Andrew Cohen eefc4811 Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen 9f09a65d team id centric ghost trainer 5 年前
Ervin Teng 293579dd Use steps_per_update to determine SAC train interval 5 年前
Ervin Teng 0fa2f4f7 Don't count buffer_init_steps 5 年前
Ervin Teng dbf8f7a5 Fix comment 5 年前
GitHub ff32035d Remove vis_encode_type from list of required (#3677) 5 年前
GitHub 141831da [bug-fix] Fix entropy computation for GaussianDistribution (#3684) 5 年前
Andrew Cohen 4c9ac553 Merge branch 'master' into self-play-mutex 5 年前
GitHub 4ecd6ad3 Fix how we set logging levels (#3703) 5 年前
Andrew Cohen cd677346 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 62c87031 Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen 59b88be6 Merge branch 'master' into self-play-mutex 5 年前
GitHub 9cbc3fa2 Asymmetric self-play (#3653) 5 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 5 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 5 年前
Andrew Cohen 3de78baa wrapped trainer has internal policy ghost 5 年前
Ervin Teng b7151b51 Remove num_update as param 5 年前
Andrew Cohen 3013774b alternative to internal-policy fix 5 年前
Andrew Cohen a870d453 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
GitHub b841c9ab Wrapped trainer has internal policy in GhostTrainer 5 年前
Ervin Teng 8b52a2d0 Address comments in docs 5 年前
Ervin Teng 817aab95 Update steps_per_update documentation 5 年前
Andrew Cohen 930d6fa3 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Ervin Teng f29b17a9 Don't block one policy queue 5 年前
GitHub aae58330 Merge branch 'master' into develop-add-inference-examples 5 年前
Andrew Cohen b0c506a6 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 5 年前
Anupam Bhatnagar 9d7dd3b6 [skip ci] moving step increment to trainer from environment for sac 5 年前
Andrew Cohen de0656b6 Merge branch 'internal-policy-ghost' into soccer-2v1 5 年前
Andrew Cohen 85304aff Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen 89db8428 Merge branch 'internal-policy-ghost-alternate' into soccer-2v1 5 年前
Andrew Cohen 26c0033c Merge branch 'soccer-2v1' into asymm-envs 5 年前
GitHub 4d23200b [refactor] Run Trainers in separate threads (#3690) 5 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
Chris Elion 68b68396 Merge remote-tracking branch 'origin/master' into release_1_to_master 5 年前
GitHub 4641038e Renaming max_step to interrupted in TermialStep(s) (#3908) 5 年前
vincentpierre c34dd5b6 Merge branch 'master' into develop-gym-wrapper 5 年前
Andrew Cohen a2f8319a Merge branch 'master' into asymm-envs 5 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Andrew Cohen 4a3ad193 Add constant decay to beta and epsilon 5 年前
GitHub c5b94ca6 Use LR schedule for beta and epsilon (#3940) 5 年前
Arthur Juliani 2b3a6347 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Andrew Cohen 704d0d11 add mede optimizer 4 年前
Andrew Cohen 88153b61 add mede opt with format 4 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub f7373172 Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2 4 年前
vincentpierre 6ddfe74f Merge branch 'master' into develop-gym-wrapper 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub a7323393 [bug-fix] Fix issue with SAC updating too much on resume (#4038) 5 年前
GitHub 5cce69ae add "the the" to precommit spell check (#4059) 5 年前
Andrew Cohen e7750fc9 Merge branch 'master' into develop-sampler-refactor 4 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
Andrew Cohen c0f7052b Merge branch 'master' into develop-sampler-refactor 4 年前
Andrew Cohen 34ecc7e6 Merge branch 'master' into asymm-envs 5 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
GitHub 8a49e8e0 [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) 4 年前
Anupam Bhatnagar 4afd8f92 first commit 4 年前
Anupam Bhatnagar f7a3c06e [skip ci] updating sac 4 年前
Anupam Bhatnagar a4567f27 [skip ci] restore process trajectory super calls 4 年前
Andrew Cohen 21f871db Merge branch 'develop-constant-decay' into asymm-envs 5 年前
Anupam Bhatnagar 26dc42e5 [skip ci] 4 年前
Anupam Bhatnagar 0aedad7c fixing should_still_train call in rl_trainer.py 4 年前
Anupam Bhatnagar 392a84f1 [skip ci] fixing property decorator in sac 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
Anupam Bhatnagar 24d5f881 first commit 4 年前
GitHub 45154f52 Pytorch port of SAC (#4219) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 74c99ec8 [refactor] Refactor normalizers and encoders (#4275) 4 年前
GitHub 93517833 [feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305) 4 年前
Andrew Cohen f74d301a Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
Ruo-Ping Dong 01e60921 add sac checkpoint 4 年前
vincentpierre 599d7e9f Merging master 4 年前
GitHub 3a982317 [add-fire] Add learning rate and beta/epsilon decay to PyTorch (#4318) 4 年前
GitHub 7ddfd81f Added Reward Providers for Torch (#4280) 4 年前
Andrew Cohen bf8b2328 Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
Ervin Teng 37f986c8 Running LSTM for SAC 4 年前
HH 7afa1761 Merge branch 'master' into hh/develop/ragdoll-updates 4 年前
Ervin Teng 8ead82e2 Use correct half of memories 4 年前
Ruo-Ping Dong 71fe4df6 fix formatting and test 4 年前
Ruo-Ping Dong 09a741c8 small improvement 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
HH 0fdac847 Merge branch 'master' into hh/develop/crawler-ragdoll-updates 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
Arthur Juliani 6bee0fd1 Merge master 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
Andrew Cohen d8c123a0 Merge branch 'master' into sensitivity 4 年前
GitHub 202db853 Remove unnecessary line (#4260) 4 年前
GitHub 1b098c9a Refactor TFPolicy and Policy (#4254) 4 年前
GitHub 380fef57 [refactor] Move TF-specific files to tf/ folder (#4266) 4 年前
GitHub beb5aca5 [refactor] Make classes except Optimizer framework agnostic (#4268) 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
Arthur Juliani 1a123641 Merge remote-tracking branch 'origin/master' into r5-master 4 年前
GitHub 3f44a0bc cleanup around AdamOptimizer (#4333) 4 年前
Andrew Cohen 598826fe Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
Ruo-Ping Dong d3eb6c46 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Anupam Bhatnagar a5cc4d03 Merge branch 'master' into global-variables 4 年前
Ruo-Ping Dong 523248be update 4 年前
GitHub f374f87a [add-fire] Add LSTM to SAC, LSTM fixes and initializations (#4324) 4 年前
Ervin Teng eeae6d97 Proper initialization and SAC masking 4 年前
HH 8eaddb61 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
Ruo-Ping Dong 59cc1a9f Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong 409a161c fix bc tests 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ervin Teng 13f15086 Merge branch 'develop-add-fire' into develop-add-fire-amrl 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
Ruo-Ping Dong d57aa9ab Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint 4 年前
Ervin Teng 02d86902 Use zeros_like 4 年前
GitHub bd6bcd2f Merge master and add Saver class for save/load checkpoints 4 年前
GitHub 6de31a03 [add-fire] Fix masked mean for 2d tensors (#4364) 4 年前
Ervin Teng 5c1717d1 Bugfixes for continuous case 4 年前
Ervin Teng 42e25b25 Merge branch 'develop-add-fire' into develop-add-fire-memoryclass 4 年前
Christopher Goy 5a233353 Merge remote-tracking branch 'origin/master' into release_6-to-master 4 年前
GitHub 49545ce1 Pytorch ghost trainer (#4370) 4 年前
Andrew Cohen 0053713a fix sac precommit 4 年前
Andrew Cohen e7c9ff35 clean up docstrings create policies 4 年前
Andrew Cohen 039ae17f capitalize Tensorflow 4 年前
GitHub 1955af9e [feature] Add experimental PyTorch support (#4335) 4 年前
vincentpierre 9f51ab14 Saving the reward providers 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
vincentpierre 108fac9a Replace torch.detach().cpu().numpy() with a utils method 4 年前
HH d9962254 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
GitHub ec8c24d8 add fire clean up docstrings in create policies (#4391) 4 年前
GitHub 328353bc Torch : Saving/Loading of the reward providers (#4405) 4 年前
vincentpierre 31750e97 Using item() in place of to_numpy() 4 年前
vincentpierre fdd343b2 more use of item() and additional tests 4 年前
Ruo-Ping Dong 88eff042 Merge branch 'master' into develop-saver-name 4 年前
GitHub 48f217b9 Rename Saver to ModelSaver (#4402) 4 年前
Anupam Bhatnagar f4f1a8d9 merge master into trainer-plugin branch 4 年前
GitHub 498934f9 Replace torch.detach().cpu().numpy() with a utils method (#4406) 4 年前
Ruo-Ping Dong 27fb4270 brain_name to behavior_name 4 年前
GitHub bfda9576 Replace brain_name with behavior_name (#4419) 4 年前
Ruo-Ping Dong fd1dc3a6 Merge branch 'master' into develop-torch-omp 4 年前
Ruo-Ping Dong ef3be79e sac 4 年前
GitHub 4e93cb6e [torch] Restructure PyTorch encoders (#4421) 4 年前
GitHub beb5eb30 [bug-fix] Fixes for Torch SAC and tests (#4408) 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
Ervin Teng 916eec4b Run backwards() of losses in threads 4 年前
Ervin Teng 9b797d61 Thread inference and not backprop 4 年前
Ervin Teng a305a41b Try futures in Optimizer 4 年前
Ervin Teng 228ea059 Try futures in Optimizer 4 年前
Andrew Cohen 3997b14b Merge branch 'master' into develop-hybrid-actions 4 年前
Ervin Teng 3e771cbb Permute visual obs outside of network 4 年前
Ervin Teng 77c810fb Fix SAC and make utility method 4 年前
Ervin Teng 9088c07a Optimized SAC soft update 4 年前
Ervin Teng 7754ad7b Don't run value during inference 4 年前
Ervin Teng 5495b2b6 Works with continuous 4 年前
Ervin Teng 52efe509 Discrete and entrop coeff 4 年前
Ervin Teng d67b9f95 Remove comment 4 年前
GitHub 1f179527 Do not keep gradients on the q for the v backup (#4504) 4 年前
vincentpierre 181bdec0 - 4 年前
GitHub 4e4ad7b0 Don't run value during policy evaluate, optimized soft update function (#4501) 4 年前
Ervin Teng f9ff3efe Merge branch 'develop-policyonly' into develop-sac-targetq 4 年前
GitHub 05fc088d [refactor] Don't compute grad for q2_p in SAC Optimizer (#4509) 4 年前
HH a3bf96fd Merge branch 'master' into hh/develop/gridsensor-tests 4 年前
GitHub badca342 Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540) 4 年前
Ervin Teng 8dec4771 Add hybrid actions to SAC 4 年前
GitHub c188781b [life improvement] Moving Python files around (#4531) 4 年前
Ervin Teng 81342148 Revert "Add hybrid actions to SAC" 4 年前
Andrew Cohen e5f14400 Merge branch 'master' into develop-hybrid-actions-singleton 4 年前
GitHub dde34423 [bug-fix] Use proper masking for entropy and policy losses (#4572) 4 年前
GitHub a690af74 [refactor] Make PyTorch the default and TensorFlow optional (#4517) 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
Andrew Cohen dc89318d remove ActionType 4 年前
vincentpierre a3a9a56b Merge branch 'exp-multi-head-attention' into exp-bullet-hell 4 年前
Ruo-Ping Dong 9e08be87 Merge branch 'master' into release_9_branch_merge 4 年前
Andrew Cohen 97dfa142 fix action_spec refs 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
Ervin Teng 2fc23737 Manchausen RL 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
GitHub 23800f33 Merge branch 'master' into develop-action-spec 4 年前
GitHub 8175d558 [bug-fix] Fix BC module + action clipping (#4667) 4 年前
Ruo-Ping Dong ee5313e4 Merge branch 'master' into develop-windows-delay 4 年前
GitHub f0ed3a38 Cherry-pick BC fixes to Release 10 (#4668) 4 年前
vincentpierre b863af57 Removing TensorFlow Trainers 4 年前
Ervin Teng 6c77ac7a Update SAC, fix PPO batching 4 年前
GitHub 278911a5 Fix staging tests (#4708) 4 年前
Ervin Teng 1db21cbb Fix SAC interrupted condition and typing 4 年前
Ervin Teng 6e6a6b2b Fix SAC interrupted again 4 年前
vincentpierre 713e65fb removing tensorflow testing for pytest and yamato 4 年前
vincentpierre 2dd34aa5 Formatting 4 年前
vincentpierre 8f9634c2 Fxing test 4 年前
Ervin Teng fdaa8c3d Merge branch 'develop-unified-obs' into develop-centralizedcritic 4 年前
Andrew Cohen 056630d7 sac continuous and discrete train 4 年前
GitHub 990f801a Develop hybrid action staging (#4702) 4 年前
vincentpierre 735fcd52 [WIP] Refactor trainers to use list of obs rather than vec and vis obs 4 年前
Andrew Cohen 85e4db33 bc tests pass 4 年前
vincentpierre 7a5cc9ec Merge master into develop-rm-tf 4 年前
vincentpierre c1587bce Solving merge conflicts 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前
Arthur Juliani 0d2f8887 Merge remote-tracking branch 'origin/master' into goal-conditioning 4 年前
Andrew Cohen 73b778cc rename extract to from_dict 4 年前
Ervin Teng 25dfd883 Merge branch 'master' into develop-centralizedcritic 4 年前
GitHub 22658a40 use sensor types to differentiate obs (#4749) 4 年前
Andrew Cohen 3c65b964 fixed recurrent prev_action issue 4 年前
GitHub 903d3afe Merge pull request #4707 from Unity-Technologies/develop-rm-tf 4 年前
vincentpierre 8cb050ef WIP Made initial changes to enale dimension properties and added attention module 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
GitHub 29d94c7c Merge pull request #4734 from Unity-Technologies/develop-obs-as-list 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
Andrew Cohen c0d01baf Merge branch 'master' into merge-release11-master 4 年前
vincentpierre 44ed3258 Merging master 4 年前
Andrew Cohen 3457cd3c save only discrete actions as prev 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 35769b53 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 17496265 move AgentAction, ActionLogProbs, and ActionFlattener to separate files 4 年前
Chris Elion 76ebc20c Merge remote-tracking branch 'origin/master' into r12-to-master 4 年前
GitHub 458fee17 Merge pull request #4763 from Unity-Technologies/develop-att 4 年前
Ervin Teng 330fc1d0 Merge branch 'master' into develop-centralizedcritic-mm 4 年前
vincentpierre 519c5f47 merging master 4 年前
Ruo-Ping Dong 8ed14762 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
GitHub 7387a77f remove pylint (#4836) 4 年前
Andrew Cohen 1bc2ff96 add weight decay to trainers 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
Ervin Teng aba633b2 Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm 4 年前
Arthur Juliani 0a876b9c Fix typos 4 年前
Andrew Cohen ff324d0c fixed sac recurrent tf simple rl 4 年前
Ruo-Ping Dong 180d3e20 Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager 4 年前
HH 0024a286 merge ervin's new stuff 4 年前
GitHub 12e1fc28 [feature] Hybrid SAC (#4574) 4 年前
Andrew Cohen 7af25330 fixed torch test sac 4 年前
GitHub 67ad9651 Merge pull request #4825 from Unity-Technologies/sensor-types 4 年前
vincentpierre 8660b1c2 merging master 4 年前
brccabral 457fb612 Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents 4 年前
vincentpierre 115e944b adding weight decay for experimentation 4 年前
vincentpierre 9fbc2e0e _ 4 年前
vincentpierre bf16bad6 _ 4 年前
GitHub 64fc7f43 Buffer key enums (#4907) 4 年前
Ervin Teng b6f88d6d Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Ervin Teng 1831044a Update SAC to use separate policy 4 年前
Andrew Cohen 8efdeeb0 make critic a property 4 年前
Ervin Teng c675393c Move value network for SAC to device 4 年前
Andrew Cohen c74dca9f add SharedActorCritic 4 年前
Ruo-Ping Dong c87bce9e Merge branch 'master' into develop-base-teammanager 4 年前
Andrew Cohen 00b891df fix sac shared 4 年前
Ervin Teng ae7643b8 Proper critic memories for PPO 4 年前
vincentpierre e1b94b8b Merge branch 'master' into develop-var-len-obs-feature 4 年前
Chris Elion e4f51ca7 Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 4 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Ervin Teng fd3f05b9 Enable GAIL to decay 4 年前
Ervin Teng bb452ffd Fix SAC 4 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 4 年前
HH 15d512f9 Merge branch 'master' into hh/develop/dodgeball 4 年前
GitHub 338af2ec Move the Critic into the Optimizer (#4939) 4 年前
HH 4c947151 Merge branch 'main' into hh/develop/dodgeball 4 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 4 年前
Andrew Cohen 9060da06 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 4 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 4 年前
Ervin Teng c8137dcd Merge branch 'main' into develop-superpush-int 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
Christopher Goy 921ba4f0 Update v2-staging from main (March 15) (#5123) 4 年前
Christopher Goy ebe45056 Merge branch 'main' into release_14_branch-to-main 4 年前
GitHub fc5d0a3f [bug-fix] Fix save/restore critic, add test (#5062) 4 年前
Chris Elion 970f1d40 Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec 4 年前
Ervin Teng 1f026c70 Merge branch 'main' into develop-superpush-branch-cleanup 4 年前
Ervin Teng ce872033 Revert "Merge branch 'main' into develop-superpush-branch-cleanup" 4 年前
Andrew Cohen 9e77d7e1 Merge branch 'main' into develop-soccer-groupman 4 年前
GitHub 62314056 Fix ghost curriculum and make steps private (#5098) 4 年前
Ervin Teng 54ffbed6 [cherry-pick] Fix ghost curriculum and make steps private (#5098) 4 年前
Andrew Cohen 9176247c Merge branch 'main' into develop-soccer-groupman-mod 4 年前
GitHub e81e038b Fix end episode for POCA, add warning for group reward if not POCA (#5113) 4 年前
GitHub 63169e2c [cherry-pick] Fix group rewards for POCA, add warning for non-POCA trainers (#5120) 4 年前
Ervin Teng d1c24251 [bug-fix] When agent isn't training, don't clear update buffer (#5205) 4 年前
Ervin Teng 9e2e2626 [bug-fix] Use correct memories for LSTM SAC (#5228) 4 年前
Andrew Cohen 18be47e8 Merge branch 'main' into develop-soccer-groupman-mod 4 年前
GitHub ff21216d [bug-fix] When agent isn't training, don't clear update buffer (#5205) 4 年前
GitHub 2e19759c Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211) 4 年前
GitHub 6d1b3a64 [bug-fix] Use correct memories for LSTM SAC (#5228) 4 年前
vincentpierre bab3ecb7 First version of MEDE, crawler does not seem to work properly, I suspect the actions make it distinguishable to the discriminator but not to the human eye 4 年前
Andrew Cohen d813bfd5 continuous, crawler integrated, new cube 4 年前
vincentpierre 8da21669 Adding some changes 4 年前
Andrew Cohen 3e642140 use discrete div 4 年前
Andrew Cohen bcee3bf5 no entropy loss 4 年前
vincentpierre 7c74c967 _ 4 年前
vincentpierre b4f30613 Adding a variational version 4 年前
vincentpierre 8450b154 - 4 年前
vincentpierre 5985959d Got 2 modes on Wlker I think 4 年前
vincentpierre 4bde393e Got the walker to walk different based on diversity setting 4 年前
GitHub fc6e8c35 [🐛🔨 ] Fix sac target for continuous actions (#5372) 4 年前
vincentpierre 8cdbc17f modifying SAC to make \alpha converge faster 4 年前
vincentpierre 983982ee Removing misleading learning rate 3 年前