507 次代码提交 (e2e62cb9-9803-4bd9-886a-b237e504173f)

作者 SHA1 备注 提交日期
Ervin Teng fe4472cb Add decoders, distributions, encoders, layers, networks, and utils 4 年前
Ervin Teng 89890bf2 Update with newest changes 4 年前
GitHub e3bc3352 [pytorch] Add decoders, distributions, encoders, layers, networks, and utils (#4349) 4 年前
GitHub 1955af9e [feature] Add experimental PyTorch support (#4335) 4 年前
vincentpierre 9f51ab14 Saving the reward providers 4 年前
vincentpierre 25454a48 adding tests 4 年前
vincentpierre 108fac9a Replace torch.detach().cpu().numpy() with a utils method 4 年前
GitHub 328353bc Torch : Saving/Loading of the reward providers (#4405) 4 年前
vincentpierre 31750e97 Using item() in place of to_numpy() 4 年前
Andrew Cohen 6587c911 hybrid bheavior spec 4 年前
Ruo-Ping Dong 88eff042 Merge branch 'master' into develop-saver-name 4 年前
Anupam Bhatnagar f4f1a8d9 merge master into trainer-plugin branch 4 年前
GitHub 12e15e29 Fix on GAIL Torch when using actions (#4407) 4 年前
GitHub 498934f9 Replace torch.detach().cpu().numpy() with a utils method (#4406) 4 年前
Andrew Cohen 5b1620e9 moved encoders to processors 4 年前
Ruo-Ping Dong 27fb4270 brain_name to behavior_name 4 年前
Ervin Teng 1dca75d8 Move linear encoding to NetworkBody 4 年前
GitHub bfda9576 Replace brain_name with behavior_name (#4419) 4 年前
GitHub 9364048e Merge branch 'develop-pytorch-encoding' into develop-restructure-torch-networks 4 年前
Ruo-Ping Dong fd1dc3a6 Merge branch 'master' into develop-torch-omp 4 年前
Ruo-Ping Dong f5dee9d1 jit for continuous control 4 年前
Ruo-Ping Dong ef3be79e sac 4 年前
GitHub 7b4d0865 [Bug fix] Fix bug in GAIL gradient penalty (#4425) 4 年前
GitHub 4e93cb6e [torch] Restructure PyTorch encoders (#4421) 4 年前
GitHub beb5eb30 [bug-fix] Fixes for Torch SAC and tests (#4408) 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
GitHub 676f5f7c [refactor] Refactor GAIL to use new encoder structure (#4433) 4 年前
Ervin Teng 60eacc0d Merge branch 'master' into develop-adjust-cpu-settings 4 年前
GitHub 6986fb10 use LinearEncoder in curiosity and clean up (#4444) 4 年前
Ervin Teng fdc887a1 Some experimental stuff 4 年前
GitHub bf6506fc [feature] Add small CNN for grids 5x5 and up (#4434) 4 年前
Andrew Cohen 3997b14b Merge branch 'master' into develop-hybrid-actions 4 年前
GitHub 069f10b9 [Bug fix] Export all branches for discrete control torch (#4491) 4 年前
Ervin Teng 14a7e29b Add AMRL layer 4 年前
Andrew Cohen 8294bc59 ignore commit 4 年前
GitHub fc557903 Develop fix visual torchh export (#4494) 4 年前
Ervin Teng 3e771cbb Permute visual obs outside of network 4 年前
Ervin Teng 9f96a495 Use built-in cumulative max 4 年前
Andrew Cohen b0b2e22e prepare to merge action_out 4 年前
Ervin Teng 77c810fb Fix SAC and make utility method 4 年前
Ervin Teng 43c41d66 Fix BC and Reward Signals 4 年前
Ervin Teng 7754ad7b Don't run value during inference 4 年前
Ervin Teng 3f7d815c detach masks 4 年前
GitHub 10f3e1c7 Cherry picked #4491 (#4493) 4 年前
GitHub 37645aa2 Develop fix visual torchh export (#4494) (#4497) 4 年前
Andrew Cohen 85602279 add action_out to dist 4 年前
Ervin Teng b6095151 Execute critic with LSTM 4 年前
vincentpierre 181bdec0 - 4 年前
GitHub 4e4ad7b0 Don't run value during policy evaluate, optimized soft update function (#4501) 4 年前
Andrew Cohen f0a2c6f2 add action_out abstract method and docstring 4 年前
Ervin Teng f9ff3efe Merge branch 'develop-policyonly' into develop-sac-targetq 4 年前
Andrew Cohen 7c0aa77b Merge branch 'develop-actions-out' into develop-hybrid-actions 4 年前
GitHub 60b76790 Random Network Distillation for Torch (#4473) 4 年前
Andrew Cohen b6b453a0 rename to exported model output 4 年前
Andrew Cohen 9cbf6c3f Merge branch 'develop-actions-out' into develop-hybrid-actions 4 年前
Andrew Cohen 72cd0d39 separate tensors for disc/cont 4 年前
Andrew Cohen 643c8e58 ppo extended 4 年前
Ervin Teng ba29b6b4 Change dimensions of recurrent in to BxN for ONNX 4 年前
GitHub 400e14cb [Bug-fix] RND would not be saved correctly. Added tests (#4514) 4 年前
Andrew Cohen db37db34 fixing errors 4 年前
Andrew Cohen 44c9879e action models 4 年前
HH a3bf96fd Merge branch 'master' into hh/develop/gridsensor-tests 4 年前
Andrew Cohen c494bfcc trains successfully 4 年前
Ervin Teng e8431a6d Proper dimensions for entropy, sum before bonus in PPO 4 年前
Andrew Cohen 190d8e4d action model as a singleton 4 年前
Ervin Teng 8dec4771 Add hybrid actions to SAC 4 年前
Andrew Cohen d6544353 using lists for distributions 4 年前
Andrew Cohen e686a785 removed abstract class 4 年前
Ervin Teng 81342148 Revert "Add hybrid actions to SAC" 4 年前
Andrew Cohen 74553853 use ModuleList 4 年前
Andrew Cohen aa5cf8ab rename ActionModel 4 年前
Andrew Cohen 4b9a7db6 remove old behaviorspec 4 年前
Andrew Cohen 5a3f5b4a remove comments 4 年前
Andrew Cohen e5f14400 Merge branch 'master' into develop-hybrid-actions-singleton 4 年前
GitHub e4db5dc5 ActionSpec and ActionBuffer (#4578) 4 年前
Andrew Cohen eaecb59e torch utils to and from buffer 4 年前
Andrew Cohen 6e23bafd ActionFlattener Refactor 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
GitHub e0ef30a5 [bug-fix] Change entropy computation and loss reporting in Torch to match TF (#4538) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen b40e7793 fix mlagents-envs tests 4 年前
GitHub a4ba90ec [bug-fix] Fix ONNX export/Barracuda import for continuous actions (#4608) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
GitHub 64e998a2 [bug-fix] Use float64 when converting np.ndarray to torch.tensor, cap Torch version to 1.7.x (#4610) 4 年前
vincentpierre 96452986 Initial commit for multi head attention 4 年前
Andrew Cohen dc89318d remove ActionType 4 年前
vincentpierre a3a9a56b Merge branch 'exp-multi-head-attention' into exp-bullet-hell 4 年前
Ruo-Ping Dong 9e08be87 Merge branch 'master' into release_9_branch_merge 4 年前
Ervin Teng 99ec16e6 Hard Swish 4 年前
Andrew Cohen 6cf54bf2 remove self.action_spec from policy/bc 4 年前
vincentpierre d3d4eb90 Trainer with attention 4 年前
Ervin Teng 5d3ad161 Leaky ReLU 4 年前
Andrew Cohen 97dfa142 fix action_spec refs 4 年前
vincentpierre 7ef3c9a1 Trainer with attention 4 年前
Ervin Teng f00ff21d Add clipping to Torch 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
vincentpierre b09dfd0a Still not working 4 年前
Ervin Teng 7e6cc222 Take mean of continuous entropy 4 年前
Ervin Teng e80d418b Use lower scaling value 4 年前
Ervin Teng 89489ae0 Invert divide by 3 in log prob 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
Andrew Cohen 0e28dd8f add static method to create continuous/discrete 4 年前
vincentpierre c264b4da additional changes 4 年前
Ervin Teng 7502a383 Proper dimensions 4 年前
Ervin Teng c3cec801 Use linear gain for KaimingHe 4 年前
Ervin Teng 1f305f24 Test commit 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
Ervin Teng 48a11a0e Undo test commit 4 年前
Ervin Teng 66f3d2ad Merge branch 'develop-contentropy' into develop-torch-clip 4 年前
Andrew Cohen ae920478 resolve conflicts 4 年前
vincentpierre 9b3d4ade Fixing some issues 4 年前
GitHub 87a7ccf8 use int64 steps, check for NaN actions (#4607) 4 年前
Ervin Teng db6ebff6 Remove clip 4 年前
GitHub 23800f33 Merge branch 'master' into develop-action-spec 4 年前
GitHub 85a7c0f7 [bug-fix] Add clipping to PyTorch policy, fix initialization (#4649) 4 年前
Ervin Teng 08c8862e Add epsilon to log 4 年前
Ervin Teng 0548057d Use real clipping (as in TF) 4 年前
Ervin Teng 0cdb2040 Use tanh squash 4 年前
Andrew Cohen 719d94e9 fix entropy_sum after merge 4 年前
vincentpierre 6fcbba53 Refactoring the code to make it more flexible. Still a hack 4 年前
Ervin Teng eb4f3065 Add another epsilon 4 年前
GitHub 733bffbf use int64 steps, check for NaN actions (#4607) (#4654) 4 年前
Ervin Teng 23b42dcd Merge branch 'develop-torchcrawlerdebug' into develop-torch-clip 4 年前
vincentpierre 58f38662 Bigger network, needed to solve 4 年前
GitHub efdb1042 Add global lock for torch.onnx.export() (#4659) 4 年前
Ervin Teng adc039a5 Use linear gain for KaimingHe 4 年前
vincentpierre 0b6c2ed3 Fixing some bugs 4 年前
GitHub 8175d558 [bug-fix] Fix BC module + action clipping (#4667) 4 年前
GitHub 2a8c6800 [bug-fix] Add clipping to PyTorch policy, fix initialization (#4649) (#4662) 4 年前
Ervin Teng 3b3b53e2 Improve comment 4 年前
Andrew Cohen c74adb81 fix specs in torch util 4 年前
Andrew Cohen 84cc2b84 concat x self before attention 4 年前
vincentpierre e14e1c4d Improvements and new tests 4 年前
GitHub de27d7a6 [bug-fix] Fix issue where NaNs are outputted by the policy when training Match3 (#4664) 4 年前
Ruo-Ping Dong 953cb6bb Merge branch 'master' into develop-windows-delay 4 年前
GitHub 8df04e83 Add global lock for torch.onnx.export() (#4665) 4 年前
Andrew Cohen d62f6b0a modify bullet/attn 4 年前
GitHub 1cac78e2 [Bug-fix] Remove extra directory created at checkpoint (#4675) 4 年前
Ervin Teng bc746839 Normalize GAIL observations 4 年前
Ruo-Ping Dong ee5313e4 Merge branch 'master' into develop-windows-delay 4 年前
Ervin Teng 78f88c15 Add clip to export and make optional in policy 4 年前
vincentpierre f283cb60 different architecture 4 年前
Ervin Teng 362f2ec0 Use correct dimensions of gradient 4 年前
GitHub f0ed3a38 Cherry-pick BC fixes to Release 10 (#4668) 4 年前
GitHub b6c903c8 Cherry-pick NaN fix for Match 3 (#4664) 4 年前
Ervin Teng 3eba7423 Increase initialization 4 年前
GitHub fd31da37 [Bug-fix] Remove extra directory created at checkpoint (#4675) (#4678) 4 年前
Ervin Teng 4f8f21e4 Bigger scale 4 年前
Andrew Cohen 3f771e61 add ActionBuffers and utils 4 年前
Ervin Teng 7a0ebfbd Pretty broken 4 年前
Ervin Teng 95bdbba3 Less broken PPO 4 年前
Ervin Teng 6c77ac7a Update SAC, fix PPO batching 4 年前
GitHub 278911a5 Fix staging tests (#4708) 4 年前
Ervin Teng 1db21cbb Fix SAC interrupted condition and typing 4 年前
GitHub 94c59e31 C# changes for hybrid action spaces (#4587) 4 年前
Ervin Teng 4158629e Properly feed in None rather than empty arrays 4 年前
Andrew Cohen bd917c9c action buffer passes continuous 4 年前
vincentpierre 90da7426 Fix masking for torch 4 年前
Ervin Teng 5a5bd515 Fix multiple obs 4 年前
Andrew Cohen b36fcf16 discrete runs/cont passes 4 年前
vincentpierre e85d8e35 Added a comment and included the change of #4715 for simplicity 4 年前
Ervin Teng 8d29114d Update curiosity reward provider 4 年前
Andrew Cohen ad951493 debugging discrete 4 年前
Ervin Teng 79a3051e Update GAIL and BC 4 年前
Andrew Cohen fcf6471e 2d discrete passes 4 年前
GitHub 2c744129 Merge pull request #4714 from Unity-Technologies/develop-fix-mask 4 年前
Ervin Teng fdaa8c3d Merge branch 'develop-unified-obs' into develop-centralizedcritic 4 年前
Andrew Cohen 056630d7 sac continuous and discrete train 4 年前
GitHub 990f801a Develop hybrid action staging (#4702) 4 年前
vincentpierre 735fcd52 [WIP] Refactor trainers to use list of obs rather than vec and vis obs 4 年前
Ervin Teng 6846af21 Multi-input network 4 年前
Andrew Cohen 85e4db33 bc tests pass 4 年前
vincentpierre 93ca1409 fixing the tests 4 年前
vincentpierre 7a5cc9ec Merge master into develop-rm-tf 4 年前
Ervin Teng cb4b7ed3 Some minor tweaks but still broken 4 年前
Andrew Cohen 24fd9b3c torch reward providers all pass 4 年前
vincentpierre 12619155 added some docstrings 4 年前
Andrew Cohen dee6b805 fixed bug in discrete 4 年前
vincentpierre c1587bce Solving merge conflicts 4 年前
Ervin Teng 2b8ab09d Use Vince's ONNX export code 4 年前
Ervin Teng 558bfb70 Cleanup 4 年前
Andrew Cohen b5d1c071 Merge branch 'master' into develop-action-buffer 4 年前
Arthur Juliani 0d2f8887 Merge remote-tracking branch 'origin/master' into goal-conditioning 4 年前
Andrew Cohen 73b778cc rename extract to from_dict 4 年前
GitHub cc6b4564 Multi Directional Walker and Initial Hypernetwork (#4740) 4 年前
Ervin Teng 25dfd883 Merge branch 'master' into develop-centralizedcritic 4 年前
GitHub ad5f878c [refactor] Remove critic pass during inference (#4743) 4 年前
GitHub 11687f8d [cherry-pick] Cherry-pick #4743 into Release 11 (#4756) 4 年前
GitHub 22658a40 use sensor types to differentiate obs (#4749) 4 年前
vincentpierre 14378aa5 Merging master 4 年前
vincentpierre 0c81006d addressing comments 4 年前
Ervin Teng ecac7ca1 Set num teammates to 2 4 年前
Arthur Juliani 95441b75 Use separate hypernetwork module 4 年前
Arthur Juliani 03ee5833 black formatting 4 年前
Andrew Cohen eef14922 discrete/contionuous unity envs train 4 年前
Arthur Juliani 79898e06 Use hypernetwork in both places 4 年前
Arthur Juliani f36b10d8 Fix typo 4 年前
Andrew Cohen a545859e fix torch test policy 4 年前
Andrew Cohen 43387daa remove unused import 4 年前
vincentpierre 8cb050ef WIP Made initial changes to enale dimension properties and added attention module 4 年前
GitHub 6f4f6044 Fix model inference issue with Barracuda v1.2.1 (#4766) (#4768) 4 年前
Ervin Teng 170c269e Remove prints 4 年前
Andrew Cohen e168bb4f add docstrings to AgentAction and ActionLogProbs 4 年前
GitHub 9fbf56e2 Fix model inference issue with Barracuda v1.2.1 (#4766) 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
GitHub a73f7d73 Turn down gain on GAIL discriminator output (#4762) 4 年前
Andrew Cohen 06f1f254 1:1 and continuous/discrete train 4 年前
Andrew Cohen 7750bccd all hybrid simple rl tests pass 4 年前
Ervin Teng 2462db69 Add comment to ghost trainer 4 年前
GitHub 559609a2 Update ml-agents/mlagents/trainers/torch/utils.py 4 年前
Andrew Cohen 6174c428 move action model to explicit distributions 4 年前
GitHub b6bb01b9 Turn down gain on GAIL discriminator output (#4762) (#4772) 4 年前
vincentpierre c3699de8 merging master and addressing comments 4 年前
Ervin Teng f207a1a6 Revert "Add comment to ghost trainer" 4 年前
vincentpierre e0cf88e6 Made a nicer comment 4 年前
Andrew Cohen f2c9d184 fix set_actions_for_agent 4 年前
GitHub 29d94c7c Merge pull request #4734 from Unity-Technologies/develop-obs-as-list 4 年前
Ervin Teng e4e0e2bd Scale size of CC network 4 年前
Andrew Cohen 1d234d1d bc works 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
Ervin Teng a3885120 Scale value network based on num agents 4 年前
Andrew Cohen 8d7e449f torch curiosity tests pass 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
Andrew Cohen 7973b46c remove print bc 4 年前
Andrew Cohen c0d01baf Merge branch 'master' into merge-release11-master 4 年前
Andrew Cohen 95566e44 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
vincentpierre c5a057d2 renaming obs_spec variables 4 年前
HH 0cc4eed4 Merge branch 'develop-centralizedcritic' into hh/develop-pushblockcollab 4 年前
Andrew Cohen e81e68de comms agent and fixed hallway 4 年前
vincentpierre 44ed3258 Merging master 4 年前
Andrew Cohen ca5a5194 soccer comms on the cloud 4 年前
Andrew Cohen 12828bdc remove tau from diff for 4 年前
Andrew Cohen 8543b917 add kickpower bonus 4 年前
Andrew Cohen 708ac9bf use hard=true gbsm 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 35769b53 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen c843e3d4 hallway collab exps on cloud 4 年前
Andrew Cohen 272affe0 preliminary aciton model tests 4 年前
Andrew Cohen a20287f7 continuous comms 4 年前
Andrew Cohen 17496265 move AgentAction, ActionLogProbs, and ActionFlattener to separate files 4 年前
vincentpierre f7a4a31f [Experiment] Bullet hell 4 年前
Chris Elion 76ebc20c Merge remote-tracking branch 'origin/master' into r12-to-master 4 年前
Andrew Cohen 2dc2ffe3 add action util files 4 年前
Andrew Cohen f57875e0 layer norm 4 年前
GitHub 458fee17 Merge pull request #4763 from Unity-Technologies/develop-att 4 年前
vincentpierre 36cc4665 Removing some vis and vec fields from policy.py 4 年前
Andrew Cohen bc77c990 layer norm and weight decay with fixed architecture 4 年前
GitHub e344fe79 Make memory contiguous (#4804) 4 年前
vincentpierre ab0dd150 formatting 4 年前
Ervin Teng 330fc1d0 Merge branch 'master' into develop-centralizedcritic-mm 4 年前
vincentpierre 519c5f47 merging master 4 年前
Andrew Cohen fad37dc5 add default args to LinearEncoder 4 年前
Ervin Teng ad439fb6 Additional changes 4 年前
vincentpierre a482a47c fixing tests 4 年前
Andrew Cohen 21365c04 formatting 4 年前
Ervin Teng d02a1033 Some more fixes 4 年前
Ruo-Ping Dong 8ed14762 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Andrew Cohen 7ba10239 remove action spec attribute from policy 4 年前
Andrew Cohen 96c01a63 custom layer norm 4 年前
Ervin Teng efa67290 Self-attention Centralized Critic 4 年前
Andrew Cohen 662fd6b1 added docstrings to action flattener 4 年前
GitHub a272bff0 Merge pull request #4803 from Unity-Technologies/develop-remove-vec-vis-fields 4 年前
GitHub d7c3022d Export separate nodes for continuous/discrete actions (#4655) 4 年前
Andrew Cohen d0c8b5f1 added docstrings to action model 4 年前
Andrew Cohen 90e0e8b2 concat self 4 年前
HH 5c5539af add zomb scene 4 年前
GitHub cc948a41 Policy output actiontuple (#4651) 4 年前
Andrew Cohen 89ed777e increase bullets to 40 4 年前
Andrew Cohen 886883b3 Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 60816204 revert changes to 20 bullets 4 年前
Andrew Cohen 231328ea remove warning prints 4 年前
GitHub 9689449f Refactor of attention (#4840) 4 年前
Andrew Cohen 010803ea separate entity encoder and RSA 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
Andrew Cohen 3ca65063 clean up args in mha 4 年前
Arthur Juliani 7c37c759 Fix some mis-renamings 4 年前
Andrew Cohen 701c1a3f fix test torch distributions 4 年前
GitHub b7e6efa3 Allow setting maximum number of elements in self-attention to None (#4841) 4 年前
vincentpierre 56972f56 WIP integrate attention to networkbody 4 年前
Andrew Cohen 540b930b add defaults to linear encoder, initialize ent encoders 4 年前
Andrew Cohen 9ae8a720 more cleanups 4 年前
Ervin Teng aba633b2 Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm 4 年前
vincentpierre d27459bd Fix error in embedding 4 年前
Andrew Cohen 631ac7f4 fixed tests 4 年前
Ervin Teng 30a09c6f Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm 4 年前
GitHub eb78a477 Add default init/gain to LinearEncoder (#4846) 4 年前
Ervin Teng 135914fd entity embeddings work with no max 4 年前
GitHub 0ac990e0 add LayerNorm (#4847) 4 年前
vincentpierre af58753b adding encoding to self and var len input 4 年前
Ervin Teng e05e897c One layer for entity embed 4 年前
vincentpierre 8c1a33a4 Fixing .contiguous missing in resnet 4 年前
vincentpierre edbac259 Fixing merge conflicts 4 年前
Ervin Teng 9a3598eb Use 4 heads 4 年前
Ruo-Ping Dong a7d04be6 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
GitHub d30e9f28 Merge pull request #4851 from Unity-Technologies/fix-contiguous-resnet 4 年前
vincentpierre 5039b65a Merge branch 'master' into develop-att-network-integration 4 年前
Arthur Juliani 0a876b9c Fix typos 4 年前
Ervin Teng 2085e17c Merge branch 'master' into develop-centralizedcritic-mm 4 年前
Ruo-Ping Dong d3348074 fix policy action_spec typo 4 年前
vincentpierre e7024786 Removing hacky layer norm 4 年前
Arthur Juliani e3de0406 Plurals 4 年前
Ervin Teng dfe07e1d Merge branch 'develop-lin-enc-def' into develop-centralizedcritic-mm 4 年前
Ruo-Ping Dong 180d3e20 Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager 4 年前
HH 0024a286 merge ervin's new stuff 4 年前
GitHub 12e1fc28 [feature] Hybrid SAC (#4574) 4 年前
Andrew Cohen b0c02ee0 Merge branch 'develop-hybrid-actions-csharp' into develop-actionmodel-csharp 4 年前
Ervin Teng 9c3da1b6 New buffer layout, TeamObsUtil, pad dead agents 4 年前
GitHub 67ad9651 Merge pull request #4825 from Unity-Technologies/sensor-types 4 年前
vincentpierre 8660b1c2 merging master 4 年前
Ervin Teng eab7e42a Use NaNs to get masks for attention 4 年前
Ervin Teng 3daa17a9 Merge branch 'develop-centralizedcritic-mm' into develop-zombieteammanager 4 年前
Arthur Juliani da0c8b9d Add hypernetwork 4 年前
Ervin Teng bf7195f1 Refactor multi input network slightly 3 年前
Ervin Teng 1c557bd3 Don't concat self 4 年前
vincentpierre 38fc2536 addresing some comments 4 年前
Ervin Teng 6b8b3db3 Try subtract marginalized value 4 年前
Ervin Teng 92fc78a5 Use new trajectory 3 年前
Ervin Teng b2c498de Use NaNs to get masks for attention 4 年前
GitHub e9ff7705 Fix torch tensor non-contiguous issue (#4855) 4 年前
Andrew Cohen ad807327 refactor entityembedding/network body 4 年前
GitHub 01e0ee00 refactor entityembedding/network body (#4857) 4 年前
Ervin Teng 5d7345a6 Add Q function with attention 4 年前
Andrew Cohen 6e6f3246 clean up 4 年前
Ervin Teng 092ea232 Some more progress - still broken 4 年前
Andrew Cohen 4e37974c moved embedding layer norm into network body 4 年前
GitHub 89b6c949 use singular entity embedding (#4873) 4 年前
Ervin Teng 457b2630 I think it's running 4 年前
Andrew Cohen 5caaef52 move entity max computation to network body 4 年前
Ervin Teng 65b866b0 Actions added but untested 4 年前
Andrew Cohen 86d4c5c5 move embedding norm into RSA block 4 年前
Ervin Teng 3e481f7d Fix issue with team_actions 4 年前
brccabral 457fb612 Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents 4 年前
Ervin Teng 0919a32d Add next action and next team obs 4 年前
Andrew Cohen 44088167 separate forward into q_net and baseline 4 年前
Andrew Cohen 07e92563 Merge branch 'develop-centralizedcritic-counterfact' into develop-coma2 4 年前
Andrew Cohen 6e1826f8 might be right 4 年前
vincentpierre 52b011d6 _ 4 年前
vincentpierre 03c905b2 Fix equation for entropy 4 年前
Andrew Cohen 1511588d forcing this to work 4 年前
vincentpierre 5f9ea5ea _ 4 年前
vincentpierre 396bc43c Merging master 4 年前
GitHub d4455936 Merge pull request #4869 from Unity-Technologies/fix-normal-entropy 4 年前
vincentpierre a4b78d53 _ 4 年前
vincentpierre b7c7d773 Adding some tests 4 年前
vincentpierre 6f3ea7b8 _ 4 年前
vincentpierre 92005ea6 Better placeholder for encoded_self than None 4 年前
Andrew Cohen feb38012 add lambda return and target network 4 年前
Arthur Juliani e8d54b6f Use hypernetwork if there is a goal 4 年前
vincentpierre 099a51ef removing extra line 4 年前
vincentpierre 65c2fa7f Making EntityEmbedding encode self and entities separately 4 年前
Arthur Juliani 512482c8 Check that goal is a goal 4 年前
GitHub bd4bc66b Merge branch 'master' into fix-numti-env-delayed-spawn 4 年前
vincentpierre f101393c Better bounds for log probabilities 4 年前
vincentpierre 77eecc6b Merge branch 'master' into develop-att-network-integration 4 年前
Arthur Juliani 1cf97635 Additional conditional experiments 4 年前
vincentpierre 7e47f94b addressing comments 4 年前
Andrew Cohen a4c336c2 value estimator 4 年前
GitHub d7f549f9 Run pytest on GPU (#4865) 4 年前
Arthur Juliani d2526ce2 Modify CrawlerDynamic 4 年前
GitHub 3c1e98ca Update ml-agents/mlagents/trainers/torch/distributions.py 4 年前
vincentpierre c27a95f0 Make a self encoder before EntityEmbedding 4 年前
Arthur Juliani ff70c5c4 Merge branch 'master' into goal-conditioning-new 3 年前
vincentpierre 2bf6737f addressing comments 3 年前
Arthur Juliani b8e81b00 Make lists modulelists 3 年前
vincentpierre fd007f53 Attempting to use EntityEmbedding directly as processor 3 年前
vincentpierre efa5a164 addressing comments and adding the changes to rpc_utils 3 年前
Arthur Juliani 759fd2b5 PushJump modifications 3 年前
vincentpierre 19a8244b [skip ci] Removing no-longer-useful comment 3 年前
Arthur Juliani 21aaa5fe Add goal to hyper input 3 年前
Arthur Juliani a180dbf7 Add visual version of task and simply encoders 3 年前
Andrew Cohen 7f491ae7 cloud run with coma2 of held out zombie test env 3 年前
vincentpierre 9fbc2e0e _ 3 年前
Arthur Juliani 7165e9cf Make conditiontype a setting 3 年前
Andrew Cohen 9af22d30 use only value funcs 3 年前
vincentpierre bf16bad6 _ 3 年前
Andrew Cohen 511a9a7e no baseline 3 年前
vincentpierre 6a61eb05 - 3 年前
vincentpierre 1acdc155 Changes to hypernet 3 年前
vincentpierre 82d80d72 better error message 3 年前
GitHub 2fb87e4f Merge branch 'master' into reward-dist 3 年前
vincentpierre 04bdb40c Reorder operations 3 年前
GitHub 212ebfb9 Merge pull request #4844 from Unity-Technologies/develop-att-network-integration 3 年前
GitHub ca8a4ae2 Merge pull request #4990 from Unity-Technologies/goal-conditioning-new-vince 3 年前
Andrew Cohen 687f411b try again on cloud 3 年前
Chris Elion 9d70220e Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 3 年前
Arthur Juliani 1d106816 Support multiple goals in networkbody 3 年前
Arthur Juliani 7ba4ca17 Reformat line 3 年前
Arthur Juliani 2552cf18 Address comments 3 年前
GitHub 001990af Merge pull request #5008 from Unity-Technologies/multi-goal-conditioning 3 年前
Andrew Cohen f9ff3fef shared baseline and v 3 年前
Ervin Teng bca6c92c Add clipping, use same network for value 3 年前
Arthur Juliani 4413203d Sensor cleanup 3 年前
Ervin Teng 3283b6a1 Remove Q-net for perf 3 年前
GitHub 64fc7f43 Buffer key enums (#4907) 3 年前
Ervin Teng 2be83146 Use same network 3 年前
Ervin Teng 6094613d try reduce bias more 3 年前
vincentpierre 8baaaf4d Changing model export to be compatible with Barracuda 3 年前
Andrew Cohen c08fefbc reduce initialization weights 3 年前
Ervin Teng b6f88d6d Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 3 年前
Andrew Cohen eeabb974 Separate Actor/Critic, remove ActorCritics 3 年前
Andrew Cohen 6bd396ee add critic to optimizer, ppo runs 3 年前
Ervin Teng c7054d76 Use attention tests from master 3 年前
Ervin Teng 0bde7598 Back out trainer changes 3 年前
Ervin Teng da6a55a0 Revert "Use attention tests from master" 3 年前
Ervin Teng 7c826fb1 Working GRU 3 年前
Ervin Teng 514873bf Use correct memories (t-1 instead of t) for training 3 年前
Ervin Teng bac2fb68 Use attention from master 3 年前
Ervin Teng e9025079 Properly use MemoryModule abstraction 3 年前
Andrew Cohen c74dca9f add SharedActorCritic 3 年前
Ervin Teng f3a2a81f Merge branch 'develop-fix-lstms' into develop-gru 3 年前
Ruo-Ping Dong c87bce9e Merge branch 'master' into develop-base-teammanager 3 年前
Ervin Teng a9116382 Bug fixes 3 年前
Andrew Cohen 98d647de MultiInputNetBody 3 年前
vincentpierre bd6ab0f7 Silencing a Torch warning that raises when exporting the attention module to ONNX 3 年前
Christopher Goy 9cadfa7a Merge master -> release_13_branch-to-master 3 年前
vincentpierre e1b94b8b Merge branch 'master' into develop-var-len-obs-feature 3 年前
GitHub 332e9b8b Merge pull request #4909 from Unity-Technologies/develop-var-len-obs-feature 3 年前
Ruo-Ping Dong b5da488d Merge branch 'master' into develop-base-teammanager 3 年前
Andrew Cohen dc8e8494 Merge branch 'master' into develop-critic-optimizer 3 年前
Chris Elion e4f51ca7 Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 3 年前
Ervin Teng 93a59971 Merge branch 'develop-critic-optimizer' into develop-critic-op-lstm 3 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 3 年前
Ervin Teng fd3f05b9 Enable GAIL to decay 3 年前
vincentpierre 811319c0 Use clipped gaussian distribution for entropy calculation 3 年前
Ervin Teng 7b41e5d6 Add GAIL learning rate to TB 3 年前
GitHub 4d5545c8 Set ignore done=False in GAIL (#4971) 3 年前
Chris Elion c3bc8991 cleanup, don't store mask 3 年前
Ervin Teng f409c40c Merge branch 'master' into develop-agentprocessor-teammanager 3 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 3 年前
HH 15d512f9 Merge branch 'master' into hh/develop/dodgeball 3 年前
Ervin Teng 08db7c2f Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer-mm 3 年前
Ervin Teng 3d2171c4 Add team methods to AgentAction 3 年前
Ervin Teng 9bc88c41 Running COMA (not sure if learning) 3 年前
Ervin Teng 12cef7af Add test for GroupObs 3 年前
Ervin Teng 1fc3640e Change AgentAction back to 0 pad and add tests 3 年前
Ervin Teng c6904f86 Group reward function 3 年前
Ervin Teng ffdfd8ff Address some comments 3 年前
GitHub 338af2ec Move the Critic into the Optimizer (#4939) 3 年前
Ervin Teng 13fca55f Add more comments 3 年前
Ervin Teng 60f2b72e Rename internal function 3 年前
Ervin Teng be45d8c0 Move padding method to AgentBufferField 3 年前
HH 4c947151 Merge branch 'main' into hh/develop/dodgeball 3 年前
vincentpierre 22db0335 Modified the model_serialization to have correct inputs and outputs 3 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 3 年前
vincentpierre 5b5171f8 switching from CamelCase to snake_case 3 年前
Andrew Cohen 9060da06 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 3 年前
vincentpierre 3068ae1f Adding a helper method for creating observation placeholder names and removed the _h and _c placeholders 3 年前
Ervin Teng 56d4c1f9 Fix to-flat and add tests 3 年前
Andrew Cohen 5d517c5e clean ups 3 年前
Andrew Cohen e2d46ca0 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 3 年前
Andrew Cohen 8562471e add inital coma optimizer tests 3 年前
GitHub c9c7e3d0 Faster NaN masking, fix masking for visual obs (#5015) 3 年前
GitHub c3c34267 Merge pull request #5018 from Unity-Technologies/develop-use-correct-names-for-recurrent-inputs-and-outputs 3 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 3 年前
Arthur Juliani f49dcf8e Update network (partially) 3 年前
GitHub d36a5242 Python Dataflow for Group Manager (#4926) 3 年前
Ervin Teng fd0dd35c Merge branch 'main' into develop-coma2-trainer 3 年前
Arthur Juliani ce1d3d88 Resolve conflicts in networkbody 3 年前
Ervin Teng c8137dcd Merge branch 'main' into develop-superpush-int 3 年前
Arthur Juliani fc1ee931 Fix hypernetwork 3 年前
GitHub 0ddfbfee V2 staging new model version (#5080) 3 年前
Christopher Goy d1acc658 V2 staging new model version (#5080) 3 年前
Andrew Cohen 81524ee8 lstm almost runs 3 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 3 年前
Christopher Goy 921ba4f0 Update v2-staging from main (March 15) (#5123) 3 年前
GitHub ba2af269 [coma2] Make group extrinsic reward part of extrinsic (#5033) 3 年前
Chris Elion a362b3d9 Merge remote-tracking branch 'origin/v2-staging' into csharp-obs-spec 3 年前
GitHub 6ae8ea1e [coma2] Add support for variable length obs in COMA2 (#5038) 3 年前
Christopher Goy ebe45056 Merge branch 'main' into release_14_branch-to-main 3 年前
Andrew Cohen 0afe5f24 add slice function to agent action 3 年前
GitHub d2635e58 Action slice (#5047) 3 年前
Andrew Cohen cb13a8ca add type/docstring to slice 3 年前
Andrew Cohen f165bfb5 update comment 3 年前
Chris Elion 970f1d40 Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec 3 年前
GitHub 8f35bdd3 POCA trainer (#5005) 3 年前
Andrew Cohen 9e77d7e1 Merge branch 'main' into develop-soccer-groupman 3 年前
GitHub e79d8a9d [bug-fix] Move POCA critic to default device (#5124) 3 年前
GitHub e6143a83 [bug-fix] Move POCA critic to default device (#5124) (#5131) 3 年前
GitHub ef3d6e0d Adding Hypernetwork modules and unit tests (#5141) 3 年前
Ervin Teng 41dd16e8 Merge branch 'main' into release_15_mm 3 年前
Christopher Goy 90872d1d V2 staging new model version (#5080) 3 年前
vincentpierre e83c350a Fixing networks.py for the merge 3 年前
vincentpierre 9609c79c Fixing networks.py for the merge 3 年前
GitHub ea2b2f20 Merge pull request #5158 from Unity-Technologies/v2-staging-rebase-2-fix-networks 3 年前
GitHub 78851829 Merge pull request #5157 from Unity-Technologies/v2-staging-rebase-2 3 年前
Andrew Cohen 18be47e8 Merge branch 'main' into develop-soccer-groupman-mod 3 年前
GitHub 81705d6d Goal conditioning integration (#5142) 3 年前
vincentpierre d4716caa Merge branch 'main' into goal-conditioning-sensors-3 3 年前
GitHub c37cfac1 Adding the goal conditioning sensors with the new observation specs (#5159) 3 年前
vincentpierre 1b4fd8fb Renaming GOAL to GOAL_SIGNAL 3 年前
GitHub b9cab453 [perf] Optimizations for performance (#5192) 3 年前
GitHub 2e19759c Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211) 3 年前
GitHub dffc37bf Update to barracuda 1.3.3 and changes to the model inputs and outputs for LSTM (#5236) 3 年前
vincentpierre 51adab1c Fix the attention module embedding size 3 年前
GitHub 353b1566 Fix the attention module embedding size (#5272) 3 年前
GitHub 1678be1c POCA Attention will use h_size for embedding size and not 128 (#5281) 3 年前
GitHub 5e25015f POCA Attention will use h_size for embedding size and not 128 (#5281) (#5287) 3 年前
vincentpierre 4e14879d Updating the barracuda 1.4.0 (#5291) 3 年前
vincentpierre bf8acbb0 - 3 年前
Miguel Alonso Jr 4846cf0f Merge branch 'main' into develop-api-documentation-update 3 年前
GitHub 806f04bd Readding the validation of the minimal cnn input size (#5345) (#5346) 3 年前
GitHub 15440c24 Readding the validation of the minimal cnn input size (#5345) 3 年前
GitHub bb07eb45 Adding a fully connected visual encoder for super small visual input + tests (#5351) 3 年前
GitHub ce1808f2 [bug-fix] Make resnet barracuda-compatible (#5358) (#5364) 3 年前
Miguel Alonso Jr 97b7d5c6 Merge branch 'main' into develop-api-documentation-update 3 年前
GitHub 20c0c728 [bug-fix] Make resnet barracuda-compatible (#5358) 3 年前
vincentpierre 983982ee Removing misleading learning rate 3 年前