144 次代码提交 (fe569079-e24e-4245-9940-f011ee69a7fb)

作者 SHA1 备注 提交日期
Andrew Cohen afe9861b add components directory and init 4 年前
GitHub 38ce37c9 Add components directory and init (#4320) 4 年前
Andrew Cohen 22a0cabc changed path to torch bc module 4 年前
Andrew Cohen 8ced43ee clean up types/comments 4 年前
GitHub 7ddfd81f Added Reward Providers for Torch (#4280) 4 年前
GitHub 6b193d03 Develop add fire layers (#4321) 4 年前
Ervin Teng 4ebccf97 Merge branch 'develop-add-fire' into develop-add-fire-sac-lst 4 年前
Andrew Cohen 598826fe Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
GitHub 3b43972d Fixed the reporting of the discriminator loss (#4348) 4 年前
Andrew Cohen ae2c83e2 added torch bc tests 4 年前
GitHub 6b255790 Behavioral Cloning Pytorch (#4293) 4 年前
Andrew Cohen 742940a3 all bc tests 4 年前
Andrew Cohen 5f3a94cf address comments 4 年前
Andrew Cohen 0a7444f9 revert bc default batch/epoch 4 年前
Ruo-Ping Dong 59cc1a9f Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Ervin Teng 13f15086 Merge branch 'develop-add-fire' into develop-add-fire-amrl 4 年前
Ervin Teng d218bf4d Merge branch 'develop-add-fire' into develop-add-fire-sac-lst 4 年前
Ervin Teng 3387f56d Fix BC module 4 年前
GitHub 6a1d993f [add-fire] Memory class abstraction (#4375) 4 年前
Ervin Teng 8ff8c401 Merge branch 'develop-add-fire' into develop-add-fire-export 4 年前
GitHub 1955af9e [feature] Add experimental PyTorch support (#4335) 4 年前
vincentpierre 9f51ab14 Saving the reward providers 4 年前
vincentpierre 25454a48 adding tests 4 年前
vincentpierre 108fac9a Replace torch.detach().cpu().numpy() with a utils method 4 年前
GitHub 328353bc Torch : Saving/Loading of the reward providers (#4405) 4 年前
vincentpierre 31750e97 Using item() in place of to_numpy() 4 年前
Ruo-Ping Dong 88eff042 Merge branch 'master' into develop-saver-name 4 年前
GitHub 12e15e29 Fix on GAIL Torch when using actions (#4407) 4 年前
GitHub 498934f9 Replace torch.detach().cpu().numpy() with a utils method (#4406) 4 年前
Ruo-Ping Dong fd1dc3a6 Merge branch 'master' into develop-torch-omp 4 年前
GitHub 7b4d0865 [Bug fix] Fix bug in GAIL gradient penalty (#4425) 4 年前
GitHub 4e93cb6e [torch] Restructure PyTorch encoders (#4421) 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
GitHub 676f5f7c [refactor] Refactor GAIL to use new encoder structure (#4433) 4 年前
Ervin Teng 60eacc0d Merge branch 'master' into develop-adjust-cpu-settings 4 年前
GitHub 6986fb10 use LinearEncoder in curiosity and clean up (#4444) 4 年前
Andrew Cohen 3997b14b Merge branch 'master' into develop-hybrid-actions 4 年前
Ervin Teng 43c41d66 Fix BC and Reward Signals 4 年前
Ervin Teng 7754ad7b Don't run value during inference 4 年前
vincentpierre 181bdec0 - 4 年前
GitHub 4e4ad7b0 Don't run value during policy evaluate, optimized soft update function (#4501) 4 年前
Ervin Teng f9ff3efe Merge branch 'develop-policyonly' into develop-sac-targetq 4 年前
GitHub 60b76790 Random Network Distillation for Torch (#4473) 4 年前
GitHub 400e14cb [Bug-fix] RND would not be saved correctly. Added tests (#4514) 4 年前
HH a3bf96fd Merge branch 'master' into hh/develop/gridsensor-tests 4 年前
Andrew Cohen e5f14400 Merge branch 'master' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 6e23bafd ActionFlattener Refactor 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
vincentpierre a3a9a56b Merge branch 'exp-multi-head-attention' into exp-bullet-hell 4 年前
Ruo-Ping Dong 9e08be87 Merge branch 'master' into release_9_branch_merge 4 年前
Andrew Cohen 6cf54bf2 remove self.action_spec from policy/bc 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
GitHub 8175d558 [bug-fix] Fix BC module + action clipping (#4667) 4 年前
Ervin Teng bc746839 Normalize GAIL observations 4 年前
Ruo-Ping Dong ee5313e4 Merge branch 'master' into develop-windows-delay 4 年前
Ervin Teng 362f2ec0 Use correct dimensions of gradient 4 年前
GitHub f0ed3a38 Cherry-pick BC fixes to Release 10 (#4668) 4 年前
Ervin Teng 4158629e Properly feed in None rather than empty arrays 4 年前
Ervin Teng 8d29114d Update curiosity reward provider 4 年前
Ervin Teng 79a3051e Update GAIL and BC 4 年前
Ervin Teng fdaa8c3d Merge branch 'develop-unified-obs' into develop-centralizedcritic 4 年前
GitHub 990f801a Develop hybrid action staging (#4702) 4 年前
Andrew Cohen 85e4db33 bc tests pass 4 年前
vincentpierre 93ca1409 fixing the tests 4 年前
vincentpierre 7a5cc9ec Merge master into develop-rm-tf 4 年前
Andrew Cohen 24fd9b3c torch reward providers all pass 4 年前
vincentpierre 12619155 added some docstrings 4 年前
vincentpierre c1587bce Solving merge conflicts 4 年前
Arthur Juliani 0d2f8887 Merge remote-tracking branch 'origin/master' into goal-conditioning 4 年前
Andrew Cohen 73b778cc rename extract to from_dict 4 年前
Ervin Teng 25dfd883 Merge branch 'master' into develop-centralizedcritic 4 年前
vincentpierre 0c81006d addressing comments 4 年前
Andrew Cohen a545859e fix torch test policy 4 年前
vincentpierre 8cb050ef WIP Made initial changes to enale dimension properties and added attention module 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
GitHub a73f7d73 Turn down gain on GAIL discriminator output (#4762) 4 年前
GitHub b6bb01b9 Turn down gain on GAIL discriminator output (#4762) (#4772) 4 年前
vincentpierre c3699de8 merging master and addressing comments 4 年前
GitHub 29d94c7c Merge pull request #4734 from Unity-Technologies/develop-obs-as-list 4 年前
Andrew Cohen 1d234d1d bc works 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
Andrew Cohen 8d7e449f torch curiosity tests pass 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
Andrew Cohen 7973b46c remove print bc 4 年前
Andrew Cohen c0d01baf Merge branch 'master' into merge-release11-master 4 年前
vincentpierre 44ed3258 Merging master 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 17496265 move AgentAction, ActionLogProbs, and ActionFlattener to separate files 4 年前
Chris Elion 76ebc20c Merge remote-tracking branch 'origin/master' into r12-to-master 4 年前
GitHub 458fee17 Merge pull request #4763 from Unity-Technologies/develop-att 4 年前
Ervin Teng 330fc1d0 Merge branch 'master' into develop-centralizedcritic-mm 4 年前
vincentpierre 519c5f47 merging master 4 年前
Ruo-Ping Dong 8ed14762 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Andrew Cohen 7ba10239 remove action spec attribute from policy 4 年前
Andrew Cohen 886883b3 Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
Arthur Juliani 7c37c759 Fix some mis-renamings 4 年前
Ruo-Ping Dong a7d04be6 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Arthur Juliani 0a876b9c Fix typos 4 年前
Arthur Juliani e3de0406 Plurals 4 年前
Ruo-Ping Dong 180d3e20 Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager 4 年前
HH 0024a286 merge ervin's new stuff 4 年前
GitHub 67ad9651 Merge pull request #4825 from Unity-Technologies/sensor-types 4 年前
vincentpierre 8660b1c2 merging master 4 年前
brccabral 457fb612 Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents 4 年前
vincentpierre 6f3ea7b8 _ 4 年前
Andrew Cohen feb38012 add lambda return and target network 4 年前
GitHub 64fc7f43 Buffer key enums (#4907) 4 年前
Ervin Teng b6f88d6d Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Andrew Cohen 6bd396ee add critic to optimizer, ppo runs 4 年前
Ervin Teng 0bde7598 Back out trainer changes 4 年前
Ruo-Ping Dong c87bce9e Merge branch 'master' into develop-base-teammanager 4 年前
Christopher Goy 9cadfa7a Merge master -> release_13_branch-to-master 4 年前
vincentpierre e1b94b8b Merge branch 'master' into develop-var-len-obs-feature 4 年前
Chris Elion e4f51ca7 Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 4 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Ervin Teng fd3f05b9 Enable GAIL to decay 4 年前
Ervin Teng 7b41e5d6 Add GAIL learning rate to TB 4 年前
GitHub 4d5545c8 Set ignore done=False in GAIL (#4971) 4 年前
Chris Elion c3bc8991 cleanup, don't store mask 4 年前
Ervin Teng f409c40c Merge branch 'master' into develop-agentprocessor-teammanager 4 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 4 年前
HH 15d512f9 Merge branch 'master' into hh/develop/dodgeball 4 年前
Ervin Teng 08db7c2f Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer-mm 4 年前
Ervin Teng c6904f86 Group reward function 4 年前
GitHub 338af2ec Move the Critic into the Optimizer (#4939) 4 年前
HH 4c947151 Merge branch 'main' into hh/develop/dodgeball 4 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 4 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 4 年前
Ervin Teng c8137dcd Merge branch 'main' into develop-superpush-int 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
Christopher Goy 921ba4f0 Update v2-staging from main (March 15) (#5123) 4 年前
GitHub ba2af269 [coma2] Make group extrinsic reward part of extrinsic (#5033) 4 年前
Christopher Goy ebe45056 Merge branch 'main' into release_14_branch-to-main 4 年前
Chris Elion 970f1d40 Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec 4 年前
GitHub 8f35bdd3 POCA trainer (#5005) 4 年前
Andrew Cohen 9e77d7e1 Merge branch 'main' into develop-soccer-groupman 4 年前
vincentpierre 4e14879d Updating the barracuda 1.4.0 (#5291) 4 年前
vincentpierre bf8acbb0 - 4 年前
vincentpierre 983982ee Removing misleading learning rate 4 年前