95 次代码提交 (54c4eb43-8bfc-4e88-8cad-1b01aab4cd7a)

作者 SHA1 备注 提交日期
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 74c99ec8 [refactor] Refactor normalizers and encoders (#4275) 4 年前
GitHub 69579611 [refactor] Refactor Actor and Critic classes (#4287) 4 年前
GitHub d8db1477 [bug-fix] Fix error with discrete probs (#4309) 4 年前
GitHub 5bcbef8d [tests] Add tests for core PyTorch files (#4292) 4 年前
GitHub 3a982317 [add-fire] Add learning rate and beta/epsilon decay to PyTorch (#4318) 4 年前
GitHub 7ddfd81f Added Reward Providers for Torch (#4280) 4 年前
GitHub 6b193d03 Develop add fire layers (#4321) 4 年前
GitHub f374f87a [add-fire] Add LSTM to SAC, LSTM fixes and initializations (#4324) 4 年前
Ervin Teng 1d4bc99e Proper mask mean for PPO 4 年前
GitHub 6de31a03 [add-fire] Fix masked mean for 2d tensors (#4364) 4 年前
vincentpierre 108fac9a Replace torch.detach().cpu().numpy() with a utils method 4 年前
Andrew Cohen 5b1620e9 moved encoders to processors 4 年前
Ruo-Ping Dong ef3be79e sac 4 年前
GitHub 4e93cb6e [torch] Restructure PyTorch encoders (#4421) 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
Ervin Teng fdc887a1 Some experimental stuff 4 年前
GitHub bf6506fc [feature] Add small CNN for grids 5x5 and up (#4434) 4 年前
Ervin Teng 77c810fb Fix SAC and make utility method 4 年前
Ervin Teng 3f7d815c detach masks 4 年前
vincentpierre 181bdec0 - 4 年前
GitHub 4e4ad7b0 Don't run value during policy evaluate, optimized soft update function (#4501) 4 年前
Andrew Cohen 190d8e4d action model as a singleton 4 年前
Andrew Cohen e5f14400 Merge branch 'master' into develop-hybrid-actions-singleton 4 年前
GitHub e4db5dc5 ActionSpec and ActionBuffer (#4578) 4 年前
Andrew Cohen eaecb59e torch utils to and from buffer 4 年前
Andrew Cohen 6e23bafd ActionFlattener Refactor 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
GitHub 64e998a2 [bug-fix] Use float64 when converting np.ndarray to torch.tensor, cap Torch version to 1.7.x (#4610) 4 年前
vincentpierre d3d4eb90 Trainer with attention 4 年前
vincentpierre 7ef3c9a1 Trainer with attention 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
Andrew Cohen ae920478 resolve conflicts 4 年前
vincentpierre 6fcbba53 Refactoring the code to make it more flexible. Still a hack 4 年前
vincentpierre f283cb60 different architecture 4 年前
Andrew Cohen 3f771e61 add ActionBuffers and utils 4 年前
Ervin Teng 7a0ebfbd Pretty broken 4 年前
Ervin Teng 95bdbba3 Less broken PPO 4 年前
Andrew Cohen bd917c9c action buffer passes continuous 4 年前
Andrew Cohen b36fcf16 discrete runs/cont passes 4 年前
Andrew Cohen ad951493 debugging discrete 4 年前
Andrew Cohen fcf6471e 2d discrete passes 4 年前
Andrew Cohen 056630d7 sac continuous and discrete train 4 年前
vincentpierre 735fcd52 [WIP] Refactor trainers to use list of obs rather than vec and vis obs 4 年前
Andrew Cohen 85e4db33 bc tests pass 4 年前
vincentpierre 93ca1409 fixing the tests 4 年前
Andrew Cohen 24fd9b3c torch reward providers all pass 4 年前
vincentpierre 12619155 added some docstrings 4 年前
Andrew Cohen dee6b805 fixed bug in discrete 4 年前
vincentpierre c1587bce Solving merge conflicts 4 年前
Andrew Cohen b5d1c071 Merge branch 'master' into develop-action-buffer 4 年前
Andrew Cohen 73b778cc rename extract to from_dict 4 年前
Ervin Teng 25dfd883 Merge branch 'master' into develop-centralizedcritic 4 年前
Andrew Cohen eef14922 discrete/contionuous unity envs train 4 年前
Andrew Cohen e168bb4f add docstrings to AgentAction and ActionLogProbs 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 06f1f254 1:1 and continuous/discrete train 4 年前
Andrew Cohen 7750bccd all hybrid simple rl tests pass 4 年前
GitHub 559609a2 Update ml-agents/mlagents/trainers/torch/utils.py 4 年前
Andrew Cohen 6174c428 move action model to explicit distributions 4 年前
Andrew Cohen 1d234d1d bc works 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
Andrew Cohen 8d7e449f torch curiosity tests pass 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
Andrew Cohen 95566e44 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
vincentpierre c5a057d2 renaming obs_spec variables 4 年前
vincentpierre 44ed3258 Merging master 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 35769b53 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
vincentpierre f7a4a31f [Experiment] Bullet hell 4 年前
Ervin Teng 330fc1d0 Merge branch 'master' into develop-centralizedcritic-mm 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
vincentpierre 56972f56 WIP integrate attention to networkbody 4 年前
vincentpierre 8660b1c2 merging master 4 年前
vincentpierre 38fc2536 addresing some comments 4 年前
Andrew Cohen ad807327 refactor entityembedding/network body 4 年前
GitHub 01e0ee00 refactor entityembedding/network body (#4857) 4 年前
vincentpierre 396bc43c Merging master 4 年前
vincentpierre b7c7d773 Adding some tests 4 年前
Arthur Juliani e8d54b6f Use hypernetwork if there is a goal 4 年前
vincentpierre 7e47f94b addressing comments 4 年前
vincentpierre c27a95f0 Make a self encoder before EntityEmbedding 4 年前
vincentpierre 2bf6737f addressing comments 4 年前
vincentpierre fd007f53 Attempting to use EntityEmbedding directly as processor 4 年前
Ervin Teng b6f88d6d Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
GitHub 6ae8ea1e [coma2] Add support for variable length obs in COMA2 (#5038) 4 年前
vincentpierre 51adab1c Fix the attention module embedding size 3 年前
GitHub 806f04bd Readding the validation of the minimal cnn input size (#5345) (#5346) 3 年前
GitHub bb07eb45 Adding a fully connected visual encoder for super small visual input + tests (#5351) 3 年前