78 次代码提交 (024bb104-c278-45a6-afc3-552ac446c9a9)

作者 SHA1 备注 提交日期
Arthur Juliani dc50162d Add ResNet and distributions 5 年前
Arthur Juliani 7c3bd376 Refactoring policy and optimizer 5 年前
Arthur Juliani 8c6f4696 Fix a couple additional bugs 5 年前
Arthur Juliani 61d671d8 Add conditional sigma for distribution 5 年前
Arthur Juliani 4a50444f Support discrete actions as well 5 年前
Arthur Juliani a11a79e4 Continuous and discrete now train 5 年前
Arthur Juliani 5f936990 Visual observations now train as well 5 年前
Ervin Teng 2fae31e6 Remove another if statement 5 年前
Ervin Teng 72180f9b Experiment with JIT compiler 5 年前
Arthur Juliani 46874cc7 ONNX exporting 4 年前
Arthur Juliani e14eb72b Fix some issues with pdf 4 年前
Arthur Juliani 6408fd4e Fix bug in pdf function 4 年前
GitHub 0d80d87a Fix for discrete actions (#4181) 4 年前
GitHub 45154f52 Pytorch port of SAC (#4219) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 69579611 [refactor] Refactor Actor and Critic classes (#4287) 4 年前
GitHub 5bcbef8d [tests] Add tests for core PyTorch files (#4292) 4 年前
Ervin Teng a172fb46 Halve entropy 4 年前
GitHub 7ddfd81f Added Reward Providers for Torch (#4280) 4 年前
GitHub dba529ff Fix discrete export (#4322) 4 年前
GitHub 36613cad [add-fire] Fix CategoricalDistInstance test and replace `range` with `arange` (#4327) 4 年前
Ervin Teng 6b29a4c9 Fix test and replace range with arange 4 年前
GitHub 6b193d03 Develop add fire layers (#4321) 4 年前
Ruo-Ping Dong f5dee9d1 jit for continuous control 4 年前
Ruo-Ping Dong ef3be79e sac 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
Andrew Cohen 8294bc59 ignore commit 4 年前
Andrew Cohen b0b2e22e prepare to merge action_out 4 年前
Andrew Cohen 85602279 add action_out to dist 4 年前
Andrew Cohen f0a2c6f2 add action_out abstract method and docstring 4 年前
Andrew Cohen 7c0aa77b Merge branch 'develop-actions-out' into develop-hybrid-actions 4 年前
Andrew Cohen b6b453a0 rename to exported model output 4 年前
Andrew Cohen 9cbf6c3f Merge branch 'develop-actions-out' into develop-hybrid-actions 4 年前
Andrew Cohen 72cd0d39 separate tensors for disc/cont 4 年前
Andrew Cohen 643c8e58 ppo extended 4 年前
Andrew Cohen db37db34 fixing errors 4 年前
Andrew Cohen 44c9879e action models 4 年前
Ervin Teng e8431a6d Proper dimensions for entropy, sum before bonus in PPO 4 年前
Andrew Cohen 190d8e4d action model as a singleton 4 年前
GitHub e4db5dc5 ActionSpec and ActionBuffer (#4578) 4 年前
GitHub a4ba90ec [bug-fix] Fix ONNX export/Barracuda import for continuous actions (#4608) 4 年前
Ervin Teng f00ff21d Add clipping to Torch 4 年前
Ervin Teng 7e6cc222 Take mean of continuous entropy 4 年前
Ervin Teng 89489ae0 Invert divide by 3 in log prob 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
Ervin Teng 7502a383 Proper dimensions 4 年前
Ervin Teng 1f305f24 Test commit 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
Ervin Teng 66f3d2ad Merge branch 'develop-contentropy' into develop-torch-clip 4 年前
Ervin Teng db6ebff6 Remove clip 4 年前
GitHub 85a7c0f7 [bug-fix] Add clipping to PyTorch policy, fix initialization (#4649) 4 年前
Ervin Teng 08c8862e Add epsilon to log 4 年前
Ervin Teng eb4f3065 Add another epsilon 4 年前
GitHub de27d7a6 [bug-fix] Fix issue where NaNs are outputted by the policy when training Match3 (#4664) 4 年前
Ervin Teng 4f8f21e4 Bigger scale 4 年前
vincentpierre 90da7426 Fix masking for torch 4 年前
vincentpierre e85d8e35 Added a comment and included the change of #4715 for simplicity 4 年前
GitHub 990f801a Develop hybrid action staging (#4702) 4 年前
GitHub cc6b4564 Multi Directional Walker and Initial Hypernetwork (#4740) 4 年前
Arthur Juliani 79898e06 Use hypernetwork in both places 4 年前
Arthur Juliani f36b10d8 Fix typo 4 年前
GitHub 6f4f6044 Fix model inference issue with Barracuda v1.2.1 (#4766) (#4768) 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 06f1f254 1:1 and continuous/discrete train 4 年前
Andrew Cohen 6174c428 move action model to explicit distributions 4 年前
Andrew Cohen e81e68de comms agent and fixed hallway 4 年前
Andrew Cohen ca5a5194 soccer comms on the cloud 4 年前
Andrew Cohen 12828bdc remove tau from diff for 4 年前
Andrew Cohen 8543b917 add kickpower bonus 4 年前
vincentpierre 52b011d6 _ 4 年前
vincentpierre 03c905b2 Fix equation for entropy 4 年前
vincentpierre 5f9ea5ea _ 4 年前
vincentpierre a4b78d53 _ 4 年前
vincentpierre 6f3ea7b8 _ 4 年前
vincentpierre f101393c Better bounds for log probabilities 4 年前
GitHub 3c1e98ca Update ml-agents/mlagents/trainers/torch/distributions.py 4 年前
vincentpierre 811319c0 Use clipped gaussian distribution for entropy calculation 4 年前
GitHub 0ddfbfee V2 staging new model version (#5080) 4 年前