445 次代码提交 (880d390b-9fec-4cd6-a140-8015f0868741)

作者 SHA1 备注 提交日期
vincentpierre 96452986 Initial commit for multi head attention 4 年前
GitHub 95efe03b [MLA-1519] Don't mark action_probs as an output node. (#4613) 4 年前
GitHub 64e998a2 [bug-fix] Use float64 when converting np.ndarray to torch.tensor, cap Torch version to 1.7.x (#4610) 4 年前
GitHub 6ce2434d Fix typo in URL (#4604) 4 年前
GitHub a4ba90ec [bug-fix] Fix ONNX export/Barracuda import for continuous actions (#4608) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
GitHub 0e91ff08 Reword readme (#4603) 4 年前
GitHub e0ef30a5 [bug-fix] Change entropy computation and loss reporting in Torch to match TF (#4538) 4 年前
GitHub b5dd43f2 [bug-fix] Don't load non-wrapped policy (#4593) 4 年前
GitHub d91d4a19 ActionBuffer with hybrid actions for RemotePolicy (#4592) 4 年前
GitHub a7f10f8a Update Training-on-Amazon-Web-Service.md (#4591) 4 年前
GitHub ca1aacc1 Assign new BrainParametersProto fields based on capabilities (#4581) 4 年前
GitHub a690af74 [refactor] Make PyTorch the default and TensorFlow optional (#4517) 4 年前
GitHub be723c66 Change BrainParametersProto to support ActionSpec (#4579) 4 年前
GitHub 7a33b67a update barracuda version, enable tests for 2020.2 (#4571) 4 年前
GitHub 4cfae52e Add hybrid action capability flag (#4576) 4 年前
GitHub dde34423 [bug-fix] Use proper masking for entropy and policy losses (#4572) 4 年前
Ruo-Ping Dong 91700916 add hybrid action capability flag 4 年前
GitHub 024bb104 [MLA-1474] detect recursion on Agent methods and throw (#4573) 4 年前
GitHub dc40c926 Merge release 8 back to master (#4570) 4 年前
GitHub 82debf3b clean up EndEpisode demo code (#4563) 4 年前
GitHub 86d8ee4a use v2 action and pin python version (#4568) 4 年前
GitHub 12bd0063 Add updated Dockerfile and CI build (#4543) 4 年前
GitHub eff5a484 [CI] use downloader, update versions (#4562) 4 年前
GitHub ec307cb3 Update Learning-Environment-Create-New.md (#4554) 4 年前
GitHub 69336624 remove demonstration recorder from crawler agent (#4556) 4 年前
GitHub 30c8caeb Moving back the template to the Examples folder rather than shared assets (#4559) 4 年前
GitHub 1f9f1798 [Test] Make tests to compare compressed and non-compressed obs (#4547) 4 年前
GitHub 0bc31929 [Release 8] update versions on master (#4549) 4 年前
GitHub 1cbbbe86 Update changelog for release 8 (#4548) 4 年前
GitHub 9cd822e3 RU - documentation localization (#4529) 4 年前
GitHub efa2a704 add to_string for samplers (#4484) 4 年前
GitHub b33e310f Add Visual3DBall scene (#4513) 4 年前
GitHub 5bc169b4 Clean up AcademyFixedUpdateStepper when playmode changed (#4532) 4 年前
GitHub 9e1a28c2 Add vector flag of agent's frozen state to VisualFoodCollector (#4511) 4 年前
GitHub c3d2b902 Support multi-dimensional and compressed observations stacking (#4476) 4 年前
GitHub 6aed3c75 Convert pypi publish to github actions (#4539) 4 年前
GitHub c188781b [life improvement] Moving Python files around (#4531) 4 年前
GitHub badca342 Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540) 4 年前
GitHub 28950f55 Fixing CPU resource allocation for new CI (#4525) 4 年前
GitHub 88e45ae9 Extensions package documentation (#4450) 4 年前
GitHub fdf73ee5 update grid sensor docs (#4516) 4 年前
GitHub 05fc088d [refactor] Don't compute grad for q2_p in SAC Optimizer (#4509) 4 年前
GitHub 400e14cb [Bug-fix] RND would not be saved correctly. Added tests (#4514) 4 年前
GitHub e471bd8b Refactoring of the tests folder for the trainers (#4510) 4 年前
GitHub f5bb26d2 WallJump - handle Agent starting before ModelOverrider (#4502) 4 年前
GitHub 215b35c6 [refactor] Optimize buffer sample_minibatch (#4508) 4 年前
GitHub 60b76790 Random Network Distillation for Torch (#4473) 4 年前
GitHub 4e4ad7b0 Don't run value during policy evaluate, optimized soft update function (#4501) 4 年前
Ervin Teng b6095151 Execute critic with LSTM 4 年前