43 次代码提交 (5bf72236-0d33-48f0-b1b1-598f1ddd2109)

作者 SHA1 备注 提交日期
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
GitHub 9c50abcf GAIL and Pretraining (#2118) 5 年前
GitHub be4292fb Add different types of visual encoder (nature cnn/resnet) 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub bd7eb286 Update reward signals in parallel with policy (#2362) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
GitHub 6a81a2f4 Add Soft Actor-Critic as trainer option (#2341) 5 年前
GitHub 3df585d9 Fix issue where SAC encoder type is always simple (#2548) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 149ebd67 Fix crash with VAIL + GAIL (#2598) 5 年前
GitHub 24ba9d58 Develop deprecate broadcasting (#2669) 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Ervin Teng a80b47d1 Fix demo loader and remaining tests 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
GitHub 1fa07edb Remove Standalone Offline BC Training (#2969) 5 年前
GitHub 58b6c7c2 Rename mlagents.envs to mlagents_envs (#3083) 5 年前
GitHub 29c91b14 update flake8 plugin version and fix warnings (#3180) 5 年前
Yuan Gao 0817c44b Moved the demo files 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng 151e3b1c Move policy to common location, remove epsilon 5 年前
GitHub 0ff8f9af Create ML-Agents Package (#3267) 5 年前
Ervin Teng db249ceb Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Ervin Teng ee27e2cc Fix tests 5 年前
Arthur Juliani 3c82bf59 Training runs, but doesn’t actually work 5 年前
GitHub adeb6536 Catch dimension mismatches between demos and policy (#3821) 5 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 5 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 5 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 69579611 [refactor] Refactor Actor and Critic classes (#4287) 4 年前
GitHub 93517833 [feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ruo-Ping Dong 27fb4270 brain_name to behavior_name 4 年前