63 次代码提交 (47893e9c-4269-4b8d-9c76-911f580807fe)

作者 SHA1 备注 提交日期
GitHub 6a81a2f4 Add Soft Actor-Critic as trainer option (#2341) 5 年前
GitHub 3df585d9 Fix issue where SAC encoder type is always simple (#2548) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 473a8758 Develop yaml json loading errors (#2601) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
GitHub 24ba9d58 Develop deprecate broadcasting (#2669) 5 年前
Chris Elion a1967c19 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 619465e1 Fix crash when SAC is used with Curiosity and Continuous Actions (#2740) 5 年前
GitHub 0892ef2c [WIP] ISensor interface and use for visual observations (#2731) 5 年前
Chris Elion 3d8a70fb Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
Ervin Teng 29cdf77a Fix RL tests 5 年前
Ervin Teng 3a4fa244 Switch to tanh squash in PPO 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng eb4a04a5 Merge branch 'master' into develop-tanhsquash 5 年前
GitHub 3b4b0d55 Remove random normal epsilon (#3039) 5 年前
Andrew Cohen ef2dfd4c adjusting tests to expect trainer.add_policy to be called 5 年前
GitHub 58b6c7c2 Rename mlagents.envs to mlagents_envs (#3083) 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
Jonathan Harper 9f166f9e Update tests to support pytest 5.x 5 年前
Andrew Cohen 70357569 adjusting tests to expect trainer.add_policy to be called 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Andrew Cohen c8514c18 Merge branch 'master' into develop-magic-string 5 年前
GitHub 45010af3 Add stats reporter class and re-enable missing stats (#3076) 5 年前
Ervin Teng 3d25f9d2 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub 29c91b14 update flake8 plugin version and fix warnings (#3180) 5 年前
Ervin Teng ce75b378 update flake8 plugin version and fix warnings (#3180) 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
GitHub 329b23e0 Fix extra summary being written when loading from checkpoint (#3272) 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 5 年前
Ervin Teng f64bdc4b Fix SAC RNN test 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
GitHub b6e3fd67 [tests] Add additional unit tests (#3581) 5 年前
GitHub 873ba7fd [bug-fix] Fix stats reporting for reward signals in SAC (#3606) 5 年前
GitHub c42a11c3 [change] Throw a proper error when sequence length is greater than batch size. (#3583) 5 年前
Ervin Teng ee27e2cc Fix tests 5 年前
Andrew Cohen 3013774b alternative to internal-policy fix 5 年前
Andrew Cohen 0af2a651 fixed test_sac 5 年前
Ervin Teng 99ce4b59 Improve tests 5 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 5 年前
Ervin Teng e90ef688 Revert to get_nowait method in AgentManagerQueue 5 年前
Andrew Cohen 89db8428 Merge branch 'internal-policy-ghost-alternate' into soccer-2v1 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub a7323393 [bug-fix] Fix issue with SAC updating too much on resume (#4038) 4 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Ruo-Ping Dong 523248be update 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ruo-Ping Dong d57aa9ab Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
Ruo-Ping Dong 27fb4270 brain_name to behavior_name 4 年前
GitHub e471bd8b Refactoring of the tests folder for the trainers (#4510) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前