92 次代码提交 (930d6fa3-8c27-4a26-bce1-48235ad7b78b)

作者 SHA1 备注 提交日期
GitHub 9178b5d2 Improve test_simple.py and check discrete actions (#2345) 5 年前
GitHub a9fe719c Add Multi-GPU implementation for PPO (#2288) 5 年前
GitHub 30930383 Move trainer initialization into a utility function (#2412) 5 年前
GitHub 6a81a2f4 Add Soft Actor-Critic as trainer option (#2341) 5 年前
GitHub 6f67cf40 unit test - don't use global random generator (#2521) 5 年前
GitHub 9e2c30ee Made the _check_environment_trains test a little more easy to pass so the test will not randomly fail (#2520) 5 年前
GitHub 0390c78b Fix determinism in unit test (#2530) 5 年前
GitHub 3df585d9 Fix issue where SAC encoder type is always simple (#2548) 5 年前
GitHub babe9e2f Develop remove academy done (#2519) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 2f74b3cc Rename protobuf objects to be suffixed with 'Proto' in python and C#. (#2646) 5 年前
GitHub 39f280d6 Develop spawn brains (#2676) 5 年前
GitHub d39b1881 speed up unit test (#2847) 5 年前
GitHub ccb7eab4 Remove {text,custom} {action,observations} (#2839) 5 年前
GitHub e6f549dc [MLA-12] update protobuf for vector observations (#2862) 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
GitHub 8ec5ab62 Develop side channels: migrate reset parameters (#2990) 5 年前
GitHub a6df9f43 Develop new ll api (#3022) 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub 8ca0d810 Better error handling if trainer config doesn't contain "default" section (#3063) 5 年前
GitHub 2c3794a6 handle mismatch between brain and metacurriculum (#3034) 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 7fbf6b1d add flake8-bugbear (#3137) 5 年前
GitHub 45010af3 Add stats reporter class and re-enable missing stats (#3076) 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
Ervin Teng d680ed32 Fix metacurriculum test (for good) 5 年前
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 5 年前
GitHub f469cbb0 Simple1DEnv refactor and additional ghost trainer tests (#3537) 5 年前
GitHub 323f104c [tests] LSTM end-to-end tests (#3544) 5 年前
Andrew Cohen 0cc2956d write to proto 5 年前
GitHub bcce774f [tests] Visual observation tests (#3549) 5 年前
GitHub 213d2466 [bug-fix] Change Simple1DEnvironment to spawn new agent IDs on reset (#3558) 5 年前
GitHub b6e3fd67 [tests] Add additional unit tests (#3581) 5 年前
Andrew Cohen b1cfa74d Merge branch 'master' into develop-test-imitation 5 年前
Andrew Cohen e7836fb5 record demos 1d env 5 年前
Andrew Cohen 7aaf1fb6 gail and bc tests 5 年前
Andrew Cohen f1eeed9c success threshold to .9 for imitation 5 年前
Andrew Cohen f6d6e3d0 reccurent gail tests 5 年前
GitHub 25cc9f15 [change] Move hyperparameter printing entirely into StatsWriters (#3630) 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
GitHub 2912c883 Basic and visual GAIL and BC integration tests (#3626) 5 年前
Andrew Cohen 79076b70 ELO calculation done in ghost controller 5 年前
GitHub 29f82921 [bug-fix] Improve performance for PPO with continuous actions (#3662) 5 年前
Ervin Teng ee27e2cc Fix tests 5 年前
Andrew Cohen c4e54218 replaced ghost_swap with team_change in tests 5 年前
GitHub 104f2c46 [tests] Add tests for multiple actions/action branches (#3672) 5 年前
GitHub 56b75555 [tests] Make end-to-end tests more stable (#3697) 5 年前
GitHub 141831da [bug-fix] Fix entropy computation for GaussianDistribution (#3684) 5 年前
Andrew Cohen 93d344ff simple rl asymm ghost tests 5 年前
Andrew Cohen 345fa382 current_best_ratio -> latest_model_ratio 5 年前
Andrew Cohen 62c87031 Merge branch 'master' into self-play-mutex 5 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 5 年前
Andrew Cohen 7006b5ff asymm ghost test consistent 5 年前
Ervin Teng 971e4b2d Don't block when disabling threading 5 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 5 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 5 年前
Ervin Teng 51e76f00 Adjust SAC recurrent 5 年前
Ervin Teng 9fe104d6 Make threading disable-able per trainer 5 年前
Ervin Teng 92158d54 Remove threaded from trainer_controller 5 年前
Ervin Teng 23039746 Disable threading for all simple_rl tests 5 年前
GitHub 1536b9f2 Increasing steps on asymmetric ghost test (#3802) 5 年前
GitHub 4d23200b [refactor] Run Trainers in separate threads (#3690) 5 年前
GitHub 7e5513a4 [bug-fix] Increase buffer size for SAC tests (#3813) 5 年前
vincentpierre cad57a00 [skip ci] Added some tests but they do not pass (too hard) 5 年前
GitHub adeb6536 Catch dimension mismatches between demos and policy (#3821) 5 年前
GitHub ea0c6fa0 [WIP] Side Channel Design Changes (#3807) 5 年前
GitHub 7b78ffeb support newer versions of tensorflow (2.1+) (#3830) 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
vincentpierre c34dd5b6 Merge branch 'master' into develop-gym-wrapper 5 年前
vincentpierre 67027af3 Removed the failing gym tests 5 年前
GitHub c5b94ca6 Use LR schedule for beta and epsilon (#3940) 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
vincentpierre 6ddfe74f Merge branch 'master' into develop-gym-wrapper 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub 335cff3e [versioning] Save ML-Agents version in checkpoints and check on load (#4035) 5 年前
GitHub 21fe203e [tests] Increase buffer_init_steps for recurrent sac test (#4051) 5 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 5 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
Andrew Cohen f76780f1 fix tests 4 年前
GitHub 5b0a5b9b Moving domain randomization to C# (#4065) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
vincentpierre 599d7e9f Merging master 4 年前
vincentpierre d031c7a9 Merging master 4 年前
GitHub 8eefdcd3 Refactor of Curriculum and parameter sampling (#4160) 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Andrew Cohen af7d3800 add test_simple_rl tests to torch 4 年前
GitHub 2332bc32 Add fire to test_simple_rl.py (#4378) 4 年前
GitHub bf6506fc [feature] Add small CNN for grids 5x5 and up (#4434) 4 年前