976 次代码提交 (19ee6e5e-2a16-4754-b9af-e4faeb2818da)

作者 SHA1 备注 提交日期
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
Deric Pang cdb41480 Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure 6 年前
GitHub 3900ed66 Merge pull request #1083 from Unity-Technologies/develop-flat-code-restructure 6 年前
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
GitHub 10d2a19d Release v0.5 (Develop) (#1203) 6 年前
GitHub f8df71a0 Revert "Release v0.5 (Develop) (#1203)" (#1222) 6 年前
GitHub 29084e77 Curriculum learning reward thresholding bug fix (#1141) 6 年前
GitHub a54714f8 Update API to version 5 (#1179) 6 年前
GitHub 560f1bd7 Merge pull request #1224 from Unity-Technologies/release-v0.5 6 年前
GitHub d2c320dd Remove graph scope (#1205) 6 年前
GitHub 3c9603d6 Demonstration Recorder (#1240) 6 年前
GitHub 2b6b4570 Fix the Python Tests (#1327) 6 年前
GitHub 547f0e98 Merge pull request #1361 from Unity-Technologies/release-v0.6 6 年前
vincentpierre 99aaa15e made the pytest directory agnostic 6 年前
GitHub b946047a Merge pull request #1470 from Unity-Technologies/release-v0.6-make-test-directory-agnostic 6 年前
GitHub c8cc5a29 Merge pull request #1495 from Unity-Technologies/release-v0.6 6 年前
GitHub a196dde2 Merge pull request #1494 from Unity-Technologies/release-v0.6 6 年前
GitHub 517e3a0a Remove env creation logic from TrainerController (#1562) 6 年前
Vincent-Pierre BERGES 4a6ae4e0 Barracuda integration into ML-Agents (#1557) 6 年前
GitHub cd087609 added the pypiwin32 package (#1668) 6 年前
GitHub c258b1c3 Move 'take_action' into Policy class (#1669) 6 年前
GitHub 275ff5d6 Merge pull request #1764 from Unity-Technologies/release-v0.7 6 年前
Ervin T b30f4c90 Split `mlagents` into two packages (#1812) 6 年前
eshvk cc9bdf17 Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return 6 年前
eshvk fb04c40c Reorganize to make metrics collection more accurate 6 年前
GitHub a0b44f1b Merge pull request #1858 from Unity-Technologies/develop-esh-metrics 6 年前
GitHub 93760bc4 Adds SubprocessUnityEnvironment for parallel envs (#1751) 6 年前
Jonathan Harper 7a0d1531 Fix subprocess model saving on Windows 6 年前
Jonathan Harper e91e847c Fix '--slow' flag after environment updates 6 年前
GitHub c613df3a Merge pull request #1922 from Unity-Technologies/release-v08-slowflag 6 年前
GitHub 2d1bda57 Merge pull request #1931 from Unity-Technologies/release-v0.8 6 年前
GitHub ba57eaad Merge pull request #1932 from Unity-Technologies/release-v0.8 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub 70d14910 Merge pull request #1934 from Unity-Technologies/develop-black 6 年前
Jonathan Harper d9a7e5b6 Fix failure on Academy Done() with parallel envs 5 年前
GitHub e916dc48 use yaml.safe_load instead of yaml.load (#2124) 5 年前
GitHub d5f6b7f8 Merge pull request #2157 from Unity-Technologies/release-v0.8.2 5 年前
GitHub 2671e1a0 Enable mypy in precommit checks (#2177) 5 年前
GitHub 40c7fc48 Merge branch 'develop' into protobuf_update 5 年前
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
Jonathan Harper 177ee5b8 Remove unused "last reward" logic, TF nodes 5 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
GitHub 966d8efb Remove "external_brains" arg for TrainerController (#2213) 5 年前
GitHub 9c50abcf GAIL and Pretraining (#2118) 5 年前
GitHub a5b7cf95 Fix get_value_estimate and buffer append (#2276) 5 年前
Chris Elion 5d07ca1f Merge remote-tracking branch 'origin/develop' into enable-flake8 5 年前
GitHub be4292fb Add different types of visual encoder (nature cnn/resnet) 5 年前
GitHub 19283bfa Very simple environment for testing (#2266) 5 年前
GitHub 6a212f73 Improvements for GAIL (#2296) 5 年前
GitHub 9eb3f049 Cleanup unused code in TrainerController (#2315) 5 年前
GitHub 6225317d refactor vis_encoder_type and add to doc 5 年前
Ervin T a46f3faa Enable generalization training (#2232) 5 年前
GitHub 9178b5d2 Improve test_simple.py and check discrete actions (#2345) 5 年前
GitHub 78c0c202 fix mock_brain (#2377) 5 年前
GitHub 53475207 Merge pull request #2380 from Unity-Technologies/release-0.9.0 5 年前
GitHub b498c19d Fix BCTrainer increment_steps (#2384) 5 年前
GitHub a9fe719c Add Multi-GPU implementation for PPO (#2288) 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub 30930383 Move trainer initialization into a utility function (#2412) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub afb6ede5 Merge pull request #2393 from Unity-Technologies/hotfix-v0.9.0a 5 年前
Ervin Teng 072d2ef8 Merge latest develop 5 年前
GitHub 4472838e Merge pull request #2421 from Unity-Technologies/hotfix-v0.9.1 5 年前
GitHub bd7eb286 Update reward signals in parallel with policy (#2362) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
GitHub 43696d60 Fix bug in add_rewards_output and add test (#2442) 5 年前
GitHub 0a163871 Merge pull request #2469 from Unity-Technologies/release-0.9.2 5 年前
GitHub b73fa378 Add more extensive tests for BC trainer (#2506) 5 年前
GitHub dc3ab81a Merge pull request #2514 from Unity-Technologies/hotfix-0.9.3 5 年前
Ervin Teng e0da93d1 Fix bug with construct_curr_info and test 5 年前
Ervin Teng aca81efb Add more tests 5 年前
Ervin Teng 28ef8983 Add 2 visual obs test 5 年前
GitHub 4bb97e25 Fix bug with construct_curr_info (#2490) 5 年前
GitHub 6a81a2f4 Add Soft Actor-Critic as trainer option (#2341) 5 年前
Jonathan Harper 2f083c8a Renamed "StepInfo" to "EnvironmentStep" 5 年前
GitHub 7ec3d7ad Merge pull request #2516 from Unity-Technologies/master 5 年前
GitHub 6f67cf40 unit test - don't use global random generator (#2521) 5 年前
GitHub 9e2c30ee Made the _check_environment_trains test a little more easy to pass so the test will not randomly fail (#2520) 5 年前
GitHub 0390c78b Fix determinism in unit test (#2530) 5 年前
GitHub 3df585d9 Fix issue where SAC encoder type is always simple (#2548) 5 年前
GitHub babe9e2f Develop remove academy done (#2519) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 0d48a352 Use argparse for arg parsing (#2586) 5 年前
GitHub d64a01e1 Added option to use environment arguments in learn (#2594) 5 年前
GitHub 149ebd67 Fix crash with VAIL + GAIL (#2598) 5 年前
GitHub 473a8758 Develop yaml json loading errors (#2601) 5 年前
GitHub 2f74b3cc Rename protobuf objects to be suffixed with 'Proto' in python and C#. (#2646) 5 年前
GitHub b2fa2268 Merge pull request #2648 from Unity-Technologies/release-0.10.0 5 年前
GitHub 8e931d8d Merge branch 'develop' into release-0.10.0 5 年前
Ervin Teng 094cbe4d Fix bug when batch size is a non-multiple of sequence length (#2661) 5 年前
Anupam Bhatnagar cc208c00 resolving conflicts 5 年前
Ervin Teng e826f4bb Bugfix for LSTM+BC (#2679) 5 年前
GitHub 68965c7b Use a class for camera res, not dict (#2656) 5 年前
Ervin Teng df44ee8d Fix crash in trainer tests (trainer_metrics) 5 年前
GitHub 5f5ccfa0 Feature Deprecation : Online Behavioral Cloning (#2659) 5 年前
GitHub b2a2047e Fix bug when batch size is a non-multiple of sequence length (#2661) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
Ervin Teng bd5b3c7d Revert "Fix crash in trainer tests (trainer_metrics)" 5 年前
GitHub 24ba9d58 Develop deprecate broadcasting (#2669) 5 年前
GitHub f22c41db Merge pull request #2704 from Unity-Technologies/hotfix-0.10.1 5 年前
GitHub e6240c7a Bugfix for LSTM+BC (#2679) 5 年前
Anupam Bhatnagar b733b34c resolving conflicts 5 年前
Chris Elion a1967c19 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 39f280d6 Develop spawn brains (#2676) 5 年前
Chris Elion 254c7d86 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 5d3e05d1 Fix "memory leak" during inference (#2722) 5 年前
GitHub 619465e1 Fix crash when SAC is used with Curiosity and Continuous Actions (#2740) 5 年前
GitHub 0892ef2c [WIP] ISensor interface and use for visual observations (#2731) 5 年前
Chris Elion 3d8a70fb Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 0fe5adc2 Develop remove memories (#2795) 5 年前
GitHub 495873e5 Merge pull request #2833 from Unity-Technologies/release-0.11.0 5 年前
Chris Elion 691d21e6 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Jonathan Harper 8550679d Merge branch 'develop' into release-0.11.0 5 年前
GitHub d39b1881 speed up unit test (#2847) 5 年前
GitHub ccb7eab4 Remove {text,custom} {action,observations} (#2839) 5 年前
GitHub e6f549dc [MLA-12] update protobuf for vector observations (#2862) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub f57b7ac6 Allow usage with tensorflow 2.0.0 (via tf.compat.v1) (#2665) 5 年前
Ervin Teng 987e0e3a Merge tf2 branch 5 年前
Andrew Cohen 13fe9cf8 Bubbled up indexing of AllBrainInfo to trainer controller from trainers 5 年前
Andrew Cohen b11f04ea Fixed test code by creating brain_name variable instead of hardcoding 5 年前
GitHub c0453ae1 Merge pull request #2912 from Unity-Technologies/develop-allbraininfo 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Ervin Teng 54644477 Merge branch 'develop' of github.com:Unity-Technologies/ml-agents into develop-nomaxstep-test 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub a2194ea7 Fix batch size issue with BC (#2965) 5 年前
GitHub 2c7e6d51 Fix bug where constant LR in pretraining will throw TF error (#2977) 5 年前
Ervin Teng 9053610f Fix buffer tests and truncate 5 年前
GitHub b5eb34dc Fix batch size issue with BC (#2965) (#2966) 5 年前
Ervin Teng 29cdf77a Fix RL tests 5 年前
Ervin Teng a80b47d1 Fix demo loader and remaining tests 5 年前
Ervin Teng 3a4fa244 Switch to tanh squash in PPO 5 年前
GitHub b1dc1015 Fix bug where constant LR in pretraining will throw TF error (#2978) 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
Ervin Teng 73000a6b Merge branch 'develop' into develop-splitbuffer 5 年前
GitHub d4780a55 Merge pull request #3010 from Unity-Technologies/release-0.12.0-to-master 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng 34f9577c Merge branch 'develop' into develop-agentprocessor 5 年前
GitHub 8ec5ab62 Develop side channels: migrate reset parameters (#2990) 5 年前
GitHub 35c995e9 Merge pull request #3038 from Unity-Technologies/develop 5 年前
Ervin Teng eb4a04a5 Merge branch 'master' into develop-tanhsquash 5 年前
GitHub 3b4b0d55 Remove random normal epsilon (#3039) 5 年前
GitHub e7bf6fff Close environment if step raises an exception. (#3043) 5 年前
GitHub a6df9f43 Develop new ll api (#3022) 5 年前
Ervin Teng 88b1123a Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor 5 年前
Andrew Cohen ef2dfd4c adjusting tests to expect trainer.add_policy to be called 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub 42bea858 Improve mypy coverage by adding --namespace-packages (#3049) 5 年前
Ervin Teng 336ca456 Kill the ProcessingBuffer 5 年前
GitHub 90db165f Add --namespace-packages to mypy for mlagents (#3075) 5 年前
GitHub 1fa07edb Remove Standalone Offline BC Training (#2969) 5 年前
GitHub 8ca0d810 Better error handling if trainer config doesn't contain "default" section (#3063) 5 年前
Ervin Teng 62d609f8 Fix some of the tests 5 年前
GitHub 2c3794a6 handle mismatch between brain and metacurriculum (#3034) 5 年前
Ervin Teng 3449b551 Add test for trajectory 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
Ervin Teng 38ff674e Fix BC and tests 5 年前
GitHub 58b6c7c2 Rename mlagents.envs to mlagents_envs (#3083) 5 年前
Chris Elion f5e6b0ed more cleanup 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
Ervin Teng 97d66e71 Remove BootstrapExperience 5 年前
Ervin Teng 324d217b Move agent_id to Trajectory 5 年前
Jonathan Harper 9f166f9e Update tests to support pytest 5.x 5 年前
Ervin Teng 43c0acfb Fix test again 5 年前
Ervin Teng 83126bb2 Fix PPO value tests 5 年前
GitHub 9f522176 Merge pull request #3097 from Unity-Technologies/develop-pytest5 5 年前
Andrew Cohen 70357569 adjusting tests to expect trainer.add_policy to be called 5 年前
Ervin Teng 77aea4cd Fix np float32 errors 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
Ervin Teng c330f6f6 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub 3de3c1f1 check min size for visual encoders (#3112) 5 年前
Andrew Cohen de902fbb passes all pytest and C# tests 5 年前
Ervin Teng 47f8fa7a Fix some import errors 5 年前
GitHub 2ac242f7 Remove TrainerMetrics and add CSVWriter using new StatsWriter API (#3108) 5 年前
Ervin Teng fdf9aea7 Make conversion methods part of NamedTuples 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
Andrew Cohen 082789ea Merge branch 'master' into develop-magic-string 5 年前
Ervin Teng 1bd791e5 Merge branch 'master' into develop-agentprocessor 5 年前
Ervin Teng abc8ca9a Fix tests 5 年前
GitHub 7fbf6b1d add flake8-bugbear (#3137) 5 年前
GitHub 0d56f6ba Merge branch 'master' into develop-magic-string 5 年前
Andrew Cohen b28b3835 fixed default trainer_util test to expect brain_name 5 年前
Andrew Cohen 654b0c79 Merge branch 'master' into develop-magic-string 5 年前
GitHub c6152459 Allow curricula to be created without files (#3145) 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Ervin Teng db743971 Move private methods out of trainer, simplify interface 5 年前
Andrew Cohen c8514c18 Merge branch 'master' into develop-magic-string 5 年前
GitHub 45010af3 Add stats reporter class and re-enable missing stats (#3076) 5 年前
Ervin Teng 48793ec1 Fix test 5 年前
Ervin Teng 3d25f9d2 Merge branch 'master' into develop-agentprocessor 5 年前
Jonathan Harper 481e0842 Remove the --num-runs option 5 年前
GitHub 29c91b14 update flake8 plugin version and fix warnings (#3180) 5 年前
Ervin Teng ce75b378 update flake8 plugin version and fix warnings (#3180) 5 年前
GitHub d985dded Merge branch 'master' into merge-release-0.13.0 5 年前
Ervin Teng c48ddcf2 Fix pre-commit error 5 年前
GitHub b0a2a54f Add 'run-experiment' script, simpler curriculum config (#3186) 5 年前
Yuan Gao 0817c44b Moved the demo files 5 年前
GitHub b3d3a9d6 Merge pull request #3202 from Unity-Technologies/develop-move-demo 5 年前
Ervin Teng 98ed88b1 Merge branch 'master' into develop-separatevalue 5 年前
GitHub 4c241a80 Only send previous action and current BrainInfo (#3187) 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
GitHub a64e7850 Fix issue with BatchedStepResult with no agents (#3240) 5 年前
Chris Elion 45e6e53c Refactor file logic in demo_loader and add unit tests. (#3241) 5 年前
Ervin Teng 29f3330f Merge master into hotfix-0.13.1 5 年前
Ervin Teng e83276f6 Fix PPO test 5 年前
GitHub d52fb483 Merge pull request #3264 from Unity-Technologies/hotfix-0.13.1 5 年前
GitHub ca96b293 Move advance() logic for environment manager out of trainer_controller (#3234) 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
GitHub 329b23e0 Fix extra summary being written when loading from checkpoint (#3272) 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng 151e3b1c Move policy to common location, remove epsilon 5 年前
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
GitHub 0ff8f9af Create ML-Agents Package (#3267) 5 年前
Ervin Teng db249ceb Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng cfc2f455 Fix BC and tests 5 年前
Ervin Teng 78671383 Move initialization call around 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
Ervin Teng aec5fcc0 Fix policy tests 5 年前
Ervin Teng dc43b0c6 Add test for NN policy 5 年前
Ervin Teng d02bfbd4 Remove PPO policy tests 5 年前
Ervin Teng 1c4f60d4 remove more PPO tests 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 5 年前
Ervin Teng 7b0f700b Add test for deletion calls 5 年前
Ervin Teng f64bdc4b Fix SAC RNN test 5 年前
GitHub 3939ca52 Change AgentProcessor logic to fix memory leak (#3383) 5 年前
Ervin Teng 00017bab Temporarily remove multi-GPU 5 年前
Ervin Teng faa9c702 Fix one more test for multi_gpu 5 年前
Ervin Teng c68b5643 Remove multi_gpu from learn test 5 年前
GitHub 1f9d04f2 Fix clear update buffer when trainer stops training, add test (#3422) 5 年前
GitHub f20a27e0 Clear agent processor properly on episode reset (#3437) 5 年前
Anupam Bhatnagar c70d0243 [bug-fix] Empty ignored trajectory queues, make sure queues don't overflow (#3451) 5 年前
GitHub 9a4b151c Merge pull request #3441 from Unity-Technologies/master-into-release-0.14.0 5 年前
Ervin Teng 5ef902bf Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
GitHub 6876a1d6 [bug-fix] Empty ignored trajectory queues, make sure queues don't overflow (#3451) 5 年前
GitHub 423e8d80 Update the test demo (#3466) 5 年前
GitHub be14dd42 Make the timer output format consistent (#3472) 5 年前
Andrew Cohen e4d776c3 Merge branch 'master' into soccer-fives 5 年前
Ervin Teng bcc25d59 Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
GitHub 472f9f0e Merge branch 'master' into develop-badEnvReturnCode 5 年前
Alphonso Crawford 802593a2 Adding test for bad env_path on create_environment_factory 5 年前
Alphonso Crawford 26d44958 Update test_bad_env_path 5 年前
Alphonso Crawford 1a7f9ad0 change test_learn.py 5 年前
Ervin Teng 847725f1 extend meta curriculum test steps 5 年前
GitHub 24145c22 Merge pull request #3438 from Unity-Technologies/develop-badEnvReturnCode 5 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
Andrew Cohen 5b0aca29 Merge branch 'master' into soccer-fives 5 年前
Ervin Teng 14f2a7f2 Rename LearningModel to ModelUtils 5 年前
Ervin Teng 1156b9b3 Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder 5 年前
Ervin Teng d57124b4 Merge 'master' into develop-removeactionholder 5 年前
Ervin Teng d680ed32 Fix metacurriculum test (for good) 5 年前
Anupam Bhatnagar e04fcd71 Merge branch 'master' into master-into-release-0.14.1 5 年前
Ervin Teng d10d27e2 Merge commit '9450d3fc0dda4547a14c5ed1b7e13fc6e3a15413' into develop-nopreviousactions 5 年前
GitHub 30a196eb Fix metacurriculum test (for good) (#3511) 5 年前
Andrew Cohen de73baa9 Merge branch 'master' into soccer-fives 5 年前
GitHub b2cc1c25 [bug-fix] Fix continuous LSTMs and add test (#3521) 5 年前
GitHub 7d954797 [change] Separate action outputs into OutputDistributions object (#3514) 5 年前
GitHub f469cbb0 Simple1DEnv refactor and additional ghost trainer tests (#3537) 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
GitHub a6bf50db Revert obs to goal in simple 1d test (#3540) 5 年前
GitHub 323f104c [tests] LSTM end-to-end tests (#3544) 5 年前
GitHub 870338b4 [bug-fix] Fix issue with more than one continuous actions (#3547) 5 年前
Andrew Cohen 573b1f6d Merge branch 'master' into soccer-fives 5 年前
Andrew Cohen 0cc2956d write to proto 5 年前
GitHub bcce774f [tests] Visual observation tests (#3549) 5 年前
GitHub 213d2466 [bug-fix] Change Simple1DEnvironment to spawn new agent IDs on reset (#3558) 5 年前
Jason Bowman c3b15492 Modify demo loader to support gzip comression and reduce memory usage by seeking for individual reads 5 年前
GitHub b6e3fd67 [tests] Add additional unit tests (#3581) 5 年前
GitHub ffd8f855 [bug-fix] Fix crash when demo size is smaller than batch size (#3591) 5 年前
Chris Elion 7f2e815a Merge remote-tracking branch 'origin/master' into develop-sidechannel-usability 5 年前
Chris Elion fa5e7e6d Merge remote-tracking branch 'origin/master' into develop-BehaviorParams-public 5 年前
GitHub ed2eb6ef [bug-fix] Fix entropy computation in MultiCategorialDistribution (#3607) 5 年前
GitHub 873ba7fd [bug-fix] Fix stats reporting for reward signals in SAC (#3606) 5 年前
GitHub c42a11c3 [change] Throw a proper error when sequence length is greater than batch size. (#3583) 5 年前
Andrew Cohen b1cfa74d Merge branch 'master' into develop-test-imitation 5 年前
Andrew Cohen e7836fb5 record demos 1d env 5 年前
Ervin Teng 98d5b8e3 Add test 5 年前
Andrew Cohen 7aaf1fb6 gail and bc tests 5 年前
Ervin Teng 6b578de4 Merge branch 'develop-refactorprint' into develop-progress-bar 5 年前
Andrew Cohen f1eeed9c success threshold to .9 for imitation 5 年前
Andrew Cohen f6d6e3d0 reccurent gail tests 5 年前
GitHub 320175d5 [change] Move console printing to StatsWriter class (#3616) 5 年前
GitHub 25cc9f15 [change] Move hyperparameter printing entirely into StatsWriters (#3630) 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
GitHub 2912c883 Basic and visual GAIL and BC integration tests (#3626) 5 年前
Andrew Cohen 53bea15c Merge branch 'master' into soccer-fives 5 年前
Andrew Cohen ac261e36 Merge branch 'master' into self-play-mutex 5 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 5 年前
Andrew Cohen eefc4811 Merge branch 'master' into self-play-mutex 5 年前
GitHub ceaac645 [tests] Make subprocess manager test easier (#3651) 5 年前
Andrew Cohen 79076b70 ELO calculation done in ghost controller 5 年前
Andrew Cohen 579bbd88 passing all tests locally 5 年前
GitHub 29f82921 [bug-fix] Improve performance for PPO with continuous actions (#3662) 5 年前
Andrew Cohen fb993986 Merge branch 'master' into self-play-mutex 5 年前
Ervin Teng ee27e2cc Fix tests 5 年前
Andrew Cohen b42c9482 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen c4e54218 replaced ghost_swap with team_change in tests 5 年前
Andrew Cohen d9cdb582 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
GitHub 104f2c46 [tests] Add tests for multiple actions/action branches (#3672) 5 年前
Ervin Teng e4d1df01 Fix TC test 5 年前
GitHub de3fc4e8 Hotfix memory leak on Python (#3664) 5 年前
GitHub 11c518a3 Stats SideChannel (for custom TensorBoard metrics) (#3660) 5 年前
GitHub 458e68f1 Remove "docker target" feature (#3687) 5 年前
GitHub 807a1441 Raise exceptions from environment subprocesses (#3680) 5 年前
Andrew Cohen 7219f60b fixed tests that expected old hyperparam team-change 5 年前
GitHub 56b75555 [tests] Make end-to-end tests more stable (#3697) 5 年前
Andrew Cohen 650ec121 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
GitHub 141831da [bug-fix] Fix entropy computation for GaussianDistribution (#3684) 5 年前
Andrew Cohen 4c9ac553 Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen 93d344ff simple rl asymm ghost tests 5 年前
Andrew Cohen a7a372b9 Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen cd677346 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 345fa382 current_best_ratio -> latest_model_ratio 5 年前
Andrew Cohen c7a34413 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
GitHub bc1fdf07 [refactor] CLI changes (#3705) 5 年前
Andrew Cohen 837886e1 Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen 6ade2ddc Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen 62c87031 Merge branch 'master' into self-play-mutex 5 年前
GitHub 9cbc3fa2 Asymmetric self-play (#3653) 5 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 5 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 5 年前
GitHub d7ca6b8d [feature] Add --initialize-from option (#3710) 5 年前
Andrew Cohen 3013774b alternative to internal-policy fix 5 年前
Andrew Cohen d1bee64b fixed test_ghost and test_ppo 5 年前
Andrew Cohen 1b9c643b Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen 7006b5ff asymm ghost test consistent 5 年前
Andrew Cohen 0af2a651 fixed test_sac 5 年前
Ervin Teng 971e4b2d Don't block when disabling threading 5 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 5 年前
Andrew Cohen 09a53bb8 make reward threshold consistent across ghosts tests 5 年前
Andrew Cohen a870d453 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
GitHub b841c9ab Wrapped trainer has internal policy in GhostTrainer 5 年前
Andrew Cohen 7a7eb324 Merge branch 'master' into internal-policy-ghost 5 年前
Ervin Teng 441fbb91 Fix subprocess test 5 年前
GitHub 55b26417 check demonstration version before loading (#3745) 5 年前
Ervin Teng f29b17a9 Don't block one policy queue 5 年前
Ervin Teng 99ce4b59 Improve tests 5 年前
GitHub aae58330 Merge branch 'master' into develop-add-inference-examples 5 年前
Ervin Teng d2d88b6a Fix env_manager test 5 年前
Andrew Cohen 933d7b32 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 5 年前
Ervin Teng 51e76f00 Adjust SAC recurrent 5 年前
Ervin Teng 46d83839 Adjust subprocessor test 5 年前
Andrew Cohen f41695b9 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Ervin Teng e90ef688 Revert to get_nowait method in AgentManagerQueue 4 年前
Andrew Cohen ad6ba833 Merge branch 'internal-policy-ghost' into soccer-2v1 5 年前
Ervin Teng 370b3c40 Fix subprocess env manager test 4 年前
Andrew Cohen 1c2005a8 Merge branch 'internal-policy-ghost' into soccer-2v1 5 年前
Andrew Cohen 80469267 Merge branch 'internal-policy-ghost' into soccer-2v1 5 年前
Ervin Teng 9fe104d6 Make threading disable-able per trainer 4 年前
Andrew Cohen 9ae19e9d Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen 89db8428 Merge branch 'internal-policy-ghost-alternate' into soccer-2v1 5 年前
Ervin Teng 92158d54 Remove threaded from trainer_controller 4 年前
Andrew Cohen 4468280a Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen 26c0033c Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen cde8360e update tests 5 年前
Ervin Teng 23039746 Disable threading for all simple_rl tests 4 年前
Andrew Cohen cb83a467 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen 9bec75ee Merge branch 'master' into soccer-2v1 5 年前
GitHub 1536b9f2 Increasing steps on asymmetric ghost test (#3802) 4 年前
Arthur Juliani 3769d943 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
GitHub 4d23200b [refactor] Run Trainers in separate threads (#3690) 4 年前
Ervin Teng 9cd2c034 Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-sac-apex 4 年前
GitHub 7e5513a4 [bug-fix] Increase buffer size for SAC tests (#3813) 4 年前
vincentpierre cad57a00 [skip ci] Added some tests but they do not pass (too hard) 4 年前
Andrew Cohen 185d4b35 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Arthur Juliani 3c82bf59 Training runs, but doesn’t actually work 4 年前
GitHub adeb6536 Catch dimension mismatches between demos and policy (#3821) 4 年前
Andrew Cohen b217f8bf Merge branch 'master' into soccer-2v1 5 年前
Andrew Cohen b4f52c88 Merge branch 'soccer-2v1' into asymm-envs 5 年前
GitHub ea0c6fa0 [WIP] Side Channel Design Changes (#3807) 4 年前
GitHub 7b78ffeb support newer versions of tensorflow (2.1+) (#3830) 4 年前
GitHub 4092d937 [Bug fix] Hard reset when team changes (#3870) 4 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 4 年前
GitHub f86fc81d [refactor] Move configuration files to single YAML file (#3791) 4 年前
GitHub 7e0032f5 [refactor] Allow full RunOptions to be specified in trainer configuration YAML (#3815) 4 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 4 年前
GitHub d8b93f8f [Bug fix] Hard reset when team changes (#3870) (#3899) 4 年前
Chris Elion 68b68396 Merge remote-tracking branch 'origin/master' into release_1_to_master 4 年前
GitHub d2bc86c8 Release 2 cherry pick (#3971) 4 年前
GitHub 4641038e Renaming max_step to interrupted in TermialStep(s) (#3908) 4 年前
vincentpierre c34dd5b6 Merge branch 'master' into develop-gym-wrapper 4 年前
vincentpierre 67027af3 Removed the failing gym tests 4 年前
Andrew Cohen a2f8319a Merge branch 'master' into asymm-envs 4 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
Andrew Cohen 0ec2a890 Merge branch 'master' into asymm-envs 4 年前
GitHub c5b94ca6 Use LR schedule for beta and epsilon (#3940) 4 年前
Arthur Juliani 2b3a6347 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
GitHub 812983c0 Some improvements to the UnityEnvironment class (#3939) 4 年前
GitHub c6ed3789 Replaced get_behavior_names and get_behavior_spec with behavior_specs property (#3946) 4 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub f7373172 Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2 4 年前
GitHub abbc6424 [bug-fix] Fix issue with initialize not resetting step count (#3962) 4 年前
vincentpierre 6ddfe74f Merge branch 'master' into develop-gym-wrapper 4 年前
Arthur Juliani 28e095e0 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
Ruo-Ping Dong 2ca79207 [bug-fix] Don't load non-wrapped policy (#4593) 4 年前
Andrew Cohen 59a60c1e Merge branch 'master' into asymm-envs 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub 335cff3e [versioning] Save ML-Agents version in checkpoints and check on load (#4035) 4 年前
GitHub a7323393 [bug-fix] Fix issue with SAC updating too much on resume (#4038) 4 年前
GitHub 21fe203e [tests] Increase buffer_init_steps for recurrent sac test (#4051) 4 年前
GitHub f5435876 [refactor] Store and restore state along with checkpoints (#4025) 4 年前
Andrew Cohen e7750fc9 Merge branch 'master' into develop-sampler-refactor 4 年前
GitHub ee1098d1 [refactor] Improve config upgrade script and add test (#4056) 4 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
Andrew Cohen fa5dae1a tests for settings 4 年前
Andrew Cohen 22786526 Merge branch 'master' into asymm-envs 4 年前
Andrew Cohen c0f7052b Merge branch 'master' into develop-sampler-refactor 4 年前
GitHub 09c7787c [bug-fix] Fix regression in --initialize-from feature (#4086) 4 年前
Andrew Cohen 34ecc7e6 Merge branch 'master' into asymm-envs 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
GitHub 8a49e8e0 [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) 4 年前
GitHub 8fb66c2d [bug-fix] Fix issue where curriculum was advancing too early (#4107) 4 年前
Andrew Cohen f76780f1 fix tests 4 年前
GitHub fefbc038 Merge pull request #4109 from Unity-Technologies/release_3_merge_master 4 年前
Andrew Cohen 6554ccb7 Merge branch 'master' into asymm-envs 4 年前
GitHub 5b0a5b9b Moving domain randomization to C# (#4065) 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
Jonathan Harper 80127232 Convert checkpoints to .nn format 4 年前
Ervin Teng 2b0c0163 Add settings test 4 年前
GitHub bb675bf4 Merge pull request #4134 from Unity-Technologies/develop-removebrainnamepolicy 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 69579611 [refactor] Refactor Actor and Critic classes (#4287) 4 年前
GitHub 17f03980 [bug-fix] Fix non-LSTM SeparateActorCritic (#4306) 4 年前
GitHub 5bcbef8d [tests] Add tests for core PyTorch files (#4292) 4 年前
GitHub 93517833 [feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305) 4 年前
Andrew Cohen f74d301a Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
GitHub b4749b31 Test fixes on add-fire (#4317) 4 年前
Ervin Teng a172fb46 Halve entropy 4 年前
Ervin Teng b2872adf Merge branch 'develop-add-fire' into develop-add-fire-halfentropy 4 年前
Andrew Cohen 6df8d32c Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
GitHub 69d29b86 [add-fire] Halve Gaussian entropy (#4319) 4 年前
Ervin Teng 50a7e952 Fix utils test 4 年前
vincentpierre 599d7e9f Merging master 4 年前
GitHub 3a982317 [add-fire] Add learning rate and beta/epsilon decay to PyTorch (#4318) 4 年前
vincentpierre d031c7a9 Merging master 4 年前
GitHub 7ddfd81f Added Reward Providers for Torch (#4280) 4 年前
Andrew Cohen bf8b2328 Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
HH 7afa1761 Merge branch 'master' into hh/develop/ragdoll-updates 4 年前
GitHub 36613cad [add-fire] Fix CategoricalDistInstance test and replace `range` with `arange` (#4327) 4 年前
Ervin Teng 6b29a4c9 Fix test and replace range with arange 4 年前
GitHub 6b193d03 Develop add fire layers (#4321) 4 年前
GitHub 3de1e660 [bug-fix] Initialize-from being incorrectly loaded as "None" rather than None (#4175) 4 年前
Ervin Teng 5bf72236 Fix util test 4 年前
Ervin Teng cded4c6c Fix SeparateActorCritic and add test 4 年前
Ruo-Ping Dong 71fe4df6 fix formatting and test 4 年前
GitHub 0e0daf47 [add-fire] Merge post-0.19.0 master into add-fire (#4328) 4 年前
GitHub 9d2e4268 Revert "[add-fire] Merge post-0.19.0 master into add-fire (#4328)" (#4330) 4 年前
Ruo-Ping Dong 79d89158 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
HH a1f2748e Merge branch 'master' into hh/develop/crawler-ragdoll-updates 4 年前
GitHub 839eb2cb Develop model transfer test (#4214) 4 年前
yanchaosun 7e3216ae simple env test 4 年前
yanchaosun cdaaa318 bisim 4 年前
yanchaosun 3d0d359c bisimulation draft 4 年前
yanchaosun 1fdbfe65 no normalization 4 年前
yanchaosun 5a778ca3 fix normalization 4 年前
GitHub 8eefdcd3 Refactor of Curriculum and parameter sampling (#4160) 4 年前
yanchaosun a212fef9 new bisim implementation 4 年前
GitHub e318f96a don't allow --num-envs >1 with no --env (#4203) 4 年前
GitHub 09c63636 [MLA-1145] don't allow --num-envs >1 with no --env (#4209) 4 年前
yanchaosun aca8cd58 update with new alternating 4 年前
HH 0fdac847 Merge branch 'master' into hh/develop/crawler-ragdoll-updates 4 年前
yanchaosun 0e2f6e19 small fix 4 年前
yanchaosun ec929746 minor update 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
Andrew Cohen d0133066 working 4 年前
yanchaosun 9bc90956 fix bug with bisimulation 4 年前
Andrew Cohen b6bf1860 fix bisim metric 4 年前
Andrew Cohen 617aefc9 resolve conflict 4 年前
yanchaosun ce36349b some changes 4 年前
Andrew Cohen 1b17ae56 add tanh activ 4 年前
Arthur Juliani 6bee0fd1 Merge master 4 年前
yanchaosun caeffa3e add two envs 4 年前
yanchaosun 447124f1 new test 4 年前
Andrew Cohen 5fa28f5f merge YC changes 4 年前
yanchaosun 28355444 bisim fix, disable stop gradient 4 年前
Arthur Juliani c63b3d09 Fix lesson incrementing (#4279) 4 年前
Andrew Cohen dad084ee old crawler config 4 年前
yanchaosun 8fc18e5d plotting 4 年前
yanchaosun 3246570c added action encoder, and flags related with action training/transferring; set model_schedule as a changable hyperparameter 4 年前
GitHub 20f1386a Don't drop multiple stats from the same step (#4236) 4 年前
GitHub 9f041970 Develop bisim action encoder, incorporate related hyperparameter settings (#4253) 4 年前
yanchaosun fb5c33c1 test code 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
yanchaosun 696ec0cc new plots 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
yanchaosun 80bad241 init sac transfer, and added action encoder to bisim; configs for crawler 4 年前
yanchaosun f81feec4 config fix; basic sac 4 年前
yanchaosun a505cb16 new config 4 年前
yanchaosun 9a19f6e5 disable bisim 4 年前
yanchaosun b991096b update target encoder soft copy 4 年前
Andrew Cohen d8c123a0 Merge branch 'master' into sensitivity 4 年前
GitHub ac36b31f [MLA-1172] Reduce calls to training_behaviors (#4259) 4 年前
yanchaosun b74294bf target encoders and new forward loss 4 年前
GitHub 1b098c9a Refactor TFPolicy and Policy (#4254) 4 年前
GitHub 380fef57 [refactor] Move TF-specific files to tf/ folder (#4266) 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
GitHub d1bf56e9 Fix lesson incrementing (#4279) 4 年前
Arthur Juliani 1a123641 Merge remote-tracking branch 'origin/master' into r5-master 4 年前
GitHub 493793a6 [MLA-1233] Remove stats.CSVWriter (#4300) 4 年前
Andrew Cohen 4b094d25 large normalization obs unit test 4 年前
Ervin Teng dc937d5c Merge branch 'master' into develop-add-fire-mm 4 年前
GitHub 1e76f8d0 Merge pull request #4331 from Unity-Technologies/develop-add-fire-mm2 4 年前
Ervin Teng 4ebccf97 Merge branch 'develop-add-fire' into develop-add-fire-sac-lst 4 年前
Andrew Cohen 598826fe Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
Ruo-Ping Dong d3eb6c46 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Andrew Cohen ae2c83e2 added torch bc tests 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
GitHub 6b255790 Behavioral Cloning Pytorch (#4293) 4 年前
Ruo-Ping Dong 523248be update 4 年前
GitHub 9dc1d99e Initialize normalizer with mean/variance from first trajectory (#4299) 4 年前
GitHub ab8e5afa Release 6 fix nan (#4343) 4 年前
GitHub f374f87a [add-fire] Add LSTM to SAC, LSTM fixes and initializations (#4324) 4 年前
Andrew Cohen c25ddc5d fix tests 4 年前
Andrew Cohen 0a7444f9 revert bc default batch/epoch 4 年前
Anupam Bhatnagar abc1220f Merge branch 'master' into global-variables 4 年前
Andrew Cohen 9f25f53b fix default bc test 4 年前
GitHub 705a0e0e Curriculum: If no behavior specified, do magic (#4346) 4 年前
Ervin Teng fe4472cb Add decoders, distributions, encoders, layers, networks, and utils 4 年前
HH 8eaddb61 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
Ervin Teng 69bae3cc Add test for lstm layer 4 年前
Andrew Cohen 53185b7e fix tf bc default test 4 年前
Ruo-Ping Dong 59cc1a9f Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong 409a161c fix bc tests 4 年前
Ervin Teng 89890bf2 Update with newest changes 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ervin Teng 13f15086 Merge branch 'develop-add-fire' into develop-add-fire-amrl 4 年前
Ervin Teng d56e53bb Fix LSTM tests 4 年前
GitHub e3bc3352 [pytorch] Add decoders, distributions, encoders, layers, networks, and utils (#4349) 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
Ervin Teng a88d3581 Fix and test for masked_mean 4 年前
Ruo-Ping Dong d57aa9ab Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint 4 年前
GitHub bd6bcd2f Merge master and add Saver class for save/load checkpoints 4 年前
Ervin Teng d218bf4d Merge branch 'develop-add-fire' into develop-add-fire-sac-lst 4 年前
GitHub 6de31a03 [add-fire] Fix masked mean for 2d tensors (#4364) 4 年前
Ervin Teng 5c1717d1 Bugfixes for continuous case 4 年前
Ervin Teng 42e25b25 Merge branch 'develop-add-fire' into develop-add-fire-memoryclass 4 年前
Ervin Teng 6e946dba Policy bugfixes and policy tests 4 年前
Christopher Goy 5a233353 Merge remote-tracking branch 'origin/master' into release_6-to-master 4 年前
GitHub 03eac72c [add-fire] Add tests and fix issues with Policy (#4372) 4 年前
Andrew Cohen a65d08c7 ghost trainer tests 4 年前
GitHub 49545ce1 Pytorch ghost trainer (#4370) 4 年前
Ervin Teng 020ce8ad Remove some unneeded stuff 4 年前
GitHub 6a1d993f [add-fire] Memory class abstraction (#4375) 4 年前
Andrew Cohen af7d3800 add test_simple_rl tests to torch 4 年前
Andrew Cohen 39bca7d2 fix tf ghost tests 4 年前
Ervin Teng 554ca0b9 Fix test typing 4 年前
GitHub 2332bc32 Add fire to test_simple_rl.py (#4378) 4 年前
Andrew Cohen d1c0f217 revert tests 4 年前
Ervin Teng b107a8d5 Fix network tests 4 年前
HH 2080c287 Merge branch 'master' into hh/develop/loco-crawler-variable-speed 4 年前
HH d4bd7fe6 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
Ervin Teng a04e68a4 Merge branch 'develop-add-fire' into develop-add-fire-memoryclass 4 年前
Ervin Teng d63aacd0 Cleanup, add test 4 年前
GitHub 0d0d2ead [add-fire] Revert unneeded changes back to master (#4389) 4 年前
Ervin Teng 987ea2d0 Revert unneeded changes back to master 4 年前
Ervin Teng 8ff8c401 Merge branch 'develop-add-fire' into develop-add-fire-export 4 年前
GitHub 1955af9e [feature] Add experimental PyTorch support (#4335) 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
Ruo-Ping Dong 09c22679 fix NNCheckpointManager for Torch 4 年前
Ruo-Ping Dong f2a8c421 add torch saver test 4 年前
GitHub 70197342 Add torch saver test 4 年前
vincentpierre ba7eb360 Merge branch 'master' into develop-torch-save-rp 4 年前
Ruo-Ping Dong 6ae17cd0 fix test 4 年前
Ruo-Ping Dong a74c904a Merge branch 'master' into develop-saver-name 4 年前
vincentpierre 25454a48 adding tests 4 年前
GitHub 347bde3d Fix export 4 年前
GitHub 38e9387b Fix NNCheckpointManager for Torch 4 年前
vincentpierre 108fac9a Replace torch.detach().cpu().numpy() with a utils method 4 年前
Ruo-Ping Dong 07e82899 update torch saver test 4 年前
vincentpierre 44fa3a65 Moved the tests around 4 年前
HH d9962254 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
GitHub 328353bc Torch : Saving/Loading of the reward providers (#4405) 4 年前
Ruo-Ping Dong e60c7038 Merge branch 'master' into develop-saver-name 4 年前
GitHub 80b7a6d3 Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py 4 年前
vincentpierre fdd343b2 more use of item() and additional tests 4 年前
Ruo-Ping Dong 88eff042 Merge branch 'master' into develop-saver-name 4 年前
GitHub 82bd7fd0 Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py 4 年前
Ruo-Ping Dong 56feb8af update test_saver_reward_providers.py 4 年前
GitHub 4dda2983 Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py 4 年前
GitHub 48f217b9 Rename Saver to ModelSaver (#4402) 4 年前
GitHub 83e21972 Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py 4 年前
Anupam Bhatnagar f4f1a8d9 merge master into trainer-plugin branch 4 年前
GitHub 12e15e29 Fix on GAIL Torch when using actions (#4407) 4 年前
GitHub 498934f9 Replace torch.detach().cpu().numpy() with a utils method (#4406) 4 年前
Ruo-Ping Dong 27fb4270 brain_name to behavior_name 4 年前
GitHub bfda9576 Replace brain_name with behavior_name (#4419) 4 年前
Ruo-Ping Dong fd1dc3a6 Merge branch 'master' into develop-torch-omp 4 年前
GitHub 7b4d0865 [Bug fix] Fix bug in GAIL gradient penalty (#4425) 4 年前
GitHub 4e93cb6e [torch] Restructure PyTorch encoders (#4421) 4 年前
GitHub 4e6d46cc [tests] Add tests for Torch PPO (#4429) 4 年前
GitHub beb5eb30 [bug-fix] Fixes for Torch SAC and tests (#4408) 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
GitHub 676f5f7c [refactor] Refactor GAIL to use new encoder structure (#4433) 4 年前
Ervin Teng 60eacc0d Merge branch 'master' into develop-adjust-cpu-settings 4 年前
GitHub bf6506fc [feature] Add small CNN for grids 5x5 and up (#4434) 4 年前
yanchaosun 1a9aaaf6 model weights and large transfer learning weight 4 年前
GitHub 94c7111e [feature] Enable default settings for TrainerSettings (#4448) 4 年前
GitHub 2dc34612 Prevent init normalize on --resume (#4463) 4 年前
Andrew Cohen 3997b14b Merge branch 'master' into develop-hybrid-actions 4 年前
Ervin Teng 7754ad7b Don't run value during inference 4 年前
Andrew Cohen 85602279 add action_out to dist 4 年前
vincentpierre 181bdec0 - 4 年前
GitHub 4e4ad7b0 Don't run value during policy evaluate, optimized soft update function (#4501) 4 年前
Ervin Teng f9ff3efe Merge branch 'develop-policyonly' into develop-sac-targetq 4 年前
Andrew Cohen 7c0aa77b Merge branch 'develop-actions-out' into develop-hybrid-actions 4 年前
GitHub 60b76790 Random Network Distillation for Torch (#4473) 4 年前
GitHub e471bd8b Refactoring of the tests folder for the trainers (#4510) 4 年前
GitHub 827525f9 Add test env for hybrid actions, clean up BehaviorSpec (#4522) 4 年前
GitHub 400e14cb [Bug-fix] RND would not be saved correctly. Added tests (#4514) 4 年前
Andrew Cohen db37db34 fixing errors 4 年前
GitHub 2b300088 Better hybrid actions test env (#4523) 4 年前
Andrew Cohen 53176dc0 Merge branch 'develop-hybrid-actions' of https://github.com/Unity-Technologies/ml-agents into develop-hybrid-actions 4 年前
Andrew Cohen 44c9879e action models 4 年前
HH a3bf96fd Merge branch 'master' into hh/develop/gridsensor-tests 4 年前
Andrew Cohen c494bfcc trains successfully 4 年前
GitHub badca342 Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540) 4 年前
Ervin Teng 8dec4771 Add hybrid actions to SAC 4 年前
GitHub c188781b [life improvement] Moving Python files around (#4531) 4 年前
Andrew Cohen e686a785 removed abstract class 4 年前
Ervin Teng 81342148 Revert "Add hybrid actions to SAC" 4 年前
Andrew Cohen 63757004 experiment with 1/1 test 4 年前
Andrew Cohen 35b88994 simple rl tests pass 4 年前
Andrew Cohen 4b9a7db6 remove old behaviorspec 4 年前
GitHub efa2a704 add to_string for samplers (#4484) 4 年前
Andrew Cohen fc3027ac tf tests except gail pass 4 年前
GitHub b3bc7896 Cherrypick bug fixes to release_9_branch (#4617) 4 年前
Andrew Cohen e5f14400 Merge branch 'master' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 601f02a8 update simple rl tests 4 年前
GitHub e4db5dc5 ActionSpec and ActionBuffer (#4578) 4 年前
Andrew Cohen 7827ca06 add ActionSpec; test_simple_rl torch passes 4 年前
GitHub be723c66 Change BrainParametersProto to support ActionSpec (#4579) 4 年前
Andrew Cohen da978fc6 add separate hybrid test file 4 年前
GitHub a690af74 [refactor] Make PyTorch the default and TensorFlow optional (#4517) 4 年前
Andrew Cohen eaecb59e torch utils to and from buffer 4 年前
Andrew Cohen 6e23bafd ActionFlattener Refactor 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
GitHub b5dd43f2 [bug-fix] Don't load non-wrapped policy (#4593) 4 年前
Andrew Cohen f654df34 fixing tensorflow tests 4 年前
GitHub e0ef30a5 [bug-fix] Change entropy computation and loss reporting in Torch to match TF (#4538) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen b40e7793 fix mlagents-envs tests 4 年前
GitHub 60b173df [bug-fix] Fix Gym and some Policy tests for ActionSpec (#4590) 4 年前
Ervin Teng ceeea719 Fix TF policy test 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
GitHub 64e998a2 [bug-fix] Use float64 when converting np.ndarray to torch.tensor, cap Torch version to 1.7.x (#4610) 4 年前
Andrew Cohen 590adc01 make_fake_trajectory/step take ActionSpec arg 4 年前
vincentpierre 96452986 Initial commit for multi head attention 4 年前
vincentpierre a3a9a56b Merge branch 'exp-multi-head-attention' into exp-bullet-hell 4 年前
Ruo-Ping Dong 9e08be87 Merge branch 'master' into release_9_branch_merge 4 年前
Andrew Cohen 97dfa142 fix action_spec refs 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
Andrew Cohen 0e28dd8f add static method to create continuous/discrete 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
Andrew Cohen ccd7cc4c fix recurrent sac test 4 年前
Andrew Cohen ae920478 resolve conflicts 4 年前
GitHub 87a7ccf8 use int64 steps, check for NaN actions (#4607) 4 年前
GitHub 23800f33 Merge branch 'master' into develop-action-spec 4 年前
GitHub 85a7c0f7 [bug-fix] Add clipping to PyTorch policy, fix initialization (#4649) 4 年前
Ervin Teng 184f27c6 Make buffer type-agnostic 4 年前
GitHub 733bffbf use int64 steps, check for NaN actions (#4607) (#4654) 4 年前
Andrew Cohen b6d10456 removed action_spec.size 4 年前
GitHub 8175d558 [bug-fix] Fix BC module + action clipping (#4667) 4 年前
GitHub 2a8c6800 [bug-fix] Add clipping to PyTorch policy, fix initialization (#4649) (#4662) 4 年前
vincentpierre e14e1c4d Improvements and new tests 4 年前
Ruo-Ping Dong 953cb6bb Merge branch 'master' into develop-windows-delay 4 年前
Ruo-Ping Dong ee5313e4 Merge branch 'master' into develop-windows-delay 4 年前
GitHub f0ed3a38 Cherry-pick BC fixes to Release 10 (#4668) 4 年前
Andrew Cohen dca09bd9 add docstrings 4 年前
Andrew Cohen afd16cc9 rename make_x to creat_x/remove redundant properties 4 年前
Andrew Cohen 5b9aab58 fix advanced vis encoder simple rl 4 年前
Andrew Cohen 505dcf80 fix recurrent/advanced ppo tests 4 年前
Andrew Cohen 4f66ebc2 fix recurrent sac 4 年前
Andrew Cohen 8df63dab reduce visual advanced steps 4 年前
Andrew Cohen 95892058 reduce recurrent step/increase batch size 4 年前
Andrew Cohen 3f771e61 add ActionBuffers and utils 4 年前
Andrew Cohen b70e6078 reduce steps_per_update recurrent sac 4 年前
Ervin Teng 3765c15a Merge branch 'develop-multitype-buffer' into develop-unified-obs 4 年前
Andrew Cohen 667d295c recurrent sac passes locally but fails on CI for inexplicable reasons 4 年前
Andrew Cohen a343f4e1 increase seq length 4 年前
Andrew Cohen e5cc57f9 rename create random to random action 4 年前
vincentpierre b863af57 Removing TensorFlow Trainers 4 年前
Ervin Teng 3b614302 Merge branch 'develop-multitype-buffer' into develop-centralizedcritic 4 年前
GitHub 278911a5 Fix staging tests (#4708) 4 年前
GitHub 94c59e31 C# changes for hybrid action spaces (#4587) 4 年前
Andrew Cohen f6355ba9 Merge branch 'develop-action-spec' into develop-action-buffer 4 年前
GitHub a4c9f58e Fix SubprocessEnvManager hanging on unexpected exceptions. (#4699) 4 年前
vincentpierre 713e65fb removing tensorflow testing for pytest and yamato 4 年前
Andrew Cohen d624b54b Merge branch 'master' into fix-conflict-base-env 4 年前
Andrew Cohen bd917c9c action buffer passes continuous 4 年前
Andrew Cohen b36fcf16 discrete runs/cont passes 4 年前
Andrew Cohen ad951493 debugging discrete 4 年前
Andrew Cohen fcf6471e 2d discrete passes 4 年前
Andrew Cohen 056630d7 sac continuous and discrete train 4 年前
GitHub 990f801a Develop hybrid action staging (#4702) 4 年前
vincentpierre 735fcd52 [WIP] Refactor trainers to use list of obs rather than vec and vis obs 4 年前
Andrew Cohen 85e4db33 bc tests pass 4 年前
Arthur Juliani b8f22fd7 Update second half of tests 4 年前
vincentpierre 93ca1409 fixing the tests 4 年前
vincentpierre 7a5cc9ec Merge master into develop-rm-tf 4 年前
Andrew Cohen 24fd9b3c torch reward providers all pass 4 年前
Arthur Juliani b074c252 Fix remaining tests 4 年前
Andrew Cohen dee6b805 fixed bug in discrete 4 年前
Arthur Juliani ba495418 Resolve pre-commit issues 4 年前
vincentpierre c1587bce Solving merge conflicts 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前
Andrew Cohen 4ebc6c44 ml-agents-envs pass 4 年前
GitHub ded1f79b Merge pull request #4732 from Unity-Technologies/goal-sensors 4 年前
Andrew Cohen b5d1c071 Merge branch 'master' into develop-action-buffer 4 年前
Arthur Juliani 0d2f8887 Merge remote-tracking branch 'origin/master' into goal-conditioning 4 年前
GitHub a0d1c829 Action Docs part2 (#4739) 4 年前
Andrew Cohen 762274d9 agent processor tests 4 年前
Arthur Juliani 2be6af80 Fix black 4 年前
Ervin Teng 25dfd883 Merge branch 'master' into develop-centralizedcritic 4 年前
Andrew Cohen 94179947 fix demo loader tests 4 年前
GitHub ba21e419 Merge pull request #4737 from Unity-Technologies/goal-gridworld-sensor 4 年前
vincentpierre bc9d3975 merge master 4 年前
Andrew Cohen cd73cce2 test_trajectory fixed 4 年前
GitHub ad5f878c [refactor] Remove critic pass during inference (#4743) 4 年前
GitHub 11687f8d [cherry-pick] Cherry-pick #4743 into Release 11 (#4756) 4 年前
Andrew Cohen 3c65b964 fixed recurrent prev_action issue 4 年前
GitHub 903d3afe Merge pull request #4707 from Unity-Technologies/develop-rm-tf 4 年前
vincentpierre 14378aa5 Merging master 4 年前
Andrew Cohen 97d94a83 fix test_tf_policy 4 年前
Andrew Cohen 293bd20b fix torch test_ppo 4 年前
vincentpierre 1a1070b1 forgot a file 4 年前
Andrew Cohen 230497f5 fix torch utils test 4 年前
Andrew Cohen eef14922 discrete/contionuous unity envs train 4 年前
Andrew Cohen e9cb1066 agent processor tests 4 年前
Andrew Cohen a545859e fix torch test policy 4 年前
vincentpierre 8cb050ef WIP Made initial changes to enale dimension properties and added attention module 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
GitHub a73f7d73 Turn down gain on GAIL discriminator output (#4762) 4 年前
Andrew Cohen 157f9e77 rename to ActionTuple 4 年前
Andrew Cohen 06f1f254 1:1 and continuous/discrete train 4 年前
GitHub b6bb01b9 Turn down gain on GAIL discriminator output (#4762) (#4772) 4 年前
vincentpierre c3699de8 merging master and addressing comments 4 年前
Andrew Cohen 453a2bba ActionTuple default is now np.array, not None 4 年前
Andrew Cohen 60466287 fix simple test env 4 年前
GitHub 29d94c7c Merge pull request #4734 from Unity-Technologies/develop-obs-as-list 4 年前
Andrew Cohen 1d234d1d bc works 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
Andrew Cohen c0d01baf Merge branch 'master' into merge-release11-master 4 年前
Andrew Cohen 95566e44 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
vincentpierre 8dee7970 Fixing the tests 4 年前
Andrew Cohen 5f0f7e3a fix reward provider tests 4 年前
Andrew Cohen 88b8f4b4 replace use_discrete with action_sizes in simple_rl 4 年前
vincentpierre c5a057d2 renaming obs_spec variables 4 年前
vincentpierre 44ed3258 Merging master 4 年前
Andrew Cohen 3457cd3c save only discrete actions as prev 4 年前
Andrew Cohen 9c3e4bab fix mock brain prev action 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 35769b53 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 272affe0 preliminary aciton model tests 4 年前
Andrew Cohen 17496265 move AgentAction, ActionLogProbs, and ActionFlattener to separate files 4 年前
Chris Elion 76ebc20c Merge remote-tracking branch 'origin/master' into r12-to-master 4 年前
Andrew Cohen d984af1f action model and network tests 4 年前
GitHub 458fee17 Merge pull request #4763 from Unity-Technologies/develop-att 4 年前
Ervin Teng 330fc1d0 Merge branch 'master' into develop-centralizedcritic-mm 4 年前
Andrew Cohen 60309d8f fix torch policy tests 4 年前
vincentpierre 519c5f47 merging master 4 年前
Andrew Cohen 89bb11d3 remove actionspec logic simple test env 4 年前
Ruo-Ping Dong 8ed14762 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Arthur Juliani 0a22af55 Add SensorType field to SensorSpec 4 年前
Andrew Cohen 11e2f5e4 remove unused imports test_hybrid 4 年前
Andrew Cohen 6ffbf209 fix imports in test utils 4 年前
GitHub 8a40c58a Added SUM as aggregation type for custom statistics (#4816) 4 年前
GitHub 7387a77f remove pylint (#4836) 4 年前
Andrew Cohen 886883b3 Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton 4 年前
Arthur Juliani e4b8e7e2 Rename to ObservationType 4 年前
GitHub 14129a08 [MLA-470] Barracuda + TF cleanup (#4837) 4 年前
Arthur Juliani 986717d0 More renaming 4 年前
Andrew Cohen 0c5934ec fix test agent processor 4 年前
GitHub 9689449f Refactor of attention (#4840) 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
Andrew Cohen 1812f08b fix test trajectory 4 年前
GitHub af5f6ad0 make sure DefaultTrainerDict is pickle-able (#4842) 4 年前
Arthur Juliani 7c37c759 Fix some mis-renamings 4 年前
Andrew Cohen 701c1a3f fix test torch distributions 4 年前
GitHub b7e6efa3 Allow setting maximum number of elements in self-attention to None (#4841) 4 年前
vincentpierre 56972f56 WIP integrate attention to networkbody 4 年前
Arthur Juliani 5342f426 One more rename 4 年前
Ervin Teng aba633b2 Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm 4 年前
Andrew Cohen e88558c3 fix torch test policy 4 年前
Andrew Cohen 631ac7f4 fixed tests 4 年前
Ervin Teng 30a09c6f Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm 4 年前
Andrew Cohen 22f42f5b fix torch test ppo 4 年前
GitHub eb78a477 Add default init/gain to LinearEncoder (#4846) 4 年前
vincentpierre 7f8e6a0d fix tests 4 年前
Andrew Cohen 85b18389 fix test tf policy 4 年前
GitHub 0ac990e0 add LayerNorm (#4847) 4 年前
Andrew Cohen 4bf182aa fix tensorflow test simple rl 4 年前
Ruo-Ping Dong a7d04be6 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Andrew Cohen 8c42dcc7 fix tensorflow test ppo 4 年前
vincentpierre 5039b65a Merge branch 'master' into develop-att-network-integration 4 年前
Arthur Juliani 0a876b9c Fix typos 4 年前
Ervin Teng 2085e17c Merge branch 'master' into develop-centralizedcritic-mm 4 年前
Andrew Cohen ff324d0c fixed sac recurrent tf simple rl 4 年前
Arthur Juliani e3de0406 Plurals 4 年前
Ruo-Ping Dong 180d3e20 Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager 4 年前
HH 0024a286 merge ervin's new stuff 4 年前
GitHub 12e1fc28 [feature] Hybrid SAC (#4574) 4 年前
Andrew Cohen 7af25330 fixed torch test sac 4 年前
Andrew Cohen 9bcd3c39 fix 2d sac 4 年前
Arthur Juliani 7b230bdf Change seed for two offending tests 4 年前
Andrew Cohen b0c02ee0 Merge branch 'develop-hybrid-actions-csharp' into develop-actionmodel-csharp 4 年前
Arthur Juliani fc756e5a Formatting 4 年前
Arthur Juliani a0876939 Extend test time 4 年前
Arthur Juliani 880d390b Change seed 4 年前
Arthur Juliani b4d8cf54 Change learning rate 4 年前
GitHub 67ad9651 Merge pull request #4825 from Unity-Technologies/sensor-types 4 年前
vincentpierre 8660b1c2 merging master 4 年前
GitHub a02cf933 Add predict minimum attention test (#4853) 4 年前
vincentpierre 24d2f335 fixing test 4 年前
vincentpierre 38fc2536 addresing some comments 4 年前
GitHub 01e0ee00 refactor entityembedding/network body (#4857) 4 年前
GitHub 89b6c949 use singular entity embedding (#4873) 4 年前
Andrew Cohen 6dafe05c fix tests 4 年前
brccabral 457fb612 Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents 4 年前
brccabral f21a1f85 Increase sleep time to assert given exception as UnityEnvironmentException 4 年前
GitHub 67594fa5 Merge pull request #4868 from brccabral/PytestWSL 4 年前
vincentpierre 52b011d6 _ 4 年前
vincentpierre 03c905b2 Fix equation for entropy 4 年前
vincentpierre 396bc43c Merging master 4 年前
GitHub d4455936 Merge pull request #4869 from Unity-Technologies/fix-normal-entropy 4 年前
vincentpierre b7c7d773 Adding some tests 4 年前
vincentpierre 6f3ea7b8 _ 4 年前
Arthur Juliani 372c784c Fix tests 4 年前
vincentpierre ff826bd2 added a test 4 年前
vincentpierre aaec009a Formatting 4 年前
vincentpierre 2f48cb82 Fixing a test 4 年前
vincentpierre 52e4069f fixing formatting 4 年前
Arthur Juliani 987800f2 Change StatsSummary to use properties 4 年前
GitHub bd4bc66b Merge branch 'master' into fix-numti-env-delayed-spawn 4 年前
vincentpierre 77eecc6b Merge branch 'master' into develop-att-network-integration 4 年前
GitHub db4436e9 Merge pull request #4872 from Unity-Technologies/fix-numti-env-delayed-spawn 4 年前
vincentpierre 7e47f94b addressing comments 4 年前
GitHub d7f549f9 Run pytest on GPU (#4865) 4 年前
vincentpierre c27a95f0 Make a self encoder before EntityEmbedding 4 年前
Arthur Juliani ff70c5c4 Merge branch 'master' into goal-conditioning-new 4 年前
vincentpierre fd007f53 Attempting to use EntityEmbedding directly as processor 4 年前
vincentpierre f5ec393b added a test to make sure that a mask of all zeros or all ones would not break backpropagation 4 年前
vincentpierre 1cff7848 no need for large number of steps in test 4 年前
GitHub 457ed0b8 Set torch device from commandline (#4888) 4 年前
GitHub d8835857 [MLA-1540] Training Analytics (#4780) 4 年前
GitHub 2fb87e4f Merge branch 'master' into reward-dist 4 年前
GitHub 212ebfb9 Merge pull request #4844 from Unity-Technologies/develop-att-network-integration 4 年前
Chris Elion 9d70220e Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 4 年前
GitHub f027e12d Merge pull request #4878 from Unity-Technologies/reward-dist 4 年前
GitHub 64fc7f43 Buffer key enums (#4907) 4 年前
Ervin Teng b6f88d6d Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Andrew Cohen 543f22bc fix test_networks 4 年前
Ervin Teng 1831044a Update SAC to use separate policy 4 年前
GitHub 4d32857d Merge branch 'master' into develop-var-len-obs-feature 4 年前
Ruo-Ping Dong 471a2e82 fix tests 4 年前
GitHub 5022d710 Add additional logic to avoid load being called on every advance (#4934) 4 年前
Ruo-Ping Dong d1107648 fix tests 4 年前
Ervin Teng c7054d76 Use attention tests from master 4 年前
Andrew Cohen 6828713c fix saver test 4 年前
Ervin Teng da6a55a0 Revert "Use attention tests from master" 4 年前
Ervin Teng 281fcdbe Merge remote-tracking branch 'origin/develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Ervin Teng 24ee4bd5 Merge remote-tracking branch 'origin/develop-critic-optimizer' into develop-critic-optimizer 4 年前
Ervin Teng bac2fb68 Use attention from master 4 年前
Andrew Cohen 66742dc8 test for SharedActorCritic 4 年前
Ruo-Ping Dong c87bce9e Merge branch 'master' into develop-base-teammanager 4 年前
Andrew Cohen d81d0be3 fix agent processor test 4 年前
Ervin Teng e112ede0 Fix mock brain 4 年前
Andrew Cohen 3f7d68b8 fix test policy 4 年前
Ervin Teng aa6d4de2 np float32 fixes 4 年前
Andrew Cohen 531695fb adjust step size gail visual ppo 4 年前
Ervin Teng 219e773b Merge branch 'develop-fix-lstms' into develop-critic-op-lstm 4 年前
Ervin Teng 44073593 Test for team obs in agentprocessor 4 年前
Ervin Teng a81512c9 Test for group and add team reward 4 年前
Christopher Goy 9cadfa7a Merge master -> release_13_branch-to-master 4 年前
vincentpierre e1b94b8b Merge branch 'master' into develop-var-len-obs-feature 4 年前
Andrew Cohen dc8e8494 Merge branch 'master' into develop-critic-optimizer 4 年前
Chris Elion e4f51ca7 Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 4 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Ervin Teng 566efa52 Fix Trajectory test 4 年前
Ervin Teng 4a33be31 Tweak SAC tests 4 年前
Ervin Teng 7471a2fd Fix AgentProcessor tests 4 年前
Ervin Teng 40f51774 Fix PPO tests 4 年前
Ervin Teng 180f7d03 Fix SAC test 4 年前
Chris Elion c3bc8991 cleanup, don't store mask 4 年前
GitHub ddb01eb2 MultiAgentGroup Interface (#4923) 4 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 4 年前
HH 15d512f9 Merge branch 'master' into hh/develop/dodgeball 4 年前
Ervin Teng 08db7c2f Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer-mm 4 年前
Ervin Teng 2f209c12 Buffer fixes 4 年前
Ervin Teng 12cef7af Add test for GroupObs 4 年前
Ervin Teng 1fc3640e Change AgentAction back to 0 pad and add tests 4 年前
GitHub 338af2ec Move the Critic into the Optimizer (#4939) 4 年前
HH 4c947151 Merge branch 'main' into hh/develop/dodgeball 4 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 4 年前
Andrew Cohen 9060da06 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 4 年前
Ervin Teng 50ab983e Fix slicing typing and string printing in AgentBufferField 4 年前
Ervin Teng bc3d3a95 Fix slicing typing and string printing in AgentBufferField 4 年前
Ervin Teng 56d4c1f9 Fix to-flat and add tests 4 年前
Andrew Cohen 5d517c5e clean ups 4 年前
Andrew Cohen e2d46ca0 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 4 年前
Andrew Cohen 8562471e add inital coma optimizer tests 4 年前
Andrew Cohen 43955c5b get value estimate test 4 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 4 年前
GitHub d36a5242 Python Dataflow for Group Manager (#4926) 4 年前
Ervin Teng fd0dd35c Merge branch 'main' into develop-coma2-trainer 4 年前
Ervin Teng c8137dcd Merge branch 'main' into develop-superpush-int 4 年前
GitHub af36ef3b [bug-fix] Fix typo (#5035) 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
GitHub 47db8ce1 [bug-fix] Fix padding for List entries in buffer (#5046) 4 年前
Christopher Goy 921ba4f0 Update v2-staging from main (March 15) (#5123) 4 年前
GitHub ba2af269 [coma2] Make group extrinsic reward part of extrinsic (#5033) 4 年前
GitHub d24b0966 [bug-fix] Fix memory leak when using LSTMs (#5048) 4 年前
Christopher Goy ebe45056 Merge branch 'main' into release_14_branch-to-main 4 年前
GitHub d2635e58 Action slice (#5047) 4 年前
Ervin Teng 8902c058 Merge branch 'main' into develop-coma2-trainer 4 年前
Andrew Cohen 95f62362 add test 4 年前
Andrew Cohen 853b44d5 torch coma tests: lstm, cur, gail 4 年前
GitHub 46461986 pass sensor name through to ObservationSpec (#5036) 4 年前
GitHub fc5d0a3f [bug-fix] Fix save/restore critic, add test (#5062) 4 年前
Chris Elion 970f1d40 Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec 4 年前
Andrew Cohen cd349985 add negative constant extrinsic to gail 4 年前
GitHub ffca08c4 Upgrade PyTorch version for python 3.9 (#5028) 4 年前
Ervin Teng 1f026c70 Merge branch 'main' into develop-superpush-branch-cleanup 4 年前
Andrew Cohen e547f26c adjust step size 4 年前
Ervin Teng ce872033 Revert "Merge branch 'main' into develop-superpush-branch-cleanup" 4 年前
GitHub 8f35bdd3 POCA trainer (#5005) 4 年前
Andrew Cohen 9e77d7e1 Merge branch 'main' into develop-soccer-groupman 4 年前
GitHub e81e038b Fix end episode for POCA, add warning for group reward if not POCA (#5113) 4 年前
GitHub 63169e2c [cherry-pick] Fix group rewards for POCA, add warning for non-POCA trainers (#5120) 4 年前
GitHub ef3d6e0d Adding Hypernetwork modules and unit tests (#5141) 4 年前
GitHub 8387e252 [release] Fix rl trainer warning (#5144) 4 年前
Ervin Teng 41dd16e8 Merge branch 'main' into release_15_mm 4 年前
Ervin Teng d1c24251 [bug-fix] When agent isn't training, don't clear update buffer (#5205) 3 年前
GitHub 3607f062 Merge release 15 into Main 4 年前
Ervin Teng c108da4a [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 3 年前
Ervin Teng bed4bf36 Load individual elements if state dict load fails (#5213) 3 年前
Andrew Cohen 18be47e8 Merge branch 'main' into develop-soccer-groupman-mod 4 年前
GitHub 81705d6d Goal conditioning integration (#5142) 4 年前
vincentpierre d4716caa Merge branch 'main' into goal-conditioning-sensors-3 4 年前
GitHub c37cfac1 Adding the goal conditioning sensors with the new observation specs (#5159) 4 年前
vincentpierre 1b4fd8fb Renaming GOAL to GOAL_SIGNAL 4 年前
Ervin Teng c05ec9af Fix groupmate obs, add tests 4 年前
Ervin Teng b3499848 Improve tests 4 年前
Ervin Teng 6e04aaf3 Fix poca test 4 年前
GitHub ff21216d [bug-fix] When agent isn't training, don't clear update buffer (#5205) 4 年前
Andrew Cohen 42105f23 add load different reward tests 4 年前
Andrew Cohen 98dcb548 test convolutions can be loaded properly 4 年前
Andrew Cohen 2e5b1352 add check that layers still have different dimensions 4 年前
GitHub cb1f5462 Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py 4 年前
GitHub f3d586bc Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py 4 年前
GitHub c5589b59 [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 4 年前
GitHub 9dfe6c7f Load individual elements if state dict load fails (#5213) 4 年前
GitHub fd79d92c Extend StatsWriter to allow handling of individual stat updates (#5249) 3 年前
vincentpierre 51adab1c Fix the attention module embedding size 3 年前
GitHub 4c776283 Fix --results-dir (#5269) 3 年前
GitHub 353b1566 Fix the attention module embedding size (#5272) 3 年前
GitHub 28eb43dd [bug-fix] Delete .pt checkpoints past keep-checkpoints (#5271) 3 年前
GitHub ed69fd2b collecting latest step as a stat (#5264) 3 年前
GitHub 4995a765 [debug] Require all behavior names to have a matching YAML entry (#5210) 3 年前
GitHub ae01cfc9 collecting latest step as a stat (#5264) (#5295) 3 年前
GitHub bff0a5d2 [debug] Require all behavior names to have a matching YAML entry (#5210) (#5296) 3 年前
Miguel Alonso Jr 4846cf0f Merge branch 'main' into develop-api-documentation-update 3 年前
GitHub 806f04bd Readding the validation of the minimal cnn input size (#5345) (#5346) 3 年前
GitHub 15440c24 Readding the validation of the minimal cnn input size (#5345) 3 年前
GitHub bb07eb45 Adding a fully connected visual encoder for super small visual input + tests (#5351) 3 年前
Miguel Alonso Jr 97b7d5c6 Merge branch 'main' into develop-api-documentation-update 3 年前
GitHub b767b66b Exclude test_visual_encoder_trains from GPU test (#5367) 3 年前
GitHub fc6e8c35 [🐛🔨 ] Fix sac target for continuous actions (#5372) 3 年前
GitHub 2933f235 Fix the reporting of histogram stats and adding a test (#5410) 3 年前