136 次代码提交 (255063d7-af89-496d-b610-2a461cbadb9c)

作者 SHA1 备注 提交日期
GitHub 8317a659 Behavioral Cloning & Trainers Reorg (#328) 7 年前
GitHub e11dae1d Python Testing & Image Inference Improvements (#353) 7 年前
eshvk 030ac5c5 [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups. 7 年前
GitHub 9ad4182e Merge pull request #366 from Unity-Technologies/feature/cleanup 7 年前
GitHub f8d27dc5 Merge branch 'development-0.3' into feature/LSTM2 7 年前
GitHub f134016b On Demand Decision (#308) 7 年前
GitHub 69481d2d Imitation Learning Helper (#371) 7 年前
GitHub dcf58f75 Feature/previous text action (#375) 7 年前
GitHub e0d5b1b0 Fix for when not using teacher helper (#379) 7 年前
GitHub a7c9096f [Semantics] Modified the placeholder names (#381) 7 年前
Vincent Gao 02df3b34 resolved conflicts 7 年前
GitHub 6dd3c284 Hotfix 0.3.0b (#519) 7 年前
GitHub a6385cbf Merge pull request #536 from Unity-Technologies/master 7 年前
GitHub 237b41f9 Hotfix 0.3.0c (#618) 7 年前
GitHub 78d411f6 Merge pull request #619 from Unity-Technologies/develop 7 年前
GitHub 1a449e98 Hotfix 0.3.1b (#637) 7 年前
GitHub b2675216 Hotfix 0.3.1b (#656) 7 年前
GitHub 755be43e [Cold Fix] Making the episode length and mean reward more accurate for the first episode (#657) 7 年前
GitHub 3b866e9f Use Clipped Gaussian (#649) 7 年前
Arthur Juliani 9477eaa9 Develop fix cumulative reward (#725) 7 年前
GitHub 38098a12 [Fixed BC with LSTM] (#766) 7 年前
GitHub 702d98c6 [Fix] The summary writer is now implemented in the abtract trainer class. (#806) 7 年前
GitHub c17937ef Curiosity Driven Exploration & Pyramids Environments (#739) 7 年前
Arthur Juliani d7338050 Enable concurrent sessions 7 年前
eshvk 680b0767 [Imitation Learning] Minor fix to make sure that step increment loads from the last saved global step if the model is being trained after loading 7 年前
GitHub e195b495 Merge pull request #838 from Unity-Technologies/develop-bc 7 年前
Arthur Juliani 5d402be9 Minor Optimizations (#836) 7 年前
GitHub 282d5bd4 Fix Pytests (#843) 7 年前
GitHub 0f65e272 [Addresses #842] (#849) 7 年前
GitHub bf858cd6 Merge pull request #884 from Unity-Technologies/release-v0.4 7 年前
Arthur Juliani 5e48766d Remove discrete observations 7 年前
GitHub b6fe0bca Merge pull request #906 from Unity-Technologies/develop-no-discrete-obs 7 年前
Arthur Juliani 195ac934 Merge branch 'develop' into develop-runs 7 年前
vincentpierre e47cec56 [Initial Commit] 7 年前
unityjeffrey 0d67f311 changed ml agents to ml-agents 7 年前
unityjeffrey 19fb437a changed to Unity ML-Agents Toolkit (english) 7 年前
GitHub 7b9a2905 Merge pull request #916 from Unity-Technologies/hotfix-trademarkupdate 7 年前
Arthur Juliani 9701c3db Merge branch 'hotfix-0' into release-v0.4-fix-curiosity-odd 7 年前
Arthur Juliani 6b359062 Fix for visual-only imitation learning 7 年前
GitHub 7b497341 Merge pull request #936 from Unity-Technologies/hotfix-visual-imitation 7 年前
GitHub f155d661 Merge pull request #908 from Unity-Technologies/hotfix-0 7 年前
GitHub e50ac7ae Merge branch 'develop' into hotfix-0 7 年前
GitHub b36e6a2e Merge pull request #946 from Unity-Technologies/hotfix-0 7 年前
GitHub 4e73f770 Merge branch 'develop' into hotfix-0.4b 6 年前
Arthur Juliani 1eb701af Merge remote-tracking branch 'origin/develop' into develop-value-estimates-ppo 6 年前
Arthur Juliani f52d5a92 Merge remote-tracking branch 'origin/develop' into develop-runs 6 年前
GitHub 1e21c143 Merge pull request #934 from Unity-Technologies/develop-value-estimates-ppo 6 年前
GitHub ef3025e6 Merge pull request #1004 from Unity-Technologies/develop-runs 6 年前
Arthur Juliani 3659bbcd Develop multi discrete (#1022) 6 年前
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
GitHub ded0d8c7 Develop action masking (#1080) 6 年前
GitHub 2e489abc Normalization of the probabilities after masking (#1123) 6 年前
Deric Pang cdb41480 Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure 6 年前
Deric Pang d4ca94a1 Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure 6 年前
GitHub 3900ed66 Merge pull request #1083 from Unity-Technologies/develop-flat-code-restructure 6 年前
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
GitHub 10d2a19d Release v0.5 (Develop) (#1203) 6 年前
GitHub f8df71a0 Revert "Release v0.5 (Develop) (#1203)" (#1222) 6 年前
GitHub ab5c49e8 Release v0.5 delete unityagents (#1151) 6 年前
GitHub 2d4b4209 Use single scope declaration for models (#1160) 6 年前
GitHub 4a881354 fix the training doc (#1193) 6 年前
GitHub 25495874 Merge pull request #1223 from Unity-Technologies/release-v0.5 6 年前
GitHub 560f1bd7 Merge pull request #1224 from Unity-Technologies/release-v0.5 6 年前
GitHub d2c320dd Remove graph scope (#1205) 6 年前
GitHub 3c9603d6 Demonstration Recorder (#1240) 6 年前
GitHub 840417ff Use organized tags for tensorboard stats (#1248) 6 年前
GitHub 6c354d16 New Learning Brain (#1303) 6 年前
GitHub 48578199 Fix brain name bug in offline bc (#1395) 6 年前
vincentpierre d1cb6ce0 Fix on the bc_offline_training using deep copies 6 年前
GitHub 13d38179 Merge pull request #1490 from Unity-Technologies/release-v0.6-fix-bc-offline 6 年前
GitHub c8cc5a29 Merge pull request #1495 from Unity-Technologies/release-v0.6 6 年前
GitHub a196dde2 Merge pull request #1494 from Unity-Technologies/release-v0.6 6 年前
GitHub c258b1c3 Move 'take_action' into Policy class (#1669) 6 年前
eshvk cc9bdf17 Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return 6 年前
eshvk fb04c40c Reorganize to make metrics collection more accurate 6 年前
GitHub a0b44f1b Merge pull request #1858 from Unity-Technologies/develop-esh-metrics 6 年前
GitHub 2d1bda57 Merge pull request #1931 from Unity-Technologies/release-v0.8 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub 70d14910 Merge pull request #1934 from Unity-Technologies/develop-black 6 年前
GitHub a4d5b2d3 Doc/comment cleanup - Fix some occurrences of 'the the' (#2119) 6 年前
GitHub d5f6b7f8 Merge pull request #2157 from Unity-Technologies/release-v0.8.2 6 年前
GitHub 2671e1a0 Enable mypy in precommit checks (#2177) 6 年前
GitHub 40c7fc48 Merge branch 'develop' into protobuf_update 6 年前
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
Jonathan Harper 177ee5b8 Remove unused "last reward" logic, TF nodes 5 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
Chris Elion 5d07ca1f Merge remote-tracking branch 'origin/develop' into enable-flake8 5 年前
GitHub 9eb3f049 Cleanup unused code in TrainerController (#2315) 5 年前
GitHub 53475207 Merge pull request #2380 from Unity-Technologies/release-0.9.0 5 年前
GitHub b498c19d Fix BCTrainer increment_steps (#2384) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub afb6ede5 Merge pull request #2393 from Unity-Technologies/hotfix-v0.9.0a 5 年前
Ervin Teng 072d2ef8 Merge latest develop 5 年前
Ervin Teng c912d140 Make sure all tests pass on BC 5 年前
GitHub 4472838e Merge pull request #2421 from Unity-Technologies/hotfix-v0.9.1 5 年前
GitHub 0a163871 Merge pull request #2469 from Unity-Technologies/release-0.9.2 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Chris Elion 3cb1755e When checking for the compatibility of the expert brain with the policy brain, we will remove the action descriptions from the dictionary of things we need to compare. This is to prevent the case where a user has different descriptions for his actions but still wants to train a brain using expert demonstrations. (#2517) 5 年前
GitHub b2fa2268 Merge pull request #2648 from Unity-Technologies/release-0.10.0 5 年前
Anupam Bhatnagar cc208c00 resolving conflicts 5 年前
Ervin Teng e826f4bb Bugfix for LSTM+BC (#2679) 5 年前
GitHub 5f5ccfa0 Feature Deprecation : Online Behavioral Cloning (#2659) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
Chris Elion 8da16bdb move compat functions 5 年前
GitHub f22c41db Merge pull request #2704 from Unity-Technologies/hotfix-0.10.1 5 年前
GitHub e6240c7a Bugfix for LSTM+BC (#2679) 5 年前
Anupam Bhatnagar b733b34c resolving conflicts 5 年前
Chris Elion a1967c19 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 0fe5adc2 Develop remove memories (#2795) 5 年前
GitHub 495873e5 Merge pull request #2833 from Unity-Technologies/release-0.11.0 5 年前
Chris Elion 691d21e6 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Jonathan Harper 8550679d Merge branch 'develop' into release-0.11.0 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub f57b7ac6 Allow usage with tensorflow 2.0.0 (via tf.compat.v1) (#2665) 5 年前
Ervin Teng 987e0e3a Merge tf2 branch 5 年前
Andrew Cohen 13fe9cf8 Bubbled up indexing of AllBrainInfo to trainer controller from trainers 5 年前
GitHub c0453ae1 Merge pull request #2912 from Unity-Technologies/develop-allbraininfo 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub a2194ea7 Fix batch size issue with BC (#2965) 5 年前
GitHub b5eb34dc Fix batch size issue with BC (#2965) (#2966) 5 年前
Ervin Teng 73000a6b Merge branch 'develop' into develop-splitbuffer 5 年前
GitHub d4780a55 Merge pull request #3010 from Unity-Technologies/release-0.12.0-to-master 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng 34f9577c Merge branch 'develop' into develop-agentprocessor 5 年前
GitHub 35c995e9 Merge pull request #3038 from Unity-Technologies/develop 5 年前
Ervin Teng eb4a04a5 Merge branch 'master' into develop-tanhsquash 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
Ervin Teng 3697e616 Convert BC (warning) might be broken 5 年前
Ervin Teng 38ff674e Fix BC and tests 5 年前
Ervin Teng 324d217b Move agent_id to Trajectory 5 年前
Ervin Teng fdf9aea7 Make conversion methods part of NamedTuples 5 年前
Ervin Teng 6242b67d Add way to check if trajectory is done or max_reached 5 年前