63 次代码提交 (44a293c6-1ae2-49ad-a873-a63957ca7a10)

作者 SHA1 备注 提交日期
GitHub 8317a659 Behavioral Cloning & Trainers Reorg (#328) 7 年前
GitHub e11dae1d Python Testing & Image Inference Improvements (#353) 7 年前
eshvk 030ac5c5 [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups. 7 年前
GitHub f134016b On Demand Decision (#308) 7 年前
GitHub 69481d2d Imitation Learning Helper (#371) 7 年前
GitHub dcf58f75 Feature/previous text action (#375) 7 年前
GitHub e0d5b1b0 Fix for when not using teacher helper (#379) 7 年前
GitHub a7c9096f [Semantics] Modified the placeholder names (#381) 7 年前
GitHub 6dd3c284 Hotfix 0.3.0b (#519) 7 年前
GitHub 237b41f9 Hotfix 0.3.0c (#618) 7 年前
GitHub 1a449e98 Hotfix 0.3.1b (#637) 7 年前
GitHub 755be43e [Cold Fix] Making the episode length and mean reward more accurate for the first episode (#657) 7 年前
Arthur Juliani 9477eaa9 Develop fix cumulative reward (#725) 7 年前
GitHub 702d98c6 [Fix] The summary writer is now implemented in the abtract trainer class. (#806) 7 年前
Arthur Juliani d7338050 Enable concurrent sessions 7 年前
eshvk 680b0767 [Imitation Learning] Minor fix to make sure that step increment loads from the last saved global step if the model is being trained after loading 7 年前
Arthur Juliani 5d402be9 Minor Optimizations (#836) 7 年前
GitHub 0f65e272 [Addresses #842] (#849) 7 年前
Arthur Juliani 5e48766d Remove discrete observations 7 年前
Arthur Juliani 195ac934 Merge branch 'develop' into develop-runs 6 年前
vincentpierre e47cec56 [Initial Commit] 6 年前
unityjeffrey 0d67f311 changed ml agents to ml-agents 6 年前
unityjeffrey 19fb437a changed to Unity ML-Agents Toolkit (english) 6 年前
Arthur Juliani 6b359062 Fix for visual-only imitation learning 6 年前
GitHub e50ac7ae Merge branch 'develop' into hotfix-0 6 年前
Arthur Juliani 1eb701af Merge remote-tracking branch 'origin/develop' into develop-value-estimates-ppo 6 年前
Arthur Juliani f52d5a92 Merge remote-tracking branch 'origin/develop' into develop-runs 6 年前
GitHub ef3025e6 Merge pull request #1004 from Unity-Technologies/develop-runs 6 年前
Arthur Juliani 3659bbcd Develop multi discrete (#1022) 6 年前
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
GitHub ded0d8c7 Develop action masking (#1080) 6 年前
Deric Pang cdb41480 Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure 6 年前
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
GitHub 10d2a19d Release v0.5 (Develop) (#1203) 6 年前
GitHub 4a881354 fix the training doc (#1193) 6 年前
GitHub d2c320dd Remove graph scope (#1205) 6 年前
GitHub 3c9603d6 Demonstration Recorder (#1240) 6 年前
GitHub 840417ff Use organized tags for tensorboard stats (#1248) 6 年前
GitHub c258b1c3 Move 'take_action' into Policy class (#1669) 6 年前
eshvk cc9bdf17 Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return 6 年前
eshvk fb04c40c Reorganize to make metrics collection more accurate 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub a4d5b2d3 Doc/comment cleanup - Fix some occurrences of 'the the' (#2119) 6 年前
GitHub 2671e1a0 Enable mypy in precommit checks (#2177) 5 年前
Jonathan Harper 177ee5b8 Remove unused "last reward" logic, TF nodes 5 年前
GitHub 9eb3f049 Cleanup unused code in TrainerController (#2315) 5 年前
GitHub b498c19d Fix BCTrainer increment_steps (#2384) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
Ervin Teng c912d140 Make sure all tests pass on BC 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Ervin Teng e826f4bb Bugfix for LSTM+BC (#2679) 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
Andrew Cohen 13fe9cf8 Bubbled up indexing of AllBrainInfo to trainer controller from trainers 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub a2194ea7 Fix batch size issue with BC (#2965) 5 年前
Ervin Teng 73000a6b Merge branch 'develop' into develop-splitbuffer 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
Ervin Teng 3697e616 Convert BC (warning) might be broken 5 年前
Ervin Teng 38ff674e Fix BC and tests 5 年前
Ervin Teng 324d217b Move agent_id to Trajectory 5 年前
Ervin Teng fdf9aea7 Make conversion methods part of NamedTuples 5 年前
Ervin Teng 6242b67d Add way to check if trajectory is done or max_reached 5 年前