26 次代码提交 (36e95a95-0970-40df-9366-7263f51c7a71)

作者 SHA1 备注 提交日期
Arthur Juliani 982fab41 Initial commit 7 年前
vincentpierre cde3c8f7 formating and added documentation 7 年前
Arthur Juliani 71591043 PPO additions and warnings 7 年前
GitHub aee5d336 Fix discrete state (#33) 7 年前
vincentpierre 3f85bb56 Merge branch 'master' into dev-broadcast 7 年前
Arthur Juliani adac2683 Fix for multi-agent with observations 7 年前
Arthur Juliani c190eb22 Randomize ppo training batch 7 年前
vincentpierre 431fc43c Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents into dev-broadcast 7 年前
Arthur Juliani b6ce30bf Add curriculum support to PPO 7 年前
Arthur Juliani 06d9bbec Log lesson in TensorBoard 7 年前
vincentpierre 3b00302a merging dev-broadcast-curriculum 7 年前
vincentpierre 22db3d64 added the modified files from dev-cooperative-env 7 年前
Arthur Juliani 51f23cd2 0.2 Update 7 年前
Arthur Juliani b56259f6 Fix cumulative reward (Unity) and Nan reward (python) bugs 7 年前
GitHub 00534390 Refactored GridWorld (#225) 7 年前
Arthur Juliani 75ea16ff Add comments and alphabetize flags 7 年前
Arthur Juliani adedd491 Initial support for multiple observations (#256) 7 年前
Arthur Juliani 5b8822a0 Bug fix multiple observations 7 年前
vincentpierre db3cb9df Merge branch 'development' into dev-logfile 7 年前
Arthur Juliani 54652c69 dev-logParam (#135) 7 年前
GitHub faa53e35 Fix observations on PPO trainer (#340) 7 年前
Arthur Juliani c21a391d Various bug fixed and changes 7 年前
Arthur Juliani 9d26767d Instantiate training buffer with trainer 7 年前
Arthur Juliani 827dca28 Fix typo in model vars 7 年前
Arthur Juliani 1bf46a85 Add flags for normalization and variable layers 7 年前
Arthur Juliani 6c1c8220 Python2 fix 7 年前