ml-agents

目录树: 3a154c62

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	cde3c8f7	formating and added documentation	7 年前
Arthur Juliani	cfceb9f4	Fix timestep for PPO.ipynb	7 年前
Arthur Juliani	71591043	PPO additions and warnings * Add linear decay to learning rate for PPO * Add warning/exception for unsupported brain configurations w/ PPO	7 年前
GitHub	aee5d336	Fix discrete state (#33 ) * made BrainParameters a class to set default values Modified the error message if the state is discrete * Add discrete state support to PPO and provide discrete state example environment * Add flexibility to continuous control as well * Finish PPO flexible model generation implementation * Fix formatting * Support color observations * Add best practices document * bug fix for non square observations * Update Readme.md * Remove scipy dependency * Add installation doc	7 年前
Arthur Juliani	4a11c005	Add curriculum code to notebook and simplify	7 年前
Arthur Juliani	06d9bbec	Log lesson in TensorBoard	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
Arthur Juliani	216888ee	Fixed to give lesson index parameter when start up (#179 ) * fixed to give lesson parameter when start up * applied to PPO.ipynb and modified ppo.py a bit	7 年前
vincentpierre	cd1feef6	minor fix to the Notebook	7 年前
Arthur Juliani	98cebd82	Fix typo "leaning_rate" (#324 )	7 年前
Arthur Juliani	54652c69	dev-logParam (#135 ) * added the method write text to trainer so it is easy to write log the hyperparameters as a dictionary. Note: needs tensorflow version r1.2 or above * added message if impossible to write text summary in Tensorboard	7 年前

12 次代码提交 (3a154c62-0c67-4d9b-98b6-1ec4389cf8c5)