ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
Arthur Juliani	f2d30f07	The internal Brain now can effectively modify the value field of the agents (#275 ) * Requires training to have been made with ppo * The name of the tensor must be value_estimate	7 年前
Arthur Juliani	3b8755d2	fixes on imitation trainer, now works with demo (#274 )	7 年前
Arthur Juliani	7bf0c888	trainer will raise an error if the memory of the brain is set wrong (#273 )	7 年前
vincentpierre	b7f787f6	bug fix on range of observations	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
vincentpierre	a54e459c	partial fix on the lstm The recurrent encoding now happens at the end	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
Arthur Juliani	36e95a95	Add additional explanation for time horizon	7 年前
Arthur Juliani	8d6c57b9	Gridworld should have an inference wait time	7 年前
GitHub	00534390	Refactored GridWorld (#225 ) Greatly simplified GridWorld code. It now also only uses a visual observation rather than state vector in order to demonstrate learning purely from a visual input.	7 年前
vincentpierre	89019f26	Merge branch 'development-0.3' of https://github.com/Unity-Technologies/ml-agents into development-0.3	7 年前
GitHub	09714460	Update Readme.md Include the How to Instantiate and destroy agents	7 年前
vincentpierre	b29aac1f	typo fix	7 年前
vincentpierre	b37cf8b9	added a document on how to instanciate and destroy agents	7 年前
vincentpierre	56c8914f	An agent can now spawn an agent in AgentStep()	7 年前
vincentpierre	4fcc6fbc	fix so we can now destroy the agent in AgentOnDone()	7 年前
vincentpierre	15f29084	fix on the SetCumulativeReward() method in Agent.cs	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
Arthur Juliani	216888ee	Fixed to give lesson index parameter when start up (#179 ) * fixed to give lesson parameter when start up * applied to PPO.ipynb and modified ppo.py a bit	7 年前
vincentpierre	a7de9336	revert previous commit	7 年前
vincentpierre	d77cfc6d	Fix Cumulative reward reset	7 年前
GitHub	d9831a99	Add additional features to list	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
Arthur Juliani	e25aa997	Corrected state space of tennis environment (#160 )	7 年前
GitHub	98765705	Update Training-on-Amazon-Web-Service.md	7 年前
GitHub	e70707ae	Change worker num to worker id	7 年前
GitHub	bb61aef9	Merge pull request #128 from dcsilver/patch-1 Fixed object name discrepancy	7 年前
GitHub	52d18cb8	Fixed object name discrepancy	7 年前
GitHub	adf836d5	Update Unity-Agents---Python-API.md	7 年前
GitHub	848ff2ca	New Windows 10 installation guide	7 年前
Arthur Juliani	06d9bbec	Log lesson in TensorBoard	7 年前
Arthur Juliani	43ac4148	clarify introduction doc; fix broken links. (#123 )	7 年前
Arthur Juliani	4a11c005	Add curriculum code to notebook and simplify	7 年前
Arthur Juliani	ed26e974	Update Making-a-new-Unity-Environment.md (#114 ) Fixed some typos	7 年前
Arthur Juliani	e6696ed3	Don't print	7 年前
GitHub	3561f003	Update Making-a-new-Unity-Environment.md	7 年前
Arthur Juliani	b6ce30bf	Add curriculum support to PPO	7 年前
Arthur Juliani	9d76583e	Fixed typo (#105 )	7 年前
vincentpierre	c16e0ac3	modified the socket to receive states and images of any size	7 年前
GitHub	9a7445f7	Update best-practices-ppo.md	7 年前
vincentpierre	447da485	fix on the CoreBrains so that if one Corebrain gets eraised, it will be reinstanciated	7 年前
Arthur Juliani	332061aa	Rename	7 年前
vincentpierre	d421a300	updated the tests of unityagents	7 年前
GitHub	0d7f6726	Add ppo to table of contents	7 年前
vincentpierre	250eb8e1	better checking of the format of the curriculum file	7 年前
Arthur Juliani	3bd12269	Add best practices for PPO	7 年前
vincentpierre	e8429059	bug fix for python3	7 年前
GitHub	d14d88e2	Clarify text	7 年前
vincentpierre	360984c4	curriculum.json params must have 4 entries	7 年前

1 2 3

140 次代码提交 (15f10de0-f6ae-43f2-90d3-f12ea0e0c587)