ml-agents

作者	SHA1	备注	提交日期
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	d1cf3030	Merge pull request #309 from Unity-Technologies/dev-imitation Miscellaneous Fixes	7 年前
Arthur Juliani	c42eff57	Misc fixes	7 年前
Arthur Juliani	4418421a	Rename variables in imitation trainer	7 年前
Arthur Juliani	6ad7f010	Fix for discrete control image observations	7 年前
Arthur Juliani	3fca9b66	Set maxStepReached to false on reset	7 年前
GitHub	e676017b	Reorganize learn.py (#302 ) Split learn.py into learn.py as command-line wrapper, and trainer_controller.py as core trainer/env logic.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	517197bc	Update Instantiating-Destroying-Agents.md	7 年前
GitHub	f8a8b112	Move epsilon generation into graph (#283 )	7 年前
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
Arthur Juliani	f2d30f07	The internal Brain now can effectively modify the value field of the agents (#275 ) * Requires training to have been made with ppo * The name of the tensor must be value_estimate	7 年前
Arthur Juliani	3b8755d2	fixes on imitation trainer, now works with demo (#274 )	7 年前
Arthur Juliani	7bf0c888	trainer will raise an error if the memory of the brain is set wrong (#273 )	7 年前
vincentpierre	b7f787f6	bug fix on range of observations	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
vincentpierre	a54e459c	partial fix on the lstm The recurrent encoding now happens at the end	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
Arthur Juliani	36e95a95	Add additional explanation for time horizon	7 年前
Arthur Juliani	8d6c57b9	Gridworld should have an inference wait time	7 年前
GitHub	00534390	Refactored GridWorld (#225 ) Greatly simplified GridWorld code. It now also only uses a visual observation rather than state vector in order to demonstrate learning purely from a visual input.	7 年前
vincentpierre	89019f26	Merge branch 'development-0.3' of https://github.com/Unity-Technologies/ml-agents into development-0.3	7 年前
GitHub	09714460	Update Readme.md Include the How to Instantiate and destroy agents	7 年前
vincentpierre	b29aac1f	typo fix	7 年前
vincentpierre	b37cf8b9	added a document on how to instanciate and destroy agents	7 年前
vincentpierre	56c8914f	An agent can now spawn an agent in AgentStep()	7 年前
vincentpierre	4fcc6fbc	fix so we can now destroy the agent in AgentOnDone()	7 年前
vincentpierre	15f29084	fix on the SetCumulativeReward() method in Agent.cs	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
Arthur Juliani	216888ee	Fixed to give lesson index parameter when start up (#179 ) * fixed to give lesson parameter when start up * applied to PPO.ipynb and modified ppo.py a bit	7 年前
vincentpierre	a7de9336	revert previous commit	7 年前
vincentpierre	d77cfc6d	Fix Cumulative reward reset	7 年前
GitHub	d9831a99	Add additional features to list	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
Arthur Juliani	e25aa997	Corrected state space of tennis environment (#160 )	7 年前
GitHub	98765705	Update Training-on-Amazon-Web-Service.md	7 年前
GitHub	e70707ae	Change worker num to worker id	7 年前
GitHub	bb61aef9	Merge pull request #128 from dcsilver/patch-1 Fixed object name discrepancy	7 年前
GitHub	52d18cb8	Fixed object name discrepancy	7 年前
GitHub	adf836d5	Update Unity-Agents---Python-API.md	7 年前
GitHub	848ff2ca	New Windows 10 installation guide	7 年前
Arthur Juliani	06d9bbec	Log lesson in TensorBoard	7 年前
Arthur Juliani	43ac4148	clarify introduction doc; fix broken links. (#123 )	7 年前
Arthur Juliani	4a11c005	Add curriculum code to notebook and simplify	7 年前
Arthur Juliani	ed26e974	Update Making-a-new-Unity-Environment.md (#114 ) Fixed some typos	7 年前
Arthur Juliani	e6696ed3	Don't print	7 年前
GitHub	3561f003	Update Making-a-new-Unity-Environment.md	7 年前
Arthur Juliani	b6ce30bf	Add curriculum support to PPO	7 年前
Arthur Juliani	9d76583e	Fixed typo (#105 )	7 年前
vincentpierre	c16e0ac3	modified the socket to receive states and images of any size	7 年前

1 2 3

150 次代码提交 (8317a659-4f50-4460-acbb-0518536227d5)