ml-agents

作者	SHA1	备注	提交日期
eshvk	64b6abf6	[Containerization] Docs clean up	7 年前
eshvk	fbb1a3d2	[containerization] Added screenshot of Docker Build Settings	7 年前
eshvk	2d85a873	[containerization] Use image name rather than tag name.	7 年前
eshvk	44a16f6b	Merge branch 'feature/containerization' of https://github.com/Unity-Technologies/ml-agents into feature/containerization	7 年前
eshvk	218887c6	[Containerization] Minor fixes	7 年前
eshvk	75a14ac8	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	6c1b6fe5	[Containerization] Minor fixes	7 年前
eshvk	e4ef7ea3	[containerization] updated docs per Vince and Yuan's comments	7 年前
eshvk	b4bad6bb	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	9345614c	[cleanup] Use debug mode for some log messages	7 年前
eshvk	6a19ae80	[containerization] updated docs per Vince and Yuan's comments	7 年前
Arthur Juliani	cbe42506	More text changes	7 年前
eshvk	403e4aef	[cleanup] Use debug mode for some log messages	7 年前
Arthur Juliani	9b2f85c5	Changes to documentation	7 年前
eshvk	5796da0e	[Cleanup] Remove unnecessary epsilon placeholder from crawler scene	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
GitHub	0277039d	Fix Basic Environment & Discrete States (#356 ) * Fix Basic environment to properly reflect number of states. * Fix discrete states when using stacked states. * Add trained model for Basic environment.	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
GitHub	5e8ba256	Use Time.captureFramerate to ensure synchrony between update and fixed update (#341 )	7 年前
Arthur Juliani	2b8ad888	[Docs] Update Balance Ball experiment eliminating graph placeholders (#338 )	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	d1cf3030	Merge pull request #309 from Unity-Technologies/dev-imitation Miscellaneous Fixes	7 年前
Arthur Juliani	c42eff57	Misc fixes	7 年前
Arthur Juliani	4418421a	Rename variables in imitation trainer	7 年前
Arthur Juliani	6ad7f010	Fix for discrete control image observations	7 年前
Arthur Juliani	3fca9b66	Set maxStepReached to false on reset	7 年前
GitHub	e676017b	Reorganize learn.py (#302 ) Split learn.py into learn.py as command-line wrapper, and trainer_controller.py as core trainer/env logic.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	517197bc	Update Instantiating-Destroying-Agents.md	7 年前
GitHub	f8a8b112	Move epsilon generation into graph (#283 )	7 年前
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
Arthur Juliani	f2d30f07	The internal Brain now can effectively modify the value field of the agents (#275 ) * Requires training to have been made with ppo * The name of the tensor must be value_estimate	7 年前
Arthur Juliani	3b8755d2	fixes on imitation trainer, now works with demo (#274 )	7 年前
Arthur Juliani	7bf0c888	trainer will raise an error if the memory of the brain is set wrong (#273 )	7 年前
vincentpierre	b7f787f6	bug fix on range of observations	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
vincentpierre	a54e459c	partial fix on the lstm The recurrent encoding now happens at the end	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
Arthur Juliani	36e95a95	Add additional explanation for time horizon	7 年前
Arthur Juliani	8d6c57b9	Gridworld should have an inference wait time	7 年前
GitHub	00534390	Refactored GridWorld (#225 ) Greatly simplified GridWorld code. It now also only uses a visual observation rather than state vector in order to demonstrate learning purely from a visual input.	7 年前
vincentpierre	89019f26	Merge branch 'development-0.3' of https://github.com/Unity-Technologies/ml-agents into development-0.3	7 年前
GitHub	09714460	Update Readme.md Include the How to Instantiate and destroy agents	7 年前
vincentpierre	b29aac1f	typo fix	7 年前
vincentpierre	b37cf8b9	added a document on how to instanciate and destroy agents	7 年前
vincentpierre	56c8914f	An agent can now spawn an agent in AgentStep()	7 年前
vincentpierre	4fcc6fbc	fix so we can now destroy the agent in AgentOnDone()	7 年前
vincentpierre	15f29084	fix on the SetCumulativeReward() method in Agent.cs	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
Arthur Juliani	216888ee	Fixed to give lesson index parameter when start up (#179 ) * fixed to give lesson parameter when start up * applied to PPO.ipynb and modified ppo.py a bit	7 年前

... 20 21 22 23 24 ...

1220 次代码提交 (5465c2e0-8173-4b91-841f-26147d018cd9)