ml-agents

作者	SHA1	备注	提交日期
GitHub	06fa6616	Docs/new semantics (#370 ) * [Semantics] Modified the semantics for the documentation * [Semantics] Updated the images * [Semantics] Made further changes to the docs based of the comments received	7 年前
GitHub	704aab24	[AcademyFirstReset] Changed the first reset logic of the academy to be consistent between training and inference (#369 )	7 年前
GitHub	69481d2d	Imitation Learning Helper (#371 ) * Add helper class to for Imitation Learning teacher. Allows for clearing buffer "C" and toggling adding info to the buffer "R".	7 年前
GitHub	2b66e6fb	Merge pull request #372 from Unity-Technologies/feature/unityfilesasbinaries [git] Use .gitattributes to treat all Unity assets as binaries	7 年前
vincentpierre	edb5ccdb	[git] Use .gitattributes to treat all Unity assets as binaries	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	99103b29	Use `curr_brain_info`	7 年前
GitHub	2bba53b8	Merge pull request #367 from Unity-Technologies/feature/LSTM2 Hallway & LSTM Improvements	7 年前
GitHub	f8d27dc5	Merge branch 'development-0.3' into feature/LSTM2	7 年前
Arthur Juliani	c3644f56	Buffer fix for properly masking gradients	7 年前
GitHub	9ad4182e	Merge pull request #366 from Unity-Technologies/feature/cleanup [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
eshvk	030ac5c5	[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
Arthur Juliani	b8a4f5f1	Add Hallway envronment to validate LSTM models	7 年前
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
GitHub	a3c7b426	Merge pull request #357 from Unity-Technologies/feature/containerization Feature/containerization	7 年前
eshvk	64b6abf6	[Containerization] Docs clean up	7 年前
eshvk	fbb1a3d2	[containerization] Added screenshot of Docker Build Settings	7 年前
eshvk	2d85a873	[containerization] Use image name rather than tag name.	7 年前
eshvk	44a16f6b	Merge branch 'feature/containerization' of https://github.com/Unity-Technologies/ml-agents into feature/containerization	7 年前
eshvk	218887c6	[Containerization] Minor fixes	7 年前
eshvk	75a14ac8	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	6c1b6fe5	[Containerization] Minor fixes	7 年前
eshvk	e4ef7ea3	[containerization] updated docs per Vince and Yuan's comments	7 年前
eshvk	b4bad6bb	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	9345614c	[cleanup] Use debug mode for some log messages	7 年前
eshvk	6a19ae80	[containerization] updated docs per Vince and Yuan's comments	7 年前
Arthur Juliani	cbe42506	More text changes	7 年前
eshvk	403e4aef	[cleanup] Use debug mode for some log messages	7 年前
Arthur Juliani	9b2f85c5	Changes to documentation	7 年前
eshvk	5796da0e	[Cleanup] Remove unnecessary epsilon placeholder from crawler scene	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
GitHub	0277039d	Fix Basic Environment & Discrete States (#356 ) * Fix Basic environment to properly reflect number of states. * Fix discrete states when using stacked states. * Add trained model for Basic environment.	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
GitHub	5e8ba256	Use Time.captureFramerate to ensure synchrony between update and fixed update (#341 )	7 年前
Arthur Juliani	2b8ad888	[Docs] Update Balance Ball experiment eliminating graph placeholders (#338 )	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	d1cf3030	Merge pull request #309 from Unity-Technologies/dev-imitation Miscellaneous Fixes	7 年前
Arthur Juliani	c42eff57	Misc fixes	7 年前
Arthur Juliani	4418421a	Rename variables in imitation trainer	7 年前
Arthur Juliani	6ad7f010	Fix for discrete control image observations	7 年前
Arthur Juliani	3fca9b66	Set maxStepReached to false on reset	7 年前
GitHub	e676017b	Reorganize learn.py (#302 ) Split learn.py into learn.py as command-line wrapper, and trainer_controller.py as core trainer/env logic.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	517197bc	Update Instantiating-Destroying-Agents.md	7 年前
GitHub	f8a8b112	Move epsilon generation into graph (#283 )	7 年前
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
Arthur Juliani	f2d30f07	The internal Brain now can effectively modify the value field of the agents (#275 ) * Requires training to have been made with ppo * The name of the tensor must be value_estimate	7 年前
Arthur Juliani	3b8755d2	fixes on imitation trainer, now works with demo (#274 )	7 年前
Arthur Juliani	7bf0c888	trainer will raise an error if the memory of the brain is set wrong (#273 )	7 年前
vincentpierre	b7f787f6	bug fix on range of observations	7 年前

... 2 3 4 5 6 ...

335 次代码提交 (9163a54a-2950-4e7c-bef1-95b62c96b95f)