ml-agents

作者	SHA1	备注	提交日期
GitHub	69481d2d	Imitation Learning Helper (#371 ) * Add helper class to for Imitation Learning teacher. Allows for clearing buffer "C" and toggling adding info to the buffer "R".	7 年前
GitHub	2b66e6fb	Merge pull request #372 from Unity-Technologies/feature/unityfilesasbinaries [git] Use .gitattributes to treat all Unity assets as binaries	7 年前
vincentpierre	edb5ccdb	[git] Use .gitattributes to treat all Unity assets as binaries	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	99103b29	Use `curr_brain_info`	7 年前
GitHub	2bba53b8	Merge pull request #367 from Unity-Technologies/feature/LSTM2 Hallway & LSTM Improvements	7 年前
GitHub	f8d27dc5	Merge branch 'development-0.3' into feature/LSTM2	7 年前
Arthur Juliani	c3644f56	Buffer fix for properly masking gradients	7 年前
GitHub	9ad4182e	Merge pull request #366 from Unity-Technologies/feature/cleanup [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
eshvk	030ac5c5	[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
Arthur Juliani	b8a4f5f1	Add Hallway envronment to validate LSTM models	7 年前
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
GitHub	a3c7b426	Merge pull request #357 from Unity-Technologies/feature/containerization Feature/containerization	7 年前
eshvk	64b6abf6	[Containerization] Docs clean up	7 年前
eshvk	fbb1a3d2	[containerization] Added screenshot of Docker Build Settings	7 年前
eshvk	2d85a873	[containerization] Use image name rather than tag name.	7 年前
eshvk	44a16f6b	Merge branch 'feature/containerization' of https://github.com/Unity-Technologies/ml-agents into feature/containerization	7 年前
eshvk	218887c6	[Containerization] Minor fixes	7 年前
eshvk	75a14ac8	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	6c1b6fe5	[Containerization] Minor fixes	7 年前
eshvk	e4ef7ea3	[containerization] updated docs per Vince and Yuan's comments	7 年前
eshvk	b4bad6bb	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	9345614c	[cleanup] Use debug mode for some log messages	7 年前
eshvk	6a19ae80	[containerization] updated docs per Vince and Yuan's comments	7 年前
Arthur Juliani	cbe42506	More text changes	7 年前
eshvk	403e4aef	[cleanup] Use debug mode for some log messages	7 年前
Arthur Juliani	9b2f85c5	Changes to documentation	7 年前
eshvk	5796da0e	[Cleanup] Remove unnecessary epsilon placeholder from crawler scene	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
GitHub	0277039d	Fix Basic Environment & Discrete States (#356 ) * Fix Basic environment to properly reflect number of states. * Fix discrete states when using stacked states. * Add trained model for Basic environment.	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
GitHub	5e8ba256	Use Time.captureFramerate to ensure synchrony between update and fixed update (#341 )	7 年前
Arthur Juliani	2b8ad888	[Docs] Update Balance Ball experiment eliminating graph placeholders (#338 )	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	d1cf3030	Merge pull request #309 from Unity-Technologies/dev-imitation Miscellaneous Fixes	7 年前
Arthur Juliani	c42eff57	Misc fixes	7 年前
Arthur Juliani	4418421a	Rename variables in imitation trainer	7 年前
Arthur Juliani	6ad7f010	Fix for discrete control image observations	7 年前
Arthur Juliani	3fca9b66	Set maxStepReached to false on reset	7 年前
GitHub	e676017b	Reorganize learn.py (#302 ) Split learn.py into learn.py as command-line wrapper, and trainer_controller.py as core trainer/env logic.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	517197bc	Update Instantiating-Destroying-Agents.md	7 年前
GitHub	f8a8b112	Move epsilon generation into graph (#283 )	7 年前
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
Arthur Juliani	f2d30f07	The internal Brain now can effectively modify the value field of the agents (#275 ) * Requires training to have been made with ppo * The name of the tensor must be value_estimate	7 年前
Arthur Juliani	3b8755d2	fixes on imitation trainer, now works with demo (#274 )	7 年前
Arthur Juliani	7bf0c888	trainer will raise an error if the memory of the brain is set wrong (#273 )	7 年前
vincentpierre	b7f787f6	bug fix on range of observations	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
vincentpierre	a54e459c	partial fix on the lstm The recurrent encoding now happens at the end	7 年前

... 6 7 8 9 10 ...

533 次代码提交 (b1a30f84-1e7c-4a55-97fe-fc7ff0b42bed)