ml-agents

作者	SHA1	备注	提交日期
eshvk	e33a083f	[docs] Update PPO hyperparameter wordings	7 年前
GitHub	430a5486	[Semantics] renaming StateType to SpaceType (#382 )	7 年前
GitHub	26a1ed87	Merge pull request #380 from Unity-Technologies/dev-reacher-cleanup Add comments to Reacher and re-train model w/o epsilon needed	7 年前
Vincent Gao	e0b60e56	removed the tab	7 年前
GitHub	0838c2bc	Merge pull request #378 from Unity-Technologies/docs/semantics-internal-brain Docs/semantics internal brain	7 年前
vincentpierre	0d2a5031	[Documentation] cleaned repo and additional semantic changes	7 年前
Vincent Gao	4a23c5cf	clean up the code in Ball3DDecision	7 年前
vincentpierre	6c55017e	[FixingPytests] Added the new Semantic and modified the pytest	7 年前
Arthur Juliani	22d931c0	Add comments to Reacher and re-train model w/ epsilon needed	7 年前
Vincent Gao	933317be	modified comments	7 年前
GitHub	d4cd72d8	[timeBetweenDecisions] Reimplementation of waitTime for GridWorld and… (#368 ) * [timeBetweenDecisions] Reimplementation of waitTime for GridWorld and Basic * [EnvironmentModification] Changed the gridworld TimeBetweenDecisionAtInference	7 年前
GitHub	a809630f	Add config for crawler, and change crawler scene (#376 ) * Add config for crawler, and change crawler scene * Changed number of crawlers in scene to 12 * Changed Max-steps for crawlers to 5000 * Newer hyperparameters and newly trained crawler model * Clean up crawler code, and improve efficency	7 年前
GitHub	171e551b	[Semantics] Modified some fields of the CoreInternalBrain (#373 )	7 年前
GitHub	dcf58f75	Feature/previous text action (#375 ) * [Previous Text Actions] Renamed previous_action to previous_vector_action added previous_text_action to the BrainInfo * [Semantics] Carried the modifications to the semantics of previous_vector_action to the trainers	7 年前
GitHub	06fa6616	Docs/new semantics (#370 ) * [Semantics] Modified the semantics for the documentation * [Semantics] Updated the images * [Semantics] Made further changes to the docs based of the comments received	7 年前
GitHub	704aab24	[AcademyFirstReset] Changed the first reset logic of the academy to be consistent between training and inference (#369 )	7 年前
GitHub	69481d2d	Imitation Learning Helper (#371 ) * Add helper class to for Imitation Learning teacher. Allows for clearing buffer "C" and toggling adding info to the buffer "R".	7 年前
GitHub	2b66e6fb	Merge pull request #372 from Unity-Technologies/feature/unityfilesasbinaries [git] Use .gitattributes to treat all Unity assets as binaries	7 年前
vincentpierre	edb5ccdb	[git] Use .gitattributes to treat all Unity assets as binaries	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	99103b29	Use `curr_brain_info`	7 年前
GitHub	2bba53b8	Merge pull request #367 from Unity-Technologies/feature/LSTM2 Hallway & LSTM Improvements	7 年前
GitHub	f8d27dc5	Merge branch 'development-0.3' into feature/LSTM2	7 年前
Arthur Juliani	c3644f56	Buffer fix for properly masking gradients	7 年前
GitHub	9ad4182e	Merge pull request #366 from Unity-Technologies/feature/cleanup [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
eshvk	030ac5c5	[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
Arthur Juliani	b8a4f5f1	Add Hallway envronment to validate LSTM models	7 年前
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
GitHub	a3c7b426	Merge pull request #357 from Unity-Technologies/feature/containerization Feature/containerization	7 年前
eshvk	64b6abf6	[Containerization] Docs clean up	7 年前
eshvk	fbb1a3d2	[containerization] Added screenshot of Docker Build Settings	7 年前
eshvk	2d85a873	[containerization] Use image name rather than tag name.	7 年前
eshvk	44a16f6b	Merge branch 'feature/containerization' of https://github.com/Unity-Technologies/ml-agents into feature/containerization	7 年前
eshvk	218887c6	[Containerization] Minor fixes	7 年前
eshvk	75a14ac8	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	6c1b6fe5	[Containerization] Minor fixes	7 年前
eshvk	e4ef7ea3	[containerization] updated docs per Vince and Yuan's comments	7 年前
eshvk	b4bad6bb	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	9345614c	[cleanup] Use debug mode for some log messages	7 年前
eshvk	6a19ae80	[containerization] updated docs per Vince and Yuan's comments	7 年前
Arthur Juliani	cbe42506	More text changes	7 年前
eshvk	403e4aef	[cleanup] Use debug mode for some log messages	7 年前
Arthur Juliani	9b2f85c5	Changes to documentation	7 年前
eshvk	5796da0e	[Cleanup] Remove unnecessary epsilon placeholder from crawler scene	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
GitHub	0277039d	Fix Basic Environment & Discrete States (#356 ) * Fix Basic environment to properly reflect number of states. * Fix discrete states when using stacked states. * Add trained model for Basic environment.	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
GitHub	5e8ba256	Use Time.captureFramerate to ensure synchrony between update and fixed update (#341 )	7 年前
Arthur Juliani	2b8ad888	[Docs] Update Balance Ball experiment eliminating graph placeholders (#338 )	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前

... 3 4 5 6 7 ...

399 次代码提交 (c1e930b5-dddd-42a5-b766-cf6568a9f1cd)