ml-agents

作者	SHA1	备注	提交日期
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	bb82e25d	Revamped Push Block (#404 ) * Adds new revamped Push Block environment. * Adds "Shared Assets" folder to Examples sub-directory.	7 年前
GitHub	4a7481a1	RayPercpetion, Push Block, and misc environment changes (#432 ) RayPerception moved to a component that is now used by Banana, Soccer, Hallway, and Push Block. Converted Push Block to use RayPerception for local perception and retrained model. Re-worked Hallway to be more extensible.	7 年前
GitHub	976c56c5	Environment Aesthetic Unification (#459 ) * Aesthetic unification * Add new environment images	7 年前
GitHub	c1e930b5	Minor Visual Changes for Environments (#470 ) * Minor changes to ensure a common visual language. * Agents are blue (or additionally red in competitive scenarios). * Interactable objects are orange. * Goals are green when objects, and checkerboards when places. * Not everything perfectly follows this, but things are mostly consistent now. * Renamed "Banana" folder to "BananaCollectors" * Ensured all brains were set to "Player" * Moved non-shared assets out of the "SharedAssets" folder.	7 年前
GitHub	237b41f9	Hotfix 0.3.0c (#618 ) Fixes the following issues: * Missing component reference in BananaRL environment. * Neural Network for multiple visual observations was not properly generated. * Episode time-out value estimate bootstrapping used incorrect observation as input.	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	9ab98584	Additional Environment Variations (#791 ) * Add Visual (Camera) and Imitation Learning variations to example environments	7 年前
GitHub	696c13d2	Fix BananaIL frozen agent material (#865 ) * Fix BananaIL frozen agent material * [Fix] Added texture on the imitation learning scene and linked the models to the internal brains	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
Arthur Juliani	3409ea3a	Remove dependency of prefab on external gameobject	6 年前
Vincent(Yuan) Gao	e8a226a0	Added 3DBall, Banana, Basic, Walker, Hallway and PushBlock (#1312 )	6 年前
Ervin T	5465c2e0	Implemented the reset parameters for Banana Collectors and Bouncer (#2258 ) Banana Collectors: Length of laser and agent scale Bouncer: Size of the banana	5 年前

15 次代码提交 (d993c549-0b16-4f42-8e8e-5bea39334e27)