ml-agents

作者	SHA1	备注	提交日期
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	41d32aca	[Bouncer Environment] Now in 3D (#408 ) * [New Bouncer] Revamped the Bouncer to be in 3D * [Bouncer Configuration file] Added the BouncerBrain configuration * [Documentation] Added the Bouncer tot he documentation page * [Fixes] Fixed lines too long and the documentation typo * Slight adjustments to bouncer environment * Don't default to internal brain on bouncer	7 年前
GitHub	976c56c5	Environment Aesthetic Unification (#459 ) * Aesthetic unification * Add new environment images	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
Vincent(Yuan) Gao	7ce0b834	Add Brains for Pyramids, Reacher, SoccerTwos, Tennis, Bouncer, and CrawlerDynamic (#1313 ) * New brains for Pyramid scene * Add reacher brains * New brains for Soccer agents * New Tennis Brains * Set prefabs correctly * New brains for bouncer * New Dynamic Crawler Brains	6 年前
GitHub	cfb8f208	Release v0.7 minor fixes (#1759 ) * Fix typo * Updated some of the scenes	6 年前
Ervin T	5465c2e0	Implemented the reset parameters for Banana Collectors and Bouncer (#2258 ) Banana Collectors: Length of laser and agent scale Bouncer: Size of the banana	5 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
GitHub	24ba9d58	Develop deprecate broadcasting (#2669 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modifie...	5 年前
GitHub	39f1f310	Don't inherit from Academy, remove virtual methods (#3184 )	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前

12 次代码提交 (48340021-9a5c-4f02-a3db-72091ab63d8e)