ml-agents

作者	SHA1	备注	提交日期
vincentpierre	22db3d64	added the modified files from dev-cooperative-env	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
Arthur Juliani	22d931c0	Add comments to Reacher and re-train model w/ epsilon needed	7 年前
GitHub	b1d6172f	[Retrained models] Of GridWorld and Tennis (#410 )	7 年前
Vincent Gao	153f723d	Updated the Reacher's Vector Observation's space size from 24 to 26, also in Internal brain mode, Vector observation node name from "state" to "vector_observation" Updated the Reacher's Vector Observation's space size from 24 to 26, also in Internal brain mode, Vector observation node name from "state" to "vector_observation"	7 年前
Vincent Gao	46ee2708	small fix	7 年前
Vincent Gao	4f2ea42a	Changed the Reacher's brain type to Player and set the control to up, down, left, right arrows.	7 年前
GitHub	976c56c5	Environment Aesthetic Unification (#459 ) * Aesthetic unification * Add new environment images	7 年前
GitHub	c1e930b5	Minor Visual Changes for Environments (#470 ) * Minor changes to ensure a common visual language. * Agents are blue (or additionally red in competitive scenarios). * Interactable objects are orange. * Goals are green when objects, and checkerboards when places. * Not everything perfectly follows this, but things are mostly consistent now. * Renamed "Banana" folder to "BananaCollectors" * Ensured all brains were set to "Player" * Moved non-shared assets out of the "SharedAssets" folder.	7 年前
GitHub	c7a4ae7e	[Fixed] Reacher internal brain now takes vector_observation (#484 )	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
Vincent(Yuan) Gao	7ce0b834	Add Brains for Pyramids, Reacher, SoccerTwos, Tennis, Bouncer, and CrawlerDynamic (#1313 ) * New brains for Pyramid scene * Add reacher brains * New brains for Soccer agents * New Tennis Brains * Set prefabs correctly * New brains for bouncer * New Dynamic Crawler Brains	6 年前
vincentpierre	6843dac6	Release v0.6 marwan tf (#1351 ) * Adding model for 3D Balance Ball. * Adding LearningBrain to BroadCast Hub. * Removed CrawlerPlayer Brain * Renamed CrawlerLearning —> CrawlerStaticLearning * Update Hallway models * Attaching model to brain for Hallway * Attaching model to 3DBall Brain. * Updated CrawlerLearning —> CrawlerStaticLearning on trainer config. * Adding Reacher model * Remove model specification in Hallway Brain asset * Removing model specification from 3Dball scene * Adding crawler model file * Specifying learning brain as default for crawler	6 年前
Ervin T	dba466e3	Reset Parameters implemented for Pushblock, Reacher and Walker (#2322 ) Pushblock: dynamic_friction, static_friction, block_drag, block_scale Reacher: Gravity, non-linear goal movement Walker: Gravity, torso mass	5 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
Vilmantas Balasevicius	78389481	Somewhat working Articulated Reacher, however still WIP: training performance is way different from standard RB version.	5 年前
GitHub	24ba9d58	Develop deprecate broadcasting (#2669 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modifie...	5 年前
GitHub	39f1f310	Don't inherit from Academy, remove virtual methods (#3184 )	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前

20 次代码提交 (d993c549-0b16-4f42-8e8e-5bea39334e27)