ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
GitHub	06fa6616	Docs/new semantics (#370 ) * [Semantics] Modified the semantics for the documentation * [Semantics] Updated the images * [Semantics] Made further changes to the docs based of the comments received	7 年前
Joe Ward	90c451c1	Workd imitaion learning into a few corners + light edit.	7 年前
Joe Ward	36a95b8e	Review fixes; added decision section to Agents doc.	7 年前
Marwan Mattar	3ceaa337	Folded ODD Feature Into Agents	7 年前
Marwan Mattar	1ecc8cf9	Removed lingering link to old page.	7 年前
Marwan Mattar	7152afa1	Added a reference to Monitor in Agents / Readme - Also added references to Brain subpages	7 年前
GitHub	529fa311	Feature/docs visual obs (#456 ) * [Documentation] Added description on how to add visual observations * [Documentation] Forgot a paragraph * [Documentation] Addressed comments * [Documentation] Addressed comments, again	7 年前
Marwan Mattar	c462b16f	Removed documentation comments. - Added to an internal Trello card to address.	7 年前
Marwan Mattar	0416aed5	Spell check on github docs Also changed pip to pip3 in Training on AWS page.	7 年前
Marwan Mattar	c471ceca	Fixed code formating and links.	7 年前
Vincent Gao	0e7c88ee	refactored the quick start and installation guide, added faq	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	fe2d3fdc	Update document (#875 )	6 年前
Arthur Juliani	ebd9dab4	Update documentation appropriately	6 年前
Arthur Juliani	bee52bce	Additional documentation changes	6 年前
unityjeffrey	6ed6b8d6	updated ml-agents to ml-agents toolkit where appropriate	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
GitHub	ded0d8c7	Develop action masking (#1080 ) * [Initial Commit] Modified the model.py file and the ppo/trainer.py file to use masked actions * Preliminary modifications to the python side of the code to enable action masking * Preliminary modifications to the C# side of the code to enable action masking * Preliminary modifications to the communication side of the code to enable action masking * Implemented action masking for BC Note : The actions of the teacher are not masked * More error messages for the action masking * fix pytests * Added Documentation * Address comment * Addressed Comments on docs * Addressed second comment on docs * Addressed comments for the python side of the code * Created the action masker and associated unit tests * Addressed comments on the C# side * Addressed the comment regarding action_masking_name * Addressed the comments	6 年前
Deric Pang	40f4eb3e	Cleaning up documentation.	6 年前
GitHub	d7224351	Brains as Scriptable Objects (#1250 ) * Initial Commit Ported most functionalities, still need to : - Documentation - Add Comments - Custom drawer for BrainParameters - Fix the UnitTests - Review Functionalities * Added Custom Drawer for the Brain Parameters * Improvements to the HubDrawer * Modified the Brain Editors * Minor bug fixes and UI changes * Modified the Help Boxes of the Drawers * Modified Brain class, renamed Initialize and made DecideAction virtual * Fix the UnityTests * Simpler Brain creation menu * Renamed Internal Brain to Learning Brain * modified the parameters to remove reference to External or Internal in the Protobuf objects * Updated the protobuf generated files * Fix the Pytests * Removed the graph scope from the Learning Brain * cleaner logic than try catch * Removed the isExternal field of the brain and put the isTraining logic into LearningBrain and Training Hub * Modified how the Brain finds the A...	6 年前
GitHub	bd4a8db2	Documentation Update (#1339 ) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments	6 年前
GitHub	5a29fd25	Documentation tweaks and updates (#1479 ) * Add blurb about using the --load flag in the intro guide, and typo fix. * Add section in tutorial to create multiple area learning environment. * Add mention of Done() method in agent design	6 年前
Vincent(Yuan) Gao	981602ad	Update Learning-Environment-Design-Agents.md (#1659 ) * Update Learning-Environment-Design-Agents.md * Space typo * Word change	6 年前
Vincent-Pierre BERGES	cb05a860	Rename decision frequency to interval (#1697 )	6 年前
Arthur Juliani	bf10ed81	Add line to describe what Branch Descriptions is (#1772 )	6 年前
GitHub	6f8fc130	External Contribution: Use RenderTexture instead of Camera for Visual Observation (#1824 ) * Added RenderTexture support for visual observations * Cleaned up new ObservationToTexture function * Added check for to width/height of RenderTexture * Added check to hide HelpBox unless both cameras and RenderTextures are used * Added documentation for Visual Observations using RenderTextures * Added GridWorldRenderTexture Example scene * Adjusted image size of doc images * Added GridWorld example reference * Fixed missing reference in the GridWorldRenderTexture scene and resaved the agent prefab * Fix prefab instantiation and render timing in GridWorldRenderTexture * Added screenshot and reworded documentation * Unchecked control box * Rename renderTexture * Make RenderTexture scene default for GridWorld Co-authored-by: Mads Johansen <pyjamads@gmail.com>	6 年前
GitHub	0d6a24c5	[Documentation] SetReward method (#1996 ) Added a paragraph in the docs/Learning-Environment-Design-Agents.md document regarding the use of SetReward and how it is different from AddReward	6 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
Chris Elion	7a178f12	Fixed various typos (#2652 ) * Add console log section to Bug Report form (#2566) * Fixed typos	5 年前
GitHub	5f5ccfa0	Feature Deprecation : Online Behavioral Cloning (#2659 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature.	5 年前
Chris Elion	3879c474	Update Learning-Environment-Design-Agents.md (#2764 ) Small wording fix.	5 年前
GitHub	4254cf36	Update Learning-Environment-Design-Agents.md (#2764 ) (#2770 ) Small wording fix.	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	d009511a	fix trailing whitespace in markdown (#2786 )	5 年前
GitHub	eab4f702	update visual observations docs (#2787 ) * update visual obs docs in Design Agents * update screenshots with personal editor skin	5 年前
GitHub	05a54c3b	Ray Perception Sensor docs (#2911 ) * docs, migration, timers * add screenshot * remove added whitespace	5 年前
GitHub	0366af0b	Always reset when agent is done (#3222 ) * Removing the AgentOnDone call * removing editor inspector field for ResetOnDone * Documentation changes * addressing comments * addressing comments * adding comments * Migrating steps * inference - fill 0s for done Agents (#3232) * fill 0s for done agents * docstrings * Simplifying the code * Removing GenerateSensorData * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	3f3916da	Fix italics (#3327 )	5 年前
GitHub	26d416ae	Fixing the Docs on On Demand Decision (#3378 )	5 年前
GitHub	5e4a15d1	Clarify curriculum and behavior name (#3380 ) * clarify curriculum and behavior name * doc some other missing fields too	5 年前
GitHub	417329d8	Fix typo in Design Agents docs (#3384 )	5 年前
GitHub	386ba66c	Develop observation collector (#3352 ) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	92a8aed2	Pass action masker as input to CollectObservations (#3389 ) * Sentencing Action masking the same as observations I am rather unsure about the doubling of the CollectObservation methods (and the copy pasta that comes along) Need to edit the documentation and the migrating doc once we agree we want to do this * Addressing the comments * Improvements to the documentation * Editing the documentation	5 年前
Anupam Bhatnagar	d8c79f48	resolving merge conflicts	5 年前
GitHub	9a371b17	[Renaming] SetActionMask -> SetDiscreteActionMask + added the virtual method CollectDiscreteActionMasks (#3525 ) * Code edits * Modified the markdowns * Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Renaming files and methods * Addressing comments * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	35df5d95	remove Use Heuristic from docs (#3568 )	5 年前
GitHub	411bb64a	Renaming Agent's methods (#3557 ) * [skip ci] Renamed methods in the Agent class WARNING, the user when implementing obsolete methods will see the message :Member `old method` overrides obsolete member `old method`. Add the Obsolete attribute to `old method`. It will not suggest the new method to override. * [skip ci] Updated the example environment * [skip ci] Updated migrating and changelog * [skip ci] Editing the docs * [skip ci] Missing docs * :+1 * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] documentation changes * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Gett...	5 年前
GitHub	a6ade9b2	Combine "Best Practices" and "Agents" documentation (#3643 ) * Merge agent & best practices doc. Plus other fixes * Fix overly long lines * Address typos and comments * Address feedback	5 年前
GitHub	08b5a645	Clarify normalization for vectors (#3654 )	5 年前
GitHub	dd6aa7e2	Agent.Heuristic takes an float[] (#3765 )	5 年前
GitHub	a09850fa	Improvements to Getting Started guide (#3774 ) * Improvements to Getting Started guide - Changed the ordered list to use "1." - Trimmed down text - Removed references to Agent APIs * Incorporating feedback * Prettier formatting	5 年前
GitHub	0dff739b	Release mm GitHub docs (#3864 ) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed ...	5 年前
GitHub	759e222e	Several, small documentation improvements (#3903 ) * Several, small documentation improvements - Re-organize main repo README - Minor clean-ups to Python package-specific readme files - Clean-up to Unity Inference Engine page - Update to the docs README - Added a specific cross-platform section in ML-Agents Overview to amplify Barracuda - Updated the links in Limitations.md to point to the specific subsections - Cleaned up the Designing a Learning Environment page. Added an intro paragraph. - Updated the installation guide to specifically call out local installation - A few minor formatting, spelling errors fixed.	5 年前
GitHub	d761a1cc	[MLA-920] add RayLayer mask documentation (#3929 )	5 年前
GitHub	431a4f41	[MLA-1010] ObservableAttribute docs, update Sensor docs (#4058 ) * update observation docs * stacking * update TOC * fix TOC * yo * PR feedback	4 年前
GitHub	b66f5ca8	AddReward before EpisodeEnd (#4064 ) Changed order of AddReward() and EpisodeEnd() in the Rewards example.	4 年前
GitHub	b7eb8b6d	Clarification in the Heuristic() documentation (#4100 ) * Clarification in the Heuristic() documentation The `Heuristic()` method will not be able to write to the action array if the action array passed as argument is reassigned in the method. For example, doing : ```csharp public override void Heuristic(float[] actionsOut) { actionOut = new float[2]; actionOut[0] = 1.0f; } ``` Will not create the action [1, 0] but [0, 0] as the `actionOut` variable was reassigned. * adding to the Agent xml doc	4 年前
GitHub	4eb47e2f	[docs] Update 'Record Demonstrations' documentation (#4432 ) * [docs] Update 'Record Demonstrations' documentation Updates a screenshot and documentation to include the newer `Num Steps To Record` field.	4 年前
Andrew Cohen	10b57d7b	clean up EndEpisode demo code	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	a0d1c829	Action Docs part2 (#4739 ) * reduce usage of "vector action" and "action space" * more cleanup * undo GettingStarted change for now * batch size description * Apply suggestions from code review Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	d4132d51	Add actuator docs to github. (#4824 ) Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	2a990e17	Convert 3DBallHard to use Observables (#4913 )	4 年前
vincentpierre	3ae1675c	adding documentation	4 年前
GitHub	b621d087	Update docs/Learning-Environment-Design-Agents.md Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	ec093d45	Update docs/Learning-Environment-Design-Agents.md Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	b745b786	Update docs/Learning-Environment-Design-Agents.md Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	e40ec8d5	Update docs/Learning-Environment-Design-Agents.md Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
vincentpierre	fdf21dbd	addressing some of the comments	4 年前
GitHub	a5e1bf3f	Masking Discrete Actions typos (#4961 ) (#4964 ) Co-authored-by: Philipp Siedler <p.d.siedler@gmail.com>	4 年前
vincentpierre	1d8f57b2	adding the example of enemies and projectiles	4 年前
vincentpierre	e8a08fab	addressing comments	4 年前
Christopher Goy	9cadfa7a	Merge master -> release_13_branch-to-master	4 年前
vincentpierre	e1b94b8b	Merge branch 'master' into develop-var-len-obs-feature	4 年前
GitHub	c9153aa7	Removing Obsolete methods from the package (#5024 ) * Removing Obsolete methods from the package * Missing depecration and modified changelog * Readding the obsolete BrainParameter methods, will need a larger discussion on these * Removing Action Masker, readding the warining when using a non-implemented Heuristic, Removing NumAction from Brain Parameters * removing documentation and some calls to deprecated methods in the extensions package * Editing the Changelog to put the unreleased on top	4 年前
GitHub	85f8b40b	Removing some scenes (#4997 ) * Removing some scenes, All the Static and all the non variable speed environments. Also removed Bouncer, PushBlock, WallJump and reacher. Removed a bunch of visual environements as well. Removed 3DBallHard and FoodCollector (kept Visual and Grid FoodCollector) * readding 3DBallHard * readding pushblock and walljump * Removing tennis * removing mentions of removed environments * removing unused images * Renaming Crawler demos * renaming some demo files * removing and modifying some config files * new examples image? * removing Bouncer from build list * replacing the Bouncer environment with Match3 for llapi tests * Typo in yamato test	4 年前
GitHub	4863475c	non-IEnumerable interface for action masking (#5060 )	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
GitHub	14489fd8	[docs] Documentation for POCA and cooperative behaviors (#5056 ) Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>	4 年前
GitHub	6895ba50	Integrate Group Manager to soccer/retrain with POCA (#5115 )	4 年前
GitHub	15d39af4	[docs] Fix link and add teams/groups to table of contents (#5126 )	4 年前
GitHub	d2ee2e6f	[cherry-pick] Integrate Group Manager to soccer/retrain with POCA (#5115 ) (#5121 ) * Integrate Group Manager to soccer/retrain with POCA (#5115) * Add Soccer env to changelog Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	f7ab0cb0	[cherry-pick][docs] Add Dungeon Escape Environment (#5133 ) * Add DungeonEscape POCA Environment (#5128) * Add DungeonEscape assets from working branch * Add Dungeon Escape docs * Create dungeon_escape.png * Add to docs Co-authored-by: Hunter-Unity <hunter@unity3d.com>	4 年前
Christopher Goy	c9be2433	Removing Obsolete methods from the package (#5024 ) * Removing Obsolete methods from the package * Missing depecration and modified changelog * Readding the obsolete BrainParameter methods, will need a larger discussion on these * Removing Action Masker, readding the warining when using a non-implemented Heuristic, Removing NumAction from Brain Parameters * removing documentation and some calls to deprecated methods in the extensions package * Editing the Changelog to put the unreleased on top	4 年前
Christopher Goy	092c2718	non-IEnumerable interface for action masking (#5060 )	4 年前
GitHub	2fcf8425	Documentation for Goal conditioning (#5149 ) * Documentation for Goal conditioning * hyper is the default * Update docs/Training-Configuration-File.md Co-authored-by: Arthur Juliani <awjuliani@gmail.com> * Update docs/Learning-Environment-Design-Agents.md Co-authored-by: Arthur Juliani <awjuliani@gmail.com> * addressing comments: Renaming goal observation to goal signal in docs * addressing comments * Update docs/Learning-Environment-Design-Agents.md Co-authored-by: Ervin T. <ervin@unity3d.com> * Update docs/Learning-Environment-Design-Agents.md * Update docs/Learning-Environment-Design-Agents.md * Update docs/Learning-Environment-Design-Agents.md * Update docs/Learning-Environment-Design-Agents.md * Update docs/Learning-Environment-Design-Agents.md Co-authored-by: Arthur Juliani <awjuliani@gmail.com> Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前
GitHub	2980ade0	Goal conditioning grid world : Example of goal conditioning (#5193 ) * Aded the Goal conditioned GridWorld to replace regular gridworld * adding missing files * Code improvements * Documentation change on gridworld * resolving conflicts * new model * Addressing comments * comments and renames * Update docs/Learning-Environment-Examples.md Co-authored-by: Ervin T. <ervin@unity3d.com> * adding reference to gridworld in docs about goal signal Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前
GitHub	adfb71c0	doc cleanup (#5309 )	4 年前
GitHub	521facd9	Add GridSensors to the documentation (#5333 ) * add grid sensor to design agent doc	4 年前
GitHub	8bb1fe6a	[WIP] [Fix] Fixing collect observation called on done (#5375 ) * [WIP] [Fix] Fixing collect observation called on done * Update com.unity.ml-agents/Runtime/Agent.cs * ⚠️ Modifying the test of stacking sensor when the agent is done * modifying the documentation for BufferSensor to specify to call AddObservation in the CollectObservations method	4 年前
GitHub	ae5a9836	Update visual stacking doc (#5391 )	3 年前
GitHub	e72c0d48	visual stacking doc typo (#5392 )	3 年前
GitHub	7f86df7c	Editing the GridSensor documentation for 2D use case (#5396 ) * Editing the GridSensor documentation for 2D use case * changing chagelog	3 年前
GitHub	9354ca64	[Release 18] Update versions and links (#5414 )	3 年前

1 2

96 次代码提交 (cd46c9c2-6692-44ed-ba47-4373c2963f36)