ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
GitHub	06fa6616	Docs/new semantics (#370 ) * [Semantics] Modified the semantics for the documentation * [Semantics] Updated the images * [Semantics] Made further changes to the docs based of the comments received	7 年前
Joe Ward	e37b13e1	wip	7 年前
Joe Ward	90c451c1	Workd imitaion learning into a few corners + light edit.	7 年前
Joe Ward	66308d7b	Fix problems exposed during review.	7 年前
Marwan Mattar	bea44626	Added reference to Basics notebook in creating env	7 年前
Marwan Mattar	8d7d0510	Folded back Organizing Scene Layout page	7 年前
Marwan Mattar	0416aed5	Spell check on github docs Also changed pip to pip3 in Training on AWS page.	7 年前
Marwan Mattar	c471ceca	Fixed code formating and links.	7 年前
Joe Ward	076fda22	Made some corrections and other refinements to the Creating a New Environment tutorial.	7 年前
Joe Ward	1c544a53	deleted redundant paragraph.	7 年前
Marwan Mattar	06cc85cc	Merge branch 'development-0.3' into docs/random-fixes # Conflicts: # docs/Learning-Environment-Create-New.md	7 年前
Marwan Mattar	16a6dcd9	Fixing conflicts.	7 年前
Arthur Juliani	8f74714f	Update Learning-Environment-Create-New.md (#559 ) Fixing a minor grammatical mistake, "classis" to "class is".	7 年前
GitHub	ac15a8d3	Update Learning-Environment-Create-New.md The list named observation is never used and is confusing.	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
Arthur Juliani	d36b370c	Update Learning-Environment-Create-New.md (#769 ) Some suggestions to avoid ambiguity	7 年前
Arthur Juliani	f66a306c	Update Learning-Environment-Create-New.md (#770 ) Shouldn't Done(); be placed after the rewards are given?	7 年前
Arthur Juliani	3c84cde2	[Documentation] Added using MLAgents to the Create new Environment page (#859 )	6 年前
unityjeffrey	6ed6b8d6	updated ml-agents to ml-agents toolkit where appropriate	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
GitHub	de9d73f3	Adjust documentation (#1096 )	6 年前
Deric Pang	709141f4	Updating docs with new paths.	6 年前
Deric Pang	e0e02ae6	Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure	6 年前
Deric Pang	40f4eb3e	Cleaning up documentation.	6 年前
GitHub	ffffe131	Renamed MLAgentsSDK to UnitySDK. (#1170 )	6 年前
Arthur Juliani	2cd8e250	Documentation 0.5 Release Check List (Part 1) (#1154 )	6 年前
GitHub	78dd5da1	Bug found and fixed The calculation of observation vectors is faulty. The old calculation does not reflect distances to the edges and it does not only yield results between -1 and 1. Since distance calculation would have been difficult in one line, I just replaced it by the relative position of the ball (only using two vectors instead of four). I've conducted 500K-step reinforcing trainings before and after the change and got enormously improved results. Contact me for screenshots of the tensorboard or just use the debugger and do the math.	6 年前
GitHub	7f826b6d	Textual corrections Minor textual adaptions...	6 年前
GitHub	642061e3	Fixed the fix... Ow... sorry, there was a typo.	6 年前
GitHub	d7224351	Brains as Scriptable Objects (#1250 ) * Initial Commit Ported most functionalities, still need to : - Documentation - Add Comments - Custom drawer for BrainParameters - Fix the UnitTests - Review Functionalities * Added Custom Drawer for the Brain Parameters * Improvements to the HubDrawer * Modified the Brain Editors * Minor bug fixes and UI changes * Modified the Help Boxes of the Drawers * Modified Brain class, renamed Initialize and made DecideAction virtual * Fix the UnityTests * Simpler Brain creation menu * Renamed Internal Brain to Learning Brain * modified the parameters to remove reference to External or Internal in the Protobuf objects * Updated the protobuf generated files * Fix the Pytests * Removed the graph scope from the Learning Brain * cleaner logic than try catch * Removed the isExternal field of the brain and put the isTraining logic into LearningBrain and Training Hub * Modified how the Brain finds the A...	6 年前
Arthur Juliani	c55a9482	Fixes typo (#1331 )	6 年前
Arthur Juliani	ea5b5ddb	Fixes agent infinite resetting bug (#1332 ) The check for wether an agent has fallen off the platform was using a wrong value of 1 instead of 0. This meant that the agent immediately started in a falling state and entered a thrashing cycle of resetting itself.	6 年前
Vincent(Yuan) Gao	156713a0	Adds missing instruction in new environment setup (#1319 )	6 年前
GitHub	fdb4400c	Fix description of brain in example doc (#1360 )	6 年前
GitHub	bd4a8db2	Documentation Update (#1339 ) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments	6 年前
GitHub	d73b6aa0	Update New Environment Doc (#1404 ) * Simplified rewards and observations; Determined better settings for training within a reasonable amount of time. * Simplified Agent rewards; Added training section that discusses hyperparameters. * Added note about DecisionFrequency. * Updated screenshots and a small clarification in the text. * Tested and updated using v0.6. * Update a couple of images, minor text edit. * Replace with more recent training stats. * resolve a couple of minor review commnts. * Increased the recommended batch and buffer size hyperparameter values. * Fix 2 typos.	6 年前
GitHub	c6d0d2e3	Minor fixes to Create New and Executable tutorials (#1486 ) * Wording and filepath changes to tutorials * Retake editor images to match v0.6 Retake editor images so that the filepaths and Brain names match what they actually are.	6 年前
GitHub	59bdc506	Add Gizmos folder to Create New Environment guide (#1493 ) * Add Gizmos folder to create new env guide	6 年前
GitHub	5a29fd25	Documentation tweaks and updates (#1479 ) * Add blurb about using the --load flag in the intro guide, and typo fix. * Add section in tutorial to create multiple area learning environment. * Add mention of Done() method in agent design	6 年前
Vincent(Yuan) Gao	928c1217	Update Learning-Environment-Create-New.md (#1650 )	6 年前
Vincent-Pierre BERGES	cb05a860	Rename decision frequency to interval (#1697 )	6 年前
GitHub	64348933	Update Learning-Environment-Create-New.md (#1993 ) * Update Learning-Environment-Create-New.md Section : Final Editor Setup - Step 3. It says: Drag the Brain RollerBallPlayer from the Project window to the RollerAgent Brain field. Should say: Drag the Brain RollerBallBrain from the Project window to the RollerAgent Brain field. * Develop black format fix (#1998) * fixed the format * changed the circleci config * [Gym] Added no_graphics argument (#1997) > Added the no_graphics argument to the gym interface. #1413 * [Documentation] SetReward method (#1996) Added a paragraph in the docs/Learning-Environment-Design-Agents.md document regarding the use of SetReward and how it is different from AddReward * [Documentation] Added information for the environments the trainer cannot train with the default configurations (#1995) * Format gym_unity using black	6 年前
Jeffrey Shih	3ecbabc3	[Documentation] Clarify Create-New.md (#2100 ) * Update Learning-Environment-Create-New.md - Clarify that training is done in the original ml-agents project folder - Remove mistype - In the future it could help to show the user that they can copy the config folder and run training in a new project folder so they don't have to mix project settings in the original config folder * Update Learning-Environment-Create-New.md Add file paths	5 年前
Vincent(Yuan) Gao	185b9a18	Release v0.8.2 doc fixes (#2155 ) * Minor basic guide fix * made it clear for training instruction	5 年前
GitHub	24ba9d58	Develop deprecate broadcasting (#2669 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modifie...	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	d009511a	fix trailing whitespace in markdown (#2786 )	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
Ervin T	5ff3ad8f	Update Learning-Environment-Create-New.md (#2925 ) Added a missing instruction in the Final Editor Setup section.	5 年前
GitHub	f8621d4d	Release 0.12.1 doc fixes (#3070 )	5 年前
Chris Goy	4ddc1f16	Update Learning-Environment-Create-New.md (#3161 ) Reminder about Barracuda package installation.	5 年前
GitHub	39f1f310	Don't inherit from Academy, remove virtual methods (#3184 )	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前
GitHub	0366af0b	Always reset when agent is done (#3222 ) * Removing the AgentOnDone call * removing editor inspector field for ResetOnDone * Documentation changes * addressing comments * addressing comments * adding comments * Migrating steps * inference - fill 0s for done Agents (#3232) * fill 0s for done agents * docstrings * Simplifying the code * Removing GenerateSensorData * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	0ff8f9af	Create ML-Agents Package (#3267 ) Convert the UnitySDK to a Packman Package. - Separate Examples into a sample project. - Move core UnitySDK Code into com.unity.ml-agents. - Create asmdefs for the ml-agents package. - Add package validation tests for win/linux/max. - Update protobuf generation scripts. - Add Barracuda as a package dependency for ML-Agents. (users no longer have to install it themselves).	5 年前
GitHub	bde6cfaf	Update docs to reflect new package installation workflow. (#3362 ) - Fix old material name references. - Update outdated code comments.	5 年前
GitHub	386ba66c	Develop observation collector (#3352 ) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	94c948c9	Release 0.14.0 more doc fixes (#3388 ) * Update Learning-Environment-Create-New.md (#3356) * Update Learning-Environment-Create-New.md In the "Final Editor Setup" , I think their should be a Step to add Decision Parameters Script and it says Decision Period from 1 to 20. Without this their was no action taken by the RolerAgent. After adding this step it worked for me. * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <celion@gmail.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <celion@gmail.com> Co-authored-by: Chris Elion <celion@gmail.com> * migration fixes Co-authored-by: Medhavi Monish <39962268+MedhaviMonish@users.noreply.github.com>	5 年前
GitHub	5c7bbfde	Update Learning-Environment-Create-New.md (#3356 ) * Update Learning-Environment-Create-New.md In the "Final Editor Setup" , I think their should be a Step to add Decision Parameters Script and it says Decision Period from 1 to 20. Without this their was no action taken by the RolerAgent. After adding this step it worked for me. * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <celion@gmail.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <celion@gmail.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	f25bf7d3	Reintroduce MLAgents.Sensors namespace (#3509 ) * Reintroduced the namespace MLAgents.Sensors * Documentation changes * updated the changelog	5 年前
GitHub	35df5d95	remove Use Heuristic from docs (#3568 )	5 年前
GitHub	411bb64a	Renaming Agent's methods (#3557 ) * [skip ci] Renamed methods in the Agent class WARNING, the user when implementing obsolete methods will see the message :Member `old method` overrides obsolete member `old method`. Add the Obsolete attribute to `old method`. It will not suggest the new method to override. * [skip ci] Updated the example environment * [skip ci] Updated migrating and changelog * [skip ci] Editing the docs * [skip ci] Missing docs * :+1 * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] documentation changes * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Gett...	5 年前
GitHub	a6ade9b2	Combine "Best Practices" and "Agents" documentation (#3643 ) * Merge agent & best practices doc. Plus other fixes * Fix overly long lines * Address typos and comments * Address feedback	5 年前
GitHub	92f1315e	Combine "Getting Started" and "Basic" Guides (#3644 ) * Merge agent & best practices doc. Plus other fixes * Fix overly long lines * Merge Getting Started and Basic Guides * Rename guide and update links appropriately * Fix broken link	5 年前
GitHub	27f78f31	Removing the notebooks from the github repository. (#3704 ) * Removing the notebooks from the github repository. * removing matplotlib	5 年前
GitHub	bc1fdf07	[refactor] CLI changes (#3705 )	5 年前
GitHub	dd6aa7e2	Agent.Heuristic takes an float[] (#3765 )	5 年前
GitHub	26ce3d8d	Improvements to Learning-Environment-Create-New.md (#3773 ) * Improvements to Learning-Environment-Create-New.md - Changed the ordered list to use "1." - Trimmed down text - Removed reference to materials as those are in the Example Envs project * Incorporated PR feedback + new images. * factor in feedback removed unnecessary configs updated the agent image * Formatting fix	5 年前
GitHub	99617039	Develop mm docs formatting (#3796 ) * Formatting and minor clean-up of background pages. * Additional formatting changed toolkit —> Toolkit where appropriate	5 年前
GitHub	1e0b022f	[MLA-850] rename namespaces to Unity.MLAgents (#3843 ) * rename in protos * rename in C# * doc changes, migration, changelog * PR numbers * fix standalone test path	5 年前
GitHub	f86fc81d	[refactor] Move configuration files to single YAML file (#3791 )	5 年前
GitHub	0dff739b	Release mm GitHub docs (#3864 ) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed ...	5 年前
Chris Elion	68b68396	Merge remote-tracking branch 'origin/master' into release_1_to_master	5 年前
GitHub	1d2e70c1	[docs] Add memory_size hyperparameter (#3973 )	5 年前
GitHub	96dc2545	[docs] Fix a configuration error in RollerBall that doesn't allow training to run. (#3945 ) mlagents.trainers.exception.UnityTrainerException: The hyper-parameter memory_size could not be found for the <class 'mlagents.trainers.ppo.trainer.PPOTrainer'> trainer of brain RollerBall.	5 年前
GitHub	f26c2b3a	Add needed indent to fix config file (#3968 )	5 年前
GitHub	e305bb31	added sequence_length to doc	4 年前
GitHub	b9ae88c1	[docs] Update Learning Environment docs (#4050 )	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	626581a0	documentation touchups (#4099 ) * doc updates getting started page now uses consistent run-id re-order create-new docs to have less back/forth between unity and text editor * add link explaining decisions where we tell the reader to modify its parameter	4 年前
GitHub	134f548c	doc fix (#4354 )	4 年前
Andrew Cohen	10b57d7b	clean up EndEpisode demo code	4 年前
GitHub	ec307cb3	Update Learning-Environment-Create-New.md (#4554 ) Point 2 and 5 are duplicates. Removed redundant information.	4 年前
GitHub	82debf3b	clean up EndEpisode demo code (#4563 )	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	2f8bca6d	Rename Actions in BrainParameter drawers, Update tutorial docs (#4748 )	4 年前
GitHub	95fbe769	Documentation updates: Added Unity instance parallelization to enviro… (#5143 ) * Updated Learning-Environment-Create-New.md with a section on parallel unity instances. * Added trailing whitespace to Learning Environment Create New md file. * Added trailing whitespace to Learning Environment Create New md file after fixes. * Minor updates. * Minor updates. * Whitespace fixes.	4 年前

1 2

89 次代码提交 (764d8948-ba11-4f57-b19c-f92397da9ab6)