ml-agents

作者	SHA1	备注	提交日期
GitHub	aee5d336	Fix discrete state (#33 ) * made BrainParameters a class to set default values Modified the error message if the state is discrete * Add discrete state support to PPO and provide discrete state example environment * Add flexibility to continuous control as well * Finish PPO flexible model generation implementation * Fix formatting * Support color observations * Add best practices document * bug fix for non square observations * Update Readme.md * Remove scipy dependency * Add installation doc	7 年前
vincentpierre	22db3d64	added the modified files from dev-cooperative-env	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	d4cd72d8	[timeBetweenDecisions] Reimplementation of waitTime for GridWorld and… (#368 ) * [timeBetweenDecisions] Reimplementation of waitTime for GridWorld and Basic * [EnvironmentModification] Changed the gridworld TimeBetweenDecisionAtInference	7 年前
Vincent Gao	38bd3e40	replaced all the tabs to 4 spaces in the project	7 年前
GitHub	1409236e	made AgentAction take vectorAction and textAction (#397 )	7 年前
Vincent Gao	1bc43933	Merge branch 'development-0.3' into hotfix/issue#333	7 年前
Marwan Mattar	fa638000	Comment improvements & refactoring to Academy.cs Added several class and method-level comments that are compatibale with Doxygen for auto-generation of documentation. In addition to some stylistic and minor code changes (summarized below). Stylistic changes: - Modified comments to /// style instead of /** */ - Removed unnecessary imports - Removed unnecessary “private” declarations - Limited code to 80 characters per line - Re-organized variables to group those that are visible in Inspector (they are now at the top) Code changes: - Renamed ScreenConfiguration to EnvironmentConfiguration (variable only used within Academy.cs, thus no other files needed modification) - Renamed ConfigureEngine to ConfigureEnvironment and created a ConfigureEnvironmentHelper method - Renamed _isCurrentlyInference to modeSwitched to signify when the engine config needs to be changed - Added isCommunicatorOn flag to be explicit about the existence of a communicator - Made isInference private which requ...	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
Arthur Juliani	5e48766d	Remove discrete observations	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	5d2e466f	Fix Code convention warnings in Rider. (#2801 )	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	39f1f310	Don't inherit from Academy, remove virtual methods (#3184 )	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前
GitHub	0366af0b	Always reset when agent is done (#3222 ) * Removing the AgentOnDone call * removing editor inspector field for ResetOnDone * Documentation changes * addressing comments * addressing comments * adding comments * Migrating steps * inference - fill 0s for done Agents (#3232) * fill 0s for done agents * docstrings * Simplifying the code * Removing GenerateSensorData * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	fed3efdc	Done After Set Reward (#3311 )	5 年前
GitHub	386ba66c	Develop observation collector (#3352 ) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	f25bf7d3	Reintroduce MLAgents.Sensors namespace (#3509 ) * Reintroduced the namespace MLAgents.Sensors * Documentation changes * updated the changelog	5 年前

25 次代码提交 (d993c549-0b16-4f42-8e8e-5bea39334e27)