ml-agents

作者	SHA1	备注	提交日期
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	430a5486	[Semantics] renaming StateType to SpaceType (#382 )	7 年前
GitHub	1409236e	made AgentAction take vectorAction and textAction (#397 )	7 年前
GitHub	addadada	[AddVectorObs] Modified the Examples (#409 ) * [AddVectorObs] Converted the Examples to use the new AddVectorObs * [AddVectorObs] Converted the Reacher to use the new AddVectorObs * [Improvement] One liner for adding the rotation	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
sankalp04	2c8bdda0	3D ball reset parameter implementation ported over	5 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	5d2e466f	Fix Code convention warnings in Rider. (#2801 )	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前
GitHub	fed3efdc	Done After Set Reward (#3311 )	5 年前
GitHub	386ba66c	Develop observation collector (#3352 ) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	f25bf7d3	Reintroduce MLAgents.Sensors namespace (#3509 ) * Reintroduced the namespace MLAgents.Sensors * Documentation changes * updated the changelog	5 年前
GitHub	b9bd4df2	Modified some namespaces (#3533 ) * Added the MLAgents.Demonstrations namespace * Added the MLAgents.Editor namespace * Overrided the .demo.meta files due to the change in namespace * More namespace changes * Added the sidechannels namespace * Modified changelog and migrating docs	5 年前
GitHub	251aa7b3	Removed the IFloatProperties interface (#3570 ) * [skip ci] Remove IFloatProperties interface * [skip ci] Update the examples * Updating the documentation	5 年前
GitHub	411bb64a	Renaming Agent's methods (#3557 ) * [skip ci] Renamed methods in the Agent class WARNING, the user when implementing obsolete methods will see the message :Member `old method` overrides obsolete member `old method`. Add the Obsolete attribute to `old method`. It will not suggest the new method to override. * [skip ci] Updated the example environment * [skip ci] Updated migrating and changelog * [skip ci] Editing the docs * [skip ci] Missing docs * :+1 * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] documentation changes * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Gett...	5 年前
GitHub	6612b496	Deprecating Academy.Instance.FloatProperties (#3696 ) * Deprecating Academy.Instance.FloatProperties * Made the registered side channels a static property and created the sideChannelUtils class to handle side channel stuff * Clearing the sending message queue in the Academy when the communicaor is not on * addressing comments	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
GitHub	1e0b022f	[MLA-850] rename namespaces to Unity.MLAgents (#3843 ) * rename in protos * rename in C# * doc changes, migration, changelog * PR numbers * fix standalone test path	5 年前
GitHub	3d15d51f	apply rider suggestions in Examples (#3846 )	5 年前
Andrew Cohen	c6416bef	3dball hard rew	4 年前
Andrew Cohen	1c4d9554	fix action dims	4 年前
yanchaosun	42c9ba43	reuse encoder and linear	4 年前
GitHub	bb9417f7	Update example environments to use the Actuator API (#4363 )	4 年前
GitHub	2a990e17	Convert 3DBallHard to use Observables (#4913 )	4 年前
GitHub	d047802f	Rider suggested cleanup, part 1 (#5265 )	4 年前

31 次代码提交 (bc1fdf07-41d4-4db9-b0d2-b6cf2261a5da)