ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	cde3c8f7	formating and added documentation	7 年前
vincentpierre	4d3716fe	default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory()	7 年前
vincentpierre	22db3d64	added the modified files from dev-cooperative-env	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
Arthur Juliani	5e75f5b7	New Tennis env and model	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	1409236e	made AgentAction take vectorAction and textAction (#397 )	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
vincentpierre	83158caf	Fix for the tennis Agent. Works with prefab now	6 年前
vincentpierre	77d62622	Addressed comment	6 年前
GitHub	7703355e	Edited the Tennis code and retrained the model (#1746 ) Addressing #1739	6 年前
sankalp04	c6fba86a	tennis reset parameter implementation ported over	5 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	5d2e466f	Fix Code convention warnings in Rider. (#2801 )	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
GitHub	386ba66c	Develop observation collector (#3352 ) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	f25bf7d3	Reintroduce MLAgents.Sensors namespace (#3509 ) * Reintroduced the namespace MLAgents.Sensors * Documentation changes * updated the changelog	5 年前
GitHub	b9bd4df2	Modified some namespaces (#3533 ) * Added the MLAgents.Demonstrations namespace * Added the MLAgents.Editor namespace * Overrided the .demo.meta files due to the change in namespace * More namespace changes * Added the sidechannels namespace * Modified changelog and migrating docs	5 年前
GitHub	251aa7b3	Removed the IFloatProperties interface (#3570 ) * [skip ci] Remove IFloatProperties interface * [skip ci] Update the examples * Updating the documentation	5 年前
GitHub	411bb64a	Renaming Agent's methods (#3557 ) * [skip ci] Renamed methods in the Agent class WARNING, the user when implementing obsolete methods will see the message :Member `old method` overrides obsolete member `old method`. Add the Obsolete attribute to `old method`. It will not suggest the new method to override. * [skip ci] Updated the example environment * [skip ci] Updated migrating and changelog * [skip ci] Editing the docs * [skip ci] Missing docs * :+1 * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] documentation changes * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Gett...	5 年前
GitHub	6612b496	Deprecating Academy.Instance.FloatProperties (#3696 ) * Deprecating Academy.Instance.FloatProperties * Made the registered side channels a static property and created the sideChannelUtils class to handle side channel stuff * Clearing the sending message queue in the Academy when the communicaor is not on * addressing comments	5 年前
GitHub	92870d43	Fix missing action axis in Tennis (#3732 ) * Fix missing action axis in Tennis * Update Project/Assets/ML-Agents/Examples/Tennis/Scripts/TennisAgent.cs Fix typo Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
Andrew Cohen	9fed4985	tennis curriculum	5 年前
Andrew Cohen	e4f7f2a6	removed curriculum tennis	5 年前
Andrew Cohen	930d6fa3	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Andrew Cohen	44e6fa7b	soccer 1e8 timesteps/Tennis existential penalty	5 年前
GitHub	dd6aa7e2	Agent.Heuristic takes an float[] (#3765 )	5 年前
Andrew Cohen	80469267	Merge branch 'internal-policy-ghost' into soccer-2v1	5 年前
Andrew Cohen	8431ecb5	tennis reward fix	5 年前
Andrew Cohen	5d659946	update tennis reward function	5 年前
Andrew Cohen	95f3ffab	fix max step tennis	5 年前
Andrew Cohen	47548ee4	tennis curriculum	5 年前
Andrew Cohen	9f36cd36	added floorhit obs tennis	5 年前
Andrew Cohen	8ad1323e	increased scale tennis env	5 年前
Andrew Cohen	4d5b1b33	fixed agent action constraint tennis	5 年前
Andrew Cohen	c4946d31	increase curr to .2	5 年前
Andrew Cohen	13fd97de	small per timestep reward	5 年前
Andrew Cohen	45d35fa4	downward force tennis agent	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
Andrew Cohen	f60df1c9	lower agent start	5 年前
Andrew Cohen	29627181	more downward force/constrain y	5 年前
Andrew Cohen	e5b883db	added bounce obs to agent/more downward force on ball	5 年前
GitHub	1e0b022f	[MLA-850] rename namespaces to Unity.MLAgents (#3843 ) * rename in protos * rename in C# * doc changes, migration, changelog * PR numbers * fix standalone test path	5 年前
Andrew Cohen	428b1dfa	Hit bonus for whole exp	5 年前
GitHub	3d15d51f	apply rider suggestions in Examples (#3846 )	5 年前
Andrew Cohen	5d22b819	added timepenalty to obs	5 年前
Andrew Cohen	4769cb1e	proximity bonus	5 年前
Andrew Cohen	39e0bbe9	remove debug log	5 年前
Andrew Cohen	376af981	lower agent height	5 年前
Andrew Cohen	b80d6228	randomize ball/agent spawn	5 年前
Andrew Cohen	2c42f577	Merge branch 'master' into asymm-envs	5 年前
Andrew Cohen	d5428487	addforce and static walls	5 年前
Andrew Cohen	cde22a14	fix clipping	5 年前
Andrew Cohen	d77f2566	energy usage penalty to prevent superstition on serve	5 年前
Andrew Cohen	0d943676	inc energy pen	5 年前
Andrew Cohen	1f2453ef	slower tennis	5 年前
Andrew Cohen	8ef0b3a8	opponent observations	5 年前
Andrew Cohen	862a3c02	faster now that it works?	5 年前
Andrew Cohen	1819fbad	0f max height	5 年前
Andrew Cohen	acb85908	hitbonus	5 年前
Andrew Cohen	4907a9cd	slower tennis	5 年前
Andrew Cohen	69acdeec	fixed reset tennis	5 年前
Andrew Cohen	98c878ba	fixed rotate action	5 年前
Andrew Cohen	6c42c221	remove debug print	5 年前
Andrew Cohen	db1a77e7	rotate over tiemsteps	5 年前
Andrew Cohen	0deacd92	much slower agent	5 年前
Andrew Cohen	f74ac6ae	remove rotation hindrance	5 年前
Andrew Cohen	8376b862	lower tennis height	5 年前
Andrew Cohen	c702bb27	faster ball	5 年前
Andrew Cohen	d7c2c163	please no more	5 年前
Andrew Cohen	765dfb4d	faster x axis/less rotation	5 年前
Andrew Cohen	9beb2a15	serve spawns forward	5 年前
Andrew Cohen	0b45d365	ball spawns forward	5 年前
Andrew Cohen	c5ce18c7	remove x/y vel, smaller network	5 年前
Andrew Cohen	fd7ee405	normalize by hand	5 年前
Andrew Cohen	7e3c3b45	remove hit bonus	5 年前
Andrew Cohen	84f231ce	time penalty	5 年前
Andrew Cohen	0ae5e040	rename some vars	5 年前
Andrew Cohen	cc79fa0e	no opp obs	5 年前
Andrew Cohen	f5aa9fd6	no time penalty no opp	5 年前
Andrew Cohen	13c2a209	added opp, decay eps removed	5 年前
Andrew Cohen	59a60c1e	Merge branch 'master' into asymm-envs	5 年前
Andrew Cohen	efdf8cdf	fixes due to change in master	5 年前
Andrew Cohen	9005ad3e	normalize	5 年前
Andrew Cohen	d7e8d25c	reset params broken	5 年前
Andrew Cohen	43d5ef17	fixed opponent setting	5 年前
Andrew Cohen	4ba0d98c	cubewar and tennis stability test	5 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
Andrew Cohen	99b18d2e	revert tennis timepenalty	5 年前
Andrew Cohen	03eef40b	constrain x tennis	5 年前
Andrew Cohen	0c17dc1b	cannot hit scenery tennis	5 年前
Andrew Cohen	31a5b2ee	4096 batch	5 年前
Andrew Cohen	346a90ba	move agent back	5 年前
Andrew Cohen	c8ecf7ea	remove opp obs	5 年前
Andrew Cohen	e76bd1ac	remove tp	5 年前
Andrew Cohen	3d228cb5	slower x	5 年前
Andrew Cohen	a5383d7e	move agent service forward	5 年前
Andrew Cohen	6852707a	better	5 年前
Andrew Cohen	01af358f	tp	5 年前
GitHub	bb9417f7	Update example environments to use the Actuator API (#4363 )	4 年前

1 2 3

110 次代码提交 (47893e9c-4269-4b8d-9c76-911f580807fe)