ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
Arthur Juliani	22d931c0	Add comments to Reacher and re-train model w/ epsilon needed	7 年前
GitHub	26a1ed87	Merge pull request #380 from Unity-Technologies/dev-reacher-cleanup Add comments to Reacher and re-train model w/o epsilon needed	7 年前
Vincent Gao	38bd3e40	replaced all the tabs to 4 spaces in the project	7 年前
Vincent Gao	ba0ecf24	fixed other tabs and spaces	7 年前
Vincent Gao	02df3b34	resolved conflicts	7 年前
GitHub	1409236e	made AgentAction take vectorAction and textAction (#397 )	7 年前
Vincent Gao	1bc43933	Merge branch 'development-0.3' into hotfix/issue#333	7 年前
GitHub	8d6bf190	Merge pull request #384 from Unity-Technologies/hotfix/issue#333 hotfix on issue#333, and a few comments fixes	7 年前
Marwan Mattar	ba6911c3	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Editor/MLAgentsEditModeTest.cs # unity-environment/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs # unity-environment/Assets/ML-Agents/Scripts/Academy.cs	7 年前
GitHub	addadada	[AddVectorObs] Modified the Examples (#409 ) * [AddVectorObs] Converted the Examples to use the new AddVectorObs * [AddVectorObs] Converted the Reacher to use the new AddVectorObs * [Improvement] One liner for adding the rotation	7 年前
Marwan Mattar	7b99ccfe	Merge branch 'development-0.3' into dev-api-doc-academy	7 年前
GitHub	bb82e25d	Revamped Push Block (#404 ) * Adds new revamped Push Block environment. * Adds "Shared Assets" folder to Examples sub-directory.	7 年前
Marwan Mattar	bab02a21	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Scripts/Brain.cs	7 年前
Vincent Gao	3a9f500b	Updated the Reacher's model file, also updated the Reacher's agent code from eulerAngle to quaternion Updated the Reacher's model file, also updated the Reacher's agent code from eulerAngle to quaternion	7 年前
GitHub	94d435e7	Merge pull request #458 from Unity-Technologies/feature/Reacher-model-update Updated the Reacher's model file, also updated the Reacher's agent co…	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
GitHub	bf858cd6	Merge pull request #884 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
GitHub	d7224351	Brains as Scriptable Objects (#1250 ) * Initial Commit Ported most functionalities, still need to : - Documentation - Add Comments - Custom drawer for BrainParameters - Fix the UnitTests - Review Functionalities * Added Custom Drawer for the Brain Parameters * Improvements to the HubDrawer * Modified the Brain Editors * Minor bug fixes and UI changes * Modified the Help Boxes of the Drawers * Modified Brain class, renamed Initialize and made DecideAction virtual * Fix the UnityTests * Simpler Brain creation menu * Renamed Internal Brain to Learning Brain * modified the parameters to remove reference to External or Internal in the Protobuf objects * Updated the protobuf generated files * Fix the Pytests * Removed the graph scope from the Learning Brain * cleaner logic than try catch * Removed the isExternal field of the brain and put the isTraining logic into LearningBrain and Training Hub * Modified how the Brain finds the A...	6 年前
GitHub	a196dde2	Merge pull request #1494 from Unity-Technologies/release-v0.6 v0.6 Release	6 年前
Ervin T	dba466e3	Reset Parameters implemented for Pushblock, Reacher and Walker (#2322 ) Pushblock: dynamic_friction, static_friction, block_drag, block_scale Reacher: Gravity, non-linear goal movement Walker: Gravity, torso mass	5 年前
GitHub	53475207	Merge pull request #2380 from Unity-Technologies/release-0.9.0 Release v0.9.0	5 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	b2fa2268	Merge pull request #2648 from Unity-Technologies/release-0.10.0 Release 0.10.0	5 年前
Vilmantas Balasevicius	a65f9b42	Initial changes/experiment scenes for articulated reacher.	5 年前
Vilmantas Balasevicius	78389481	Somewhat working Articulated Reacher, however still WIP: training performance is way different from standard RB version.	5 年前
Vilmantas Balasevicius	0e8ad96f	Experiments with torque applied to articulations.	5 年前
Vilmantas Balasevicius	eff3308a	Further experimentation with reacher articulations setup, manual control scene added.	5 年前
Vilmantas Balasevicius	2d032594	Further modifications to make PPO work	5 年前
Vilmantas Balasevicius	1e4e02d3	Reducing torque to 150. Makes it learn faster.	5 年前
Anupam Bhatnagar	cc208c00	resolving conflicts	5 年前
Vilmantas Balasevicius	6dc72657	Merge branch 'PhysXArticulations20201' of https://github.com/Unity-Technologies/ml-agents into PhysXArticulations20201	5 年前
Vilmantas Balasevicius	86a627d2	Implemented agent reset via prefab respawning for ArticulatedReacher and ArticulatedReacherManualControl scenes.	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
Chris Elion	3d8a70fb	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
GitHub	5d2e466f	Fix Code convention warnings in Rider. (#2801 )	5 年前
GitHub	495873e5	Merge pull request #2833 from Unity-Technologies/release-0.11.0 Release 0.11.0	5 年前
Chris Elion	691d21e6	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Jonathan Harper	8550679d	Merge branch 'develop' into release-0.11.0	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
Chris Elion	fca51de8	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Ervin Teng	987e0e3a	Merge tf2 branch	5 年前
GitHub	d4780a55	Merge pull request #3010 from Unity-Technologies/release-0.12.0-to-master Merge Release 0.12.0 to master	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	35c995e9	Merge pull request #3038 from Unity-Technologies/develop Merge develop to master	5 年前
Ervin Teng	eb4a04a5	Merge branch 'master' into develop-tanhsquash	5 年前
Ervin Teng	88b1123a	Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor	5 年前
GitHub	39f1f310	Don't inherit from Academy, remove virtual methods (#3184 )	5 年前
Ervin Teng	98ed88b1	Merge branch 'master' into develop-separatevalue	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前
Ervin Teng	29f3330f	Merge master into hotfix-0.13.1	5 年前
GitHub	386ba66c	Develop observation collector (#3352 ) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
Anupam Bhatnagar	d8c79f48	resolving merge conflicts	5 年前
Ervin Teng	5ef902bf	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
Ervin Teng	1156b9b3	Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder	5 年前
Anupam Bhatnagar	e04fcd71	Merge branch 'master' into master-into-release-0.14.1	5 年前
GitHub	f25bf7d3	Reintroduce MLAgents.Sensors namespace (#3509 ) * Reintroduced the namespace MLAgents.Sensors * Documentation changes * updated the changelog	5 年前
Andrew Cohen	573b1f6d	Merge branch 'master' into soccer-fives	5 年前
GitHub	411bb64a	Renaming Agent's methods (#3557 ) * [skip ci] Renamed methods in the Agent class WARNING, the user when implementing obsolete methods will see the message :Member `old method` overrides obsolete member `old method`. Add the Obsolete attribute to `old method`. It will not suggest the new method to override. * [skip ci] Updated the example environment * [skip ci] Updated migrating and changelog * [skip ci] Editing the docs * [skip ci] Missing docs * :+1 * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] documentation changes * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Gett...	5 年前
Chris Elion	fa5e7e6d	Merge remote-tracking branch 'origin/master' into develop-BehaviorParams-public	5 年前
Andrew Cohen	b1cfa74d	Merge branch 'master' into develop-test-imitation	5 年前
Andrew Cohen	53bea15c	Merge branch 'master' into soccer-fives	5 年前
Andrew Cohen	ac261e36	Merge branch 'master' into self-play-mutex	5 年前
GitHub	6612b496	Deprecating Academy.Instance.FloatProperties (#3696 ) * Deprecating Academy.Instance.FloatProperties * Made the registered side channels a static property and created the sideChannelUtils class to handle side channel stuff * Clearing the sending message queue in the Academy when the communicaor is not on * addressing comments	5 年前
Andrew Cohen	59b88be6	Merge branch 'master' into self-play-mutex	5 年前
Ervin Teng	06fa3d39	Merge branch 'master' into develop-sac-apex	5 年前
Anupam Bhatnagar	50e52d9c	Merge branch 'master' into distributed-training	5 年前
Andrew Cohen	930d6fa3	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Andrew Cohen	b0c506a6	Merge branch 'soccer-2v1' into asymm-envs	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
GitHub	1e0b022f	[MLA-850] rename namespaces to Unity.MLAgents (#3843 ) * rename in protos * rename in C# * doc changes, migration, changelog * PR numbers * fix standalone test path	5 年前
GitHub	3d15d51f	apply rider suggestions in Examples (#3846 )	5 年前
Arthur Juliani	212e2d1d	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
GitHub	d8956399	Merge pull request #3914 from Unity-Technologies/release_1_to_master Merge Release 1 to master	5 年前
vincentpierre	c34dd5b6	Merge branch 'master' into develop-gym-wrapper	5 年前
Arthur Juliani	89ad3020	Merge remote-tracking branch 'origin/master' into develop-add-fire # Conflicts: # ml-agents/mlagents/trainers/policy/tf_policy.py	5 年前
Andrew Cohen	59a60c1e	Merge branch 'master' into asymm-envs	5 年前
Chris Elion	2f7174f0	migrate from old branch	4 年前
Chris Elion	20b5a157	update scenes and get them training	4 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
GitHub	bb9417f7	Update example environments to use the Actuator API (#4363 )	4 年前
Christopher Goy	5a233353	Merge remote-tracking branch 'origin/master' into release_6-to-master	4 年前
HH	2080c287	Merge branch 'master' into hh/develop/loco-crawler-variable-speed	4 年前
Ervin Teng	d52443a5	Merge branch 'master' into develop-shortenstrikervsgoalie	4 年前
yanchaosun	4be4f1d1	new reacher env	4 年前
yanchaosun	e39986ed	block larger feature size; reacher fix and new reward	4 年前
yanchaosun	7dac3284	push block more steps	4 年前
yanchaosun	27dffa4d	new reacher reward	4 年前
yanchaosun	883361ee	reacher new reward: action penalty and constant not-reaching-goal penalty	4 年前
yanchaosun	7bc457f8	change reward function: stress less on action	4 年前
yanchaosun	85549b2b	reacher: stack observation. with the original reward function	4 年前
yanchaosun	92c3facf	distance based penalty	4 年前
yanchaosun	68544337	lift pink balls to another level	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
vincentpierre	7a5cc9ec	Merge master into develop-rm-tf	4 年前
vincentpierre	c1587bce	Solving merge conflicts	4 年前
Arthur Juliani	0d2f8887	Merge remote-tracking branch 'origin/master' into goal-conditioning # Conflicts: # ml-agents-envs/mlagents_envs/base_env.py # ml-agents-envs/mlagents_envs/rpc_utils.py # ml-agents/mlagents/trainers/tests/mock_brain.py # ml-agents/mlagents/trainers/tests/simple_test_envs.py	4 年前
Ervin Teng	25dfd883	Merge branch 'master' into develop-centralizedcritic	4 年前
HH	15d512f9	Merge branch 'master' into hh/develop/dodgeball	4 年前

1 2 3

104 次代码提交 (7e7743d1-03a2-4a84-a127-380dea067341)