ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	b1d6172f	[Retrained models] Of GridWorld and Tennis (#410 )	7 年前
Marwan Mattar	ba6911c3	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Editor/MLAgentsEditModeTest.cs # unity-environment/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs # unity-environment/Assets/ML-Agents/Scripts/Academy.cs	7 年前
GitHub	c1e930b5	Minor Visual Changes for Environments (#470 ) * Minor changes to ensure a common visual language. * Agents are blue (or additionally red in competitive scenarios). * Interactable objects are orange. * Goals are green when objects, and checkerboards when places. * Not everything perfectly follows this, but things are mostly consistent now. * Renamed "Banana" folder to "BananaCollectors" * Ensured all brains were set to "Player" * Moved non-shared assets out of the "SharedAssets" folder.	7 年前
Marwan Mattar	4d1b3ae3	Merge branch 'development-0.3' into docs/doxygen # Conflicts: # docs/doxygen/Readme.md	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	bf858cd6	Merge pull request #884 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
Vincent(Yuan) Gao	7ce0b834	Add Brains for Pyramids, Reacher, SoccerTwos, Tennis, Bouncer, and CrawlerDynamic (#1313 ) * New brains for Pyramid scene * Add reacher brains * New brains for Soccer agents * New Tennis Brains * Set prefabs correctly * New brains for bouncer * New Dynamic Crawler Brains	6 年前
Arthur Juliani	59126c8c	Release v0.6 tennis (#1350 ) * Modified the scene, missing the model * modified the hyperparameters * Updated the model	6 年前
GitHub	547f0e98	Merge pull request #1361 from Unity-Technologies/release-v0.6 Merge Release v0.6 into develop	6 年前
GitHub	a196dde2	Merge pull request #1494 from Unity-Technologies/release-v0.6 v0.6 Release	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
GitHub	275ff5d6	Merge pull request #1764 from Unity-Technologies/release-v0.7 Release v0.7 into master	6 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
GitHub	b2fa2268	Merge pull request #2648 from Unity-Technologies/release-0.10.0 Release 0.10.0	5 年前
Anupam Bhatnagar	cc208c00	resolving conflicts	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
Chris Elion	3d8a70fb	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Jonathan Harper	c561dfbf	Update NN models for all example scenes (v0.11.0) This change updates all of the .nn models, and uses the new output filenames (e.g. 3DBallLearning.nn becomes 3dBall.nn).	5 年前
Jonathan Harper	7de4046c	Merge remote-tracking branch 'origin/release-0.11.0' into develop	5 年前
GitHub	495873e5	Merge pull request #2833 from Unity-Technologies/release-0.11.0 Release 0.11.0	5 年前
GitHub	35892405	Merge pull request #2832 from Unity-Technologies/develop-merge-release-0.11.0 Merge release-0.11.0 into develop	5 年前
Chris Elion	691d21e6	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Ervin Teng	987e0e3a	Merge tf2 branch	5 年前
Andrew Cohen	184af227	splitting brain params into brain name and identifiers	5 年前
Andrew Cohen	c257e053	set team id in prefab	5 年前
Andrew Cohen	94366bfe	splitting brain params into brain name and identifiers	5 年前
Andrew Cohen	19cb893e	set team id in prefab	5 年前
Andrew Cohen	e648cbc8	splitting brain params into brain name and identifiers	5 年前
Andrew Cohen	9c4ec4c4	set team id in prefab	5 年前
Andrew Cohen	8f62c69e	splitting brain params into brain name and identifiers	5 年前
Andrew Cohen	12756131	set team id in prefab	5 年前
Andrew Cohen	76f8a515	fixed assets that got messed up	5 年前
GitHub	03664e75	Make On Demand Decision the default (#3243 ) * Added a simple Decision Requester * Modified the prefabs * Fixing the tests and removing fields from Agent parameters * Migrating.md * addressing comments * addressing comments	5 年前
Ervin Teng	29f3330f	Merge master into hotfix-0.13.1	5 年前
GitHub	a1a1126d	Trim some public fields on the Agent (#3269 ) * Triming some of the methods of the agent but left SetReward * Fixing bugs * modifying the environments * Reintroducing IsDone and IsMaxStepReached * Updating the Migrating doc * more details on the Migration	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
Ervin Teng	db249ceb	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
Yuan Gao	24a681bf	Updated the prefebs to enable inference	5 年前
Andrew Cohen	8e271ee8	new tennis brain	5 年前
GitHub	6ef56c83	Merge pull request #3749 from Unity-Technologies/develop-add-inference-examples Add ModelOverrider to all of the Agent prefabs to enable Barracuda Inference with specified .nn model file	5 年前
Andrew Cohen	c07e0fce	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Andrew Cohen	8431ecb5	tennis reward fix	5 年前
Andrew Cohen	1ac4dfb3	update Tennis max step	5 年前
Andrew Cohen	95f3ffab	fix max step tennis	5 年前
Andrew Cohen	47548ee4	tennis curriculum	5 年前
Andrew Cohen	1807f698	increase drag	5 年前
Andrew Cohen	c8cae0dd	reduced bounciness/added downward force tennis	5 年前
Andrew Cohen	3bd33889	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	9f36cd36	added floorhit obs tennis	5 年前
Andrew Cohen	f5c551f2	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	fdcf3f43	Revert "added floorhit obs tennis" This reverts commit 1d3e1cffda2e49da9cf1944eca4f250d9e551c39.	5 年前
Andrew Cohen	e1660b49	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	8ad1323e	increased scale tennis env	5 年前
Andrew Cohen	29c23484	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	e2d68e4f	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	e5b883db	added bounce obs to agent/more downward force on ball	5 年前
Andrew Cohen	028a8d59	larger network/6 stacked obs	5 年前
Andrew Cohen	ca6cdff3	fixed broken prefab...	5 年前
Andrew Cohen	5d22b819	added timepenalty to obs	5 年前
GitHub	4092d937	[Bug fix] Hard reset when team changes (#3870 )	5 年前
Arthur Juliani	212e2d1d	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
Andrew Cohen	fb7aa862	remove timepenalty obs	5 年前
Andrew Cohen	b0243014	broken prefab..	5 年前
Andrew Cohen	e0723381	new tennis brain	5 年前
Andrew Cohen	7da1869a	new tennis brain	5 年前
Andrew Cohen	a56643bb	tennis brain to prefab	5 年前
GitHub	d8b93f8f	[Bug fix] Hard reset when team changes (#3870 ) (#3899 )	5 年前
Andrew Cohen	d5428487	addforce and static walls	5 年前
Andrew Cohen	8ef0b3a8	opponent observations	5 年前
vincentpierre	c34dd5b6	Merge branch 'master' into develop-gym-wrapper	5 年前
Andrew Cohen	c5ce18c7	remove x/y vel, smaller network	5 年前
Andrew Cohen	97119431	fix prefab	5 年前
Andrew Cohen	fd7ee405	normalize by hand	5 年前
Andrew Cohen	cc79fa0e	no opp obs	5 年前
Andrew Cohen	13c2a209	added opp, decay eps removed	5 年前
Andrew Cohen	8e69cd88	fix tennis prefab	5 年前
Andrew Cohen	7475ad11	tunneling is a loss	5 年前
Andrew Cohen	31a5b2ee	4096 batch	5 年前
Andrew Cohen	c8ecf7ea	remove opp obs	5 年前
Andrew Cohen	30161f21	reduce max step	5 年前
Andrew Cohen	3d228cb5	slower x	5 年前
Andrew Cohen	e7e25c16	correct max step...	5 年前
Andrew Cohen	acd219ce	faster ball real max step	5 年前
vincentpierre	89605d02	Replaced .nn models with .onnx models Missing so far : - Visual3DBall - CrawlerDynamicVariableSpeed - CrawlerStaticVariableSpeed - GridFoodCollector - VisualFoodCollector - GridWorld - Match3	4 年前
GitHub	bc0ba098	add option for Burst inference (#4925 )	4 年前
Ruo-Ping Dong	c87bce9e	Merge branch 'master' into develop-base-teammanager	4 年前
vincentpierre	e1b94b8b	Merge branch 'master' into develop-var-len-obs-feature	4 年前
Chris Elion	e4f51ca7	Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider	4 年前
Ervin Teng	d4438878	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
HH	15d512f9	Merge branch 'master' into hh/develop/dodgeball	4 年前
Arthur Juliani	06c147f8	Merge remote-tracking branch 'origin/main' into goal-conditioning-new # Conflicts: # Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity # Project/ProjectSettings/TagManager.asset # com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs # com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs # ml-agents/mlagents/trainers/torch/networks.py # ml-agents/mlagents/trainers/torch/utils.py	4 年前

1 2

96 次代码提交 (2c42f577-2067-45be-964c-a1476d616ff9)