ml-agents

作者	SHA1	备注	提交日期
GitHub	c17937ef	Curiosity Driven Exploration & Pyramids Environments (#739 ) * Adds implementation of Curiosity-driven Exploration by Self-supervised Prediction (https://arxiv.org/abs/1705.05363) to PPO trainer. * To enable, set use_curiosity flag to true in hyperparameter file. * Includes refactor of unitytrainers model code to accommodate new feature. * Adds new Pyramids environment (w/ documentation). Environment contains sparse reward, and can only be solved using PPO+Curiosity.	7 年前
GitHub	9ab98584	Additional Environment Variations (#791 ) * Add Visual (Camera) and Imitation Learning variations to example environments	7 年前
GitHub	7eebb5f6	Fixes for broken materials (#855 ) * Fix materials for wall and checker * Make the ball orange again	6 年前
GitHub	3990eceb	Fix wall material (#874 )	6 年前
GitHub	4b3c6c9f	Merge pull request #885 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
Arthur Juliani	195ac934	Merge branch 'develop' into develop-runs # Conflicts: # python/learn.py # python/unitytrainers/trainer.py	6 年前
Vincent(Yuan) Gao	7ce0b834	Add Brains for Pyramids, Reacher, SoccerTwos, Tennis, Bouncer, and CrawlerDynamic (#1313 ) * New brains for Pyramid scene * Add reacher brains * New brains for Soccer agents * New Tennis Brains * Set prefabs correctly * New brains for bouncer * New Dynamic Crawler Brains	6 年前
GitHub	e7048eca	Changed the scene to use scriptable object for Crawler static, gridworld, walljump, visualhallway, visualpushblock, visualpyramid (#1314 )	6 年前
GitHub	a196dde2	Merge pull request #1494 from Unity-Technologies/release-v0.6 v0.6 Release	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
GitHub	275ff5d6	Merge pull request #1764 from Unity-Technologies/release-v0.7 Release v0.7 into master	6 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
GitHub	b2fa2268	Merge pull request #2648 from Unity-Technologies/release-0.10.0 Release 0.10.0	5 年前
Ervin Teng	be6354ca	Fix visual Pyramids and Pushblock scenes (#2672 ) * Change to learning brain and uncheck control * Fix Push Block learning brain	5 年前
Anupam Bhatnagar	cc208c00	resolving conflicts	5 年前
GitHub	d05818b9	Fix visual Pyramids and Pushblock scenes (#2672 ) * Change to learning brain and uncheck control * Fix Push Block learning brain	5 年前
GitHub	f22c41db	Merge pull request #2704 from Unity-Technologies/hotfix-0.10.1 Merge Hotfix 0.10.1	5 年前
Anupam Bhatnagar	b733b34c	resolving conflicts	5 年前
Chris Elion	a1967c19	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
Chris Elion	3d8a70fb	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Jonathan Harper	c561dfbf	Update NN models for all example scenes (v0.11.0) This change updates all of the .nn models, and uses the new output filenames (e.g. 3DBallLearning.nn becomes 3dBall.nn).	5 年前
Jonathan Harper	7de4046c	Merge remote-tracking branch 'origin/release-0.11.0' into develop	5 年前
GitHub	495873e5	Merge pull request #2833 from Unity-Technologies/release-0.11.0 Release 0.11.0	5 年前
GitHub	35892405	Merge pull request #2832 from Unity-Technologies/develop-merge-release-0.11.0 Merge release-0.11.0 into develop	5 年前
Chris Elion	691d21e6	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Ervin Teng	987e0e3a	Merge tf2 branch	5 年前
GitHub	03664e75	Make On Demand Decision the default (#3243 ) * Added a simple Decision Requester * Modified the prefabs * Fixing the tests and removing fields from Agent parameters * Migrating.md * addressing comments * addressing comments	5 年前
Ervin Teng	29f3330f	Merge master into hotfix-0.13.1	5 年前
GitHub	a1a1126d	Trim some public fields on the Agent (#3269 ) * Triming some of the methods of the agent but left SetReward * Fixing bugs * modifying the environments * Reintroducing IsDone and IsMaxStepReached * Updating the Migrating doc * more details on the Migration	5 年前
GitHub	ceebacd6	Convert pyramids to Raycast sensor (#3299 )	5 年前
Ervin Teng	db249ceb	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
Yuan Gao	24a681bf	Updated the prefebs to enable inference	5 年前
GitHub	6ef56c83	Merge pull request #3749 from Unity-Technologies/develop-add-inference-examples Add ModelOverrider to all of the Agent prefabs to enable Barracuda Inference with specified .nn model file	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Andrew Cohen	2c42f577	Merge branch 'master' into asymm-envs	5 年前
vincentpierre	89605d02	Replaced .nn models with .onnx models Missing so far : - Visual3DBall - CrawlerDynamicVariableSpeed - CrawlerStaticVariableSpeed - GridFoodCollector - VisualFoodCollector - GridWorld - Match3	4 年前
GitHub	bc0ba098	add option for Burst inference (#4925 )	4 年前
Ruo-Ping Dong	c87bce9e	Merge branch 'master' into develop-base-teammanager	4 年前
vincentpierre	e1b94b8b	Merge branch 'master' into develop-var-len-obs-feature	4 年前
Chris Elion	e4f51ca7	Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider	4 年前
Ervin Teng	d4438878	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
HH	15d512f9	Merge branch 'master' into hh/develop/dodgeball	4 年前
Arthur Juliani	06c147f8	Merge remote-tracking branch 'origin/main' into goal-conditioning-new # Conflicts: # Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity # Project/ProjectSettings/TagManager.asset # com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs # com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs # ml-agents/mlagents/trainers/torch/networks.py # ml-agents/mlagents/trainers/torch/utils.py	4 年前
GitHub	85f8b40b	Removing some scenes (#4997 ) * Removing some scenes, All the Static and all the non variable speed environments. Also removed Bouncer, PushBlock, WallJump and reacher. Removed a bunch of visual environements as well. Removed 3DBallHard and FoodCollector (kept Visual and Grid FoodCollector) * readding 3DBallHard * readding pushblock and walljump * Removing tennis * removing mentions of removed environments * removing unused images * Renaming Crawler demos * renaming some demo files * removing and modifying some config files * new examples image? * removing Bouncer from build list * replacing the Bouncer environment with Match3 for llapi tests * Typo in yamato test	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
Christopher Goy	921ba4f0	Update v2-staging from main (March 15) (#5123 )	4 年前
Christopher Goy	ebe45056	Merge branch 'main' into release_14_branch-to-main	4 年前
Ervin Teng	8902c058	Merge branch 'main' into develop-coma2-trainer	4 年前
Chris Elion	970f1d40	Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec	4 年前
Ervin Teng	1f026c70	Merge branch 'main' into develop-superpush-branch-cleanup	4 年前
Ervin Teng	ce872033	Revert "Merge branch 'main' into develop-superpush-branch-cleanup" This reverts commit 5bea802525381f931a5e0f8b8778fe27a12f03af, reversing changes made to cee3524e85161e13689d95f66bc6bff994d2cdfd.	4 年前
Andrew Cohen	9e77d7e1	Merge branch 'main' into develop-soccer-groupman	4 年前
vincentpierre	4e14879d	Updating the barracuda 1.4.0 (#5291 ) Initial commit second commit. The no-extrinsic was trained without the log reward (reward = prob) while the new one is (reward = log_prob - log_prior) A few results, it looks like Walker-diverse-r05-bigger.onnx is doing something Modified pushblock using next state and action. Did not help Fixing bug that had 9 diversity settings instead of 8 removing results	4 年前

1 2

56 次代码提交 (39a76867-58c3-47e1-8cab-99681ab89eb5)