ml-agents

作者	SHA1	备注	提交日期
GitHub	c17937ef	Curiosity Driven Exploration & Pyramids Environments (#739 ) * Adds implementation of Curiosity-driven Exploration by Self-supervised Prediction (https://arxiv.org/abs/1705.05363) to PPO trainer. * To enable, set use_curiosity flag to true in hyperparameter file. * Includes refactor of unitytrainers model code to accommodate new feature. * Adds new Pyramids environment (w/ documentation). Environment contains sparse reward, and can only be solved using PPO+Curiosity.	7 年前
GitHub	7eebb5f6	Fixes for broken materials (#855 ) * Fix materials for wall and checker * Make the ball orange again	7 年前
Vincent(Yuan) Gao	7ce0b834	Add Brains for Pyramids, Reacher, SoccerTwos, Tennis, Bouncer, and CrawlerDynamic (#1313 ) * New brains for Pyramid scene * Add reacher brains * New brains for Soccer agents * New Tennis Brains * Set prefabs correctly * New brains for bouncer * New Dynamic Crawler Brains	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
Jonathan Harper	c561dfbf	Update NN models for all example scenes (v0.11.0) This change updates all of the .nn models, and uses the new output filenames (e.g. 3DBallLearning.nn becomes 3dBall.nn).	5 年前
GitHub	03664e75	Make On Demand Decision the default (#3243 ) * Added a simple Decision Requester * Modified the prefabs * Fixing the tests and removing fields from Agent parameters * Migrating.md * addressing comments * addressing comments	5 年前
GitHub	a1a1126d	Trim some public fields on the Agent (#3269 ) * Triming some of the methods of the agent but left SetReward * Fixing bugs * modifying the environments * Reintroducing IsDone and IsMaxStepReached * Updating the Migrating doc * more details on the Migration	5 年前
GitHub	ceebacd6	Convert pyramids to Raycast sensor (#3299 )	5 年前
Yuan Gao	24a681bf	Updated the prefebs to enable inference	5 年前
vincentpierre	89605d02	Replaced .nn models with .onnx models Missing so far : - Visual3DBall - CrawlerDynamicVariableSpeed - CrawlerStaticVariableSpeed - GridFoodCollector - VisualFoodCollector - GridWorld - Match3	4 年前
GitHub	bc0ba098	add option for Burst inference (#4925 )	4 年前
vincentpierre	4e14879d	Updating the barracuda 1.4.0 (#5291 ) Initial commit second commit. The no-extrinsic was trained without the log reward (reward = prob) while the new one is (reward = log_prob - log_prior) A few results, it looks like Walker-diverse-r05-bigger.onnx is doing something Modified pushblock using next state and action. Did not help Fixing bug that had 9 diversity settings instead of 8 removing results	4 年前

14 次代码提交 (11a85d59-5728-48e2-a158-0fe0f5f1fee2)