ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	22db3d64	added the modified files from dev-cooperative-env	7 年前
GitHub	00534390	Refactored GridWorld (#225 ) Greatly simplified GridWorld code. It now also only uses a visual observation rather than state vector in order to demonstrate learning purely from a visual input.	7 年前
Arthur Juliani	8d6c57b9	Gridworld should have an inference wait time	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	e676017b	Reorganize learn.py (#302 ) Split learn.py into learn.py as command-line wrapper, and trainer_controller.py as core trainer/env logic.	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	d4cd72d8	[timeBetweenDecisions] Reimplementation of waitTime for GridWorld and… (#368 ) * [timeBetweenDecisions] Reimplementation of waitTime for GridWorld and Basic * [EnvironmentModification] Changed the gridworld TimeBetweenDecisionAtInference	7 年前
GitHub	b1d6172f	[Retrained models] Of GridWorld and Tennis (#410 )	7 年前
GitHub	976c56c5	Environment Aesthetic Unification (#459 ) * Aesthetic unification * Add new environment images	7 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
vincentpierre	27c3c9be	Fixed Action Space for GridWorld (#1085 ) Added the appropriate branch information.	6 年前
Marwan Mattar	2d767f3f	GridWorld now uses action masking (#1125 ) * GridWorld now uses action masking * Addressed the comments * addressed comments * Added checkbox to turn action masking on/off (#1146) * Added checkbox to turn action masking on/off * Fix to handle the no-action option * Added comment to GridWorld mentioning the use of action masking. (#1153)	6 年前
GitHub	e7048eca	Changed the scene to use scriptable object for Crawler static, gridworld, walljump, visualhallway, visualpushblock, visualpyramid (#1314 )	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
GitHub	cfb8f208	Release v0.7 minor fixes (#1759 ) * Fix typo * Updated some of the scenes	6 年前
GitHub	6f8fc130	External Contribution: Use RenderTexture instead of Camera for Visual Observation (#1824 ) * Added RenderTexture support for visual observations * Cleaned up new ObservationToTexture function * Added check for to width/height of RenderTexture * Added check to hide HelpBox unless both cameras and RenderTextures are used * Added documentation for Visual Observations using RenderTextures * Added GridWorldRenderTexture Example scene * Adjusted image size of doc images * Added GridWorld example reference * Fixed missing reference in the GridWorldRenderTexture scene and resaved the agent prefab * Fix prefab instantiation and render timing in GridWorldRenderTexture * Added screenshot and reworded documentation * Unchecked control box * Rename renderTexture * Make RenderTexture scene default for GridWorld Co-authored-by: Mads Johansen <pyjamads@gmail.com>	6 年前
GitHub	bebdb293	ML-Agents Branding & Color Updates (#2583 ) * new env styles rebased on develop * added new trained models * renamed food collector platforms * reduce training timescale on WallJump from 100 to 10 * uncheck academy control on walljump * new banner image * rename banner file * new example env images * add foodCollector image * change Banana to FoodCollector and update image * change bouncer description to include green cube * update image * update gridworld image * cleanup prefab names and tags * updated soccer env to reference purple agent instead of red * remove unused mats * rename files * remove more unused tags * update image * change platform to agent cube * update text. change platform to agents head * cleanup * cleaned up weird unused meta files * add new wall jump nn files and rename a prefab * walker change stacked states from 5 to 1 walker collects physics observations so stacked states are not need...	5 年前
GitHub	24ba9d58	Develop deprecate broadcasting (#2669 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modifie...	5 年前
Hunter	9461c4f1	moved UI to corner and made text easier to read	5 年前
Ervin Teng	b9193bcc	Scale rendertexture with screen	5 年前
GitHub	3485dfb9	Made gridworld a prefab so we can have more of them at once (#2721 )	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	04b456eb	Add model for render texture agent (#2863 )	5 年前
GitHub	8426501b	Fix width and height in visual observations (#2919 ) * swap h/w in sensor * change texture to non-square, retrain model * get dimensions from RenderTexture	5 年前
GitHub	a488299f	[MLA-345] float visual observations (#3148 ) * pass shape to WriteAdapter * handle floats on python side * cleanup * whitespace * rename GetFloatObservationShape, support uncompressed in RenderTexture sensor * numpy float32 * remove unused using * Float sensor and unit test * replace asserts with exceptions, docstrings	5 年前
GitHub	39f1f310	Don't inherit from Academy, remove virtual methods (#3184 )	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前
GitHub	2ba1e9d4	Remove dead components from the examples scenes (#3619 )	5 年前
GitHub	f623b722	Anchor GridWorld UI to top-left (#3706 )	5 年前
HH	1912e47a	Dynamic Sensor Benchmarks In	4 年前
brccabral	bbdbe0ec	set multiple obstacles from area added a parameter to GridArea to control how many obstacles will be created in the scene	4 年前
brccabral	349e98a6	set number of obstacles to 1 in scene	4 年前
Arthur Juliani	b3638ad6	Update prefabs	4 年前
Arthur Juliani	4060202d	Use GoalSensor in GridWorld	4 年前
Arthur Juliani	e6a973cd	Add OneHot util to goal sensor	4 年前
Arthur Juliani	95fd8040	Make GridWorld a goal-based environment	4 年前
Arthur Juliani	e8d54b6f	Use hypernetwork if there is a goal	4 年前
GitHub	bc0ba098	add option for Burst inference (#4925 )	4 年前
vincentpierre	ceba7bcc	Aded the Goal conditioned GridWorld to replace regular gridworld	4 年前
vincentpierre	87242300	adding missing files	4 年前
vincentpierre	42a3732c	Code improvements	4 年前
vincentpierre	9be72429	Fixing conflicts with main	4 年前
GitHub	2980ade0	Goal conditioning grid world : Example of goal conditioning (#5193 ) * Aded the Goal conditioned GridWorld to replace regular gridworld * adding missing files * Code improvements * Documentation change on gridworld * resolving conflicts * new model * Addressing comments * comments and renames * Update docs/Learning-Environment-Examples.md Co-authored-by: Ervin T. <ervin@unity3d.com> * adding reference to gridworld in docs about goal signal Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前

46 次代码提交 (7e7743d1-03a2-4a84-a127-380dea067341)