ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	4d3716fe	default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory()	7 年前
vincentpierre	0df8326e	minor fixes	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
vincentpierre	d77cfc6d	Fix Cumulative reward reset	7 年前
vincentpierre	a7de9336	revert previous commit	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
vincentpierre	15f29084	fix on the SetCumulativeReward() method in Agent.cs	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	0277039d	Fix Basic Environment & Discrete States (#356 ) * Fix Basic environment to properly reflect number of states. * Fix discrete states when using stacked states. * Add trained model for Basic environment.	7 年前
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	69481d2d	Imitation Learning Helper (#371 ) * Add helper class to for Imitation Learning teacher. Allows for clearing buffer "C" and toggling adding info to the buffer "R".	7 年前
Vincent Gao	933317be	modified comments	7 年前
Vincent Gao	4a23c5cf	clean up the code in Ball3DDecision	7 年前
Vincent Gao	38bd3e40	replaced all the tabs to 4 spaces in the project	7 年前
GitHub	430a5486	[Semantics] renaming StateType to SpaceType (#382 )	7 年前
Vincent Gao	ba0ecf24	fixed other tabs and spaces	7 年前
Vincent Gao	02df3b34	resolved conflicts	7 年前
GitHub	1409236e	made AgentAction take vectorAction and textAction (#397 )	7 年前
GitHub	cc0b046d	[AddVectorObs] Made it possible to call AddVectorObs with non floats (#398 ) * [AddVectorObs] Made it possible to call AddVectorObs with int, Vector2, Vector3, List<float> and float[]. * [Comments] Made the comment clearer after overloading * [Fix] Use AddRange instead of Add when adding lists or floatarrays	7 年前
Vincent Gao	1bc43933	Merge branch 'development-0.3' into hotfix/issue#333	7 年前
Marwan Mattar	fa638000	Comment improvements & refactoring to Academy.cs Added several class and method-level comments that are compatibale with Doxygen for auto-generation of documentation. In addition to some stylistic and minor code changes (summarized below). Stylistic changes: - Modified comments to /// style instead of /** */ - Removed unnecessary imports - Removed unnecessary “private” declarations - Limited code to 80 characters per line - Re-organized variables to group those that are visible in Inspector (they are now at the top) Code changes: - Renamed ScreenConfiguration to EnvironmentConfiguration (variable only used within Academy.cs, thus no other files needed modification) - Renamed ConfigureEngine to ConfigureEnvironment and created a ConfigureEnvironmentHelper method - Renamed _isCurrentlyInference to modeSwitched to signify when the engine config needs to be changed - Added isCommunicatorOn flag to be explicit about the existence of a communicator - Made isInference private which requ...	7 年前
Marwan Mattar	ba6911c3	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Editor/MLAgentsEditModeTest.cs # unity-environment/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs # unity-environment/Assets/ML-Agents/Scripts/Academy.cs	7 年前
GitHub	addadada	[AddVectorObs] Modified the Examples (#409 ) * [AddVectorObs] Converted the Examples to use the new AddVectorObs * [AddVectorObs] Converted the Reacher to use the new AddVectorObs * [Improvement] One liner for adding the rotation	7 年前
Marwan Mattar	7b99ccfe	Merge branch 'development-0.3' into dev-api-doc-academy	7 年前
Marwan Mattar	f1966275	Comment improvements to Agent.cs. Added several class and method-level comments that are compatibale with Doxygen for auto-generation of documentation. In addition to some stylistic and minor code changes (summarized below). Stylistic changes: - Modified comments to /// style instead of /** */ - Removed unnecessary imports - Limited code to 80 characters per line Code changes: - Change SetTextObs to accept a string, not an object - Renamed all methods that have “state” semantics to “info” semantics - Renamed _InitializeAgent as OnEnableHelper - Removed _DisableAgent, foldered into OnDisable - Renamed StoredVectorActions to storedVectorActions, similarly for StoredTextActions - Changed internal methods to protected since thats the desired behavior - Renamed _info to info and _action to action since they’re already private These refactorings had impacts on CoreBrainInternal, ExternalCommunicator, MLAgentsEditModeTest. Performed minor improvemens to Ball3DAgent and AgentEditor (re...	7 年前
Marwan Mattar	d8a6e730	Fixed OpenURL urls - Ensured consistency of how (Experimental) appears in docs.	7 年前
Marwan Mattar	6d29c6ed	Updated c# docs to avoid confusing Decision in ODD with Decision.cs	7 年前
Marwan Mattar	4d1b3ae3	Merge branch 'development-0.3' into docs/doxygen # Conflicts: # docs/doxygen/Readme.md	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
GitHub	c17937ef	Curiosity Driven Exploration & Pyramids Environments (#739 ) * Adds implementation of Curiosity-driven Exploration by Self-supervised Prediction (https://arxiv.org/abs/1705.05363) to PPO trainer. * To enable, set use_curiosity flag to true in hyperparameter file. * Includes refactor of unitytrainers model code to accommodate new feature. * Adds new Pyramids environment (w/ documentation). Environment contains sparse reward, and can only be solved using PPO+Curiosity.	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
Arthur Juliani	5e48766d	Remove discrete observations	6 年前
vincentpierre	d993c549	[Added the unity side code]	6 年前
Arthur Juliani	9dd4a81b	Fix for memory leak	6 年前
Arthur Juliani	5b52e610	Replace return w/ reference	6 年前
Arthur Juliani	993b7a1a	Use resize instead of new	6 年前
GitHub	e50ac7ae	Merge branch 'develop' into hotfix-0	6 年前
vincentpierre	60eed8f3	[Hotfix] Removed the reference to the brain in the OnEnable method of the agent to avoid errors when the agent is initialized without a brain	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
GitHub	4e73f770	Merge branch 'develop' into hotfix-0.4b	6 年前
Arthur Juliani	1eb701af	Merge remote-tracking branch 'origin/develop' into develop-value-estimates-ppo	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
GitHub	3c9603d6	Demonstration Recorder (#1240 )	6 年前
GitHub	d7224351	Brains as Scriptable Objects (#1250 ) * Initial Commit Ported most functionalities, still need to : - Documentation - Add Comments - Custom drawer for BrainParameters - Fix the UnitTests - Review Functionalities * Added Custom Drawer for the Brain Parameters * Improvements to the HubDrawer * Modified the Brain Editors * Minor bug fixes and UI changes * Modified the Help Boxes of the Drawers * Modified Brain class, renamed Initialize and made DecideAction virtual * Fix the UnityTests * Simpler Brain creation menu * Renamed Internal Brain to Learning Brain * modified the parameters to remove reference to External or Internal in the Protobuf objects * Updated the protobuf generated files * Fix the Pytests * Removed the graph scope from the Learning Brain * cleaner logic than try catch * Removed the isExternal field of the brain and put the isTraining logic into LearningBrain and Training Hub * Modified how the Brain finds the A...	6 年前
Arthur Juliani	9d2a8c53	Replace AddVectorObs(float[]) and AddVectorObs(List<float>) with a more generic AddVectorObs(IEnumerable<float>) (#1540 )	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
Vincent-Pierre BERGES	bc636075	API for sending custom protobuf messages to and from Unity. (#1595 ) * API for sending custom protobuf messages to and from Unity. * Rename custom_output to custom_outputs. * Move custom protos to their own files. * Add SetCustomOutput method. * Add docstrings. * Various adjustments. * Rename CustomParameters -> CustomResetParameters * Rename CustomOutput -> CUstomObservation * Add CustomAction * Add CustomActionResult * Remove custom action result. * Remove custom action result from Python API * Start new documentation. * Add some docstrings * Expand documentation. * Typos * Tweak doc. Also eliminate GetCustomObservation. * Fix typo. * Clarify docs. * Remove trailing whitspace	6 年前
GitHub	6f8fc130	External Contribution: Use RenderTexture instead of Camera for Visual Observation (#1824 ) * Added RenderTexture support for visual observations * Cleaned up new ObservationToTexture function * Added check for to width/height of RenderTexture * Added check to hide HelpBox unless both cameras and RenderTextures are used * Added documentation for Visual Observations using RenderTextures * Added GridWorldRenderTexture Example scene * Adjusted image size of doc images * Added GridWorld example reference * Fixed missing reference in the GridWorldRenderTexture scene and resaved the agent prefab * Fix prefab instantiation and render timing in GridWorldRenderTexture * Added screenshot and reworded documentation * Unchecked control box * Rename renderTexture * Make RenderTexture scene default for GridWorld Co-authored-by: Mads Johansen <pyjamads@gmail.com>	6 年前
GitHub	9c6dcb1b	A couple fixes for recording demonstrations (#1999 ) * Sanitize demo filenames so that they can't be too long, overflow the header, and corrupt demo files * Fix issue where 1st demo of each episode is always recorded as 0 action	6 年前
Mantas Puida	1862b6be	Multiple LSTM cell handling added to Barracuda code path	5 年前
GitHub	f13d0f11	Merge pull request #2049 from Unity-Technologies/develop-barracuda-0.2.0 Barracuda 0.2.1 -> develop	5 年前
GitHub	49f20394	Fix for vis obs memory leak in docker (#2274 ) * Fix for vis obs memory leak in docker * Remove reversions from code	5 年前
Ervin T	9ea7fea8	Use Barracuda tensors and Barracuda 0.2.4 (#2308 ) Bringing bucket of temp memory allocation optimizations: * switched to Barracuda backed tensor across the board, helps to leverage allocators and reuse of the internal buffers * added Barracuda 0.2.4 release, which bring another set of temp memory allocation fixes	5 年前
GitHub	d7bdb3a3	Fix issue with visual obs destroyed too early (#2400 )	5 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	babe9e2f	Develop remove academy done (#2519 ) * Initial Commit * Remove the Academy Done flag from the protobuf definitions * remove global_done in the environment * Removed irrelevant unitTests * Remove the max_step from the Academy inspector * Removed global_done from the python scripts * Modified and removed some tests * This actually does not break either curriculum nor generalization training * Replace global_done with reserved. Addressing Chris Elion's comment regarding the deprecation of the global_done field. We will use a reserved field to make sure the global done does not get replaced in the future causing errors. * Removed unused fake brain * Tested that the first call to step was the same as a reset call * black formating * Added documentation changes * Editing the migrating doc * Addressing comments on the Migrating doc * Addressing comments : - Removing dead code - Resolving forgotten merged conflicts - Editing documentations...	5 年前
GitHub	24250c90	Move gRPC code to its own special place (#2621 ) - exclude .meta files from mixed line ending checks. - update CommunicatorObjects directory in CI scripts and Proto generation scripts.	5 年前
GitHub	2f74b3cc	Rename protobuf objects to be suffixed with 'Proto' in python and C#. (#2646 )	5 年前
GitHub	57a5a717	C# hierarchical timers (#2198 ) * proof of concept - simple C# hierachical timers * fix compile error, add CustomSampler placeholder * use CustomSampler and Recorder per node * singleton, add to Batcher * output timers * raw counts and times * curly braces * timer cleaup * json serialize timers * more timer cleanup * dont accumulate from Recorders * move Timers to own file * meta file * Wait for env process to exit before killing it * timer cleanup * docstrings * undo some accidental changes * make timers closer to python * Timer unit test * getters * no => for properties * singleton * property one-liner * scientific notation, cleanup TODOs * reasonable values for root timer	5 年前
GitHub	2d92a49b	Refactor ICommunicator API (#2675 ) - Push (almost) all references to protobuf objects into the RpcCommunicator. - Simplify the passing around of Agents and Agent Infos. - Delete all references to the Batcher. - Simplify the Environment Step by removing all of the reset and message counting logic. - Finishes MLA-27 and MLA-28	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	0fe5adc2	Develop remove memories (#2795 ) * Initial commit removing memories from C# and deprecating memory fields in proto * initial changes to Python * Adding functionalities * Fixes * adding the memories to the dictionary * Fixing bugs * tweeks * Resolving bugs * Recreating the proto * Addressing comments * Passing by reference does not work. Do not merge * Fixing huge bug in Inference * Applying patches * fixing tests * Addressing comments * Renaming variable to reflect type * test	5 年前
GitHub	5d2e466f	Fix Code convention warnings in Rider. (#2801 )	5 年前
GitHub	2431f184	build fixes for 2018+ (#2808 ) * rename CompressionType enum * fix standalone build test for 2018+	5 年前
GitHub	6ba6f08c	Merge 0.11.0 to develop (#2825 ) * Update package and communicator versions to 0.11 * Remove pip cache fallback for CircleCI This change removes the caching fallback in the case where dependencies change, since it can cause CI failures when we have incompatible dependencies in the cache. * Limit Tensorflow version for tests to <2.0 * Use stable bokken image. (#2815) * build fixes for 2018+ (#2808) * rename CompressionType enum * fix standalone build test for 2018+ * Add more editor versions for testing. (#2809) * class variable for API verison, fix env tests (#2817) * fixed area prefab agents were pointing to the wrong laser gameObject.	5 年前
GitHub	1934bb75	VectorSensor and StackedSensor (#2813 ) * WIP VectorSensor and StackedSensor * fix a few dumb mistakes * more VectorSensor * remove Update(), add util methods, hook into TensorGenerator * WriteApdater to write to tensors and arrays * write float observations * used circular buffer for stacked obs * cleanup * fix unit tests * docstrings * undo accidental checkins * rider suggestions, add range check * bounds check before writing * undo ProjectVersion.txt change * fix unit tests * unit test for VectorSensor * StackingSensor tests * missing meta file * missing meta file * WriteAdapter tests	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	7f77b7d7	Add ISensor.Update() (#2852 )	5 年前
GitHub	e6f549dc	[MLA-12] update protobuf for vector observations (#2862 )	5 年前
GitHub	2e6bab0d	RayPerception sensor (#2874 )	5 年前
GitHub	281626f6	deregister the same method we registered (#3072 )	5 年前
Chris Elion	59317314	WIP 2d scene	5 年前
GitHub	a074c501	Add option to search agent children for SensorComponents (#3095 )	5 年前
GitHub	a488299f	[MLA-345] float visual observations (#3148 ) * pass shape to WriteAdapter * handle floats on python side * cleanup * whitespace * rename GetFloatObservationShape, support uncompressed in RenderTexture sensor * numpy float32 * remove unused using * Float sensor and unit test * replace asserts with exceptions, docstrings	5 年前
Christopher Goy	3a355570	[rewardProviders] First stab a reward provider implementation.	5 年前
Christopher Goy	db578832	[rewardProvider] Reset the reward after calls to GetIncrementalReward, and remove the calls from Agent.	5 年前
Christopher Goy	bd2a492b	Rename LegacyRewardProvider to LowLevelRewardProvider.	5 年前
Christopher Goy	fbc37fe7	Instantiate reward provider earlier.	5 年前
Christopher Goy	969161ae	Add checks for null canvas in monitor. Instantiate reward provider at construction time.	5 年前
Christopher Goy	bbeb952e	Remove unused variable.	5 年前
GitHub	f97bcf1c	Decoupling IPolicy from Agent (#3203 ) * initial commit * Fixed the compilation errors * fixing the tests * Addressing the comment about the brain parameters * Fixing typo * Made timers more accurate * addressing comments * Better memory allocation * Added some docstrings * Adding better sensor validation * Wrapped in #if DEBUG and also wrapped GenerateSensorData in a timer * Timer changes	5 年前
GitHub	4269447e	Convert Academy to a singleton (#3210 )	5 年前
Christopher Goy	310c94ba	Reintroduce a base RewardProviderComponent. Make changes based on PR feedback.	5 年前
Christopher Goy	0d9511d4	Remove extra lines.	5 年前
GitHub	fbb5022a	add NaN checks to reward and observation in C# (#3221 )	5 年前
Christopher Goy	1b618b49	Rename private property. Assert that the component isn't null.	5 年前
Christopher Goy	fa2614e6	Add an update for the Reward Provider in Agent. Rename some variable. Update docs.	5 年前
Christopher Goy	0e5b4975	Rename function to properly describe its behavior.	5 年前
GitHub	6451f564	write observations directly to protobuf (#3229 ) * write observations directly to protobuf * docstring and comment about Capacity	5 年前
Christopher Goy	718650c0	Modifications to reward providers.	5 年前
GitHub	0366af0b	Always reset when agent is done (#3222 ) * Removing the AgentOnDone call * removing editor inspector field for ResetOnDone * Documentation changes * addressing comments * addressing comments * adding comments * Migrating steps * inference - fill 0s for done Agents (#3232) * fill 0s for done agents * docstrings * Simplifying the code * Removing GenerateSensorData * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	33f09a49	Simplifying the Agent reset logic (#3242 ) * Simplifying the Agent reset logic - Agents will reset in ResetIfDone immediately after being marked Done - Agents will always request a decision right after reset - This change implies that additional messages might be sent to Python * Fixing the Unit Tests * Added a note in the Migrating.md document	5 年前
GitHub	03664e75	Make On Demand Decision the default (#3243 ) * Added a simple Decision Requester * Modified the prefabs * Fixing the tests and removing fields from Agent parameters * Migrating.md * addressing comments * addressing comments	5 年前
GitHub	a1a1126d	Trim some public fields on the Agent (#3269 ) * Triming some of the methods of the agent but left SetReward * Fixing bugs * modifying the environments * Reintroducing IsDone and IsMaxStepReached * Updating the Migrating doc * more details on the Migration	5 年前
GitHub	590559e7	Make the Agent reset immediately after Done (#3291 ) * Made the Agent reset immediately * fixing the C# tests * Fixing the tests still * Trying with incremental episode ids * deleting buffer rather than using an empty list * Addressing the comments * Forgot to edit the comment on AgentInfo * Updating the migrating doc * Fixed an obvious bug * cleaning after an agent is done in agent processor * Fixing the pytest errors	5 年前
GitHub	18fc5131	Format code and add .editorconfig to our package. (#3305 )	5 年前
GitHub	2db09cef	Model override from commandline (#3265 ) * WIP model override from commandline * Agent lazy init, multiple overrides * MLAgentsExamples namespace * add model override to 3dball	5 年前
GitHub	c6e5b23e	Develop return float array (#3319 ) * Decide Action to return float array * Removing Debug statement * Fixing the tests * Fixing the format * Renaming some variables * Better memory allocation	5 年前
GitHub	620fa24a	Track reward for inference (#3320 ) * WIP * add reward stats * const, dont write timers on mobile	5 年前
GitHub	9b72aab2	Making some fields and properties internal (#3342 ) * Making some fields and properties internal * Fixing the formating * Making more things internal * Adressing the comments * reverting the changes made to the recorder * WriteAdapter public * Have to make AgentInfo and TensorProxy public because of changes to write adapter and the demorecorder	5 年前
GitHub	0c4d68d1	Exposing the last action in the Agent API (#3351 ) * Exposing the last action in the Agent API * INDENTATION makes me \t\t\t mad * Update Agent.cs	5 年前
GitHub	d1644496	Remove UpdateAgentAction (#3373 )	5 年前
GitHub	386ba66c	Develop observation collector (#3352 ) * Add the VectorSensor to the CollectObservation call * Example of API change for BalanceBall * Modified the Examples * Changes to the migrating doc * Editing the docs * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing comments * Removed the MLAgents.Sensor namespace * Removing the MLAgents.Sensor namespace from the tests * Editing the migrating docs Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	84161e7a	csharp cleanup (#3392 ) * clean up examples and tests * more cleanup * m_UseChildSensors	5 年前
GitHub	51f7690d	Fix off-by-one error on AgentReset and maxSteps (#3394 ) * Fix ballance ball 100 reward * Re-test * Add test for maxSteps and number of AgentActions Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	d20bda06	csharp cleanup (#3392 ) (#3395 ) * clean up examples and tests * more cleanup * m_UseChildSensors	5 年前
GitHub	92a8aed2	Pass action masker as input to CollectObservations (#3389 ) * Sentencing Action masking the same as observations I am rather unsure about the doubling of the CollectObservation methods (and the copy pasta that comes along) Need to edit the documentation and the migrating doc once we agree we want to do this * Addressing the comments * Improvements to the documentation * Editing the documentation	5 年前
GitHub	44f88933	Make a serialization upgrade path for maxStep. (#3424 ) - Test comment out as we need a package that is only verified for 2019.2.	5 年前
GitHub	85d6d9dd	Remove unity.editor namespace from Agent.cs. (#3433 )	5 年前
GitHub	6f5bb92a	Make a serialization upgrade path for maxStep. (#3424 ) (#3434 ) - Test comment out as we need a package that is only verified for 2019.2.	5 年前
Anupam Bhatnagar	d8c79f48	resolving merge conflicts	5 年前
Anupam Bhatnagar	1c924d6a	Make the demoRecorder write the experience on reset (#3463 ) * Make the demoRecorder write the experience on reset * do nothing if demostore is null * Calling reset data if the action is null	5 年前
GitHub	d072e091	[upkeep] Add a dev project to take advantage of package that only work with 2019.x or newer. (#3452 )	5 年前
GitHub	413de82e	Make the demoRecorder write the experience on reset (#3463 ) * Make the demoRecorder write the experience on reset * do nothing if demostore is null * Calling reset data if the action is null	5 年前
GitHub	47649555	C# and Python checks for infinity and NaN. (#3418 )	5 年前
GitHub	764d8948	Develop modify stepping logic (#3448 ) * Moving the max step logic - Created a new Academy Event called AgentIncrementStep to be called before SetStatus - Implemented the AgentSteping logic * second commit : Moving the step counting at the begining. I had to edit the tests but I think they are now closer to what we want * addressing comments * Update com.unity.ml-agents/Runtime/Agent.cs Co-Authored-By: Chris Goy <goyenator@gmail.com> * Update com.unity.ml-agents/Runtime/Agent.cs Co-Authored-By: Chris Goy <goyenator@gmail.com> * Made the tests not be broken * Update com.unity.ml-agents/Runtime/Agent.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * step logic changes: unit test (#3467) * Added a line in the changelog Co-authored-by: Chris Goy <christopherg@unity3d.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	c55cb4df	Replace Agent.GetStepCount with Agent.StepCount` (#3476 )	5 年前
GitHub	a5d0cf3c	Refactor DemonstrationStore/Recorder (#3354 )	5 年前
GitHub	cd0a38c3	Develop mm validation fixes (#3487 ) * Doc fixes for Agent and Academy to remove all validation errors. * Made `ScaleAction()` in `Agent.cs` static.	5 年前
GitHub	ecd13c8a	Move Demonstration code to sub-folder (#3488 )	5 年前
GitHub	f25bf7d3	Reintroduce MLAgents.Sensors namespace (#3509 ) * Reintroduced the namespace MLAgents.Sensors * Documentation changes * updated the changelog	5 年前
GitHub	8ce9dcfd	add DoneReason enum to Agent (#3517 )	5 年前
GitHub	9a371b17	[Renaming] SetActionMask -> SetDiscreteActionMask + added the virtual method CollectDiscreteActionMasks (#3525 ) * Code edits * Modified the markdowns * Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Renaming files and methods * Addressing comments * Update docs/Learning-Environment-Design-Agents.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <celion@gmail.com>	5 年前
GitHub	e1a0f41b	Added the MLAgents.Demonstrations namespace (#3532 ) * Added the MLAgents.Demonstrations namespace * Added the MLAgents.Editor namespace * Overrided the .demo.meta files due to the change in namespace	5 年前
GitHub	b9bd4df2	Modified some namespaces (#3533 ) * Added the MLAgents.Demonstrations namespace * Added the MLAgents.Editor namespace * Overrided the .demo.meta files due to the change in namespace * More namespace changes * Added the sidechannels namespace * Modified changelog and migrating docs	5 年前
GitHub	e7ec5007	[change] Make Agent non-abstract, update Basic scene. (#3528 )	5 年前
GitHub	4e747130	Renaming AgentInfo.actionMasks to AgentInfo.DiscreteActionMasks (#3539 )	5 年前
GitHub	a27117a4	Step sensor for Heuristic policy (#3542 )	5 年前
GitHub	47755a62	Made BehaviorParameters internal (#3546 ) * Made the BrainParameters internal * Editing the docs * [skip-ci] A lot more controversial * [skip ci] Added formerly serialized as * Use cached BehaviorParameters	5 年前
GitHub	91bbcabb	Added a numberOfActions get property on the BrainParameters (#3571 ) * Added a numberOfActions get property on the BrainParameters * forgot one place to replace * [skip ci] numberOfActions --> numActions	5 年前
GitHub	c0e88aa8	Made BehaviorType public and added the SetBehaviorType method (#3572 ) * Made BehaviorType public and added the SetBehaviorType method * Moving the comments around and making Setting the behaviorType to the same value do nothing	5 年前
GitHub	7697492d	Added missing docstrings (#3577 )	5 年前
GitHub	0f381ab9	Remove ForceReset from Agent. (#3575 )	5 年前
Chris Elion	a2ad53be	apply auto-formatting	5 年前
GitHub	5b7975ad	[bugfix] Fix MLA-793 Make Unity lifecycle methods protected. Added tests for changes (#3590 )	5 年前
Chris Elion	841b0937	SideChannel helper messages	5 年前
Chris Elion	b4ce35a2	Behavior and BrainParameters back to public	5 年前
Chris Elion	ddec91cd	formatting	5 年前
GitHub	411bb64a	Renaming Agent's methods (#3557 ) * [skip ci] Renamed methods in the Agent class WARNING, the user when implementing obsolete methods will see the message :Member `old method` overrides obsolete member `old method`. Add the Obsolete attribute to `old method`. It will not suggest the new method to override. * [skip ci] Updated the example environment * [skip ci] Updated migrating and changelog * [skip ci] Editing the docs * [skip ci] Missing docs * :+1 * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] documentation changes * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Gett...	5 年前
Chris Elion	fcfd0f7f	add and use Agent.ReloadPolicy()	5 年前
Chris Elion	fa5e7e6d	Merge remote-tracking branch 'origin/master' into develop-BehaviorParams-public	5 年前
GitHub	119141fb	Make the agent begin episode at initialization (#3605 ) * Make the agent begin episode at initialization * Renaming and adding a comment * [skip ci] Update com.unity.ml-agents/Tests/Editor/MLAgentsEditModeTest.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] Update com.unity.ml-agents/Tests/Editor/MLAgentsEditModeTest.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * renamed test variables and modified some test statements * Use TotalStepCount rather than HadFirstReset * [skip ci] Renamed HadFirstReset to m_HadFirstReset Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
GitHub	eeeb09b3	Make most property setters public. (#3602 )	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
GitHub	eeb0c74d	Add CompletedEpisodes counter (#3724 )	5 年前
GitHub	89237f96	Reset StackingSensor when the Agent resets (#3727 ) * sensors.Reset() WIP * fix test implementations * call reset from Agent	5 年前
GitHub	43f23ee3	WIP : Changes to the LL-API - Refactor of “done” logic (#3681 ) * [skip ci] WIP : Modify the base_env.py file * [skip ci] typo * [skip ci] renamed some methods * [skip ci] Incorporated changes from our meeting * [skip ci] everything is broken * [skip ci] everything is broken * [skip ci] formatting * Fixing the gym tests * Fixing bug, C# has an error that needs fixing * Fixing the test * relaxing the threshold of 0.99 to 0.9 * fixing the C# side * formating * Fixed the llapi integratio test * [Increasing steps for testing] * Fixing the python tests * Need __contains__ after all * changing the max_steps in the tests * addressing comments * Making env_manager logic clearer as proposed in the comments * Remove duplicated logic and added back in episode length (#3728) * removing mentions of multi-agent in gym and changed the docstring in base_env.py * Edited the Documentation for the changes to the LLAPI (#3733) * Edite...	5 年前
GitHub	e44004d2	Fix walljump warning. (#3746 ) * Initial commit, need more work on the test * Fixing the tests * [skip ci] Update com.unity.ml-agents/Runtime/Agent.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Bug fixing : Do nothing if here are no VectorObs Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
GitHub	aae58330	Merge branch 'master' into develop-add-inference-examples	5 年前
GitHub	dd6aa7e2	Agent.Heuristic takes an float[] (#3765 )	5 年前
GitHub	8b5587cc	Remove obsolete methods from Agent class (#3770 ) * Removed the obsolete methods from the Agent class * Documentation changes * [skip ci] Update com.unity.ml-agents/CHANGELOG.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] Update docs/Migrating.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
GitHub	3fb490f6	Better memory allocation for Heuristic (#3785 ) * Fixing some issues with previous action and better memory allocation in the Heuristic Policy * Copying the data rather than using references * [skip ci]Update com.unity.ml-agents/Runtime/Agent.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Addressing comments Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
GitHub	bc45453b	AcademyStepper to DontDestroyOnLoad (#3789 ) * AcademyStepper to DontDestroyOnLoad * Adding a try-catch around DontDestroyOnLoad because it cannot be used in Editor Tests	5 年前
GitHub	576ebc67	Fixing package validation errors. (#3808 ) * Fixing package validation errors. This impacts our API as two public variables have been made private. TODO: fix our CI to catch these automatically, per commit. * Changelog changes.	5 年前
GitHub	3a4a6792	access to observations in Heuristic (#3825 ) * access to observations in Heuristic * changelog	5 年前
GitHub	256431f7	Doc review (#3803 ) * Edit and review package docs. * Filter out testa and internal namespaces. * remove offsetStep field that was accidentally revivified * Resolving review comments * Update com.unity.ml-agents/Runtime/Agent.cs * fix trailing whitespace * Revised Agent class intro and step description * Fixed a few missed comments. * removed prerelease warning Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
GitHub	0a7e53be	Code Style - apply PascalCase (#3828 ) * cleanup a few classes * cleanup raycast code * more capitalization * more renames * changelog and migration * fix MaxStep in docs * doc string	5 年前
GitHub	1e0b022f	[MLA-850] rename namespaces to Unity.MLAgents (#3843 ) * rename in protos * rename in C# * doc changes, migration, changelog * PR numbers * fix standalone test path	5 年前
GitHub	d4bbecc1	apply Rider suggestions to API code (#3847 )	5 年前
GitHub	c7722f73	[barracuda] Update Barracuda to 0.7.0-preview. (#3873 )	5 年前
GitHub	1e582745	Doc link fix (#3865 ) * Make all doc links point to release_1_docs tag * fix 0.15.1 link * relative links in readme * fix link in env warnings * more link fixes	5 年前
GitHub	731fb88b	[barracuda] Update Barracuda to 0.7.0-preview (#3875 )	5 年前
GitHub	6a98f07f	replace some crefs with direct links (#3878 )	5 年前
GitHub	0dff739b	Release mm GitHub docs (#3864 ) * Improvements to Key Components section of ML-Agents Overview - Moved some documentation from Learning-Environment-Design. - Added the trainers vs LL-API separation. - Made a note about gym-unity. - Some update to the Agent/Behavior sections - Updated diagrams to reflect new side channels. Made Behavior type a consistent color. * Reorganizing the overview file and creating new (empty) sections This change defines the new structure for the overview doc. Subsequent commits will fill in the sections and rewrite existing sections. * Reorganizing the main Training ML-Agents page Re-organizes into feature-specific sections that somewhat mirror the previous commit of reorganizing the overview doc. Subsequent commits will populate these empty sections. * Adding Deep RL - Update ML-Agents-Overview with description of DeepRL training algorithms - Decribe the common and trainer-specific hyperparams in Training-ML-Agents. - Removed ...	5 年前
Chris Elion	68b68396	Merge remote-tracking branch 'origin/master' into release_1_to_master	5 年前
GitHub	4eeb7f55	Release 2 docs (#3976 ) * Add v1.0 blog post and update reference paper. (#3947) * Develop mm fix readme releases (#3966) * Fix broken link and clean-up Releases section. * Updated link to be consistent with the table. * Update one of the bullets for consistency. * update table, add Versioning doc * release_2_docs Co-authored-by: Marwan Mattar <marwan@unity3d.com>	5 年前
GitHub	a54aef02	[MLA-1223] Backport Heuristic fixes (#4176 ) * [bugfix] Make FoodCollector heuristic playable (#4147) * Modified the documentation of the Heuristic method (default action = previous action) (#4174) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	197cf3e7	ObservableAttribute (#3925 ) * ObservableAttribute proof-of-concept * restructue sensors, add int impl, unit test * add vector3 sensor, cleanup constructors * add more types * account for observables in barracuda checks * iterators for observable fields/props * stacking, fix obs size in prefab * use DeclaredOnly to filter members * ignore write-only properties * fix error message * docstrings * agent enum (WIP) * agent enum and unit tests * fix comment * cleanup TODO * ignore by default, rename declaredOnly param, docstrings * fix tests * rename, cleanup, revert FoodCollector * warning for write-only, no exception for invalid type * move observableAttributeHandling to BehaviorParameters * autoformatting * changelog * fix up sensor creation logic	5 年前
GitHub	75689a87	Merge release 2 to master (#4000 ) * update versions for patch release (#3970) * update versions for patch releae * Update precommit flake8 (#3961) * fix changelog * Release 2 cherry pick (#3971) * [bug-fix] Fix issue with initialize not resetting step count (#3962) * Develop better error message for #3953 (#3963) * Making the error for wrong number of agents raise consistently * Better error message for inputs of wrong dimensions * Fix #3932, stop the editor from going into a loop when a prefab is selected. (#3949) * Minor doc updates to release * add unit tests and fix exceptions (#3930) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Goy <christopherg@unity3d.com> * update changelog (#3975) * [docs] Add memory_size hyperparameter (#3973) * Release 2 docs (#3976) * Add v1.0 blog post and update reference paper. (#3947) * Develop mm fix readme rel...	5 年前
GitHub	56d07c4b	Release 2 verified update docs (#4535 )	4 年前
GitHub	5066c28e	backport fix for recursion in user code (#4638 ) * backport fix for recursion in user code	4 年前
GitHub	b7eb8b6d	Clarification in the Heuristic() documentation (#4100 ) * Clarification in the Heuristic() documentation The `Heuristic()` method will not be able to write to the action array if the action array passed as argument is reassigned in the method. For example, doing : ```csharp public override void Heuristic(float[] actionsOut) { actionOut = new float[2]; actionOut[0] = 1.0f; } ``` Will not create the action [1, 0] but [0, 0] as the `actionOut` variable was reassigned. * adding to the Agent xml doc	4 年前
GitHub	306c2f8c	[docs] Update doc links (#4145 ) * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Update to release_3 in installation.md (#4144) Co-authored-by: Yuan Gao <xiaomaogy88@gmail.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
Yuan Gao	2e30fdcb	Replaced all of the doc to release_3_doc	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	af9abb6c	[MLA-1009] observable performance tests (#4031 ) * WIP perf tests * WIP perf test * add marker tests too * move to devproject * yamato first pass * chmod * fix trigger, fix meta files * fix utr command * fix artifact paths * Update com.unity.ml-agents-performance.yml * test properties, reduce some noise * timer around RequestDecision * actually set ObservableAttributeHandling * undo asmdef changes	4 年前
GitHub	6ee553d8	Modified the documentation of the Heuristic method (default action = previous action) (#4174 ) * Modifying the documentation to explain that Heuristic method default action will be the previous action decided by the heuristic. Changing this behavior would be a breking change. * Rephrase the working of the documentation of the default action of the Heuristic method * Forgot an import	4 年前
Ervin Teng	2fc4fe16	Add AgentParametersChannel	4 年前
GitHub	0e0daf47	[add-fire] Merge post-0.19.0 master into add-fire (#4328 )	4 年前
Christopher Goy	ab57d838	Update links in code, and relevant markdown files.	4 年前
GitHub	3a7572b4	Integrate IActuators into ML-Agents core code. (#4315 ) Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	bb9417f7	Update example environments to use the Actuator API (#4363 )	4 年前
Christopher Goy	5a233353	Merge remote-tracking branch 'origin/master' into release_6-to-master	4 年前
GitHub	e7916b08	add pre-commit hook for dotnet-format (#4362 )	4 年前
Christopher Goy	061a6c43	Merge remote-tracking branch 'origin/master' into release_6-to-master	4 年前
GitHub	31919e08	[MLA-1267] Account for actuators in training and inference. (#4371 )	4 年前
Scott Jordan	52ec9230	Merge branch 'develop-taggedobservations' into active-variablespeed	4 年前
GitHub	7a012c5b	allow ending the episode for MaxStepsReached (#4453 ) * allow ending the episode for MaxStepsReached * changelog * rename and update docs	4 年前
GitHub	53c13a29	docstrings and cleanup around actuators (#4467 ) * docstrings and cleanup around actuators * move ActionSpec property from IActionReceiver to IActuator	4 年前
GitHub	847e6638	[release 7] update versions on release branch (#4470 ) * update release versions * update changelog * add PR number * changelog	4 年前
GitHub	3893adcc	Misc doc fixes (#4483 ) * Fix inheritdoc usage * undo barracuda changes * fix \'the the\' * reword * tweaks to env build instructions * changelog * Update docs/Learning-Environment-Executable.md Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	0cfaddc4	Don't call NotifyDone in Agent.OnDisable if Academy is shut down (#4489 )	4 年前
GitHub	9c7aa728	[Release 8] update versions on release branch (#4550 )	4 年前
GitHub	a4ed3660	Update release_9 versions (#4621 )	4 年前
GitHub	024bb104	[MLA-1474] detect recursion on Agent methods and throw (#4573 ) * recursion checker proof-of-concept * checkers on agent * cleanup and unit tests * changelog * extra test * update comment	4 年前
Ruo-Ping Dong	9e08be87	Merge branch 'master' into release_9_branch_merge	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
GitHub	f5747db5	Changing the versions for the release_10 (#4643 )	4 年前
GitHub	94c59e31	C# changes for hybrid action spaces (#4587 ) * Add hybrid action capability flag (#4576) * Change BrainParametersProto to support ActionSpec (#4579) * Assign new BrainParametersProto fields based on capabilities (#4581) * ActionBuffer with hybrid actions for RemotePolicy (#4592) * Barracuda inference for hybrid actions (#4611) * Refactor BarracudaModel loader checks (#4629) * Export separate nodes for continuous/discrete actions (#4655) * Separate continuous/discrete actions in AgentActionProto (#4698) * Force different nodes for new and deprecated action output (#4705)	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	ddfacd86	Integrate BrainParameters with ActionSpec, update BrainParametersDrawer (#4718 ) * make actionSpec not read-only * add actionSpec in BrainParameters and update BrainParameters Drawer * add serialization callbacks * enable hybrid ParameterLoaderTest	4 年前
GitHub	5fbffd3a	update lib versions and references to release 10 (#4755 )	4 年前
GitHub	4fd0c8fe	Uncomment obsolete attributes, fix warnings (#4771 ) (#4774 ) * uncomment obsolete attributes, fix warnings * Apply suggestions from code review Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	c6000214	Uncomment obsolete attributes, fix warnings (#4771 ) * uncomment obsolete attributes, fix warnings * Apply suggestions from code review Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	8f389445	More misc hybrid action followup (#4777 )	4 年前
GitHub	19daa8f8	R12 update docs tags (#4795 ) * update package version and release tag, update doc links * changelog * table	4 年前
Andrew Cohen	c0d01baf	Merge branch 'master' into merge-release11-master	4 年前
Ruo-Ping Dong	4bad484b	very rough sketch for TeamManager interface	4 年前
Ruo-Ping Dong	ef054af0	team manager for hallway	4 年前
Chris Elion	76ebc20c	Merge remote-tracking branch 'origin/master' into r12-to-master	4 年前
Ruo-Ping Dong	180d3e20	Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager	4 年前
GitHub	70220f95	Team manager prototype (#4850 ) * remove group id * very rough sketch for TeamManager interface * add team manager id to proto * team manager for hallway * add manager to hallway * send and process team manager id * remove print * small cleanup Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Ruo-Ping Dong	7303985e	add team reward	4 年前
Ruo-Ping Dong	224d2087	add team reward	4 年前
Ruo-Ping Dong	910da750	change teammanager id from string to int	4 年前
Ruo-Ping Dong	0a2b5c5f	add option. fix EndEpisode bug. Fix reset m_reward bug in agent	4 年前
Ruo-Ping Dong	3f2aff32	small fix	4 年前
Ruo-Ping Dong	6d1dcb15	change manager id from string to int	4 年前
Ruo-Ping Dong	0bacf564	disable copy reward	4 年前
Ruo-Ping Dong	6f0bb2a4	add base team manager	4 年前
Ruo-Ping Dong	e2451ce5	add base team manager	4 年前
Ruo-Ping Dong	a79d484d	refactor PushBlockTeamManager	4 年前
Ruo-Ping Dong	40766a36	add team reward field to agent and proto	4 年前
Ruo-Ping Dong	90c9280e	add team reward field to agent and proto	4 年前
Ruo-Ping Dong	c826f52c	set team reward	4 年前
Ruo-Ping Dong	9ee89544	set team reward	4 年前
GitHub	399f99e7	Initial implementation using IHeuristicProvider. (#4849 ) - Actuators can now optionally implement IHeuristicProvider to generate heuristic actions for agents. Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Ruo-Ping Dong	ab4ec610	add maxstep to teammanager and hook to academy	4 年前
Ruo-Ping Dong	438f1d25	add maxstep to teammanager and hook to academy	4 年前
Ruo-Ping Dong	5d10c019	remove manager from academy when dispose	4 年前
Ruo-Ping Dong	8748f561	use 0 as default manager id	4 年前
Ruo-Ping Dong	ff4e57f2	fix setTeamReward Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Ruo-Ping Dong	0006cd7f	address comments	4 年前
Ruo-Ping Dong	596a540c	use delegate to avoid agent-manager cyclic reference	4 年前
Ruo-Ping Dong	bef5ae8e	fix unregister agents	4 年前
Ruo-Ping Dong	a487d0a2	unregister on disabled	4 年前
Ruo-Ping Dong	aad7d342	add base team manager	4 年前
Ruo-Ping Dong	918c2dcd	change name TeamManager to MultiAgentGroup	4 年前
Ruo-Ping Dong	f547f201	add team reward field to agent and proto	4 年前
Ruo-Ping Dong	38621840	set team reward	4 年前
GitHub	2bc19b68	Add CreateActuators API, obsolete old method. (#4899 ) * Add CreateActuators method to the ActuatorComponent class which wraps the original method. The original method will be removed in the future. Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Ruo-Ping Dong	b99c6f8b	add maxstep to teammanager and hook to academy	4 年前
Ruo-Ping Dong	33b11ab2	add some doc	4 年前
Ruo-Ping Dong	0ed78a36	remove manager from academy when dispose	4 年前
Ruo-Ping Dong	c22ed805	use 0 as default manager id	4 年前
GitHub	790f2d32	fix setTeamReward Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Ruo-Ping Dong	83344b9c	address comments	4 年前
Ruo-Ping Dong	63cf2d12	use delegate to avoid agent-manager cyclic reference	4 年前
Ervin Teng	b6f88d6d	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Ruo-Ping Dong	55265294	fix unregister agents	4 年前
Ervin Teng	3fbed6dc	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Ruo-Ping Dong	3e560168	unregister on disabled	4 年前
Ruo-Ping Dong	7271556f	change name TeamManager to MultiAgentGroup	4 年前
Ruo-Ping Dong	e8e91bc0	add some doc	4 年前
Ruo-Ping Dong	c87bce9e	Merge branch 'master' into develop-base-teammanager	4 年前
GitHub	f83fc474	Release 13 versions. (#4946 ) - updated release tag validation script to automate the updating of files with release tags that need to be changed as part of the pre-commit operation.	4 年前
vincentpierre	7298e889	Refactor of ModelParmLoaderChecks	4 年前
GitHub	d1f0fc4c	[MLA-1783] built-in actuator type (#4950 )	4 年前
Christopher Goy	9cadfa7a	Merge master -> release_13_branch-to-master	4 年前
Ruo-Ping Dong	b5da488d	Merge branch 'master' into develop-base-teammanager	4 年前
GitHub	ff146cbe	Update versions for release 14 hotfix. (#5040 )	4 年前
GitHub	ddb01eb2	MultiAgentGroup Interface (#4923 ) * add SimpleMultiAgentGroup * add group reward field to agent and proto	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
GitHub	c9153aa7	Removing Obsolete methods from the package (#5024 ) * Removing Obsolete methods from the package * Missing depecration and modified changelog * Readding the obsolete BrainParameter methods, will need a larger discussion on these * Removing Action Masker, readding the warining when using a non-implemented Heuristic, Removing NumAction from Brain Parameters * removing documentation and some calls to deprecated methods in the extensions package * Editing the Changelog to put the unreleased on top	4 年前
Ervin Teng	c8137dcd	Merge branch 'main' into develop-superpush-int	4 年前
GitHub	4863475c	non-IEnumerable interface for action masking (#5060 )	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
Christopher Goy	ebe45056	Merge branch 'main' into release_14_branch-to-main	4 年前
Ervin Teng	1f026c70	Merge branch 'main' into develop-superpush-branch-cleanup	4 年前
GitHub	7606cd68	[release_15] Release 15 update versions (#5101 ) * Update versions * Fix for validate release links * Update release tag and docs	4 年前
Ervin Teng	4b95fa34	[bug fix] Fix warning using demo recorder (#5216 ) (cherry picked from commit 272899a8a100b4ff1bbcaa575d2bc46965fc5938)	4 年前
Christopher Goy	c9be2433	Removing Obsolete methods from the package (#5024 ) * Removing Obsolete methods from the package * Missing depecration and modified changelog * Readding the obsolete BrainParameter methods, will need a larger discussion on these * Removing Action Masker, readding the warining when using a non-implemented Heuristic, Removing NumAction from Brain Parameters * removing documentation and some calls to deprecated methods in the extensions package * Editing the Changelog to put the unreleased on top	4 年前
Christopher Goy	092c2718	non-IEnumerable interface for action masking (#5060 )	4 年前
GitHub	9b1b17c6	[Release 16] Update Python and release versions (#5234 ) * Update Python and release versions * Tick C# versions	4 年前
Andrew Cohen	18be47e8	Merge branch 'main' into develop-soccer-groupman-mod	4 年前
GitHub	3d53ec5a	Turns physics modules into optional dependencies. (#5112 )	4 年前
GitHub	734baf16	change default barracuda behavior (#5175 )	4 年前
GitHub	65bbb10b	[MLA-1824] make SensorComponent return ISensor[] (#5181 ) * Make SensorComponent return an array * split match3 sensors, partial retrain * docstrings, migration, changelog, cleanup	4 年前
GitHub	5415b004	[MLA-1879] culture-invariant sorting for sensors and actuators (#5194 )	4 年前
GitHub	acc9ba45	[bug fix] Fix warning using demo recorder (#5216 )	4 年前
vincentpierre	7cece532	BASIC WORKS	4 年前
GitHub	41f38daa	[MLA-1909] Match3 and Camera/RenderTexture sensor GC improvements (#5233 )	4 年前
GitHub	03af9322	avoid empty set iteration, avoid Debug.AssertFormat (#5246 ) * avoid empty set iteration, avoid Debug.AssertFormat * changelog	4 年前
GitHub	76077fa8	[Release 16] Release 16 Merge Back to Main (#5255 ) Update versions and documentation for Release 16. Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	fabc492e	fix all PVS and doc generation warnings (#5262 )	4 年前
GitHub	ce4ad782	Release 17 version bumps and docs version bumps (#5280 )	4 年前
GitHub	8bb1fe6a	[WIP] [Fix] Fixing collect observation called on done (#5375 ) * [WIP] [Fix] Fixing collect observation called on done * Update com.unity.ml-agents/Runtime/Agent.cs * ⚠️ Modifying the test of stacking sensor when the agent is done * modifying the documentation for BufferSensor to specify to call AddObservation in the CollectObservations method	4 年前
GitHub	9354ca64	[Release 18] Update versions and links (#5414 )	3 年前

1 2 3 4 5 ...

296 次代码提交 (674540de-b9d2-4df8-9dec-5cee3e226711)