ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	65df8ae9	fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step	7 年前
GitHub	aee5d336	Fix discrete state (#33 ) * made BrainParameters a class to set default values Modified the error message if the state is discrete * Add discrete state support to PPO and provide discrete state example environment * Add flexibility to continuous control as well * Finish PPO flexible model generation implementation * Fix formatting * Support color observations * Add best practices document * bug fix for non square observations * Update Readme.md * Remove scipy dependency * Add installation doc	7 年前
vincentpierre	3f85bb56	Merge branch 'master' into dev-broadcast	7 年前
vincentpierre	447da485	fix on the CoreBrains so that if one Corebrain gets eraised, it will be reinstanciated	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
vincentpierre	4fcc6fbc	fix so we can now destroy the agent in AgentOnDone()	7 年前
vincentpierre	56c8914f	An agent can now spawn an agent in AgentStep()	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
Arthur Juliani	c42eff57	Misc fixes	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	430a5486	[Semantics] renaming StateType to SpaceType (#382 )	7 年前
Vincent Gao	ba0ecf24	fixed other tabs and spaces	7 年前
Vincent Gao	02df3b34	resolved conflicts	7 年前
Marwan Mattar	fa638000	Comment improvements & refactoring to Academy.cs Added several class and method-level comments that are compatibale with Doxygen for auto-generation of documentation. In addition to some stylistic and minor code changes (summarized below). Stylistic changes: - Modified comments to /// style instead of /** */ - Removed unnecessary imports - Removed unnecessary “private” declarations - Limited code to 80 characters per line - Re-organized variables to group those that are visible in Inspector (they are now at the top) Code changes: - Renamed ScreenConfiguration to EnvironmentConfiguration (variable only used within Academy.cs, thus no other files needed modification) - Renamed ConfigureEngine to ConfigureEnvironment and created a ConfigureEnvironmentHelper method - Renamed _isCurrentlyInference to modeSwitched to signify when the engine config needs to be changed - Added isCommunicatorOn flag to be explicit about the existence of a communicator - Made isInference private which requ...	7 年前
Marwan Mattar	ba6911c3	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Editor/MLAgentsEditModeTest.cs # unity-environment/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs # unity-environment/Assets/ML-Agents/Scripts/Academy.cs	7 年前
GitHub	6b5fa5ee	[Fix] If the brain is inactive, an error will be thrown if an agent t… (#412 ) * [Fix] If the brain is inactive, an error will be thrown if an agent tries to request a decision from it * [Typo] Included feedback from comments	7 年前
Marwan Mattar	bab02a21	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Scripts/Brain.cs	7 年前
GitHub	4a7481a1	RayPercpetion, Push Block, and misc environment changes (#432 ) RayPerception moved to a component that is now used by Banana, Soccer, Hallway, and Push Block. Converted Push Block to use RayPerception for local perception and retrained model. Re-worked Hallway to be more extensible.	7 年前
Marwan Mattar	d8a6e730	Fixed OpenURL urls - Ensured consistency of how (Experimental) appears in docs.	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
Arthur Juliani	5e48766d	Remove discrete observations	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
GitHub	3c9603d6	Demonstration Recorder (#1240 )	6 年前
GitHub	d7224351	Brains as Scriptable Objects (#1250 ) * Initial Commit Ported most functionalities, still need to : - Documentation - Add Comments - Custom drawer for BrainParameters - Fix the UnitTests - Review Functionalities * Added Custom Drawer for the Brain Parameters * Improvements to the HubDrawer * Modified the Brain Editors * Minor bug fixes and UI changes * Modified the Help Boxes of the Drawers * Modified Brain class, renamed Initialize and made DecideAction virtual * Fix the UnityTests * Simpler Brain creation menu * Renamed Internal Brain to Learning Brain * modified the parameters to remove reference to External or Internal in the Protobuf objects * Updated the protobuf generated files * Fix the Pytests * Removed the graph scope from the Learning Brain * cleaner logic than try catch * Removed the isExternal field of the brain and put the isTraining logic into LearningBrain and Training Hub * Modified how the Brain finds the A...	6 年前
GitHub	6c354d16	New Learning Brain (#1303 ) * Initial Commit * attempt at refactor * Put all static methods into the CoreInternalBrain * improvements * more testing * modifications * renamed epsilon * misc * Now supports discrete actions * added discrete support and RNN and visual. Left to do is refactor and save variables into models * code cleaning * made a tensor generator and applier * fix on the models.py file * Moved the Checks to a different Class * Added some unit tests * BugFix * Need to generate the output tensors as well as inputs before executing the graph * Made NodeNames static and created a new namespace * Added comments to the TensorAppliers * Started adding comments on the TensorGenerators code * Added comments for the Tensor Generator * Moving the helper classes into a separate folder * Added initial comments to the TensorChecks * Renamed NodeNames -> TensorNames * Removing warnings in tests * Now using Aut...	6 年前
Vincent-Pierre BERGES	9b00c012	Fix for Brains not reinitialising when the scene is reloaded. (#1758 ) * Fix for Brains not reinitialising when the scene is reloaded. This was a bug caused by the conversion of Brains over to ScriptableObjects. ScriptableObjects persist in memory between scene changes, which means that after a scene change the Brains would still be initialised and the agentInfos list would contain invalid references to the Agents from the previous scene. The fix is to have the Academy notify the Brains when it is destroyed. This allows the Brains to clean themselves up and transition back to an uninitialised state. After the new scene is loaded, the Brain's LazyInitialise will reconnect the Brain to the new Academy as expected. * Fix for Brains not reinitialising when the scene is reloaded. This was a bug caused by the conversion of Brains over to ScriptableObjects. ScriptableObjects persist in memory between scene changes, which means that after a scene change the Brains would still be...	6 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	20bf7c42	use [] instead of Dictionary.Add to avoid key exception (#2629 )	5 年前
GitHub	24ba9d58	Develop deprecate broadcasting (#2669 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modifie...	5 年前
GitHub	2d92a49b	Refactor ICommunicator API (#2675 ) - Push (almost) all references to protobuf objects into the RpcCommunicator. - Simplify the passing around of Agents and Agent Infos. - Delete all references to the Batcher. - Simplify the Environment Step by removing all of the reset and message counting logic. - Finishes MLA-27 and MLA-28	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前

39 次代码提交 (3fca9b66-52f3-42bf-ba9e-035c10cc22a6)