ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	54d85928	made a nice error if a placeholder is missing or if a placeholder is not in the graph	7 年前
vincentpierre	b393bcbf	renamed run to networkOutput	7 年前
vincentpierre	0df8326e	minor fixes	7 年前
vincentpierre	3f85bb56	Merge branch 'master' into dev-broadcast	7 年前
vincentpierre	5390bb09	need to convert the state to int if the state is discrete	7 年前
vincentpierre	9933b56e	modification to get started on the recurrent NN rewrite	7 年前
vincentpierre	6e950cd3	Can now switch inference configuration on/off in the editor. Reintroduced the broadcast feature for the non-External brains. Introduced the API number to check the compatibility between Unity and Python.	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
Arthur Juliani	f2d30f07	The internal Brain now can effectively modify the value field of the agents (#275 ) * Requires training to have been made with ppo * The name of the tensor must be value_estimate	7 年前
Arthur Juliani	15f10de0	Added tooltip and helpURL to ML-Agents scripts (#276 )	7 年前
GitHub	0277039d	Fix Basic Environment & Discrete States (#356 ) * Fix Basic environment to properly reflect number of states. * Fix discrete states when using stacked states. * Add trained model for Basic environment.	7 年前
Arthur Juliani	b8a4f5f1	Add Hallway envronment to validate LSTM models	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	171e551b	[Semantics] Modified some fields of the CoreInternalBrain (#373 )	7 年前
GitHub	a809630f	Add config for crawler, and change crawler scene (#376 ) * Add config for crawler, and change crawler scene * Changed number of crawlers in scene to 12 * Changed Max-steps for crawlers to 5000 * Newer hyperparameters and newly trained crawler model * Clean up crawler code, and improve efficency	7 年前
Vincent Gao	38bd3e40	replaced all the tabs to 4 spaces in the project	7 年前
GitHub	430a5486	[Semantics] renaming StateType to SpaceType (#382 )	7 年前
GitHub	a7c9096f	[Semantics] Modified the placeholder names (#381 )	7 年前
Vincent Gao	02df3b34	resolved conflicts	7 年前
GitHub	786b3a00	[Error Message] Improved the error and added a link (#411 )	7 年前
Marwan Mattar	f1966275	Comment improvements to Agent.cs. Added several class and method-level comments that are compatibale with Doxygen for auto-generation of documentation. In addition to some stylistic and minor code changes (summarized below). Stylistic changes: - Modified comments to /// style instead of /** */ - Removed unnecessary imports - Limited code to 80 characters per line Code changes: - Change SetTextObs to accept a string, not an object - Renamed all methods that have “state” semantics to “info” semantics - Renamed _InitializeAgent as OnEnableHelper - Removed _DisableAgent, foldered into OnDisable - Renamed StoredVectorActions to storedVectorActions, similarly for StoredTextActions - Changed internal methods to protected since thats the desired behavior - Renamed _info to info and _action to action since they’re already private These refactorings had impacts on CoreBrainInternal, ExternalCommunicator, MLAgentsEditModeTest. Performed minor improvemens to Ball3DAgent and AgentEditor (re...	7 年前
Marwan Mattar	d8a6e730	Fixed OpenURL urls - Ensured consistency of how (Experimental) appears in docs.	7 年前
GitHub	6dd3c284	Hotfix 0.3.0b (#519 ) * Fixes internal brain for Banana Imitation. * Fixes Discrete Control training for Imitation Learning. * Fixes Visual Observations in internal brain with non-square inputs.	7 年前
GitHub	a99aad13	Hotfix 0.3.1a (#625 ) * [CoreBrain] Bug fix in the internal brain Discrete vector observations did not have the right size * [Docs] Removed all references to the unitypackages other than the TensorFlowSharp.unitypackage . * [Basic] Updated the bytes file of basic * [Docs] Addressed comments * [Docs] Re-addressed the comments * [Bug Fix] Scalling the visual input between 0 and 1 * [Comments] Added comments to the BatchVisualObservations method of the CoreInternalBrain. * [Renaming] Renamed BlackAndWhite to blackAndWhite	7 年前
Vincent Gao	0e7c88ee	refactored the quick start and installation guide, added faq	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
GitHub	c0e73bcb	Fix typo (#864 )	6 年前
maxiaoxiao1991	8f3c5a11	Optimize BatchVisualObservations	6 年前
Arthur Juliani	5e48766d	Remove discrete observations	6 年前
vincentpierre	0d84858d	[Modified the Internal Brain]	6 年前
Arthur Juliani	ce565904	Fix scaling of greyscale visual obsrrvations	6 年前
GitHub	e50ac7ae	Merge branch 'develop' into hotfix-0	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
Arthur Juliani	1eb701af	Merge remote-tracking branch 'origin/develop' into develop-value-estimates-ppo	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
GitHub	ded0d8c7	Develop action masking (#1080 ) * [Initial Commit] Modified the model.py file and the ppo/trainer.py file to use masked actions * Preliminary modifications to the python side of the code to enable action masking * Preliminary modifications to the C# side of the code to enable action masking * Preliminary modifications to the communication side of the code to enable action masking * Implemented action masking for BC Note : The actions of the teacher are not masked * More error messages for the action masking * fix pytests * Added Documentation * Address comment * Addressed Comments on docs * Addressed second comment on docs * Addressed comments for the python side of the code * Created the action masker and associated unit tests * Addressed comments on the C# side * Addressed the comment regarding action_masking_name * Addressed the comments	6 年前
GitHub	e9c7c2ce	Added the past action input when using RNN with multi-discrete (#1124 )	6 年前
GitHub	d2c320dd	Remove graph scope (#1205 ) * initial commit : Only works with PPO balance ball * Fix for recurrent * [Fix indentation error] * Fixed BC * Remove Dead code * Addressing comment : Removing dead code * Fixing the Pytest * edited comments * Removing GraphScope from the InternalBrain (#1227) * Documentation changes for removing graph scope (#1226) * Documentation changes * removed the keep checkpoint printing	6 年前
GitHub	3c9603d6	Demonstration Recorder (#1240 )	6 年前

43 次代码提交 (d993c549-0b16-4f42-8e8e-5bea39334e27)