ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	e36b8bf0	added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards	7 年前
vincentpierre	5db042c6	removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands	7 年前
vincentpierre	ac910514	initial commit of the curriculum with broadcast. Improved the Unity python handshake	7 年前
vincentpierre	c16e0ac3	modified the socket to receive states and images of any size	7 年前
vincentpierre	6e950cd3	Can now switch inference configuration on/off in the editor. Reintroduced the broadcast feature for the non-External brains. Introduced the API number to check the compatibility between Unity and Python.	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
vincentpierre	4c8efaeb	New feature: The errors happening in Unity will be reported in the unity-environment.log file that will be generated when an environment with an external Brain is launched. This should help developers figure out faster if errors are happening on the Unity side. Looking into the Player.Log or using a developement build could be replaced by this feature.	7 年前
vincentpierre	d1ace9cb	formatting	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	dcf58f75	Feature/previous text action (#375 ) * [Previous Text Actions] Renamed previous_action to previous_vector_action added previous_text_action to the BrainInfo * [Semantics] Carried the modifications to the semantics of previous_vector_action to the trainers	7 年前
GitHub	430a5486	[Semantics] renaming StateType to SpaceType (#382 )	7 年前
Vincent Gao	ba0ecf24	fixed other tabs and spaces	7 年前
Vincent Gao	02df3b34	resolved conflicts	7 年前
GitHub	19e60852	[Fix] If no agents are connected to an external brain, the external communicator should not try to update the agents (#399 )	7 年前
Vincent Gao	1bc43933	Merge branch 'development-0.3' into hotfix/issue#333	7 年前
Marwan Mattar	fa638000	Comment improvements & refactoring to Academy.cs Added several class and method-level comments that are compatibale with Doxygen for auto-generation of documentation. In addition to some stylistic and minor code changes (summarized below). Stylistic changes: - Modified comments to /// style instead of /** */ - Removed unnecessary imports - Removed unnecessary “private” declarations - Limited code to 80 characters per line - Re-organized variables to group those that are visible in Inspector (they are now at the top) Code changes: - Renamed ScreenConfiguration to EnvironmentConfiguration (variable only used within Academy.cs, thus no other files needed modification) - Renamed ConfigureEngine to ConfigureEnvironment and created a ConfigureEnvironmentHelper method - Renamed _isCurrentlyInference to modeSwitched to signify when the engine config needs to be changed - Added isCommunicatorOn flag to be explicit about the existence of a communicator - Made isInference private which requ...	7 年前
Marwan Mattar	ba6911c3	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Editor/MLAgentsEditModeTest.cs # unity-environment/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs # unity-environment/Assets/ML-Agents/Scripts/Academy.cs	7 年前
GitHub	a487853d	[Important] Fixes the logic of the academy reset (#417 ) The External Communicator now sends the AcademyDone properly	7 年前
Marwan Mattar	f1966275	Comment improvements to Agent.cs. Added several class and method-level comments that are compatibale with Doxygen for auto-generation of documentation. In addition to some stylistic and minor code changes (summarized below). Stylistic changes: - Modified comments to /// style instead of /** */ - Removed unnecessary imports - Limited code to 80 characters per line Code changes: - Change SetTextObs to accept a string, not an object - Renamed all methods that have “state” semantics to “info” semantics - Renamed _InitializeAgent as OnEnableHelper - Removed _DisableAgent, foldered into OnDisable - Renamed StoredVectorActions to storedVectorActions, similarly for StoredTextActions - Changed internal methods to protected since thats the desired behavior - Renamed _info to info and _action to action since they’re already private These refactorings had impacts on CoreBrainInternal, ExternalCommunicator, MLAgentsEditModeTest. Performed minor improvemens to Ball3DAgent and AgentEditor (re...	7 年前
GitHub	647b0a8f	Merge pull request #418 from Unity-Technologies/dev-api-doc-agent Comment improvements to Agent.cs.	7 年前
GitHub	f19739cb	Update API version in anticipation of v0.3 release (#437 ) * Update API version in anticipation of v0.3 release * Use _version_ across both Unity/Python	7 年前

25 次代码提交 (57a9ed38-921a-4d42-9958-eb1c63ed648a)