ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	e36b8bf0	added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
eshvk	030ac5c5	[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	dcf58f75	Feature/previous text action (#375 ) * [Previous Text Actions] Renamed previous_action to previous_vector_action added previous_text_action to the BrainInfo * [Semantics] Carried the modifications to the semantics of previous_vector_action to the trainers	7 年前
Arthur Juliani	5e48766d	Remove discrete observations	6 年前
Arthur Juliani	1bfbf67a	Simplify approach	6 年前
GitHub	ded0d8c7	Develop action masking (#1080 ) * [Initial Commit] Modified the model.py file and the ppo/trainer.py file to use masked actions * Preliminary modifications to the python side of the code to enable action masking * Preliminary modifications to the C# side of the code to enable action masking * Preliminary modifications to the communication side of the code to enable action masking * Implemented action masking for BC Note : The actions of the teacher are not masked * More error messages for the action masking * fix pytests * Added Documentation * Address comment * Addressed Comments on docs * Addressed second comment on docs * Addressed comments for the python side of the code * Created the action masker and associated unit tests * Addressed comments on the C# side * Addressed the comment regarding action_masking_name * Addressed the comments	6 年前
Deric Pang	cdb41480	Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure	6 年前
GitHub	c4fa3893	Add file check & reuse protobuf conversion functions (#1316 )	6 年前
Vincent-Pierre BERGES	bc636075	API for sending custom protobuf messages to and from Unity. (#1595 ) * API for sending custom protobuf messages to and from Unity. * Rename custom_output to custom_outputs. * Move custom protos to their own files. * Add SetCustomOutput method. * Add docstrings. * Various adjustments. * Rename CustomParameters -> CustomResetParameters * Rename CustomOutput -> CUstomObservation * Add CustomAction * Add CustomActionResult * Remove custom action result. * Remove custom action result from Python API * Start new documentation. * Add some docstrings * Expand documentation. * Typos * Tweak doc. Also eliminate GetCustomObservation. * Fix typo. * Clarify docs. * Remove trailing whitspace	6 年前
GitHub	93760bc4	Adds SubprocessUnityEnvironment for parallel envs (#1751 ) This commit adds support for running Unity environments in parallel. An abstract base class was created for UnityEnvironment which a new SubprocessUnityEnvironment inherits from. SubprocessUnityEnvironment communicates through a pipe in order to send commands which will be run in parallel to its workers. A few significant changes needed to be made as a side-effect: * UnityEnvironments are created via a factory method (a closure) rather than being directly created by the main process. * In mlagents-learn "worker-id" has been replaced by "base-port" and "num-envs", and worker_ids are automatically assigned across runs. * BrainInfo objects now convert all fields to numpy arrays or lists to avoid serialization issues.	6 年前
eshvk	ef8009d9	Python code reformat via [`black`](https://github.com/ambv/black ). Features: - Reformat code via black. - Adding circleci configurations. - Add contribution guidelines. Steps to reproduce: - `pip install black` - `black <source code directory>`	6 年前
GitHub	2671e1a0	Enable mypy in precommit checks (#2177 ) * WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * WIP enable mypy * run mypy on each package * fix trainer_metrics mypy errors * more mypy errors * more mypy * Fix some partially typed functions * types for take_action_outputs * fix formatting * cleanup * generate stubs for proto objects * fix ml-agents-env mypy errors * disallow-incomplete-defs for gym-unity * Add CI notes to CONTRIBUTING.md	5 年前
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	5 年前
GitHub	e1d93a0e	Allow mypy to reject incomplete defs for mlagents-envs (#2585 ) This wasn't working before because of several remaining partially defined function definitions.	5 年前
GitHub	43c13a35	image decompress timer (#2596 ) * image decompress timer * dont lazy load the image	5 年前
GitHub	2f74b3cc	Rename protobuf objects to be suffixed with 'Proto' in python and C#. (#2646 )	5 年前
GitHub	68965c7b	Use a class for camera res, not dict (#2656 )	5 年前
GitHub	b95c4d1d	check for unecessary list comprehensions (#2707 )	5 年前
GitHub	af3ed33f	Speed up processing large vector observations (#2717 ) * faster NaN checks for large float obs * add unit test, remove is_nan check on obs * add comment * rename var	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前
GitHub	0fe5adc2	Develop remove memories (#2795 ) * Initial commit removing memories from C# and deprecating memory fields in proto * initial changes to Python * Adding functionalities * Fixes * adding the memories to the dictionary * Fixing bugs * tweeks * Resolving bugs * Recreating the proto * Addressing comments * Passing by reference does not work. Do not merge * Fixing huge bug in Inference * Applying patches * fixing tests * Addressing comments * Renaming variable to reflect type * test	5 年前
GitHub	c6c01a03	Enable pylint and fix a few things (#2767 ) * enable pylint, disable some messages and fix a few * SAC memories in init	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	e6f549dc	[MLA-12] update protobuf for vector observations (#2862 )	5 年前
Ervin Teng	6420a03b	Make max step reached always false	5 年前
GitHub	69d1a033	Develop remove past action communication (#2913 ) * Modifying the .proto files * attempt 1 at refactoring Python * works for ppo hallway * changing the documentation * now works with both sac and ppo both training and inference * Ned to fix the tests * TODOs : - Fix the demonstration recorder - Fix the demonstration loader - verify the intrinsic reward signals work - Fix the tests on Python - Fix the C# tests * Regenerating the protos * fix proto typo * protos and modifying the C# demo recorder * modified the demo loader * Demos are loading * IMPORTANT : THESE ARE THE FILES USED FOR CONVERSION FROM OLD TO NEW FORMAT * Modified all the demo files * Fixing all the tests * fixing ci * addressing comments * removing reference to memories in the ll-api	5 年前
Ervin Teng	54644477	Merge branch 'develop' of github.com:Unity-Technologies/ml-agents into develop-nomaxstep-test	5 年前
GitHub	ece9733c	add timers and fix filehandle leak (#2989 )	5 年前
GitHub	652488d9	check for numpy float64 (#2948 )	5 年前
GitHub	a6df9f43	Develop new ll api (#3022 ) * initial commit for LL-API * fixing ml-agents-envs tests * Implementing action masks * training is fixed for 3DBall * Tests all fixed, gym is broken and missing documentation changes * adding case where no vector obs * Fixed Gym * fixing tests of float64 * fixing float64 * reverting some of brain.py * removing old proto apis * comment type fixes * added properties to AgentGroupSpec and edited the notebooks. * clearing the notebook outputs * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing first comments * NaN checks for r...	5 年前
GitHub	42bea858	Improve mypy coverage by adding --namespace-packages (#3049 )	5 年前
Chris Elion	fdc810ff	move (first pass)	5 年前
GitHub	58b6c7c2	Rename mlagents.envs to mlagents_envs (#3083 )	5 年前
GitHub	a488299f	[MLA-345] float visual observations (#3148 ) * pass shape to WriteAdapter * handle floats on python side * cleanup * whitespace * rename GetFloatObservationShape, support uncompressed in RenderTexture sensor * numpy float32 * remove unused using * Float sensor and unit test * replace asserts with exceptions, docstrings	5 年前
GitHub	f058b18c	Replace BrainInfos with BatchedStepResult (#3207 )	5 年前
Anupam Bhatnagar	f4dbedcf	removed extraneous logging imports and loggers	5 年前

41 次代码提交 (41d32aca-f829-4abe-a765-c997eed537b2)