ml-agents

作者	SHA1	备注	提交日期
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	5 年前
Jonathan Harper	c2cd5a87	Add custom reset parameters to subprocess env manager This mirrors functionality already found in UnityEnvironment	5 年前
GitHub	84d9d622	python timers (#2180 ) * Timer proof-of-concept * micro optimizations * add some timers * cleanup, add asserts * Cleanup (no start/end methods) and handle exceptions * unit test and decorator * move output code, add a decorator * cleanup * module docstring * actually write the timings when done with training * use __qualname__ instead * add a few more timers * fix mock import * fix unit test * don't need fwd reference * cleanup root * always write timers, add comments * undo accidental change	5 年前
GitHub	d415528a	fix subprocess test and style checks on develop (#2248 ) * fix tests that broke with new arg * fix black	5 年前
GitHub	a802d0d7	Make SubprocessEnvManager take asynchronous steps (#2265 ) SubprocessEnvManager takes steps synchronously to reproduce old behavior, meaning all parallel environments will need to wait for the slowest environment to take a step. If some steps take much longer than others, this can lead to a substantial overall slowdown in practice. We've seen extreme cases where we see almost a 2x speedup from using asynchronous stepping, with no downside for our faster environments. (Bouncer 16% improvement, Walker 14% improvement in tests). This PR changes the SubprocessEnvManager to use async stepping. This means on the "step" call the environment manager will enqueue step requests to workers, and then only wait until at least one step has been completed before returning.	5 年前
GitHub	f82f0f37	Get timers from subprocess (#2268 ) * Timer proof-of-concept * micro optimizations * add some timers * cleanup, add asserts * Cleanup (no start/end methods) and handle exceptions * unit test and decorator * move output code, add a decorator * cleanup * module docstring * actually write the timings when done with training * use __qualname__ instead * add a few more timers * fix mock import * fix unit test * get timers from worker process (WIP) * clean up timer merging * typo * WIP * cleanup merging code * bad merge * undo accidental change * remove reset command * fix style * fix unit tests * fix unit tests (they got overwrote in merge) * get timer root though a function * timer around communicate	5 年前
GitHub	83875376	Add "gauges" to timer system (#2329 ) * WIP still needs tests and merging from multiprocess * cleanup gauges * add TODO for subprocesses	5 年前
Jonathan Harper	98297be9	Fix training not quitting when play button is unchecked (#2376 ) This fixes an issue where stopping the game when training in the Editor won't end training, due to the new asynchronous SubprocessEnvManager changes. Another minor change was made to move the `env_manager.close()` in TrainerController to the end of `start_learning` so that we are more likely to save the model if something goes wrong during the environment shutdown (this occurs sometimes on Windows machines).	5 年前
GitHub	c7f0ed04	Merge pull request #2381 from Unity-Technologies/release-0.9.0	5 年前
GitHub	4abe89bc	Only call get_action on brains with policies (#2437 )	5 年前
Jonathan Harper	2f083c8a	Renamed "StepInfo" to "EnvironmentStep" This change was requested for clarity during the async EnvManager PR. It's a simple rename of the StepInfo class.	5 年前
GitHub	babe9e2f	Develop remove academy done (#2519 ) * Initial Commit * Remove the Academy Done flag from the protobuf definitions * remove global_done in the environment * Removed irrelevant unitTests * Remove the max_step from the Academy inspector * Removed global_done from the python scripts * Modified and removed some tests * This actually does not break either curriculum nor generalization training * Replace global_done with reserved. Addressing Chris Elion's comment regarding the deprecation of the global_done field. We will use a reserved field to make sure the global done does not get replaced in the future causing errors. * Removed unused fake brain * Tested that the first call to step was the same as a reset call * black formating * Added documentation changes * Editing the migrating doc * Addressing comments on the Migrating doc * Addressing comments : - Removing dead code - Resolving forgotten merged conflicts - Editing documentations...	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
GitHub	e1d93a0e	Allow mypy to reject incomplete defs for mlagents-envs (#2585 ) This wasn't working before because of several remaining partially defined function definitions.	5 年前
GitHub	30042ef7	fix hang with multiple envs (#2600 )	5 年前
GitHub	89b1c7a8	Better environment shutdown (#2620 ) * Wait for env process to exit before killing it * don't propagate signals, better error logging * set proc1 to None when done * comments	5 年前
GitHub	b95c4d1d	check for unecessary list comprehensions (#2707 )	5 年前
GitHub	0fe5adc2	Develop remove memories (#2795 ) * Initial commit removing memories from C# and deprecating memory fields in proto * initial changes to Python * Adding functionalities * Fixes * adding the memories to the dictionary * Fixing bugs * tweeks * Resolving bugs * Recreating the proto * Addressing comments * Passing by reference does not work. Do not merge * Fixing huge bug in Inference * Applying patches * fixing tests * Addressing comments * Renaming variable to reflect type * test	5 年前
Jonathan Harper	bae94a76	Add timeout for communicator exchange When we initially connect to the environment using RPCCommunicator, the connection is polled so we don't hang forever on `.recv()` when the environment wasn't launched or failed. However we don't currently have any similar check for the exchanges mid-training-run. This change applies the same timeout from initialization to each exchange, and extends the default `timeout_wait` to 60 seconds to generally improve the chances we won't have a mismatch between environment launch time and the trainer timeout. Tested on: single-env and multi-env cases. Killed 1 environment process manually and saw that the model was saved appropriately and all processes closed.	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	a6df9f43	Develop new ll api (#3022 ) * initial commit for LL-API * fixing ml-agents-envs tests * Implementing action masks * training is fixed for 3DBall * Tests all fixed, gym is broken and missing documentation changes * adding case where no vector obs * Fixed Gym * fixing tests of float64 * fixing float64 * reverting some of brain.py * removing old proto apis * comment type fixes * added properties to AgentGroupSpec and edited the notebooks. * clearing the notebook outputs * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing first comments * NaN checks for r...	5 年前
GitHub	36048cb6	Moving Env Manager to Trainers (#3062 ) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. * Moving Env Manager to Trainers * fix pylint madness	5 年前
GitHub	42bea858	Improve mypy coverage by adding --namespace-packages (#3049 )	5 年前
GitHub	90db165f	Add --namespace-packages to mypy for mlagents (#3075 )	5 年前
Chris Elion	fdc810ff	move (first pass)	5 年前
GitHub	58b6c7c2	Rename mlagents.envs to mlagents_envs (#3083 )	5 年前
GitHub	4c241a80	Only send previous action and current BrainInfo (#3187 ) This PR makes it so that the env_manager only sends one current BrainInfo and the previous actions (if any) to the AgentManager. The list of agents was added to the ActionInfo and used appropriately.	5 年前
GitHub	f058b18c	Replace BrainInfos with BatchedStepResult (#3207 )	5 年前
GitHub	ca96b293	Move advance() logic for environment manager out of trainer_controller (#3234 ) This PR moves the AgentManagers from the TrainerController into the env_manager. This way, the TrainerController only needs to create the components (Trainers, AgentManagers) and call advance() on the EnvManager and the Trainers.	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
GitHub	11c518a3	Stats SideChannel (for custom TensorBoard metrics) (#3660 )	5 年前
GitHub	807a1441	Raise exceptions from environment subprocesses (#3680 ) This commit surfaces exceptions from environment worker subprocesses, and changes the SubprocessEnvManager to raise those exceptions when caught. Additionally TrainerController was changed to treat environment exceptions differently than KeyboardInterrupts. We now raise the environment exceptions after exporting the model, so that ML-Agents will correctly exit with a non-zero return code.	5 年前
GitHub	4ecd6ad3	Fix how we set logging levels (#3703 ) * cleanup logging * comments and cleanup * pylint, gym	5 年前
GitHub	43f23ee3	WIP : Changes to the LL-API - Refactor of “done” logic (#3681 ) * [skip ci] WIP : Modify the base_env.py file * [skip ci] typo * [skip ci] renamed some methods * [skip ci] Incorporated changes from our meeting * [skip ci] everything is broken * [skip ci] everything is broken * [skip ci] formatting * Fixing the gym tests * Fixing bug, C# has an error that needs fixing * Fixing the test * relaxing the threshold of 0.99 to 0.9 * fixing the C# side * formating * Fixed the llapi integratio test * [Increasing steps for testing] * Fixing the python tests * Need __contains__ after all * changing the max_steps in the tests * addressing comments * Making env_manager logic clearer as proposed in the comments * Remove duplicated logic and added back in episode length (#3728) * removing mentions of multi-agent in gym and changed the docstring in base_env.py * Edited the Documentation for the changes to the LLAPI (#3733) * Edite...	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
GitHub	e57144f9	[bug-fix] Set logging level in subprocesses (#3874 )	5 年前
GitHub	ebe12502	[bug-fix] Fix exception thrown when quitting in-editor training from editor (#3885 )	5 年前
GitHub	422247a0	update versions for patch release (#3970 ) * update versions for patch releae * Update precommit flake8 (#3961) * fix changelog	5 年前
GitHub	c6ed3789	Replaced get_behavior_names and get_behavior_spec with behavior_specs property (#3946 ) * Replaced get_behavior_names and get_behavior_spec with behavior_specs property * Fixing the test * [ci] * addressing some comments * use typing.Mapping (#3948) * Update ml-agents-envs/mlagents_envs/base_env.py Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Adding the documentation Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
GitHub	e274bcf6	Update precommit flake8 (#3961 ) * fix flake8 errors * update flake8 hook * update flake8 plugins	5 年前
Andrew Cohen	fe0a077e	passing sampler configs to c#	4 年前
Andrew Cohen	4464ca46	ignoring commit checks	4 年前
Andrew Cohen	5ffd9761	type checks for parameter randomization settings/enforces float encoding	4 年前
Andrew Cohen	e5c07272	using to_float for encoding	4 年前
Andrew Cohen	953f4e09	from set_sampler_params => set_{samplertype}_params	4 年前
GitHub	5b0a5b9b	Moving domain randomization to C# (#4065 )	4 年前
GitHub	8c2ade77	Separate send environment data from reset (#4128 )	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	3bcb029b	[refactor] Remove BrainParameters from Python code (#4138 )	4 年前
GitHub	8eefdcd3	Refactor of Curriculum and parameter sampling (#4160 ) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation up...	4 年前
Scott Jordan	d695c044	initial addition of active learning (incomplete)	4 年前
Scott Jordan	56745026	Initial commit of running active learning code Active learning code is running on walker variable speed. Needs to be tested to see if it is working.	4 年前
GitHub	b853e5ba	Action buffer (#4612 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	3c96a3a2	Action Model (#4580 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
vincentpierre	b863af57	Removing TensorFlow Trainers	4 年前
GitHub	a4c9f58e	Fix SubprocessEnvManager hanging on unexpected exceptions. (#4699 ) * Add shutdown sentinel value to subprocess_env_manager. * Add Sanity Check for Zombie Workers	4 年前
Andrew Cohen	d624b54b	Merge branch 'master' into fix-conflict-base-env	4 年前
Andrew Cohen	bd917c9c	action buffer passes continuous	4 年前
Andrew Cohen	4ebc6c44	ml-agents-envs pass	4 年前
Andrew Cohen	6ffbf209	fix imports in test utils	4 年前
vincentpierre	3bbd61e4	[Bug Fix] Fix crash if spawn is delayed in multi-env	4 年前
GitHub	2af86534	[MLA-1712] Make UnityEnvironment fail fast if the env crashes (#4880 )	4 年前
GitHub	d8835857	[MLA-1540] Training Analytics (#4780 )	4 年前
GitHub	2e19759c	Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211 ) * turning some logger.info into logger.debug and remove some logging overhead when not using debug * Addressing comments * Adding to changelog	4 年前
vincentpierre	e3b67e9f	Remove some dependencies of the trainers on UnityEnvironment (and use BaseEnv instead)	3 年前

1 2

67 次代码提交 (d20bda06-1db5-4fb7-8ae7-9dffd80204cf)