ml-agents

作者	SHA1	备注	提交日期
GitHub	517e3a0a	Remove env creation logic from TrainerController (#1562 ) * Remove env creation logic from TrainerController Currently TrainerController includes logic related to creating the UnityEnvironment, which causes poor separation of concerns between the learn.py application script, TrainerController and UnityEnvironment: * TrainerController must know about the proper way to instantiate the UnityEnvironment, which may differ from application to application. This also makes mocking or subclassing UnityEnvironment more difficult. * Many arguments are passed by learn.py to TrainerController and passed along to UnityEnvironment. This change moves environment construction logic into learn.py, as part of the greater refactor to separate trainer logic from actor / environment.	6 年前
eshvk	cc9bdf17	Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return	6 年前
GitHub	93760bc4	Adds SubprocessUnityEnvironment for parallel envs (#1751 ) This commit adds support for running Unity environments in parallel. An abstract base class was created for UnityEnvironment which a new SubprocessUnityEnvironment inherits from. SubprocessUnityEnvironment communicates through a pipe in order to send commands which will be run in parallel to its workers. A few significant changes needed to be made as a side-effect: * UnityEnvironments are created via a factory method (a closure) rather than being directly created by the main process. * In mlagents-learn "worker-id" has been replaced by "base-port" and "num-envs", and worker_ids are automatically assigned across runs. * BrainInfo objects now convert all fields to numpy arrays or lists to avoid serialization issues.	6 年前
Jonathan Harper	7a0d1531	Fix subprocess model saving on Windows On Windows the interrupt for subprocesses works in a different way from OSX/Linux. The result is that child subprocesses and their pipes may close while the parent process is still running during a keyboard (ctrl+C) interrupt. To handle this, this change adds handling for EOFError and BrokenPipeError exceptions when interacting with subprocess environments. Additional management is also added to be sure when using parallel runs using the "num-runs" option that the threads for each run are joined and KeyboardInterrupts are handled. These changes made the "_win_handler" we used to specially manage interrupts on Windows unnecessary, so they have been removed.	6 年前
Jonathan Harper	e91e847c	Fix '--slow' flag after environment updates A change was made to the way the "train_mode" flag was used by environments when SubprocessUnityEnvironment was added which was intended to be part of a separate change set. This broke the CLI '--slow' flag. This change undoes those changes, so that the slow / fast simulation option works correctly. As a minor additional change, the remaining tests from top level 'tests' folders have been moved into the new test folders.	6 年前
eshvk	ef8009d9	Python code reformat via [`black`](https://github.com/ambv/black ). Features: - Reformat code via black. - Adding circleci configurations. - Add contribution guidelines. Steps to reproduce: - `pip install black` - `black <source code directory>`	6 年前
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	5 年前
GitHub	966d8efb	Remove "external_brains" arg for TrainerController (#2213 ) TrainerController depended on an external_brains dictionary with brain params in its constructor but only used it in a single function call. The same function call (start_learning) takes the environment as an argument, which is the source of the external_brains. This change removes the dependency of TrainerController on external brains and removes the two class members related to external_brains and retrieves the brains directly from the environment.	5 年前
Ervin T	a46f3faa	Enable generalization training (#2232 ) * Add Sampler and SamplerManager * Enable resampling of reset parameters during training * Documentation for Sampler and example YAML configuration file	5 年前
GitHub	a9fe719c	Add Multi-GPU implementation for PPO (#2288 ) Add MultiGpuPPOPolicy class and command line options to run multi-GPU training	5 年前
GitHub	30930383	Move trainer initialization into a utility function (#2412 ) This change moves trainer initialization outside of TrainerController, reducing some of the constructor arguments of TrainerController and setting up the ability for trainers to be initialized in the case where a TrainerController isn't needed.	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
GitHub	0d48a352	Use argparse for arg parsing (#2586 ) * encapsulate commandline args * fix tests * add tests on cmdline parsing * cleanup * remove docopt * simplify --slow	5 年前
GitHub	d64a01e1	Added option to use environment arguments in learn (#2594 ) * Added option to use environment arguments in learn * hook into argparse * add example to readme	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
Jonathan Harper	9f166f9e	Update tests to support pytest 5.x Our tests were using pytest fixtures by actually calling the fixture methods, but in newer 5.x versions of pytest this causes test failures. The recommended method for using fixtures is dependency injection. This change updates the relevant test fixtures to either not use `pytest.fixture` or to use dependency injection to pass the fixture. The version range requirements in `test_requirements.txt` were also updated accordingly.	5 年前
Jonathan Harper	481e0842	Remove the --num-runs option The "num-runs" command-line option provides the ability to run multiple identically-configured training runs in separate processes by running mlagents-learn only once. This is a rarely used ML-Agents feature, but it adds complexity to other parts of the system by adding the need to support multiprocessing and managing of ports for the parallel training runs. It also doesn't provide truly reproducible experiments, since there is no guarantee of resource isolation between the trials. This commit removes the --num-runs option, with the idea that users will manage parallel or sequential runs of the same experiment themselves in the future.	5 年前
GitHub	b0a2a54f	Add 'run-experiment' script, simpler curriculum config (#3186 ) This change adds a new 'mlagents-run-experiment' endpoint which accepts a single YAML/JSON file providing all of the information that mlagents-learn accepts via command-line arguments and file inputs. As part of this change the curriculum configuration is simplified to accept only a single file for all the curricula in an environment rather than a file for each behavior.	5 年前
Ervin Teng	00017bab	Temporarily remove multi-GPU	5 年前
Ervin Teng	c68b5643	Remove multi_gpu from learn test	5 年前
Alphonso Crawford	802593a2	Adding test for bad env_path on create_environment_factory	5 年前
Alphonso Crawford	26d44958	Update test_bad_env_path	5 年前
Alphonso Crawford	1a7f9ad0	change test_learn.py	5 年前
GitHub	c145e75b	Split Policy and Optimizer, common Policy for PPO and SAC (#3345 )	5 年前
GitHub	6709a9bf	[change] Clean up trainer interface, clean up GhostTrainer stats (#3634 )	5 年前
GitHub	458e68f1	Remove "docker target" feature (#3687 ) The "docker target" feature and associated command-line flag --docker-target-name were created for use with the now-deprecated Docker setup. This feature redirects the paths used by learn.py for the environment and config files to be based from a directory other than the current working directory. Additionally it wrapped the environment execution with xvfb-run. This commit removes the "docker target" feature because: * Renaming the paths doesn't fix any problem. Absolute paths can already be passed for configs and environment executables. * Use of xserver, Xvfb, or xvfb-run are independent of mlagents-learn and can be used outside of the mlagents-learn call. Further, xvfb-run is not the only solution for software rendering.	5 年前
GitHub	bc1fdf07	[refactor] CLI changes (#3705 )	5 年前
GitHub	d7ca6b8d	[feature] Add --initialize-from option (#3710 )	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
GitHub	f86fc81d	[refactor] Move configuration files to single YAML file (#3791 )	5 年前
GitHub	7e0032f5	[refactor] Allow full RunOptions to be specified in trainer configuration YAML (#3815 )	5 年前
GitHub	812983c0	Some improvements to the UnityEnvironment class (#3939 ) * Fix typo * Made a side channel utils to reduce the complexity of UnityEnvironment * Added a get_side_channel_dict utils method * Better executable launcher (unarguably) * Fixing the broken test * Addressing comments * [skip ci] Update ml-agents-envs/mlagents_envs/side_channel/side_channel_manager.py Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com> * No catch all Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	5 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	4 年前
GitHub	f5435876	[refactor] Store and restore state along with checkpoints (#4025 )	4 年前
GitHub	09853e13	[refactor] Move checkpoint saving into trainer (#4034 )	4 年前
GitHub	09c7787c	[bug-fix] Fix regression in --initialize-from feature (#4086 )	4 年前
Andrew Cohen	f76780f1	fix tests	4 年前
GitHub	5b0a5b9b	Moving domain randomization to C# (#4065 )	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	c188781b	[life improvement] Moving Python files around (#4531 ) * Moved components to the tf folder and moved the TrainerFactory to the `trainer` folder * Addressing comments * Editing the migrating doc * fixing test	4 年前
GitHub	8a40c58a	Added SUM as aggregation type for custom statistics (#4816 )	4 年前
GitHub	4c776283	Fix --results-dir (#5269 )	4 年前

43 次代码提交 (cd46c9c2-6692-44ed-ba47-4373c2963f36)