ml-agents

作者	SHA1	备注	提交日期
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
Arthur Juliani	4dce8f6a	Sleep three seconds between session launches	6 年前
Arthur Juliani	fa3bee21	Use queue to check for environment launch	6 年前
Arthur Juliani	e07bfab2	Address comments	6 年前
GitHub	cc083fd8	fixed the windows ctrl-c bug (#1558 ) * Documentation tweaks and updates (#1479) * Add blurb about using the --load flag in the intro guide, and typo fix. * Add section in tutorial to create multiple area learning environment. * Add mention of Done() method in agent design * fixed the windows ctrl-c bug * fixed typo * removed some uncessary printing * nothing * make the import of the win api conditional * removved the duplicate code * added the ability to use python debugger on ml-agents * added newline at the end, changed the import to be complete path * changed the info.log into policy.export_model, changed the sys.platform to use startswith * fixed a bug * remove the printing of the path * tweaked the info message to notify the user about the expected error message * removed some logging according to comments * removed the sys import * Revert "Documentation tweaks and updates (#1479)" This reverts commit 84ef07a4525fa8a89f4...	6 年前
GitHub	3523f9be	Only using multiprocess when --num-runs>1 (#1583 ) Fixes the bug of the models not being saved in docker	6 年前
GitHub	517e3a0a	Remove env creation logic from TrainerController (#1562 ) * Remove env creation logic from TrainerController Currently TrainerController includes logic related to creating the UnityEnvironment, which causes poor separation of concerns between the learn.py application script, TrainerController and UnityEnvironment: * TrainerController must know about the proper way to instantiate the UnityEnvironment, which may differ from application to application. This also makes mocking or subclassing UnityEnvironment more difficult. * Many arguments are passed by learn.py to TrainerController and passed along to UnityEnvironment. This change moves environment construction logic into learn.py, as part of the greater refactor to separate trainer logic from actor / environment.	6 年前
eshvk	cc9bdf17	Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return	6 年前
GitHub	93760bc4	Adds SubprocessUnityEnvironment for parallel envs (#1751 ) This commit adds support for running Unity environments in parallel. An abstract base class was created for UnityEnvironment which a new SubprocessUnityEnvironment inherits from. SubprocessUnityEnvironment communicates through a pipe in order to send commands which will be run in parallel to its workers. A few significant changes needed to be made as a side-effect: * UnityEnvironments are created via a factory method (a closure) rather than being directly created by the main process. * In mlagents-learn "worker-id" has been replaced by "base-port" and "num-envs", and worker_ids are automatically assigned across runs. * BrainInfo objects now convert all fields to numpy arrays or lists to avoid serialization issues.	6 年前
Jonathan Harper	7a0d1531	Fix subprocess model saving on Windows On Windows the interrupt for subprocesses works in a different way from OSX/Linux. The result is that child subprocesses and their pipes may close while the parent process is still running during a keyboard (ctrl+C) interrupt. To handle this, this change adds handling for EOFError and BrokenPipeError exceptions when interacting with subprocess environments. Additional management is also added to be sure when using parallel runs using the "num-runs" option that the threads for each run are joined and KeyboardInterrupts are handled. These changes made the "_win_handler" we used to specially manage interrupts on Windows unnecessary, so they have been removed.	6 年前
Jonathan Harper	e91e847c	Fix '--slow' flag after environment updates A change was made to the way the "train_mode" flag was used by environments when SubprocessUnityEnvironment was added which was intended to be part of a separate change set. This broke the CLI '--slow' flag. This change undoes those changes, so that the slow / fast simulation option works correctly. As a minor additional change, the remaining tests from top level 'tests' folders have been moved into the new test folders.	6 年前
eshvk	ef8009d9	Python code reformat via [`black`](https://github.com/ambv/black ). Features: - Reformat code via black. - Adding circleci configurations. - Add contribution guidelines. Steps to reproduce: - `pip install black` - `black <source code directory>`	6 年前
GitHub	e916dc48	use yaml.safe_load instead of yaml.load (#2124 )	5 年前
GitHub	2671e1a0	Enable mypy in precommit checks (#2177 ) * WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * WIP enable mypy * run mypy on each package * fix trainer_metrics mypy errors * more mypy errors * more mypy * Fix some partially typed functions * types for take_action_outputs * fix formatting * cleanup * generate stubs for proto objects * fix ml-agents-env mypy errors * disallow-incomplete-defs for gym-unity * Add CI notes to CONTRIBUTING.md	5 年前
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	5 年前
Chris Elion	bb7773c1	add flake8 to precommit	5 年前
GitHub	966d8efb	Remove "external_brains" arg for TrainerController (#2213 ) TrainerController depended on an external_brains dictionary with brain params in its constructor but only used it in a single function call. The same function call (start_learning) takes the environment as an argument, which is the source of the external_brains. This change removes the dependency of TrainerController on external brains and removes the two class members related to external_brains and retrieves the brains directly from the environment.	5 年前
Chris Elion	5d07ca1f	Merge remote-tracking branch 'origin/develop' into enable-flake8	5 年前
Ervin T	a46f3faa	Enable generalization training (#2232 ) * Add Sampler and SamplerManager * Enable resampling of reset parameters during training * Documentation for Sampler and example YAML configuration file	5 年前
GitHub	a9fe719c	Add Multi-GPU implementation for PPO (#2288 ) Add MultiGpuPPOPolicy class and command line options to run multi-GPU training	5 年前
GitHub	30930383	Move trainer initialization into a utility function (#2412 ) This change moves trainer initialization outside of TrainerController, reducing some of the constructor arguments of TrainerController and setting up the ability for trainers to be initialized in the case where a TrainerController isn't needed.	5 年前
Ervin T	184b5d5a	Change samplers to use random state to allow consistency in reset par… (#2398 ) * Change samplers to use random state to allow consistency in reset parameter draws for a specified seed	5 年前
sankalp04	121221f2	Adding new command line arguments	5 年前
sankalp04	dfc8885d	Allow generalization training with specified arguments of min_reward and min_lesson_length	5 年前
Ervin Teng	072d2ef8	Merge latest develop	5 年前
sankalp04	8cbfee43	Get rid of dead code and clean up code	5 年前
sankalp04	dacb420b	Instantiate SamplerManager in learn.py instead of trainer_controller	5 年前
sankalp04	f331e5b7	Rebase develop	5 年前
Yuan Gao	b9210f4c	Updated the comment for —multi-gpu option.	5 年前
Yuan Gao	33404e1b	Fixed the flake8	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
Ervin Teng	02c8507b	Add tensorboard startup on training	5 年前
Ervin Teng	3162606f	Freeze support for multiprocessing	5 年前
GitHub	0d48a352	Use argparse for arg parsing (#2586 ) * encapsulate commandline args * fix tests * add tests on cmdline parsing * cleanup * remove docopt * simplify --slow	5 年前
Ervin Teng	209c71c0	Move freeze support	5 年前
Ervin Teng	dc47efbe	Import webfiles.zip for Tensorboard	5 年前
GitHub	d64a01e1	Added option to use environment arguments in learn (#2594 ) * Added option to use environment arguments in learn * hook into argparse * add example to readme	5 年前
GitHub	473a8758	Develop yaml json loading errors (#2601 ) * WIP cleanup loading * better exceptions for parser errors - refer to online lint tools * feedback - rename variable	5 年前
Jonathan Harper	3fc14963	EXPERIMENTAL horovod support	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
Chris Elion	c531f87d	Added --cpu flag to train using CPU only (#2755 )	5 年前
GitHub	5ee487e9	Fixing unecerrary error with curriculum (#2772 )	5 年前
GitHub	c6c01a03	Enable pylint and fix a few things (#2767 ) * enable pylint, disable some messages and fix a few * SAC memories in init	5 年前
GitHub	38d39e38	disable tensorflow warnings by default (#2931 )	5 年前
GitHub	28dbf4c5	Allow --version argument in mlagents-learn (#2942 ) * allow --version argument in mlagents-learn * Develop version print add strings (#2945) * add __version__ to libs * more version info * use actual version	5 年前
GitHub	a71c67d9	better logging for ports and versions (#3048 ) (#3069 )	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	e7bf6fff	Close environment if step raises an exception. (#3043 ) * close env manager in finally * rename to env_manager * remove obsolete mock checks	5 年前
GitHub	a6df9f43	Develop new ll api (#3022 ) * initial commit for LL-API * fixing ml-agents-envs tests * Implementing action masks * training is fixed for 3DBall * Tests all fixed, gym is broken and missing documentation changes * adding case where no vector obs * Fixed Gym * fixing tests of float64 * fixing float64 * reverting some of brain.py * removing old proto apis * comment type fixes * added properties to AgentGroupSpec and edited the notebooks. * clearing the notebook outputs * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing first comments * NaN checks for r...	5 年前
GitHub	15050bc4	better logging for ports and versions (#3048 )	5 年前
GitHub	36048cb6	Moving Env Manager to Trainers (#3062 ) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. * Moving Env Manager to Trainers * fix pylint madness	5 年前
Chris Elion	fdc810ff	move (first pass)	5 年前
GitHub	58b6c7c2	Rename mlagents.envs to mlagents_envs (#3083 )	5 年前
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
GitHub	2ac242f7	Remove TrainerMetrics and add CSVWriter using new StatsWriter API (#3108 )	5 年前
GitHub	c6152459	Allow curricula to be created without files (#3145 ) Previously the Curriculum and MetaCurriculum classes required file / folder paths for initialization. These methods loaded the configuration for the curricula from the filesystem. Requiring files for configuring curricula makes testing and updating our config format more difficult. This change moves the file loading into static methods, so that Curricula / MetaCurricula can be initialized from dictionaries only.	5 年前
GitHub	45010af3	Add stats reporter class and re-enable missing stats (#3076 )	5 年前
Jonathan Harper	481e0842	Remove the --num-runs option The "num-runs" command-line option provides the ability to run multiple identically-configured training runs in separate processes by running mlagents-learn only once. This is a rarely used ML-Agents feature, but it adds complexity to other parts of the system by adding the need to support multiprocessing and managing of ports for the parallel training runs. It also doesn't provide truly reproducible experiments, since there is no guarantee of resource isolation between the trials. This commit removes the --num-runs option, with the idea that users will manage parallel or sequential runs of the same experiment themselves in the future.	5 年前
GitHub	b0a2a54f	Add 'run-experiment' script, simpler curriculum config (#3186 ) This change adds a new 'mlagents-run-experiment' endpoint which accepts a single YAML/JSON file providing all of the information that mlagents-learn accepts via command-line arguments and file inputs. As part of this change the curriculum configuration is simplified to accept only a single file for all the curricula in an environment rather than a file for each behavior.	5 年前
Ervin Teng	9ad99eb6	Combined model and policy for PPO	5 年前
GitHub	f62af526	Set logging level to INFO, was overridden by newer TF (#3358 )	5 年前
GitHub	2ac92182	constant for editor port (#3396 ) * constant for editor port * undo stupid pycharm * cleanup	5 年前
Ervin Teng	00017bab	Temporarily remove multi-GPU	5 年前
Alphonso Crawford	d106d497	Raise exception if path does not exist	5 年前
Alphonso Crawford	615de041	Check if environment is launchable in learn.py	5 年前
Alphonso Crawford	b891a38b	properly formatting within environment_launch_check	5 年前
Alphonso Crawford	2c14779c	moving launch check to static method	5 年前
Anupam Bhatnagar	d8c79f48	resolving merge conflicts	5 年前
Alphonso Crawford	cff1a003	pylint error resolution	5 年前
Alphonso Crawford	51e947fe	extra space aboe create environment factory	5 年前
Alphonso Crawford	40f1f6ed	validate_environment_path	5 年前
Alphonso Crawford	2a154bf3	Moving env_strip to validate_environment_path	5 年前
Ervin Teng	5ef902bf	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
GitHub	be14dd42	Make the timer output format consistent (#3472 )	5 年前
Ervin Teng	bcc25d59	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
GitHub	472f9f0e	Merge branch 'master' into develop-badEnvReturnCode	5 年前
Alphonso Crawford	35e49f5d	Using f-strings for exception strings	5 年前
GitHub	c145e75b	Split Policy and Optimizer, common Policy for PPO and SAC (#3345 )	5 年前
Andrew Cohen	bd78ec40	self-play assym hacked branch	5 年前
Andrew Cohen	94654de4	ghost controller	5 年前
Anupam Bhatnagar	abc369a6	Adding a logging utility for improved logs	5 年前
Anupam Bhatnagar	ee67c628	add log level as an argument to create logger	5 年前
Anupam Bhatnagar	c2611126	uniformize log level for all loggers	5 年前
Anupam Bhatnagar	e8e0078e	first commit	5 年前
Chris Elion	0d65c600	top-level timers to see where time is going	5 年前
Ervin Teng	bcf073bf	Move console logging to ConsoleWriter	5 年前
Ervin Teng	49df4038	Make progress bar a statswriter	5 年前
Chris Elion	a5dd261b	make sure top-level timer is closed before writing	5 年前
GitHub	a1f00b07	Merge pull request #3629 from Unity-Technologies/develop-timers-fix-writing make sure top-level timer is closed before writing	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
Andrew Cohen	ac261e36	Merge branch 'master' into self-play-mutex	5 年前
GitHub	2ecd1d9b	remove obsolete code, offset worker seeds (#3645 )	5 年前
Andrew Cohen	eefc4811	Merge branch 'master' into self-play-mutex	5 年前
Andrew Cohen	1269b555	docstrings/ghost_swap -> team_change	5 年前
GitHub	5d4f7f08	cleanup port logic in UnityEnvironment (#3673 )	5 年前
GitHub	458e68f1	Remove "docker target" feature (#3687 ) The "docker target" feature and associated command-line flag --docker-target-name were created for use with the now-deprecated Docker setup. This feature redirects the paths used by learn.py for the environment and config files to be based from a directory other than the current working directory. Additionally it wrapped the environment execution with xvfb-run. This commit removes the "docker target" feature because: * Renaming the paths doesn't fix any problem. Absolute paths can already be passed for configs and environment executables. * Use of xserver, Xvfb, or xvfb-run are independent of mlagents-learn and can be used outside of the mlagents-learn call. Further, xvfb-run is not the only solution for software rendering.	5 年前
GitHub	4ecd6ad3	Fix how we set logging levels (#3703 ) * cleanup logging * comments and cleanup * pylint, gym	5 年前
GitHub	bc1fdf07	[refactor] CLI changes (#3705 )	5 年前
Anupam Bhatnagar	50e52d9c	Merge branch 'master' into distributed-training	5 年前
GitHub	d7ca6b8d	[feature] Add --initialize-from option (#3710 )	5 年前
Anupam Bhatnagar	001fce2a	first commit	5 年前
Anupam Bhatnagar	06c6de13	activate environment from executable	5 年前
GitHub	c79475eb	[MLA-803] Add timer metadata to C# and python (#3758 ) * Add timer metadata to C# and python * end time last * changelog * add commandline args	5 年前
GitHub	8c5edc99	Improvements to Training-ML-Agents (#3776 ) * Improvements to Training-ML-Agents - Removed duplicate documentation - Moved CLI descriptions to learn.py - Reorganized "Training with mlagents-learn" into 5 sub-sections * fixed formatting errors and incorporated minor feedback * minor improvement * Minor formatting. * fixed run-id references * Keeping link to use Inference consistent with master Will update the UIE page in a separate PR. * Squashed commit of the following: commit 9600d0fbe6684eca69fb5bab84ab0f6754fc8b0f Author: Marwan Mattar <marwan@unity3d.com> Date: Tue Apr 14 17:45:33 2020 -0700 Various doc improvements (#3775) * Various doc improvements For Using-Virtual-Environment.md: - Made a note regarding updating setuptools and pip. - Changed lists from "-" to "*" For Using-Tensorboard.md: - Changed the ordered list to use "1." For Training-on-Microsoft-Azure-Custom-Instance.md: - Deleted ...	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
Andrew Cohen	ddb6787c	hard reset when team changes	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
GitHub	fccbcdd2	Removed the default for width and height of the executable training. (#3867 ) * Removed the default for width and height of the executable training. This is to help relove #3835 since setting the screen resolution on Linux 2019.3 can cause issues. * Editing the changelog * Making fields in EngineConfig optional	5 年前
GitHub	f86fc81d	[refactor] Move configuration files to single YAML file (#3791 )	5 年前
GitHub	7e0032f5	[refactor] Allow full RunOptions to be specified in trainer configuration YAML (#3815 )	5 年前
Chris Elion	68b68396	Merge remote-tracking branch 'origin/master' into release_1_to_master	5 年前
GitHub	812983c0	Some improvements to the UnityEnvironment class (#3939 ) * Fix typo * Made a side channel utils to reduce the complexity of UnityEnvironment * Added a get_side_channel_dict utils method * Better executable launcher (unarguably) * Fixing the broken test * Addressing comments * [skip ci] Update ml-agents-envs/mlagents_envs/side_channel/side_channel_manager.py Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com> * No catch all Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	5 年前
GitHub	9083752d	Making some things private in UnityEnvironment (#3951 ) * Making some things private in UnityEnvironment * Readding the default ports as public * removing _SCALAR_ACTION_TYPES and _SINGLE_BRAIN_ACTION_TYPES * Removing unused method	5 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	4 年前
Andrew Cohen	fe0a077e	passing sampler configs to c#	4 年前
Andrew Cohen	4464ca46	ignoring commit checks	4 年前
Andrew Cohen	91217b0d	use settings.py to check PR config	4 年前
GitHub	f5435876	[refactor] Store and restore state along with checkpoints (#4025 )	4 年前
Andrew Cohen	e7750fc9	Merge branch 'master' into develop-sampler-refactor	4 年前
GitHub	09853e13	[refactor] Move checkpoint saving into trainer (#4034 )	4 年前
Andrew Cohen	b790ce76	error properly when a keyword is not followed by a valid config in yaml	4 年前
Andrew Cohen	72e4a9c6	use run_seed if no seed specified in yaml	4 年前
Andrew Cohen	c0f7052b	Merge branch 'master' into develop-sampler-refactor	4 年前
GitHub	09c7787c	[bug-fix] Fix regression in --initialize-from feature (#4086 )	4 年前
Andrew Cohen	56479c12	add docstring for maybe_add_samplers	4 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
GitHub	a1c63c4b	Release 3 Cherry-pick bug-fixes and doc changes from master (#4102 ) * [bug-fix] Fix regression in --initialize-from feature (#4086) * Fixed text in GettingStarted page specifying the logdir for tensorboard. Before it was in a directory summaries which no longer existed. Results are now saved to the results dir. (#4085) * [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) * Reverting bug introduced in #4071 (#4101) Co-authored-by: Scott <Scott.m.jordan91@gmail.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Anupam Bhatnagar	4afd8f92	first commit	4 年前
GitHub	5b0a5b9b	Moving domain randomization to C# (#4065 )	4 年前
GitHub	8c2ade77	Separate send environment data from reset (#4128 )	4 年前
Andrew Cohen	b61334a5	add transfer relaunch for cloud	4 年前
Anupam Bhatnagar	24d5f881	first commit	4 年前
yanchaosun	1e52ad3d	ready for cloud training	4 年前
yanchaosun	e338ab91	test cloud training	4 年前
yanchaosun	f0881a94	fix commands for cloud training	4 年前
GitHub	05a11c96	Develop add fire exp framework (#4213 ) * Experiment branch for comparing torch * Updates and merging ervin changes * improvements on experiment_torch.py * Better printing of results * preliminary gpu experiment * Testing gpu * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * _ * _ * _ * _ * _ * _ * _ * _ * Attempt at gpu on tf. Does not work * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * Fixing learn.py	4 年前
yanchaosun	44fa16fa	fix issues with cloud training	4 年前
yanchaosun	ad95032b	transfer path	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
yanchaosun	5ba3031a	comment out transfer	4 年前
yanchaosun	59e93b0b	transfer config	4 年前
yanchaosun	5eccb4c9	new transfer test for cloud	4 年前
GitHub	3de1e660	[bug-fix] Initialize-from being incorrectly loaded as "None" rather than None (#4175 )	4 年前
GitHub	0e0daf47	[add-fire] Merge post-0.19.0 master into add-fire (#4328 )	4 年前
GitHub	8eefdcd3	Refactor of Curriculum and parameter sampling (#4160 ) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation up...	4 年前
GitHub	84440f05	Convert checkpoints to .NN (#4127 ) This change adds an export to .nn for each checkpoint generated by RLTrainer and adds a NNCheckpointManager to track the generated checkpoints and final model in training_status.json. Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	4 年前
GitHub	7f3e2e22	add numpy version to timer metadata, log random seed (#4285 )	4 年前
GitHub	493793a6	[MLA-1233] Remove stats.CSVWriter (#4300 )	4 年前
HH	af4792a6	reset ppo learn.py to master	4 年前
Anupam Bhatnagar	e9d3de8e	[skip ci] adding initializer	4 年前
Anupam Bhatnagar	90435403	[skip ci] adding statement to import rank	4 年前
Anupam Bhatnagar	5d9c110f	[skip ci] clean up around initializer	4 年前
Anupam Bhatnagar	4d19245f	[skip ci] adding PluginSettings	4 年前
Anupam Bhatnagar	dbd21c95	[skip ci] adding distributed trainers	4 年前
Anupam Bhatnagar	07daf8b5	[skip ci] adding type annotations	4 年前
GitHub	df685184	Make --torch use torch even without config (#4400 ) * Make --torch use torch even without config * Update ml-agents/mlagents/trainers/trainer_util.py Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> * Update ml-agents/mlagents/trainers/trainer_util.py Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> * Update ml-agents/mlagents/trainers/trainer_util.py Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> * renaming use_torch to force_torch Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
Ruo-Ping Dong	a0b14228	set OMP_NUM_THREADS	4 年前
Anupam Bhatnagar	5e8aa485	renaming file from globals.py to global_values.py	4 年前
Anupam Bhatnagar	71c301bc	minor fixes	4 年前
Scott Jordan	56745026	Initial commit of running active learning code Active learning code is running on walker variable speed. Needs to be tested to see if it is working.	4 年前
Anupam Bhatnagar	d7f0d457	[skip ci] removing package import statements	4 年前
Scott Jordan	78f8a9a2	Updated task manager active learning is no optional and defaults to uniform sampling of tasks. Renamed ActiveLearningTaskManager to just TaskManager	4 年前
Ruo-Ping Dong	7eceb27f	revert changes	4 年前
Ervin Teng	8cc75388	Changes for experiment	4 年前
Ruo-Ping Dong	fb50b0ec	add wb	4 年前
Ervin Teng	fdc887a1	Some experimental stuff	4 年前
Ruo-Ping Dong	4a2512f3	update	4 年前
Ervin Teng	3a7cd3ad	Merge experiments	4 年前
vincentpierre	d9e2f974	-	4 年前
vincentpierre	dda6dc1b	-	4 年前
vincentpierre	31ea11e0	-	4 年前
vincentpierre	a8137478	-	4 年前
GitHub	c188781b	[life improvement] Moving Python files around (#4531 ) * Moved components to the tf folder and moved the TrainerFactory to the `trainer` folder * Addressing comments * Editing the migrating doc * fixing test	4 年前
GitHub	a690af74	[refactor] Make PyTorch the default and TensorFlow optional (#4517 ) * Torch setup.py * Set torch to default * Make torch default in setup.py * Remove indents * Remove other instances of TF being used * Add tensorboard to setup.py * Adding correst setup commands for verifying torch is installed (#4524) * Adding correst setup commands for verifying torch is installed * Editing the test_requirments to add tf and remove torch * Develop torchdefault raise outside setup (#4530) * Torch not imported error to raise at first usage * Torch not imported error to raise at first usage * [refactor] Use PyTorch TensorBoard utils (#4518) * Convert stats writer to use PyTorch TB support * Use common function to print params * Update test * Bump tensorboard to 1.15 to fix the tests * putting tensorboard 1.15.0 as min version requirement Co-authored-by: vincentpierre <vincentpierre@unity3d.com> * [Docs] Initial documentation changes for making...	4 年前
Ervin Teng	3b15cc32	Multiprocessing but Stats are quite broken	4 年前
vincentpierre	b863af57	Removing TensorFlow Trainers	4 年前
GitHub	7387a77f	remove pylint (#4836 ) * remove pylint * remove other pylint disables	4 年前
GitHub	457ed0b8	Set torch device from commandline (#4888 )	4 年前
GitHub	d8835857	[MLA-1540] Training Analytics (#4780 )	4 年前
GitHub	7954bd26	setuptools-based plugin for StatsWriters (#4788 )	4 年前
GitHub	2e19759c	Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211 ) * turning some logger.info into logger.debug and remove some logging overhead when not using debug * Addressing comments * Adding to changelog	4 年前
vincentpierre	e3b67e9f	Remove some dependencies of the trainers on UnityEnvironment (and use BaseEnv instead)	3 年前

1 2 3 4

182 次代码提交 (2ca5cd21-4e15-4ff9-b2b4-a3d1651484d3)