ml-agents

作者	SHA1	备注	提交日期
GitHub	30930383	Move trainer initialization into a utility function (#2412 ) This change moves trainer initialization outside of TrainerController, reducing some of the constructor arguments of TrainerController and setting up the ability for trainers to be initialized in the case where a TrainerController isn't needed.	5 年前
GitHub	6a81a2f4	Add Soft Actor-Critic as trainer option (#2341 ) * Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
GitHub	473a8758	Develop yaml json loading errors (#2601 ) * WIP cleanup loading * better exceptions for parser errors - refer to online lint tools * feedback - rename variable	5 年前
GitHub	cb144f20	small mypy cleanup (#2637 ) * small mypy cleanup * sac cleanup * types for ppo policy init	5 年前
GitHub	5f5ccfa0	Feature Deprecation : Online Behavioral Cloning (#2659 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature.	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
Andrew Cohen	184af227	splitting brain params into brain name and identifiers	5 年前
Andrew Cohen	e96b80db	recieves brain_name and identifier on python side	5 年前
GitHub	36048cb6	Moving Env Manager to Trainers (#3062 ) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. * Moving Env Manager to Trainers * fix pylint madness	5 年前
GitHub	42bea858	Improve mypy coverage by adding --namespace-packages (#3049 )	5 年前
Andrew Cohen	8f62c69e	splitting brain params into brain name and identifiers	5 年前
GitHub	1fa07edb	Remove Standalone Offline BC Training (#2969 )	5 年前
GitHub	8ca0d810	Better error handling if trainer config doesn't contain "default" section (#3063 )	5 年前
Andrew Cohen	c7f283df	splitting brain params into brain name and identifiers	5 年前
GitHub	2c3794a6	handle mismatch between brain and metacurriculum (#3034 ) * handle mismatch between brain and metacur * add unit tests * use os.path.splitext in metacurriculum * fix type	5 年前
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
GitHub	0b5b1b01	Develop magic string + trajectory (#3122 ) * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * trainer_controller expects name_behavior_ids * add_policy and create_policy separated * adjusting tests to expect trainer.add_policy to be called * fixing tests * fixed naming ...	5 年前
Andrew Cohen	082789ea	Merge branch 'master' into develop-magic-string	5 年前
GitHub	0d56f6ba	Merge branch 'master' into develop-magic-string	5 年前
Andrew Cohen	654b0c79	Merge branch 'master' into develop-magic-string	5 年前
GitHub	c6152459	Allow curricula to be created without files (#3145 ) Previously the Curriculum and MetaCurriculum classes required file / folder paths for initialization. These methods loaded the configuration for the curricula from the filesystem. Requiring files for configuring curricula makes testing and updating our config format more difficult. This change moves the file loading into static methods, so that Curricula / MetaCurricula can be initialized from dictionaries only.	5 年前
GitHub	45010af3	Add stats reporter class and re-enable missing stats (#3076 )	5 年前
GitHub	0fe7e731	use absolute path in error (#3230 )	5 年前
Ervin Teng	9ad99eb6	Combined model and policy for PPO	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
Ervin Teng	db249ceb	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
Ervin Teng	00017bab	Temporarily remove multi-GPU	5 年前
Andrew Cohen	94654de4	ghost controller	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
Andrew Cohen	1269b555	docstrings/ghost_swap -> team_change	5 年前
Andrew Cohen	bc611906	removed team-change CLI	5 年前
GitHub	bc1fdf07	[refactor] CLI changes (#3705 )	5 年前
Andrew Cohen	59b88be6	Merge branch 'master' into self-play-mutex	5 年前
GitHub	9cbc3fa2	Asymmetric self-play (#3653 )	5 年前
GitHub	d7ca6b8d	[feature] Add --initialize-from option (#3710 )	5 年前
Andrew Cohen	ddb6787c	hard reset when team changes	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
GitHub	f86fc81d	[refactor] Move configuration files to single YAML file (#3791 )	5 年前
Christopher Goy	ba80b292	format files with pre-commit.	4 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	4 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
GitHub	a1c63c4b	Release 3 Cherry-pick bug-fixes and doc changes from master (#4102 ) * [bug-fix] Fix regression in --initialize-from feature (#4086) * Fixed text in GettingStarted page specifying the logdir for tensorboard. Before it was in a directory summaries which no longer existed. Results are now saved to the results dir. (#4085) * [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) * Reverting bug introduced in #4071 (#4101) Co-authored-by: Scott <Scott.m.jordan91@gmail.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
yanchaosun	3ef4196e	Added the algorithm named ppo_transfer	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	8eefdcd3	Refactor of Curriculum and parameter sampling (#4160 ) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation up...	4 年前
yanchaosun	f81feec4	config fix; basic sac	4 年前
Anupam Bhatnagar	dbd21c95	[skip ci] adding distributed trainers	4 年前
Anupam Bhatnagar	07daf8b5	[skip ci] adding type annotations	4 年前
Anupam Bhatnagar	9d8fc301	[skip ci] fix distributed sac import statement	4 年前
GitHub	df685184	Make --torch use torch even without config (#4400 ) * Make --torch use torch even without config * Update ml-agents/mlagents/trainers/trainer_util.py Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> * Update ml-agents/mlagents/trainers/trainer_util.py Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> * Update ml-agents/mlagents/trainers/trainer_util.py Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> * renaming use_torch to force_torch Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
Anupam Bhatnagar	890cc572	[skip ci] fix import statement	4 年前
Anupam Bhatnagar	4398d7b8	[skip ci] renaming trainer to ppo_trainer and sac_trainer to avoid name collision	4 年前
Scott Jordan	56745026	Initial commit of running active learning code Active learning code is running on walker variable speed. Needs to be tested to see if it is working.	4 年前
Anupam Bhatnagar	d7f0d457	[skip ci] removing package import statements	4 年前
Anupam Bhatnagar	3d7956e9	[skip ci] fix key name	4 年前

1 2

56 次代码提交 (7d8651ac-4808-4d52-86d5-a5423bd3329b)