ml-agents

作者	SHA1	备注	提交日期
GitHub	9178b5d2	Improve test_simple.py and check discrete actions (#2345 ) * discrete action coverage * undo change * rename test * move test file * Revert "move test file" This reverts commit 2e72b2dbf9ce9163c92066036b06591dc4173e5c. * move files post merge	5 年前
GitHub	a9fe719c	Add Multi-GPU implementation for PPO (#2288 ) Add MultiGpuPPOPolicy class and command line options to run multi-GPU training	5 年前
GitHub	30930383	Move trainer initialization into a utility function (#2412 ) This change moves trainer initialization outside of TrainerController, reducing some of the constructor arguments of TrainerController and setting up the ability for trainers to be initialized in the case where a TrainerController isn't needed.	5 年前
GitHub	6a81a2f4	Add Soft Actor-Critic as trainer option (#2341 ) * Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.	5 年前
GitHub	6f67cf40	unit test - don't use global random generator (#2521 ) * unit test - don't use global random generator * Update test_simple_rl.py	5 年前
GitHub	9e2c30ee	Made the _check_environment_trains test a little more easy to pass so the test will not randomly fail (#2520 )	5 年前
GitHub	0390c78b	Fix determinism in unit test (#2530 ) * initialize random instance correctly * restore threshold (I hope)	5 年前
GitHub	3df585d9	Fix issue where SAC encoder type is always simple (#2548 )	5 年前
GitHub	babe9e2f	Develop remove academy done (#2519 ) * Initial Commit * Remove the Academy Done flag from the protobuf definitions * remove global_done in the environment * Removed irrelevant unitTests * Remove the max_step from the Academy inspector * Removed global_done from the python scripts * Modified and removed some tests * This actually does not break either curriculum nor generalization training * Replace global_done with reserved. Addressing Chris Elion's comment regarding the deprecation of the global_done field. We will use a reserved field to make sure the global done does not get replaced in the future causing errors. * Removed unused fake brain * Tested that the first call to step was the same as a reset call * black formating * Added documentation changes * Editing the migrating doc * Addressing comments on the Migrating doc * Addressing comments : - Removing dead code - Resolving forgotten merged conflicts - Editing documentations...	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
GitHub	2f74b3cc	Rename protobuf objects to be suffixed with 'Proto' in python and C#. (#2646 )	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
GitHub	d39b1881	speed up unit test (#2847 )	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	e6f549dc	[MLA-12] update protobuf for vector observations (#2862 )	5 年前
GitHub	69d1a033	Develop remove past action communication (#2913 ) * Modifying the .proto files * attempt 1 at refactoring Python * works for ppo hallway * changing the documentation * now works with both sac and ppo both training and inference * Ned to fix the tests * TODOs : - Fix the demonstration recorder - Fix the demonstration loader - verify the intrinsic reward signals work - Fix the tests on Python - Fix the C# tests * Regenerating the protos * fix proto typo * protos and modifying the C# demo recorder * modified the demo loader * Demos are loading * IMPORTANT : THESE ARE THE FILES USED FOR CONVERSION FROM OLD TO NEW FORMAT * Modified all the demo files * Fixing all the tests * fixing ci * addressing comments * removing reference to memories in the ll-api	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	a6df9f43	Develop new ll api (#3022 ) * initial commit for LL-API * fixing ml-agents-envs tests * Implementing action masks * training is fixed for 3DBall * Tests all fixed, gym is broken and missing documentation changes * adding case where no vector obs * Fixed Gym * fixing tests of float64 * fixing float64 * reverting some of brain.py * removing old proto apis * comment type fixes * added properties to AgentGroupSpec and edited the notebooks. * clearing the notebook outputs * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing first comments * NaN checks for r...	5 年前
GitHub	36048cb6	Moving Env Manager to Trainers (#3062 ) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. * Moving Env Manager to Trainers * fix pylint madness	5 年前
GitHub	8ca0d810	Better error handling if trainer config doesn't contain "default" section (#3063 )	5 年前
GitHub	2c3794a6	handle mismatch between brain and metacurriculum (#3034 ) * handle mismatch between brain and metacur * add unit tests * use os.path.splitext in metacurriculum * fix type	5 年前
Chris Elion	fdc810ff	move (first pass)	5 年前
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
GitHub	7fbf6b1d	add flake8-bugbear (#3137 ) * unused loop variables * change loop variable	5 年前
GitHub	45010af3	Add stats reporter class and re-enable missing stats (#3076 )	5 年前
GitHub	f058b18c	Replace BrainInfos with BatchedStepResult (#3207 )	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
Ervin Teng	d680ed32	Fix metacurriculum test (for good)	5 年前
GitHub	7d954797	[change] Separate action outputs into OutputDistributions object (#3514 )	5 年前
GitHub	f469cbb0	Simple1DEnv refactor and additional ghost trainer tests (#3537 )	5 年前
GitHub	323f104c	[tests] LSTM end-to-end tests (#3544 )	5 年前
Andrew Cohen	0cc2956d	write to proto	5 年前
GitHub	bcce774f	[tests] Visual observation tests (#3549 )	5 年前
GitHub	213d2466	[bug-fix] Change Simple1DEnvironment to spawn new agent IDs on reset (#3558 )	5 年前
GitHub	b6e3fd67	[tests] Add additional unit tests (#3581 )	5 年前
Andrew Cohen	b1cfa74d	Merge branch 'master' into develop-test-imitation	5 年前
Andrew Cohen	e7836fb5	record demos 1d env	5 年前
Andrew Cohen	7aaf1fb6	gail and bc tests	5 年前
Andrew Cohen	f1eeed9c	success threshold to .9 for imitation	5 年前
Andrew Cohen	f6d6e3d0	reccurent gail tests	5 年前
GitHub	25cc9f15	[change] Move hyperparameter printing entirely into StatsWriters (#3630 )	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
GitHub	2912c883	Basic and visual GAIL and BC integration tests (#3626 )	5 年前
Andrew Cohen	79076b70	ELO calculation done in ghost controller	5 年前
GitHub	29f82921	[bug-fix] Improve performance for PPO with continuous actions (#3662 )	5 年前
Ervin Teng	ee27e2cc	Fix tests	5 年前
Andrew Cohen	c4e54218	replaced ghost_swap with team_change in tests	5 年前
GitHub	104f2c46	[tests] Add tests for multiple actions/action branches (#3672 )	5 年前
GitHub	56b75555	[tests] Make end-to-end tests more stable (#3697 )	5 年前
GitHub	141831da	[bug-fix] Fix entropy computation for GaussianDistribution (#3684 )	5 年前
Andrew Cohen	93d344ff	simple rl asymm ghost tests	5 年前
Andrew Cohen	345fa382	current_best_ratio -> latest_model_ratio	5 年前
Andrew Cohen	62c87031	Merge branch 'master' into self-play-mutex	5 年前
Ervin Teng	06fa3d39	Merge branch 'master' into develop-sac-apex	5 年前
Andrew Cohen	7006b5ff	asymm ghost test consistent	5 年前
Ervin Teng	971e4b2d	Don't block when disabling threading	5 年前
GitHub	43f23ee3	WIP : Changes to the LL-API - Refactor of “done” logic (#3681 ) * [skip ci] WIP : Modify the base_env.py file * [skip ci] typo * [skip ci] renamed some methods * [skip ci] Incorporated changes from our meeting * [skip ci] everything is broken * [skip ci] everything is broken * [skip ci] formatting * Fixing the gym tests * Fixing bug, C# has an error that needs fixing * Fixing the test * relaxing the threshold of 0.99 to 0.9 * fixing the C# side * formating * Fixed the llapi integratio test * [Increasing steps for testing] * Fixing the python tests * Need __contains__ after all * changing the max_steps in the tests * addressing comments * Making env_manager logic clearer as proposed in the comments * Remove duplicated logic and added back in episode length (#3728) * removing mentions of multi-agent in gym and changed the docstring in base_env.py * Edited the Documentation for the changes to the LLAPI (#3733) * Edite...	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Ervin Teng	51e76f00	Adjust SAC recurrent	5 年前
Ervin Teng	9fe104d6	Make threading disable-able per trainer	5 年前
Ervin Teng	92158d54	Remove threaded from trainer_controller	5 年前
Ervin Teng	23039746	Disable threading for all simple_rl tests	5 年前
GitHub	1536b9f2	Increasing steps on asymmetric ghost test (#3802 )	5 年前
GitHub	4d23200b	[refactor] Run Trainers in separate threads (#3690 )	5 年前
GitHub	7e5513a4	[bug-fix] Increase buffer size for SAC tests (#3813 )	5 年前
vincentpierre	cad57a00	[skip ci] Added some tests but they do not pass (too hard)	5 年前
GitHub	adeb6536	Catch dimension mismatches between demos and policy (#3821 )	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
GitHub	7b78ffeb	support newer versions of tensorflow (2.1+) (#3830 ) * support tf2.x and python3.8 * tensorflow==2.2.0rc3 for python3.8 * stick with tf2.1 and py3.7 for now * More gail visual steps in simple test (#3836) * increase gail visual ppo steps * increase to 2000 * tune steps down to 750 Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
vincentpierre	c34dd5b6	Merge branch 'master' into develop-gym-wrapper	5 年前
vincentpierre	67027af3	Removed the failing gym tests	5 年前
GitHub	c5b94ca6	Use LR schedule for beta and epsilon (#3940 )	5 年前
Christopher Goy	ba80b292	format files with pre-commit.	4 年前
vincentpierre	6ddfe74f	Merge branch 'master' into develop-gym-wrapper	5 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	4 年前
GitHub	335cff3e	[versioning] Save ML-Agents version in checkpoints and check on load (#4035 )	4 年前
GitHub	21fe203e	[tests] Increase buffer_init_steps for recurrent sac test (#4051 )	4 年前
GitHub	09853e13	[refactor] Move checkpoint saving into trainer (#4034 )	4 年前
GitHub	a1c63c4b	Release 3 Cherry-pick bug-fixes and doc changes from master (#4102 ) * [bug-fix] Fix regression in --initialize-from feature (#4086) * Fixed text in GettingStarted page specifying the logdir for tensorboard. Before it was in a directory summaries which no longer existed. Results are now saved to the results dir. (#4085) * [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) * Reverting bug introduced in #4071 (#4101) Co-authored-by: Scott <Scott.m.jordan91@gmail.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Andrew Cohen	f76780f1	fix tests	4 年前
GitHub	5b0a5b9b	Moving domain randomization to C# (#4065 )	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
vincentpierre	599d7e9f	Merging master	4 年前
vincentpierre	d031c7a9	Merging master	4 年前
GitHub	8eefdcd3	Refactor of Curriculum and parameter sampling (#4160 ) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation up...	4 年前
GitHub	129f9ddc	[MLA-427] make pyupgrade convert f-strings too (#4244 ) * make pyupgrade convert f-strings too	4 年前
Ruo-Ping Dong	95858e25	update saver interface and add tests	4 年前
GitHub	25dc8c3d	Add Saver Class to handle all save/load/checkpoint/export work (#4323 )	4 年前
Andrew Cohen	af7d3800	add test_simple_rl tests to torch	4 年前
GitHub	2332bc32	Add fire to test_simple_rl.py (#4378 ) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
GitHub	bf6506fc	[feature] Add small CNN for grids 5x5 and up (#4434 )	4 年前

1 2

92 次代码提交 (49d6b70c-989e-4b5d-9010-2c19a966f11c)