ml-agents

作者	SHA1	备注	提交日期
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
Arthur Juliani	b8a4f5f1	Add Hallway envronment to validate LSTM models	7 年前
eshvk	030ac5c5	[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
Arthur Juliani	c3644f56	Buffer fix for properly masking gradients	7 年前
GitHub	f8d27dc5	Merge branch 'development-0.3' into feature/LSTM2	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	cfc6bdc8	[Fix] The environment logs information about itself when lauched. (#395 )	7 年前
eshvk	2d2eb64b	[containers] Enables container support for scenes that use visual observations	7 年前
GitHub	237b41f9	Hotfix 0.3.0c (#618 ) Fixes the following issues: * Missing component reference in BananaRL environment. * Neural Network for multiple visual observations was not properly generated. * Episode time-out value estimate bootstrapping used incorrect observation as input.	7 年前
GitHub	38098a12	[Fixed BC with LSTM] (#766 ) Fixes the issue raised by @hsaikia in #552 Added the memory_size variable to the BC model Added memory_size and recurrent_out to the output nodes of the graph when using BC with LSTM	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
GitHub	c17937ef	Curiosity Driven Exploration & Pyramids Environments (#739 ) * Adds implementation of Curiosity-driven Exploration by Self-supervised Prediction (https://arxiv.org/abs/1705.05363) to PPO trainer. * To enable, set use_curiosity flag to true in hyperparameter file. * Includes refactor of unitytrainers model code to accommodate new feature. * Adds new Pyramids environment (w/ documentation). Environment contains sparse reward, and can only be solved using PPO+Curiosity.	7 年前
Arthur Juliani	d7338050	Enable concurrent sessions	6 年前
Arthur Juliani	5d402be9	Minor Optimizations (#836 )	6 年前
vincentg	3c4cb523	some hack to make windows save the model when do ctrl+c	6 年前
Arthur Juliani	195ac934	Merge branch 'develop' into develop-runs # Conflicts: # python/learn.py # python/unitytrainers/trainer.py	6 年前
vincentpierre	e47cec56	[Initial Commit]	6 年前
unityjeffrey	0d67f311	changed ml agents to ml-agents	6 年前
unityjeffrey	19fb437a	changed to Unity ML-Agents Toolkit (english)	6 年前
Deric Pang	8380f2f2	Moved curriculum code out of environment code.	6 年前
Deric Pang	e580e544	Removing commented out code.	6 年前
Deric Pang	db031b07	Updating tests for refactored curriculum learning.	6 年前
Deric Pang	eb251008	Removing unnecessary import.	6 年前
Arthur Juliani	1eb701af	Merge remote-tracking branch 'origin/develop' into develop-value-estimates-ppo	6 年前
Arthur Juliani	f52d5a92	Merge remote-tracking branch 'origin/develop' into develop-runs	6 年前
GitHub	ef3025e6	Merge pull request #1004 from Unity-Technologies/develop-runs Enable multiple runs in learn.py	6 年前
GitHub	7d0990cf	Fix MultiBrain bug that was introduced with the value estimates (#1018 )	6 年前
Deric Pang	de128fa1	Refactoring Curriculum tests and code. - Curriculum tests are now separate from other trainers. - Property setter is now used in Curriculum.	6 年前
Deric Pang	c6617b70	Multi-curriculum support added. - New school module maps brains to curriculums.	6 年前
Deric Pang	c88c7e42	Fixing bugs, updating tests. - Added more unit tests for school module. - Fixed bugs found during testing with PushBlock env.	6 年前
Deric Pang	06eb8037	Renaming School to MetaCurriculum.	6 年前
Deric Pang	e678e691	Addressing Vince's offline comments. - Warning logged if two curriculums attempt to reset the same parameter. - Error is raised when a curriculum file is not named to match a brain.	6 年前
Deric Pang	ca54fc4f	Adding back import that was accidentally removed.	6 年前
Deric Pang	9d9c91e4	Fixed TensorBoard lesson logging.	6 年前
Deric Pang	70308432	Adding space in metacurriculum error message.	6 年前
Deric Pang	4429077f	Improving MetaCurriculum initialization. - Raises MetaCurriculumError when curriculum_folder is not a folder. - Removed the ability to set curriculum_folder to None. trainer_controller.py has been refactored to not depend on this functionality which will make curriculums more stable.	6 年前
Deric Pang	822d329a	Fixing bug when no curriculum folder is passed. - The old Curriculum object would accept None as a location for the curriculum. If the location was None, it would return default values as its config and lesson number. - The new MetaCurriculum does not accept None as a location for the curriculum folder. This was done to remove unnecessary edge case functionality from curriculums. - None checks have been added into trainer_controller. In the future, it should be possible to better refactor trainer_controller so that these None checks can be removed. This is preferable to hard-coding default behavior into MetaCurriculum objects when a metacurriculum would not even be in place.	6 年前
Deric Pang	032446de	Trainer controller lines wrapped.	6 年前
Deric Pang	bb8e74f9	Helper func for incrementing lessons and resetting.	6 年前
Arthur Juliani	9e8049f0	Will now print summaries even when not training or when training is over (#1020 ) * [Initial Commit] * [Addressed comments] * [Now using global step to write the summaries]	6 年前
GitHub	9538d699	Move seed randomization to learn.py (#1071 ) * Move seed randomization to learn.py * Remove print statement	6 年前
Deric Pang	6eba6940	Merge remote-tracking branch 'upstream/develop' into develop-trainer-controller-cleanup	6 年前
Deric Pang	634280a6	Fixed imports, all tests are passing.	6 年前
GitHub	c8371e3b	Print summaries at inference (#1143 ) Reimplement a wrongfully reverted commit	6 年前
GitHub	fbf92810	Refactor Trainers to use Policy (#1098 )	6 年前
GitHub	10d2a19d	Release v0.5 (Develop) (#1203 )	6 年前
GitHub	6430fc86	Changing learn.py log messages. (#1159 ) * Changing learn.py log messages. - learn.py refers to the mlagents-learn script now. - If a non-existant trainer config is passed, the log message correctly points that out now. * Changing the curriculum arg from file to dir.	6 年前
GitHub	a6f45b76	Fixing learn.py, trainer_controller.py, and Docker (#1164 ) * Fixing learn.py, trainer_controller.py, and Docker - learn.py has been moved under trainers. - this was a two line change - learn.py will no longer be run as a main method - docopt arguments are strings by default. learn.py now uses this assumption to correctly parse arguments. - trainer_controller.py now considers the Docker volume when accepting a trainer config file path. - the Docker container now uses mlagents-learn. * Removing extraneous unity-volume ref.	6 年前
GitHub	29084e77	Curriculum learning reward thresholding bug fix (#1141 )	6 年前
GitHub	6b04b516	Consistent string style in trainer_controller.py (#1177 ) * Consistent string style in trainer_controller.py. * Linting error in doc.	6 年前
GitHub	2af80543	Add fix for multiple instances on a single GPU (#1192 )	6 年前
GitHub	af7de3ca	Fix bug when academy max steps is set to nonzero value. (#1195 )	6 年前
GitHub	d2c320dd	Remove graph scope (#1205 ) * initial commit : Only works with PPO balance ball * Fix for recurrent * [Fix indentation error] * Fixed BC * Remove Dead code * Addressing comment : Removing dead code * Fixing the Pytest * edited comments * Removing GraphScope from the InternalBrain (#1227) * Documentation changes for removing graph scope (#1226) * Documentation changes * removed the keep checkpoint printing	6 年前
GitHub	3c9603d6	Demonstration Recorder (#1240 )	6 年前
GitHub	78374601	vince's fix for model step (#1329 )	6 年前
eshvk	4a96b14d	Fixes to container workflow for GCP	6 年前
eshvk	3755a211	Refactor and clean up code	6 年前
GitHub	cc083fd8	fixed the windows ctrl-c bug (#1558 ) * Documentation tweaks and updates (#1479) * Add blurb about using the --load flag in the intro guide, and typo fix. * Add section in tutorial to create multiple area learning environment. * Add mention of Done() method in agent design * fixed the windows ctrl-c bug * fixed typo * removed some uncessary printing * nothing * make the import of the win api conditional * removved the duplicate code * added the ability to use python debugger on ml-agents * added newline at the end, changed the import to be complete path * changed the info.log into policy.export_model, changed the sys.platform to use startswith * fixed a bug * remove the printing of the path * tweaked the info message to notify the user about the expected error message * removed some logging according to comments * removed the sys import * Revert "Documentation tweaks and updates (#1479)" This reverts commit 84ef07a4525fa8a89f4...	6 年前
GitHub	517e3a0a	Remove env creation logic from TrainerController (#1562 ) * Remove env creation logic from TrainerController Currently TrainerController includes logic related to creating the UnityEnvironment, which causes poor separation of concerns between the learn.py application script, TrainerController and UnityEnvironment: * TrainerController must know about the proper way to instantiate the UnityEnvironment, which may differ from application to application. This also makes mocking or subclassing UnityEnvironment more difficult. * Many arguments are passed by learn.py to TrainerController and passed along to UnityEnvironment. This change moves environment construction logic into learn.py, as part of the greater refactor to separate trainer logic from actor / environment.	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
GitHub	c258b1c3	Move 'take_action' into Policy class (#1669 ) * Move 'take_action' into Policy class This refactor is part of Actor-Trainer separation. Since policies will be distributed across actors in separate processes which share a single trainer, taking an action should be the responsibility of the policy. This change makes a few smaller changes: * Combines `take_action` logic between trainers, making it more generic * Adds an `ActionInfo` data class to be more explicit about the data returned by the policy, only used by TrainerController and policy for now. * Moves trainer stats logic out of `take_action` and into `add_experiences` * Renames 'take_action' to 'get_action'	6 年前
eshvk	cc9bdf17	Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return	6 年前
eshvk	fb04c40c	Reorganize to make metrics collection more accurate	6 年前
GitHub	93760bc4	Adds SubprocessUnityEnvironment for parallel envs (#1751 ) This commit adds support for running Unity environments in parallel. An abstract base class was created for UnityEnvironment which a new SubprocessUnityEnvironment inherits from. SubprocessUnityEnvironment communicates through a pipe in order to send commands which will be run in parallel to its workers. A few significant changes needed to be made as a side-effect: * UnityEnvironments are created via a factory method (a closure) rather than being directly created by the main process. * In mlagents-learn "worker-id" has been replaced by "base-port" and "num-envs", and worker_ids are automatically assigned across runs. * BrainInfo objects now convert all fields to numpy arrays or lists to avoid serialization issues.	6 年前
Jonathan Harper	7a0d1531	Fix subprocess model saving on Windows On Windows the interrupt for subprocesses works in a different way from OSX/Linux. The result is that child subprocesses and their pipes may close while the parent process is still running during a keyboard (ctrl+C) interrupt. To handle this, this change adds handling for EOFError and BrokenPipeError exceptions when interacting with subprocess environments. Additional management is also added to be sure when using parallel runs using the "num-runs" option that the threads for each run are joined and KeyboardInterrupts are handled. These changes made the "_win_handler" we used to specially manage interrupts on Windows unnecessary, so they have been removed.	6 年前
Jonathan Harper	e91e847c	Fix '--slow' flag after environment updates A change was made to the way the "train_mode" flag was used by environments when SubprocessUnityEnvironment was added which was intended to be part of a separate change set. This broke the CLI '--slow' flag. This change undoes those changes, so that the slow / fast simulation option works correctly. As a minor additional change, the remaining tests from top level 'tests' folders have been moved into the new test folders.	6 年前
eshvk	ef8009d9	Python code reformat via [`black`](https://github.com/ambv/black ). Features: - Reformat code via black. - Adding circleci configurations. - Add contribution guidelines. Steps to reproduce: - `pip install black` - `black <source code directory>`	6 年前
Jonathan Harper	d9a7e5b6	Fix failure on Academy Done() with parallel envs When using parallel SubprocessUnityEnvironment instances along with Academy Done(), a new step might be taken when reset should have been called because some environments may have been done while others were not (making "global done" less useful). This change manages the reset on `global_done` at the level of the environment worker, and removes the global reset from TrainerController.	6 年前
GitHub	2671e1a0	Enable mypy in precommit checks (#2177 ) * WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * WIP enable mypy * run mypy on each package * fix trainer_metrics mypy errors * more mypy errors * more mypy * Fix some partially typed functions * types for take_action_outputs * fix formatting * cleanup * generate stubs for proto objects * fix ml-agents-env mypy errors * disallow-incomplete-defs for gym-unity * Add CI notes to CONTRIBUTING.md	5 年前
Jonathan Harper	177ee5b8	Remove unused "last reward" logic, TF nodes At each step, an unused `last_reward` variable in the TF graph is updated in our PPO trainer. There are also related unused methods in various places in the codebase. This change removes them.	5 年前
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	5 年前
GitHub	966d8efb	Remove "external_brains" arg for TrainerController (#2213 ) TrainerController depended on an external_brains dictionary with brain params in its constructor but only used it in a single function call. The same function call (start_learning) takes the environment as an argument, which is the source of the external_brains. This change removes the dependency of TrainerController on external brains and removes the two class members related to external_brains and retrieves the brains directly from the environment.	5 年前
Chris Elion	af4699ac	Fix reference to external_brains in TrainerController (#2237 ) PR #2213 conflicted with PR #2209 on a reference to external_brains. This change fixes the conflict.	5 年前
GitHub	84d9d622	python timers (#2180 ) * Timer proof-of-concept * micro optimizations * add some timers * cleanup, add asserts * Cleanup (no start/end methods) and handle exceptions * unit test and decorator * move output code, add a decorator * cleanup * module docstring * actually write the timings when done with training * use __qualname__ instead * add a few more timers * fix mock import * fix unit test * don't need fwd reference * cleanup root * always write timers, add comments * undo accidental change	5 年前
GitHub	19283bfa	Very simple environment for testing (#2266 ) * WIP doesn't crash * return stats and assert convergence * pass lint checks * rename * fix-reset-params * add time penalty * _get_measure_vals always returns something * fix tests * unused import * single env, fix double step * move LocalEnvManager to ml-agents-envs * move and rename EnvManager * remove obsolete docstring and method * clean up	5 年前
GitHub	9eb3f049	Cleanup unused code in TrainerController (#2315 ) * Removes unused SubprocessEnvManager import in trainer_controller * Removes unused `steps` argument to `TrainerController._save_model` * Consolidates unnecessary branching for curricula in `TrainerController.advance` * Moves `reward_buffer` into `TFPolicy` from `PPOPolicy` and adds `BCTrainer` support so that we don't have a broken interface / undefined behavior when BCTrainer is used with curricula.	5 年前
Ervin T	a46f3faa	Enable generalization training (#2232 ) * Add Sampler and SamplerManager * Enable resampling of reset parameters during training * Documentation for Sampler and example YAML configuration file	5 年前
Jonathan Harper	98297be9	Fix training not quitting when play button is unchecked (#2376 ) This fixes an issue where stopping the game when training in the Editor won't end training, due to the new asynchronous SubprocessEnvManager changes. Another minor change was made to move the `env_manager.close()` in TrainerController to the end of `start_learning` so that we are more likely to save the model if something goes wrong during the environment shutdown (this occurs sometimes on Windows machines).	5 年前
GitHub	a9fe719c	Add Multi-GPU implementation for PPO (#2288 ) Add MultiGpuPPOPolicy class and command line options to run multi-GPU training	5 年前
GitHub	30930383	Move trainer initialization into a utility function (#2412 ) This change moves trainer initialization outside of TrainerController, reducing some of the constructor arguments of TrainerController and setting up the ability for trainers to be initialized in the case where a TrainerController isn't needed.	5 年前
sankalp04	121221f2	Adding new command line arguments	5 年前
sankalp04	dfc8885d	Allow generalization training with specified arguments of min_reward and min_lesson_length	5 年前
sankalp04	0b006719	Incorporate generalization checks for resetting parameters in take_step	5 年前
sankalp04	f331e5b7	Rebase develop	5 年前
Jonathan Harper	2f083c8a	Renamed "StepInfo" to "EnvironmentStep" This change was requested for clarity during the async EnvManager PR. It's a simple rename of the StepInfo class.	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
Chris Elion	43e23941	rough pass at tf2 support, needs cleanup	5 年前
Chris Elion	806c77e4	centralize tensorflow imports	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
Chris Elion	254c7d86	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
GitHub	b95c4d1d	check for unecessary list comprehensions (#2707 )	5 年前
GitHub	5d3e05d1	Fix "memory leak" during inference (#2722 ) * Clear buffer if not training * Add tests	5 年前
Chris Elion	3d8a70fb	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Chris Elion	73a346cb	cleanup	5 年前
Andrew Cohen	13fe9cf8	Bubbled up indexing of AllBrainInfo to trainer controller from trainers	5 年前
Andrew Cohen	e96b80db	recieves brain_name and identifier on python side	5 年前
GitHub	d6f69c1f	handle null action outputs (#2988 )	5 年前
Ervin Teng	3434352a	Non-working commit	5 年前
Ervin Teng	17dca3ce	Another nonworking commit	5 年前
Ervin Teng	1e36028d	Runs but doesn't do anything yet	5 年前
Ervin Teng	34f9577c	Merge branch 'develop' into develop-agentprocessor	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
Ervin Teng	2c9376bc	Convert to trajectory	5 年前
Ervin Teng	9e661f0c	Looks like it's training	5 年前
Ervin Teng	9c5fdd31	Stats reporting is working	5 年前
Andrew Cohen	5097bcc0	recieves brain_name and identifier on python side	5 年前
GitHub	e7bf6fff	Close environment if step raises an exception. (#3043 ) * close env manager in finally * rename to env_manager * remove obsolete mock checks	5 年前
Ervin Teng	88b1123a	Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor	5 年前
Andrew Cohen	2b192bc3	trainer_controller expects name_behavior_ids	5 年前
Andrew Cohen	8578b0b7	add_policy and create_policy separated	5 年前
Andrew Cohen	ef2dfd4c	adjusting tests to expect trainer.add_policy to be called	5 年前
GitHub	36048cb6	Moving Env Manager to Trainers (#3062 ) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. * Moving Env Manager to Trainers * fix pylint madness	5 年前
GitHub	42bea858	Improve mypy coverage by adding --namespace-packages (#3049 )	5 年前
Andrew Cohen	bd5e8434	fixed naming name_behavior_id	5 年前
Andrew Cohen	614d276f	recieves brain_name and identifier on python side	5 年前
Ervin Teng	40bbe173	Better decoupling for agent processor	5 年前
GitHub	2c3794a6	handle mismatch between brain and metacurriculum (#3034 ) * handle mismatch between brain and metacur * add unit tests * use os.path.splitext in metacurriculum * fix type	5 年前
Chris Elion	fdc810ff	move (first pass)	5 年前
GitHub	58b6c7c2	Rename mlagents.envs to mlagents_envs (#3083 )	5 年前
Ervin Teng	97d66e71	Remove BootstrapExperience	5 年前
Andrew Cohen	46f8f077	trainer_controller expects name_behavior_ids	5 年前
Andrew Cohen	d1edbf43	add_policy and create_policy separated	5 年前
Andrew Cohen	70357569	adjusting tests to expect trainer.add_policy to be called	5 年前
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
Andrew Cohen	e67e866e	fixed naming name_behavior_id	5 年前
Ervin Teng	c330f6f6	Merge branch 'master' into develop-agentprocessor	5 年前
Andrew Cohen	de902fbb	passes all pytest and C# tests	5 年前
GitHub	2ac242f7	Remove TrainerMetrics and add CSVWriter using new StatsWriter API (#3108 )	5 年前
GitHub	0b5b1b01	Develop magic string + trajectory (#3122 ) * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * trainer_controller expects name_behavior_ids * add_policy and create_policy separated * adjusting tests to expect trainer.add_policy to be called * fixing tests * fixed naming ...	5 年前
Andrew Cohen	082789ea	Merge branch 'master' into develop-magic-string	5 年前
Ervin Teng	bad47dad	Allow None max steps	5 年前
Ervin Teng	1bd791e5	Merge branch 'master' into develop-agentprocessor	5 年前
Ervin Teng	abc8ca9a	Fix tests	5 年前
GitHub	7fbf6b1d	add flake8-bugbear (#3137 ) * unused loop variables * change loop variable	5 年前
Andrew Cohen	654b0c79	Merge branch 'master' into develop-magic-string	5 年前
GitHub	c6152459	Allow curricula to be created without files (#3145 ) Previously the Curriculum and MetaCurriculum classes required file / folder paths for initialization. These methods loaded the configuration for the curricula from the filesystem. Requiring files for configuring curricula makes testing and updating our config format more difficult. This change moves the file loading into static methods, so that Curricula / MetaCurricula can be initialized from dictionaries only.	5 年前
GitHub	bec2e8f0	Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113 )	5 年前
Andrew Cohen	fc485077	fixed more ci problems/removed self.policies	5 年前
Ervin Teng	db743971	Move private methods out of trainer, simplify interface	5 年前
Andrew Cohen	c8514c18	Merge branch 'master' into develop-magic-string	5 年前
GitHub	45010af3	Add stats reporter class and re-enable missing stats (#3076 )	5 年前
GitHub	bed7debf	Fix issue with different decision intervals for different brains (#3181 ) * Move action check into agent_processor * Better loop for iterating over step_info * Add warning for agentmanager not found	5 年前
Ervin Teng	3d25f9d2	Merge branch 'master' into develop-agentprocessor	5 年前
GitHub	d985dded	Merge branch 'master' into merge-release-0.13.0	5 年前
Andrew Cohen	4c260917	fix flake merge conflicts with master	5 年前
GitHub	4c241a80	Only send previous action and current BrainInfo (#3187 ) This PR makes it so that the env_manager only sends one current BrainInfo and the previous actions (if any) to the AgentManager. The list of agents was added to the ActionInfo and used appropriately.	5 年前
GitHub	f058b18c	Replace BrainInfos with BatchedStepResult (#3207 )	5 年前
GitHub	ca96b293	Move advance() logic for environment manager out of trainer_controller (#3234 ) This PR moves the AgentManagers from the TrainerController into the env_manager. This way, the TrainerController only needs to create the components (Trainers, AgentManagers) and call advance() on the EnvManager and the Trainers.	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
GitHub	65dbe0ec	Move processing of steps after reset to advance() (#3271 ) In the previous PR, steps were processed when the env manager was reset. This was an issue for the very first reset, where we don't actually know which agent groups (and AgentManagers) we needed to send the steps to. These steps were being thrown away. This PR moves the processing of steps to advance(), so that the initial reset steps are simply processed when the next advance(). This also removes the need for an additional block of code in TrainerController to handle the initial reset.	5 年前
GitHub	be14dd42	Make the timer output format consistent (#3472 )	5 年前
Andrew Cohen	bd78ec40	self-play assym hacked branch	5 年前
Anupam Bhatnagar	e8e0078e	first commit	5 年前
Anupam Bhatnagar	07b15ae7	[skip-ci] small refactors	5 年前
Chris Elion	0d65c600	top-level timers to see where time is going	5 年前
Chris Elion	a5dd261b	make sure top-level timer is closed before writing	5 年前
GitHub	25cc9f15	[change] Move hyperparameter printing entirely into StatsWriters (#3630 )	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
Andrew Cohen	9f09a65d	team id centric ghost trainer	5 年前
Ervin Teng	3deb8e30	Make trainer in separate threads	5 年前
Ervin Teng	93351d30	Fix comments	5 年前
GitHub	807a1441	Raise exceptions from environment subprocesses (#3680 ) This commit surfaces exceptions from environment worker subprocesses, and changes the SubprocessEnvManager to raise those exceptions when caught. Additionally TrainerController was changed to treat environment exceptions differently than KeyboardInterrupts. We now raise the environment exceptions after exporting the model, so that ML-Agents will correctly exit with a non-zero return code.	5 年前
GitHub	4ecd6ad3	Fix how we set logging levels (#3703 ) * cleanup logging * comments and cleanup * pylint, gym	5 年前
Andrew Cohen	62c87031	Merge branch 'master' into self-play-mutex	5 年前
Andrew Cohen	59b88be6	Merge branch 'master' into self-play-mutex	5 年前
Ervin Teng	06fa3d39	Merge branch 'master' into develop-sac-apex	5 年前
Anupam Bhatnagar	50e52d9c	Merge branch 'master' into distributed-training	5 年前
Andrew Cohen	3de78baa	wrapped trainer has internal policy ghost	5 年前
Andrew Cohen	3013774b	alternative to internal-policy fix	5 年前
Ervin Teng	ed06f37c	Ability to disable threading	5 年前
Anupam Bhatnagar	001fce2a	first commit	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Ervin Teng	d1fed8ae	Remove empty_queue interface	5 年前
Ervin Teng	9fe104d6	Make threading disable-able per trainer	5 年前
Ervin Teng	92158d54	Remove threaded from trainer_controller	5 年前
Ervin Teng	8d2434c2	kill trainer threads when training finishes	5 年前
Ervin Teng	ebed4f51	Only create/start thread for new trainers	5 年前
Ervin Teng	392fcb4e	Fix stall in ghost trainer non-threaded	5 年前
Andrew Cohen	ddb6787c	hard reset when team changes	5 年前
GitHub	4092d937	[Bug fix] Hard reset when team changes (#3870 )	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
GitHub	ebe12502	[bug-fix] Fix exception thrown when quitting in-editor training from editor (#3885 )	5 年前
GitHub	d8b93f8f	[Bug fix] Hard reset when team changes (#3870 ) (#3899 )	5 年前
GitHub	f501c395	Fix timers when using multithreading. (#3901 )	5 年前
Chris Elion	68b68396	Merge remote-tracking branch 'origin/master' into release_1_to_master	5 年前
Christopher Goy	ba80b292	format files with pre-commit.	4 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	4 年前
Andrew Cohen	4464ca46	ignoring commit checks	4 年前
GitHub	f5435876	[refactor] Store and restore state along with checkpoints (#4025 )	4 年前
Andrew Cohen	e7750fc9	Merge branch 'master' into develop-sampler-refactor	4 年前
GitHub	09853e13	[refactor] Move checkpoint saving into trainer (#4034 )	4 年前
Andrew Cohen	22786526	Merge branch 'master' into asymm-envs	5 年前
PSankalp Patro	45c4ea36	Save checkpoint files as .nn files in checkpoint directory	4 年前
Andrew Cohen	c0f7052b	Merge branch 'master' into develop-sampler-refactor	4 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
Anupam Bhatnagar	4afd8f92	first commit	4 年前
Jonathan Harper	80127232	Convert checkpoints to .nn format Fixed style Fixed more style Nit changes Fixed signature Convert checkpoints to .nn format Fixed style Nit changes Fixed tests, checkpoint management and style Check checkpoint management Modify statement on artifacts Nit changes Fixed signature Nit changes Fixed signature Fixed tests, checkpoint management and style Check checkpoint management Modify statement on artifacts	4 年前
Anupam Bhatnagar	24d5f881	first commit	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	3bcb029b	[refactor] Remove BrainParameters from Python code (#4138 )	4 年前
GitHub	8eefdcd3	Refactor of Curriculum and parameter sampling (#4160 ) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation up...	4 年前
GitHub	84440f05	Convert checkpoints to .NN (#4127 ) This change adds an export to .nn for each checkpoint generated by RLTrainer and adds a NNCheckpointManager to track the generated checkpoints and final model in training_status.json. Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	4 年前
GitHub	1f5eb9da	add pyupgrade to pre-commit and run (#4239 )	4 年前
GitHub	2c64d623	don't try/except for control flow (#4251 )	4 年前
Anupam Bhatnagar	dbd5dc04	adding rank to ml-agents	4 年前
GitHub	a74c7bc5	TensorBoard Lesson -> Lesson Number (#4347 )	4 年前
Anupam Bhatnagar	abc1220f	Merge branch 'master' into global-variables	4 年前
GitHub	2332bc32	Add fire to test_simple_rl.py (#4378 ) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Anupam Bhatnagar	5e8aa485	renaming file from globals.py to global_values.py	4 年前
Scott Jordan	d695c044	initial addition of active learning (incomplete)	4 年前
Anupam Bhatnagar	1f60979f	[skip ci] change self.rank to global_values.get_rank()	4 年前
Scott Jordan	9f3d3428	Merge branch 'master' into active-variablespeed	4 年前
Anupam Bhatnagar	f4f1a8d9	merge master into trainer-plugin branch	4 年前
Scott Jordan	56745026	Initial commit of running active learning code Active learning code is running on walker variable speed. Needs to be tested to see if it is working.	4 年前
Scott Jordan	78f8a9a2	Updated task manager active learning is no optional and defaults to uniform sampling of tasks. Renamed ActiveLearningTaskManager to just TaskManager	4 年前
Scott Jordan	87969325	added histogram recorded, fixed active learning bug added histogram recorder for task samples. Fixed a bug that prevented active learning from being used.	4 年前
GitHub	1076d275	Remove unused methods in trainer_controller.py (#4418 )	4 年前
vincentpierre	d137feab	using torch.set_num_threads	4 年前
GitHub	6f534366	Add torch_utils class, auto-detect CUDA availability (#4403 ) * Add torch_utils * Use torch from torch_utils * Add torch to banned modules in CI * Better import error handling * Fix flake8 errors * Address comments * Move networks to GPU if enabled * Switch to torch_utils * More flake8 problems * Move reward providers to GPU/CPU * Remove anothere set default tensor * Fix banned import in test	4 年前
vincentpierre	c78639a0	-	4 年前
Ervin Teng	4b3f5f77	Add pympler	4 年前
GitHub	c188781b	[life improvement] Moving Python files around (#4531 ) * Moved components to the tf folder and moved the TrainerFactory to the `trainer` folder * Addressing comments * Editing the migrating doc * fixing test	4 年前
GitHub	a690af74	[refactor] Make PyTorch the default and TensorFlow optional (#4517 ) * Torch setup.py * Set torch to default * Make torch default in setup.py * Remove indents * Remove other instances of TF being used * Add tensorboard to setup.py * Adding correst setup commands for verifying torch is installed (#4524) * Adding correst setup commands for verifying torch is installed * Editing the test_requirments to add tf and remove torch * Develop torchdefault raise outside setup (#4530) * Torch not imported error to raise at first usage * Torch not imported error to raise at first usage * [refactor] Use PyTorch TensorBoard utils (#4518) * Convert stats writer to use PyTorch TB support * Use common function to print params * Update test * Bump tensorboard to 1.15 to fix the tests * putting tensorboard 1.15.0 as min version requirement Co-authored-by: vincentpierre <vincentpierre@unity3d.com> * [Docs] Initial documentation changes for making...	4 年前
Ervin Teng	3b15cc32	Multiprocessing but Stats are quite broken	4 年前
vincentpierre	b863af57	Removing TensorFlow Trainers	4 年前
vincentpierre	713e65fb	removing tensorflow testing for pytest and yamato	4 年前
Andrew Cohen	5bbe796b	update soccer raycasts	4 年前
Andrew Cohen	34420044	fix trainer c and soccer config	4 年前
Andrew Cohen	c72e00c9	fix multiple policy issue	4 年前
GitHub	d8835857	[MLA-1540] Training Analytics (#4780 )	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
GitHub	9a3600d5	Logging lesson number at the begining of training (#5199 )	4 年前
GitHub	2e19759c	Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211 ) * turning some logger.info into logger.debug and remove some logging overhead when not using debug * Addressing comments * Adding to changelog	4 年前

1 2 3 4 5

235 次代码提交 (9695b89a-39ea-4843-bafc-7b6ba4027929)