ml-agents

作者	SHA1	备注	提交日期
Deric Pang	634280a6	Fixed imports, all tests are passing.	6 年前
Deric Pang	cdb41480	Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure	6 年前
GitHub	3900ed66	Merge pull request #1083 from Unity-Technologies/develop-flat-code-restructure ML-Agents Code Restructure	6 年前
GitHub	fbf92810	Refactor Trainers to use Policy (#1098 )	6 年前
GitHub	10d2a19d	Release v0.5 (Develop) (#1203 )	6 年前
GitHub	f8df71a0	Revert "Release v0.5 (Develop) (#1203 )" (#1222 ) This reverts commit 448aac65dc891bad04a23a02d275f6a1d2704e1e.	6 年前
GitHub	29084e77	Curriculum learning reward thresholding bug fix (#1141 )	6 年前
GitHub	a54714f8	Update API to version 5 (#1179 )	6 年前
GitHub	560f1bd7	Merge pull request #1224 from Unity-Technologies/release-v0.5 Release v0.5	6 年前
GitHub	d2c320dd	Remove graph scope (#1205 ) * initial commit : Only works with PPO balance ball * Fix for recurrent * [Fix indentation error] * Fixed BC * Remove Dead code * Addressing comment : Removing dead code * Fixing the Pytest * edited comments * Removing GraphScope from the InternalBrain (#1227) * Documentation changes for removing graph scope (#1226) * Documentation changes * removed the keep checkpoint printing	6 年前
GitHub	3c9603d6	Demonstration Recorder (#1240 )	6 年前
GitHub	2b6b4570	Fix the Python Tests (#1327 )	6 年前
GitHub	547f0e98	Merge pull request #1361 from Unity-Technologies/release-v0.6 Merge Release v0.6 into develop	6 年前
vincentpierre	99aaa15e	made the pytest directory agnostic	6 年前
GitHub	b946047a	Merge pull request #1470 from Unity-Technologies/release-v0.6-make-test-directory-agnostic made the pytest directory agnostic	6 年前
GitHub	c8cc5a29	Merge pull request #1495 from Unity-Technologies/release-v0.6 release-v0.6 --> develop	6 年前
GitHub	a196dde2	Merge pull request #1494 from Unity-Technologies/release-v0.6 v0.6 Release	6 年前
GitHub	517e3a0a	Remove env creation logic from TrainerController (#1562 ) * Remove env creation logic from TrainerController Currently TrainerController includes logic related to creating the UnityEnvironment, which causes poor separation of concerns between the learn.py application script, TrainerController and UnityEnvironment: * TrainerController must know about the proper way to instantiate the UnityEnvironment, which may differ from application to application. This also makes mocking or subclassing UnityEnvironment more difficult. * Many arguments are passed by learn.py to TrainerController and passed along to UnityEnvironment. This change moves environment construction logic into learn.py, as part of the greater refactor to separate trainer logic from actor / environment.	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
GitHub	cd087609	added the pypiwin32 package (#1668 ) * added the pypiwin32 package * fixed the break on mac, fixed part of pytest above version 4 * added something to the windows to help unstuck people * resolved the comment	6 年前
GitHub	c258b1c3	Move 'take_action' into Policy class (#1669 ) * Move 'take_action' into Policy class This refactor is part of Actor-Trainer separation. Since policies will be distributed across actors in separate processes which share a single trainer, taking an action should be the responsibility of the policy. This change makes a few smaller changes: * Combines `take_action` logic between trainers, making it more generic * Adds an `ActionInfo` data class to be more explicit about the data returned by the policy, only used by TrainerController and policy for now. * Moves trainer stats logic out of `take_action` and into `add_experiences` * Renames 'take_action' to 'get_action'	6 年前
GitHub	275ff5d6	Merge pull request #1764 from Unity-Technologies/release-v0.7 Release v0.7 into master	6 年前
Ervin T	b30f4c90	Split `mlagents` into two packages (#1812 ) * Reogranize project * Fix all tests * Address comments * Delete init file * Update requirements * Tick version * Add timeout wait parameter (mlagents_envs) (#1699) * Add timeout wait param * Remove unnecessary function * Add new meta files for communicator objects * Fix all tests * update circleci * Reorganize mlagents_envs tests * WIP: test removing circleci cache * Move gym tests * Namespaced packages * Update installation instructions for separate packages * Remove unused package from setup script * Add Readme for ml-agents-envs * Clarify docs and re-comment compiler in make.bat * Add more doc to installation * Add back fix for Hololens * Recompile Protobufs * Change mlagents_envs to mlagents.envs in trainer_controller * Remove extraneous files, fix win bat script * Support Python 3.7 for envs package	6 年前
eshvk	cc9bdf17	Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return	6 年前
eshvk	fb04c40c	Reorganize to make metrics collection more accurate	6 年前
GitHub	a0b44f1b	Merge pull request #1858 from Unity-Technologies/develop-esh-metrics Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return per policy	6 年前
GitHub	93760bc4	Adds SubprocessUnityEnvironment for parallel envs (#1751 ) This commit adds support for running Unity environments in parallel. An abstract base class was created for UnityEnvironment which a new SubprocessUnityEnvironment inherits from. SubprocessUnityEnvironment communicates through a pipe in order to send commands which will be run in parallel to its workers. A few significant changes needed to be made as a side-effect: * UnityEnvironments are created via a factory method (a closure) rather than being directly created by the main process. * In mlagents-learn "worker-id" has been replaced by "base-port" and "num-envs", and worker_ids are automatically assigned across runs. * BrainInfo objects now convert all fields to numpy arrays or lists to avoid serialization issues.	6 年前
Jonathan Harper	7a0d1531	Fix subprocess model saving on Windows On Windows the interrupt for subprocesses works in a different way from OSX/Linux. The result is that child subprocesses and their pipes may close while the parent process is still running during a keyboard (ctrl+C) interrupt. To handle this, this change adds handling for EOFError and BrokenPipeError exceptions when interacting with subprocess environments. Additional management is also added to be sure when using parallel runs using the "num-runs" option that the threads for each run are joined and KeyboardInterrupts are handled. These changes made the "_win_handler" we used to specially manage interrupts on Windows unnecessary, so they have been removed.	6 年前
Jonathan Harper	e91e847c	Fix '--slow' flag after environment updates A change was made to the way the "train_mode" flag was used by environments when SubprocessUnityEnvironment was added which was intended to be part of a separate change set. This broke the CLI '--slow' flag. This change undoes those changes, so that the slow / fast simulation option works correctly. As a minor additional change, the remaining tests from top level 'tests' folders have been moved into the new test folders.	6 年前
GitHub	c613df3a	Merge pull request #1922 from Unity-Technologies/release-v08-slowflag Fix '--slow' flag after environment updates	6 年前
GitHub	2d1bda57	Merge pull request #1931 from Unity-Technologies/release-v0.8 Release v0.8	6 年前
GitHub	ba57eaad	Merge pull request #1932 from Unity-Technologies/release-v0.8 Release v0.8	6 年前
eshvk	ef8009d9	Python code reformat via [`black`](https://github.com/ambv/black ). Features: - Reformat code via black. - Adding circleci configurations. - Add contribution guidelines. Steps to reproduce: - `pip install black` - `black <source code directory>`	6 年前
GitHub	70d14910	Merge pull request #1934 from Unity-Technologies/develop-black Black formatting	6 年前
Jonathan Harper	d9a7e5b6	Fix failure on Academy Done() with parallel envs When using parallel SubprocessUnityEnvironment instances along with Academy Done(), a new step might be taken when reset should have been called because some environments may have been done while others were not (making "global done" less useful). This change manages the reset on `global_done` at the level of the environment worker, and removes the global reset from TrainerController.	6 年前
GitHub	e916dc48	use yaml.safe_load instead of yaml.load (#2124 )	6 年前
GitHub	d5f6b7f8	Merge pull request #2157 from Unity-Technologies/release-v0.8.2 Release v0.8.2	6 年前
GitHub	2671e1a0	Enable mypy in precommit checks (#2177 ) * WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * WIP enable mypy * run mypy on each package * fix trainer_metrics mypy errors * more mypy errors * more mypy * Fix some partially typed functions * types for take_action_outputs * fix formatting * cleanup * generate stubs for proto objects * fix ml-agents-env mypy errors * disallow-incomplete-defs for gym-unity * Add CI notes to CONTRIBUTING.md	6 年前
GitHub	40c7fc48	Merge branch 'develop' into protobuf_update	6 年前
GitHub	4ac79742	Refactor reward signals into separate class (#2144 ) * Create new class (RewardSignal) that represents a reward signal. * Add value heads for each reward signal in the PPO model. * Make summaries agnostic to the type of reward signals, and log weighted rewards per reward signal. * Move extrinsic and curiosity rewards into this new structure. * Allow defining multiple reward signals in YAML file. Add documentation for this new structure.	5 年前
Jonathan Harper	177ee5b8	Remove unused "last reward" logic, TF nodes At each step, an unused `last_reward` variable in the TF graph is updated in our PPO trainer. There are also related unused methods in various places in the codebase. This change removes them.	5 年前
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	5 年前
GitHub	966d8efb	Remove "external_brains" arg for TrainerController (#2213 ) TrainerController depended on an external_brains dictionary with brain params in its constructor but only used it in a single function call. The same function call (start_learning) takes the environment as an argument, which is the source of the external_brains. This change removes the dependency of TrainerController on external brains and removes the two class members related to external_brains and retrieves the brains directly from the environment.	5 年前
GitHub	9c50abcf	GAIL and Pretraining (#2118 ) Based on the new reward signals architecture, add BC pretrainer and GAIL for PPO. Main changes: - A new GAILRewardSignal and GAILModel for GAIL/VAIL - A BCModule component (not a reward signal) to do pretraining during RL - Documentation for both of these - Change to Demo Loader that lets you load multiple demo files in a folder - Example Demo files for all of our tested sample environments (for future regression testing)	5 年前
GitHub	a5b7cf95	Fix get_value_estimate and buffer append (#2276 ) Fixes shuffling issue with newer versions of numpy (#1798). * make get_value_estimates output a dict of floats * Use np.append instead of convert to list, unconvert * Add type hints and test for get_value_estimates	5 年前
Chris Elion	5d07ca1f	Merge remote-tracking branch 'origin/develop' into enable-flake8	5 年前
GitHub	be4292fb	Add different types of visual encoder (nature cnn/resnet) Add resnet and nature cnn in addition to default visual encoder	5 年前
GitHub	19283bfa	Very simple environment for testing (#2266 ) * WIP doesn't crash * return stats and assert convergence * pass lint checks * rename * fix-reset-params * add time penalty * _get_measure_vals always returns something * fix tests * unused import * single env, fix double step * move LocalEnvManager to ml-agents-envs * move and rename EnvManager * remove obsolete docstring and method * clean up	5 年前
GitHub	6a212f73	Improvements for GAIL (#2296 ) * Don't 0 value bootstrap for GAIL and Curiosity * Add gradient penalties to GAN to help with stability * Add gail_config.yaml with GAIL examples * Cleaned up trainer_config.yaml and unnecessary gammas * Documentation updates * Code cleanup	5 年前
GitHub	9eb3f049	Cleanup unused code in TrainerController (#2315 ) * Removes unused SubprocessEnvManager import in trainer_controller * Removes unused `steps` argument to `TrainerController._save_model` * Consolidates unnecessary branching for curricula in `TrainerController.advance` * Moves `reward_buffer` into `TFPolicy` from `PPOPolicy` and adds `BCTrainer` support so that we don't have a broken interface / undefined behavior when BCTrainer is used with curricula.	5 年前
GitHub	6225317d	refactor vis_encoder_type and add to doc refactor vis_encoder_type and add to doc	5 年前
Ervin T	a46f3faa	Enable generalization training (#2232 ) * Add Sampler and SamplerManager * Enable resampling of reset parameters during training * Documentation for Sampler and example YAML configuration file	5 年前
GitHub	9178b5d2	Improve test_simple.py and check discrete actions (#2345 ) * discrete action coverage * undo change * rename test * move test file * Revert "move test file" This reverts commit 2e72b2dbf9ce9163c92066036b06591dc4173e5c. * move files post merge	5 年前
GitHub	78c0c202	fix mock_brain (#2377 ) fix mock_brain	5 年前
GitHub	53475207	Merge pull request #2380 from Unity-Technologies/release-0.9.0 Release v0.9.0	5 年前
GitHub	b498c19d	Fix BCTrainer increment_steps (#2384 )	5 年前
GitHub	a9fe719c	Add Multi-GPU implementation for PPO (#2288 ) Add MultiGpuPPOPolicy class and command line options to run multi-GPU training	5 年前
GitHub	d7ebaae1	Return list instead of np array for make_mini_batch() (#2371 ) Return list instead of np array for make_mini_batch() to reduce time copying data	5 年前
GitHub	30930383	Move trainer initialization into a utility function (#2412 ) This change moves trainer initialization outside of TrainerController, reducing some of the constructor arguments of TrainerController and setting up the ability for trainers to be initialized in the case where a TrainerController isn't needed.	5 年前
GitHub	7b69bd14	Refactor Trainer and Model (#2360 ) - Move common functions to trainer.py, model.pyfromppo/trainer.py, ppo/policy.pyandppo/model.py' - Introduce RLTrainer class and move most of add_experiences and some common reward signal code there. PPO and SAC will inherit from this, not so much BC Trainer. - Add methods to Buffer to enable sampling, truncating, and save/loading. - Add scoping to create encoders in model.py	5 年前
GitHub	afb6ede5	Merge pull request #2393 from Unity-Technologies/hotfix-v0.9.0a - Fix issue with BC Trainer `increment_steps`. - Fix issue with Demonstration Recorder and visual observations (memory leak fix was deleting vis obs too early). - Make Samplers sample from the same random seed every time, so generalization runs are repeatable. - Fix crash when using GAIL, Curiosity, and visual observations together.	5 年前
Ervin Teng	072d2ef8	Merge latest develop	5 年前
GitHub	4472838e	Merge pull request #2421 from Unity-Technologies/hotfix-v0.9.1 Hotfix v0.9.1 - develop	5 年前
GitHub	bd7eb286	Update reward signals in parallel with policy (#2362 )	5 年前
GitHub	689765d6	Modification of reward signals and rl_trainer for SAC (#2433 ) * Adds evaluate_batch to reward signals. Evaluates on minibatch rather than on BrainInfo. * Changes the way reward signal results are reported in rl_trainer so that we get the pure, unprocessed environment reward separate from the reward signals. * Moves end_episode to rl_trainer * Fixed bug with BCModule with RNN	5 年前
GitHub	43696d60	Fix bug in add_rewards_output and add test (#2442 )	5 年前
GitHub	0a163871	Merge pull request #2469 from Unity-Technologies/release-0.9.2 Release 0.9.2	5 年前
GitHub	b73fa378	Add more extensive tests for BC trainer (#2506 ) * Add more extensive tests for BC trainer * Break up tests for BC trainer	5 年前
GitHub	dc3ab81a	Merge pull request #2514 from Unity-Technologies/hotfix-0.9.3 Hotfix 0.9.3	5 年前
Ervin Teng	e0da93d1	Fix bug with construct_curr_info and test	5 年前
Ervin Teng	aca81efb	Add more tests	5 年前
Ervin Teng	28ef8983	Add 2 visual obs test	5 年前
GitHub	4bb97e25	Fix bug with construct_curr_info (#2490 ) * Fix bug with construct_curr_info * Add more tests	5 年前
GitHub	6a81a2f4	Add Soft Actor-Critic as trainer option (#2341 ) * Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.	5 年前
Jonathan Harper	2f083c8a	Renamed "StepInfo" to "EnvironmentStep" This change was requested for clarity during the async EnvManager PR. It's a simple rename of the StepInfo class.	5 年前
GitHub	7ec3d7ad	Merge pull request #2516 from Unity-Technologies/master Merege 0.9.3 changes to develop	5 年前
GitHub	6f67cf40	unit test - don't use global random generator (#2521 ) * unit test - don't use global random generator * Update test_simple_rl.py	5 年前
GitHub	9e2c30ee	Made the _check_environment_trains test a little more easy to pass so the test will not randomly fail (#2520 )	5 年前
GitHub	0390c78b	Fix determinism in unit test (#2530 ) * initialize random instance correctly * restore threshold (I hope)	5 年前
GitHub	3df585d9	Fix issue where SAC encoder type is always simple (#2548 )	5 年前
GitHub	babe9e2f	Develop remove academy done (#2519 ) * Initial Commit * Remove the Academy Done flag from the protobuf definitions * remove global_done in the environment * Removed irrelevant unitTests * Remove the max_step from the Academy inspector * Removed global_done from the python scripts * Modified and removed some tests * This actually does not break either curriculum nor generalization training * Replace global_done with reserved. Addressing Chris Elion's comment regarding the deprecation of the global_done field. We will use a reserved field to make sure the global done does not get replaced in the future causing errors. * Removed unused fake brain * Tested that the first call to step was the same as a reset call * black formating * Added documentation changes * Editing the migrating doc * Addressing comments on the Migrating doc * Addressing comments : - Removing dead code - Resolving forgotten merged conflicts - Editing documentations...	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
GitHub	0d48a352	Use argparse for arg parsing (#2586 ) * encapsulate commandline args * fix tests * add tests on cmdline parsing * cleanup * remove docopt * simplify --slow	5 年前
GitHub	d64a01e1	Added option to use environment arguments in learn (#2594 ) * Added option to use environment arguments in learn * hook into argparse * add example to readme	5 年前
GitHub	149ebd67	Fix crash with VAIL + GAIL (#2598 )	5 年前
GitHub	473a8758	Develop yaml json loading errors (#2601 ) * WIP cleanup loading * better exceptions for parser errors - refer to online lint tools * feedback - rename variable	5 年前
GitHub	2f74b3cc	Rename protobuf objects to be suffixed with 'Proto' in python and C#. (#2646 )	5 年前
GitHub	b2fa2268	Merge pull request #2648 from Unity-Technologies/release-0.10.0 Release 0.10.0	5 年前
GitHub	8e931d8d	Merge branch 'develop' into release-0.10.0	5 年前
Ervin Teng	094cbe4d	Fix bug when batch size is a non-multiple of sequence length (#2661 )	5 年前
Anupam Bhatnagar	cc208c00	resolving conflicts	5 年前
Ervin Teng	e826f4bb	Bugfix for LSTM+BC (#2679 ) * Fix LSTM+BC in discrete case * Add test for Barracuda export * Fix LSTM training for BC	5 年前
GitHub	68965c7b	Use a class for camera res, not dict (#2656 )	5 年前
Ervin Teng	df44ee8d	Fix crash in trainer tests (trainer_metrics)	5 年前
GitHub	5f5ccfa0	Feature Deprecation : Online Behavioral Cloning (#2659 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature.	5 年前
GitHub	b2a2047e	Fix bug when batch size is a non-multiple of sequence length (#2661 )	5 年前
Chris Elion	43e23941	rough pass at tf2 support, needs cleanup	5 年前
Chris Elion	806c77e4	centralize tensorflow imports	5 年前
Ervin Teng	bd5b3c7d	Revert "Fix crash in trainer tests (trainer_metrics)" This reverts commit e7a4270db7dc14c35034b6e28f780bbe7d6ac6e3.	5 年前
GitHub	24ba9d58	Develop deprecate broadcasting (#2669 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modifie...	5 年前
GitHub	f22c41db	Merge pull request #2704 from Unity-Technologies/hotfix-0.10.1 Merge Hotfix 0.10.1	5 年前
GitHub	e6240c7a	Bugfix for LSTM+BC (#2679 ) * Fix LSTM+BC in discrete case * Add test for Barracuda export * Fix LSTM training for BC	5 年前
Anupam Bhatnagar	b733b34c	resolving conflicts	5 年前
Chris Elion	a1967c19	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
Chris Elion	254c7d86	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
GitHub	5d3e05d1	Fix "memory leak" during inference (#2722 ) * Clear buffer if not training * Add tests	5 年前
GitHub	619465e1	Fix crash when SAC is used with Curiosity and Continuous Actions (#2740 ) * Add test for curiosity + SAC * Use actions for all curiosity (need to test on PPO) * Fix issue with reward signals updating multiple times * Put curiosity actions in the right placeholder * Test PPO curiosity update	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前
Chris Elion	3d8a70fb	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
GitHub	0fe5adc2	Develop remove memories (#2795 ) * Initial commit removing memories from C# and deprecating memory fields in proto * initial changes to Python * Adding functionalities * Fixes * adding the memories to the dictionary * Fixing bugs * tweeks * Resolving bugs * Recreating the proto * Addressing comments * Passing by reference does not work. Do not merge * Fixing huge bug in Inference * Applying patches * fixing tests * Addressing comments * Renaming variable to reflect type * test	5 年前
GitHub	495873e5	Merge pull request #2833 from Unity-Technologies/release-0.11.0 Release 0.11.0	5 年前
Chris Elion	691d21e6	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Jonathan Harper	8550679d	Merge branch 'develop' into release-0.11.0	5 年前
GitHub	d39b1881	speed up unit test (#2847 )	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	e6f549dc	[MLA-12] update protobuf for vector observations (#2862 )	5 年前
Chris Elion	fca51de8	Merge remote-tracking branch 'origin/develop' into try-tf2-support	5 年前
Chris Elion	73a346cb	cleanup	5 年前
GitHub	f57b7ac6	Allow usage with tensorflow 2.0.0 (via tf.compat.v1) (#2665 )	5 年前
Ervin Teng	987e0e3a	Merge tf2 branch	5 年前
Andrew Cohen	13fe9cf8	Bubbled up indexing of AllBrainInfo to trainer controller from trainers	5 年前
Andrew Cohen	b11f04ea	Fixed test code by creating brain_name variable instead of hardcoding	5 年前
GitHub	c0453ae1	Merge pull request #2912 from Unity-Technologies/develop-allbraininfo Bubbled up indexing of AllBrainInfo to trainer controller from trainers	5 年前
GitHub	69d1a033	Develop remove past action communication (#2913 ) * Modifying the .proto files * attempt 1 at refactoring Python * works for ppo hallway * changing the documentation * now works with both sac and ppo both training and inference * Ned to fix the tests * TODOs : - Fix the demonstration recorder - Fix the demonstration loader - verify the intrinsic reward signals work - Fix the tests on Python - Fix the C# tests * Regenerating the protos * fix proto typo * protos and modifying the C# demo recorder * modified the demo loader * Demos are loading * IMPORTANT : THESE ARE THE FILES USED FOR CONVERSION FROM OLD TO NEW FORMAT * Modified all the demo files * Fixing all the tests * fixing ci * addressing comments * removing reference to memories in the ll-api	5 年前
Ervin Teng	54644477	Merge branch 'develop' of github.com:Unity-Technologies/ml-agents into develop-nomaxstep-test	5 年前
Ervin Teng	df5ee7bf	Split buffer into two buffers (PPO works)	5 年前
GitHub	a2194ea7	Fix batch size issue with BC (#2965 )	5 年前
GitHub	2c7e6d51	Fix bug where constant LR in pretraining will throw TF error (#2977 )	5 年前
Ervin Teng	9053610f	Fix buffer tests and truncate	5 年前
GitHub	b5eb34dc	Fix batch size issue with BC (#2965 ) (#2966 )	5 年前
Ervin Teng	29cdf77a	Fix RL tests	5 年前
Ervin Teng	a80b47d1	Fix demo loader and remaining tests	5 年前
Ervin Teng	3a4fa244	Switch to tanh squash in PPO	5 年前
GitHub	b1dc1015	Fix bug where constant LR in pretraining will throw TF error (#2978 )	5 年前
Ervin Teng	fd0647a6	Rename append_update_buffer to append_to_update_buffer	5 年前
Ervin Teng	73000a6b	Merge branch 'develop' into develop-splitbuffer	5 年前
GitHub	d4780a55	Merge pull request #3010 from Unity-Technologies/release-0.12.0-to-master Merge Release 0.12.0 to master	5 年前
GitHub	652488d9	check for numpy float64 (#2948 )	5 年前
GitHub	213cd68d	Split Buffer into processing and update buffers (#2964 ) This is the first in a series of PRs that intend to move the agent processing logic (add_experiences and process_experiences) out of the trainer and into a separate class. The plan is to do so in steps: - Split the processing buffers (keeping track of agent trajectories and assembling trajectories) and update buffer (complete trajectories to be used for training) within the Trainer (this PR) - Move the processing buffer and add/process experiences into a separate, outside class - Change the data type of the update buffer to be a Trajectory - Place and read Trajectories from queues, add subscription mechanism for both AgentProcessor and Trainers	5 年前
Ervin Teng	34f9577c	Merge branch 'develop' into develop-agentprocessor	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	35c995e9	Merge pull request #3038 from Unity-Technologies/develop Merge develop to master	5 年前
Ervin Teng	eb4a04a5	Merge branch 'master' into develop-tanhsquash	5 年前
GitHub	3b4b0d55	Remove random normal epsilon (#3039 )	5 年前
GitHub	e7bf6fff	Close environment if step raises an exception. (#3043 ) * close env manager in finally * rename to env_manager * remove obsolete mock checks	5 年前
GitHub	a6df9f43	Develop new ll api (#3022 ) * initial commit for LL-API * fixing ml-agents-envs tests * Implementing action masks * training is fixed for 3DBall * Tests all fixed, gym is broken and missing documentation changes * adding case where no vector obs * Fixed Gym * fixing tests of float64 * fixing float64 * reverting some of brain.py * removing old proto apis * comment type fixes * added properties to AgentGroupSpec and edited the notebooks. * clearing the notebook outputs * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing first comments * NaN checks for r...	5 年前
Ervin Teng	88b1123a	Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor	5 年前
Andrew Cohen	ef2dfd4c	adjusting tests to expect trainer.add_policy to be called	5 年前
GitHub	36048cb6	Moving Env Manager to Trainers (#3062 ) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. * Moving Env Manager to Trainers * fix pylint madness	5 年前
GitHub	42bea858	Improve mypy coverage by adding --namespace-packages (#3049 )	5 年前
Ervin Teng	336ca456	Kill the ProcessingBuffer	5 年前
GitHub	90db165f	Add --namespace-packages to mypy for mlagents (#3075 )	5 年前
GitHub	1fa07edb	Remove Standalone Offline BC Training (#2969 )	5 年前
GitHub	8ca0d810	Better error handling if trainer config doesn't contain "default" section (#3063 )	5 年前
Ervin Teng	62d609f8	Fix some of the tests	5 年前
GitHub	2c3794a6	handle mismatch between brain and metacurriculum (#3034 ) * handle mismatch between brain and metacur * add unit tests * use os.path.splitext in metacurriculum * fix type	5 年前
Ervin Teng	3449b551	Add test for trajectory	5 年前
Chris Elion	fdc810ff	move (first pass)	5 年前
Ervin Teng	38ff674e	Fix BC and tests	5 年前
GitHub	58b6c7c2	Rename mlagents.envs to mlagents_envs (#3083 )	5 年前
Chris Elion	f5e6b0ed	more cleanup	5 年前
Ervin Teng	27c2a55b	Lots of test fixes	5 年前
Ervin Teng	97d66e71	Remove BootstrapExperience	5 年前
Ervin Teng	324d217b	Move agent_id to Trajectory	5 年前
Jonathan Harper	9f166f9e	Update tests to support pytest 5.x Our tests were using pytest fixtures by actually calling the fixture methods, but in newer 5.x versions of pytest this causes test failures. The recommended method for using fixtures is dependency injection. This change updates the relevant test fixtures to either not use `pytest.fixture` or to use dependency injection to pass the fixture. The version range requirements in `test_requirements.txt` were also updated accordingly.	5 年前
Ervin Teng	43c0acfb	Fix test again	5 年前
Ervin Teng	83126bb2	Fix PPO value tests	5 年前
GitHub	9f522176	Merge pull request #3097 from Unity-Technologies/develop-pytest5 Update tests to support pytest 5.x	5 年前
Andrew Cohen	70357569	adjusting tests to expect trainer.add_policy to be called	5 年前
Ervin Teng	77aea4cd	Fix np float32 errors	5 年前
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
Ervin Teng	c330f6f6	Merge branch 'master' into develop-agentprocessor	5 年前
GitHub	3de3c1f1	check min size for visual encoders (#3112 ) * check min size for visual encoders * friendlier exception * fix typo	5 年前
Andrew Cohen	de902fbb	passes all pytest and C# tests	5 年前
Ervin Teng	47f8fa7a	Fix some import errors	5 年前
GitHub	2ac242f7	Remove TrainerMetrics and add CSVWriter using new StatsWriter API (#3108 )	5 年前
Ervin Teng	fdf9aea7	Make conversion methods part of NamedTuples	5 年前
GitHub	0b5b1b01	Develop magic string + trajectory (#3122 ) * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * trainer_controller expects name_behavior_ids * add_policy and create_policy separated * adjusting tests to expect trainer.add_policy to be called * fixing tests * fixed naming ...	5 年前
Andrew Cohen	082789ea	Merge branch 'master' into develop-magic-string	5 年前
Ervin Teng	1bd791e5	Merge branch 'master' into develop-agentprocessor	5 年前
Ervin Teng	abc8ca9a	Fix tests	5 年前
GitHub	7fbf6b1d	add flake8-bugbear (#3137 ) * unused loop variables * change loop variable	5 年前
GitHub	0d56f6ba	Merge branch 'master' into develop-magic-string	5 年前
Andrew Cohen	b28b3835	fixed default trainer_util test to expect brain_name	5 年前
Andrew Cohen	654b0c79	Merge branch 'master' into develop-magic-string	5 年前
GitHub	c6152459	Allow curricula to be created without files (#3145 ) Previously the Curriculum and MetaCurriculum classes required file / folder paths for initialization. These methods loaded the configuration for the curricula from the filesystem. Requiring files for configuring curricula makes testing and updating our config format more difficult. This change moves the file loading into static methods, so that Curricula / MetaCurricula can be initialized from dictionaries only.	5 年前
GitHub	bec2e8f0	Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113 )	5 年前
Ervin Teng	db743971	Move private methods out of trainer, simplify interface	5 年前
Andrew Cohen	c8514c18	Merge branch 'master' into develop-magic-string	5 年前
GitHub	45010af3	Add stats reporter class and re-enable missing stats (#3076 )	5 年前
Ervin Teng	48793ec1	Fix test	5 年前
Ervin Teng	3d25f9d2	Merge branch 'master' into develop-agentprocessor	5 年前
Jonathan Harper	481e0842	Remove the --num-runs option The "num-runs" command-line option provides the ability to run multiple identically-configured training runs in separate processes by running mlagents-learn only once. This is a rarely used ML-Agents feature, but it adds complexity to other parts of the system by adding the need to support multiprocessing and managing of ports for the parallel training runs. It also doesn't provide truly reproducible experiments, since there is no guarantee of resource isolation between the trials. This commit removes the --num-runs option, with the idea that users will manage parallel or sequential runs of the same experiment themselves in the future.	5 年前
GitHub	29c91b14	update flake8 plugin version and fix warnings (#3180 )	5 年前
Ervin Teng	ce75b378	update flake8 plugin version and fix warnings (#3180 )	5 年前
GitHub	d985dded	Merge branch 'master' into merge-release-0.13.0	5 年前
Ervin Teng	c48ddcf2	Fix pre-commit error	5 年前
GitHub	b0a2a54f	Add 'run-experiment' script, simpler curriculum config (#3186 ) This change adds a new 'mlagents-run-experiment' endpoint which accepts a single YAML/JSON file providing all of the information that mlagents-learn accepts via command-line arguments and file inputs. As part of this change the curriculum configuration is simplified to accept only a single file for all the curricula in an environment rather than a file for each behavior.	5 年前
Yuan Gao	0817c44b	Moved the demo files	5 年前
GitHub	b3d3a9d6	Merge pull request #3202 from Unity-Technologies/develop-move-demo Move the demo files into corresponding example/[env_name]/Demos/ folder	5 年前
Ervin Teng	98ed88b1	Merge branch 'master' into develop-separatevalue	5 年前
GitHub	4c241a80	Only send previous action and current BrainInfo (#3187 ) This PR makes it so that the env_manager only sends one current BrainInfo and the previous actions (if any) to the AgentManager. The list of agents was added to the ActionInfo and used appropriately.	5 年前
GitHub	f058b18c	Replace BrainInfos with BatchedStepResult (#3207 )	5 年前
GitHub	a64e7850	Fix issue with BatchedStepResult with no agents (#3240 )	5 年前
Chris Elion	45e6e53c	Refactor file logic in demo_loader and add unit tests. (#3241 )	5 年前
Ervin Teng	29f3330f	Merge master into hotfix-0.13.1	5 年前
Ervin Teng	e83276f6	Fix PPO test	5 年前
GitHub	d52fb483	Merge pull request #3264 from Unity-Technologies/hotfix-0.13.1 Merge hotfix 0.13.1 into master	5 年前
GitHub	ca96b293	Move advance() logic for environment manager out of trainer_controller (#3234 ) This PR moves the AgentManagers from the TrainerController into the env_manager. This way, the TrainerController only needs to create the components (Trainers, AgentManagers) and call advance() on the EnvManager and the Trainers.	5 年前
Ervin Teng	9ad99eb6	Combined model and policy for PPO	5 年前
GitHub	329b23e0	Fix extra summary being written when loading from checkpoint (#3272 ) * Load next summary properly * Add tests for add_policy and get_policy	5 年前
Ervin Teng	164732a9	Move optimizer creation to Trainer, fix some of the reward signals	5 年前
Ervin Teng	151e3b1c	Move policy to common location, remove epsilon	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
GitHub	0ff8f9af	Create ML-Agents Package (#3267 ) Convert the UnitySDK to a Packman Package. - Separate Examples into a sample project. - Move core UnitySDK Code into com.unity.ml-agents. - Create asmdefs for the ml-agents package. - Add package validation tests for win/linux/max. - Update protobuf generation scripts. - Add Barracuda as a package dependency for ML-Agents. (users no longer have to install it themselves).	5 年前
Ervin Teng	db249ceb	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
Ervin Teng	cfc2f455	Fix BC and tests	5 年前
Ervin Teng	78671383	Move initialization call around	5 年前
Ervin Teng	cadf6603	Fix SAC CC and some reward signal tests	5 年前
Ervin Teng	aec5fcc0	Fix policy tests	5 年前
Ervin Teng	dc43b0c6	Add test for NN policy	5 年前
Ervin Teng	d02bfbd4	Remove PPO policy tests	5 年前
Ervin Teng	1c4f60d4	remove more PPO tests	5 年前
Ervin Teng	48b39b80	Fix ghost trainer and all tests	5 年前
Ervin Teng	7b0f700b	Add test for deletion calls	5 年前
Ervin Teng	f64bdc4b	Fix SAC RNN test	5 年前
GitHub	3939ca52	Change AgentProcessor logic to fix memory leak (#3383 )	5 年前
Ervin Teng	00017bab	Temporarily remove multi-GPU	5 年前
Ervin Teng	faa9c702	Fix one more test for multi_gpu	5 年前
Ervin Teng	c68b5643	Remove multi_gpu from learn test	5 年前
GitHub	1f9d04f2	Fix clear update buffer when trainer stops training, add test (#3422 ) * Fix clear update buffer when trainer stops training, add test * Fix buffer changing types when truncated	5 年前
GitHub	f20a27e0	Clear agent processor properly on episode reset (#3437 )	5 年前
Anupam Bhatnagar	c70d0243	[bug-fix] Empty ignored trajectory queues, make sure queues don't overflow (#3451 )	5 年前
GitHub	9a4b151c	Merge pull request #3441 from Unity-Technologies/master-into-release-0.14.0 Master into release 0.14.0 copy	5 年前
Ervin Teng	5ef902bf	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
GitHub	6876a1d6	[bug-fix] Empty ignored trajectory queues, make sure queues don't overflow (#3451 )	5 年前
GitHub	423e8d80	Update the test demo (#3466 )	5 年前
GitHub	be14dd42	Make the timer output format consistent (#3472 )	5 年前
Andrew Cohen	e4d776c3	Merge branch 'master' into soccer-fives	5 年前
Ervin Teng	bcc25d59	Merge branch 'master' into develop-splitpolicyoptimizer	5 年前
GitHub	472f9f0e	Merge branch 'master' into develop-badEnvReturnCode	5 年前
Alphonso Crawford	802593a2	Adding test for bad env_path on create_environment_factory	5 年前
Alphonso Crawford	26d44958	Update test_bad_env_path	5 年前
Alphonso Crawford	1a7f9ad0	change test_learn.py	5 年前
Ervin Teng	847725f1	extend meta curriculum test steps	5 年前
GitHub	24145c22	Merge pull request #3438 from Unity-Technologies/develop-badEnvReturnCode Raise Exception if path does not exist [Bug Fix]	5 年前
GitHub	c145e75b	Split Policy and Optimizer, common Policy for PPO and SAC (#3345 )	5 年前
Andrew Cohen	5b0aca29	Merge branch 'master' into soccer-fives	5 年前
Ervin Teng	14f2a7f2	Rename LearningModel to ModelUtils	5 年前
Ervin Teng	1156b9b3	Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder	5 年前
Ervin Teng	d57124b4	Merge 'master' into develop-removeactionholder	5 年前
Ervin Teng	d680ed32	Fix metacurriculum test (for good)	5 年前
Anupam Bhatnagar	e04fcd71	Merge branch 'master' into master-into-release-0.14.1	5 年前
Ervin Teng	d10d27e2	Merge commit '9450d3fc0dda4547a14c5ed1b7e13fc6e3a15413' into develop-nopreviousactions	5 年前
GitHub	30a196eb	Fix metacurriculum test (for good) (#3511 )	5 年前
Andrew Cohen	de73baa9	Merge branch 'master' into soccer-fives	5 年前
GitHub	b2cc1c25	[bug-fix] Fix continuous LSTMs and add test (#3521 )	5 年前
GitHub	7d954797	[change] Separate action outputs into OutputDistributions object (#3514 )	5 年前
GitHub	f469cbb0	Simple1DEnv refactor and additional ghost trainer tests (#3537 )	5 年前
GitHub	e4177de0	[change] Organize trainer files a bit better (#3538 )	5 年前
GitHub	a6bf50db	Revert obs to goal in simple 1d test (#3540 )	5 年前
GitHub	323f104c	[tests] LSTM end-to-end tests (#3544 )	5 年前
GitHub	870338b4	[bug-fix] Fix issue with more than one continuous actions (#3547 )	5 年前
Andrew Cohen	573b1f6d	Merge branch 'master' into soccer-fives	5 年前
Andrew Cohen	0cc2956d	write to proto	5 年前
GitHub	bcce774f	[tests] Visual observation tests (#3549 )	5 年前
GitHub	213d2466	[bug-fix] Change Simple1DEnvironment to spawn new agent IDs on reset (#3558 )	5 年前
Jason Bowman	c3b15492	Modify demo loader to support gzip comression and reduce memory usage by seeking for individual reads	5 年前
GitHub	b6e3fd67	[tests] Add additional unit tests (#3581 )	5 年前
GitHub	ffd8f855	[bug-fix] Fix crash when demo size is smaller than batch size (#3591 )	5 年前
Chris Elion	7f2e815a	Merge remote-tracking branch 'origin/master' into develop-sidechannel-usability	5 年前
Chris Elion	fa5e7e6d	Merge remote-tracking branch 'origin/master' into develop-BehaviorParams-public	5 年前
GitHub	ed2eb6ef	[bug-fix] Fix entropy computation in MultiCategorialDistribution (#3607 )	5 年前
GitHub	873ba7fd	[bug-fix] Fix stats reporting for reward signals in SAC (#3606 )	5 年前
GitHub	c42a11c3	[change] Throw a proper error when sequence length is greater than batch size. (#3583 )	5 年前
Andrew Cohen	b1cfa74d	Merge branch 'master' into develop-test-imitation	5 年前
Andrew Cohen	e7836fb5	record demos 1d env	5 年前
Ervin Teng	98d5b8e3	Add test	5 年前
Andrew Cohen	7aaf1fb6	gail and bc tests	5 年前
Ervin Teng	6b578de4	Merge branch 'develop-refactorprint' into develop-progress-bar	5 年前
Andrew Cohen	f1eeed9c	success threshold to .9 for imitation	5 年前
Andrew Cohen	f6d6e3d0	reccurent gail tests	5 年前
GitHub	320175d5	[change] Move console printing to StatsWriter class (#3616 )	5 年前
GitHub	25cc9f15	[change] Move hyperparameter printing entirely into StatsWriters (#3630 )	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
GitHub	2912c883	Basic and visual GAIL and BC integration tests (#3626 )	5 年前
Andrew Cohen	53bea15c	Merge branch 'master' into soccer-fives	5 年前
Andrew Cohen	ac261e36	Merge branch 'master' into self-play-mutex	5 年前
GitHub	6709a9bf	[change] Clean up trainer interface, clean up GhostTrainer stats (#3634 )	5 年前
Andrew Cohen	eefc4811	Merge branch 'master' into self-play-mutex	5 年前
GitHub	ceaac645	[tests] Make subprocess manager test easier (#3651 )	5 年前
Andrew Cohen	79076b70	ELO calculation done in ghost controller	5 年前
Andrew Cohen	579bbd88	passing all tests locally	5 年前
GitHub	29f82921	[bug-fix] Improve performance for PPO with continuous actions (#3662 )	5 年前
Andrew Cohen	fb993986	Merge branch 'master' into self-play-mutex	5 年前
Ervin Teng	ee27e2cc	Fix tests	5 年前
Andrew Cohen	b42c9482	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Andrew Cohen	c4e54218	replaced ghost_swap with team_change in tests	5 年前
Andrew Cohen	d9cdb582	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
GitHub	104f2c46	[tests] Add tests for multiple actions/action branches (#3672 )	5 年前
Ervin Teng	e4d1df01	Fix TC test	5 年前
GitHub	de3fc4e8	Hotfix memory leak on Python (#3664 ) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com>	5 年前
GitHub	11c518a3	Stats SideChannel (for custom TensorBoard metrics) (#3660 )	5 年前
GitHub	458e68f1	Remove "docker target" feature (#3687 ) The "docker target" feature and associated command-line flag --docker-target-name were created for use with the now-deprecated Docker setup. This feature redirects the paths used by learn.py for the environment and config files to be based from a directory other than the current working directory. Additionally it wrapped the environment execution with xvfb-run. This commit removes the "docker target" feature because: * Renaming the paths doesn't fix any problem. Absolute paths can already be passed for configs and environment executables. * Use of xserver, Xvfb, or xvfb-run are independent of mlagents-learn and can be used outside of the mlagents-learn call. Further, xvfb-run is not the only solution for software rendering.	5 年前
GitHub	807a1441	Raise exceptions from environment subprocesses (#3680 ) This commit surfaces exceptions from environment worker subprocesses, and changes the SubprocessEnvManager to raise those exceptions when caught. Additionally TrainerController was changed to treat environment exceptions differently than KeyboardInterrupts. We now raise the environment exceptions after exporting the model, so that ML-Agents will correctly exit with a non-zero return code.	5 年前
Andrew Cohen	7219f60b	fixed tests that expected old hyperparam team-change	5 年前
GitHub	56b75555	[tests] Make end-to-end tests more stable (#3697 )	5 年前
Andrew Cohen	650ec121	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
GitHub	141831da	[bug-fix] Fix entropy computation for GaussianDistribution (#3684 )	5 年前
Andrew Cohen	4c9ac553	Merge branch 'master' into self-play-mutex	5 年前
Andrew Cohen	93d344ff	simple rl asymm ghost tests	5 年前
Andrew Cohen	a7a372b9	Merge branch 'master' into self-play-mutex	5 年前
Andrew Cohen	cd677346	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Andrew Cohen	345fa382	current_best_ratio -> latest_model_ratio	5 年前
Andrew Cohen	c7a34413	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
GitHub	bc1fdf07	[refactor] CLI changes (#3705 )	5 年前
Andrew Cohen	837886e1	Merge branch 'master' into self-play-mutex	5 年前
Andrew Cohen	6ade2ddc	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Andrew Cohen	62c87031	Merge branch 'master' into self-play-mutex	5 年前
GitHub	9cbc3fa2	Asymmetric self-play (#3653 )	5 年前
Ervin Teng	06fa3d39	Merge branch 'master' into develop-sac-apex	5 年前
Anupam Bhatnagar	50e52d9c	Merge branch 'master' into distributed-training	5 年前
GitHub	d7ca6b8d	[feature] Add --initialize-from option (#3710 )	5 年前
Andrew Cohen	3013774b	alternative to internal-policy fix	5 年前
Andrew Cohen	d1bee64b	fixed test_ghost and test_ppo	5 年前
Andrew Cohen	1b9c643b	Merge branch 'master' into self-play-mutex	5 年前
Andrew Cohen	7006b5ff	asymm ghost test consistent	5 年前
Andrew Cohen	0af2a651	fixed test_sac	5 年前
Ervin Teng	971e4b2d	Don't block when disabling threading	5 年前
GitHub	43f23ee3	WIP : Changes to the LL-API - Refactor of “done” logic (#3681 ) * [skip ci] WIP : Modify the base_env.py file * [skip ci] typo * [skip ci] renamed some methods * [skip ci] Incorporated changes from our meeting * [skip ci] everything is broken * [skip ci] everything is broken * [skip ci] formatting * Fixing the gym tests * Fixing bug, C# has an error that needs fixing * Fixing the test * relaxing the threshold of 0.99 to 0.9 * fixing the C# side * formating * Fixed the llapi integratio test * [Increasing steps for testing] * Fixing the python tests * Need __contains__ after all * changing the max_steps in the tests * addressing comments * Making env_manager logic clearer as proposed in the comments * Remove duplicated logic and added back in episode length (#3728) * removing mentions of multi-agent in gym and changed the docstring in base_env.py * Edited the Documentation for the changes to the LLAPI (#3733) * Edite...	5 年前
Andrew Cohen	09a53bb8	make reward threshold consistent across ghosts tests	5 年前
Andrew Cohen	a870d453	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
GitHub	b841c9ab	Wrapped trainer has internal policy in GhostTrainer	5 年前
Andrew Cohen	7a7eb324	Merge branch 'master' into internal-policy-ghost	5 年前
Ervin Teng	441fbb91	Fix subprocess test	5 年前
GitHub	55b26417	check demonstration version before loading (#3745 ) * check demonstration version before loading * I guess we have version 0 too * fix mock import (works on my machine)	5 年前
Ervin Teng	f29b17a9	Don't block one policy queue Only put policies when policy is actually updated	5 年前
Ervin Teng	99ce4b59	Improve tests	5 年前
GitHub	aae58330	Merge branch 'master' into develop-add-inference-examples	5 年前
Ervin Teng	d2d88b6a	Fix env_manager test	5 年前
Andrew Cohen	933d7b32	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Ervin Teng	51e76f00	Adjust SAC recurrent	5 年前
Ervin Teng	46d83839	Adjust subprocessor test	5 年前
Andrew Cohen	f41695b9	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Ervin Teng	e90ef688	Revert to get_nowait method in AgentManagerQueue	5 年前
Andrew Cohen	ad6ba833	Merge branch 'internal-policy-ghost' into soccer-2v1	5 年前
Ervin Teng	370b3c40	Fix subprocess env manager test	5 年前
Andrew Cohen	1c2005a8	Merge branch 'internal-policy-ghost' into soccer-2v1	5 年前
Andrew Cohen	80469267	Merge branch 'internal-policy-ghost' into soccer-2v1	5 年前
Ervin Teng	9fe104d6	Make threading disable-able per trainer	5 年前
Andrew Cohen	9ae19e9d	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	89db8428	Merge branch 'internal-policy-ghost-alternate' into soccer-2v1	5 年前
Ervin Teng	92158d54	Remove threaded from trainer_controller	5 年前
Andrew Cohen	4468280a	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	26c0033c	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	cde8360e	update tests	5 年前
Ervin Teng	23039746	Disable threading for all simple_rl tests	5 年前
Andrew Cohen	cb83a467	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	9bec75ee	Merge branch 'master' into soccer-2v1	5 年前
GitHub	1536b9f2	Increasing steps on asymmetric ghost test (#3802 )	5 年前
Arthur Juliani	3769d943	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
GitHub	4d23200b	[refactor] Run Trainers in separate threads (#3690 )	5 年前
Ervin Teng	9cd2c034	Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-sac-apex	5 年前
GitHub	7e5513a4	[bug-fix] Increase buffer size for SAC tests (#3813 )	5 年前
vincentpierre	cad57a00	[skip ci] Added some tests but they do not pass (too hard)	5 年前
Andrew Cohen	185d4b35	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Arthur Juliani	3c82bf59	Training runs, but doesn’t actually work	5 年前
GitHub	adeb6536	Catch dimension mismatches between demos and policy (#3821 )	5 年前
Andrew Cohen	b217f8bf	Merge branch 'master' into soccer-2v1	5 年前
Andrew Cohen	b4f52c88	Merge branch 'soccer-2v1' into asymm-envs	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
GitHub	7b78ffeb	support newer versions of tensorflow (2.1+) (#3830 ) * support tf2.x and python3.8 * tensorflow==2.2.0rc3 for python3.8 * stick with tf2.1 and py3.7 for now * More gail visual steps in simple test (#3836) * increase gail visual ppo steps * increase to 2000 * tune steps down to 750 Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	5 年前
GitHub	4092d937	[Bug fix] Hard reset when team changes (#3870 )	5 年前
Arthur Juliani	212e2d1d	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
GitHub	f86fc81d	[refactor] Move configuration files to single YAML file (#3791 )	5 年前
GitHub	7e0032f5	[refactor] Allow full RunOptions to be specified in trainer configuration YAML (#3815 )	5 年前
Arthur Juliani	ca887743	Support tf and pytorch alongside one another	5 年前
GitHub	d8b93f8f	[Bug fix] Hard reset when team changes (#3870 ) (#3899 )	5 年前
Chris Elion	68b68396	Merge remote-tracking branch 'origin/master' into release_1_to_master	5 年前
GitHub	d2bc86c8	Release 2 cherry pick (#3971 ) * [bug-fix] Fix issue with initialize not resetting step count (#3962) * Develop better error message for #3953 (#3963) * Making the error for wrong number of agents raise consistently * Better error message for inputs of wrong dimensions * Fix #3932, stop the editor from going into a loop when a prefab is selected. (#3949) * Minor doc updates to release * add unit tests and fix exceptions (#3930) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Goy <christopherg@unity3d.com>	5 年前
GitHub	4641038e	Renaming max_step to interrupted in TermialStep(s) (#3908 )	5 年前
vincentpierre	c34dd5b6	Merge branch 'master' into develop-gym-wrapper	5 年前
vincentpierre	67027af3	Removed the failing gym tests	5 年前
Andrew Cohen	a2f8319a	Merge branch 'master' into asymm-envs	5 年前
Arthur Juliani	89ad3020	Merge remote-tracking branch 'origin/master' into develop-add-fire # Conflicts: # ml-agents/mlagents/trainers/policy/tf_policy.py	5 年前
Andrew Cohen	0ec2a890	Merge branch 'master' into asymm-envs	5 年前
GitHub	c5b94ca6	Use LR schedule for beta and epsilon (#3940 )	5 年前
Arthur Juliani	2b3a6347	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
GitHub	812983c0	Some improvements to the UnityEnvironment class (#3939 ) * Fix typo * Made a side channel utils to reduce the complexity of UnityEnvironment * Added a get_side_channel_dict utils method * Better executable launcher (unarguably) * Fixing the broken test * Addressing comments * [skip ci] Update ml-agents-envs/mlagents_envs/side_channel/side_channel_manager.py Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com> * No catch all Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	5 年前
GitHub	c6ed3789	Replaced get_behavior_names and get_behavior_spec with behavior_specs property (#3946 ) * Replaced get_behavior_names and get_behavior_spec with behavior_specs property * Fixing the test * [ci] * addressing some comments * use typing.Mapping (#3948) * Update ml-agents-envs/mlagents_envs/base_env.py Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Adding the documentation Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
Christopher Goy	ba80b292	format files with pre-commit.	4 年前
GitHub	f7373172	Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2 update verified brach with barracuda 1.0.2	4 年前
GitHub	abbc6424	[bug-fix] Fix issue with initialize not resetting step count (#3962 )	5 年前
vincentpierre	6ddfe74f	Merge branch 'master' into develop-gym-wrapper	5 年前
Arthur Juliani	28e095e0	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
Ruo-Ping Dong	2ca79207	[bug-fix] Don't load non-wrapped policy (#4593 ) * Always initialize non-wrapped policy * Load ghosted policy * Update changelog * Resume test * Add test * Add torch test and fix torch.	4 年前
Andrew Cohen	59a60c1e	Merge branch 'master' into asymm-envs	5 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	5 年前
GitHub	335cff3e	[versioning] Save ML-Agents version in checkpoints and check on load (#4035 )	5 年前
GitHub	a7323393	[bug-fix] Fix issue with SAC updating too much on resume (#4038 )	5 年前
GitHub	21fe203e	[tests] Increase buffer_init_steps for recurrent sac test (#4051 )	5 年前
GitHub	f5435876	[refactor] Store and restore state along with checkpoints (#4025 )	5 年前
Andrew Cohen	e7750fc9	Merge branch 'master' into develop-sampler-refactor	5 年前
GitHub	ee1098d1	[refactor] Improve config upgrade script and add test (#4056 )	5 年前
GitHub	09853e13	[refactor] Move checkpoint saving into trainer (#4034 )	5 年前
Andrew Cohen	fa5dae1a	tests for settings	5 年前
Andrew Cohen	22786526	Merge branch 'master' into asymm-envs	5 年前
Andrew Cohen	c0f7052b	Merge branch 'master' into develop-sampler-refactor	5 年前
GitHub	09c7787c	[bug-fix] Fix regression in --initialize-from feature (#4086 )	5 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
GitHub	a1c63c4b	Release 3 Cherry-pick bug-fixes and doc changes from master (#4102 ) * [bug-fix] Fix regression in --initialize-from feature (#4086) * Fixed text in GettingStarted page specifying the logdir for tensorboard. Before it was in a directory summaries which no longer existed. Results are now saved to the results dir. (#4085) * [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) * Reverting bug introduced in #4071 (#4101) Co-authored-by: Scott <Scott.m.jordan91@gmail.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	5 年前
GitHub	8a49e8e0	[refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087 )	5 年前
GitHub	8fb66c2d	[bug-fix] Fix issue where curriculum was advancing too early (#4107 )	5 年前
Andrew Cohen	f76780f1	fix tests	5 年前
GitHub	fefbc038	Merge pull request #4109 from Unity-Technologies/release_3_merge_master Release 3 merge to master	5 年前
Andrew Cohen	6554ccb7	Merge branch 'master' into asymm-envs	5 年前
GitHub	5b0a5b9b	Moving domain randomization to C# (#4065 )	5 年前
Arthur Juliani	9724c9ac	Merge master	5 年前
Jonathan Harper	80127232	Convert checkpoints to .nn format Fixed style Fixed more style Nit changes Fixed signature Convert checkpoints to .nn format Fixed style Nit changes Fixed tests, checkpoint management and style Check checkpoint management Modify statement on artifacts Nit changes Fixed signature Nit changes Fixed signature Fixed tests, checkpoint management and style Check checkpoint management Modify statement on artifacts	5 年前
Ervin Teng	2b0c0163	Add settings test	5 年前
GitHub	bb675bf4	Merge pull request #4134 from Unity-Technologies/develop-removebrainnamepolicy [refactor] Remove references to brain_name in policy	5 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	69579611	[refactor] Refactor Actor and Critic classes (#4287 )	4 年前
GitHub	17f03980	[bug-fix] Fix non-LSTM SeparateActorCritic (#4306 )	4 年前
GitHub	5bcbef8d	[tests] Add tests for core PyTorch files (#4292 )	4 年前
GitHub	93517833	[feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305 )	4 年前
Andrew Cohen	f74d301a	Merge branch 'develop-add-fire' into develop-add-fire-bc	4 年前
GitHub	b4749b31	Test fixes on add-fire (#4317 )	4 年前
Ervin Teng	a172fb46	Halve entropy	4 年前
Ervin Teng	b2872adf	Merge branch 'develop-add-fire' into develop-add-fire-halfentropy	4 年前
Andrew Cohen	6df8d32c	Merge branch 'develop-add-fire' into develop-add-fire-bc	4 年前
GitHub	69d29b86	[add-fire] Halve Gaussian entropy (#4319 ) * Halve entropy * Fix utils test	4 年前
Ervin Teng	50a7e952	Fix utils test	4 年前
vincentpierre	599d7e9f	Merging master	4 年前
GitHub	3a982317	[add-fire] Add learning rate and beta/epsilon decay to PyTorch (#4318 )	4 年前
vincentpierre	d031c7a9	Merging master	4 年前
GitHub	7ddfd81f	Added Reward Providers for Torch (#4280 ) * Added Reward Providers for Torch * Use NetworkBody to encode state in the reward providers * Integrating the reward prodiders with ppo and torch * work in progress, integration with PPO. Not training properly Pyramids at the moment * Integration in PPO * Removing duplicate file * Gail and Curiosity working * addressing comments * Enfore float32 for tests * enfore np.float32 in buffer	4 年前
Andrew Cohen	bf8b2328	Merge branch 'develop-add-fire' into develop-add-fire-bc	4 年前
HH	7afa1761	Merge branch 'master' into hh/develop/ragdoll-updates	5 年前
GitHub	36613cad	[add-fire] Fix CategoricalDistInstance test and replace `range` with `arange` (#4327 )	4 年前
Ervin Teng	6b29a4c9	Fix test and replace range with arange	4 年前
GitHub	6b193d03	Develop add fire layers (#4321 ) * Layer initialization + swish as a layer * integrating with the existing layers * fixing tests * setting the seed for a test * Using swish and fixing tests	4 年前
GitHub	3de1e660	[bug-fix] Initialize-from being incorrectly loaded as "None" rather than None (#4175 )	4 年前
Ervin Teng	5bf72236	Fix util test	4 年前
Ervin Teng	cded4c6c	Fix SeparateActorCritic and add test	4 年前
Ruo-Ping Dong	71fe4df6	fix formatting and test	4 年前
GitHub	0e0daf47	[add-fire] Merge post-0.19.0 master into add-fire (#4328 )	4 年前
GitHub	9d2e4268	Revert "[add-fire] Merge post-0.19.0 master into add-fire (#4328 )" (#4330 ) This reverts commit 9913e71b6f35f1e11027a4a571a65533caf285ac.	4 年前
Ruo-Ping Dong	79d89158	Merge branch 'develop-add-fire' into develop-add-fire-checkpoint	4 年前
GitHub	3bcb029b	[refactor] Remove BrainParameters from Python code (#4138 )	4 年前
Ruo-Ping Dong	e06812aa	fix tests	4 年前
HH	a1f2748e	Merge branch 'master' into hh/develop/crawler-ragdoll-updates	5 年前
GitHub	839eb2cb	Develop model transfer test (#4214 ) * test env, and code integration * delete results	4 年前
yanchaosun	7e3216ae	simple env test	4 年前
yanchaosun	cdaaa318	bisim	4 年前
yanchaosun	3d0d359c	bisimulation draft	4 年前
yanchaosun	1fdbfe65	no normalization	4 年前
yanchaosun	5a778ca3	fix normalization	4 年前
GitHub	8eefdcd3	Refactor of Curriculum and parameter sampling (#4160 ) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation up...	4 年前
yanchaosun	a212fef9	new bisim implementation	4 年前
GitHub	e318f96a	don't allow --num-envs >1 with no --env (#4203 ) * don't allow --num-envs >1 with no --env * changelog * PR feedback	4 年前
GitHub	09c63636	[MLA-1145] don't allow --num-envs >1 with no --env (#4209 ) * don't allow --num-envs >1 with no --env (#4203)	4 年前
yanchaosun	aca8cd58	update with new alternating	4 年前
HH	0fdac847	Merge branch 'master' into hh/develop/crawler-ragdoll-updates	5 年前
yanchaosun	0e2f6e19	small fix	4 年前
yanchaosun	ec929746	minor update	4 年前
GitHub	84440f05	Convert checkpoints to .NN (#4127 ) This change adds an export to .nn for each checkpoint generated by RLTrainer and adds a NNCheckpointManager to track the generated checkpoints and final model in training_status.json. Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	4 年前
Andrew Cohen	d0133066	working	4 年前
yanchaosun	9bc90956	fix bug with bisimulation	4 年前
Andrew Cohen	b6bf1860	fix bisim metric	4 年前
Andrew Cohen	617aefc9	resolve conflict	4 年前
yanchaosun	ce36349b	some changes	4 年前
Andrew Cohen	1b17ae56	add tanh activ	4 年前
Arthur Juliani	6bee0fd1	Merge master	4 年前
yanchaosun	caeffa3e	add two envs	4 年前
yanchaosun	447124f1	new test	4 年前
Andrew Cohen	5fa28f5f	merge YC changes	4 年前
yanchaosun	28355444	bisim fix, disable stop gradient	4 年前
Arthur Juliani	c63b3d09	Fix lesson incrementing (#4279 )	4 年前
Andrew Cohen	dad084ee	old crawler config	4 年前
yanchaosun	8fc18e5d	plotting	4 年前
yanchaosun	3246570c	added action encoder, and flags related with action training/transferring; set model_schedule as a changable hyperparameter	4 年前
GitHub	20f1386a	Don't drop multiple stats from the same step (#4236 )	4 年前
GitHub	9f041970	Develop bisim action encoder, incorporate related hyperparameter settings (#4253 )	4 年前
yanchaosun	fb5c33c1	test code	4 年前
GitHub	1f5eb9da	add pyupgrade to pre-commit and run (#4239 )	4 年前
yanchaosun	696ec0cc	new plots	4 年前
GitHub	129f9ddc	[MLA-427] make pyupgrade convert f-strings too (#4244 ) * make pyupgrade convert f-strings too	4 年前
yanchaosun	80bad241	init sac transfer, and added action encoder to bisim; configs for crawler	4 年前
yanchaosun	f81feec4	config fix; basic sac	4 年前
yanchaosun	a505cb16	new config	4 年前
yanchaosun	9a19f6e5	disable bisim	4 年前
yanchaosun	b991096b	update target encoder soft copy	4 年前
Andrew Cohen	d8c123a0	Merge branch 'master' into sensitivity	4 年前
GitHub	ac36b31f	[MLA-1172] Reduce calls to training_behaviors (#4259 )	4 年前
yanchaosun	b74294bf	target encoders and new forward loss	4 年前
GitHub	1b098c9a	Refactor TFPolicy and Policy (#4254 ) * Refactor TFPolicy and Policy	4 年前
GitHub	380fef57	[refactor] Move TF-specific files to tf/ folder (#4266 )	4 年前
Andrew Cohen	06e4356c	Merge branch 'master' into sensitivity	4 年前
GitHub	d1bf56e9	Fix lesson incrementing (#4279 ) * Fix lesson incrementing * Add warning and test * Add test for lesson pasing Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Arthur Juliani	1a123641	Merge remote-tracking branch 'origin/master' into r5-master	4 年前
GitHub	493793a6	[MLA-1233] Remove stats.CSVWriter (#4300 )	4 年前
Andrew Cohen	4b094d25	large normalization obs unit test	4 年前
Ervin Teng	dc937d5c	Merge branch 'master' into develop-add-fire-mm	4 年前
GitHub	1e76f8d0	Merge pull request #4331 from Unity-Technologies/develop-add-fire-mm2 [add-fire] Merge post-0.19.0 master into add-fire (ver. 2)	4 年前
Ervin Teng	4ebccf97	Merge branch 'develop-add-fire' into develop-add-fire-sac-lst	4 年前
Andrew Cohen	598826fe	Merge branch 'develop-add-fire' into develop-add-fire-bc	4 年前
Ruo-Ping Dong	d3eb6c46	Merge branch 'develop-add-fire' into develop-add-fire-checkpoint	4 年前
Andrew Cohen	ae2c83e2	added torch bc tests	4 年前
Ruo-Ping Dong	95858e25	update saver interface and add tests	4 年前
GitHub	6b255790	Behavioral Cloning Pytorch (#4293 )	4 年前
Ruo-Ping Dong	523248be	update	4 年前
GitHub	9dc1d99e	Initialize normalizer with mean/variance from first trajectory (#4299 ) Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	ab8e5afa	Release 6 fix nan (#4343 ) * test initalize steps to 100 * use mean of first trajectory to initialize the normalizer * remove blank line * update changelog * cleaned up initialization of variance/mean * large normalization obs unit test * add --upgrade to pip to get newer downloader (#4338) * Fix format of the changelog for validation. (#4340) Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Goy <christopherg@unity3d.com>	4 年前
GitHub	f374f87a	[add-fire] Add LSTM to SAC, LSTM fixes and initializations (#4324 )	4 年前
Andrew Cohen	c25ddc5d	fix tests	4 年前
Andrew Cohen	0a7444f9	revert bc default batch/epoch	4 年前
Anupam Bhatnagar	abc1220f	Merge branch 'master' into global-variables	4 年前
Andrew Cohen	9f25f53b	fix default bc test	4 年前
GitHub	705a0e0e	Curriculum: If no behavior specified, do magic (#4346 ) * Make behavior in curriculum a required attrib * Re-adding the test	4 年前
Ervin Teng	fe4472cb	Add decoders, distributions, encoders, layers, networks, and utils	4 年前
HH	8eaddb61	Merge branch 'master' into hh/develop/loco-walker-variable-speed	4 年前
Ervin Teng	69bae3cc	Add test for lstm layer	4 年前
Andrew Cohen	53185b7e	fix tf bc default test	4 年前
Ruo-Ping Dong	59cc1a9f	Merge branch 'develop-add-fire' into develop-add-fire-checkpoint	4 年前
Ruo-Ping Dong	409a161c	fix bc tests	4 年前
Ervin Teng	89890bf2	Update with newest changes	4 年前
GitHub	25dc8c3d	Add Saver Class to handle all save/load/checkpoint/export work (#4323 )	4 年前
Ervin Teng	13f15086	Merge branch 'develop-add-fire' into develop-add-fire-amrl	4 年前
Ervin Teng	d56e53bb	Fix LSTM tests	4 年前
GitHub	e3bc3352	[pytorch] Add decoders, distributions, encoders, layers, networks, and utils (#4349 )	4 年前
Ervin Teng	d65a9326	Merge branch 'master' into develop-add-fire-mm3	4 年前
Ervin Teng	a88d3581	Fix and test for masked_mean	4 年前
Ruo-Ping Dong	d57aa9ab	Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint	4 年前
GitHub	bd6bcd2f	Merge master and add Saver class for save/load checkpoints	4 年前
Ervin Teng	d218bf4d	Merge branch 'develop-add-fire' into develop-add-fire-sac-lst	4 年前
GitHub	6de31a03	[add-fire] Fix masked mean for 2d tensors (#4364 )	4 年前
Ervin Teng	5c1717d1	Bugfixes for continuous case	4 年前
Ervin Teng	42e25b25	Merge branch 'develop-add-fire' into develop-add-fire-memoryclass	4 年前
Ervin Teng	6e946dba	Policy bugfixes and policy tests	4 年前
Christopher Goy	5a233353	Merge remote-tracking branch 'origin/master' into release_6-to-master	4 年前
GitHub	03eac72c	[add-fire] Add tests and fix issues with Policy (#4372 )	4 年前
Andrew Cohen	a65d08c7	ghost trainer tests	4 年前
GitHub	49545ce1	Pytorch ghost trainer (#4370 )	4 年前
Ervin Teng	020ce8ad	Remove some unneeded stuff	4 年前
GitHub	6a1d993f	[add-fire] Memory class abstraction (#4375 )	4 年前
Andrew Cohen	af7d3800	add test_simple_rl tests to torch	4 年前
Andrew Cohen	39bca7d2	fix tf ghost tests	4 年前
Ervin Teng	554ca0b9	Fix test typing	4 年前
GitHub	2332bc32	Add fire to test_simple_rl.py (#4378 ) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Andrew Cohen	d1c0f217	revert tests	4 年前
Ervin Teng	b107a8d5	Fix network tests	4 年前
HH	2080c287	Merge branch 'master' into hh/develop/loco-crawler-variable-speed	4 年前
HH	d4bd7fe6	Merge branch 'master' into hh/develop/loco-walker-variable-speed	4 年前
Ervin Teng	a04e68a4	Merge branch 'develop-add-fire' into develop-add-fire-memoryclass	4 年前
Ervin Teng	d63aacd0	Cleanup, add test	4 年前
GitHub	0d0d2ead	[add-fire] Revert unneeded changes back to master (#4389 )	4 年前
Ervin Teng	987ea2d0	Revert unneeded changes back to master	4 年前
Ervin Teng	8ff8c401	Merge branch 'develop-add-fire' into develop-add-fire-export	4 年前
GitHub	1955af9e	[feature] Add experimental PyTorch support (#4335 ) * Begin porting work * Add ResNet and distributions * Dynamically construct actor and critic * Initial optimizer port * Refactoring policy and optimizer * Resolving a few bugs * Share more code between tf and torch policies * Slightly closer to running model * Training runs, but doesn’t actually work * Fix a couple additional bugs * Add conditional sigma for distribution * Fix normalization * Support discrete actions as well * Continuous and discrete now train * Mulkti-discrete now working * Visual observations now train as well * GRU in-progress and dynamic cnns * Fix for memories * Remove unused arg * Combine actor and critic classes. Initial export. * Support tf and pytorch alongside one another * Prepare model for onnx export * Use LSTM and fix a few merge errors * Fix bug in probs calculation * Optimize np -> tensor operations * Time action sample funct...	4 年前
Ruo-Ping Dong	c47ffc20	Rename saver	4 年前
Ruo-Ping Dong	09c22679	fix NNCheckpointManager for Torch	4 年前
Ruo-Ping Dong	f2a8c421	add torch saver test	4 年前
GitHub	70197342	Add torch saver test Add torch saver test	4 年前
vincentpierre	ba7eb360	Merge branch 'master' into develop-torch-save-rp	4 年前
Ruo-Ping Dong	6ae17cd0	fix test	4 年前
Ruo-Ping Dong	a74c904a	Merge branch 'master' into develop-saver-name	4 年前
vincentpierre	25454a48	adding tests	4 年前
GitHub	347bde3d	Fix export	4 年前
GitHub	38e9387b	Fix NNCheckpointManager for Torch Fix NNCheckpointManager for Torch	4 年前
vincentpierre	108fac9a	Replace torch.detach().cpu().numpy() with a utils method	4 年前
Ruo-Ping Dong	07e82899	update torch saver test	4 年前
vincentpierre	44fa3a65	Moved the tests around	4 年前
HH	d9962254	Merge branch 'master' into hh/develop/loco-walker-variable-speed	4 年前
GitHub	328353bc	Torch : Saving/Loading of the reward providers (#4405 ) * Saving the reward providers * adding tests * Moved the tests around * Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py * Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py * Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com> * Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com> Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>	4 年前
Ruo-Ping Dong	e60c7038	Merge branch 'master' into develop-saver-name	4 年前
GitHub	80b7a6d3	Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py	4 年前
vincentpierre	fdd343b2	more use of item() and additional tests	4 年前
Ruo-Ping Dong	88eff042	Merge branch 'master' into develop-saver-name	4 年前
GitHub	82bd7fd0	Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py	4 年前
Ruo-Ping Dong	56feb8af	update test_saver_reward_providers.py	4 年前
GitHub	4dda2983	Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>	4 年前
GitHub	48f217b9	Rename Saver to ModelSaver (#4402 ) Rename Saver to ModelSaver to avoid confusion with tf.Saver	4 年前
GitHub	83e21972	Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>	4 年前
Anupam Bhatnagar	f4f1a8d9	merge master into trainer-plugin branch	4 年前
GitHub	12e15e29	Fix on GAIL Torch when using actions (#4407 )	4 年前
GitHub	498934f9	Replace torch.detach().cpu().numpy() with a utils method (#4406 ) * Replace torch.detach().cpu().numpy() with a utils method * Using item() in place of to_numpy() * more use of item() and additional tests	4 年前
Ruo-Ping Dong	27fb4270	brain_name to behavior_name	4 年前
GitHub	bfda9576	Replace brain_name with behavior_name (#4419 ) brain_name -> behavior_name some prob -> log_prob in comments rename files optimizer -> optimizer_tf for tensorflow	4 年前
Ruo-Ping Dong	fd1dc3a6	Merge branch 'master' into develop-torch-omp	4 年前
GitHub	7b4d0865	[Bug fix] Fix bug in GAIL gradient penalty (#4425 )	4 年前
GitHub	4e93cb6e	[torch] Restructure PyTorch encoders (#4421 ) * Move linear encoding to NetworkBody * moved encoders to processors (#4420) * fix bad merge * Get it running * Replace mentions of visual_encoders * Remove output_size property * Fix tests * Fix some references * Revert test_simple_rl * Fix networks test * Make curiosity test more accomodating * Rename total_input_size * [Bug fix] Fix bug in GAIL gradient penalty (#4425) (#4426) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Up number of steps * Rename to visual_processors and vector_processors Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	4e6d46cc	[tests] Add tests for Torch PPO (#4429 )	4 年前
GitHub	beb5eb30	[bug-fix] Fixes for Torch SAC and tests (#4408 ) * Fixes for Torch SAC and tests * FIx recurrent sac test * Properly update normalization for SAC-continuous * Fix issue with log ent coef reporting in SAC Torch	4 年前
GitHub	6f534366	Add torch_utils class, auto-detect CUDA availability (#4403 ) * Add torch_utils * Use torch from torch_utils * Add torch to banned modules in CI * Better import error handling * Fix flake8 errors * Address comments * Move networks to GPU if enabled * Switch to torch_utils * More flake8 problems * Move reward providers to GPU/CPU * Remove anothere set default tensor * Fix banned import in test	4 年前
GitHub	676f5f7c	[refactor] Refactor GAIL to use new encoder structure (#4433 ) Co-authored-by: Ervin Teng <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Ervin Teng	60eacc0d	Merge branch 'master' into develop-adjust-cpu-settings	4 年前
GitHub	bf6506fc	[feature] Add small CNN for grids 5x5 and up (#4434 )	4 年前
yanchaosun	1a9aaaf6	model weights and large transfer learning weight	4 年前
GitHub	94c7111e	[feature] Enable default settings for TrainerSettings (#4448 ) * Enable default settings for TrainerSettings * Improve comments * Fix bugs and add tests * Remove unneccessary changes * Update docs * Update changelog * spelling correction	4 年前
GitHub	2dc34612	Prevent init normalize on --resume (#4463 ) Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前
Andrew Cohen	3997b14b	Merge branch 'master' into develop-hybrid-actions	4 年前
Ervin Teng	7754ad7b	Don't run value during inference	4 年前
Andrew Cohen	85602279	add action_out to dist	4 年前
vincentpierre	181bdec0	-	4 年前
GitHub	4e4ad7b0	Don't run value during policy evaluate, optimized soft update function (#4501 ) * Don't run value during inference * Execute critic with LSTM * Address comments * Unformat * Optimized soft update * Move soft update to model utils * Add test for soft update	4 年前
Ervin Teng	f9ff3efe	Merge branch 'develop-policyonly' into develop-sac-targetq	4 年前
Andrew Cohen	7c0aa77b	Merge branch 'develop-actions-out' into develop-hybrid-actions	4 年前
GitHub	60b76790	Random Network Distillation for Torch (#4473 ) * initial commit * works with Pyramids * added unit tests and a separate config file * Adding first batch of documentation * adding in the docs that rnd is only for PyTorch * adding newline at the end of the config files * adding some docs * Code comments * no normalization of the reward * Fixing the tests * [skip ci] * [skip ci] Make sure RND will only work for Torch by editing the config file * [skip ci] Additional information in the Documentation * Remove the _has_updated_once flag	4 年前
GitHub	e471bd8b	Refactoring of the tests folder for the trainers (#4510 ) * Refactoring of the tests folder for the trainers * Fixing issues * Fixing issues * Fixing issues	4 年前
GitHub	827525f9	Add test env for hybrid actions, clean up BehaviorSpec (#4522 )	4 年前
GitHub	400e14cb	[Bug-fix] RND would not be saved correctly. Added tests (#4514 )	4 年前
Andrew Cohen	db37db34	fixing errors	4 年前
GitHub	2b300088	Better hybrid actions test env (#4523 ) * Add test env for hybrid actions, clean up BehaviorSpec * Add reward function and proper step	4 年前
Andrew Cohen	53176dc0	Merge branch 'develop-hybrid-actions' of https://github.com/Unity-Technologies/ml-agents into develop-hybrid-actions	4 年前
Andrew Cohen	44c9879e	action models	4 年前
HH	a3bf96fd	Merge branch 'master' into hh/develop/gridsensor-tests	4 年前
Andrew Cohen	c494bfcc	trains successfully	4 年前
GitHub	badca342	Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540 )	4 年前
Ervin Teng	8dec4771	Add hybrid actions to SAC	4 年前
GitHub	c188781b	[life improvement] Moving Python files around (#4531 ) * Moved components to the tf folder and moved the TrainerFactory to the `trainer` folder * Addressing comments * Editing the migrating doc * fixing test	4 年前
Andrew Cohen	e686a785	removed abstract class	4 年前
Ervin Teng	81342148	Revert "Add hybrid actions to SAC" This reverts commit a759b36a51df4f8f1fd296f9f148269f0f026e42.	4 年前
Andrew Cohen	63757004	experiment with 1/1 test	4 年前
Andrew Cohen	35b88994	simple rl tests pass	4 年前
Andrew Cohen	4b9a7db6	remove old behaviorspec	4 年前
GitHub	efa2a704	add to_string for samplers (#4484 ) Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Andrew Cohen	fc3027ac	tf tests except gail pass	4 年前
GitHub	b3bc7896	Cherrypick bug fixes to release_9_branch (#4617 ) * [bug-fix] Don't load non-wrapped policy (#4593) * pin cattrs version * cap PyTorch version * use v2 action and pin python version (#4568) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Andrew Cohen	e5f14400	Merge branch 'master' into develop-hybrid-actions-singleton	4 年前
Andrew Cohen	601f02a8	update simple rl tests	4 年前
GitHub	e4db5dc5	ActionSpec and ActionBuffer (#4578 )	4 年前
Andrew Cohen	7827ca06	add ActionSpec; test_simple_rl torch passes	4 年前
GitHub	be723c66	Change BrainParametersProto to support ActionSpec (#4579 )	4 年前
Andrew Cohen	da978fc6	add separate hybrid test file	4 年前
GitHub	a690af74	[refactor] Make PyTorch the default and TensorFlow optional (#4517 ) * Torch setup.py * Set torch to default * Make torch default in setup.py * Remove indents * Remove other instances of TF being used * Add tensorboard to setup.py * Adding correst setup commands for verifying torch is installed (#4524) * Adding correst setup commands for verifying torch is installed * Editing the test_requirments to add tf and remove torch * Develop torchdefault raise outside setup (#4530) * Torch not imported error to raise at first usage * Torch not imported error to raise at first usage * [refactor] Use PyTorch TensorBoard utils (#4518) * Convert stats writer to use PyTorch TB support * Use common function to print params * Update test * Bump tensorboard to 1.15 to fix the tests * putting tensorboard 1.15.0 as min version requirement Co-authored-by: vincentpierre <vincentpierre@unity3d.com> * [Docs] Initial documentation changes for making...	4 年前
Andrew Cohen	eaecb59e	torch utils to and from buffer	4 年前
Andrew Cohen	6e23bafd	ActionFlattener Refactor	4 年前
Andrew Cohen	8013e544	ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member)	4 年前
GitHub	b5dd43f2	[bug-fix] Don't load non-wrapped policy (#4593 ) * Always initialize non-wrapped policy * Load ghosted policy * Update changelog * Resume test * Add test * Add torch test and fix torch.	4 年前
Andrew Cohen	f654df34	fixing tensorflow tests	4 年前
GitHub	e0ef30a5	[bug-fix] Change entropy computation and loss reporting in Torch to match TF (#4538 ) * Proper dimensions for entropy, sum before bonus in PPO * Make entropy reporting same as TF * Always use separate critic * Revert to shared * Remove unneeded extra line * Change entropy shape in test * Change another entropy shape * Add entropy summing to evaluate_actions * Add notes about torch.abs(policy_loss)	4 年前
GitHub	cb8e4d25	Add ActionSpec (#4586 ) Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Andrew Cohen	b40e7793	fix mlagents-envs tests	4 年前
GitHub	60b173df	[bug-fix] Fix Gym and some Policy tests for ActionSpec (#4590 ) * Fix Gym for ActionSpec * Fix TF policy test	4 年前
Ervin Teng	ceeea719	Fix TF policy test	4 年前
Andrew Cohen	9689cf2c	remove _action_ from function names	4 年前
GitHub	64e998a2	[bug-fix] Use float64 when converting np.ndarray to torch.tensor, cap Torch version to 1.7.x (#4610 ) * Use float64 in GAIL tests * Use float32 when converting np arrays by default * Enforce torch 1.7.x or below * Add comment about Windows install * Adjust tests	4 年前
Andrew Cohen	590adc01	make_fake_trajectory/step take ActionSpec arg	4 年前
vincentpierre	96452986	Initial commit for multi head attention	4 年前
vincentpierre	a3a9a56b	Merge branch 'exp-multi-head-attention' into exp-bullet-hell	4 年前
Ruo-Ping Dong	9e08be87	Merge branch 'master' into release_9_branch_merge	4 年前
Andrew Cohen	97dfa142	fix action_spec refs	4 年前
GitHub	b853e5ba	Action buffer (#4612 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	3c96a3a2	Action Model (#4580 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Andrew Cohen	0e28dd8f	add static method to create continuous/discrete	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
Andrew Cohen	ccd7cc4c	fix recurrent sac test	4 年前
Andrew Cohen	ae920478	resolve conflicts	4 年前
GitHub	87a7ccf8	use int64 steps, check for NaN actions (#4607 ) * use int64 steps * check for NaN actions Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>	4 年前
GitHub	23800f33	Merge branch 'master' into develop-action-spec	4 年前
GitHub	85a7c0f7	[bug-fix] Add clipping to PyTorch policy, fix initialization (#4649 )	4 年前
Ervin Teng	184f27c6	Make buffer type-agnostic	4 年前
GitHub	733bffbf	use int64 steps, check for NaN actions (#4607 ) (#4654 ) * use int64 steps * check for NaN actions Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Andrew Cohen	b6d10456	removed action_spec.size	4 年前
GitHub	8175d558	[bug-fix] Fix BC module + action clipping (#4667 )	4 年前
GitHub	2a8c6800	[bug-fix] Add clipping to PyTorch policy, fix initialization (#4649 ) (#4662 )	4 年前
vincentpierre	e14e1c4d	Improvements and new tests	4 年前
Ruo-Ping Dong	953cb6bb	Merge branch 'master' into develop-windows-delay	4 年前
Ruo-Ping Dong	ee5313e4	Merge branch 'master' into develop-windows-delay	4 年前
GitHub	f0ed3a38	Cherry-pick BC fixes to Release 10 (#4668 )	4 年前
Andrew Cohen	dca09bd9	add docstrings	4 年前
Andrew Cohen	afd16cc9	rename make_x to creat_x/remove redundant properties	4 年前
Andrew Cohen	5b9aab58	fix advanced vis encoder simple rl	4 年前
Andrew Cohen	505dcf80	fix recurrent/advanced ppo tests	4 年前
Andrew Cohen	4f66ebc2	fix recurrent sac	4 年前
Andrew Cohen	8df63dab	reduce visual advanced steps	4 年前
Andrew Cohen	95892058	reduce recurrent step/increase batch size	4 年前
Andrew Cohen	3f771e61	add ActionBuffers and utils	4 年前
Andrew Cohen	b70e6078	reduce steps_per_update recurrent sac	4 年前
Ervin Teng	3765c15a	Merge branch 'develop-multitype-buffer' into develop-unified-obs	4 年前
Andrew Cohen	667d295c	recurrent sac passes locally but fails on CI for inexplicable reasons	4 年前
Andrew Cohen	a343f4e1	increase seq length	4 年前
Andrew Cohen	e5cc57f9	rename create random to random action	4 年前
vincentpierre	b863af57	Removing TensorFlow Trainers	4 年前
Ervin Teng	3b614302	Merge branch 'develop-multitype-buffer' into develop-centralizedcritic	4 年前
GitHub	278911a5	Fix staging tests (#4708 )	4 年前
GitHub	94c59e31	C# changes for hybrid action spaces (#4587 ) * Add hybrid action capability flag (#4576) * Change BrainParametersProto to support ActionSpec (#4579) * Assign new BrainParametersProto fields based on capabilities (#4581) * ActionBuffer with hybrid actions for RemotePolicy (#4592) * Barracuda inference for hybrid actions (#4611) * Refactor BarracudaModel loader checks (#4629) * Export separate nodes for continuous/discrete actions (#4655) * Separate continuous/discrete actions in AgentActionProto (#4698) * Force different nodes for new and deprecated action output (#4705)	4 年前
Andrew Cohen	f6355ba9	Merge branch 'develop-action-spec' into develop-action-buffer	4 年前
GitHub	a4c9f58e	Fix SubprocessEnvManager hanging on unexpected exceptions. (#4699 ) * Add shutdown sentinel value to subprocess_env_manager. * Add Sanity Check for Zombie Workers	4 年前
vincentpierre	713e65fb	removing tensorflow testing for pytest and yamato	4 年前
Andrew Cohen	d624b54b	Merge branch 'master' into fix-conflict-base-env	4 年前
Andrew Cohen	bd917c9c	action buffer passes continuous	4 年前
Andrew Cohen	b36fcf16	discrete runs/cont passes	4 年前
Andrew Cohen	ad951493	debugging discrete	4 年前
Andrew Cohen	fcf6471e	2d discrete passes	4 年前
Andrew Cohen	056630d7	sac continuous and discrete train	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
vincentpierre	735fcd52	[WIP] Refactor trainers to use list of obs rather than vec and vis obs	4 年前
Andrew Cohen	85e4db33	bc tests pass	4 年前
Arthur Juliani	b8f22fd7	Update second half of tests	4 年前
vincentpierre	93ca1409	fixing the tests	4 年前
vincentpierre	7a5cc9ec	Merge master into develop-rm-tf	4 年前
Andrew Cohen	24fd9b3c	torch reward providers all pass	4 年前
Arthur Juliani	b074c252	Fix remaining tests	4 年前
Andrew Cohen	dee6b805	fixed bug in discrete	4 年前
Arthur Juliani	ba495418	Resolve pre-commit issues	4 年前
vincentpierre	c1587bce	Solving merge conflicts	4 年前
Andrew Cohen	8172b3d6	test_simple_rl/reward providers pass tf/torch	4 年前
Andrew Cohen	4ebc6c44	ml-agents-envs pass	4 年前
GitHub	ded1f79b	Merge pull request #4732 from Unity-Technologies/goal-sensors Adds SensorTypes and GoalSensors	4 年前
Andrew Cohen	b5d1c071	Merge branch 'master' into develop-action-buffer	4 年前
Arthur Juliani	0d2f8887	Merge remote-tracking branch 'origin/master' into goal-conditioning # Conflicts: # ml-agents-envs/mlagents_envs/base_env.py # ml-agents-envs/mlagents_envs/rpc_utils.py # ml-agents/mlagents/trainers/tests/mock_brain.py # ml-agents/mlagents/trainers/tests/simple_test_envs.py	4 年前
GitHub	a0d1c829	Action Docs part2 (#4739 ) * reduce usage of "vector action" and "action space" * more cleanup * undo GettingStarted change for now * batch size description * Apply suggestions from code review Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
Andrew Cohen	762274d9	agent processor tests	4 年前
Arthur Juliani	2be6af80	Fix black	4 年前
Ervin Teng	25dfd883	Merge branch 'master' into develop-centralizedcritic	4 年前
Andrew Cohen	94179947	fix demo loader tests	4 年前
GitHub	ba21e419	Merge pull request #4737 from Unity-Technologies/goal-gridworld-sensor Use GoalSensor in GridWorld	4 年前
vincentpierre	bc9d3975	merge master	4 年前
Andrew Cohen	cd73cce2	test_trajectory fixed	4 年前
GitHub	ad5f878c	[refactor] Remove critic pass during inference (#4743 )	4 年前
GitHub	11687f8d	[cherry-pick] Cherry-pick #4743 into Release 11 (#4756 )	4 年前
Andrew Cohen	3c65b964	fixed recurrent prev_action issue	4 年前
GitHub	903d3afe	Merge pull request #4707 from Unity-Technologies/develop-rm-tf Removing TensorFlow Trainers	4 年前
vincentpierre	14378aa5	Merging master	4 年前
Andrew Cohen	97d94a83	fix test_tf_policy	4 年前
Andrew Cohen	293bd20b	fix torch test_ppo	4 年前
vincentpierre	1a1070b1	forgot a file	4 年前
Andrew Cohen	230497f5	fix torch utils test	4 年前
Andrew Cohen	eef14922	discrete/contionuous unity envs train	4 年前
Andrew Cohen	e9cb1066	agent processor tests	4 年前
Andrew Cohen	a545859e	fix torch test policy	4 年前
vincentpierre	8cb050ef	WIP Made initial changes to enale dimension properties and added attention module	4 年前
Andrew Cohen	498b1ee6	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
GitHub	a73f7d73	Turn down gain on GAIL discriminator output (#4762 )	4 年前
Andrew Cohen	157f9e77	rename to ActionTuple	4 年前
Andrew Cohen	06f1f254	1:1 and continuous/discrete train	4 年前
GitHub	b6bb01b9	Turn down gain on GAIL discriminator output (#4762 ) (#4772 )	4 年前
vincentpierre	c3699de8	merging master and addressing comments	4 年前
Andrew Cohen	453a2bba	ActionTuple default is now np.array, not None	4 年前
Andrew Cohen	60466287	fix simple test env	4 年前
GitHub	29d94c7c	Merge pull request #4734 from Unity-Technologies/develop-obs-as-list Refactor trainers to use list of obs rather than vec and vis obs	4 年前
Andrew Cohen	1d234d1d	bc works	4 年前
vincentpierre	719c969c	addressing comments. ObservationSpec is no longer a list	4 年前
vincentpierre	4bba4e8e	Renaming ObservationSpec to SensorSpec	4 年前
Andrew Cohen	c0d01baf	Merge branch 'master' into merge-release11-master	4 年前
Andrew Cohen	95566e44	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
vincentpierre	8dee7970	Fixing the tests	4 年前
Andrew Cohen	5f0f7e3a	fix reward provider tests	4 年前
Andrew Cohen	88b8f4b4	replace use_discrete with action_sizes in simple_rl	4 年前
vincentpierre	c5a057d2	renaming obs_spec variables	4 年前
vincentpierre	44ed3258	Merging master	4 年前
Andrew Cohen	3457cd3c	save only discrete actions as prev	4 年前
Andrew Cohen	9c3e4bab	fix mock brain prev action	4 年前
vincentpierre	449712b0	renaming sensor_spec to sensor_specS	4 年前
Andrew Cohen	35769b53	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
Andrew Cohen	272affe0	preliminary aciton model tests	4 年前
Andrew Cohen	17496265	move AgentAction, ActionLogProbs, and ActionFlattener to separate files	4 年前
Chris Elion	76ebc20c	Merge remote-tracking branch 'origin/master' into r12-to-master	4 年前
Andrew Cohen	d984af1f	action model and network tests	4 年前
GitHub	458fee17	Merge pull request #4763 from Unity-Technologies/develop-att WIP Made initial changes to enable dimension properties and added attention module	4 年前
Ervin Teng	330fc1d0	Merge branch 'master' into develop-centralizedcritic-mm	4 年前
Andrew Cohen	60309d8f	fix torch policy tests	4 年前
vincentpierre	519c5f47	merging master	4 年前
Andrew Cohen	89bb11d3	remove actionspec logic simple test env	4 年前
Ruo-Ping Dong	8ed14762	Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp	4 年前
Arthur Juliani	0a22af55	Add SensorType field to SensorSpec	4 年前
Andrew Cohen	11e2f5e4	remove unused imports test_hybrid	4 年前
Andrew Cohen	6ffbf209	fix imports in test utils	4 年前
GitHub	8a40c58a	Added SUM as aggregation type for custom statistics (#4816 )	4 年前
GitHub	7387a77f	remove pylint (#4836 ) * remove pylint * remove other pylint disables	4 年前
Andrew Cohen	886883b3	Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton	4 年前
Arthur Juliani	e4b8e7e2	Rename to ObservationType	4 年前
GitHub	14129a08	[MLA-470] Barracuda + TF cleanup (#4837 ) * remove barracuda conversion, tensorflow cleanup * unused var	4 年前
Arthur Juliani	986717d0	More renaming	4 年前
Andrew Cohen	0c5934ec	fix test agent processor	4 年前
GitHub	9689449f	Refactor of attention (#4840 ) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Arthur Juliani	0b4b0992	Rename more files	4 年前
Andrew Cohen	1812f08b	fix test trajectory	4 年前
GitHub	af5f6ad0	make sure DefaultTrainerDict is pickle-able (#4842 )	4 年前
Arthur Juliani	7c37c759	Fix some mis-renamings	4 年前
Andrew Cohen	701c1a3f	fix test torch distributions	4 年前
GitHub	b7e6efa3	Allow setting maximum number of elements in self-attention to None (#4841 ) * separate entity encoder and RSA * clean up args in mha * more cleanups * fixed tests * entity embeddings have no max option * Add exceptions for variable export * Fix test * Add docstrings Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>	4 年前
vincentpierre	56972f56	WIP integrate attention to networkbody	4 年前
Arthur Juliani	5342f426	One more rename	4 年前
Ervin Teng	aba633b2	Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm	4 年前
Andrew Cohen	e88558c3	fix torch test policy	4 年前
Andrew Cohen	631ac7f4	fixed tests	4 年前
Ervin Teng	30a09c6f	Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm	4 年前
Andrew Cohen	22f42f5b	fix torch test ppo	4 年前
GitHub	eb78a477	Add default init/gain to LinearEncoder (#4846 )	4 年前
vincentpierre	7f8e6a0d	fix tests	4 年前
Andrew Cohen	85b18389	fix test tf policy	4 年前
GitHub	0ac990e0	add LayerNorm (#4847 )	4 年前
Andrew Cohen	4bf182aa	fix tensorflow test simple rl	4 年前
Ruo-Ping Dong	a7d04be6	Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp	4 年前
Andrew Cohen	8c42dcc7	fix tensorflow test ppo	4 年前
vincentpierre	5039b65a	Merge branch 'master' into develop-att-network-integration	4 年前
Arthur Juliani	0a876b9c	Fix typos	4 年前
Ervin Teng	2085e17c	Merge branch 'master' into develop-centralizedcritic-mm	4 年前
Andrew Cohen	ff324d0c	fixed sac recurrent tf simple rl	4 年前
Arthur Juliani	e3de0406	Plurals	4 年前
Ruo-Ping Dong	180d3e20	Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager	4 年前
HH	0024a286	merge ervin's new stuff	4 年前
GitHub	12e1fc28	[feature] Hybrid SAC (#4574 )	4 年前
Andrew Cohen	7af25330	fixed torch test sac	4 年前
Andrew Cohen	9bcd3c39	fix 2d sac	4 年前
Arthur Juliani	7b230bdf	Change seed for two offending tests	4 年前
Andrew Cohen	b0c02ee0	Merge branch 'develop-hybrid-actions-csharp' into develop-actionmodel-csharp	4 年前
Arthur Juliani	fc756e5a	Formatting	4 年前
Arthur Juliani	a0876939	Extend test time	4 年前
Arthur Juliani	880d390b	Change seed	4 年前
Arthur Juliani	b4d8cf54	Change learning rate	4 年前
GitHub	67ad9651	Merge pull request #4825 from Unity-Technologies/sensor-types [WIP] Observation Types	4 年前
vincentpierre	8660b1c2	merging master	4 年前
GitHub	a02cf933	Add predict minimum attention test (#4853 )	4 年前
vincentpierre	24d2f335	fixing test	4 年前
vincentpierre	38fc2536	addresing some comments	4 年前
GitHub	01e0ee00	refactor entityembedding/network body (#4857 )	4 年前
GitHub	89b6c949	use singular entity embedding (#4873 )	4 年前
Andrew Cohen	6dafe05c	fix tests	4 年前
brccabral	457fb612	Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents	4 年前
brccabral	f21a1f85	Increase sleep time to assert given exception as UnityEnvironmentException	4 年前
GitHub	67594fa5	Merge pull request #4868 from brccabral/PytestWSL Increase sleep time to assert given exception as UnityEnvironmentException	4 年前
vincentpierre	52b011d6	_	4 年前
vincentpierre	03c905b2	Fix equation for entropy	4 年前
vincentpierre	396bc43c	Merging master	4 年前
GitHub	d4455936	Merge pull request #4869 from Unity-Technologies/fix-normal-entropy Fix equation for entropy	4 年前
vincentpierre	b7c7d773	Adding some tests	4 年前
vincentpierre	6f3ea7b8	_	4 年前
Arthur Juliani	372c784c	Fix tests	4 年前
vincentpierre	ff826bd2	added a test	4 年前
vincentpierre	aaec009a	Formatting	4 年前
vincentpierre	2f48cb82	Fixing a test	4 年前
vincentpierre	52e4069f	fixing formatting	4 年前
Arthur Juliani	987800f2	Change StatsSummary to use properties	4 年前
GitHub	bd4bc66b	Merge branch 'master' into fix-numti-env-delayed-spawn	4 年前
vincentpierre	77eecc6b	Merge branch 'master' into develop-att-network-integration	4 年前
GitHub	db4436e9	Merge pull request #4872 from Unity-Technologies/fix-numti-env-delayed-spawn [Bug Fix] Fix crash if spawn is delayed in multi-env	4 年前
vincentpierre	7e47f94b	addressing comments	4 年前
GitHub	d7f549f9	Run pytest on GPU (#4865 ) * make tests device-friendly * mark all tests in test_simple_rl	4 年前
vincentpierre	c27a95f0	Make a self encoder before EntityEmbedding	4 年前
Arthur Juliani	ff70c5c4	Merge branch 'master' into goal-conditioning-new	4 年前
vincentpierre	fd007f53	Attempting to use EntityEmbedding directly as processor	4 年前
vincentpierre	f5ec393b	added a test to make sure that a mask of all zeros or all ones would not break backpropagation	4 年前
vincentpierre	1cff7848	no need for large number of steps in test	4 年前
GitHub	457ed0b8	Set torch device from commandline (#4888 )	4 年前
GitHub	d8835857	[MLA-1540] Training Analytics (#4780 )	4 年前
GitHub	2fb87e4f	Merge branch 'master' into reward-dist	4 年前
GitHub	212ebfb9	Merge pull request #4844 from Unity-Technologies/develop-att-network-integration Integrate attention to networkbody	4 年前
Chris Elion	9d70220e	Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider	4 年前
GitHub	f027e12d	Merge pull request #4878 from Unity-Technologies/reward-dist Track histogram of environment reward	4 年前
GitHub	64fc7f43	Buffer key enums (#4907 )	4 年前
Ervin Teng	b6f88d6d	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Andrew Cohen	543f22bc	fix test_networks	4 年前
Ervin Teng	1831044a	Update SAC to use separate policy	4 年前
GitHub	4d32857d	Merge branch 'master' into develop-var-len-obs-feature	4 年前
Ruo-Ping Dong	471a2e82	fix tests	4 年前
GitHub	5022d710	Add additional logic to avoid load being called on every advance (#4934 )	4 年前
Ruo-Ping Dong	d1107648	fix tests	4 年前
Ervin Teng	c7054d76	Use attention tests from master	4 年前
Andrew Cohen	6828713c	fix saver test	4 年前
Ervin Teng	da6a55a0	Revert "Use attention tests from master" This reverts commit 78e052be8f36381bb6857817ff0f505716be83b9.	4 年前
Ervin Teng	281fcdbe	Merge remote-tracking branch 'origin/develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Ervin Teng	24ee4bd5	Merge remote-tracking branch 'origin/develop-critic-optimizer' into develop-critic-optimizer	4 年前
Ervin Teng	bac2fb68	Use attention from master	4 年前
Andrew Cohen	66742dc8	test for SharedActorCritic	4 年前
Ruo-Ping Dong	c87bce9e	Merge branch 'master' into develop-base-teammanager	4 年前
Andrew Cohen	d81d0be3	fix agent processor test	4 年前
Ervin Teng	e112ede0	Fix mock brain	4 年前
Andrew Cohen	3f7d68b8	fix test policy	4 年前
Ervin Teng	aa6d4de2	np float32 fixes	4 年前
Andrew Cohen	531695fb	adjust step size gail visual ppo	4 年前
Ervin Teng	219e773b	Merge branch 'develop-fix-lstms' into develop-critic-op-lstm	4 年前
Ervin Teng	44073593	Test for team obs in agentprocessor	4 年前
Ervin Teng	a81512c9	Test for group and add team reward	4 年前
Christopher Goy	9cadfa7a	Merge master -> release_13_branch-to-master	4 年前
vincentpierre	e1b94b8b	Merge branch 'master' into develop-var-len-obs-feature	4 年前
Andrew Cohen	dc8e8494	Merge branch 'master' into develop-critic-optimizer	4 年前
Chris Elion	e4f51ca7	Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider	4 年前
Ervin Teng	d4438878	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Ervin Teng	566efa52	Fix Trajectory test	4 年前
Ervin Teng	4a33be31	Tweak SAC tests	4 年前
Ervin Teng	7471a2fd	Fix AgentProcessor tests	4 年前
Ervin Teng	40f51774	Fix PPO tests	4 年前
Ervin Teng	180f7d03	Fix SAC test	4 年前
Chris Elion	c3bc8991	cleanup, don't store mask	4 年前
GitHub	ddb01eb2	MultiAgentGroup Interface (#4923 ) * add SimpleMultiAgentGroup * add group reward field to agent and proto	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
HH	15d512f9	Merge branch 'master' into hh/develop/dodgeball	4 年前
Ervin Teng	08db7c2f	Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer-mm	4 年前
Ervin Teng	2f209c12	Buffer fixes (cherry picked from commit 2c03d2b544d0c615e7b60d939f01532674d80753)	4 年前
Ervin Teng	12cef7af	Add test for GroupObs	4 年前
Ervin Teng	1fc3640e	Change AgentAction back to 0 pad and add tests	4 年前
GitHub	338af2ec	Move the Critic into the Optimizer (#4939 ) Co-authored-by: Ervin Teng <ervin@unity3d.com>	4 年前
HH	4c947151	Merge branch 'main' into hh/develop/dodgeball	4 年前
Ervin Teng	61781a1a	Merge branch 'main' into develop-agentprocessor-teammanager	4 年前
Andrew Cohen	9060da06	Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer	4 年前
Ervin Teng	50ab983e	Fix slicing typing and string printing in AgentBufferField	4 年前
Ervin Teng	bc3d3a95	Fix slicing typing and string printing in AgentBufferField	4 年前
Ervin Teng	56d4c1f9	Fix to-flat and add tests	4 年前
Andrew Cohen	5d517c5e	clean ups	4 年前
Andrew Cohen	e2d46ca0	Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer	4 年前
Andrew Cohen	8562471e	add inital coma optimizer tests	4 年前
Andrew Cohen	43955c5b	get value estimate test	4 年前
Arthur Juliani	06c147f8	Merge remote-tracking branch 'origin/main' into goal-conditioning-new # Conflicts: # Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity # Project/ProjectSettings/TagManager.asset # com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs # com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs # ml-agents/mlagents/trainers/torch/networks.py # ml-agents/mlagents/trainers/torch/utils.py	4 年前
GitHub	d36a5242	Python Dataflow for Group Manager (#4926 ) * Make buffer type-agnostic * Edit types of Apped method * Change comment * Collaborative walljump * Make collab env harder * Add group ID * Add collab obs to trajectory * Fix bug; add critic_obs to buffer * Set group ids for some envs * Pretty broken * Less broken PPO * Update SAC, fix PPO batching * Fix SAC interrupted condition and typing * Fix SAC interrupted again * Remove erroneous file * Fix multiple obs * Update curiosity reward provider * Update GAIL and BC * Multi-input network * Some minor tweaks but still broken * Get next critic observations into value estimate * Temporarily disable exporting * Use Vince's ONNX export code * Cleanup * Add walljump collab YAML * Lower max height * Update prefab * Update prefab * Collaborative Hallway * Set num teammates to 2 * Add config and group ids to HallwayCollab * Fix bug with hallway collab * E...	4 年前
Ervin Teng	fd0dd35c	Merge branch 'main' into develop-coma2-trainer	4 年前
Ervin Teng	c8137dcd	Merge branch 'main' into develop-superpush-int	4 年前
GitHub	af36ef3b	[bug-fix] Fix typo (#5035 ) * Fix typo * Add test	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
GitHub	47db8ce1	[bug-fix] Fix padding for List entries in buffer (#5046 ) * Fix padding for List entries in buffer * Revert to coonverting to np.array * Fix dtype in PPO trainer	4 年前
Christopher Goy	921ba4f0	Update v2-staging from main (March 15) (#5123 )	4 年前
GitHub	ba2af269	[coma2] Make group extrinsic reward part of extrinsic (#5033 ) * Make group extrinsic part of extrinsic * Fix test and init * Fix tests and bug * Add baseline loss to TensorBoard	4 年前
GitHub	d24b0966	[bug-fix] Fix memory leak when using LSTMs (#5048 ) * Detach memory before storing * Add test * Evaluate with no_grad	4 年前
Christopher Goy	ebe45056	Merge branch 'main' into release_14_branch-to-main	4 年前
GitHub	d2635e58	Action slice (#5047 ) * add slice function to agent action * add type/docstring to slice * add test	4 年前
Ervin Teng	8902c058	Merge branch 'main' into develop-coma2-trainer	4 年前
Andrew Cohen	95f62362	add test	4 年前
Andrew Cohen	853b44d5	torch coma tests: lstm, cur, gail	4 年前
GitHub	46461986	pass sensor name through to ObservationSpec (#5036 )	4 年前
GitHub	fc5d0a3f	[bug-fix] Fix save/restore critic, add test (#5062 ) * Fix save/restore critic, add test * Rename module for PPO * Use correct policy in test	4 年前
Chris Elion	970f1d40	Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec	4 年前
Andrew Cohen	cd349985	add negative constant extrinsic to gail	4 年前
GitHub	ffca08c4	Upgrade PyTorch version for python 3.9 (#5028 )	4 年前
Ervin Teng	1f026c70	Merge branch 'main' into develop-superpush-branch-cleanup	4 年前
Andrew Cohen	e547f26c	adjust step size	4 年前
Ervin Teng	ce872033	Revert "Merge branch 'main' into develop-superpush-branch-cleanup" This reverts commit 5bea802525381f931a5e0f8b8778fe27a12f03af, reversing changes made to cee3524e85161e13689d95f66bc6bff994d2cdfd.	4 年前
GitHub	8f35bdd3	POCA trainer (#5005 ) Co-authored-by: Ervin Teng <ervin@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Andrew Cohen	9e77d7e1	Merge branch 'main' into develop-soccer-groupman	4 年前
GitHub	e81e038b	Fix end episode for POCA, add warning for group reward if not POCA (#5113 ) * Fix end episode for POCA, add warning for group reward if not POCA * Add missing imports	4 年前
GitHub	63169e2c	[cherry-pick] Fix group rewards for POCA, add warning for non-POCA trainers (#5120 ) * Fix end episode for POCA, add warning for group reward if not POCA (#5113) * Fix end episode for POCA, add warning for group reward if not POCA * Add missing imports * Use np.any, which is faster	4 年前
GitHub	ef3d6e0d	Adding Hypernetwork modules and unit tests (#5141 )	4 年前
GitHub	8387e252	[release] Fix rl trainer warning (#5144 ) * Fix rl trainer warning * Fix typo	4 年前
Ervin Teng	41dd16e8	Merge branch 'main' into release_15_mm	4 年前
Ervin Teng	d1c24251	[bug-fix] When agent isn't training, don't clear update buffer (#5205 ) * Don't clear update buffer, but don't append to it either * Update changelog * Address comments * Make experience replay buffer saving more verbose (cherry picked from commit 63e7ad44d96b7663b91f005ca1d88f4f3b11dd2a)	4 年前
GitHub	3607f062	Merge release 15 into Main [release_15] Release 15 Merge into Main	4 年前
Ervin Teng	c108da4a	[bug-fix] Fix POCA LSTM, pad sequences in the back (#5206 ) * Pad buffer at the end * Fix padding in optimizer value estimate * Fix additional bugs and POCA * Fix groupmate obs, add tests * Update changelog * Improve tests * Address comments * Fix poca test * Fix buffer test * Increase entropy for Hallway * Add EOF newline * Fix Behavior Name * Address comments (cherry picked from commit 2ce6810846ba9268e4fb5fb082fa54e90414c980)	4 年前
Ervin Teng	bed4bf36	Load individual elements if state dict load fails (#5213 ) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T. <ervin@unity3d.com> (cherry picked from commit ac4f43cf18b98d0fc7063b9b831e07429f7ea39e)	4 年前
Andrew Cohen	18be47e8	Merge branch 'main' into develop-soccer-groupman-mod	4 年前
GitHub	81705d6d	Goal conditioning integration (#5142 ) * Adding Hypernetwork modules and unit tests * Edits * Integration of the hypernetowrk to the trainer * Update ml-agents/mlagents/trainers/torch/networks.py Co-authored-by: Arthur Juliani <awjuliani@gmail.com> * Making the default hyper and added the conditioning type None * Reducing the number of hypernetwork layers * addressing comments Co-authored-by: Arthur Juliani <awjuliani@gmail.com>	4 年前
vincentpierre	d4716caa	Merge branch 'main' into goal-conditioning-sensors-3	4 年前
GitHub	c37cfac1	Adding the goal conditioning sensors with the new observation specs (#5159 ) * Fixing networks.py for the merge * fix compile error * Adding the goal conditioning sensors with the new observation specs * addressing feedback * I forgot to change the m_observationType * Renaming Goal to GoalSignal (#5190) * Renaming GOAL to GOAL_SIGNAL * VectorSensorComponent to use new API * Adding docstrings * verbose pytest on github action Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
vincentpierre	1b4fd8fb	Renaming GOAL to GOAL_SIGNAL	4 年前
Ervin Teng	c05ec9af	Fix groupmate obs, add tests	4 年前
Ervin Teng	b3499848	Improve tests	4 年前
Ervin Teng	6e04aaf3	Fix poca test	4 年前
GitHub	ff21216d	[bug-fix] When agent isn't training, don't clear update buffer (#5205 ) * Don't clear update buffer, but don't append to it either * Update changelog * Address comments * Make experience replay buffer saving more verbose	4 年前
Andrew Cohen	42105f23	add load different reward tests	4 年前
Andrew Cohen	98dcb548	test convolutions can be loaded properly	4 年前
Andrew Cohen	2e5b1352	add check that layers still have different dimensions	4 年前
GitHub	cb1f5462	Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	f3d586bc	Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	c5589b59	[bug-fix] Fix POCA LSTM, pad sequences in the back (#5206 ) * Pad buffer at the end * Fix padding in optimizer value estimate * Fix additional bugs and POCA * Fix groupmate obs, add tests * Update changelog * Improve tests * Address comments * Fix poca test * Fix buffer test * Increase entropy for Hallway * Add EOF newline * Fix Behavior Name * Address comments	4 年前
GitHub	9dfe6c7f	Load individual elements if state dict load fails (#5213 ) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前
GitHub	fd79d92c	Extend StatsWriter to allow handling of individual stat updates (#5249 ) * Extend StatsWriter to allow callback handling of individual stat updates * Update documentation and expand test coverage.	4 年前
vincentpierre	51adab1c	Fix the attention module embedding size	4 年前
GitHub	4c776283	Fix --results-dir (#5269 )	4 年前
GitHub	353b1566	Fix the attention module embedding size (#5272 ) * Fix the attention module embedding size * editing the changelog	4 年前
GitHub	28eb43dd	[bug-fix] Delete .pt checkpoints past keep-checkpoints (#5271 ) * Manage non-ONNX files with checkpoint manager too * Update tests * Update training status version * Change ticking of status file version	4 年前
GitHub	ed69fd2b	collecting latest step as a stat (#5264 ) * collecting latest step as a stat * adding a list of hidden_keys to TB summarywriter to hide unnecessary stats from user * fixing precommit * fixing precommit * formating * defined the property types * moving custom defaults to get_default_stats_writers * new test for TensorboardWriter.hidden_keys * improved testing * explicit None evaluation Co-authored-by: Ervin T. <ervin@unity3d.com> * make hidden_keys optional Co-authored-by: Ervin T. <ervin@unity3d.com> * adding optional argument * lowering the training threshold to 0.8 on test_var_len_obs_and_goal_poca * Update pytest.yml * Do not merge! droping pytest 3.9 job * -add back pytest -format imports and comments * back to default threshold for test_var_len_obs_and_goal_poca Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前
GitHub	4995a765	[debug] Require all behavior names to have a matching YAML entry (#5210 ) * Add strict check to settings.py * Remove warning from trainer factory, add test * Add changelog * Fix test * Update changelog * Remove strict CLI options * Remove strict option, rename, make strict default * Remove newline * Update comments * Set default dict to actually default to a default dict * Fix tests * Fix tests again * Default trainer dict to requiring all fields * Fix settings typing * Use logger * Add default_settings to error	4 年前
GitHub	ae01cfc9	collecting latest step as a stat (#5264 ) (#5295 ) * collecting latest step as a stat * adding a list of hidden_keys to TB summarywriter to hide unnecessary stats from user * fixing precommit * formating * defined the property types * moving custom defaults to get_default_stats_writers * new test for TensorboardWriter.hidden_keys * improved testing * explicit None evaluation Co-authored-by: Ervin T. <ervin@unity3d.com> * make hidden_keys optional Co-authored-by: Ervin T. <ervin@unity3d.com> * adding optional argument * lowering the training threshold to 0.8 on test_var_len_obs_and_goal_poca * Update pytest.yml * Do not merge! droping pytest 3.9 job * -add back pytest -format imports and comments * back to default threshold for test_var_len_obs_and_goal_poca Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Ervin T. <ervin@unity3d.com> Co-authored-by: mahon94 <maryam.honari@unity3d.com> Co-authored-by: Ervin...	4 年前
GitHub	bff0a5d2	[debug] Require all behavior names to have a matching YAML entry (#5210 ) (#5296 ) * Add strict check to settings.py * Remove warning from trainer factory, add test * Add changelog * Fix test * Update changelog * Remove strict CLI options * Remove strict option, rename, make strict default * Remove newline * Update comments * Set default dict to actually default to a default dict * Fix tests * Fix tests again * Default trainer dict to requiring all fields * Fix settings typing * Use logger * Add default_settings to error (cherry picked from commit 86a4070bad4f5bca201db57f29117362c62617d0)	4 年前
Miguel Alonso Jr	4846cf0f	Merge branch 'main' into develop-api-documentation-update Updating with main.	4 年前
GitHub	806f04bd	Readding the validation of the minimal cnn input size (#5345 ) (#5346 )	4 年前
GitHub	15440c24	Readding the validation of the minimal cnn input size (#5345 )	4 年前
GitHub	bb07eb45	Adding a fully connected visual encoder for super small visual input + tests (#5351 ) * initial commit for a fully connected visual encoder * adding a test * addressing comments * Fixing error with minimal size of fully connected network * adding documentation and changelog	4 年前
Miguel Alonso Jr	97b7d5c6	Merge branch 'main' into develop-api-documentation-update Syncing with main.	4 年前
GitHub	b767b66b	Exclude test_visual_encoder_trains from GPU test (#5367 )	4 年前
GitHub	fc6e8c35	[🐛🔨 ] Fix sac target for continuous actions (#5372 ) * Fix of the target entropy for continuous SAC * Lowering required steps of test and remove unecessary unsqueeze * Changing the target from -dim(a)^2 to -dim(a) by removing implicit broadcasting	4 年前
GitHub	2933f235	Fix the reporting of histogram stats and adding a test (#5410 ) * Fix the reporting of histogram stats and adding a test * Appending to the Changelog	4 年前

1 2 3 4 5 ...

976 次代码提交 (a1967c19-b5e1-48c8-a4d5-7bad9f5c1420)