ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	cde3c8f7	formating and added documentation	7 年前
vincentpierre	c4745ba7	fix on the socket timeout error on windows due to the use of signal.SIGALRM	7 年前
vincentpierre	bddfb85e	changed the connection to non-blocking	7 年前
vincentpierre	e36b8bf0	added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards	7 年前
vincentpierre	7118a209	bug fix : The environment only requests actions from external brains when unique	7 年前
vincentpierre	e191fbef	added warning in case no brins are set to external	7 年前
vincentpierre	65df8ae9	fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step	7 年前
vincentpierre	0df8326e	minor fixes	7 年前
vincentpierre	5cae720d	modified Environment to send a specific error when no external brains are in the environment	7 年前
vincentpierre	ac910514	initial commit of the curriculum with broadcast. Improved the Unity python handshake	7 年前
vincentpierre	e8429059	bug fix for python3	7 年前
vincentpierre	250eb8e1	better checking of the format of the curriculum file	7 年前
vincentpierre	c16e0ac3	modified the socket to receive states and images of any size	7 年前
Arthur Juliani	b6ce30bf	Add curriculum support to PPO	7 年前
Arthur Juliani	4a11c005	Add curriculum code to notebook and simplify	7 年前
vincentpierre	6e950cd3	Can now switch inference configuration on/off in the editor. Reintroduced the broadcast feature for the non-External brains. Introduced the API number to check the compatibility between Unity and Python.	7 年前
Arthur Juliani	d1b81a32	Add push curriculum	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
Arthur Juliani	b56259f6	Fix cumulative reward (Unity) and Nan reward (python) bugs	7 年前
Arthur Juliani	216888ee	Fixed to give lesson index parameter when start up (#179 ) * fixed to give lesson parameter when start up * applied to PPO.ipynb and modified ppo.py a bit	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
vincentpierre	053c3739	Launching the environment with absolute path. Need testing on Windows and Linux	7 年前
vincentpierre	22bfd276	simplifications on launching from absolute path, bug fix : closing the environment when the file_name was wrong.	7 年前
GitHub	00534390	Refactored GridWorld (#225 ) Greatly simplified GridWorld code. It now also only uses a visual observation rather than state vector in order to demonstrate learning purely from a visual input.	7 年前
Arthur Juliani	2a0e9e6f	Fixed issue with unity environment not being found on MacOS (#236 ) If the internal executable can't be found, it will look for any file in the folder and run it.	7 年前
Arthur Juliani	9ded88f3	Provide support with incompatible API	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
vincentpierre	539c081f	modified the python side to read the logfile path from the academy parameters	7 年前
vincentpierre	5e1d05af	added the logfile_path property to the environment class. Give a link to the logfile when the timeout error is launched. Note: still need testing on windows	7 年前
vincentpierre	34b6e786	made the UnityTimeOutException that reads into the logfile when available	7 年前
vincentpierre	50f91f66	use logging instead of print Replaced the print statements with logging statements in the exception.py file Uses the same logger as the environment one named the logger unityagents	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
vincentpierre	d8f74dc9	If reset does not take either config or progress, no information is logged. Bug fix : Environment handles invalid configurations better	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
eshvk	403e4aef	[cleanup] Use debug mode for some log messages	7 年前
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
eshvk	030ac5c5	[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
GitHub	989dea4a	Merge pull request #132 from Unity-Technologies/dev-logfile Dev logfile	7 年前
GitHub	9ad4182e	Merge pull request #366 from Unity-Technologies/feature/cleanup [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	dcf58f75	Feature/previous text action (#375 ) * [Previous Text Actions] Renamed previous_action to previous_vector_action added previous_text_action to the BrainInfo * [Semantics] Carried the modifications to the semantics of previous_vector_action to the trainers	7 年前
GitHub	cfc6bdc8	[Fix] The environment logs information about itself when lauched. (#395 )	7 年前
GitHub	f19739cb	Update API version in anticipation of v0.3 release (#437 ) * Update API version in anticipation of v0.3 release * Use _version_ across both Unity/Python	7 年前
eshvk	2d2eb64b	[containers] Enables container support for scenes that use visual observations	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
vincentpierre	3c2283e8	[fix tennis]	6 年前
vincentpierre	85b844cc	[Better version of the fix]	6 年前
Arthur Juliani	5d402be9	Minor Optimizations (#836 )	6 年前
GitHub	dda6ad8b	Replaced message printed in Python and in documentation. (#881 )	6 年前
Arthur Juliani	5e48766d	Remove discrete observations	6 年前
vincentpierre	e47cec56	[Initial Commit]	6 年前
Deric Pang	8380f2f2	Moved curriculum code out of environment code.	6 年前
Deric Pang	e580e544	Removing commented out code.	6 年前
Deric Pang	db031b07	Updating tests for refactored curriculum learning.	6 年前
Arthur Juliani	1eb701af	Merge remote-tracking branch 'origin/develop' into develop-value-estimates-ppo	6 年前
GitHub	e60272f2	New error when using In Editor Training with a non-zero worker-id (#1012 )	6 年前
Deric Pang	ff4ce695	Updated logging in trainer. - The logger in trainer.py is now unitytrainers. This makes it easier to differentiate it from unityagents logs.	6 年前
vincentpierre	7f74131d	Nan Rewards converted to 0 and throwing a warning	6 年前
Arthur Juliani	708e2bb9	Check NaN in observations (#1063 ) * Check NaN in observations * Replace math with np	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
Deric Pang	634280a6	Fixed imports, all tests are passing.	6 年前
GitHub	ded0d8c7	Develop action masking (#1080 ) * [Initial Commit] Modified the model.py file and the ppo/trainer.py file to use masked actions * Preliminary modifications to the python side of the code to enable action masking * Preliminary modifications to the C# side of the code to enable action masking * Preliminary modifications to the communication side of the code to enable action masking * Implemented action masking for BC Note : The actions of the teacher are not masked * More error messages for the action masking * fix pytests * Added Documentation * Address comment * Addressed Comments on docs * Addressed second comment on docs * Addressed comments for the python side of the code * Created the action masker and associated unit tests * Addressed comments on the C# side * Addressed the comment regarding action_masking_name * Addressed the comments	6 年前
GitHub	9ba493ef	Fixing develop after merging action masking (#1114 ) Ran into problems due to inacurate merging of develop into the branch	6 年前
GitHub	106d562d	Fix for Windows (#1120 ) addresses #1113	6 年前
Deric Pang	cdb41480	Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure	6 年前
Deric Pang	e2940d96	Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure	6 年前
GitHub	10d2a19d	Release v0.5 (Develop) (#1203 )	6 年前
GitHub	a54714f8	Update API to version 5 (#1179 )	6 年前
GitHub	3c9603d6	Demonstration Recorder (#1240 )	6 年前
GitHub	d7224351	Brains as Scriptable Objects (#1250 ) * Initial Commit Ported most functionalities, still need to : - Documentation - Add Comments - Custom drawer for BrainParameters - Fix the UnitTests - Review Functionalities * Added Custom Drawer for the Brain Parameters * Improvements to the HubDrawer * Modified the Brain Editors * Minor bug fixes and UI changes * Modified the Help Boxes of the Drawers * Modified Brain class, renamed Initialize and made DecideAction virtual * Fix the UnityTests * Simpler Brain creation menu * Renamed Internal Brain to Learning Brain * modified the parameters to remove reference to External or Internal in the Protobuf objects * Updated the protobuf generated files * Fix the Pytests * Removed the graph scope from the Learning Brain * cleaner logic than try catch * Removed the isExternal field of the brain and put the isTraining logic into LearningBrain and Training Hub * Modified how the Brain finds the A...	6 年前
GitHub	c4fa3893	Add file check & reuse protobuf conversion functions (#1316 )	6 年前
vincentpierre	a9b5ad37	Ticked the communication version to API-6	6 年前
GitHub	87a30e34	Support both 32-bit and 64-bit types in UnityEnvironment (#1471 ) We check for the single brain case in UnityEnvironment by checking for applicable non-dict types in the step arguments. However for ints and floats we just use `np.int_` and `np.float_` for the check, which are the defaults for your system. This means if you are using an application (like baselines in #1448) which uses the wrong int/float size an error will be thrown. This change explicitly allows both 32 and 64-bit numbers.	6 年前
GitHub	249e86a4	Ticked API : (#1696 ) * Ticked API : - Ticked API for pypi for mlagents - Ticked API for pypi for unity-gym - Ticked Communication number for API - Ticked Model Loader number for API * Ticked the API for the pytest	6 年前
GitHub	4846907e	Add timeout wait param (Develop) (#1700 ) * Add timeout wait param * Remove unnecessary function	6 年前
Jonathan Harper	35eb595d	Add back 'get_communicator' in UnityEnvironment Removing this function breaks some tests, and the only way around this at this time is a bigger refactor or hacky fixes to tests. For now, I'd suggest we just revert this small part of a change and keep a refactor in mind for the future.	6 年前
GitHub	20ff1436	Merge pull request #1765 from Unity-Technologies/release-v0.7 Release v0.7 into develop	6 年前
Vincent-Pierre BERGES	bc636075	API for sending custom protobuf messages to and from Unity. (#1595 ) * API for sending custom protobuf messages to and from Unity. * Rename custom_output to custom_outputs. * Move custom protos to their own files. * Add SetCustomOutput method. * Add docstrings. * Various adjustments. * Rename CustomParameters -> CustomResetParameters * Rename CustomOutput -> CUstomObservation * Add CustomAction * Add CustomActionResult * Remove custom action result. * Remove custom action result from Python API * Start new documentation. * Add some docstrings * Expand documentation. * Typos * Tweak doc. Also eliminate GetCustomObservation. * Fix typo. * Clarify docs. * Remove trailing whitspace	6 年前
GitHub	93760bc4	Adds SubprocessUnityEnvironment for parallel envs (#1751 ) This commit adds support for running Unity environments in parallel. An abstract base class was created for UnityEnvironment which a new SubprocessUnityEnvironment inherits from. SubprocessUnityEnvironment communicates through a pipe in order to send commands which will be run in parallel to its workers. A few significant changes needed to be made as a side-effect: * UnityEnvironments are created via a factory method (a closure) rather than being directly created by the main process. * In mlagents-learn "worker-id" has been replaced by "base-port" and "num-envs", and worker_ids are automatically assigned across runs. * BrainInfo objects now convert all fields to numpy arrays or lists to avoid serialization issues.	6 年前
eshvk	a50aadda	* Ticked API : - Ticked API for pypi for mlagents - Ticked API for pypi for mlagents_envs - Ticked Communication number for API - Ticked API for unity-gym * Ticked the API for the pytest	6 年前
Jonathan Harper	e91e847c	Fix '--slow' flag after environment updates A change was made to the way the "train_mode" flag was used by environments when SubprocessUnityEnvironment was added which was intended to be part of a separate change set. This broke the CLI '--slow' flag. This change undoes those changes, so that the slow / fast simulation option works correctly. As a minor additional change, the remaining tests from top level 'tests' folders have been moved into the new test folders.	6 年前
eshvk	ef8009d9	Python code reformat via [`black`](https://github.com/ambv/black ). Features: - Reformat code via black. - Adding circleci configurations. - Add contribution guidelines. Steps to reproduce: - `pip install black` - `black <source code directory>`	6 年前
GitHub	2671e1a0	Enable mypy in precommit checks (#2177 ) * WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * WIP enable mypy * run mypy on each package * fix trainer_metrics mypy errors * more mypy errors * more mypy * Fix some partially typed functions * types for take_action_outputs * fix formatting * cleanup * generate stubs for proto objects * fix ml-agents-env mypy errors * disallow-incomplete-defs for gym-unity * Add CI notes to CONTRIBUTING.md	5 年前
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	5 年前
Jonathan Harper	9a170db5	Add Unity command line arguments	5 年前
Jonathan Harper	2f203f89	fix lint checks	5 年前
GitHub	f82f0f37	Get timers from subprocess (#2268 ) * Timer proof-of-concept * micro optimizations * add some timers * cleanup, add asserts * Cleanup (no start/end methods) and handle exceptions * unit test and decorator * move output code, add a decorator * cleanup * module docstring * actually write the timings when done with training * use __qualname__ instead * add a few more timers * fix mock import * fix unit test * get timers from worker process (WIP) * clean up timer merging * typo * WIP * cleanup merging code * bad merge * undo accidental change * remove reset command * fix style * fix unit tests * fix unit tests (they got overwrote in merge) * get timer root though a function * timer around communicate	5 年前
GitHub	33cb438b	Tick version number for 0.9 (#2331 ) * Tick versions of gym, ml-agents, ml-agents-envs * Tick communication API to 9	5 年前
Jonathan Harper	98297be9	Fix training not quitting when play button is unchecked (#2376 ) This fixes an issue where stopping the game when training in the Editor won't end training, due to the new asynchronous SubprocessEnvManager changes. Another minor change was made to move the `env_manager.close()` in TrainerController to the end of `start_learning` so that we are more likely to save the model if something goes wrong during the environment shutdown (this occurs sometimes on Windows machines).	5 年前
GitHub	eaf8ca35	Add clearer message for bad permissions (#2539 )	5 年前
GitHub	babe9e2f	Develop remove academy done (#2519 ) * Initial Commit * Remove the Academy Done flag from the protobuf definitions * remove global_done in the environment * Removed irrelevant unitTests * Remove the max_step from the Academy inspector * Removed global_done from the python scripts * Modified and removed some tests * This actually does not break either curriculum nor generalization training * Replace global_done with reserved. Addressing Chris Elion's comment regarding the deprecation of the global_done field. We will use a reserved field to make sure the global done does not get replaced in the future causing errors. * Removed unused fake brain * Tested that the first call to step was the same as a reset call * black formating * Added documentation changes * Editing the migrating doc * Addressing comments on the Migrating doc * Addressing comments : - Removing dead code - Resolving forgotten merged conflicts - Editing documentations...	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
GitHub	e1d93a0e	Allow mypy to reject incomplete defs for mlagents-envs (#2585 ) This wasn't working before because of several remaining partially defined function definitions.	5 年前
GitHub	d64a01e1	Added option to use environment arguments in learn (#2594 ) * Added option to use environment arguments in learn * hook into argparse * add example to readme	5 年前
GitHub	6f9a2dfa	Tick version of API and pypi packages to 10 (#2610 ) * Tick versions for pip packages * Tick API version to 10	5 年前
GitHub	89b1c7a8	Better environment shutdown (#2620 ) * Wait for env process to exit before killing it * don't propagate signals, better error logging * set proc1 to None when done * comments	5 年前
Jonathan Harper	3fc14963	EXPERIMENTAL horovod support	5 年前
GitHub	2f74b3cc	Rename protobuf objects to be suffixed with 'Proto' in python and C#. (#2646 )	5 年前
GitHub	8e931d8d	Merge branch 'develop' into release-0.10.0	5 年前
GitHub	24ba9d58	Develop deprecate broadcasting (#2669 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modifie...	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
GitHub	0892ef2c	[WIP] ISensor interface and use for visual observations (#2731 ) * ISensor and SensorBase * camera and rendertex first pass * use isensors for visual obs * Update gridworld with CameraSensors * compressed obs for reals * Remove AgentInfo.visualObservations * better separation of train and inference sensor calls * compressed obs proto - need CI to generate code * int32 * get proto name right * run protoc locally for new fiels * apply generated proto patch (pyi files were weird) * don't repeat bytes * hook up compressedobs * dont send BrainParameters until there's an AgentInfo * python BrainParameters now needs an AgentInfo to create * remove last (I hope) dependency on camerares * remove CameraResolutions and AgentInfo.visual_observations * update mypy-protobuf version * cleanup todos * python cleanup * more unit test fixes * more unit test fix * camera sensors for VisualFood collector, record demo * SensorCompon...	5 年前
Jonathan Harper	6fb6bb4c	Update package and communicator versions to 0.11	5 年前
GitHub	0fe5adc2	Develop remove memories (#2795 ) * Initial commit removing memories from C# and deprecating memory fields in proto * initial changes to Python * Adding functionalities * Fixes * adding the memories to the dictionary * Fixing bugs * tweeks * Resolving bugs * Recreating the proto * Addressing comments * Passing by reference does not work. Do not merge * Fixing huge bug in Inference * Applying patches * fixing tests * Addressing comments * Renaming variable to reflect type * test	5 年前
Jonathan Harper	bae94a76	Add timeout for communicator exchange When we initially connect to the environment using RPCCommunicator, the connection is polled so we don't hang forever on `.recv()` when the environment wasn't launched or failed. However we don't currently have any similar check for the exchanges mid-training-run. This change applies the same timeout from initialization to each exchange, and extends the default `timeout_wait` to 60 seconds to generally improve the chances we won't have a mismatch between environment launch time and the trainer timeout. Tested on: single-env and multi-env cases. Killed 1 environment process manually and saw that the model was saved appropriately and all processes closed.	5 年前
GitHub	9957b699	class variable for API verison, fix env tests (#2817 )	5 年前
GitHub	6ba6f08c	Merge 0.11.0 to develop (#2825 ) * Update package and communicator versions to 0.11 * Remove pip cache fallback for CircleCI This change removes the caching fallback in the case where dependencies change, since it can cause CI failures when we have incompatible dependencies in the cache. * Limit Tensorflow version for tests to <2.0 * Use stable bokken image. (#2815) * build fixes for 2018+ (#2808) * rename CompressionType enum * fix standalone build test for 2018+ * Add more editor versions for testing. (#2809) * class variable for API verison, fix env tests (#2817) * fixed area prefab agents were pointing to the wrong laser gameObject.	5 年前
GitHub	c6c01a03	Enable pylint and fix a few things (#2767 ) * enable pylint, disable some messages and fix a few * SAC memories in init	5 年前
GitHub	ccb7eab4	Remove {text,custom} {action,observations} (#2839 ) * delete text actions and obs * delete custom actions and obs * regenerate protos * cleanup C# * format * fix tests * fix base env signature * doc cleanup	5 年前
GitHub	99981937	fix errors from new flake8-comprehensions (#2917 )	5 年前
GitHub	e2eef3c4	Clean up env logging on initialization (#2950 )	5 年前
GitHub	c57884dc	bump version strings (#2955 ) * bump version strings * API version strings too	5 年前
GitHub	11243348	Develop side channel (#2956 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChannel() env = UnityEnvironme...	5 年前
GitHub	a4c111f4	Merge pull request #3012 from Unity-Technologies/release-0.12.0-to-develop Release 0.12.0 to develop	5 年前
GitHub	a71c67d9	better logging for ports and versions (#3048 ) (#3069 )	5 年前
GitHub	8ec5ab62	Develop side channels: migrate reset parameters (#2990 ) * [WIP] Side Channel initial layout * Working prototype for raw bytes * fixing format mistake * Added some errors and some unit tests in C# * Added the side channel for the Engine Configuration. (#2958) * Added the side channel for the Engine Configuration. Note that this change does not require modifying a lot of files : - Adding a sender in Python - Adding a receiver in C# - subscribe the receiver to the communicator (here is a one liner in the Academy) - Add the side channel to the Python UnityEnvironment (not represented here) Adding the side channel to the environment would look like such : ```python from mlagents.envs.environment import UnityEnvironment from mlagents.envs.side_channel.raw_bytes_channel import RawBytesChannel from mlagents.envs.side_channel.engine_configuration_channel import EngineConfigurationChannel channel0 = RawBytesChannel() channel1 = EngineConfigurationChanne...	5 年前
GitHub	a6df9f43	Develop new ll api (#3022 ) * initial commit for LL-API * fixing ml-agents-envs tests * Implementing action masks * training is fixed for 3DBall * Tests all fixed, gym is broken and missing documentation changes * adding case where no vector obs * Fixed Gym * fixing tests of float64 * fixing float64 * reverting some of brain.py * removing old proto apis * comment type fixes * added properties to AgentGroupSpec and edited the notebooks. * clearing the notebook outputs * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update gym-unity/gym_unity/tests/test_gym.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update ml-agents-envs/mlagents/envs/base_env.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * addressing first comments * NaN checks for r...	5 年前
GitHub	15050bc4	better logging for ports and versions (#3048 )	5 年前
GitHub	42bea858	Improve mypy coverage by adding --namespace-packages (#3049 )	5 年前
Chris Elion	fdc810ff	move (first pass)	5 年前
GitHub	58b6c7c2	Rename mlagents.envs to mlagents_envs (#3083 )	5 年前
GitHub	c0c26d2f	tick communication API (#3154 )	5 年前
GitHub	5ce669f9	add dev0 suffix to versions (#3268 )	5 年前
GitHub	88ad0eea	remove obsolete param from docstring (#3317 )	5 年前
GitHub	77c3c343	Updating version number (#3367 ) * updating version number * fixing version numbers	5 年前
GitHub	b8bfc79c	set package and API to 0.15.0-dev0 (#3369 )	5 年前
GitHub	2ac92182	constant for editor port (#3396 ) * constant for editor port * undo stupid pycharm * cleanup	5 年前
Alphonso Crawford	2c14779c	moving launch check to static method	5 年前
Alphonso Crawford	cff1a003	pylint error resolution	5 年前
GitHub	6f65b0e3	Update ml-agents-envs/mlagents_envs/environment.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com>	5 年前
Alphonso Crawford	40f1f6ed	validate_environment_path	5 年前
Alphonso Crawford	d308d713	Returning None for early exit	5 年前
Alphonso Crawford	2a154bf3	Moving env_strip to validate_environment_path	5 年前
Alphonso Crawford	d495f4c8	improper indendtaton in validate_environment_path	5 年前
GitHub	c38dd44c	Develop SideChannel new api (#3425 ) * Make ChannelId a property and renamed ReservedChannelId * Changes on the Python side for consistency * Modified the tutorial appropriately * fixing bugs * Update ml-agents-envs/mlagents_envs/environment.py Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update com.unity.ml-agents/Runtime/Grpc/RpcCommunicator.cs Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Addressing comments * Update docs/Python-API.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Added a Utils class on the side channel (#3447) - No change in user facing API - Simplifies the code in the side channel implementations as it makes it easier to check if a side channel id is within ranges - No changes to tests - No changes to Documentation * Simplifying * Fixing a bug * Replace the int ChannelId with a GUID/UUID ChannelId (#3454) * renaming channel_type to channel_id * Making the constant GUID const...	5 年前
GitHub	472f9f0e	Merge branch 'master' into develop-badEnvReturnCode	5 年前
GitHub	b5a3b1de	Rename --port commandline arg to --mlagents-port (#3477 )	5 年前
GitHub	414cd0a6	Update environment.py __init__ params were over indented	5 年前
Alphonso Crawford	35e49f5d	Using f-strings for exception strings	5 年前
GitHub	24145c22	Merge pull request #3438 from Unity-Technologies/develop-badEnvReturnCode Raise Exception if path does not exist [Bug Fix]	5 年前
GitHub	e5108d2c	Communication protocol versioning (#3535 )	5 年前
Anupam Bhatnagar	abc369a6	Adding a logging utility for improved logs	5 年前
Anupam Bhatnagar	e8e0078e	first commit	5 年前
Chris Elion	841b0937	SideChannel helper messages	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
GitHub	5d4f7f08	cleanup port logic in UnityEnvironment (#3673 )	5 年前
GitHub	a46e7237	Shorten timeout duration for environment close (#3679 ) The timeout duration for closing an environment was set to the same duration as the timeout when waiting for a response from the still-running environment. This led to long waits for the error response when communication version wasn't matching. This change forces a timeout duration of 0 when handling errors.	5 年前
GitHub	458e68f1	Remove "docker target" feature (#3687 ) The "docker target" feature and associated command-line flag --docker-target-name were created for use with the now-deprecated Docker setup. This feature redirects the paths used by learn.py for the environment and config files to be based from a directory other than the current working directory. Additionally it wrapped the environment execution with xvfb-run. This commit removes the "docker target" feature because: * Renaming the paths doesn't fix any problem. Absolute paths can already be passed for configs and environment executables. * Use of xserver, Xvfb, or xvfb-run are independent of mlagents-learn and can be used outside of the mlagents-learn call. Further, xvfb-run is not the only solution for software rendering.	5 年前
GitHub	4ecd6ad3	Fix how we set logging levels (#3703 ) * cleanup logging * comments and cleanup * pylint, gym	5 年前
GitHub	43f23ee3	WIP : Changes to the LL-API - Refactor of “done” logic (#3681 ) * [skip ci] WIP : Modify the base_env.py file * [skip ci] typo * [skip ci] renamed some methods * [skip ci] Incorporated changes from our meeting * [skip ci] everything is broken * [skip ci] everything is broken * [skip ci] formatting * Fixing the gym tests * Fixing bug, C# has an error that needs fixing * Fixing the test * relaxing the threshold of 0.99 to 0.9 * fixing the C# side * formating * Fixed the llapi integratio test * [Increasing steps for testing] * Fixing the python tests * Need __contains__ after all * changing the max_steps in the tests * addressing comments * Making env_manager logic clearer as proposed in the comments * Remove duplicated logic and added back in episode length (#3728) * removing mentions of multi-agent in gym and changed the docstring in base_env.py * Edited the Documentation for the changes to the LLAPI (#3733) * Edite...	5 年前
GitHub	078c6502	Make the DecisionRequester public and customizable. (#3716 ) * [API] Make the DecisionRequester public and add a delegate to its API to allow users to customize it's behavior. - Rename Academy.AgentSetStatus to Academy.AgentPreStep and make it public. - Fix Unity library cache issues for backwards compatibility tests. - Collect standalone build and logs to artifacts for standalone build jobs. - cat standalone build log if the build fails. - Default verbose to False for standalone build test. * disable backward compatibility test, bump communication version. * still run training tests on latest. * fix yml parse error.	5 年前
GitHub	989cc89f	Use SemVer to check communication compatibility between C# and Python (#3760 ) * [communication] Use semantic versioning to test communication compatibility between C# and Python. - Add tests for the change. Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
GitHub	ea0c6fa0	[WIP] Side Channel Design Changes (#3807 ) * Make EnvironmentParameters a first-class citizen in the API Missing: Python conterparts and testing. * Minor comment fix to Engine Parameters * A second minor fix. * Make EngineConfigChannel Internal and add a singleton/sealed accessor * Make StatsSideChannel Internal and add a singleton/sealed accessor * Changes to SideChannelUtils - Disallow two sidechannels of the same type to be added - Remove GetSideChannels that return a list as that is now unnecessary - Make most methods except (register/unregister) internal to limit users impacting the “system-level” side channels - Add an improved comment to SideChannel.cs * Added Dispose methods to system-level sidechannel wrappers - Specifically to StatsRecorder, EnvironmentParameters and EngineParameters. - Updated Academy.Dispose to take advantage of these. - Updated Editor tests to cover all three “system-level” side channels. Kudos to Unit Tests (TestAcade...	5 年前
GitHub	85789ded	Add capabilities checks bewteen C# and Python codebases. (#3831 )	5 年前
GitHub	1e582745	Doc link fix (#3865 ) * Make all doc links point to release_1_docs tag * fix 0.15.1 link * relative links in readme * fix link in env warnings * more link fixes	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
GitHub	ebe12502	[bug-fix] Fix exception thrown when quitting in-editor training from editor (#3885 )	5 年前
GitHub	ce6e906d	set communication to 1.0.0 (#3896 )	5 年前
Chris Elion	68b68396	Merge remote-tracking branch 'origin/master' into release_1_to_master	5 年前
GitHub	d2bc86c8	Release 2 cherry pick (#3971 ) * [bug-fix] Fix issue with initialize not resetting step count (#3962) * Develop better error message for #3953 (#3963) * Making the error for wrong number of agents raise consistently * Better error message for inputs of wrong dimensions * Fix #3932, stop the editor from going into a loop when a prefab is selected. (#3949) * Minor doc updates to release * add unit tests and fix exceptions (#3930) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Goy <christopherg@unity3d.com>	5 年前
GitHub	812983c0	Some improvements to the UnityEnvironment class (#3939 ) * Fix typo * Made a side channel utils to reduce the complexity of UnityEnvironment * Added a get_side_channel_dict utils method * Better executable launcher (unarguably) * Fixing the broken test * Addressing comments * [skip ci] Update ml-agents-envs/mlagents_envs/side_channel/side_channel_manager.py Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com> * No catch all Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	5 年前
GitHub	c6ed3789	Replaced get_behavior_names and get_behavior_spec with behavior_specs property (#3946 ) * Replaced get_behavior_names and get_behavior_spec with behavior_specs property * Fixing the test * [ci] * addressing some comments * use typing.Mapping (#3948) * Update ml-agents-envs/mlagents_envs/base_env.py Co-authored-by: Chris Elion <chris.elion@unity3d.com> * Adding the documentation Co-authored-by: Chris Elion <chris.elion@unity3d.com>	5 年前
Christopher Goy	ba80b292	format files with pre-commit.	4 年前
GitHub	9083752d	Making some things private in UnityEnvironment (#3951 ) * Making some things private in UnityEnvironment * Readding the default ports as public * removing _SCALAR_ACTION_TYPES and _SINGLE_BRAIN_ACTION_TYPES * Removing unused method	5 年前
GitHub	8bae4088	Develop better error message for #3953 (#3963 ) * Making the error for wrong number of agents raise consistently * Better error message for inputs of wrong dimensions	5 年前
Chris Elion	7f8258a8	update yamato tests from master	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	9d840f22	Increase communicator version for concatenated PNGs. (#4462 )	4 年前
GitHub	c3d2b902	Support multi-dimensional and compressed observations stacking (#4476 ) Added stacking to multi-dimensional and compressed observations and added compressed channel mapping in communicator to support decompression. Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Ruo-Ping Dong	91700916	add hybrid action capability flag	4 年前
Andrew Cohen	60510f45	use proper spec in environment.py	4 年前
GitHub	cb8e4d25	Add ActionSpec (#4586 ) Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Andrew Cohen	9689cf2c	remove _action_ from function names	4 年前
GitHub	b853e5ba	Action buffer (#4612 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Andrew Cohen	55a40cae	moved type and shape checking into ActionSpec	4 年前
Andrew Cohen	1bbe492c	fixed tests/ -> single validate_action func	4 年前
Andrew Cohen	a4c3e26a	make validate action private	4 年前
Andrew Cohen	3f771e61	add ActionBuffers and utils	4 年前
GitHub	559e0ee5	Fix set_action_for_agent (#4691 ) * fixes * Update CHANGELOG.md Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	94c59e31	C# changes for hybrid action spaces (#4587 ) * Add hybrid action capability flag (#4576) * Change BrainParametersProto to support ActionSpec (#4579) * Assign new BrainParametersProto fields based on capabilities (#4581) * ActionBuffer with hybrid actions for RemotePolicy (#4592) * Barracuda inference for hybrid actions (#4611) * Refactor BarracudaModel loader checks (#4629) * Export separate nodes for continuous/discrete actions (#4655) * Separate continuous/discrete actions in AgentActionProto (#4698) * Force different nodes for new and deprecated action output (#4705)	4 年前
Andrew Cohen	f6355ba9	Merge branch 'develop-action-spec' into develop-action-buffer	4 年前
Andrew Cohen	d624b54b	Merge branch 'master' into fix-conflict-base-env	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Andrew Cohen	4ebc6c44	ml-agents-envs pass	4 年前
GitHub	43d74c0a	[MLA-1587] Don't warn about minor version mismatch, add links in specific messages (#4688 ) (#4759 ) * Don't warn about minor version mismatch, add links in specific messages * changelog * fix tests	4 年前
Andrew Cohen	157f9e77	rename to ActionTuple	4 年前
Andrew Cohen	f2c9d184	fix set_actions_for_agent	4 年前
GitHub	caac7324	Update ml-agents-envs/mlagents_envs/environment.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
vincentpierre	449712b0	renaming sensor_spec to sensor_specS	4 年前
Ruo-Ping Dong	8ed14762	Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp	4 年前
GitHub	d7c3022d	Export separate nodes for continuous/discrete actions (#4655 )	4 年前
GitHub	7387a77f	remove pylint (#4836 ) * remove pylint * remove other pylint disables	4 年前
GitHub	2af86534	[MLA-1712] Make UnityEnvironment fail fast if the env crashes (#4880 )	4 年前
GitHub	d8835857	[MLA-1540] Training Analytics (#4780 )	4 年前
vincentpierre	b7e8e4f6	Adding a capabilities flag and bumping the communication to 1.5.0	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
Ervin Teng	fd2dc688	[🐛 🔨 ] set_action_for_agent expects a ActionTuple with batch size 1. (#5208 ) * [Bug Fix] set_action_for_agent expects a ActionTuple with batch size 1. * moving a line around (cherry picked from commit aac2ee6cb650e6969a6d8b9f7c966f69b9e2df04)	4 年前
GitHub	2e19759c	Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211 ) * turning some logger.info into logger.debug and remove some logging overhead when not using debug * Addressing comments * Adding to changelog	4 年前

1 2 3 4 5

202 次代码提交 (04dd1c46-97c8-419b-988c-77b28e294388)