ml-agents

作者	SHA1	备注	提交日期
Jonathan Harper	7a0d1531	Fix subprocess model saving on Windows On Windows the interrupt for subprocesses works in a different way from OSX/Linux. The result is that child subprocesses and their pipes may close while the parent process is still running during a keyboard (ctrl+C) interrupt. To handle this, this change adds handling for EOFError and BrokenPipeError exceptions when interacting with subprocess environments. Additional management is also added to be sure when using parallel runs using the "num-runs" option that the threads for each run are joined and KeyboardInterrupts are handled. These changes made the "_win_handler" we used to specially manage interrupts on Windows unnecessary, so they have been removed.	6 年前
Jonathan Harper	18bedf6a	Fix parallel writes to UnitySDK.log on Windows When using the SubprocessUnityEnvironment, parallel writes are made to UnitySDK.log. This causes file access violation issues in Windows/C#. This change modifies the access and sharing mode for our writes to UnitySDK.log to fix the issue.	6 年前
GitHub	d906273a	Fix environment factory pickling on Windows (#1912 ) SubprocessUnityEnvironment sends an environment factory function to each worker which it can use to create a UnityEnvironment to interact with. We use Python's standard multiprocessing library, which pickles all data sent to the subprocess. The built-in pickle library doesn't pickle function objects on Windows machines (tested with Python 3.6 on Windows 10 Pro). This PR adds cloudpickle as a dependency in order to serialize the environment factory. Other implementations of subprocess environments do the same: https://github.com/openai/baselines/blob/master/baselines/common/vec_env/subproc_vec_env.py	6 年前
Jonathan Harper	8616db5d	Fix not saving .nn file after max_timesteps (#1896 ) Sends close command when closing workers.	6 年前
Jonathan Harper	d8549567	Add documentation for new multi-env CLI flags We need to document the meaning of the two new flags added for multi-environment training. We may also want to add more specific instructions for people wanting to speed up training in the future.	6 年前
Ervin T	6ca50994	Install dependencies for ml-agents-envs and ml-agents in Docker (#1895 )	6 年前
Vincent(Yuan) Gao	a28b9c58	Included TFS page to redirect (#1893 ) * Create Using-TensorFlow-Sharp-in-Unity.md * Update Using-TensorFlow-Sharp-in-Unity.md * Update Using-TensorFlow-Sharp-in-Unity.md	6 年前
GitHub	eb90ad80	Merge pull request #1894 from Unity-Technologies/develop-esh-docker Install dependencies for ml-agents-envs and ml-agents in Docker	6 年前
eshvk	515c15a2	Install dependencies for ml-agents-envs and ml-agents in Docker	6 年前
eshvk	a50aadda	* Ticked API : - Ticked API for pypi for mlagents - Ticked API for pypi for mlagents_envs - Ticked Communication number for API - Ticked API for unity-gym * Ticked the API for the pytest	6 年前
GitHub	d32f8b81	Soccer Twos - Fixes missing tag change, plus code cleanup (#1813 ) * Fixes missing tag change, plus code cleanup * Fix bug in agent position setting	6 年前
GitHub	93760bc4	Adds SubprocessUnityEnvironment for parallel envs (#1751 ) This commit adds support for running Unity environments in parallel. An abstract base class was created for UnityEnvironment which a new SubprocessUnityEnvironment inherits from. SubprocessUnityEnvironment communicates through a pipe in order to send commands which will be run in parallel to its workers. A few significant changes needed to be made as a side-effect: * UnityEnvironments are created via a factory method (a closure) rather than being directly created by the main process. * In mlagents-learn "worker-id" has been replaced by "base-port" and "num-envs", and worker_ids are automatically assigned across runs. * BrainInfo objects now convert all fields to numpy arrays or lists to avoid serialization issues.	6 年前
GitHub	a0b44f1b	Merge pull request #1858 from Unity-Technologies/develop-esh-metrics Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return per policy	6 年前
eshvk	fb04c40c	Reorganize to make metrics collection more accurate	6 年前
eshvk	cc9bdf17	Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return	6 年前
Arthur Juliani	2409fc6a	Adding instructions to Basic Guide for Running in Python (#1725 ) * add "Control" check instruction (#1719)	6 年前
GitHub	65ba0eb7	Update to documentation (#1872 ) * Update to documentation * Update Custom-Protos.md	6 年前
Ervin T	b30f4c90	Split `mlagents` into two packages (#1812 ) * Reogranize project * Fix all tests * Address comments * Delete init file * Update requirements * Tick version * Add timeout wait parameter (mlagents_envs) (#1699) * Add timeout wait param * Remove unnecessary function * Add new meta files for communicator objects * Fix all tests * update circleci * Reorganize mlagents_envs tests * WIP: test removing circleci cache * Move gym tests * Namespaced packages * Update installation instructions for separate packages * Remove unused package from setup script * Add Readme for ml-agents-envs * Clarify docs and re-comment compiler in make.bat * Add more doc to installation * Add back fix for Hololens * Recompile Protobufs * Change mlagents_envs to mlagents.envs in trainer_controller * Remove extraneous files, fix win bat script * Support Python 3.7 for envs package	6 年前
GitHub	6f8fc130	External Contribution: Use RenderTexture instead of Camera for Visual Observation (#1824 ) * Added RenderTexture support for visual observations * Cleaned up new ObservationToTexture function * Added check for to width/height of RenderTexture * Added check to hide HelpBox unless both cameras and RenderTextures are used * Added documentation for Visual Observations using RenderTextures * Added GridWorldRenderTexture Example scene * Adjusted image size of doc images * Added GridWorld example reference * Fixed missing reference in the GridWorldRenderTexture scene and resaved the agent prefab * Fix prefab instantiation and render timing in GridWorldRenderTexture * Added screenshot and reworded documentation * Unchecked control box * Rename renderTexture * Make RenderTexture scene default for GridWorld Co-authored-by: Mads Johansen <pyjamads@gmail.com>	6 年前
Vincent-Pierre BERGES	85c82247	Fix typos in Gym Wrapper README.md (#1823 )	6 年前
Vincent-Pierre BERGES	8373b998	Update BrainParametersDrawer.cs (#1840 )	6 年前
Vincent-Pierre BERGES	bc636075	API for sending custom protobuf messages to and from Unity. (#1595 ) * API for sending custom protobuf messages to and from Unity. * Rename custom_output to custom_outputs. * Move custom protos to their own files. * Add SetCustomOutput method. * Add docstrings. * Various adjustments. * Rename CustomParameters -> CustomResetParameters * Rename CustomOutput -> CUstomObservation * Add CustomAction * Add CustomActionResult * Remove custom action result. * Remove custom action result from Python API * Start new documentation. * Add some docstrings * Expand documentation. * Typos * Tweak doc. Also eliminate GetCustomObservation. * Fix typo. * Clarify docs. * Remove trailing whitspace	6 年前
Vincent-Pierre BERGES	db1ff84b	Fixed compilation under Scripting API compatibility level ".NET Standard 2.0" (#1869 ) Fixing #1779	6 年前
GitHub	52ea887d	Add doc about AVX support (#1865 )	6 年前
GitHub	5662622c	Develop codacy test (#1667 ) * fixed the test break on pytest > 4.0, added the pytest cov * added the pytest-cov package * added the logic to upload coverage.yml report to codacy * remove the warning message in during the pytest * added the codacy badge to show what it looks like * added a space * removed the space * removed the duplicate pytest * removed the extra spaces * added the test coverage badge * point the badge to the test branch * changed * moved the python test coverage to circleci * removed the badge * added the badge * fixed the link * Added the gym_unity test to the circleci * Fixed the gym_unity installation * Changed the test-reports from the ml-agents subfolder to the root folder, so that it covers gym_unity’s pytest also	6 年前
Vincent-Pierre BERGES	eefe0d6b	Optimisation - Removed a lot of garbage allocation (#1804 ) * Garbage collection optimisations: - Changed a few IEnumerable instances to IReadOnlyList. This avoids some unnecessary GC allocs that cast the Lists to IEnumerables. - Moved cdf allocation outside of the loop to avoid unnecessary GC allocation. - Changed GeneratorImpl to use plain float and int arrays instead of Array during generation. This avoids SetValue performing boxing on the arrays, which eliminates an awful lot of GC allocs. * Convert InferenceBrain to use IReadOnlyList to avoid garbage creation.	6 年前
GitHub	be0d2709	Refactor RayPerception and add RayPerception2D (#1793 ) * Fix typos * Use abstract class for rayperception * Created RayPerception2D. (#1721) * Incorporate RayPerception2D * Fix typo * Make abstract class * Add tests	6 年前
Vincent-Pierre BERGES	9b00c012	Fix for Brains not reinitialising when the scene is reloaded. (#1758 ) * Fix for Brains not reinitialising when the scene is reloaded. This was a bug caused by the conversion of Brains over to ScriptableObjects. ScriptableObjects persist in memory between scene changes, which means that after a scene change the Brains would still be initialised and the agentInfos list would contain invalid references to the Agents from the previous scene. The fix is to have the Academy notify the Brains when it is destroyed. This allows the Brains to clean themselves up and transition back to an uninitialised state. After the new scene is loaded, the Brain's LazyInitialise will reconnect the Brain to the new Academy as expected. * Fix for Brains not reinitialising when the scene is reloaded. This was a bug caused by the conversion of Brains over to ScriptableObjects. ScriptableObjects persist in memory between scene changes, which means that after a scene change the Brains would still be...	6 年前
Arthur Juliani	bf10ed81	Add line to describe what Branch Descriptions is (#1772 )	6 年前
GitHub	b99aa703	Merge pull request #1771 from markovuksanovic/patch-2 Fixup link	6 年前
GitHub	a217c6bc	Fixup link Fixup link which point to instructions on how to install tensorflow using anaconda.	6 年前
GitHub	20ff1436	Merge pull request #1765 from Unity-Technologies/release-v0.7 Release v0.7 into develop	6 年前
Vincent-Pierre BERGES	ed1b7f33	Retrained models for Release 0.7 and deleted random prefab for bouncer (#1761 )	6 年前
GitHub	cb9816d7	Adding the new icon for the NNModel (#1748 )	6 年前
GitHub	cfb8f208	Release v0.7 minor fixes (#1759 ) * Fix typo * Updated some of the scenes	6 年前
GitHub	a84dccab	Update Timeout error messages (#1750 )	6 年前
vincentpierre	b5d055c0	Linking Tennis Model to Tennis Brain	6 年前
Vincent-Pierre BERGES	d67eaf05	Release 0.7 Fix gym_unity Tests (#1744 ) * Change gym-unity tests to use Mock instead of MockCommunicator * move creation of mock objects into helper functions * Fix comment * Fix Codacy errors * Fix ending whitespace * Minor fixes	6 年前
GitHub	7703355e	Edited the Tennis code and retrained the model (#1746 ) Addressing #1739	6 年前
GitHub	90a66686	Release v0.7 disable gRPC on non supported platforms (#1743 ) * Fix for GRPC, need documentation * Edits * typo * Fixes * Missing typo * Modified the documentation * Updated the documentation	6 年前
GitHub	eb90772f	Added comment on OpenGL 3.0 emulation (#1735 ) * Added comment on OpenGL 3.0 emulation * Updated line change	6 年前
vincentpierre	06caa8d0	Set the default Package Name on Android to com.Company.ProductName	6 年前
vincentpierre	1a13ab44	Added missing meta files	6 年前
vincentpierre	b6864a25	Removing dead Links	6 年前
vincentpierre	3f85800b	Capitalization	6 年前
Vincent-Pierre BERGES	018d0793	Develop windows install instructions update (#1760 ) * As discussed here: https://github.com/Unity-Technologies/ml-agents/issues/1706 Fixed typos on Installation-Windows.md that made instructions unclear. Added warning about overwriting drivers. * edits * edits, formatting * Incorporated feedback from @vincentpierre	6 年前
vincentpierre	781d661c	Fixing Typos	6 年前
GitHub	c258b1c3	Move 'take_action' into Policy class (#1669 ) * Move 'take_action' into Policy class This refactor is part of Actor-Trainer separation. Since policies will be distributed across actors in separate processes which share a single trainer, taking an action should be the responsibility of the policy. This change makes a few smaller changes: * Combines `take_action` logic between trainers, making it more generic * Adds an `ActionInfo` data class to be more explicit about the data returned by the policy, only used by TrainerController and policy for now. * Moves trainer stats logic out of `take_action` and into `add_experiences` * Renames 'take_action' to 'get_action'	6 年前
Jonathan Harper	35eb595d	Add back 'get_communicator' in UnityEnvironment Removing this function breaks some tests, and the only way around this at this time is a bigger refactor or hacky fixes to tests. For now, I'd suggest we just revert this small part of a change and keep a refactor in mind for the future.	6 年前
GitHub	4846907e	Add timeout wait param (Develop) (#1700 ) * Add timeout wait param * Remove unnecessary function	6 年前

1 2 3 4 5 ...

1182 次代码提交 (430b9354-7460-40c4-a94c-f760a3296cd5)