ml-agents

作者	SHA1	备注	提交日期
eshvk	680b0767	[Imitation Learning] Minor fix to make sure that step increment loads from the last saved global step if the model is being trained after loading	6 年前
GitHub	678e5dab	Merge pull request #837 from Unity-Technologies/develop-fix-tennis [fix tennis]	6 年前
vincentpierre	85b844cc	[Better version of the fix]	6 年前
vincentpierre	3c2283e8	[fix tennis]	6 年前
Arthur Juliani	d7338050	Enable concurrent sessions	6 年前
Arthur Juliani	463ca9af	[removed playground] (#831 )	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
vincentpierre	a22c0f65	[fixing encoding_size]	7 年前
Arthur Juliani	5abb001b	[Add curiosity_enc_size: 128 to the trainer_config.yaml] (#826 )	7 年前
GitHub	cd6559e5	Capitalize material names (#822 ) * Capitalize material names	7 年前
GitHub	9ab98584	Additional Environment Variations (#791 ) * Add Visual (Camera) and Imitation Learning variations to example environments	7 年前
GitHub	c7890e88	[Removed the JSON library] (#816 ) Removed the JSON dll and associated meta files Removed references to JSON libraries in the GridwWorld Agent script	7 年前
GitHub	c17937ef	Curiosity Driven Exploration & Pyramids Environments (#739 ) * Adds implementation of Curiosity-driven Exploration by Self-supervised Prediction (https://arxiv.org/abs/1705.05363) to PPO trainer. * To enable, set use_curiosity flag to true in hyperparameter file. * Includes refactor of unitytrainers model code to accommodate new feature. * Adds new Pyramids environment (w/ documentation). Environment contains sparse reward, and can only be solved using PPO+Curiosity.	7 年前
GitHub	702d98c6	[Fix] The summary writer is now implemented in the abtract trainer class. (#806 ) Summary writer now displays {}: Step: {}. No episode was completed since last summary. when there was no completed episodes	7 年前
vincentpierre	2a591aaa	Update project version to 2018.1 release (#795 )	7 年前
GitHub	88bd0b54	[Documentation for in Editor Training] (#773 ) * [Documentation for in Editor Training] * [Addressed the comments] * [Addressing unofficial comments] * [Addressed comments] * [Addressed more comments]	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
GitHub	bdeb506c	TensorFlowSharp 1.7 upgrade package (#746 ) * some random change so that I can create this PR * docs update for TensorFlowSharp new version * changed the links to the new unitypackage file * resolved conflicts, updated the pictures for CUDA 9.0 * fixed a typo * resolved arthur's comment * blurred the usernames * modified the AWS doc * resolved Vince's comment	7 年前
GitHub	38700012	Merge pull request #786 from LeighS/patch-1 Fixed path variables for Anacondo3	7 年前
Leigh Shayler	122c5f7a	Fixed path variables for Anacondo3	7 年前
GitHub	ffcf8c9c	Newer Ascii Art (#780 ) Replaced UNITY ML AGENTS with the unity logo	7 年前
Arthur Juliani	0264de49	[Update Curriculum for WallJump] Updating the curriculum for WallJump (#774 )	7 年前
Arthur Juliani	f66a306c	Update Learning-Environment-Create-New.md (#770 ) Shouldn't Done(); be placed after the rewards are given?	7 年前
Arthur Juliani	d36b370c	Update Learning-Environment-Create-New.md (#769 ) Some suggestions to avoid ambiguity	7 年前
Arthur Juliani	ce5e2dba	[Added Ascii art on learn.py] (#727 ) * [Added Ascii art on learn.py] Note : This is by far the best feature of 0.4	7 年前
Arthur Juliani	4d98b4c7	Monitor without JSON Conversion (#724 ) * [Refactor] Fixed line indentation * Removed the library Newtonsoft.Json from the monitor * Replaced calls to JSON converstion with manual conversion * [Modified] The Monitor now has multiple * Log methods that take different object types	7 年前
Arthur Juliani	5590abfb	Fix Explicit Documentation Issue (#776 )	7 年前
Arthur Juliani	b1a30f84	* Add benchmark thresholds for example environments	7 年前
Arthur Juliani	89f89099	Clarify documentation for step() in Python-API.md (#775 ) - Indent the section about providing actions to multiple brains to be in line with the rest of the step() docs. - Move the line about what step() returns closer to the top of the docs so it's harder to overlook. - Add a small code snippet about how to get BrainInfo belonging to a specific brain and how to get data from that BrainInfo object.	7 年前
GitHub	38098a12	[Fixed BC with LSTM] (#766 ) Fixes the issue raised by @hsaikia in #552 Added the memory_size variable to the BC model Added memory_size and recurrent_out to the output nodes of the graph when using BC with LSTM	7 年前
GitHub	d2da48e0	Merge pull request #772 from Unity-Technologies/develop-doc-videolinks added video links	7 年前
Vincent Gao	d8e1a24e	added video links	7 年前
Arthur Juliani	898874c7	Develop docs azure (#744 ) * First draft of Azure support docs * Correcting links to other docs * Adding additional links and cleaning instructions * Adding references to Azure docs in other appropriate places	7 年前
GitHub	9594f3d8	Walker Environment (#720 ) * Add `Walker` example environment and documentation.	7 年前
Arthur Juliani	9477eaa9	Develop fix cumulative reward (#725 ) * [Cold Fix] Split the way cummulative rewards and episode length are counted The reward is appended at each step to the cummulative reward The episode count is ONLY incremented when d_t+1 is false	7 年前
GitHub	20aa6424	Merge pull request #730 from Unity-Technologies/develop-ignore-plugins Add ignore for plugins folder	7 年前
Arthur Juliani	6a845a0d	Add ignore for plugins folder	7 年前
GitHub	d1161180	Merge pull request #703 from Unity-Technologies/develop-docs-imitation-video-link added the video link	7 年前
Vincent Gao	69f106a4	resolved comments	7 年前
Vincent Gao	80494240	added the video link	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	c2902dfe	Merge pull request #599 from Unity-Technologies/docs-refactor Quick start integrated into Installation	7 年前
Vincent Gao	0e7c88ee	refactored the quick start and installation guide, added faq	7 年前
GitHub	755be43e	[Cold Fix] Making the episode length and mean reward more accurate for the first episode (#657 )	7 年前
GitHub	b2675216	Hotfix 0.3.1b (#656 ) * [Fix] Use the stored agent info instead of the previous agent info when bootstraping the value * [Bug Fix] Addressed #643 * [Added Line Break]	7 年前
vincentpierre	063ee530	Update Python-API.md	7 年前
vincentpierre	4c40ef6d	Explicitly document `Basics.ipynb` location.	7 年前
vincentpierre	076c8744	Report means instead of totals for losses (#580 ) * Report means instead of totals for losses. * Report absolute loss for policy.	7 年前
vincentpierre	086dd450	CrawlerLegContact incorrectly refers to Ground (#589 ) No game object is named "Platform" therefore the legs never make contact with anything	7 年前
GitHub	a99aad13	Hotfix 0.3.1a (#625 ) * [CoreBrain] Bug fix in the internal brain Discrete vector observations did not have the right size * [Docs] Removed all references to the unitypackages other than the TensorFlowSharp.unitypackage . * [Basic] Updated the bytes file of basic * [Docs] Addressed comments * [Docs] Re-addressed the comments * [Bug Fix] Scalling the visual input between 0 and 1 * [Comments] Added comments to the BatchVisualObservations method of the CoreInternalBrain. * [Renaming] Renamed BlackAndWhite to blackAndWhite	7 年前

1 2 3 4 5 ...

610 次代码提交 (d993c549-0b16-4f42-8e8e-5bea39334e27)