ml-agents

作者	SHA1	备注	提交日期
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	69481d2d	Imitation Learning Helper (#371 ) * Add helper class to for Imitation Learning teacher. Allows for clearing buffer "C" and toggling adding info to the buffer "R".	7 年前
GitHub	a809630f	Add config for crawler, and change crawler scene (#376 ) * Add config for crawler, and change crawler scene * Changed number of crawlers in scene to 12 * Changed Max-steps for crawlers to 5000 * Newer hyperparameters and newly trained crawler model * Clean up crawler code, and improve efficency	7 年前
Vincent Gao	38bd3e40	replaced all the tabs to 4 spaces in the project	7 年前
Vincent Gao	ba0ecf24	fixed other tabs and spaces	7 年前
GitHub	1409236e	made AgentAction take vectorAction and textAction (#397 )	7 年前
Vincent Gao	1bc43933	Merge branch 'development-0.3' into hotfix/issue#333	7 年前
GitHub	8d6bf190	Merge pull request #384 from Unity-Technologies/hotfix/issue#333 hotfix on issue#333, and a few comments fixes	7 年前
Marwan Mattar	ba6911c3	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Editor/MLAgentsEditModeTest.cs # unity-environment/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs # unity-environment/Assets/ML-Agents/Scripts/Academy.cs	7 年前
GitHub	4a7481a1	RayPercpetion, Push Block, and misc environment changes (#432 ) RayPerception moved to a component that is now used by Banana, Soccer, Hallway, and Push Block. Converted Push Block to use RayPerception for local perception and retrained model. Re-worked Hallway to be more extensible.	7 年前
Marwan Mattar	72a71a08	Merge branch 'development-0.3' into dev-api-doc-decision	7 年前
Joe Ward	9163a54a	resolved merge conflict with dev-0.3 branch	7 年前
GitHub	6dd3c284	Hotfix 0.3.0b (#519 ) * Fixes internal brain for Banana Imitation. * Fixes Discrete Control training for Imitation Learning. * Fixes Visual Observations in internal brain with non-square inputs.	7 年前
GitHub	a6385cbf	Merge pull request #536 from Unity-Technologies/master Bring develop to v0.3.0b	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	9ab98584	Additional Environment Variations (#791 ) * Add Visual (Camera) and Imitation Learning variations to example environments	7 年前
Arthur Juliani	d4a2df66	Namespacification (#814 ) * [Namespace created] Added the namespace MLAgents on the C# scripts	7 年前
GitHub	bf858cd6	Merge pull request #884 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
GitHub	2d715dc5	Revert "Release v0.5 (#1202 )" (#1221 ) This reverts commit 983c4029cb435fc7ad27a796e79a1d59904e53e5.	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
GitHub	25495874	Merge pull request #1223 from Unity-Technologies/release-v0.5 Release v0.5	6 年前
Arthur Juliani	3409ea3a	Remove dependency of prefab on external gameobject	6 年前
GitHub	52bb4c99	Merge pull request #1253 from Unity-Technologies/hotfix-bananas Remove dependency of prefab on external gameobject	6 年前
GitHub	45a86c85	Merge pull request #1264 from Unity-Technologies/hotfix-050a Hotfix 050a	6 年前
GitHub	fe47d896	Merge pull request #1261 from Unity-Technologies/hotfix-050a Hotfix v0.5.0a	6 年前
GitHub	be0d2709	Refactor RayPerception and add RayPerception2D (#1793 ) * Fix typos * Use abstract class for rayperception * Created RayPerception2D. (#1721) * Incorporate RayPerception2D * Fix typo * Make abstract class * Add tests	6 年前
GitHub	2d1bda57	Merge pull request #1931 from Unity-Technologies/release-v0.8 Release v0.8	6 年前
Ervin T	b4675aa0	Fix respawn part of BananaLogic (#2277 ) Fix the bug of "respawn" part that cause all the banana respawn in the first Area.	5 年前
Ervin T	5465c2e0	Implemented the reset parameters for Banana Collectors and Bouncer (#2258 ) Banana Collectors: Length of laser and agent scale Bouncer: Size of the banana	5 年前
GitHub	53475207	Merge pull request #2380 from Unity-Technologies/release-0.9.0 Release v0.9.0	5 年前
GitHub	afbf46bd	fix BananaIL scene (#2512 ) * add reset parameters to scene * default values for BananaAgent reset parameters	5 年前
GitHub	dc3ab81a	Merge pull request #2514 from Unity-Technologies/hotfix-0.9.3 Hotfix 0.9.3	5 年前
GitHub	7ec3d7ad	Merge pull request #2516 from Unity-Technologies/master Merege 0.9.3 changes to develop	5 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前

39 次代码提交 (d993c549-0b16-4f42-8e8e-5bea39334e27)