ml-agents

作者	SHA1	备注	提交日期
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
GitHub	4a7481a1	RayPercpetion, Push Block, and misc environment changes (#432 ) RayPerception moved to a component that is now used by Banana, Soccer, Hallway, and Push Block. Converted Push Block to use RayPerception for local perception and retrained model. Re-worked Hallway to be more extensible.	7 年前
Marwan Mattar	72a71a08	Merge branch 'development-0.3' into dev-api-doc-decision	7 年前
Joe Ward	9163a54a	resolved merge conflict with dev-0.3 branch	7 年前
GitHub	976c56c5	Environment Aesthetic Unification (#459 ) * Aesthetic unification * Add new environment images	7 年前
GitHub	c1e930b5	Minor Visual Changes for Environments (#470 ) * Minor changes to ensure a common visual language. * Agents are blue (or additionally red in competitive scenarios). * Interactable objects are orange. * Goals are green when objects, and checkerboards when places. * Not everything perfectly follows this, but things are mostly consistent now. * Renamed "Banana" folder to "BananaCollectors" * Ensured all brains were set to "Player" * Moved non-shared assets out of the "SharedAssets" folder.	7 年前
Marwan Mattar	4d1b3ae3	Merge branch 'development-0.3' into docs/doxygen # Conflicts: # docs/doxygen/Readme.md	7 年前
GitHub	8c228f99	[Fix] Renamed the bananas and badBananas using git mv command (#482 ) * [Fix] Renamed the bananas and badBananas using git special command * [Fix] Renamed metafiles	7 年前
Marwan Mattar	06cc85cc	Merge branch 'development-0.3' into docs/random-fixes # Conflicts: # docs/Learning-Environment-Create-New.md	7 年前
vincentpierre	c18b6860	[HotFix] Added missing RayPerception.cs scripts to the agents in the imitation scene	7 年前
GitHub	ad5ac4ab	Merge pull request #499 from Unity-Technologies/hotfix-imitation-banana [HotFix] Added missing RayPerception.cs scripts to the agents	7 年前
GitHub	a8797f54	Merge pull request #500 from Unity-Technologies/master Merging 0.3.0a into develop	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	9ab98584	Additional Environment Variations (#791 ) * Add Visual (Camera) and Imitation Learning variations to example environments	7 年前
GitHub	696c13d2	Fix BananaIL frozen agent material (#865 ) * Fix BananaIL frozen agent material * [Fix] Added texture on the imitation learning scene and linked the models to the internal brains	6 年前
GitHub	bf858cd6	Merge pull request #884 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
GitHub	4b3c6c9f	Merge pull request #885 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
Arthur Juliani	195ac934	Merge branch 'develop' into develop-runs # Conflicts: # python/learn.py # python/unitytrainers/trainer.py	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
GitHub	2d715dc5	Revert "Release v0.5 (#1202 )" (#1221 ) This reverts commit 983c4029cb435fc7ad27a796e79a1d59904e53e5.	6 年前
GitHub	1cf194d8	Fix Banana scene lightmap (#1095 )	6 年前
GitHub	b146cce3	Fix Banana IL scene (#1097 )	6 年前
GitHub	325fb849	Add max-step to agents in BananaVisual (#1119 )	6 年前
Deric Pang	3e6497dd	Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure	6 年前
Deric Pang	fe697b9b	Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure	6 年前
GitHub	25495874	Merge pull request #1223 from Unity-Technologies/release-v0.5 Release v0.5	6 年前
Vincent(Yuan) Gao	e8a226a0	Added 3DBall, Banana, Basic, Walker, Hallway and PushBlock (#1312 )	6 年前
vincentpierre	5c060417	Added PushBlock models, fixed trainer config and fixed Learning brain asset (#1344 ) * Added PushBlock models, fixed trainer config and fixed Learning brain asset * Fixed PushBlock model to be in correct place * Added BananaLearning, deleted bytes files for PushBlock, fixed PushBlockLearning.asset * Deleted stray file * Added WallJumpArea training mods * Fixed Banana collector	6 年前
GitHub	547f0e98	Merge pull request #1361 from Unity-Technologies/release-v0.6 Merge Release v0.6 into develop	6 年前
vincentpierre	09657954	Made the default brains of the BananaRL prefab the Player brain	6 年前
GitHub	9d4d1a84	Merge pull request #1472 from Unity-Technologies/release-v0.6-bananaRL-player-brain Made the default brains of the BananaRL prefab the Player brain	6 年前
GitHub	c8cc5a29	Merge pull request #1495 from Unity-Technologies/release-v0.6 release-v0.6 --> develop	6 年前
GitHub	a196dde2	Merge pull request #1494 from Unity-Technologies/release-v0.6 v0.6 Release	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
GitHub	275ff5d6	Merge pull request #1764 from Unity-Technologies/release-v0.7 Release v0.7 into master	6 年前

37 次代码提交 (eb90772f-4aea-48f5-a82d-2d3a0d2fde92)