ml-agents

作者	SHA1	备注	提交日期
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
GitHub	4a7481a1	RayPercpetion, Push Block, and misc environment changes (#432 ) RayPerception moved to a component that is now used by Banana, Soccer, Hallway, and Push Block. Converted Push Block to use RayPerception for local perception and retrained model. Re-worked Hallway to be more extensible.	7 年前
Marwan Mattar	72a71a08	Merge branch 'development-0.3' into dev-api-doc-decision	7 年前
Joe Ward	9163a54a	resolved merge conflict with dev-0.3 branch	7 年前
GitHub	6dd3c284	Hotfix 0.3.0b (#519 ) * Fixes internal brain for Banana Imitation. * Fixes Discrete Control training for Imitation Learning. * Fixes Visual Observations in internal brain with non-square inputs.	7 年前
GitHub	a6385cbf	Merge pull request #536 from Unity-Technologies/master Bring develop to v0.3.0b	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
GitHub	bf858cd6	Merge pull request #884 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
GitHub	2d715dc5	Revert "Release v0.5 (#1202 )" (#1221 ) This reverts commit 983c4029cb435fc7ad27a796e79a1d59904e53e5.	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
GitHub	25495874	Merge pull request #1223 from Unity-Technologies/release-v0.5 Release v0.5	6 年前
vincentpierre	5c060417	Added PushBlock models, fixed trainer config and fixed Learning brain asset (#1344 ) * Added PushBlock models, fixed trainer config and fixed Learning brain asset * Fixed PushBlock model to be in correct place * Added BananaLearning, deleted bytes files for PushBlock, fixed PushBlockLearning.asset * Deleted stray file * Added WallJumpArea training mods * Fixed Banana collector	6 年前
GitHub	547f0e98	Merge pull request #1361 from Unity-Technologies/release-v0.6 Merge Release v0.6 into develop	6 年前
vincentpierre	ec18f6d6	Added the new models	6 年前
GitHub	e9121bb5	Merge pull request #1451 from Unity-Technologies/release-v0.6-revertTF1 Release v0.6 revert tf1	6 年前
GitHub	c8cc5a29	Merge pull request #1495 from Unity-Technologies/release-v0.6 release-v0.6 --> develop	6 年前
GitHub	a196dde2	Merge pull request #1494 from Unity-Technologies/release-v0.6 v0.6 Release	6 年前
Vincent-Pierre BERGES	4a6ae4e0	Barracuda integration into ML-Agents (#1557 ) * Switched default Mac GFX API to Metal * Added Barracuda pre-0.1.5 * Added basic integration with Barracuda Inference Engine * Use predefined outputs the same way as for TF engine * Fixed discrete action + LSTM support * Switch Unity Mac Editor to Metal GFX API * Fixed null model handling * All examples converted to support Barracuda * Added model conversion from Tensorflow to Barracuda copied the barracuda.py file to ml-agents/mlagents/trainers copied the tensorflow_to_barracuda.py file to ml-agents/mlagents/trainers modified the tensorflow_to_barracuda.py file so it could be called from mlagents modified ml-agents/mlagents/trainers/policy.py to convert the tf models to barracuda compatible .bytes file * Added missing iOS BLAS plugin * Added forgotten prefab changes * Removed GLCore GFX backend for Mac, because it doesn't support Compute shaders * Exposed GPU support for LearningBrain inference ...	6 年前
Vincent-Pierre BERGES	ed1b7f33	Retrained models for Release 0.7 and deleted random prefab for bouncer (#1761 )	6 年前
GitHub	275ff5d6	Merge pull request #1764 from Unity-Technologies/release-v0.7 Release v0.7 into master	6 年前
GitHub	20ff1436	Merge pull request #1765 from Unity-Technologies/release-v0.7 Release v0.7 into develop	6 年前
Yuan Gao	c2c25bf6	Updated all the scenes’s model and the bouncer’s expected reward	6 年前
GitHub	74bd5e1a	Merge pull request #1928 from Unity-Technologies/release-v0.8-model-update Updated all the scenes’s model and the bouncer’s expected reward	6 年前
GitHub	2d1bda57	Merge pull request #1931 from Unity-Technologies/release-v0.8 Release v0.8	6 年前
GitHub	ba57eaad	Merge pull request #1932 from Unity-Technologies/release-v0.8 Release v0.8	6 年前
Mantas Puida	27567062	First stage of ML Agents update to Barracuda 0.2.x	6 年前
GitHub	f13d0f11	Merge pull request #2049 from Unity-Technologies/develop-barracuda-0.2.0 Barracuda 0.2.1 -> develop	5 年前
GitHub	610b8852	Release v0.8.2 update models (#2178 ) * ignore the idea file * Retrained most of the models * Updated the remaining models	5 年前
GitHub	d5f6b7f8	Merge pull request #2157 from Unity-Technologies/release-v0.8.2 Release v0.8.2	5 年前
GitHub	dcef9f69	Merge pull request #2179 from Unity-Technologies/release-v0.8.2 Merge from release 0.8.2 to develop	5 年前
GitHub	40c7fc48	Merge branch 'develop' into protobuf_update	5 年前
Ervin T	cf5e09fc	Updated the models for v0.9 (#2374 )	5 年前
GitHub	53475207	Merge pull request #2380 from Unity-Technologies/release-0.9.0 Release v0.9.0	5 年前
GitHub	c7f0ed04	Merge pull request #2381 from Unity-Technologies/release-0.9.0	5 年前
Yuan Gao	0c492fb7	Updated the model	5 年前
GitHub	0a163871	Merge pull request #2469 from Unity-Technologies/release-0.9.2 Release 0.9.2	5 年前
GitHub	cf9e67fb	Merge pull request #2470 from Unity-Technologies/release-0.9.2 Release 0.9.2 to develop	5 年前

39 次代码提交 (d993c549-0b16-4f42-8e8e-5bea39334e27)