ml-agents

作者	SHA1	备注	提交日期
GitHub	6a212f73	Improvements for GAIL (#2296 ) * Don't 0 value bootstrap for GAIL and Curiosity * Add gradient penalties to GAN to help with stability * Add gail_config.yaml with GAIL examples * Cleaned up trainer_config.yaml and unnecessary gammas * Documentation updates * Code cleanup	6 年前
GitHub	19283bfa	Very simple environment for testing (#2266 ) * WIP doesn't crash * return stats and assert convergence * pass lint checks * rename * fix-reset-params * add time penalty * _get_measure_vals always returns something * fix tests * unused import * single env, fix double step * move LocalEnvManager to ml-agents-envs * move and rename EnvManager * remove obsolete docstring and method * clean up	6 年前
GitHub	b11efed9	fix bug in RandomNormal (#2294 ) * fix bug in RandomNormal, add test for distribution * extract epsilon, rename vars	6 年前
GitHub	49f20394	Fix for vis obs memory leak in docker (#2274 ) * Fix for vis obs memory leak in docker * Remove reversions from code	6 年前
GitHub	be4292fb	Add different types of visual encoder (nature cnn/resnet) Add resnet and nature cnn in addition to default visual encoder	6 年前
GitHub	a802d0d7	Make SubprocessEnvManager take asynchronous steps (#2265 ) SubprocessEnvManager takes steps synchronously to reproduce old behavior, meaning all parallel environments will need to wait for the slowest environment to take a step. If some steps take much longer than others, this can lead to a substantial overall slowdown in practice. We've seen extreme cases where we see almost a 2x speedup from using asynchronous stepping, with no downside for our faster environments. (Bouncer 16% improvement, Walker 14% improvement in tests). This PR changes the SubprocessEnvManager to use async stepping. This means on the "step" call the environment manager will enqueue step requests to workers, and then only wait until at least one step has been completed before returning.	6 年前
GitHub	f8041534	Merge pull request #2236 from Unity-Technologies/enable-flake8 Enable flake8	6 年前
Chris Elion	d29289cd	update mypy version	6 年前
Chris Elion	9924c40e	one more unused	6 年前
Chris Elion	c58c2600	remove unused variables	6 年前
Ervin T	b4675aa0	Fix respawn part of BananaLogic (#2277 ) Fix the bug of "respawn" part that cause all the banana respawn in the first Area.	6 年前
Chris Elion	dfdf7b83	fix whitespace and line breaks	6 年前
GitHub	08672c47	remove codacy (#2287 ) * remove codacy * Cleanup name	6 年前
Chris Elion	5d07ca1f	Merge remote-tracking branch 'origin/develop' into enable-flake8	6 年前
GitHub	e0544e8f	Merge pull request #2267 from Unity-Technologies/develop-enforce-coverage Enforce min coverage percentage	6 年前
GitHub	39e693b5	Merge pull request #2280 from Unity-Technologies/develop-newResetParams-3DBall-Tennis-TwoSoccer Develop new reset params 3 d ball tennis two soccer	6 年前
sankalp04	a441c374	Ported documentation from other branch	6 年前
sankalp04	c6fba86a	tennis reset parameter implementation ported over	6 年前
sankalp04	ca644b3b	Fixed the default value to match the value in the docs	6 年前
sankalp04	22c3331a	two soccer reset parameter implementation ported over	6 年前
sankalp04	ae620f59	3D ball reset parameter implementation ported over	6 年前
sankalp04	2c8bdda0	3D ball reset parameter implementation ported over	6 年前
GitHub	d6e4eee2	Relax the cloudpickle version restriction (#2279 )	6 年前
GitHub	a5b7cf95	Fix get_value_estimate and buffer append (#2276 ) Fixes shuffling issue with newer versions of numpy (#1798). * make get_value_estimates output a dict of floats * Use np.append instead of convert to list, unconvert * Add type hints and test for get_value_estimates	6 年前
Jonathan Harper	2f203f89	fix lint checks	6 年前
Jonathan Harper	9a170db5	Add Unity command line arguments	6 年前
GitHub	1c18bd18	Swap 0 set and reward buffer append (#2273 ) Fix bug with reward_buffer always 0	6 年前
GitHub	9c50abcf	GAIL and Pretraining (#2118 ) Based on the new reward signals architecture, add BC pretrainer and GAIL for PPO. Main changes: - A new GAILRewardSignal and GAILModel for GAIL/VAIL - A BCModule component (not a reward signal) to do pretraining during RL - Documentation for both of these - Change to Demo Loader that lets you load multiple demo files in a folder - Example Demo files for all of our tested sample environments (for future regression testing)	6 年前
GitHub	fca0048d	Updated links to point to KR blog site (#2272 )	6 年前
GitHub	24d1f803	link from readme to KR docs (#2271 ) added link	6 年前
Jeffrey Shih	26f20508	add kor ver of README.md and empty docs, images (#2221 )	6 年前
Chris Elion	b00f17c5	comments	6 年前
GitHub	60cf98cf	[docs] Fix typo. (#2260 )	6 年前
Chris Elion	430b9354	enforce a min % of code coverage	6 年前
GitHub	6c37c9df	[docs] Reorder the instructions for intalling python/mlagents for sequential clarity. (#2259 )	6 年前
GitHub	5b494ffb	[docs] Fix a small spelling error. (#2256 )	6 年前
Chris Elion	a523d60b	Using-Docker.md miss a backslash in 3DBall command (#2239 ) * Using-Docker.md miss a backslash in 3DBall command Hi, Just a quick edit because a backslash seems to be missing from the 3DBall command example. * Added interactive options and Tensorboard documentation for Docker training	6 年前
Chris Elion	165d4312	Merge remote-tracking branch 'origin/master' into enable-flake8	6 年前
GitHub	d80d5852	add some types to the reward signals (#2215 ) * WIP add some types to the reward signals * fix next_visual_in * cleanup TODO * fix bad merge	6 年前
Chris Elion	85809f78	remove unused variables	6 年前
GitHub	d415528a	fix subprocess test and style checks on develop (#2248 ) * fix tests that broke with new arg * fix black	6 年前
Chris Elion	731e129b	fix accidental change	6 年前
GitHub	84d9d622	python timers (#2180 ) * Timer proof-of-concept * micro optimizations * add some timers * cleanup, add asserts * Cleanup (no start/end methods) and handle exceptions * unit test and decorator * move output code, add a decorator * cleanup * module docstring * actually write the timings when done with training * use __qualname__ instead * add a few more timers * fix mock import * fix unit test * don't need fwd reference * cleanup root * always write timers, add comments * undo accidental change	6 年前
Chris Elion	cf8a3237	precommit autoupdate	6 年前
Jeffrey Shih	4bd384a3	Make Gym interface work with grayscale and RGB visual observations (#2192 )	6 年前
Chris Elion	e69ddc53	cleanup setup.cfg	6 年前
Jonathan Harper	c2cd5a87	Add custom reset parameters to subprocess env manager This mirrors functionality already found in UnityEnvironment	6 年前
Chris Elion	2f9c3ed5	enforce line length	6 年前
Chris Elion	af4699ac	Fix reference to external_brains in TrainerController (#2237 ) PR #2213 conflicted with PR #2209 on a reference to external_brains. This change fixes the conflict.	6 年前
Chris Elion	01e11360	add setup.cfg	6 年前

1 2 3 4 5 ...

1214 次代码提交 (6a212f73-86c2-48a2-a399-146d66aae08e)