ml-agents

作者	SHA1	备注	提交日期
sankalp04	f331e5b7	Rebase develop	5 年前
sankalp04	34127b76	Example parameter sampling file config	5 年前
sankalp04	db858686	Made the code cleanup changes - mostly nit changes	5 年前
sankalp04	dacb420b	Instantiate SamplerManager in learn.py instead of trainer_controller	5 年前
sankalp04	7f96b47c	Removed check_key and replaced with **param_dict for implicit type checks	5 年前
sankalp04	8cbfee43	Get rid of dead code and clean up code	5 年前
sankalp04	74245e35	Add LessonControllerError to track errors in LessonController	5 年前
sankalp04	8a37ad29	Add Sampler error to track errors in the sampler class	5 年前
sankalp04	0b006719	Incorporate generalization checks for resetting parameters in take_step	5 年前
sankalp04	c9ea139f	Change reset parameters based on reward or progress metric	5 年前
sankalp04	dfc8885d	Allow generalization training with specified arguments of min_reward and min_lesson_length	5 年前
sankalp04	121221f2	Adding new command line arguments	5 年前
GitHub	30930383	Move trainer initialization into a utility function (#2412 ) This change moves trainer initialization outside of TrainerController, reducing some of the constructor arguments of TrainerController and setting up the ability for trainers to be initialized in the case where a TrainerController isn't needed.	5 年前
GitHub	d7ebaae1	Return list instead of np array for make_mini_batch() (#2371 ) Return list instead of np array for make_mini_batch() to reduce time copying data	5 年前
GitHub	a9fe719c	Add Multi-GPU implementation for PPO (#2288 ) Add MultiGpuPPOPolicy class and command line options to run multi-GPU training	5 年前
GitHub	c7f0ed04	Merge pull request #2381 from Unity-Technologies/release-0.9.0	5 年前
Jonathan Harper	98297be9	Fix training not quitting when play button is unchecked (#2376 ) This fixes an issue where stopping the game when training in the Editor won't end training, due to the new asynchronous SubprocessEnvManager changes. Another minor change was made to move the `env_manager.close()` in TrainerController to the end of `start_learning` so that we are more likely to save the model if something goes wrong during the environment shutdown (this occurs sometimes on Windows machines).	5 年前
Ervin T	cf5e09fc	Updated the models for v0.9 (#2374 )	5 年前
Jeffrey Shih	728afebf	Release 0.9.0 docs checklist and cleanup - v2 (#2372 ) * Included explicit version # for ZN * added explicit version for KR docs * minor fix in installation doc * Consistency with numbers for reset parameters * Removed extra verbiage. minor consistency * minor consistency * Cleaned up IL language * moved parameter sampling above in list * Cleaned up language in Env Parameter sampling * Cleaned up migrating content * updated consistency of Reset Parameter Sampling * Rename Training-Generalization-Learning.md to Training-Generalization-Reinforcement-Learning-Agents.md * Updated doc link for generalization * Rename Training-Generalization-Reinforcement-Learning-Agents.md to Training-Generalized-Reinforcement-Learning-Agents.md * Re-wrote the intro paragraph for generalization * add titles, cleaned up language for reset params * Update Training-Generalized-Reinforcement-Learning-Agents.md * cleanup of generalization doc * More cleanu...	5 年前
GitHub	78c0c202	fix mock_brain (#2377 ) fix mock_brain	5 年前
Jeffrey Shih	14c9b2fc	Localizing kr (#2356 ) * add kor ver of README.md and empty docs, images * add Installation.md translated to korean * Fixed main readme docs and move all the English documents in the docs folder * modify contents of 'Installation.md' and add kr version 'Installation-Windows.md'(not completed) with related image * completed 1st translation of 'Installation-Windows.md' and added related images for korean docs * add kr version 'Using-Docker.md'(not completed) * translate Training-PPO.md to Korean * Change word about epsilon in Training-PPO.md * Fix Training PPO about epsilon * completed korean translation of 'Using-Docker.md' * Training Imitation Learning translation to Korean is finished! Also information about the translators are added * modified all 'blogs.unity3d.com/' to 'blogs.unity3d.com/kr' * removed all non-translated doc * add translator information	5 年前
GitHub	dd0d2a10	Remove unnecessary feed_dicts for GAIL and Curiosity (#2348 )	5 年前
GitHub	6016b775	Added Migrating docs for 0.9 (#2347 )	5 年前
GitHub	9178b5d2	Improve test_simple.py and check discrete actions (#2345 ) * discrete action coverage * undo change * rename test * move test file * Revert "move test file" This reverts commit 2e72b2dbf9ce9163c92066036b06591dc4173e5c. * move files post merge	5 年前
Ervin T	7cfce1a9	Barracuda hotfix for LSTM and tests (#2352 ) * Removed obsolete 'TestDstWrongShape' test as it does not reflect how Barracuda tensors work * Added proper test cleanup, to avoid warning messages from finalizer thread. * Hotfix for recurrent + continous action nets in ML Agents	5 年前
GitHub	4991d83f	Merge pull request #2346 from Unity-Technologies/release-0.9.0 Merge latest fixes from release into develop	5 年前
Ervin T	00a3b592	Fix docs for Generalization (#2334 ) * Fix naming conventions for consistency * Add generalization link to ML-Agents Overview * Add generalization to main Readme * Include types of samplers available for use	5 年前
Ervin T	fb9dc411	Fix tests for Barracuda (#2333 ) * Removed obsolete 'TestDstWrongShape' test as it does not reflect how Barracuda tensors work * Added proper test cleanup, to avoid warning messages from finalizer thread.	5 年前
GitHub	33cb438b	Tick version number for 0.9 (#2331 ) * Tick versions of gym, ml-agents, ml-agents-envs * Tick communication API to 9	5 年前
Ervin T	ca32cadf	Fix default for vis_encode_type (#2330 )	5 年前
GitHub	83875376	Add "gauges" to timer system (#2329 ) * WIP still needs tests and merging from multiprocess * cleanup gauges * add TODO for subprocesses	5 年前
GitHub	b6bbfea4	Fix docs for reward signals (#2320 )	5 年前
Ervin T	a46f3faa	Enable generalization training (#2232 ) * Add Sampler and SamplerManager * Enable resampling of reset parameters during training * Documentation for Sampler and example YAML configuration file	5 年前
GitHub	9fc0c465	Profiling docs (#2325 ) * profiling docs * clean up debug option, move csv info * Imitation Learning -> Behavioral Cloning	5 年前
Ervin T	dba466e3	Reset Parameters implemented for Pushblock, Reacher and Walker (#2322 ) Pushblock: dynamic_friction, static_friction, block_drag, block_scale Reacher: Gravity, non-linear goal movement Walker: Gravity, torso mass	5 年前
Ervin T	5465c2e0	Implemented the reset parameters for Banana Collectors and Bouncer (#2258 ) Banana Collectors: Length of laser and agent scale Bouncer: Size of the banana	5 年前
GitHub	5aba7a08	Fix broken doc links (#2327 ) * fix azure link * fix imitation learning links	5 年前
GitHub	6225317d	refactor vis_encoder_type and add to doc refactor vis_encoder_type and add to doc	5 年前
Ervin T	9ea7fea8	Use Barracuda tensors and Barracuda 0.2.4 (#2308 ) Bringing bucket of temp memory allocation optimizations: * switched to Barracuda backed tensor across the board, helps to leverage allocators and reuse of the internal buffers * added Barracuda 0.2.4 release, which bring another set of temp memory allocation fixes	5 年前
GitHub	f82f0f37	Get timers from subprocess (#2268 ) * Timer proof-of-concept * micro optimizations * add some timers * cleanup, add asserts * Cleanup (no start/end methods) and handle exceptions * unit test and decorator * move output code, add a decorator * cleanup * module docstring * actually write the timings when done with training * use __qualname__ instead * add a few more timers * fix mock import * fix unit test * get timers from worker process (WIP) * clean up timer merging * typo * WIP * cleanup merging code * bad merge * undo accidental change * remove reset command * fix style * fix unit tests * fix unit tests (they got overwrote in merge) * get timer root though a function * timer around communicate	5 年前
GitHub	9eb3f049	Cleanup unused code in TrainerController (#2315 ) * Removes unused SubprocessEnvManager import in trainer_controller * Removes unused `steps` argument to `TrainerController._save_model` * Consolidates unnecessary branching for curricula in `TrainerController.advance` * Moves `reward_buffer` into `TFPolicy` from `PPOPolicy` and adds `BCTrainer` support so that we don't have a broken interface / undefined behavior when BCTrainer is used with curricula.	5 年前
GitHub	6a212f73	Improvements for GAIL (#2296 ) * Don't 0 value bootstrap for GAIL and Curiosity * Add gradient penalties to GAN to help with stability * Add gail_config.yaml with GAIL examples * Cleaned up trainer_config.yaml and unnecessary gammas * Documentation updates * Code cleanup	5 年前
GitHub	19283bfa	Very simple environment for testing (#2266 ) * WIP doesn't crash * return stats and assert convergence * pass lint checks * rename * fix-reset-params * add time penalty * _get_measure_vals always returns something * fix tests * unused import * single env, fix double step * move LocalEnvManager to ml-agents-envs * move and rename EnvManager * remove obsolete docstring and method * clean up	5 年前
GitHub	b11efed9	fix bug in RandomNormal (#2294 ) * fix bug in RandomNormal, add test for distribution * extract epsilon, rename vars	5 年前
GitHub	49f20394	Fix for vis obs memory leak in docker (#2274 ) * Fix for vis obs memory leak in docker * Remove reversions from code	5 年前
GitHub	be4292fb	Add different types of visual encoder (nature cnn/resnet) Add resnet and nature cnn in addition to default visual encoder	5 年前
GitHub	a802d0d7	Make SubprocessEnvManager take asynchronous steps (#2265 ) SubprocessEnvManager takes steps synchronously to reproduce old behavior, meaning all parallel environments will need to wait for the slowest environment to take a step. If some steps take much longer than others, this can lead to a substantial overall slowdown in practice. We've seen extreme cases where we see almost a 2x speedup from using asynchronous stepping, with no downside for our faster environments. (Bouncer 16% improvement, Walker 14% improvement in tests). This PR changes the SubprocessEnvManager to use async stepping. This means on the "step" call the environment manager will enqueue step requests to workers, and then only wait until at least one step has been completed before returning.	5 年前
GitHub	f8041534	Merge pull request #2236 from Unity-Technologies/enable-flake8 Enable flake8	5 年前
Chris Elion	d29289cd	update mypy version	5 年前
Chris Elion	9924c40e	one more unused	5 年前

1 2 3 4 5 ...

1255 次代码提交 (f331e5b7-d459-4ed4-91b1-17afc83f360f)