ml-agents

作者	SHA1	备注	提交日期
Yuan Gao	33404e1b	Fixed the flake8	5 年前
GitHub	df0196f9	Merge pull request #2472 from Unity-Technologies/release-0.9.2-multi-gpu-doc Added the doc for multi-gpu	5 年前
Yuan Gao	b9210f4c	Updated the comment for —multi-gpu option.	5 年前
Yuan Gao	66205c3e	Added the doc for multi-gpu	5 年前
GitHub	be66102d	Merge pull request #2471 from Unity-Technologies/setup.py-h5py-version More flexibility on the h5py version	5 年前
GitHub	0afd58fc	More flexibility on the h5py version	5 年前
Yuan Gao	0c492fb7	Updated the model	5 年前
Yuan Gao	f33830bc	Updated the python packages version to 0.9.2	5 年前
GitHub	261ee0b6	Merge pull request #2457 from Unity-Technologies/hh/fix-training-NaN-errors-crawler Fix NaN training errors for crawler	5 年前
Hunter	83703d20	moved look rotation logic to avoid potential NaN LookRotation	5 年前
GitHub	1b7045bf	Merge pull request #2448 from Unity-Technologies/hh/fix-broken-crawler-prefabs fixed broken crawler prefabs	5 年前
Hunter	45fe60db	fixed broken prefabs	5 年前
GitHub	3880fd3a	Update development release version to 0.10.0.dev0 (#2443 ) In order for downstream packages to make use of the latest pre-release features, we can pre-release versions of our packages. For packages ending in `devN` pip will not install that package version by default. This change manually updates our package version to a development version with the idea that we can manually perform development versions with the potential for future automated / nightly dev releases.	5 年前
GitHub	43696d60	Fix bug in add_rewards_output and add test (#2442 )	5 年前
GitHub	689765d6	Modification of reward signals and rl_trainer for SAC (#2433 ) * Adds evaluate_batch to reward signals. Evaluates on minibatch rather than on BrainInfo. * Changes the way reward signal results are reported in rl_trainer so that we get the pure, unprocessed environment reward separate from the reward signals. * Moves end_episode to rl_trainer * Fixed bug with BCModule with RNN	5 年前
Arthur Juliani	fa46be7f	Merge branch 'RunSwimFlyRich-master' into develop	5 年前
GitHub	4abe89bc	Only call get_action on brains with policies (#2437 )	5 年前
GitHub	bd7eb286	Update reward signals in parallel with policy (#2362 )	5 年前
Jonathan Harper	e333abf8	Fixing compile error Variable "model" is undefined.	5 年前
GitHub	4472838e	Merge pull request #2421 from Unity-Technologies/hotfix-v0.9.1 Hotfix v0.9.1 - develop	5 年前
GitHub	7b69bd14	Refactor Trainer and Model (#2360 ) - Move common functions to trainer.py, model.pyfromppo/trainer.py, ppo/policy.pyandppo/model.py' - Introduce RLTrainer class and move most of add_experiences and some common reward signal code there. PPO and SAC will inherit from this, not so much BC Trainer. - Add methods to Buffer to enable sampling, truncating, and save/loading. - Add scoping to create encoders in model.py	5 年前

1 2 3 4

171 次代码提交 (cd46c9c2-6692-44ed-ba47-4373c2963f36)