ml-agents

作者	SHA1	备注	提交日期
Chris Elion	cf8a3237	precommit autoupdate	6 年前
Jeffrey Shih	4bd384a3	Make Gym interface work with grayscale and RGB visual observations (#2192 )	6 年前
Chris Elion	e69ddc53	cleanup setup.cfg	6 年前
Jonathan Harper	c2cd5a87	Add custom reset parameters to subprocess env manager This mirrors functionality already found in UnityEnvironment	6 年前
Chris Elion	2f9c3ed5	enforce line length	6 年前
Chris Elion	af4699ac	Fix reference to external_brains in TrainerController (#2237 ) PR #2213 conflicted with PR #2209 on a reference to external_brains. This change fixes the conflict.	6 年前
Chris Elion	01e11360	add setup.cfg	6 年前
GitHub	966d8efb	Remove "external_brains" arg for TrainerController (#2213 ) TrainerController depended on an external_brains dictionary with brain params in its constructor but only used it in a single function call. The same function call (start_learning) takes the environment as an argument, which is the source of the external_brains. This change removes the dependency of TrainerController on external brains and removes the two class members related to external_brains and retrieves the brains directly from the environment.	6 年前
Chris Elion	bb7773c1	add flake8 to precommit	6 年前
GitHub	b05c9ac1	Add environment manager for parallel environments (#2209 ) Previously in v0.8 we added parallel environments via the SubprocessUnityEnvironment, which exposed the same abstraction as UnityEnvironment while actually wrapping many parallel environments via subprocesses. Wrapping many environments with the same interface as a single environment had some downsides, however: * Ordering needed to be preserved for agents across different envs, complicating the SubprocessEnvironment logic * Asynchronous environments with steps taken out of sync with the trainer aren't viable with the Environment abstraction This PR introduces a new EnvManager abstraction which exposes a reduced subset of the UnityEnvironment abstraction and a SubprocessEnvManager implementation which replaces the SubprocessUnityEnvironment.	6 年前
Jonathan Harper	177ee5b8	Remove unused "last reward" logic, TF nodes At each step, an unused `last_reward` variable in the TF graph is updated in our PPO trainer. There are also related unused methods in various places in the codebase. This change removes them.	6 年前
GitHub	4ac79742	Refactor reward signals into separate class (#2144 ) * Create new class (RewardSignal) that represents a reward signal. * Add value heads for each reward signal in the PPO model. * Make summaries agnostic to the type of reward signals, and log weighted rewards per reward signal. * Move extrinsic and curiosity rewards into this new structure. * Allow defining multiple reward signals in YAML file. Add documentation for this new structure.	6 年前
GitHub	e6b8140a	enable precommit for line endings, fix 1 failure (#2208 )	6 年前
GitHub	9a68742f	Merge pull request #2138 from rsfutch77/protobuf_update Fix Protobuf Install Instructions	6 年前
GitHub	40c7fc48	Merge branch 'develop' into protobuf_update	6 年前
GitHub	6660608c	Merge pull request #2190 from gregnz/develop-rayperception-docs Develop rayperception docs	6 年前
GitHub	342641d6	Merge pull request #2172 from quevedin/patch-1 Updated minor typos (sh for console)	6 年前
gregday	02ca8cdc	Adding docs clarifying the construction of the ray sublist. The main point to clarify is the behaviour of returning a '1' on a no hit.	6 年前
GitHub	202ed600	Update to include CLA language (#2195 )	6 年前
GitHub	2671e1a0	Enable mypy in precommit checks (#2177 ) * WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * WIP enable mypy * run mypy on each package * fix trainer_metrics mypy errors * more mypy errors * more mypy * Fix some partially typed functions * types for take_action_outputs * fix formatting * cleanup * generate stubs for proto objects * fix ml-agents-env mypy errors * disallow-incomplete-defs for gym-unity * Add CI notes to CONTRIBUTING.md	6 年前
GitHub	e910ccc5	Use pre-commit for CI tests, add black hook (#2163 ) * WIP precommit on top level * update CI * circleci fixes * intentionally fail black * use --show-diff-on-failure in CI * fix command order * rebreak a file * apply black * run on whole repo	6 年前
GitHub	dcef9f69	Merge pull request #2179 from Unity-Technologies/release-v0.8.2 Merge from release 0.8.2 to develop	6 年前
GitHub	610b8852	Release v0.8.2 update models (#2178 ) * ignore the idea file * Retrained most of the models * Updated the remaining models	6 年前
GitHub	4966b888	Fixed the import issue (#2158 ) * Fixed the import issue * make black happy * Use find_namespace_packages	6 年前
rsfutch77	bb6acc1c	Fix mlagents re-install directions - Fix re-install directions to include -e modifer - Move re-install directions from creating-custom... to protobuf readme - Add how to see confirmation that install worked	6 年前
Vincent(Yuan) Gao	185b9a18	Release v0.8.2 doc fixes (#2155 ) * Minor basic guide fix * made it clear for training instruction	6 年前
rsfutch77	9f7c8428	Update protobuf readme - Include grpc required versions - Clarify which steps are install and which are run every time	6 年前
Yuan Gao	fa4d61b5	Bumping the ml-agents, ml-agents-envs, gym_unity versions	6 年前
rsfutch77	fdd452ac	Update protobuf readme - Add required version for grpc tools. Newer versions cause UnityToExternalGrpc.cs to fail to compile inside Unity due to a new function in the file	6 年前
GitHub	b3432dda	Automate meta file check (#2133 ) * Script to validate .meta files are set up correctly * Add command to CI * don't gitignore Gizmos, add .meta * Move to utils directory	6 年前
rsfutch77	72042bf8	Update make_for_win_.bat to match make.bat - make_for_win_.bat now has the same comment as make.bat - The instructions for editing those files will both use line 7 now.	6 年前
GitHub	cf3ccc4a	Barracuda - remove unused private variable (#2128 )	6 年前
rsfutch77	d8319227	Update .gitignore - Ignore grpc installation on any platform	6 年前
Ervin T	f77984db	Change guide to not extend two bases classes in c# (#2104 ) Documentation change to Heuristic Brain	6 年前
rsfutch77	57868828	Update make_for_win.bat - Revert version to what is currently in develop branch to clarify what changed.	6 年前
GitHub	e916dc48	use yaml.safe_load instead of yaml.load (#2124 )	6 年前
rsfutch77	8d53b8cc	Update .gitignore - Update protobuf folder ignores	6 年前
Ervin T	a3d03fb4	run black on ml-agents (#2125 ) Run black after Barracuda 0.2 merge	6 年前
rsfutch77	236934a4	Update README.md - Notes for where to enter commands to start with - Select a particular version of grpcio-tools - Note how to get nuget if needed - Directory independent nuget install - Remove instruction to download protoc since it comes with grpc.tools - Add instructions for windows in ##Running and directories for clarification	6 年前
GitHub	a4d5b2d3	Doc/comment cleanup - Fix some occurrences of 'the the' (#2119 )	6 年前
rsfutch77	b3c65637	Update make_for_win.bat - Put rem back in make_fo_win.bat	6 年前
Vincent(Yuan) Gao	a15763f8	Clear cumulative_returns_since_policy_update (#2120 ) Before the CSV file's mean rewards would lag much behind the rest of the code since this buffer was never cleared.	6 年前
rsfutch77	412dbe92	Update Creating-Custom-Protobuf-Messages.md - Note to use the windows batch file on windows	6 年前
GitHub	f13d0f11	Merge pull request #2049 from Unity-Technologies/develop-barracuda-0.2.0 Barracuda 0.2.1 -> develop	6 年前
rsfutch77	89eb9270	Update make_for_win.bat - Fix slash direction for windows in COMPILER definition - Fix missing COMPILER variables when calling protoc - Fix call to "python" instead of "python3"	6 年前
Jeffrey Shih	3ecbabc3	[Documentation] Clarify Create-New.md (#2100 ) * Update Learning-Environment-Create-New.md - Clarify that training is done in the original ml-agents project folder - Remove mistype - In the future it could help to show the user that they can copy the config folder and run training in a new project folder so they don't have to mix project settings in the original config folder * Update Learning-Environment-Create-New.md Add file paths	6 年前
Jonathan Harper	7d9eea6b	Fix formatting for gym env	6 年前
Jonathan Harper	0423c341	Update pip and setuptools in CircleCI The CircleCI checks have been broken because of outdated setuptools, this change should fix the issue.	6 年前
Jonathan Harper	d9a7e5b6	Fix failure on Academy Done() with parallel envs When using parallel SubprocessUnityEnvironment instances along with Academy Done(), a new step might be taken when reset should have been called because some environments may have been done while others were not (making "global done" less useful). This change manages the reset on `global_done` at the level of the environment worker, and removes the global reset from TrainerController.	6 年前
Mantas Puida	1862b6be	Multiple LSTM cell handling added to Barracuda code path	6 年前

1 2 3 4 5 ...

1221 次代码提交 (dba466e3-4776-435a-8d5a-e60042257826)