ml-agents

作者	SHA1	备注	提交日期
GitHub	4ac79742	Refactor reward signals into separate class (#2144 ) * Create new class (RewardSignal) that represents a reward signal. * Add value heads for each reward signal in the PPO model. * Make summaries agnostic to the type of reward signals, and log weighted rewards per reward signal. * Move extrinsic and curiosity rewards into this new structure. * Allow defining multiple reward signals in YAML file. Add documentation for this new structure.	5 年前
GitHub	d80d5852	add some types to the reward signals (#2215 ) * WIP add some types to the reward signals * fix next_visual_in * cleanup TODO * fix bad merge	5 年前
GitHub	ab690b93	Fix naming conflict between Curiosity and GAIL (#2406 )	5 年前
Chris Elion	43e23941	rough pass at tf2 support, needs cleanup	5 年前
Chris Elion	806c77e4	centralize tensorflow imports	5 年前
Chris Elion	73a346cb	cleanup	5 年前
Ervin Teng	151e3b1c	Move policy to common location, remove epsilon	5 年前
Ervin Teng	b61d2fa1	Fix some typing issues with curiosity	5 年前
Ervin Teng	cadf6603	Fix SAC CC and some reward signal tests	5 年前
Ervin Teng	5bfc0b87	Update docstring	5 年前
GitHub	c145e75b	Split Policy and Optimizer, common Policy for PPO and SAC (#3345 )	5 年前
Ervin Teng	53c25fb1	Move one-hot out of policy and remove selected_actions	5 年前
GitHub	e4177de0	[change] Organize trainer files a bit better (#3538 )	5 年前
Christopher Goy	ba80b292	format files with pre-commit.	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	3bcb029b	[refactor] Remove BrainParameters from Python code (#4138 )	4 年前
GitHub	1f5eb9da	add pyupgrade to pre-commit and run (#4239 )	4 年前
GitHub	129f9ddc	[MLA-427] make pyupgrade convert f-strings too (#4244 ) * make pyupgrade convert f-strings too	4 年前
Andrew Cohen	f654df34	fixing tensorflow tests	4 年前
GitHub	cb8e4d25	Add ActionSpec (#4586 ) Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Andrew Cohen	9689cf2c	remove _action_ from function names	4 年前

21 次代码提交 (336e1c34-b54d-4486-bdc5-42b46d0b5284)