ml-agents

作者	SHA1	备注	提交日期
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
sankalp04	c6fba86a	tennis reset parameter implementation ported over	5 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	5d2e466f	Fix Code convention warnings in Rider. (#2801 )	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
GitHub	411bb64a	Renaming Agent's methods (#3557 ) * [skip ci] Renamed methods in the Agent class WARNING, the user when implementing obsolete methods will see the message :Member `old method` overrides obsolete member `old method`. Add the Obsolete attribute to `old method`. It will not suggest the new method to override. * [skip ci] Updated the example environment * [skip ci] Updated migrating and changelog * [skip ci] Editing the docs * [skip ci] Missing docs * :+1 * Update docs/Getting-Started-with-Balance-Ball.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * Update docs/Learning-Environment-Create-New.md Co-Authored-By: Chris Elion <chris.elion@unity3d.com> * [skip ci] documentation changes * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Getting-Started-with-Balance-Ball.md * [skip ci] Update docs/Gett...	5 年前
Andrew Cohen	d9f1a2f5	more experiments for self-play	5 年前
Andrew Cohen	8431ecb5	tennis reward fix	5 年前
Andrew Cohen	5d659946	update tennis reward function	5 年前
Andrew Cohen	9f36cd36	added floorhit obs tennis	5 年前
Andrew Cohen	e5b883db	added bounce obs to agent/more downward force on ball	5 年前
Andrew Cohen	1c4ba1a5	add timestep bonus to loss	5 年前
Andrew Cohen	a6e6e63e	timestep penalty on loss only	5 年前
Andrew Cohen	251dcc76	remove timepenalty from tennis	5 年前
Andrew Cohen	b7bd4c2c	reduce winning reward	5 年前
Andrew Cohen	1c2e1d79	increase beta	5 年前
Andrew Cohen	d77f2566	energy usage penalty to prevent superstition on serve	5 年前
Andrew Cohen	a8f2f613	no energy penalty	5 年前
Andrew Cohen	69acdeec	fixed reset tennis	5 年前
Andrew Cohen	84f231ce	time penalty	5 年前
Andrew Cohen	43d5ef17	fixed opponent setting	5 年前
Andrew Cohen	0c17dc1b	cannot hit scenery tennis	5 年前
Andrew Cohen	7475ad11	tunneling is a loss	5 年前
GitHub	e7916b08	add pre-commit hook for dotnet-format (#4362 )	4 年前

25 次代码提交 (7e7743d1-03a2-4a84-a127-380dea067341)