ml-agents

目录树: d993c549

作者	SHA1	备注	提交日期
vincentpierre	22db3d64	added the modified files from dev-cooperative-env	7 年前
Arthur Juliani	22d931c0	Add comments to Reacher and re-train model w/ epsilon needed	7 年前
Vincent Gao	3a9f500b	Updated the Reacher's model file, also updated the Reacher's agent code from eulerAngle to quaternion Updated the Reacher's model file, also updated the Reacher's agent code from eulerAngle to quaternion	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前