ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
Vincent Gao	38bd3e40	replaced all the tabs to 4 spaces in the project	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
Vincent-Pierre BERGES	db1ff84b	Fixed compilation under Scripting API compatibility level ".NET Standard 2.0" (#1869 ) Fixing #1779	6 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	5d2e466f	Fix Code convention warnings in Rider. (#2801 )	5 年前
GitHub	14193ada	Self-play for symmetric games (#3194 )	5 年前
Andrew Cohen	5d659946	update tennis reward function	5 年前
Andrew Cohen	47548ee4	tennis curriculum	5 年前
Andrew Cohen	c8cae0dd	reduced bounciness/added downward force tennis	5 年前
Andrew Cohen	8ad1323e	increased scale tennis env	5 年前
Andrew Cohen	e5b883db	added bounce obs to agent/more downward force on ball	5 年前
Andrew Cohen	1b10ef6d	clip speed	5 年前
Andrew Cohen	b80d6228	randomize ball/agent spawn	5 年前
Andrew Cohen	1f2453ef	slower tennis	5 年前
Andrew Cohen	862a3c02	faster now that it works?	5 年前
Andrew Cohen	4907a9cd	slower tennis	5 年前
Andrew Cohen	d7c2c163	please no more	5 年前
Andrew Cohen	765dfb4d	faster x axis/less rotation	5 年前
Andrew Cohen	9beb2a15	serve spawns forward	5 年前
Andrew Cohen	0b45d365	ball spawns forward	5 年前
Andrew Cohen	138dc0e3	ball toss	5 年前
Andrew Cohen	95424b93	slower	5 年前
Andrew Cohen	97119431	fix prefab	5 年前
Andrew Cohen	0ae5e040	rename some vars	5 年前
Andrew Cohen	f21304a9	ball	5 年前
Andrew Cohen	1625cca0	faster tennis ball	5 年前
Andrew Cohen	a5383d7e	move agent service forward	5 年前
Andrew Cohen	6852707a	better	5 年前
Andrew Cohen	70b36614	increase max step	5 年前
Andrew Cohen	acd219ce	faster ball real max step	5 年前

33 次代码提交 (bc1fdf07-41d4-4db9-b0d2-b6cf2261a5da)