ml-agents

941 提交

337 分支

128 Plastic标签

目录树: 3bc092d2

作者	SHA1	备注	提交日期
GitHub	d73b6aa0	Update New Environment Doc (#1404 ) * Simplified rewards and observations; Determined better settings for training within a reasonable amount of time. * Simplified Agent rewards; Added training section that discusses hyperparameters. * Added note about DecisionFrequency. * Updated screenshots and a small clarification in the text. * Tested and updated using v0.6. * Update a couple of images, minor text edit. * Replace with more recent training stats. * resolve a couple of minor review commnts. * Increased the recommended batch and buffer size hyperparameter values. * Fix 2 typos.	6 年前

作者

SHA1

备注

提交日期

GitHub

d73b6aa0

Update New Environment Doc (#1404 )

* Simplified rewards and observations; Determined better settings for training within a reasonable amount of time.

* Simplified Agent rewards; Added training section that discusses hyperparameters.

* Added note about DecisionFrequency.

* Updated screenshots and a small clarification in the text.

* Tested and updated using v0.6.

* Update a couple of images, minor text edit.

* Replace with more recent training stats.

* resolve a couple of minor review commnts.

* Increased the recommended batch and buffer size hyperparameter values.

* Fix 2 typos.

6 年前

1 次代码提交 (3bc092d2-28fb-4645-83fa-3f3ed898cc90)