ml-agents

15 提交

337 分支

128 Plastic标签

目录树: 78b5933f

作者	SHA1	备注	提交日期
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前

作者

SHA1

备注

提交日期

GitHub

51621334

State Stacking & Banan Environment (#262 )

* Add support for stacking past n states to allow network to learn temporal dependencies.
* Add Banana Collector environment for demonstrating partially observable multi-agent environments.
* Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features.
* Rework Tennis environment to be continuous control and trainable in 100k steps.

7 年前

GitHub

f134016b

On Demand Decision (#308 )

* On Demand Decision : Use RequestDecision and RequestAction 
 * New Agent Inspector : Use it to set On Demand Decision
 * New BrainParameters interface
 * LSTM memory size is now set in python
 * New C# API
 * Semantic Changes
 * Replaced RunMDP
 * New Bouncer Environment to test On Demand Dscision

7 年前

2 次代码提交 (78b5933f-4750-4143-8900-0be5b403ab75)