ml-agents

6 提交

337 分支

128 Plastic标签

目录树: 7e7743d1

作者	SHA1	备注	提交日期
GitHub	101a8e00	Add Dynamic Walker. Improved Ragdoll Stability/Performance (#4037 ) * about to implement orientation cube * oCube spawining works. ready to train * working. about to try com * ready for training * add random rot on episode start * feet now alternate but runs backwards * still running with right leg in front * increased joint strength to 40k * removed texture example * reduced maxAngVel, enabled enhanced determinism, cont spec * rebuilt walker ragdoll to scale 1 * rebuilt ragdoll ready * update walker pair prefab * fixed bp heirarchy * added trained model, renamed scene, usecollisioncallbacks * updated dynamic platforms * added dynamic walker tf file. max speed 5 * DynamicWalker working. has working nn file * collect local rotations * added new dynamic nn file * hip facing reward * Create WalkerDynamic.yaml * fix hip rotation * about to clean up code * added dirIndicator and orentCubeGizmo * clean up * clea...	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	b51347ac	New Variable Speed Walker Environments (#4301 ) * init * Add reward manager and hurryUpReward * fix hurry reward/ add awful first training * Turn off head height and hurry rew * changed max speed to 15. added small hh rew * add NaN check for reward manager. start vel penalty * add bpVel pen * add new BPVelPen nn file * remove outdated nn file * add randomize speed bool * try rewad product * change coeff to 1 * try avg vel of all bp for reward * move outside loop * try linear inverselerp for vel * add avg rew matchspeed15 nn file. looks much better * save scene * no hand penalty, random walk speed * fix inverse lerp * try new reward falloff * cleanup * added new nn file. don't allow hand contact * update obsv * remove hh rew. add trained no-hh model * add new nn file * new curve * add new models. try no reset * add hh rew * clamp hh * zero rewards if ground contact * switch to approved with movi...	4 年前

作者

SHA1

备注

提交日期

GitHub

101a8e00

Add Dynamic Walker. Improved Ragdoll Stability/Performance (#4037 )

* about to implement orientation cube

* oCube spawining works. ready to train

* working. about to try com

* ready for training

* add random rot on episode start

* feet now alternate but runs backwards

* still running with right leg in front

* increased joint strength to 40k

* removed texture example

* reduced maxAngVel, enabled enhanced determinism, cont spec

* rebuilt walker ragdoll to scale 1

* rebuilt ragdoll ready

* update walker pair prefab

* fixed bp heirarchy

* added trained model, renamed scene, usecollisioncallbacks

* updated dynamic platforms

* added dynamic walker tf file. max speed 5

* DynamicWalker working. has working nn file

* collect local rotations

* added new dynamic nn file

* hip facing reward

* Create WalkerDynamic.yaml

* fix hip rotation

* about to clean up code

* added dirIndicator and orentCubeGizmo

* clean up

* clea...

4 年前

GitHub

a28e2767

Update add-fire to latest master, including Policy refactor (#4263 )

* Update Dockerfile

* Separate send environment data from reset (#4128)

* Fixed a typo on ML-Agents-Overview.md (#4130)

Fixed redundant "to" word from the sentence since it is probably a typo in document.

* Updated the badge’s link to point to the newest doc version

* Replaced all of the doc to release_3_doc

* Fix 3DBall and 3DBallHard SAC regressions (#4132)

* Move memory validation to settings

* Update docs

* Add settings test

* Update to release_3 in installation.md (#4144)

* rename to SideChannelManager +backcompat (#4137)

* Remove comment about logo with --help (#4148)

* [bugfix] Make FoodCollector heuristic playable (#4147)

* Make FoodCollector heuristic playable

* Update changelog

* script to check for old release links and references (#4153)

* Remove package validation suite from Project (#4146)

* RayPerceptionSensor: handle empty and invalid tags (#4155...

4 年前

GitHub

b51347ac

New Variable Speed Walker Environments (#4301 )

* init

* Add reward manager and hurryUpReward

* fix hurry reward/ add awful first training

* Turn off head height and hurry rew

* changed max speed to 15. added small hh rew

* add NaN check for reward manager. start vel penalty

* add bpVel pen

* add new BPVelPen nn file

* remove outdated nn file

* add randomize speed bool

* try rewad product

* change coeff to 1

* try avg vel of all bp for reward

* move outside loop

* try linear inverselerp for vel

* add avg rew matchspeed15 nn file. looks much better

* save scene

* no hand penalty, random walk speed

* fix inverse lerp

* try new reward falloff

* cleanup

* added new nn file. don't allow hand contact

* update obsv

* remove hh rew. add trained no-hh model

* add new nn file

* new curve

* add new models. try no reset

* add hh rew

* clamp hh

* zero rewards if ground contact

* switch to approved with movi...

4 年前

3 次代码提交 (7e7743d1-03a2-4a84-a127-380dea067341)