ml-agents

目录树: 54c20b78

作者	SHA1	备注	提交日期
GitHub	101a8e00	Add Dynamic Walker. Improved Ragdoll Stability/Performance (#4037 ) * about to implement orientation cube * oCube spawining works. ready to train * working. about to try com * ready for training * add random rot on episode start * feet now alternate but runs backwards * still running with right leg in front * increased joint strength to 40k * removed texture example * reduced maxAngVel, enabled enhanced determinism, cont spec * rebuilt walker ragdoll to scale 1 * rebuilt ragdoll ready * update walker pair prefab * fixed bp heirarchy * added trained model, renamed scene, usecollisioncallbacks * updated dynamic platforms * added dynamic walker tf file. max speed 5 * DynamicWalker working. has working nn file * collect local rotations * added new dynamic nn file * hip facing reward * Create WalkerDynamic.yaml * fix hip rotation * about to clean up code * added dirIndicator and orentCubeGizmo * clean up * clea...	5 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	b51347ac	New Variable Speed Walker Environments (#4301 ) * init * Add reward manager and hurryUpReward * fix hurry reward/ add awful first training * Turn off head height and hurry rew * changed max speed to 15. added small hh rew * add NaN check for reward manager. start vel penalty * add bpVel pen * add new BPVelPen nn file * remove outdated nn file * add randomize speed bool * try rewad product * change coeff to 1 * try avg vel of all bp for reward * move outside loop * try linear inverselerp for vel * add avg rew matchspeed15 nn file. looks much better * save scene * no hand penalty, random walk speed * fix inverse lerp * try new reward falloff * cleanup * added new nn file. don't allow hand contact * update obsv * remove hh rew. add trained no-hh model * add new nn file * new curve * add new models. try no reset * add hh rew * clamp hh * zero rewards if ground contact * switch to approved with movi...	4 年前
HH	6c67bf4e	cleanup	4 年前

4 次代码提交 (54c20b78-6288-4d85-9a01-e324aab0f3ec)