ml-agents

作者	SHA1	备注	提交日期
GitHub	101a8e00	Add Dynamic Walker. Improved Ragdoll Stability/Performance (#4037 ) * about to implement orientation cube * oCube spawining works. ready to train * working. about to try com * ready for training * add random rot on episode start * feet now alternate but runs backwards * still running with right leg in front * increased joint strength to 40k * removed texture example * reduced maxAngVel, enabled enhanced determinism, cont spec * rebuilt walker ragdoll to scale 1 * rebuilt ragdoll ready * update walker pair prefab * fixed bp heirarchy * added trained model, renamed scene, usecollisioncallbacks * updated dynamic platforms * added dynamic walker tf file. max speed 5 * DynamicWalker working. has working nn file * collect local rotations * added new dynamic nn file * hip facing reward * Create WalkerDynamic.yaml * fix hip rotation * about to clean up code * added dirIndicator and orentCubeGizmo * clean up * clea...	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
HH	bdc9c1a4	add WalkerDynamicVarialbeSpeed scene and update prefabs	4 年前
HH	950f9a8f	add trained static walker nn file	4 年前
HH	292e8743	try 15k strength. reset jdcontroller to master	4 年前
HH	4a1a3210	remove h rew 10k strength	4 年前
HH	7083559c	about to train 20k strength, no hh, no rolling targ 30M	4 年前
HH	0560848f	implemented distToTarget Instead of targetPos	4 年前
HH	51cbb887	reduce maxSpeed to 10, update prefabs	4 年前
HH	c34ea517	max dist 50 avg core vel	4 年前
GitHub	b51347ac	New Variable Speed Walker Environments (#4301 ) * init * Add reward manager and hurryUpReward * fix hurry reward/ add awful first training * Turn off head height and hurry rew * changed max speed to 15. added small hh rew * add NaN check for reward manager. start vel penalty * add bpVel pen * add new BPVelPen nn file * remove outdated nn file * add randomize speed bool * try rewad product * change coeff to 1 * try avg vel of all bp for reward * move outside loop * try linear inverselerp for vel * add avg rew matchspeed15 nn file. looks much better * save scene * no hand penalty, random walk speed * fix inverse lerp * try new reward falloff * cleanup * added new nn file. don't allow hand contact * update obsv * remove hh rew. add trained no-hh model * add new nn file * new curve * add new models. try no reset * add hh rew * clamp hh * zero rewards if ground contact * switch to approved with movi...	4 年前
HH	f5d3ef52	reimplement cube relTargetPos	4 年前
HH	6c67bf4e	cleanup	4 年前
GitHub	bc0ba098	add option for Burst inference (#4925 )	4 年前
GitHub	85f8b40b	Removing some scenes (#4997 ) * Removing some scenes, All the Static and all the non variable speed environments. Also removed Bouncer, PushBlock, WallJump and reacher. Removed a bunch of visual environements as well. Removed 3DBallHard and FoodCollector (kept Visual and Grid FoodCollector) * readding 3DBallHard * readding pushblock and walljump * Removing tennis * removing mentions of removed environments * removing unused images * Renaming Crawler demos * renaming some demo files * removing and modifying some config files * new examples image? * removing Bouncer from build list * replacing the Bouncer environment with Match3 for llapi tests * Typo in yamato test	4 年前
vincentpierre	4e14879d	Updating the barracuda 1.4.0 (#5291 ) Initial commit second commit. The no-extrinsic was trained without the log reward (reward = prob) while the new one is (reward = log_prob - log_prior) A few results, it looks like Walker-diverse-r05-bigger.onnx is doing something Modified pushblock using next state and action. Did not help Fixing bug that had 9 diversity settings instead of 8 removing results	4 年前
vincentpierre	5985959d	Got 2 modes on Wlker I think	4 年前

17 次代码提交 (49d6b70c-989e-4b5d-9010-2c19a966f11c)