32 次代码提交 (d45b1f73-7b63-401a-bf08-0264372e62e8)

作者 SHA1 备注 提交日期
GitHub 6a81a2f4 Add Soft Actor-Critic as trainer option (#2341) 5 年前
Ervin Teng b1bfb9e8 Delete VisualBanana 5 年前
GitHub 3df585d9 Fix issue where SAC encoder type is always simple (#2548) 5 年前
GitHub 3683cc1c Enable learning rate decay to be disabled (#2567) 5 年前
GitHub bebdb293 ML-Agents Branding & Color Updates (#2583) 5 年前
GitHub aa861bef Improved SAC hyperparameters for Crawler, Walker (#2635) 5 年前
Vilmantas Balasevicius 2d032594 Further modifications to make PPO work 5 年前
Ervin Teng 258b5d00 Remove unneeded beta param from SAC config 5 年前
GitHub c9b71cee Better hyperparams for GridWorld/SAC (#2776) 5 年前
GitHub 99146e97 1 to 1 Brain to Agent (#2729) 5 年前
Ervin Teng 776b6c8b Add new trainer config for walljump 5 年前
Ervin Teng cc299259 Adjust SAC params 5 年前
GitHub 72bab623 reduce max_steps for Gridworld (#2973) 5 年前
GitHub 45c22d13 Run precommit in its own job, cache the data (#3094) 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Yuan Gao 0817c44b Moved the demo files 5 年前
GitHub 14193ada Self-play for symmetric games (#3194) 5 年前
GitHub 0ff8f9af Create ML-Agents Package (#3267) 5 年前
Ervin Teng 9b0b2fed Reduce memory sizes 5 年前
Ervin Teng ab9b082a Fix Hallway summary freq 5 年前
GitHub 6284ea4a Reduce max steps for Bouncer, summary for Hallway (#3343) 5 年前
GitHub 0d6fffc1 Reduce num steps for walljump (#3377) 5 年前
Ervin Teng d4ee7346 Merge commit 'f9c05a61d574305497789b5997f1ae3ea1b1ad3b' into develop-splitpolicyoptimizer 5 年前
Ervin Teng 5ef902bf Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
Ervin Teng c825f13e Reduce PushBlock max_steps 5 年前
Ervin Teng 84e526fa Update trainer config 5 年前
Ervin Teng b7151b51 Remove num_update as param 5 年前
Ervin Teng 66bc2498 Trainer config adjustments 5 年前
Ervin Teng 9b0da1a4 Adjust walker params 5 年前
Ervin Teng 0ff591bc Adjust Reacher steps_per_update 5 年前
Ervin Teng d11f2f73 Increase PushBlock summary steps 5 年前
GitHub 98d4d5be Add worm config for SAC (#3879) 5 年前