766 次代码提交 (3607f062-47c9-441c-b8ff-003cd2baed04)

作者 SHA1 备注 提交日期
Deric Pang 20dd50c4 Addressing feedback from offline meeting. 6 年前
GitHub 3900ed66 Merge pull request #1083 from Unity-Technologies/develop-flat-code-restructure 6 年前
GitHub 10d2a19d Release v0.5 (Develop) (#1203) 6 年前
GitHub f8df71a0 Revert "Release v0.5 (Develop) (#1203)" (#1222) 6 年前
GitHub 29084e77 Curriculum learning reward thresholding bug fix (#1141) 6 年前
GitHub ab6eb8dc Fix TF Nan bug (#1178) 6 年前
GitHub 63062b92 updated the Pyramids model (#1184) 6 年前
GitHub 50228570 updated the walljump model for the multi-discrete action space (#1198) 6 年前
GitHub 25495874 Merge pull request #1223 from Unity-Technologies/release-v0.5 6 年前
GitHub 560f1bd7 Merge pull request #1224 from Unity-Technologies/release-v0.5 6 年前
GitHub 3c9603d6 Demonstration Recorder (#1240) 6 年前
Arthur Juliani 18cea1f2 Put Time Horizon back into the default training config for BC (#1291) 6 年前
GitHub bcd487a1 Develop environment bc fix and doc update (#1317) 6 年前
GitHub f99dc261 Rename brains to new names (#1321) 6 年前
vincentpierre b5edc64a typos in the config 6 年前
Arthur Juliani 107d734e New model for the dynamic crawler (#1322) 6 年前
GitHub 285d33c7 Fix brain name (#1349) 6 年前
vincentpierre 5c060417 Added PushBlock models, fixed trainer config and fixed Learning brain asset (#1344) 6 年前
Arthur Juliani 59126c8c Release v0.6 tennis (#1350) 6 年前
vincentpierre 6843dac6 Release v0.6 marwan tf (#1351) 6 年前
vincentpierre 148bd304 updated the models for the soccer, gridworld and 3dballhard (#1328) 6 年前
GitHub 547f0e98 Merge pull request #1361 from Unity-Technologies/release-v0.6 6 年前
GitHub 8c7c62f0 Doc clarification and typo fix for offline BC (#1481) 6 年前
GitHub c8cc5a29 Merge pull request #1495 from Unity-Technologies/release-v0.6 6 年前
GitHub a196dde2 Merge pull request #1494 from Unity-Technologies/release-v0.6 6 年前
Jonathan Harper 603485bd Update curricula brain names for 0.6 6 年前
GitHub 8b1f0a38 Merge pull request #1589 from Unity-Technologies/hotfix-0.6.0a 6 年前
GitHub c0c289cc Merge pull request #1588 from Unity-Technologies/hotfix-0.6.0a 6 年前
GitHub 610b8852 Release v0.8.2 update models (#2178) 5 年前
GitHub d5f6b7f8 Merge pull request #2157 from Unity-Technologies/release-v0.8.2 5 年前
GitHub dcef9f69 Merge pull request #2179 from Unity-Technologies/release-v0.8.2 5 年前
GitHub 40c7fc48 Merge branch 'develop' into protobuf_update 5 年前
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
GitHub be4292fb Add different types of visual encoder (nature cnn/resnet) 5 年前
GitHub 6a212f73 Improvements for GAIL (#2296) 5 年前
Ervin T a46f3faa Enable generalization training (#2232) 5 年前
Ervin T ca32cadf Fix default for vis_encode_type (#2330) 5 年前
Ervin T 00a3b592 Fix docs for Generalization (#2334) 5 年前
GitHub 4991d83f Merge pull request #2346 from Unity-Technologies/release-0.9.0 5 年前
GitHub 53475207 Merge pull request #2380 from Unity-Technologies/release-0.9.0 5 年前
sankalp04 34127b76 Example parameter sampling file config 5 年前
GitHub 6a81a2f4 Add Soft Actor-Critic as trainer option (#2341) 5 年前
Ervin Teng b1bfb9e8 Delete VisualBanana 5 年前
GitHub 36528481 Merge pull request #2522 from Unity-Technologies/develop-cleanupconfig 5 年前
Yuan Gao 0c42db82 Update the offline_bc_config path 5 年前
GitHub d80812be Merge pull request #2526 from Unity-Technologies/develop-update-offline-bc 5 年前
GitHub 3df585d9 Fix issue where SAC encoder type is always simple (#2548) 5 年前
GitHub 3683cc1c Enable learning rate decay to be disabled (#2567) 5 年前
GitHub bebdb293 ML-Agents Branding & Color Updates (#2583) 5 年前
GitHub aa861bef Improved SAC hyperparameters for Crawler, Walker (#2635) 5 年前
GitHub b2fa2268 Merge pull request #2648 from Unity-Technologies/release-0.10.0 5 年前
GitHub d1ebca5c Merge pull request #2649 from Unity-Technologies/release-0.10.0 5 年前
Vilmantas Balasevicius 2d032594 Further modifications to make PPO work 5 年前
Anupam Bhatnagar cc208c00 resolving conflicts 5 年前
GitHub 5f5ccfa0 Feature Deprecation : Online Behavioral Cloning (#2659) 5 年前
Ervin Teng 258b5d00 Remove unneeded beta param from SAC config 5 年前
GitHub f22c41db Merge pull request #2704 from Unity-Technologies/hotfix-0.10.1 5 年前
Anupam Bhatnagar b733b34c resolving conflicts 5 年前
Chris Elion a1967c19 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 7e68f08f Merge Hotfix 0.10.1 to Develop (#2708) 5 年前
Hunter c92a9008 init 5 年前
Hunter 47d31907 added new nn files 5 年前
GitHub c9b71cee Better hyperparams for GridWorld/SAC (#2776) 5 年前
Hunter 70e7a646 clean up config 5 年前
GitHub 99146e97 1 to 1 Brain to Agent (#2729) 5 年前
Ervin Teng 776b6c8b Add new trainer config for walljump 5 年前
Ervin Teng cc299259 Adjust SAC params 5 年前
Hunter 7c1a38e0 add drawspheres gizmo to perception 5 年前
Hunter 90457de5 added builder env. observing blocks pos 5 年前
Hunter 8b55f522 more testing with high targets 5 年前
Chris Elion 3d8a70fb Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
GitHub 495873e5 Merge pull request #2833 from Unity-Technologies/release-0.11.0 5 年前
GitHub 72bab623 reduce max_steps for Gridworld (#2973) 5 年前
Ervin Teng 58a4ea71 Increase max steps for 3DBall 5 年前
GitHub cdf307bb add BC FoodCollector config (#2987) 5 年前
GitHub a4c111f4 Merge pull request #3012 from Unity-Technologies/release-0.12.0-to-develop 5 年前
GitHub d4780a55 Merge pull request #3010 from Unity-Technologies/release-0.12.0-to-master 5 年前
Ervin Teng 34f9577c Merge branch 'develop' into develop-agentprocessor 5 年前
Ervin Teng eb4a04a5 Merge branch 'master' into develop-tanhsquash 5 年前
GitHub 1fa07edb Remove Standalone Offline BC Training (#2969) 5 年前
GitHub 45c22d13 Run precommit in its own job, cache the data (#3094) 5 年前
Andrew Cohen 082789ea Merge branch 'master' into develop-magic-string 5 年前
Ervin Teng 1bd791e5 Merge branch 'master' into develop-agentprocessor 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Andrew Cohen c8514c18 Merge branch 'master' into develop-magic-string 5 年前
andrewcoh 317c59d2 Move FoodCollector to gail_config and remove offline_bc_config.yaml (#3170) 5 年前
GitHub d985dded Merge branch 'master' into merge-release-0.13.0 5 年前
GitHub ad42705d Merge pull request #3185 from Unity-Technologies/merge-release-0.13.0 5 年前
GitHub b0a2a54f Add 'run-experiment' script, simpler curriculum config (#3186) 5 年前
Yuan Gao 0817c44b Moved the demo files 5 年前
GitHub b3d3a9d6 Merge pull request #3202 from Unity-Technologies/develop-move-demo 5 年前
Ervin Teng 98ed88b1 Merge branch 'master' into develop-separatevalue 5 年前
Ervin Teng 29f3330f Merge master into hotfix-0.13.1 4 年前
GitHub 14193ada Self-play for symmetric games (#3194) 4 年前
GitHub 0ff8f9af Create ML-Agents Package (#3267) 4 年前
Ervin Teng db249ceb Merge branch 'master' into develop-splitpolicyoptimizer 4 年前
Ervin Teng 9b0b2fed Reduce memory sizes 4 年前
Ervin Teng ab9b082a Fix Hallway summary freq 4 年前
GitHub 6284ea4a Reduce max steps for Bouncer, summary for Hallway (#3343) 4 年前
GitHub 8eb8e279 Fix WallJump yaml indentation in docs and curriculum config (#3340) 4 年前
GitHub ae97ab3a Soccer refactor (#3331) 4 年前
GitHub 0d6fffc1 Reduce num steps for walljump (#3377) 4 年前
Ervin Teng d4ee7346 Merge commit 'f9c05a61d574305497789b5997f1ae3ea1b1ad3b' into develop-splitpolicyoptimizer 4 年前
GitHub c1340b0e Hotfix docs odd (#3379) 4 年前
Ervin Teng d2f67c50 Reduce num steps for walljump 4 年前
Andrew Cohen 23f74f21 soccer fives 4 年前
Andrew Cohen 5c7a1fbf cloud run 4 年前
Andrew Cohen 7d90b042 soccerfives curricula 4 年前
Anupam Bhatnagar d5617834 [bug-fix] Update the gail config for the new steps in 0.14.0 (#3475) 4 年前
Anupam Bhatnagar be7e2e3a Fix demo path for pushblock (#3489) 4 年前
Ervin Teng 5ef902bf Merge branch 'master' into develop-splitpolicyoptimizer 4 年前
GitHub 5e78e5d4 [bug-fix] Update the gail config for the new steps in 0.14.0 (#3475) 4 年前
GitHub 472f9f0e Merge branch 'master' into develop-badEnvReturnCode 4 年前
Andrew Cohen b7d77740 Merge branch 'master' into soccer-fives 4 年前
Ervin Teng 1859f252 Merge commit 'fbcdd83c087135f870e785cc72e5ff9a7e898e3a' into develop-splitpolicyoptimizer 4 年前
Andrew Cohen 39a76867 added more backward raycasts to twos and fives 4 年前
GitHub 3f8bbaf1 Fix demo path for pushblock (#3489) 4 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 4 年前
Andrew Cohen 5b0aca29 Merge branch 'master' into soccer-fives 4 年前
Andrew Cohen 4edb7f41 updated config/soccer brains 4 年前
Ervin Teng 1156b9b3 Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder 4 年前
Ervin Teng d57124b4 Merge 'master' into develop-removeactionholder 4 年前
Anupam Bhatnagar e04fcd71 Merge branch 'master' into master-into-release-0.14.1 4 年前
Ervin Teng d10d27e2 Merge commit '9450d3fc0dda4547a14c5ed1b7e13fc6e3a15413' into develop-nopreviousactions 4 年前
Andrew Cohen 30725c27 2v1 soccer config and env 4 年前
Ervin Teng c825f13e Reduce PushBlock max_steps 4 年前
Ervin Teng c3ff4a31 Cut bouncer max steps 4 年前
Anupam Bhatnagar 21a526c5 [skip ci] shorter 3dball run 4 年前
GitHub 9d9c8a8a Merge pull request #3576 from Unity-Technologies/develop-shortentrainerconfigs 4 年前
GitHub e3af96ca Merge branch 'master' into develop-demo-load-seek 4 年前
Andrew Cohen b1cfa74d Merge branch 'master' into develop-test-imitation 4 年前
GitHub 0d3fd17e [bug-fix] Increase 3dballhard and GAIL default steps (#3636) 4 年前
Andrew Cohen 53bea15c Merge branch 'master' into soccer-fives 4 年前
Andrew Cohen ac261e36 Merge branch 'master' into self-play-mutex 4 年前
GitHub 4fa9735e [bug-fix] Increase 3dballhard and GAIL default steps (#3636) (#3647) 4 年前
Andrew Cohen eefc4811 Merge branch 'master' into self-play-mutex 4 年前
GitHub 3a771afa Rename Generalization -> Environment Parameter Randomization (#3646) 4 年前
Andrew Cohen 072b4135 soccer 2v1 on the cloud 4 年前
Ervin Teng 84e526fa Update trainer config 4 年前
Andrew Cohen c70cfa63 running soccer for more steps 4 年前
Andrew Cohen fb993986 Merge branch 'master' into self-play-mutex 4 年前
Andrew Cohen b42c9482 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 5d21e211 tennis config 4 年前
Andrew Cohen a13f107f updated self-play doc for asymmetric games/changed current_self->current_best 4 年前
Andrew Cohen f7e76054 tennis config restored 4 年前
Andrew Cohen 6e43bbf4 soccer config 4 年前
Andrew Cohen bc611906 removed team-change CLI 4 年前
Andrew Cohen 42518d84 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 650ec121 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 941b8ae7 Strikers vs goalie added 4 年前
Andrew Cohen 1ba1bc22 tennis config 4 年前
Andrew Cohen 345fa382 current_best_ratio -> latest_model_ratio 4 年前
Andrew Cohen c7a34413 Merge branch 'self-play-mutex' into soccer-2v1 4 年前
Andrew Cohen 61d38b15 rerunning all self-play 4 年前
Andrew Cohen d9f1a2f5 more experiments for self-play 4 年前
Andrew Cohen d7b8cf16 CubeWars 4 年前
GitHub 9cbc3fa2 Asymmetric self-play (#3653) 4 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 4 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 4 年前
Andrew Cohen 72706301 soccer curriculum 4 年前
Andrew Cohen c13259fd curriculum for small soldiers 4 年前
Ervin Teng b7151b51 Remove num_update as param 4 年前
Andrew Cohen e91f5233 reduced steps cubewars 4 年前
Andrew Cohen 9fed4985 tennis curriculum 4 年前
bhh 35736d30 added scripts 4 年前
Andrew Cohen e4f7f2a6 removed curriculum tennis 4 年前
Anupam Bhatnagar d94ae012 [skip ci] shorter 3dball run 4 年前
Andrew Cohen 2e7f8f41 Merge branch 'develop-cubewars' into asymm-envs 4 年前
Andrew Cohen a0985d94 increased striker goalie steps 4 年前
bhh dc9fcd46 loosened joints retrained looking good 4 年前
Anupam Bhatnagar 06a54ae8 step increment moved to _update_policy, fixed exit status issue 4 年前
Anupam Bhatnagar 5d180caf [skip ci] modify learning rate in horovod optimizer 4 年前
bhh 1ecc8924 final training done. ready to go. 4 年前
GitHub aae58330 Merge branch 'master' into develop-add-inference-examples 4 年前
Ervin Teng 66bc2498 Trainer config adjustments 4 年前
Andrew Cohen 5a5e13fa soccertwos config 4 年前
Anupam Bhatnagar d49ceecc [skip ci] moving summary writer to update_policy 4 年前
bhh 9e40ed64 update config to 3.5M steps 4 年前
Andrew Cohen 44e6fa7b soccer 1e8 timesteps/Tennis existential penalty 4 年前
Andrew Cohen 900ae050 new SoccerTwos brain 4 年前
Anupam Bhatnagar 86e16a64 [skip ci] tweaking 3dball configs 4 年前
Andrew Cohen 6f1f89f6 new soccertwos brain 4 年前
Hunter-Unity 2751b3a4 updated crawlerAgent code to match worm env 4 年前
Andrew Cohen a90812a0 soccer twos for 50mill 4 年前
Ervin Teng 9b0da1a4 Adjust walker params 4 年前
Andrew Cohen 384f6439 reduced laser cd/increased heal 4 年前
Ervin Teng 0ff591bc Adjust Reacher steps_per_update 4 年前
Ervin Teng d11f2f73 Increase PushBlock summary steps 4 年前
GitHub 9695b89a StrikerVsGoalie and SoccerTwos env improvements (#3699) 4 年前
Arthur Juliani c577ce26 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
Andrew Cohen 34349a2f reduce latest_model prob 4 年前
Andrew Cohen 72bd2c5d Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen 8431ecb5 tennis reward fix 4 年前
Andrew Cohen 1ac4dfb3 update Tennis max step 4 年前
GitHub 4d23200b [refactor] Run Trainers in separate threads (#3690) 4 年前
Ervin Teng 9cd2c034 Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-sac-apex 4 年前
Andrew Cohen 9b552f08 increased latest_model soccer prob 4 年前
Andrew Cohen 3daff010 svg config 4 年前
Andrew Cohen 47548ee4 tennis curriculum 4 年前
Andrew Cohen 79276531 new goalie 4 年前
Andrew Cohen 3bd33889 Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen e5c62cb8 update striker vs goalie brain/retrain 4 年前
Andrew Cohen d2cf07be increased tennis xurriculum 4 年前
Andrew Cohen 1d020fa7 Merge branch 'soccer-2v1' into asymm-envs 4 年前
Andrew Cohen 064bcdad Merge branch 'soccer-2v1' into asymm-envs 4 年前
GitHub f8909ab1 Add New 3 Joint Ragdoll Worm Environment (#3798) 4 年前
Andrew Cohen d54fdfbf increase batch/buff/erbeta 4 年前
Andrew Cohen 028a8d59 larger network/6 stacked obs 4 年前
Andrew Cohen ca6cdff3 fixed broken prefab... 4 年前
Andrew Cohen 32f562d9 striker goalie increase latest_mod ratio 4 年前
Andrew Cohen 3df4f4a3 smaller window cubewar 4 年前
Andrew Cohen 717fae65 reduce tennis latest_model_ratio 4 年前
Andrew Cohen 1c4ba1a5 add timestep bonus to loss 4 年前
Andrew Cohen a1143427 increased entro bonus tennis 4 年前
Andrew Cohen e9f570aa slightly larger beta tennis 4 年前
Andrew Cohen 3f806353 increased beta 4 年前
Andrew Cohen 54972202 tuning beta tennis 4 年前
Andrew Cohen 0871fc96 remove beta/no curr tennis 4 年前
Andrew Cohen 547f3192 beta .05 4 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
Andrew Cohen 04ac54a3 reduced tennis time horizon 4 年前
GitHub f86fc81d [refactor] Move configuration files to single YAML file (#3791) 4 年前
Andrew Cohen a5ca5e0c reduce beta for new reward func 4 年前
Andrew Cohen fda39c3d more beta tuning... 4 年前
GitHub 2f80dd02 Worm SAC configs (#3912) 4 年前
GitHub 98d4d5be Add worm config for SAC (#3879) 4 年前
Andrew Cohen e3f6c716 higher granularity curr 4 年前
Andrew Cohen 39e0bbe9 remove debug log 4 年前
Andrew Cohen 1c2e1d79 increase beta 4 年前
Andrew Cohen e7922b68 trying larger beta 4 年前
Andrew Cohen 376af981 lower agent height 4 年前
Andrew Cohen 14df5d02 increase gamma 4 年前
Andrew Cohen 052f1c87 reduce gamma 4 年前
Andrew Cohen 8fba6faa increase network capacity 4 年前
Andrew Cohen bbc1014a reduce learning rate 4 年前
Andrew Cohen d5428487 addforce and static walls 4 年前
Chris Elion 68b68396 Merge remote-tracking branch 'origin/master' into release_1_to_master 4 年前
Andrew Cohen 8ef0b3a8 opponent observations 4 年前
Chris Elion ff7318c2 Merge remote-tracking branch 'origin/master' into release_1_to_master 4 年前
vincentpierre c34dd5b6 Merge branch 'master' into develop-gym-wrapper 4 年前
Andrew Cohen b6784390 frequent swapping of diverse opponents tennis 4 年前
Andrew Cohen 69acdeec fixed reset tennis 4 年前
Andrew Cohen a2f8319a Merge branch 'master' into asymm-envs 4 年前
Andrew Cohen d9d6c172 remove threading 4 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 4 年前
Hunter-Unity e891d9b5 about to implement orientation cube 4 年前
Andrew Cohen f74ac6ae remove rotation hindrance 4 年前
Andrew Cohen d7c2c163 please no more 4 年前
Andrew Cohen a926aa7a remove threaded 4 年前
Hunter-Unity 3edca8d0 reduced maxAngVel, enabled enhanced determinism, cont spec 4 年前
Andrew Cohen d52168e9 threaded false 4 年前
Andrew Cohen 3f7f9883 remove thread 4 年前
Andrew Cohen de2ca11b no thread config 4 年前
Andrew Cohen c5ce18c7 remove x/y vel, smaller network 4 年前
Hunter-Unity 0b02b434 added new dynamic nn file 4 年前
Andrew Cohen fd7ee405 normalize by hand 4 年前
Hunter-Unity 9e20feef hip facing reward 4 年前
Hunter-Unity cb8eec30 Create WalkerDynamic.yaml 4 年前
Andrew Cohen e58a3f5e small swap 4 年前
Andrew Cohen f21304a9 ball 4 年前
GitHub c0d96ecd Increase 3DBall generalization sampling interval (#3995) 4 年前
Andrew Cohen fa66e9e9 beta.005 4 年前
Ervin Teng f214836a Changes for speed test 4 年前
Hunter-Unity da6d25c9 updated walker dynamic demo file. cleanup 4 年前
Andrew Cohen c3fd56b5 testing beta 4 年前
Andrew Cohen 13c2a209 added opp, decay eps removed 4 年前
Andrew Cohen 53f2f360 long tennis/soccer runs 4 年前
Hunter-Unity 99eadde6 try 100M steps on walkerdynamic 4 年前
Andrew Cohen a89d9791 changed striker vs goalie config 4 年前
Hunter-Unity c9821f85 100M steps 4 年前
Andrew Cohen 5dfa0014 increased beta for all self-play 4 年前
Hunter-Unity 07266f46 add dir vector obsv 4 年前
Hunter-Unity f4c8f344 2e7 steps 4 年前
Andrew Cohen 59a60c1e Merge branch 'master' into asymm-envs 4 年前
Hunter-Unity ffa4ce52 testing bigger batch size 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
Hunter-Unity 85958dad try 8x mem for cloud 4 年前
Andrew Cohen 11815554 revert soccer hyper params 4 年前
Hunter-Unity b06dd988 8x batch size for cloud test 4 年前
Andrew Cohen 3c2ce7be beta... 4 年前
Hunter-Unity 6b92b01a epoch 10 4 年前
Andrew Cohen a0dc8789 test new sampling method 4 年前
Hunter-Unity e032db74 hyptest 4 年前
Andrew Cohen 4083e344 tennis window 10 4 年前
Andrew Cohen 46654d49 soccer 100 4 年前
Hunter-Unity f17b1075 increase timescale for cloudtraining 4 年前
Andrew Cohen bc249921 riker goalie 100 4 年前
HH ad90e6b9 about to implement orientation cube 4 年前
Hunter-Unity 769dbec5 cp 4 年前
Andrew Cohen 4671cf17 tnenis congif 4 年前
Hunter-Unity b3bf1418 try new cluster 4 年前
Andrew Cohen 78744111 test ghost 4 年前
Hunter-Unity a3f7b980 cp 4 年前
Andrew Cohen 5f8ef3ca .5 opponent tennis 4 年前
Andrew Cohen 4464ca46 ignoring commit checks 4 年前
Hunter-Unity aca47e1f 200k buff cloud 4 年前
Andrew Cohen 20d973c8 bug 4 年前
Andrew Cohen 91217b0d use settings.py to check PR config 4 年前
Andrew Cohen 4e4cf9e2 .5 4 年前
Chris Elion 20b5a157 update scenes and get them training 4 年前
Hunter-Unity 32feefee update configs 4 年前
Andrew Cohen 53d1a98d more entro 4 年前
Andrew Cohen b6b2c58e smaller window 4 年前
GitHub 8566ed4f [bug-fix] Fix hyperparameters for Walker-SAC and WallJump-SAC (#4049) 4 年前
Andrew Cohen 6568158f 3.o beta 4 年前
Andrew Cohen b6d9c58b beta 2 4 年前
HH f7e650a6 reduced maxAngVel, enabled enhanced determinism, cont spec 4 年前
Andrew Cohen bca3bd73 return to team change 4 年前
Andrew Cohen 4b8db5c3 test failure 4 年前
Andrew Cohen 55bafe1b control 4 年前
Andrew Cohen e7750fc9 Merge branch 'master' into develop-sampler-refactor 4 年前
GitHub 91f199cd Self play hyperparameter improvements (#4063) 4 年前
Andrew Cohen 6071c74f hard reset on team change 4 年前
Andrew Cohen 922136f3 usual tennis 4 年前
GitHub ee1098d1 [refactor] Improve config upgrade script and add test (#4056) 4 年前
Andrew Cohen 55e3e7f6 sanity check 4 年前
Andrew Cohen af364ac9 more exsp 4 年前
GitHub 101a8e00 Add Dynamic Walker. Improved Ragdoll Stability/Performance (#4037) 4 年前
Andrew Cohen d91a7cbd reduce time horizon tennis 4 年前
HH 8bee075b added new dynamic nn file 4 年前
HH 5bf43487 hip facing reward 4 年前
HH de87c750 Create WalkerDynamic.yaml 4 年前
Andrew Cohen 446bdeee hund 4 年前
Andrew Cohen 4ba0d98c cubewar and tennis stability test 4 年前
Andrew Cohen bd1d6c08 all self-play 4 年前
Andrew Cohen c0f7052b Merge branch 'master' into develop-sampler-refactor 4 年前
Andrew Cohen 150e7d73 cubewar threaded false 4 年前
Andrew Cohen 34ecc7e6 Merge branch 'master' into asymm-envs 4 年前
Andrew Cohen 33458d24 running cubewar 4 年前
HH 2d2844bd updated walker dynamic demo file. cleanup 4 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 4 年前
GitHub 8a49e8e0 [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) 4 年前
Andrew Cohen 34f3ac64 updated cube war 4 年前
HH d7e8c5e4 try 100M steps on walkerdynamic 4 年前
HH 8f463c55 100M steps 4 年前
Andrew Cohen 81cc5f69 reduce epsilon tennis ppo 4 年前
HH 005f377a add dir vector obsv 4 年前
Anupam Bhatnagar 4afd8f92 first commit 4 年前
Andrew Cohen 45293f01 larger batch size 4 年前
HH 999fc7ab 2e7 steps 4 年前
Andrew Cohen c68e865b opp 4 年前
Andrew Cohen 03eef40b constrain x tennis 4 年前
HH f8a22591 testing bigger batch size 4 年前
HH ef3be52c try 8x mem for cloud 4 年前
HH 25d7ba5e 8x batch size for cloud test 4 年前
Andrew Cohen 0c17dc1b cannot hit scenery tennis 4 年前
HH 2cce3bbe epoch 10 4 年前
HH 90c7d05f hyptest 4 年前
Andrew Cohen 31a5b2ee 4096 batch 4 年前
GitHub 5b0a5b9b Moving domain randomization to C# (#4065) 4 年前
Andrew Cohen 71d7c24b 0.0 latest model 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
HH f7dd600f increase timescale for cloudtraining 4 年前
Andrew Cohen 346a90ba move agent back 4 年前
HH 65b80abb cp 4 年前
yanchaosun 3ef4196e Added the algorithm named ppo_transfer 4 年前
HH fa937cb9 try new cluster 4 年前
yanchaosun c2d6f5c0 basic implementation 4 年前
HH ba835a22 cp 4 年前
HH ad2e63d6 200k buff cloud 4 年前
GitHub ca3bdbc0 Fix 3DBall and 3DBallHard SAC regressions (#4132) 4 年前
Andrew Cohen 8c0b3548 reduce batch size Tennis 4 年前
HH 48d78ac7 update configs 4 年前
Anupam Bhatnagar 24d5f881 first commit 4 年前
yanchaosun ac4c80c2 integrate the implementation and hyperparameters 4 年前
HH a121795d Merge branch 'hh/develop/dynamic-walker' of https://github.com/Unity-Technologies/ml-agents into hh/develop/dynamic-walker 4 年前
yanchaosun 1e52ad3d ready for cloud training 4 年前
HH ced14d9d update configs to new class format 4 年前
yanchaosun e338ab91 test cloud training 4 年前
yanchaosun f0881a94 fix commands for cloud training 4 年前
yanchaosun 05a96355 remove slim package 4 年前
Andrew Cohen 1f305f23 no latest model 4 年前
Jonathan Harper 4e7a1170 Adding training configs 4 年前
yanchaosun 44fa16fa fix issues with cloud training 4 年前
Jonathan Harper 7656f419 More experimentation 4 年前
yanchaosun ad95032b transfer path 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
yanchaosun b10b0895 test crawler 4 年前
yanchaosun 428f013e add old crawler 4 年前
HH 9570a5fe Delete trainer_config.yaml 4 年前
yanchaosun 59251abe change yamls 4 年前
Andrew Cohen 4839e040 try team change zero 4 年前
Andrew Cohen 48f02b61 int as team change 4 年前
yanchaosun cd1778ff added one yaml 4 年前
Andrew Cohen 12bc2143 large window 4 年前
yanchaosun a80915a8 yaml update 4 年前
Andrew Cohen 4f03be74 window 30 4 年前
yanchaosun 666c8ba9 new cloud training change 4 年前
Andrew Cohen 5efa1e92 time hor 4 年前
yanchaosun 59e93b0b transfer config 4 年前
yanchaosun d0714701 new setting for cloud 4 年前
Andrew Cohen 68c6d513 reduce time hor 4 年前
vincentpierre 599d7e9f Merging master 4 年前
yanchaosun d7402406 multiple sizes configs 4 年前
yanchaosun 5eccb4c9 new transfer test for cloud 4 年前
HH 5147f2c6 temp add robot arm training config 4 年前
yanchaosun fe4e057f test more configs 4 年前
HH 7afa1761 Merge branch 'master' into hh/develop/ragdoll-updates 4 年前
GitHub 1308b344 [CI] Better hyperparameters for Pyramids-SAC, WalkerStatic-SAC, and Reacher-PPO (#4154) 4 年前
GitHub 8b913a96 Add TargetController/OrientationCubeController Components & Bugfix (#4157) 4 年前
yanchaosun d8d418c4 walker configs 4 年前
GitHub 559549e4 Add dynamics change to crawler (#4218) 4 年前
yanchaosun 7e3216ae simple env test 4 年前
HH 84430eec update config to match master 4 年前
GitHub d42e82a8 Fix 3DBall PPO hard regression (#4133) 4 年前
yanchaosun cdaaa318 bisim 4 年前
yanchaosun bc4b7f98 walker config 4 年前
yanchaosun 3d0d359c bisimulation draft 4 年前
yanchaosun 1fdbfe65 no normalization 4 年前
yanchaosun 5a778ca3 fix normalization 4 年前
GitHub 8eefdcd3 Refactor of Curriculum and parameter sampling (#4160) 4 年前
yanchaosun 66c4e6ff new config 4 年前
yanchaosun a212fef9 new bisim implementation 4 年前
yanchaosun 5471699d crawler config 4 年前
HH b877d953 remove unneeded config 4 年前
HH 0fdac847 Merge branch 'master' into hh/develop/crawler-ragdoll-updates 4 年前
yanchaosun 6daa2ed7 cloud config 4 年前
yanchaosun 9599a8ec new config 4 年前
Andrew Cohen 5fa28f5f merge YC changes 4 年前
Andrew Cohen dad084ee old crawler config 4 年前
Andrew Cohen b46d3214 crawler configs 4 年前
Andrew Cohen 29af84da action encoder configs 4 年前
yanchaosun 80bad241 init sac transfer, and added action encoder to bisim; configs for crawler 4 年前
Andrew Cohen 1e05e727 fix crawler yaml 4 年前
yanchaosun f81feec4 config fix; basic sac 4 年前
HH 9e6edb6c try new reward falloff 4 年前
Andrew Cohen e6066ffd separate value train and model schedule to const 4 年前
yanchaosun a505cb16 new config 4 年前
HH c3c83920 cleanup 4 年前
Andrew Cohen 240919b1 2 layer policy 4 年前
yanchaosun 9a19f6e5 disable bisim 4 年前
Andrew Cohen 35e9df24 value layers 3 4 年前
yanchaosun c1bccaf5 diable bisim 4 年前
Andrew Cohen 36fa1614 model linear lr 4 年前
yanchaosun 62284176 change id 4 年前
Andrew Cohen 2213a071 policy linear lr 4 年前
Andrew Cohen d8c123a0 Merge branch 'master' into sensitivity 4 年前
Andrew Cohen 33a906ad add forward layer 4 年前
yanchaosun 6657129c config: not reuse encoder 4 年前
HH e2217a9a new curve 4 年前
Andrew Cohen 0c7db26a target encoder 4 年前
Andrew Cohen 57f247d4 targ for both 4 年前
yanchaosun 0c468084 sac transfer implementation; disable action encoder 4 年前
Ruo-Ping Dong 262f38ea add basketball example 4 年前
Andrew Cohen 5d8b5274 add load model false to config 4 年前
yanchaosun 0a1a30d3 sac update 4 年前
Andrew Cohen 5524d6f3 test reuse 4 年前
yanchaosun 7226256d config: no alter 4 年前
Andrew Cohen cb60aa53 no separate vf 4 年前
yanchaosun a9c6105d configs 4 年前
Andrew Cohen 288eb0ed reuse encoder false 4 年前
yanchaosun 00bb821c fix sac transfer problems 4 年前
Andrew Cohen 6979a952 3dball transfers 4 年前
yanchaosun e2f0b3ca fix transfer 4 年前
Andrew Cohen 83bc38fd try reuse encoder 4 年前
HH 00cb4c89 add WalkerStaticVariableSpeedScene and PPO config 4 年前
yanchaosun cc9a38ae cloud config with shared encoder 4 年前
Andrew Cohen 89abe29d op buffer 4 年前
yanchaosun 2b67d1a6 fix crawler config 4 年前
HH 7c63197e start dynamic cleanup and more debug for NaNs 4 年前
yanchaosun 42c0c333 fig bug 4 年前
Andrew Cohen 9c012d6a no op buffer no acen 4 年前
yanchaosun d1f57dec separate value net config 4 年前
Andrew Cohen d94b81c0 sep value false 4 年前
yanchaosun 910707dd PPO 3dball config 4 年前
Andrew Cohen 2dc3c84c add forward layer 4 年前
yanchaosun f55fd920 remove transfer from yaml 4 年前
Andrew Cohen 2dec257c no encoder for single task 4 年前
yanchaosun d706f28c use off policy buffer to transfer 4 年前
HH 977287dd add all scenes 4 年前
Andrew Cohen 0198e41a 0 fwl 4 年前
yanchaosun f937aa96 3dball ppo: without var predict 4 年前
Andrew Cohen 3513d5a6 load policy/vf 4 年前
yanchaosun 36f36750 target critic for ppo 4 年前
Andrew Cohen bfd6a029 load value 4 年前
yanchaosun 6df774ed update: separate model train as an option 4 年前
Andrew Cohen e1ea3dca load pol 4 年前
yanchaosun aa0e896f linear value, no target 4 年前
Andrew Cohen 78943972 add l2 penalty 3dball 4 年前
yanchaosun c48b6429 numpy fix, config 3dball 4 年前
yanchaosun 8c03c82a use target 4 年前
HH b88434f8 increase to 30M 4 年前
Andrew Cohen efa9e471 inc 3dball steps 4 年前
yanchaosun 44312bdb linear policy and linear forward 4 年前
yanchaosun 57d3ba64 change path 4 年前
yanchaosun 42c9ba43 reuse encoder and linear 4 年前
Andrew Cohen a65bd13e no fw lay 4 年前
yanchaosun e8fcc4bb ppo new implementation 4 年前
Andrew Cohen bec3f28c no load policy 4 年前
Andrew Cohen 462b34fc fw lay 4 年前
yanchaosun 66bbdae9 sac crawler configs 4 年前
Andrew Cohen ad9e2eea fewer features 4 年前
yanchaosun 120d1c3a cloud config: non-linear policy 4 年前
yanchaosun f78940c1 less features 4 年前
Andrew Cohen 2cd0de04 action enc 4 年前
yanchaosun 2d1ffac5 ppo ball 4 年前
HH 8eaddb61 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
Andrew Cohen 12f3786c Revert "action enc" 4 年前
yanchaosun 3ce88589 1 layer everything 4 年前
Andrew Cohen 014fc5fc new crawler 4 年前
yanchaosun 86da272d load pv 4 年前
yanchaosun 6220f7c7 linear model 4 年前
yanchaosun f1346bdf multiple seeds 4 年前
HH c038362c use all bp for avg vel 4 年前
yanchaosun de4870be new configs 4 年前
GitHub b51347ac New Variable Speed Walker Environments (#4301) 4 年前
Andrew Cohen 69bf67f3 fix config 4 年前
HH 1bbd76fe update prefabs 4 年前
Andrew Cohen 40f7b9e6 no val sep 4 年前
yanchaosun 4f64d0f5 new config 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
Ruo-Ping Dong d57aa9ab Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint 4 年前
Ervin Teng 7032fe82 Reduce max steps for striker vs. goalie 4 年前
HH ef62939e updating prefabs 4 年前
Andrew Cohen eace3365 linear 3dball 4 年前
yanchaosun 0646e095 crawler configs 4 年前
yanchaosun 6b8a6e45 fix path 4 年前
GitHub bd6bcd2f Merge master and add Saver class for save/load checkpoints 4 年前
yanchaosun 990d25e3 fix path again 4 年前
Andrew Cohen 12eda929 try reload all 4 年前
yanchaosun 09e1f0c4 another fix 4 年前
Ervin Teng 42e25b25 Merge branch 'develop-add-fire' into develop-add-fire-memoryclass 4 年前
Andrew Cohen 70f05c39 reduce max step 4 年前
yanchaosun fec40537 ppo crawler 4 年前
Andrew Cohen b822283f merge add fire 4 年前
Christopher Goy 5a233353 Merge remote-tracking branch 'origin/master' into release_6-to-master 4 年前
Andrew Cohen 764122ac crawler update 4 年前
yanchaosun 15b2e80e action encoder 4 年前
yanchaosun b5e02978 sac crawler config 4 年前
yanchaosun 685c4d67 ppo crawler transfer 4 年前
yanchaosun 5ed6bd3e sac crawler 4 年前
Andrew Cohen 5f7a7e44 revert tennis config 4 年前
yanchaosun d6f8995a larger feature size 4 年前
yanchaosun ee48cca4 linear v 4 年前
GitHub abfadb3d Reduce max steps for striker vs. goalie (#4377) 4 年前
HH 7e7743d1 update static prefabs 4 年前
yanchaosun 49d6b70c crawler: max episode length=1000; new config: 1 forward layer 4 年前
Ervin Teng 6455654b Shorten max steps for strikergoalie 4 年前
yanchaosun 4b081de4 smaller feature size 4 年前
HH e3b1c5cf add nn files. update to 15M steps 4 年前
yanchaosun 96b5478f smaller 4 年前
GitHub a79aa854 [ci] Shorten max steps for strikergoalie (#4394) 4 年前
yanchaosun 0463bfe9 smaller state feature, large action feature 4 年前
yanchaosun 2e927257 separate policy net 4 年前
vincentpierre ba7eb360 Merge branch 'master' into develop-torch-save-rp 4 年前
yanchaosun 86830ac9 3dball mass=5 transfer test 4 年前
yanchaosun dd0ac8a3 mass=2 4 年前
HH 5bedaef6 add configs 4 年前
HH f0a12c70 update configs/prefabs 4 年前
yanchaosun 46817bed fix bug 4 年前
HH a9d9ea4c Merge branch 'master' into hh/develop/loco-crawler-variable-speed 4 年前
Scott Jordan 3d98516d incorporated task parameter channel branch 4 年前
yanchaosun b0f6f307 transfer from mass 2 to mass 1 4 年前
yanchaosun bcdc0a11 f512 4 年前
Anupam Bhatnagar f4f1a8d9 merge master into trainer-plugin branch 4 年前
Scott Jordan 56745026 Initial commit of running active learning code 4 年前
yanchaosun 4a23dbb3 fix mass 3dball 4 年前
Scott Jordan 78f8a9a2 Updated task manager 4 年前
yanchaosun db30f918 push block 4 年前
yanchaosun 4be4f1d1 new reacher env 4 年前
yanchaosun e9a3ea57 reacher self-transfer 4 年前
yanchaosun f1802c3a push block transfer setting 4 年前
vincentpierre 0dd5effa DO NOT MERGE 4 年前
vincentpierre 7cfb763d [DO NOT MERGE] 4 年前
yanchaosun 5cab2114 push block without action encoder 4 年前
vincentpierre 9b8924a6 - 4 年前
Scott Jordan e33168d6 Added comments and new yaml files for variable speed walker 4 年前
yanchaosun 4133fb35 no action 4 年前
vincentpierre e2e62cb9 - 4 年前
yanchaosun 191a1133 block forward 2 layers 4 年前
yanchaosun 1ee62100 reacher 4 年前
yanchaosun 5c3306ef large buffer size 4 年前
yanchaosun 4d5f5888 encoder layer 1 4 年前
GitHub a117c932 Grid Sensor (#4399) 4 年前
vincentpierre 3b8a8971 no threading 4 年前
yanchaosun e39986ed block larger feature size; reacher fix and new reward 4 年前
yanchaosun 7dac3284 push block more steps 4 年前
yanchaosun 51491a3e new dynamics change: scale 1 to 2 4 年前
GitHub 582859b6 New Crawler Variable Speed Scenes (#4382) 4 年前
yanchaosun a1859fb8 reacher multi seeds 4 年前
yanchaosun 854e10e1 3dball hard scale 4 年前
GitHub cc10cd82 Worm Ragdoll & Env Updates (#4413) 4 年前
yanchaosun b5a1b9b4 hard task name change 4 年前
yanchaosun 27dffa4d new reacher reward 4 年前
yanchaosun 16e63cb8 config fix 4 年前
yanchaosun 883361ee reacher new reward: action penalty and constant not-reaching-goal penalty 4 年前
yanchaosun 85549b2b reacher: stack observation. with the original reward function 4 年前
Ervin Teng 333af451 Turn off threading 4 年前
yanchaosun 92c3facf distance based penalty 4 年前
yanchaosun f15a4f2d 2 layers 4 年前
yanchaosun 716336bf larger feature size 4 年前
yanchaosun 63cec035 fix config 4 年前
Ervin Teng 3a7cd3ad Merge experiments 4 年前
yanchaosun 693c0ca4 feature size 32 4 年前
yanchaosun 1a9aaaf6 model weights and large transfer learning weight 4 年前
yanchaosun 1ebe7054 new config 4 年前
yanchaosun 8f67cd40 smaller learning rate 4 年前
Andrew Cohen 3997b14b Merge branch 'master' into develop-hybrid-actions 4 年前
vincentpierre 49e08218 - 4 年前
Ervin Teng d4beb937 Make 3dball longer 4 年前
vincentpierre c10da7ef - 4 年前
GitHub 60b76790 Random Network Distillation for Torch (#4473) 4 年前
Ervin Teng b98e7c28 Use constant LR 4 年前
HH 0d42b277 train combo. added nn files. 4 年前
HH d02c90f6 added more variants 4 年前
HH 1912e47a Dynamic Sensor Benchmarks In 4 年前
GitHub 9e1a28c2 Add vector flag of agent's frozen state to VisualFoodCollector (#4511) 4 年前
GitHub b33e310f Add Visual3DBall scene (#4513) 4 年前
Andrew Cohen e5f14400 Merge branch 'master' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 2f870407 bullet hell game 4 年前
Ervin Teng 56196761 hyperparameteers and tweaks 4 年前
GitHub 90a9d214 Match3 example (#4515) 4 年前
Ervin Teng 89489ae0 Invert divide by 3 in log prob 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
Ervin Teng 7bec1df2 Better hyperparams 4 年前
HH 281e0be1 added sensors & controls UI 4 年前
Chris Elion 8cf87ed6 match3 settings 4 年前
Ervin Teng e1378efc Merge commit '6d729a0a2b2ba1fc946720cdb7871c9be3e38d45' into develop-fix-nan 4 年前
Ervin Teng 4c49f181 Change num envs 4 年前
vincentpierre e14e1c4d Improvements and new tests 4 年前
Andrew Cohen d62f6b0a modify bullet/attn 4 年前
GitHub edc2ae2f [bug-fix] Disable threading for self-play envs (#4679) 4 年前
Ervin Teng ce7d34a3 Revert "Invert divide by 3 in log prob" 4 年前
GitHub 63704803 [bug-fix] Disable threading for self-play envs (#4679) (#4681) 4 年前
Andrew Cohen ef8f70e8 Add WalljumpPushblock env 4 年前
Ervin Teng 5130c9b3 Add walljump collab YAML 4 年前
GitHub cc6b4564 Multi Directional Walker and Initial Hypernetwork (#4740) 4 年前
Ervin Teng d816513e Add config and group ids to HallwayCollab 4 年前
Andrew Cohen 8a95b0bb rays and disc 4 年前
Andrew Cohen 5b2e704f updated heuristic 4 年前
Andrew Cohen 5bbe796b update soccer raycasts 4 年前
Andrew Cohen 34420044 fix trainer c and soccer config 4 年前
Andrew Cohen ca5a5194 soccer comms on the cloud 4 年前
Andrew Cohen 12828bdc remove tau from diff for 4 年前
HH 16acb693 update max steps and add config 3 年前
HH fce83c8a try curiosity 3 年前
HH 9d17392a about to merge in master 3 年前
HH dd1fbd8a update config to train 5M steps 4 年前
Andrew Cohen c183040a update soccer scene 4 年前
vincentpierre f7a4a31f [Experiment] Bullet hell 4 年前
Andrew Cohen f57875e0 layer norm 4 年前
Andrew Cohen 6fae089e bullet config 4 年前
Andrew Cohen a6294e38 run bullet on cloud 4 年前
HH 5c5539af add zomb scene 4 年前
HH fd7d9c4a add trained models 4 年前
HH a738d235 add new env scene 4 年前
Andrew Cohen 32d77b5e Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen e2506856 sequence env 3 年前
Andrew Cohen bedf9886 update sequencer env 3 年前
Andrew Cohen 9effa1b5 update sorter yaml 3 年前
Ruo-Ping Dong a7d04be6 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
HH a29ce02c train 4 env 3 年前
Ruo-Ping Dong 224d2087 add team reward 3 年前
Ervin Teng 384bfaac Add configuration yaml for pushblockcollab 3 年前
Andrew Cohen fecddfed refactored sequence env 3 年前
Andrew Cohen 3a4aa513 COMAA runs 3 年前
Andrew Cohen 5741f8f6 no target net 3 年前
Arthur Juliani 1cf97635 Additional conditional experiments 3 年前
Andrew Cohen a4c336c2 value estimator 3 年前
Arthur Juliani d2526ce2 Modify CrawlerDynamic 3 年前
Andrew Cohen 2792cc87 update coma config 3 年前
Andrew Cohen 6c6d54b0 cubewars config 3 年前
Andrew Cohen bd341f7f no target, increase lambda 3 年前
Andrew Cohen 00e3c5c5 fix config 3 年前
GitHub 8cf3b93b Merge pull request #4741 from Unity-Technologies/walljump-pushblock 3 年前
Arthur Juliani 759fd2b5 PushJump modifications 3 年前
Andrew Cohen e997a5fc cloud config 3 年前
Arthur Juliani b84b4880 Add GoalNav environment 3 年前
Andrew Cohen fce842aa adding zombie to coma2 brnch 3 年前
Andrew Cohen b0bf7817 clipping values and updated zombie 3 年前
Andrew Cohen da4f4ae8 update configs 3 年前
vincentpierre 8dd003e6 - 3 年前
Andrew Cohen 869a2811 update zombie config 3 年前
Andrew Cohen 2047ab1f cubewars config 3 年前
vincentpierre 48bd37ee - 3 年前
Ervin Teng e9e80149 Change names of behaviors 3 年前
Andrew Cohen e1061302 config 3 年前
Ervin Teng f4f559da Remove a bunch of stuff from envs 3 年前
Ervin Teng 844b5955 Remove a bunch of extra files 3 年前
Ervin Teng 985c80d7 Remove remaining files 3 年前
GitHub ed28d1ba [MLA-1768] retrain Match3 scene (#4943) 3 年前
vincentpierre fdf21dbd addressing some of the comments 3 年前
GitHub 307d7cd2 Merge pull request #4912 from Unity-Technologies/develop-var-len-obs-feature-refactor-model-loader-checks 3 年前
vincentpierre 695c02fd [skip ci] Attempting new config 3 年前
vincentpierre 272097ed new curriculum 3 年前
vincentpierre 9f51d91a New curriculum, new model 3 年前
Christopher Goy 9cadfa7a Merge master -> release_13_branch-to-master 3 年前
GitHub 332e9b8b Merge pull request #4909 from Unity-Technologies/develop-var-len-obs-feature 3 年前
Ruo-Ping Dong b5da488d Merge branch 'master' into develop-base-teammanager 3 年前
Andrew Cohen dc8e8494 Merge branch 'master' into develop-critic-optimizer 3 年前
Chris Elion e4f51ca7 Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider 3 年前
Ervin Teng 93a59971 Merge branch 'develop-critic-optimizer' into develop-critic-op-lstm 3 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 3 年前
vincentpierre 3499a645 - 3 年前
GitHub 4d5545c8 Set ignore done=False in GAIL (#4971) 3 年前
Ervin Teng f409c40c Merge branch 'master' into develop-agentprocessor-teammanager 3 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 3 年前
HH 15d512f9 Merge branch 'master' into hh/develop/dodgeball 3 年前
Ervin Teng 08db7c2f Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer-mm 3 年前
vincentpierre 8f729b75 Fixing the number of layers in the config of PyramidsRND 3 年前
GitHub 5ce1083b Merge pull request #5006 from Unity-Technologies/fix-num-layers-rnd-pyramids 3 年前
Christopher Goy 747e2228 Merge branch 'master' into release_13_branch-to-master 3 年前
GitHub ccca1309 Merge pull request #5007 from Unity-Technologies/release_13_branch-to-master 3 年前
Ervin Teng 4b159789 Add PushBlockCollab config and fix some stuff 3 年前
Chris Elion f5bf6e08 simple TicTacToe example 3 年前
HH 4c947151 Merge branch 'main' into hh/develop/dodgeball 3 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 3 年前
Andrew Cohen 9060da06 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 3 年前
HH 1f8aa5c3 add simple training scene 3 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 3 年前
Ervin Teng c8137dcd Merge branch 'main' into develop-superpush-int 3 年前
GitHub 85f8b40b Removing some scenes (#4997) 3 年前
GitHub 21623b50 renaming of behavior name for imitation crawler (#5039) 3 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 3 年前
Ervin Teng d9cbae07 Dodgeball config update 3 年前
Christopher Goy 921ba4f0 Update v2-staging from main (March 15) (#5123) 3 年前
GitHub ba2af269 [coma2] Make group extrinsic reward part of extrinsic (#5033) 3 年前
Ervin Teng f45afff3 Different YAML settings 3 年前
Ervin Teng d5aee550 Add num_envs for cloud run 3 年前
Christopher Goy ebe45056 Merge branch 'main' into release_14_branch-to-main 3 年前
Ervin Teng 8902c058 Merge branch 'main' into develop-coma2-trainer 3 年前
Chris Elion 970f1d40 Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec 3 年前
Ervin Teng 1f026c70 Merge branch 'main' into develop-superpush-branch-cleanup 3 年前
Ervin Teng 8263eb52 Backup more changes 3 年前
Ervin Teng ce872033 Revert "Merge branch 'main' into develop-superpush-branch-cleanup" 3 年前
Ervin Teng 8ef2c390 Merge branch 'develop-superpush-branch-cleanup' into develop-pushcollabonly 3 年前
GitHub d015ef17 [environment] Push Block Collaborative (#5090) 3 年前
Andrew Cohen 9e77d7e1 Merge branch 'main' into develop-soccer-groupman 3 年前
GitHub 62aa3d47 Move PushBlockCollab config to poca directory (#5097) 3 年前
Ervin Teng 09e7e805 [cherry-pick] Move PushBlockCollab config to poca directory (#5097) 3 年前
Andrew Cohen d95d8d92 soccer fours, agent prefabs 3 年前
Andrew Cohen 9176247c Merge branch 'main' into develop-soccer-groupman-mod 3 年前
GitHub 6895ba50 Integrate Group Manager to soccer/retrain with POCA (#5115) 3 年前
Andrew Cohen 25be5ff7 increase beta 3 年前
HH 02ac5091 add actuated sensors & rbsensor 3 年前
GitHub d2ee2e6f [cherry-pick] Integrate Group Manager to soccer/retrain with POCA (#5115) (#5121) 3 年前
GitHub 31e72e67 Add DungeonEscape POCA Environment (#5128) 3 年前
GitHub fe1d3e26 Fix GridFoodCollector yaml (#5134) 3 年前
GitHub f7ab0cb0 [cherry-pick][docs] Add Dungeon Escape Environment (#5133) 3 年前
GitHub 6eef8929 Fix GridFoodCollector yaml (#5134) (#5136) 3 年前
GitHub 43147c1a Remove env settings from Sorter (#5146) 3 年前
GitHub 65cd8dab Remove env settings from Sorter (#5145) 3 年前
Christopher Goy eeeb7ba3 upate scene layout. 3 年前
Ervin Teng 75d9cf59 Fix path to PushBlock demo (#5198) 3 年前
Ervin Teng c108da4a [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 3 年前
vincentpierre 42a3732c Code improvements 3 年前
Andrew Cohen 18be47e8 Merge branch 'main' into develop-soccer-groupman-mod 3 年前
GitHub dc807346 Reduce pb collab steps to 15M (#5196) 3 年前
GitHub 119503db Fix path to PushBlock demo (#5198) 3 年前
vincentpierre 7fa8b242 Code improvements 3 年前
GitHub 2980ade0 Goal conditioning grid world : Example of goal conditioning (#5193) 3 年前
GitHub c5589b59 [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 3 年前
GitHub 45e75e01 [config] Disable `threading` by default (#5221) 3 年前
vincentpierre 4e14879d Updating the barracuda 1.4.0 (#5291) 3 年前
vincentpierre bab3ecb7 First version of MEDE, crawler does not seem to work properly, I suspect the actions make it distinguishable to the discriminator but not to the human eye 3 年前
Andrew Cohen d813bfd5 continuous, crawler integrated, new cube 3 年前
vincentpierre 8da21669 Adding some changes 3 年前
vincentpierre 47fa1682 - 3 年前
vincentpierre 7c74c967 _ 3 年前
vincentpierre 8450b154 - 3 年前
vincentpierre 5985959d Got 2 modes on Wlker I think 3 年前
Scott 130512b4 fixed episode length modification issue. 3 年前
Scott 97990611 Added decision frequency and evaluation metric 3 年前
GitHub f0159e00 Better hyperparameters for Hallway-SAC (#5339) 3 年前
GitHub 5e1df27b [ci] Shorten SAC runs (#5354) 3 年前
Miguel Alonso Jr 97b7d5c6 Merge branch 'main' into develop-api-documentation-update 3 年前