516 次代码提交 (c68e865b-0c2a-4cbc-acfe-2c65f1c309b0)

作者 SHA1 备注 提交日期
Andrew Cohen 9d5d6fa7 Merge branch 'master' into asymm-envs 5 年前
Andrew Cohen dab592d0 Merge branch 'master' into asymm-envs 5 年前
Andrew Cohen 8fba6faa increase network capacity 5 年前
Andrew Cohen 843ddfc3 updated heuristics of cubewars 5 年前
Andrew Cohen 2c42f577 Merge branch 'master' into asymm-envs 5 年前
Andrew Cohen 052f1c87 reduce gamma 5 年前
Andrew Cohen b80d6228 randomize ball/agent spawn 5 年前
Andrew Cohen 14df5d02 increase gamma 5 年前
Andrew Cohen 376af981 lower agent height 5 年前
Andrew Cohen a56643bb tennis brain to prefab 5 年前
Andrew Cohen 7da1869a new tennis brain 5 年前
Andrew Cohen e7922b68 trying larger beta 5 年前
Andrew Cohen e0723381 new tennis brain 5 年前
Andrew Cohen 1c2e1d79 increase beta 5 年前
Andrew Cohen b7bd4c2c reduce winning reward 5 年前
Andrew Cohen 39e0bbe9 remove debug log 5 年前
Andrew Cohen e3f6c716 higher granularity curr 5 年前
Andrew Cohen 4769cb1e proximity bonus 5 年前
Andrew Cohen 1b10ef6d clip speed 5 年前
Andrew Cohen b0243014 broken prefab.. 5 年前
Andrew Cohen 251dcc76 remove timepenalty from tennis 5 年前
Andrew Cohen fb7aa862 remove timepenalty obs 5 年前
Andrew Cohen b80c48bf new cube wars brains 5 年前
Andrew Cohen fda39c3d more beta tuning... 5 年前
Andrew Cohen a5ca5e0c reduce beta for new reward func 5 年前
Andrew Cohen a6e6e63e timestep penalty on loss only 5 年前
Andrew Cohen 547f3192 beta .05 5 年前
Andrew Cohen 5d22b819 added timepenalty to obs 5 年前
Andrew Cohen 0871fc96 remove beta/no curr tennis 5 年前
Andrew Cohen 54972202 tuning beta tennis 5 年前
Andrew Cohen 3f806353 increased beta 5 年前
Andrew Cohen e9f570aa slightly larger beta tennis 5 年前
Andrew Cohen 443935f4 remove time bonus loss 5 年前
Andrew Cohen a1143427 increased entro bonus tennis 5 年前
Andrew Cohen 428b1dfa Hit bonus for whole exp 5 年前
Andrew Cohen 1c4ba1a5 add timestep bonus to loss 5 年前
Andrew Cohen 717fae65 reduce tennis latest_model_ratio 5 年前
Andrew Cohen 3df4f4a3 smaller window cubewar 5 年前
Andrew Cohen 32f562d9 striker goalie increase latest_mod ratio 5 年前
Andrew Cohen ca6cdff3 fixed broken prefab... 5 年前
Andrew Cohen 028a8d59 larger network/6 stacked obs 5 年前
Andrew Cohen d54fdfbf increase batch/buff/erbeta 5 年前
Andrew Cohen e5b883db added bounce obs to agent/more downward force on ball 5 年前
Andrew Cohen 29627181 more downward force/constrain y 5 年前
Andrew Cohen f60df1c9 lower agent start 5 年前
Andrew Cohen 45d35fa4 downward force tennis agent 5 年前
Andrew Cohen 0c7b4ac4 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen 13fd97de small per timestep reward 5 年前
Andrew Cohen c4946d31 increase curr to .2 5 年前
Andrew Cohen b4f52c88 Merge branch 'soccer-2v1' into asymm-envs 5 年前