Andrew Cohen
|
0d943676
|
inc energy pen
|
5 年前 |
Andrew Cohen
|
d77f2566
|
energy usage penalty to prevent superstition on serve
|
5 年前 |
Andrew Cohen
|
cde22a14
|
fix clipping
|
5 年前 |
Andrew Cohen
|
d5428487
|
addforce and static walls
|
5 年前 |
Andrew Cohen
|
bbc1014a
|
reduce learning rate
|
5 年前 |
Andrew Cohen
|
9d5d6fa7
|
Merge branch 'master' into asymm-envs
|
5 年前 |
Andrew Cohen
|
dab592d0
|
Merge branch 'master' into asymm-envs
|
5 年前 |
Andrew Cohen
|
8fba6faa
|
increase network capacity
|
5 年前 |
Andrew Cohen
|
843ddfc3
|
updated heuristics of cubewars
|
5 年前 |
Andrew Cohen
|
2c42f577
|
Merge branch 'master' into asymm-envs
|
5 年前 |
Andrew Cohen
|
052f1c87
|
reduce gamma
|
5 年前 |
Andrew Cohen
|
b80d6228
|
randomize ball/agent spawn
|
5 年前 |
Andrew Cohen
|
14df5d02
|
increase gamma
|
5 年前 |
Andrew Cohen
|
376af981
|
lower agent height
|
5 年前 |
Andrew Cohen
|
a56643bb
|
tennis brain to prefab
|
5 年前 |
Andrew Cohen
|
7da1869a
|
new tennis brain
|
5 年前 |
Andrew Cohen
|
e7922b68
|
trying larger beta
|
5 年前 |
Andrew Cohen
|
e0723381
|
new tennis brain
|
5 年前 |
Andrew Cohen
|
1c2e1d79
|
increase beta
|
5 年前 |
Andrew Cohen
|
b7bd4c2c
|
reduce winning reward
|
5 年前 |
Andrew Cohen
|
39e0bbe9
|
remove debug log
|
5 年前 |
Andrew Cohen
|
e3f6c716
|
higher granularity curr
|
5 年前 |
Andrew Cohen
|
4769cb1e
|
proximity bonus
|
5 年前 |
Andrew Cohen
|
1b10ef6d
|
clip speed
|
5 年前 |
Andrew Cohen
|
b0243014
|
broken prefab..
|
5 年前 |
Andrew Cohen
|
251dcc76
|
remove timepenalty from tennis
|
5 年前 |
Andrew Cohen
|
fb7aa862
|
remove timepenalty obs
|
5 年前 |
Andrew Cohen
|
b80c48bf
|
new cube wars brains
|
5 年前 |
Andrew Cohen
|
fda39c3d
|
more beta tuning...
|
5 年前 |
Andrew Cohen
|
a5ca5e0c
|
reduce beta for new reward func
|
5 年前 |
Andrew Cohen
|
a6e6e63e
|
timestep penalty on loss only
|
5 年前 |
Andrew Cohen
|
547f3192
|
beta .05
|
5 年前 |
Andrew Cohen
|
5d22b819
|
added timepenalty to obs
|
5 年前 |
Andrew Cohen
|
0871fc96
|
remove beta/no curr tennis
|
5 年前 |
Andrew Cohen
|
54972202
|
tuning beta tennis
|
5 年前 |
Andrew Cohen
|
3f806353
|
increased beta
|
5 年前 |
Andrew Cohen
|
e9f570aa
|
slightly larger beta tennis
|
5 年前 |
Andrew Cohen
|
443935f4
|
remove time bonus loss
|
5 年前 |
Andrew Cohen
|
a1143427
|
increased entro bonus tennis
|
5 年前 |
Andrew Cohen
|
428b1dfa
|
Hit bonus for whole exp
|
5 年前 |
Andrew Cohen
|
1c4ba1a5
|
add timestep bonus to loss
|
5 年前 |
Andrew Cohen
|
717fae65
|
reduce tennis latest_model_ratio
|
5 年前 |
Andrew Cohen
|
3df4f4a3
|
smaller window cubewar
|
5 年前 |
Andrew Cohen
|
32f562d9
|
striker goalie increase latest_mod ratio
|
5 年前 |
Andrew Cohen
|
ca6cdff3
|
fixed broken prefab...
|
5 年前 |
Andrew Cohen
|
028a8d59
|
larger network/6 stacked obs
|
5 年前 |
Andrew Cohen
|
d54fdfbf
|
increase batch/buff/erbeta
|
5 年前 |
Andrew Cohen
|
e5b883db
|
added bounce obs to agent/more downward force on ball
|
5 年前 |
Andrew Cohen
|
29627181
|
more downward force/constrain y
|
5 年前 |
Andrew Cohen
|
f60df1c9
|
lower agent start
|
5 年前 |