yanchaosun
|
0c468084
|
sac transfer implementation; disable action encoder
|
4 年前 |
yanchaosun
|
0a1a30d3
|
sac update
|
4 年前 |
yanchaosun
|
a9c6105d
|
configs
|
4 年前 |
yanchaosun
|
00bb821c
|
fix sac transfer problems
|
4 年前 |
yanchaosun
|
e2f0b3ca
|
fix transfer
|
4 年前 |
yanchaosun
|
cc9a38ae
|
cloud config with shared encoder
|
4 年前 |
yanchaosun
|
2b67d1a6
|
fix crawler config
|
4 年前 |
yanchaosun
|
42c0c333
|
fig bug
|
4 年前 |
yanchaosun
|
d1f57dec
|
separate value net config
|
4 年前 |
yanchaosun
|
6df774ed
|
update: separate model train as an option
|
4 年前 |
yanchaosun
|
aa0e896f
|
linear value, no target
|
4 年前 |
yanchaosun
|
c48b6429
|
numpy fix, config 3dball
|
4 年前 |
yanchaosun
|
8c03c82a
|
use target
|
4 年前 |
yanchaosun
|
44312bdb
|
linear policy and linear forward
|
4 年前 |
yanchaosun
|
57d3ba64
|
change path
|
4 年前 |
yanchaosun
|
42c9ba43
|
reuse encoder and linear
|
4 年前 |
yanchaosun
|
e8fcc4bb
|
ppo new implementation
|
4 年前 |
yanchaosun
|
66bbdae9
|
sac crawler configs
|
4 年前 |
yanchaosun
|
120d1c3a
|
cloud config: non-linear policy
|
4 年前 |
yanchaosun
|
f78940c1
|
less features
|
4 年前 |
yanchaosun
|
3ce88589
|
1 layer everything
|
4 年前 |
yanchaosun
|
86da272d
|
load pv
|
4 年前 |
yanchaosun
|
6220f7c7
|
linear model
|
4 年前 |
yanchaosun
|
f1346bdf
|
multiple seeds
|
4 年前 |
yanchaosun
|
de4870be
|
new configs
|
4 年前 |
yanchaosun
|
4f64d0f5
|
new config
|
4 年前 |
yanchaosun
|
0646e095
|
crawler configs
|
4 年前 |
yanchaosun
|
6b8a6e45
|
fix path
|
4 年前 |
yanchaosun
|
990d25e3
|
fix path again
|
4 年前 |
yanchaosun
|
09e1f0c4
|
another fix
|
4 年前 |
yanchaosun
|
15b2e80e
|
action encoder
|
4 年前 |
yanchaosun
|
b5e02978
|
sac crawler config
|
4 年前 |
yanchaosun
|
5ed6bd3e
|
sac crawler
|
4 年前 |
yanchaosun
|
d6f8995a
|
larger feature size
|
4 年前 |
yanchaosun
|
ee48cca4
|
linear v
|
4 年前 |
yanchaosun
|
49d6b70c
|
crawler: max episode length=1000; new config: 1 forward layer
|
4 年前 |
yanchaosun
|
4b081de4
|
smaller feature size
|
4 年前 |
yanchaosun
|
96b5478f
|
smaller
|
4 年前 |
yanchaosun
|
0463bfe9
|
smaller state feature, large action feature
|
4 年前 |
yanchaosun
|
2e927257
|
separate policy net
|
4 年前 |
yanchaosun
|
86830ac9
|
3dball mass=5 transfer test
|
4 年前 |
yanchaosun
|
dd0ac8a3
|
mass=2
|
4 年前 |
yanchaosun
|
46817bed
|
fix bug
|
4 年前 |
yanchaosun
|
b0f6f307
|
transfer from mass 2 to mass 1
|
4 年前 |
yanchaosun
|
bcdc0a11
|
f512
|
4 年前 |
yanchaosun
|
4a23dbb3
|
fix mass 3dball
|
4 年前 |
yanchaosun
|
db30f918
|
push block
|
4 年前 |
yanchaosun
|
4be4f1d1
|
new reacher env
|
4 年前 |
yanchaosun
|
e9a3ea57
|
reacher self-transfer
|
4 年前 |
yanchaosun
|
f1802c3a
|
push block transfer setting
|
4 年前 |
yanchaosun
|
5cab2114
|
push block without action encoder
|
4 年前 |
yanchaosun
|
4133fb35
|
no action
|
4 年前 |
yanchaosun
|
191a1133
|
block forward 2 layers
|
4 年前 |
yanchaosun
|
1ee62100
|
reacher
|
4 年前 |
yanchaosun
|
5c3306ef
|
large buffer size
|
4 年前 |
yanchaosun
|
4d5f5888
|
encoder layer 1
|
4 年前 |
yanchaosun
|
e39986ed
|
block larger feature size; reacher fix and new reward
|
4 年前 |
yanchaosun
|
7dac3284
|
push block more steps
|
4 年前 |
yanchaosun
|
51491a3e
|
new dynamics change: scale 1 to 2
|
4 年前 |
yanchaosun
|
a1859fb8
|
reacher multi seeds
|
4 年前 |
yanchaosun
|
854e10e1
|
3dball hard scale
|
4 年前 |
yanchaosun
|
b5a1b9b4
|
hard task name change
|
4 年前 |
yanchaosun
|
27dffa4d
|
new reacher reward
|
4 年前 |
yanchaosun
|
16e63cb8
|
config fix
|
4 年前 |
yanchaosun
|
883361ee
|
reacher new reward: action penalty and constant not-reaching-goal penalty
|
4 年前 |
yanchaosun
|
85549b2b
|
reacher: stack observation. with the original reward function
|
4 年前 |
yanchaosun
|
92c3facf
|
distance based penalty
|
4 年前 |
yanchaosun
|
f15a4f2d
|
2 layers
|
4 年前 |
yanchaosun
|
716336bf
|
larger feature size
|
4 年前 |
yanchaosun
|
63cec035
|
fix config
|
4 年前 |
yanchaosun
|
693c0ca4
|
feature size 32
|
4 年前 |
yanchaosun
|
1a9aaaf6
|
model weights and large transfer learning weight
|
4 年前 |
yanchaosun
|
1ebe7054
|
new config
|
4 年前 |
yanchaosun
|
8f67cd40
|
smaller learning rate
|
4 年前 |