Ervin Teng
|
88998fc9
|
Add add_policy docstrings
|
5 年前 |
Ervin Teng
|
ff607162
|
Move learning rate reporting
|
5 年前 |
Ervin Teng
|
847725f1
|
extend meta curriculum test steps
|
5 年前 |
Ervin Teng
|
63463bd1
|
Make TF graph seed deterministic
|
5 年前 |
Ervin Teng
|
30e4424c
|
Fix PPO optimizer creation
|
5 年前 |
Ervin Teng
|
1cfc461a
|
Remove and rename tf_optimizer
|
5 年前 |
Ervin Teng
|
0ab7aa58
|
Fix tensor names
|
5 年前 |
Ervin Teng
|
1859f252
|
Merge commit 'fbcdd83c087135f870e785cc72e5ff9a7e898e3a' into develop-splitpolicyoptimizer
|
5 年前 |
Ervin Teng
|
be9d772e
|
Add option to not condition sigma on obs
|
5 年前 |
Ervin Teng
|
c9fbb111
|
Fix entropy calculation
|
5 年前 |
Ervin Teng
|
bcc25d59
|
Merge branch 'master' into develop-splitpolicyoptimizer
|
5 年前 |
Ervin Teng
|
5ef902bf
|
Merge branch 'master' into develop-splitpolicyoptimizer
|
5 年前 |
Ervin Teng
|
c60e16c9
|
Correct memory size docs
|
5 年前 |
Ervin Teng
|
8abd4129
|
Clean up nn_policy
|
5 年前 |
Ervin Teng
|
7004604d
|
Used NamedTuple for create normalization tensors
|
5 年前 |
Ervin Teng
|
ffdc41bb
|
Removed floating constants
|
5 年前 |
Ervin Teng
|
441e6a0c
|
Add typing to optimizer, rename self.tf_optimizer
|
5 年前 |
Ervin Teng
|
5bfc0b87
|
Update docstring
|
5 年前 |
Ervin Teng
|
c350c6d8
|
Added enforcement of m_size to be divisible by 2
|
5 年前 |
Ervin Teng
|
c68b5643
|
Remove multi_gpu from learn test
|
5 年前 |
Ervin Teng
|
faa9c702
|
Fix one more test for multi_gpu
|
5 年前 |
Ervin Teng
|
00017bab
|
Temporarily remove multi-GPU
|
5 年前 |
Ervin Teng
|
f9c0f169
|
Delete extra tf_optimizer
|
5 年前 |
Ervin Teng
|
7d5c1b0b
|
Add docstring and make some methods private
|
5 年前 |
Ervin Teng
|
d969e013
|
Remove extra tf_optimizer
|
5 年前 |
Ervin Teng
|
4d94e180
|
Move optimizer to common folder
|
5 年前 |
Ervin Teng
|
cbfbff2c
|
Split optimizer and TFOptimizer
|
5 年前 |
Ervin Teng
|
632ff859
|
add init
|
5 年前 |
Ervin Teng
|
31c844e2
|
Change memory size definition in docs
|
5 年前 |
Ervin Teng
|
ce110201
|
Add optional burn-in for SAC as well
|
5 年前 |
Ervin Teng
|
f64bdc4b
|
Fix SAC RNN test
|
5 年前 |
Ervin Teng
|
328476d8
|
Move check for creation into nn_policy
|
5 年前 |
Ervin Teng
|
14720e2d
|
Remove burn-in
|
5 年前 |
Ervin Teng
|
1407db53
|
Fix Barracuda export for LSTM
|
5 年前 |
Ervin Teng
|
d4ee7346
|
Merge commit 'f9c05a61d574305497789b5997f1ae3ea1b1ad3b' into develop-splitpolicyoptimizer
|
5 年前 |
Ervin Teng
|
5f00782b
|
Clean up some SAC LSTM
|
5 年前 |
Ervin Teng
|
7a401feb
|
Remove float64 numpy
|
5 年前 |
Ervin Teng
|
dcbb90e1
|
Fix graph init in ghost trainer
|
5 年前 |
Ervin Teng
|
cb2d2526
|
Reformat using black
|
5 年前 |
Ervin Teng
|
48b39b80
|
Fix ghost trainer and all tests
|
5 年前 |
Ervin Teng
|
1c4f60d4
|
remove more PPO tests
|
5 年前 |
Ervin Teng
|
d02bfbd4
|
Remove PPO policy tests
|
5 年前 |
Ervin Teng
|
dc43b0c6
|
Add test for NN policy
|
5 年前 |
Ervin Teng
|
2eda5575
|
Fix discrete scoping
|
5 年前 |
Ervin Teng
|
cdd57468
|
Re-fix scoping and add method to get all variables
|
5 年前 |
Ervin Teng
|
1f094da9
|
Fix policy's scoping
|
5 年前 |
GitHub
|
dd86e879
|
Separate out optimizer creation and policy graph creation (#3355)
|
5 年前 |
Ervin Teng
|
85249afc
|
Fix SAC scoping
|
5 年前 |
Ervin Teng
|
aec5fcc0
|
Fix policy tests
|
5 年前 |
Ervin Teng
|
cadf6603
|
Fix SAC CC and some reward signal tests
|
5 年前 |