82 次代码提交 (1859f252-dd10-4aec-944f-914fa18f05f1)

作者 SHA1 备注 提交日期
Ervin Teng 1859f252 Merge commit 'fbcdd83c087135f870e785cc72e5ff9a7e898e3a' into develop-splitpolicyoptimizer 4 年前
Ervin Teng be9d772e Add option to not condition sigma on obs 4 年前
Ervin Teng c9fbb111 Fix entropy calculation 4 年前
Ervin Teng bcc25d59 Merge branch 'master' into develop-splitpolicyoptimizer 4 年前
Ervin Teng 5ef902bf Merge branch 'master' into develop-splitpolicyoptimizer 4 年前
Ervin Teng c60e16c9 Correct memory size docs 4 年前
Ervin Teng 8abd4129 Clean up nn_policy 4 年前
Ervin Teng 7004604d Used NamedTuple for create normalization tensors 4 年前
Ervin Teng ffdc41bb Removed floating constants 4 年前
Ervin Teng 441e6a0c Add typing to optimizer, rename self.tf_optimizer 4 年前
Ervin Teng 5bfc0b87 Update docstring 4 年前
Ervin Teng c350c6d8 Added enforcement of m_size to be divisible by 2 4 年前
Ervin Teng c68b5643 Remove multi_gpu from learn test 4 年前
Ervin Teng faa9c702 Fix one more test for multi_gpu 4 年前
Ervin Teng 00017bab Temporarily remove multi-GPU 4 年前
Ervin Teng f9c0f169 Delete extra tf_optimizer 4 年前
Ervin Teng 7d5c1b0b Add docstring and make some methods private 4 年前
Ervin Teng d969e013 Remove extra tf_optimizer 4 年前
Ervin Teng 4d94e180 Move optimizer to common folder 4 年前
Ervin Teng cbfbff2c Split optimizer and TFOptimizer 4 年前
Ervin Teng 632ff859 add init 4 年前
Ervin Teng 31c844e2 Change memory size definition in docs 4 年前
Ervin Teng ce110201 Add optional burn-in for SAC as well 4 年前
Ervin Teng f64bdc4b Fix SAC RNN test 4 年前
Ervin Teng 328476d8 Move check for creation into nn_policy 4 年前
Ervin Teng 14720e2d Remove burn-in 4 年前
Ervin Teng 1407db53 Fix Barracuda export for LSTM 4 年前
Ervin Teng d4ee7346 Merge commit 'f9c05a61d574305497789b5997f1ae3ea1b1ad3b' into develop-splitpolicyoptimizer 4 年前
Ervin Teng 5f00782b Clean up some SAC LSTM 4 年前
Ervin Teng 7a401feb Remove float64 numpy 4 年前
Ervin Teng dcbb90e1 Fix graph init in ghost trainer 4 年前
Ervin Teng cb2d2526 Reformat using black 4 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 4 年前
Ervin Teng 1c4f60d4 remove more PPO tests 4 年前
Ervin Teng d02bfbd4 Remove PPO policy tests 4 年前
Ervin Teng dc43b0c6 Add test for NN policy 4 年前
Ervin Teng 2eda5575 Fix discrete scoping 4 年前
Ervin Teng cdd57468 Re-fix scoping and add method to get all variables 4 年前
Ervin Teng 1f094da9 Fix policy's scoping 4 年前
GitHub dd86e879 Separate out optimizer creation and policy graph creation (#3355) 4 年前
Ervin Teng 85249afc Fix SAC scoping 4 年前
Ervin Teng aec5fcc0 Fix policy tests 4 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 4 年前
Ervin Teng 78671383 Move initialization call around 4 年前
Ervin Teng a6e28cf4 Fix for visual obs 4 年前
Ervin Teng cfc2f455 Fix BC and tests 4 年前
Ervin Teng 4871f49c Fix comments for PPO 4 年前
Ervin Teng 7d616651 Add burn-in for memory PPO 4 年前
Ervin Teng 08cb91de Remove __init__ for LearningModel static class 4 年前
Ervin Teng ab9b082a Fix Hallway summary freq 4 年前