25 次代码提交 (88998fc9-c589-4dd6-866a-bfc0a93d6e15)

作者 SHA1 备注 提交日期
Ervin Teng 0ef40c08 SAC CC working 5 年前
Ervin Teng 28f7608f Clean up value head creation 5 年前
Ervin Teng 1b6e175c Fix discrete SAC and clean up policy 5 年前
Ervin Teng a5caf4d6 Remove epsilon from everywhere 5 年前
Ervin Teng 8e300036 Add some typing to optimizer 5 年前
Ervin Teng 5ec49542 SAC LSTM isn't broken 5 年前
Ervin Teng 78671383 Move initialization call around 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
GitHub dd86e879 Separate out optimizer creation and policy graph creation (#3355) 5 年前
Ervin Teng cdd57468 Re-fix scoping and add method to get all variables 5 年前
Ervin Teng dcbb90e1 Fix graph init in ghost trainer 5 年前
Ervin Teng 5f00782b Clean up some SAC LSTM 5 年前
Ervin Teng 328476d8 Move check for creation into nn_policy 5 年前
Ervin Teng ce110201 Add optional burn-in for SAC as well 5 年前
Ervin Teng cbfbff2c Split optimizer and TFOptimizer 5 年前
Ervin Teng 4d94e180 Move optimizer to common folder 5 年前
Ervin Teng ffdc41bb Removed floating constants 5 年前
Ervin Teng 7c0fa1c4 Remove action_holder placeholder 5 年前
Ervin Teng 1cfc461a Remove and rename tf_optimizer 5 年前
Ervin Teng ff607162 Move learning rate reporting 5 年前
Ervin Teng da6daebd Make create losses private 5 年前
Ervin Teng 14f2a7f2 Rename LearningModel to ModelUtils 5 年前
Ervin Teng 1156b9b3 Merge branch 'develop-splitpolicyoptimizer' into develop-removeactionholder 5 年前
Ervin Teng 23088088 Remove outdated comment 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前