23 次代码提交 (1db18bd6-65c7-4859-bf35-891bb0856880)

作者 SHA1 备注 提交日期
GitHub 9c50abcf GAIL and Pretraining (#2118) 5 年前
Chris Elion dfdf7b83 fix whitespace and line breaks 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng 9c5fdd31 Stats reporting is working 5 年前
GitHub 1fa07edb Remove Standalone Offline BC Training (#2969) 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng a5caf4d6 Remove epsilon from everywhere 5 年前
Ervin Teng cfc2f455 Fix BC and tests 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
GitHub adeb6536 Catch dimension mismatches between demos and policy (#3821) 4 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Andrew Cohen 7fe7f3fe fix tf bc test 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前