21 次代码提交 (1db18bd6-65c7-4859-bf35-891bb0856880)

作者 SHA1 备注 提交日期
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 5 年前
GitHub d80d5852 add some types to the reward signals (#2215) 5 年前
GitHub ab690b93 Fix naming conflict between Curiosity and GAIL (#2406) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
Chris Elion 73a346cb cleanup 5 年前
Ervin Teng 151e3b1c Move policy to common location, remove epsilon 5 年前
Ervin Teng b61d2fa1 Fix some typing issues with curiosity 5 年前
Ervin Teng cadf6603 Fix SAC CC and some reward signal tests 5 年前
Ervin Teng 5bfc0b87 Update docstring 5 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
Andrew Cohen f654df34 fixing tensorflow tests 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前