20 次代码提交 (cd46c9c2-6692-44ed-ba47-4373c2963f36)

作者 SHA1 备注 提交日期
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
Ervin Teng 9053610f Fix buffer tests and truncate 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng 336ca456 Kill the ProcessingBuffer 5 年前
GitHub 1f9d04f2 Fix clear update buffer when trainer stops training, add test (#3422) 5 年前
Ervin Teng 184f27c6 Make buffer type-agnostic 4 年前
GitHub 64fc7f43 Buffer key enums (#4907) 4 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 4 年前
Ervin Teng 2f209c12 Buffer fixes 4 年前
Ervin Teng 50ab983e Fix slicing typing and string printing in AgentBufferField 4 年前
GitHub af36ef3b [bug-fix] Fix typo (#5035) 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
Ervin Teng c108da4a [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 4 年前