8 次代码提交 (cd46c9c2-6692-44ed-ba47-4373c2963f36)

作者 SHA1 备注 提交日期
GitHub 3c9603d6 Demonstration Recorder (#1240) 6 年前
GitHub 48578199 Fix brain name bug in offline bc (#1395) 6 年前
vincentpierre d1cb6ce0 Fix on the bc_offline_training using deep copies 6 年前
eshvk cc9bdf17 Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub a4d5b2d3 Doc/comment cleanup - Fix some occurrences of 'the the' (#2119) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
Chris Elion 3cb1755e When checking for the compatibility of the expert brain with the policy brain, we will remove the action descriptions from the dictionary of things we need to compare. This is to prevent the case where a user has different descriptions for his actions but still wants to train a brain using expert demonstrations. (#2517) 5 年前