浏览代码
* [Cold Fix] Split the way cummulative rewards and episode length are counted The reward is appended at each step to the cummulative reward The episode count is ONLY incremented when d_t+1 is false/develop-generalizationTraining-TrainerController
Arthur Juliani
7 年前
当前提交
9477eaa9