浏览代码
* Normalize observations when adding experiences This change moves normalization of vector observations into the trainer's "add_experiences" interface. Prior to this change, normalization occurred at inference time. This was somewhat confusing since usually executing a forward pass shouldn't have side-effects which would change the training step. Also, in a asynchronous or distributed setting where we copy the neural network weights from a trainer to a remote actor / inference worker we'd end up with training issues because of the weights being different on the trainer than the workers./develop-gpu-test
GitHub
5 年前
当前提交
832e4a47
共有 5 个文件被更改,包括 16 次插入 和 14 次删除
正在加载...
Reference in new issue