浏览代码
* Fix of the target entropy for continuous SAC * Lowering required steps of test and remove unecessary unsqueeze * Changing the target from -dim(a)^2 to -dim(a) by removing implicit broadcasting/colab-links
GitHub
4 年前
当前提交
fc6e8c35