浏览代码

no entropy loss

/exp-continuous-div
Andrew Cohen 3 年前
当前提交
bcee3bf5
共有 1 个文件被更改,包括 4 次插入4 次删除
  1. 8
      ml-agents/mlagents/trainers/sac/optimizer_torch.py

8
ml-agents/mlagents/trainers/sac/optimizer_torch.py


total_value_loss.backward()
self.value_optimizer.step()
ModelUtils.update_learning_rate(self.entropy_optimizer, decay_lr)
self.entropy_optimizer.zero_grad()
entropy_loss.backward()
self.entropy_optimizer.step()
#ModelUtils.update_learning_rate(self.entropy_optimizer, decay_lr)
#self.entropy_optimizer.zero_grad()
#entropy_loss.backward()
#self.entropy_optimizer.step()
mede_loss = self._mede_network.loss(current_obs, sampled_actions, masks)
ModelUtils.update_learning_rate(self._mede_optimizer, decay_lr)

正在加载...
取消
保存