浏览代码

Don't count buffer_init_steps

/develop/sac-apex
Ervin Teng 5 年前
当前提交
0fa2f4f7
共有 1 个文件被更改,包括 6 次插入2 次删除
  1. 8
      ml-agents/mlagents/trainers/sac/trainer.py

8
ml-agents/mlagents/trainers/sac/trainer.py


self.optimizer: SACOptimizer = None # type: ignore
self.step = 0
self.update_steps = 0
self.reward_signal_update_steps = 0
# Don't count buffer_init_steps in steps_per_update ratio, but also don't divide-by-0
self.update_steps = max(1, self.trainer_parameters["buffer_init_steps"])
self.reward_signal_update_steps = max(
1, self.trainer_parameters["buffer_init_steps"]
)
self.steps_per_update = (
trainer_parameters["steps_per_update"]

正在加载...
取消
保存