浏览代码

Merge branch 'develop-sac-apex' of github.com:Unity-Technologies/ml-agents into develop-sac-apex

/develop/sac-apex
Ervin Teng 4 年前
当前提交
56cf2ddb
共有 1 个文件被更改,包括 2 次插入2 次删除
  1. 4
      docs/Training-SAC.md

4
docs/Training-SAC.md


`steps_per_update` corresponds to the number of agent steps (actions) taken for each mini-batch sampled and used during training. In SAC, a single "update" corresponds to grabbing a batch of size `batch_size` from the experience
replay buffer, and using this mini batch to update the models. Typically, this should be greater
than 1. Note that setting `steps_per_update` lower will improve sample efficiency (reduce the number of steps required to train)
but increase the CPU time spent performaing updates. For most environments where steps are fairly fast (e.g. our example
environments) `steps_per_update` equals the number of agents in the scene is a good balance. For slow environments (steps
but increase the CPU time spent performing updates. For most environments where steps are fairly fast (e.g. our example
environments) `steps_per_update` equal to the number of agents in the scene is a good balance. For slow environments (steps
take 0.1 seconds or more) reducing `steps_per_update` may improve training speed.
We can also change `steps_per_update` to lower than 1 to update more often than once per step, though this is usually
not neccessary.

正在加载...
取消
保存