浏览代码

Fxing test

/develop/rm-rf-new-models
vincentpierre 4 年前
当前提交
8f9634c2
共有 1 个文件被更改,包括 2 次插入3 次删除
  1. 5
      ml-agents/mlagents/trainers/sac/trainer.py

5
ml-agents/mlagents/trainers/sac/trainer.py


) / self.reward_signal_update_steps > self.reward_signal_steps_per_update:
# Get minibatches for reward signal update if needed
reward_signal_minibatches = {}
for name, signal in self.optimizer.reward_signals.items():
for name in self.optimizer.reward_signals.keys():
# Some signals don't need a minibatch to be sampled - so we don't!
if signal.update_dict:
if name != "extrinsic":
reward_signal_minibatches[name] = buffer.sample_mini_batch(
self.hyperparameters.batch_size,
sequence_length=self.policy.sequence_length,

正在加载...
取消
保存