浏览代码

Fix more indexing bugs

/develop/critic-op-lstm-currentmem
Ervin Teng 4 年前
当前提交
8d834f0b
共有 1 个文件被更改,包括 6 次插入3 次删除
  1. 9
      ml-agents/mlagents/trainers/optimizer/torch_optimizer.py

9
ml-agents/mlagents/trainers/optimizer/torch_optimizer.py


):
seq_obs = []
for _obs in tensor_obs:
start = seq_num * self.policy.sequence_length - leftover
end = (seq_num + 1) * self.policy.sequence_length - leftover
start = seq_num * self.policy.sequence_length - (
self.policy.sequence_length - leftover
)
end = (seq_num + 1) * self.policy.sequence_length - (
self.policy.sequence_length - leftover
)
assert _obs[start:end].shape[0] == self.policy.sequence_length
values, _mem = self.critic.critic_pass(
seq_obs, _mem, sequence_length=self.policy.sequence_length
)

正在加载...
取消
保存