浏览代码

Fix pypi issues

/develop/action-slice
Ervin Teng 4 年前
当前提交
ac0b56bb
共有 1 个文件被更改,包括 3 次插入3 次删除
  1. 6
      ml-agents/mlagents/trainers/coma/trainer.py

6
ml-agents/mlagents/trainers/coma/trainer.py


and not trajectory.interrupted,
)
if value_memories is not None:
if value_memories is not None and baseline_memories is not None:
agent_buffer_trajectory[BufferKey.CRITIC_MEMORY].set(value_memories)
agent_buffer_trajectory[BufferKey.BASELINE_MEMORY].set(baseline_memories)

dtype=np.float32,
)
baseline_estimates = agent_buffer_trajectory[
baseline_estimate = agent_buffer_trajectory[
RewardSignalUtil.baseline_estimates_key(name)
].get_batch()
v_estimates = agent_buffer_trajectory[

value_next=value_next[name],
)
local_advantage = np.array(lambd_returns) - np.array(baseline_estimates)
local_advantage = np.array(lambd_returns) - np.array(baseline_estimate)
agent_buffer_trajectory[RewardSignalUtil.returns_key(name)].set(
lambd_returns

正在加载...
取消
保存