浏览代码

disable bisim

/develop/bisim-sac-transfer
yanchaosun 4 年前
当前提交
9a19f6e5
共有 3 个文件被更改,包括 24 次插入5 次删除
  1. 2
      config/ppo_transfer/TransferCrawlerStatic.yaml
  2. 6
      ml-agents/mlagents/trainers/sac_transfer/trainer.py
  3. 21
      ml-agents/mlagents/trainers/tests/reward_plot.ipynb

2
config/ppo_transfer/TransferCrawlerStatic.yaml


use_var_predict: true
with_prior: false
predict_return: true
use_bisim: true
use_bisim: false
separate_value_net: true
use_transfer: true
load_policy: false

6
ml-agents/mlagents/trainers/sac_transfer/trainer.py


# Update the normalization
if self.is_training:
self.policy.update_normalization(agent_buffer_trajectory["vector_obs"])
self.policy.update_normalization(
agent_buffer_trajectory["vector_obs"],
agent_buffer_trajectory["next_vector_in"],
agent_buffer_trajectory["vector_obs"],
)
# Evaluate all reward functions for reporting purposes
self.collected_rewards["environment"][agent_id] += np.sum(

21
ml-agents/mlagents/trainers/tests/reward_plot.ipynb
文件差异内容过多而无法显示
查看文件

正在加载...
取消
保存