浏览代码

distance based penalty

/develop/bisim-sac-transfer
yanchaosun 5 年前
当前提交
92c3facf
共有 3 个文件被更改,包括 3 次插入3 次删除
  1. 2
      Project/Assets/ML-Agents/Examples/Reacher/Scripts/NewReacherAgent.cs
  2. 2
      Project/Assets/ML-Agents/Examples/Reacher/Scripts/ReacherAgent.cs
  3. 2
      config/sac_transfer/ReacherTransfer.yaml

2
Project/Assets/ML-Agents/Examples/Reacher/Scripts/NewReacherAgent.cs


// {
// AddReward(-0.002f);
// }
// AddReward( - 0.001f * (goal.transform.position - hand.transform.position).magnitude);
AddReward( - 0.001f * (goal.transform.position - hand.transform.position).magnitude);
// Debug.Log((goal.transform.position - hand.transform.position).magnitude);
var radians = m_GoalDegree * Mathf.PI / 180f;
var goalX = 8f * Mathf.Cos(radians);

2
Project/Assets/ML-Agents/Examples/Reacher/Scripts/ReacherAgent.cs


// {
// AddReward(-0.002f);
// }
// AddReward( - 0.001f * (goal.transform.position - hand.transform.position).magnitude);
AddReward( - 0.001f * (goal.transform.position - hand.transform.position).magnitude);
// Debug.Log((goal.transform.position - hand.transform.position).magnitude);
var radians = m_GoalDegree * Mathf.PI / 180f;
var goalX = 8f * Mathf.Cos(radians);

2
config/sac_transfer/ReacherTransfer.yaml


train_model: false
load_action: true
train_action: false
transfer_path: "results/reacher-ori-sta/Reacher"
transfer_path: "results/reacher-stack/Reacher"
network_settings:
normalize: true
hidden_units: 128

正在加载...
取消
保存