浏览代码

crawler: max episode length=1000; new config: 1 forward layer

/develop/bisim-sac-transfer
yanchaosun 4 年前
当前提交
49d6b70c
共有 7 个文件被更改,包括 27 次插入15 次删除
  1. 2
      Project/Assets/ML-Agents/Examples/Crawler/Prefabs/Crawler.prefab
  2. 6
      Project/Assets/ML-Agents/Examples/Crawler/Scenes/CrawlerStaticTarget.unity
  3. 2
      Project/Assets/ML-Agents/Examples/OldCrawler/Prefabs/OldCrawler.prefab
  4. 6
      Project/Assets/ML-Agents/Examples/OldCrawler/Scenes/OldCrawlerStaticTarget.unity
  5. 12
      config/sac_transfer/CrawlerStatic.yaml
  6. 10
      config/sac_transfer/CrawlerStaticTransfer.yaml
  7. 4
      ml-agents/mlagents/trainers/sac_transfer/network.py

2
Project/Assets/ML-Agents/Examples/Crawler/Prefabs/Crawler.prefab


agentParameters:
maxStep: 0
hasUpgradedFromAgentParameters: 1
MaxStep: 5000
MaxStep: 1000
maximumWalkingSpeed: 999
target: {fileID: 0}
targetSpawnRadius: 40

6
Project/Assets/ML-Agents/Examples/Crawler/Scenes/CrawlerStaticTarget.unity


propertyPath: m_LocalEulerAnglesHint.z
value: 0
objectReference: {fileID: 0}
- target: {fileID: 3386028169429758298, guid: 1b9d167a4e71146a883212e4f08bda88,
type: 3}
propertyPath: m_Model
value:
objectReference: {fileID: 11400000, guid: 2ddbe780c2a664a8db253b5eef2c21dc,
type: 3}
m_RemovedComponents: []
m_SourcePrefab: {fileID: 100100000, guid: 1b9d167a4e71146a883212e4f08bda88, type: 3}
--- !u!1001 &680193204

2
Project/Assets/ML-Agents/Examples/OldCrawler/Prefabs/OldCrawler.prefab


agentParameters:
maxStep: 0
hasUpgradedFromAgentParameters: 1
MaxStep: 5000
MaxStep: 1000
target: {fileID: 0}
ground: {fileID: 0}
detectTargets: 0

6
Project/Assets/ML-Agents/Examples/OldCrawler/Scenes/OldCrawlerStaticTarget.unity


propertyPath: m_LocalEulerAnglesHint.z
value: 0
objectReference: {fileID: 0}
- target: {fileID: 4245501741859669186, guid: a1a421bfb42ce4b29b585c08752f03a7,
type: 3}
propertyPath: m_Model
value:
objectReference: {fileID: 11400000, guid: 06e8d119a177741a0904878e0132931b,
type: 3}
m_RemovedComponents: []
m_SourcePrefab: {fileID: 100100000, guid: a1a421bfb42ce4b29b585c08752f03a7, type: 3}
--- !u!1001 &943263318

12
config/sac_transfer/CrawlerStatic.yaml


save_replay_buffer: false
init_entcoef: 1.0
reward_signal_steps_per_update: 20.0
encoder_layers: 3
policy_layers: 0
forward_layers: 0
value_layers: 0
feature_size: 512
encoder_layers: 2
policy_layers: 1
forward_layers: 1
value_layers: 2
feature_size: 128
action_feature_size: 256
action_feature_size: 128
separate_policy_train: true
separate_model_train: true
# separate_value_net: true

10
config/sac_transfer/CrawlerStaticTransfer.yaml


save_replay_buffer: false
init_entcoef: 1.0
reward_signal_steps_per_update: 20.0
encoder_layers: 3
policy_layers: 0
forward_layers: 0
encoder_layers: 2
policy_layers: 1
forward_layers: 1
feature_size: 300
feature_size: 128
action_layers: 2
action_feature_size: 128
separate_policy_train: true

train_model: false
load_action: true
train_action: false
transfer_path: "results/oldcs-f300/CrawlerStatic"
transfer_path: "results/oldcs-m1/CrawlerStatic"
network_settings:
normalize: true
hidden_units: 512

4
ml-agents/mlagents/trainers/sac_transfer/network.py


self.q1_heads, self.q2_heads, self.q1, self.q2 = self.create_q_heads(
self.stream_names,
hidden_q,
self.num_layers+2,
self.num_layers,
self.h_size,
self.join_scopes(scope, "q"),
)

self.num_layers+2,
self.num_layers,
self.h_size,
self.join_scopes(scope, "q"),
reuse=True,

正在加载...
取消
保存