浏览代码

Fix 3DBall PPO hard regression (#4133)

/MLA-1734-demo-provider
GitHub 4 年前
当前提交
d42e82a8
共有 1 个文件被更改,包括 3 次插入3 次删除
  1. 6
      config/ppo/3DBallHard.yaml

6
config/ppo/3DBallHard.yaml


3DBallHard:
trainer_type: ppo
hyperparameters:
batch_size: 1200
batch_size: 120
buffer_size: 12000
learning_rate: 0.0003
beta: 0.001

vis_encode_type: simple
reward_signals:
extrinsic:
gamma: 0.995
gamma: 0.99
max_steps: 5000000
max_steps: 500000
time_horizon: 1000
summary_freq: 12000
threaded: true
正在加载...
取消
保存