浏览代码
Don't run value during policy evaluate, optimized soft update function (#4501)
Don't run value during policy evaluate, optimized soft update function (#4501)
* Don't run value during inference * Execute critic with LSTM * Address comments * Unformat * Optimized soft update * Move soft update to model utils * Add test for soft update/MLA-1734-demo-provider
GitHub
4 年前
当前提交
4e4ad7b0
共有 7 个文件被更改,包括 75 次插入 和 46 次删除
-
37ml-agents/mlagents/trainers/policy/torch_policy.py
-
27ml-agents/mlagents/trainers/sac/optimizer_torch.py
-
4ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py
-
10ml-agents/mlagents/trainers/tests/torch/test_policy.py
-
17ml-agents/mlagents/trainers/tests/torch/test_utils.py
-
2ml-agents/mlagents/trainers/torch/components/bc/module.py
-
24ml-agents/mlagents/trainers/torch/utils.py
撰写
预览
正在加载...
取消
保存
Reference in new issue