浏览代码
Fix get_value_estimate and buffer append (#2276)
Fix get_value_estimate and buffer append (#2276)
Fixes shuffling issue with newer versions of numpy (#1798). * make get_value_estimates output a dict of floats * Use np.append instead of convert to list, unconvert * Add type hints and test for get_value_estimates/develop-generalizationTraining-TrainerController
GitHub
5 年前
当前提交
a5b7cf95
共有 3 个文件被更改,包括 53 次插入 和 7 次删除
-
18ml-agents/mlagents/trainers/ppo/policy.py
-
11ml-agents/mlagents/trainers/ppo/trainer.py
-
31ml-agents/mlagents/trainers/tests/test_ppo.py
撰写
预览
正在加载...
取消
保存
Reference in new issue