ml-agents

Unity 机器学习代理工具包 (ML-Agents) 是一个开源项目，它使游戏和模拟能够作为训练智能代理的环境。

目录树: ef2dfd4c

文件历史

GitHub 213cd68d Split Buffer into processing and update buffers (#2964 ) This is the first in a series of PRs that intend to move the agent processing logic (add_experiences and process_experiences) out of the trainer and into a separate class. The plan is to do so in steps: - Split the processing buffers (keeping track of agent trajectories and assembling trajectories) and update buffer (complete trajectories to be used for training) within the Trainer (this PR) - Move the processing buffer and add/process experiences into a separate, outside class - Change the data type of the update buffer to be a Trajectory - Place and read Trajectories from queues, add subscription mechanism for both AgentProcessor and Trainers		5 年前
..
bc	Split Buffer into processing and update buffers (#2964)	5 年前
reward_signals	Split Buffer into processing and update buffers (#2964)	5 年前
__init__.py	Refactor reward signals into separate class (#2144)	5 年前