ml-agents

Unity 机器学习代理工具包 (ML-Agents) 是一个开源项目，它使游戏和模拟能够作为训练智能代理的环境。

unity3d unity unity-tech reinforcement-le deep-learning deep-reinforcement-learning neural-networks

文件历史

yanchaosun 49d6b70c crawler: max episode length=1000; new config: 1 forward layer		4 年前
..
components	[refactor] Structure configuration files into classes (#3936)	4 年前
ghost	[refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087)	4 年前
optimizer	[refactor] Structure configuration files into classes (#3936)	4 年前
policy	fix action stop gradient	4 年前
ppo	[refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087)	4 年前
ppo_transfer	sac crawler config	4 年前
sac	[refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087)	4 年前
sac_transfer	crawler: max episode length=1000; new config: 1 forward layer	4 年前
tests	target encoders and new forward loss	4 年前
trainer	[refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087)	4 年前
__init__.py	Increment versions to 0.18.0 and 1.1.0 (#4075)	4 年前
action_info.py	Move advance() logic for environment manager out of trainer_controller (#3234)	5 年前
agent_processor.py	Renaming max_step to interrupted in TermialStep(s) (#3908)	5 年前
barracuda.py	fix errors from new flake8-comprehensions (#2917)	5 年前
behavior_id_utils.py	Asymmetric self-play (#3653)	5 年前
brain.py	removed extraneous logging imports and loggers	5 年前
brain_conversion_utils.py	WIP : Changes to the LL-API - Refactor of “done” logic (#3681)	5 年前
buffer.py	Fix clear update buffer when trainer stops training, add test (#3422)	5 年前
cli_utils.py	[refactor] Move checkpoint saving into trainer (#4034)	4 年前
curriculum.py	[refactor] Structure configuration files into classes (#3936)	4 年前
demo_loader.py	Catch dimension mismatches between demos and policy (#3821)	5 年前
distributions.py	Hotfixes for Release 0.15.1 (#3698)	5 年前
env_manager.py	[WIP] Side Channel Design Changes (#3807)	5 年前
exception.py	Combined model and policy for PPO	5 年前
learn.py	new transfer test for cloud	4 年前
meta_curriculum.py	[refactor] Store and restore state along with checkpoints (#4025)	4 年前
models.py	target critic for ppo	4 年前
run_experiment.py	[refactor] Structure configuration files into classes (#3936)	4 年前
settings.py	sac crawler config	4 年前
simple_env_manager.py	Moving domain randomization to C# (#4065)	4 年前
stats.py	Asymmetric self-play (#3653)	5 年前
subprocess_env_manager.py	Moving domain randomization to C# (#4065)	4 年前
tensorflow_to_barracuda.py	backport tf2bc changes from barracuda-release (#3341)	5 年前
trainer_controller.py	Merge branch 'master' into develop-sampler-refactor	4 年前
trainer_util.py	config fix; basic sac	4 年前
training_status.py	[refactor] Store and restore state along with checkpoints (#4025)	4 年前
trajectory.py	Renaming max_step to interrupted in TermialStep(s) (#3908)	5 年前
upgrade_config.py	Moving domain randomization to C# (#4065)	4 年前