Unity ML - Agents (Python API)

Python Setup

To install dependencies, run:

pip install .

pip3 install .

If your Python environment doesn't include pip, see these instructions on installing it.

Basic - Demonstrates usage of UnityEnvironment class for launching and interfacing with Unity Environments.
PPO - Used for training agents. Contains an implementation of Proximal Policy Optimization Reinforcement Learning algorithm.

To launch jupyter, run:

jupyter notebook

Then navigate to localhost:8888 to access each training notebook.

To monitor training progress, run the following from the root directory of this repo:

tensorboard --logdir='./summaries'

Then navigate to localhost:6006 to monitor progress with Tensorboard.

To train using PPO without the notebook, run:

python3 ppo.py <env_name> --train

For a list of additional hyperparameters, run:

python3 ppo.py --help

See this wiki page for a detailed description of the functions and uses of the Python API.

See this related blog post for a description of how to run Unity Environments on AWS EC2 instances with the GPU.