GitHub
5 年前
当前提交
d009511a
共有 48 个文件被更改,包括 483 次插入 和 480 次删除
-
3.pre-commit-config.yaml
-
4SURVEY.md
-
2UnitySDK/Assets/ML-Agents/Plugins/Barracuda.Core/Barracuda.md
-
2UnitySDK/Assets/ML-Agents/Plugins/Barracuda.Core/LICENSE.md
-
10UnitySDK/Assets/ML-Agents/Plugins/Barracuda.Core/ReleaseNotes.md
-
4UnitySDK/README.md
-
34docs/Basic-Guide.md
-
16docs/Creating-Custom-Protobuf-Messages.md
-
2docs/FAQ.md
-
8docs/Getting-Started-with-Balance-Ball.md
-
2docs/Glossary.md
-
16docs/Installation.md
-
2docs/Learning-Environment-Best-Practices.md
-
112docs/Learning-Environment-Create-New.md
-
80docs/Learning-Environment-Design-Agents.md
-
16docs/Learning-Environment-Design.md
-
8docs/Learning-Environment-Examples.md
-
10docs/ML-Agents-Overview.md
-
2docs/Migrating.md
-
2docs/Readme.md
-
34docs/Training-Behavioral-Cloning.md
-
4docs/Training-Curriculum-Learning.md
-
70docs/Training-Generalized-Reinforcement-Learning-Agents.md
-
20docs/Training-ML-Agents.md
-
2docs/Training-Using-Concurrent-Unity-Instances.md
-
2docs/Training-on-Amazon-Web-Service.md
-
2docs/Training-on-Microsoft-Azure-Custom-Instance.md
-
2docs/Training-on-Microsoft-Azure.md
-
22docs/Unity-Inference-Engine.md
-
8docs/Using-Tensorboard.md
-
30docs/Using-Virtual-Environment.md
-
8docs/localized/KR/README.md
-
40docs/localized/KR/docs/Installation-Windows.md
-
12docs/localized/KR/docs/Installation.md
-
34docs/localized/KR/docs/Training-Imitation-Learning.md
-
48docs/localized/KR/docs/Training-PPO.md
-
10docs/localized/KR/docs/Using-Docker.md
-
4docs/localized/zh-CN/README.md
-
66docs/localized/zh-CN/docs/Getting-Started-with-Balance-Ball.md
-
8docs/localized/zh-CN/docs/Installation.md
-
24docs/localized/zh-CN/docs/Learning-Environment-Create-New.md
-
8docs/localized/zh-CN/docs/Learning-Environment-Design.md
-
2docs/localized/zh-CN/docs/Learning-Environment-Examples.md
-
60docs/localized/zh-CN/docs/ML-Agents-Overview.md
-
4docs/localized/zh-CN/docs/Readme.md
-
92gym-unity/README.md
-
6ml-agents-envs/README.md
-
6protobuf-definitions/README.md
|
|||
# Unity ML-Agents Toolkit Survey |
|||
|
|||
Your opinion matters a great deal to us. Only by hearing your thoughts on the Unity ML-Agents Toolkit can we continue to improve and grow. Please take a few minutes to let us know about it. |
|||
Your opinion matters a great deal to us. Only by hearing your thoughts on the Unity ML-Agents Toolkit can we continue to improve and grow. Please take a few minutes to let us know about it. |
|||
[Fill out the survey](https://goo.gl/forms/qFMYSYr5TlINvG6f1) |
|||
[Fill out the survey](https://goo.gl/forms/qFMYSYr5TlINvG6f1) |
|
|||
Barracuda cross-platform Neural Net engine copyright © 2018 Unity Technologies ApS |
|||
|
|||
Licensed under the Unity Companion License for Unity-dependent projects--see [Unity Companion License](http://www.unity3d.com/legal/licenses/Unity_Companion_License). |
|||
Licensed under the Unity Companion License for Unity-dependent projects--see [Unity Companion License](http://www.unity3d.com/legal/licenses/Unity_Companion_License). |
|||
|
|||
Unless expressly provided otherwise, the Software under this license is made available strictly on an “AS IS” BASIS WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. Please review the license for details on these and other terms and conditions. |
|||
|
|
|||
# Unity ML-Agents SDK |
|||
|
|||
Contains the ML-Agents Unity Project, including |
|||
both the core plugin (in `Scripts`), as well as a set |
|||
Contains the ML-Agents Unity Project, including |
|||
both the core plugin (in `Scripts`), as well as a set |
|||
of example environments (in `Examples`). |
|
|||
# Training with Behavioral Cloning |
|||
|
|||
There are a variety of possible imitation learning algorithms which can |
|||
be used, the simplest one of them is Behavioral Cloning. It works by collecting |
|||
demonstrations from a teacher, and then simply uses them to directly learn a |
|||
policy, in the same way the supervised learning for image classification |
|||
There are a variety of possible imitation learning algorithms which can |
|||
be used, the simplest one of them is Behavioral Cloning. It works by collecting |
|||
demonstrations from a teacher, and then simply uses them to directly learn a |
|||
policy, in the same way the supervised learning for image classification |
|||
With offline behavioral cloning, we can use demonstrations (`.demo` files) |
|||
With offline behavioral cloning, we can use demonstrations (`.demo` files) |
|||
1. Choose an agent you would like to learn to imitate some set of demonstrations. |
|||
2. Record a set of demonstration using the `Demonstration Recorder` (see [here](Training-Imitation-Learning.md)). |
|||
For illustrative purposes we will refer to this file as `AgentRecording.demo`. |
|||
3. Build the scene(make sure the Agent is not using its heuristic). |
|||
4. Open the `config/offline_bc_config.yaml` file. |
|||
5. Modify the `demo_path` parameter in the file to reference the path to the |
|||
demonstration file recorded in step 2. In our case this is: |
|||
1. Choose an agent you would like to learn to imitate some set of demonstrations. |
|||
2. Record a set of demonstration using the `Demonstration Recorder` (see [here](Training-Imitation-Learning.md)). |
|||
For illustrative purposes we will refer to this file as `AgentRecording.demo`. |
|||
3. Build the scene(make sure the Agent is not using its heuristic). |
|||
4. Open the `config/offline_bc_config.yaml` file. |
|||
5. Modify the `demo_path` parameter in the file to reference the path to the |
|||
demonstration file recorded in step 2. In our case this is: |
|||
6. Launch `mlagent-learn`, providing `./config/offline_bc_config.yaml` |
|||
as the config parameter, and include the `--run-id` and `--train` as usual. |
|||
Provide your environment as the `--env` parameter if it has been compiled |
|||
6. Launch `mlagent-learn`, providing `./config/offline_bc_config.yaml` |
|||
as the config parameter, and include the `--run-id` and `--train` as usual. |
|||
Provide your environment as the `--env` parameter if it has been compiled |
|||
This will use the demonstration file to train a neural network driven agent |
|||
to directly imitate the actions provided in the demonstration. The environment |
|||
This will use the demonstration file to train a neural network driven agent |
|||
to directly imitate the actions provided in the demonstration. The environment |
|||
will launch and be used for evaluating the agent's performance during training. |