ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
GitHub	e676017b	Reorganize learn.py (#302 ) Split learn.py into learn.py as command-line wrapper, and trainer_controller.py as core trainer/env logic.	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
Vincent Gao	621ba3af	clarify the python docs and learn.py help message	7 年前
Vincent Gao	6806c801	resolved comments	7 年前
Vincent Gao	2f373c5a	fixed the learn.py with a better way	7 年前
Arthur Juliani	ce5e2dba	[Added Ascii art on learn.py] (#727 ) * [Added Ascii art on learn.py] Note : This is by far the best feature of 0.4	7 年前
GitHub	ffcf8c9c	Newer Ascii Art (#780 ) Replaced UNITY ML AGENTS with the unity logo	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
Arthur Juliani	d7338050	Enable concurrent sessions	6 年前
Arthur Juliani	5d402be9	Minor Optimizations (#836 )	6 年前
Arthur Juliani	195ac934	Merge branch 'develop' into develop-runs # Conflicts: # python/learn.py # python/unitytrainers/trainer.py	6 年前
Arthur Juliani	11b50054	Replace Ray with multiprocess	6 年前
unityjeffrey	0d67f311	changed ml agents to ml-agents	6 年前
unityjeffrey	19fb437a	changed to Unity ML-Agents Toolkit (english)	6 年前
Arthur Juliani	f52d5a92	Merge remote-tracking branch 'origin/develop' into develop-runs	6 年前
Arthur Juliani	3b916dd9	Add exception for in-edtior training	6 年前
Arthur Juliani	ffe365dc	Add white space	6 年前
GitHub	9538d699	Move seed randomization to learn.py (#1071 ) * Move seed randomization to learn.py * Remove print statement	6 年前
Arthur Juliani	567ad3f0	fix Unity-Technologies/ml-agents#1041 (#1102 )	6 年前
GitHub	2edaf342	Clean up learn.py (#1106 )	6 年前
GitHub	106d562d	Fix for Windows (#1120 ) addresses #1113	6 年前

25 次代码提交 (ac5e6bc7-9a4d-4b3d-b879-f70a45febf0e)