ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
GitHub	06fa6616	Docs/new semantics (#370 ) * [Semantics] Modified the semantics for the documentation * [Semantics] Updated the images * [Semantics] Made further changes to the docs based of the comments received	7 年前
eshvk	e33a083f	[docs] Update PPO hyperparameter wordings	7 年前
eshvk	78906771	[docs] Rework hyperparameter wordings and alternative to PPO jupyter notebook patches	7 年前
Joe Ward	ac5e6bc7	Edits for 0.3	7 年前
Marwan Mattar	095632d6	Added reference to Basics in Jupyter installation - Added consistent naming to the 3D Balance Ball environment - Minor fixes to the Basics notebook	7 年前
Marwan Mattar	c462b16f	Removed documentation comments. - Added to an internal Trello card to address.	7 年前
Marwan Mattar	0416aed5	Spell check on github docs Also changed pip to pip3 in Training on AWS page.	7 年前
GitHub	655adb30	Add docs for additional hyperparameters (#473 ) * Add docs for additional hyperparameters	7 年前
Marwan Mattar	c471ceca	Fixed code formating and links.	7 年前
Marwan Mattar	1f34e67c	Fixed lingering broken links	7 年前
vincentpierre	076c8744	Report means instead of totals for losses (#580 ) * Report means instead of totals for losses. * Report absolute loss for policy.	7 年前
GitHub	c17937ef	Curiosity Driven Exploration & Pyramids Environments (#739 ) * Adds implementation of Curiosity-driven Exploration by Self-supervised Prediction (https://arxiv.org/abs/1705.05363) to PPO trainer. * To enable, set use_curiosity flag to true in hyperparameter file. * Includes refactor of unitytrainers model code to accommodate new feature. * Adds new Pyramids environment (w/ documentation). Environment contains sparse reward, and can only be solved using PPO+Curiosity.	7 年前
GitHub	0c417c55	Release v0.5 (#1202 )	6 年前
Deric Pang	40f4eb3e	Cleaning up documentation.	6 年前
GitHub	bd4a8db2	Documentation Update (#1339 ) * Documentation Update * addressed comments * new images for the recorder * Improvements to the docs * Address the comments * Core_ML typo * Updated the links to inference repo * Put back Inference-Engine.md * fix typos : brain * Readd deleted file * fix typos * Addressed comments	6 年前
GitHub	4ac79742	Refactor reward signals into separate class (#2144 ) * Create new class (RewardSignal) that represents a reward signal. * Add value heads for each reward signal in the PPO model. * Make summaries agnostic to the type of reward signals, and log weighted rewards per reward signal. * Move extrinsic and curiosity rewards into this new structure. * Allow defining multiple reward signals in YAML file. Add documentation for this new structure.	5 年前
GitHub	9c50abcf	GAIL and Pretraining (#2118 ) Based on the new reward signals architecture, add BC pretrainer and GAIL for PPO. Main changes: - A new GAILRewardSignal and GAILModel for GAIL/VAIL - A BCModule component (not a reward signal) to do pretraining during RL - Documentation for both of these - Change to Demo Loader that lets you load multiple demo files in a folder - Example Demo files for all of our tested sample environments (for future regression testing)	5 年前
GitHub	6225317d	refactor vis_encoder_type and add to doc refactor vis_encoder_type and add to doc	5 年前
GitHub	5aba7a08	Fix broken doc links (#2327 ) * fix azure link * fix imitation learning links	5 年前
Jeffrey Shih	728afebf	Release 0.9.0 docs checklist and cleanup - v2 (#2372 ) * Included explicit version # for ZN * added explicit version for KR docs * minor fix in installation doc * Consistency with numbers for reset parameters * Removed extra verbiage. minor consistency * minor consistency * Cleaned up IL language * moved parameter sampling above in list * Cleaned up language in Env Parameter sampling * Cleaned up migrating content * updated consistency of Reset Parameter Sampling * Rename Training-Generalization-Learning.md to Training-Generalization-Reinforcement-Learning-Agents.md * Updated doc link for generalization * Rename Training-Generalization-Reinforcement-Learning-Agents.md to Training-Generalized-Reinforcement-Learning-Agents.md * Re-wrote the intro paragraph for generalization * add titles, cleaned up language for reset params * Update Training-Generalized-Reinforcement-Learning-Agents.md * cleanup of generalization doc * More cleanu...	5 年前
GitHub	c6e7c5ba	Fixed small typo in documentation.	5 年前
GitHub	6a81a2f4	Add Soft Actor-Critic as trainer option (#2341 ) * Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.	5 年前
GitHub	3683cc1c	Enable learning rate decay to be disabled (#2567 )	5 年前
GitHub	17b3a805	Fix spelling error in documentation (#2636 )	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	1fa07edb	Remove Standalone Offline BC Training (#2969 )	5 年前
Yuan Gao	0817c44b	Moved the demo files	5 年前
GitHub	0ff8f9af	Create ML-Agents Package (#3267 ) Convert the UnitySDK to a Packman Package. - Separate Examples into a sample project. - Move core UnitySDK Code into com.unity.ml-agents. - Create asmdefs for the ml-agents package. - Add package validation tests for win/linux/max. - Update protobuf generation scripts. - Add Barracuda as a package dependency for ML-Agents. (users no longer have to install it themselves).	5 年前
Ervin Teng	31c844e2	Change memory size definition in docs	5 年前
Ervin Teng	c60e16c9	Correct memory size docs	5 年前
GitHub	92f1315e	Combine "Getting Started" and "Basic" Guides (#3644 ) * Merge agent & best practices doc. Plus other fixes * Fix overly long lines * Merge Getting Started and Basic Guides * Rename guide and update links appropriately * Fix broken link	5 年前
GitHub	d7ca6b8d	[feature] Add --initialize-from option (#3710 )	5 年前
Marwan Mattar	9084db7b	Consolidate Feature descriptions into ML-Agents-Overview page Merged the "Overview" sections of a few pages into their respective sections in ML-Agents-Overview: - Training-Using-Concurrent-Unity-Instances.md - Training-Self-Play.md - Training-SAC.md - Training-PPO.md - Training-Imitation-Learning.md - Training-Environment-Parameter-Randomization.md - Training-Curriculum-Learning.md - Reward-Signals.md - Feature-Monitor.md - Feature-Memory.md Organized ML-Agents-Overview into Training Methods and Training Options sections. Follow-up action items (part of a separate PR): - Smooth over the documentation in ML-Agents-Overview (right now, we somewhat just pasted text from other pages). If we align on the new structure for this page, we can iterate on it. - Update “Key Components” section with new graph and discuss side channels and revise use of Academy. - Consolidate “Training-*” docs into Training-ML-Agents to offer a single guide for all hyperparameter selection	5 年前
Ervin Teng	9fe104d6	Make threading disable-able per trainer	5 年前
Ervin Teng	a36c6e9c	Fix default value in docs	5 年前
Ervin Teng	61e442ff	Update documentation about disabling threading	5 年前
GitHub	92b571e7	find and replace for ML-Agents Toolkit (#3799 ) * find and replace for ML-Agents Toolkit * apply * install ruby * apt-get update * Update config.yml	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前

40 次代码提交 (b9210f4c-28e0-4c60-a4e7-e13dbddfc25a)