ml-agents

作者	SHA1	备注	提交日期
GitHub	6a81a2f4	Add Soft Actor-Critic as trainer option (#2341 ) * Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.	5 年前
GitHub	12d57671	Changing Training-RewardSignals.md --> Reward-Signals.md (#2525 )	5 年前
GitHub	c8796488	Markdown link check in CI (#2543 ) * check using xargs * fix broken BC link * install npm, run precommit before unit tests * try to install npm * try a node image build * add workflow * don't use precommit on node run * sudo make me a sandwich * pass config arg * revert CI order change * retry precommit * sudo apt-get * sudo npm * make sure fails on bad link * cleanup and refix link	5 年前
GitHub	3683cc1c	Enable learning rate decay to be disabled (#2567 )	5 年前
GitHub	17b3a805	Fix spelling error in documentation (#2636 )	5 年前
GitHub	39f280d6	Develop spawn brains (#2676 ) * Feature Deprecation : Online Behavioral Cloning In this PR : - Delete the online_bc_trainer - Delete the tests for online bc - delete the configuration file for online bc training * Deleting the BCTeacherHelper.cs Script TODO : - Remove usages in the scene - Documentation Edits DO NOT MERGE * IMPORTANT : REMOVED ALL IL SCENES - Removed all the IL scenes from the Examples folder * Removed all mentions of online BC training in the Documentation * Made a note in the Migrating.md doc about the removal of the Online BC feature. * Modified the Academy UI to remove the control checkbox and replaced it with a train in the editor checkbox * Removed the Broadcast functionality from the non-Learning brains * Bug fix * Note that the scenes are broken since the BroadcastHub has changed * Modified the LL-API for Python to remove the broadcasting functiuonality. * All unit tests are running * Modified the scen...	5 年前
GitHub	99146e97	1 to 1 Brain to Agent (#2729 ) * 1 to 1 Brain to Agent This is a work in progess In this PR : - Deleted all Brain Objects - Moved the BrainParameters into the Agent - Gave the Agent a Heuristic method (see Balance Ball for example) - Modified the Communicator and ModelRunner : Put can only take one agent at a time - Made the IBrain Interface with RequestDecision and DecideAction method No changes made to Python [Design Doc](https://docs.google.com/document/d/1hBhBxZ9lepGF4H6fc6Hu6AW7UwOmnyX3trmgI3HpOmo/edit#) * Removing editorconfig * Updating BallanceBall scene * grammar mistake * Clearing the Agents of the Model runner * Added Documentation on IBrain * Modified comments on GiveModel * Introduced a factory * Split Learning Brain in two * Changes to walljump * Fixing the Unit tests * Renaming the Brain to Policy * Heuristic now has priority over training * Edited code comments * Fixing bugs * Develop one to one scene edits...	5 年前
GitHub	1fa07edb	Remove Standalone Offline BC Training (#2969 )	5 年前
Yuan Gao	0817c44b	Moved the demo files	5 年前
GitHub	0ff8f9af	Create ML-Agents Package (#3267 ) Convert the UnitySDK to a Packman Package. - Separate Examples into a sample project. - Move core UnitySDK Code into com.unity.ml-agents. - Create asmdefs for the ml-agents package. - Add package validation tests for win/linux/max. - Update protobuf generation scripts. - Add Barracuda as a package dependency for ML-Agents. (users no longer have to install it themselves).	5 年前
Ervin Teng	31c844e2	Change memory size definition in docs	5 年前
Ervin Teng	c60e16c9	Correct memory size docs	5 年前
GitHub	c145e75b	Split Policy and Optimizer, common Policy for PPO and SAC (#3345 )	5 年前
GitHub	92f1315e	Combine "Getting Started" and "Basic" Guides (#3644 ) * Merge agent & best practices doc. Plus other fixes * Fix overly long lines * Merge Getting Started and Basic Guides * Rename guide and update links appropriately * Fix broken link	5 年前
Ervin Teng	8bf8c9a9	Update docs	5 年前
GitHub	d7ca6b8d	[feature] Add --initialize-from option (#3710 )	5 年前
Ervin Teng	8b52a2d0	Address comments in docs	5 年前
Ervin Teng	817aab95	Update steps_per_update documentation Add constant Tweak buffer max size	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Marwan Mattar	9084db7b	Consolidate Feature descriptions into ML-Agents-Overview page Merged the "Overview" sections of a few pages into their respective sections in ML-Agents-Overview: - Training-Using-Concurrent-Unity-Instances.md - Training-Self-Play.md - Training-SAC.md - Training-PPO.md - Training-Imitation-Learning.md - Training-Environment-Parameter-Randomization.md - Training-Curriculum-Learning.md - Reward-Signals.md - Feature-Monitor.md - Feature-Memory.md Organized ML-Agents-Overview into Training Methods and Training Options sections. Follow-up action items (part of a separate PR): - Smooth over the documentation in ML-Agents-Overview (right now, we somewhat just pasted text from other pages). If we align on the new structure for this page, we can iterate on it. - Update “Key Components” section with new graph and discuss side channels and revise use of Academy. - Consolidate “Training-*” docs into Training-ML-Agents to offer a single guide for all hyperparameter selection	5 年前
GitHub	8a79612a	Update docs/Training-SAC.md Fix typo Co-Authored-By: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	5 年前
GitHub	216f7ca9	Update docs/Training-SAC.md Co-Authored-By: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	5 年前
Ervin Teng	55c876c8	Update SAC documentation	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前

24 次代码提交 (823fa3a5-ec34-4f3d-9781-77f244f9fbe0)