ml-agents

作者	SHA1	备注	提交日期
GitHub	213cd68d	Split Buffer into processing and update buffers (#2964 ) This is the first in a series of PRs that intend to move the agent processing logic (add_experiences and process_experiences) out of the trainer and into a separate class. The plan is to do so in steps: - Split the processing buffers (keeping track of agent trajectories and assembling trajectories) and update buffer (complete trajectories to be used for training) within the Trainer (this PR) - Move the processing buffer and add/process experiences into a separate, outside class - Change the data type of the update buffer to be a Trajectory - Place and read Trajectories from queues, add subscription mechanism for both AgentProcessor and Trainers	5 年前
GitHub	42bea858	Improve mypy coverage by adding --namespace-packages (#3049 )	5 年前
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
GitHub	0b5b1b01	Develop magic string + trajectory (#3122 ) * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * trainer_controller expects name_behavior_ids * add_policy and create_policy separated * adjusting tests to expect trainer.add_policy to be called * fixing tests * fixed naming ...	5 年前
GitHub	bec2e8f0	Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113 )	5 年前
GitHub	bed7debf	Fix issue with different decision intervals for different brains (#3181 ) * Move action check into agent_processor * Better loop for iterating over step_info * Add warning for agentmanager not found	5 年前
GitHub	d985dded	Merge branch 'master' into merge-release-0.13.0	5 年前
Andrew Cohen	4c260917	fix flake merge conflicts with master	5 年前
GitHub	4c241a80	Only send previous action and current BrainInfo (#3187 ) This PR makes it so that the env_manager only sends one current BrainInfo and the previous actions (if any) to the AgentManager. The list of agents was added to the ActionInfo and used appropriately.	5 年前
GitHub	f058b18c	Replace BrainInfos with BatchedStepResult (#3207 )	5 年前
GitHub	56a67403	Fix lost trajectories when they are produced faster than they are consumed (#3233 ) * Fix bug when trajectories are produced faster than they are consumed * Cap max length	5 年前
GitHub	a64e7850	Fix issue with BatchedStepResult with no agents (#3240 )	5 年前
GitHub	ca96b293	Move advance() logic for environment manager out of trainer_controller (#3234 ) This PR moves the AgentManagers from the TrainerController into the env_manager. This way, the TrainerController only needs to create the components (Trainers, AgentManagers) and call advance() on the EnvManager and the Trainers.	5 年前
GitHub	590559e7	Make the Agent reset immediately after Done (#3291 ) * Made the Agent reset immediately * fixing the C# tests * Fixing the tests still * Trying with incremental episode ids * deleting buffer rather than using an empty list * Addressing the comments * Forgot to edit the comment on AgentInfo * Updating the migrating doc * Fixed an obvious bug * cleaning after an agent is done in agent processor * Fixing the pytest errors	5 年前
Ervin Teng	7bbd91ad	Change logic to fix memory leak	5 年前
GitHub	3939ca52	Change AgentProcessor logic to fix memory leak (#3383 )	5 年前
GitHub	f20a27e0	Clear agent processor properly on episode reset (#3437 )	5 年前
Ervin Teng	ff607162	Move learning rate reporting	5 年前
GitHub	e4177de0	[change] Organize trainer files a bit better (#3538 )	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
GitHub	6709a9bf	[change] Clean up trainer interface, clean up GhostTrainer stats (#3634 )	5 年前
Ervin Teng	3deb8e30	Make trainer in separate threads	5 年前
GitHub	de3fc4e8	Hotfix memory leak on Python (#3664 ) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com>	5 年前
GitHub	11c518a3	Stats SideChannel (for custom TensorBoard metrics) (#3660 )	5 年前
Ervin Teng	06fa3d39	Merge branch 'master' into develop-sac-apex	5 年前
Ervin Teng	971e4b2d	Don't block when disabling threading	5 年前
GitHub	43f23ee3	WIP : Changes to the LL-API - Refactor of “done” logic (#3681 ) * [skip ci] WIP : Modify the base_env.py file * [skip ci] typo * [skip ci] renamed some methods * [skip ci] Incorporated changes from our meeting * [skip ci] everything is broken * [skip ci] everything is broken * [skip ci] formatting * Fixing the gym tests * Fixing bug, C# has an error that needs fixing * Fixing the test * relaxing the threshold of 0.99 to 0.9 * fixing the C# side * formating * Fixed the llapi integratio test * [Increasing steps for testing] * Fixing the python tests * Need __contains__ after all * changing the max_steps in the tests * addressing comments * Making env_manager logic clearer as proposed in the comments * Remove duplicated logic and added back in episode length (#3728) * removing mentions of multi-agent in gym and changed the docstring in base_env.py * Edited the Documentation for the changes to the LLAPI (#3733) * Edite...	5 年前
Ervin Teng	817aab95	Update steps_per_update documentation Add constant Tweak buffer max size	5 年前
GitHub	83ac520a	Merge 0.15.1 to master (#3755 ) * Bumping version on the release (#3615) * Update examples project to 2018.4.18f1 (#3618) From 2018.4.14f1. An internal package dependency was updated as a side effect. * Remove dead components from the examples scenes (#3619) (#3624) * Improve warnings and exception if using unsupported combo * add meta file * fix unit test * enforce onnx conversion (expect tf2 CI to fail) (#3600) * Update error message * Updated the release branch docs (#3621) * Updated the release branch docs * Edited the README * make sure top-level timer is closed before writing * Remove space from Product Name for examples In #2588 it was suggested that the space in the Product Name for our example environments causes confusion when using a default build because of the need to escape the space in the build filename. This change removes the space from the Product Name in the project's player settings. * [bug-fix] Increase 3dbal...	5 年前
Ervin Teng	f6fcf512	Clean up interface for AP	5 年前
Ervin Teng	81f78aec	Make fields properties	5 年前
Ervin Teng	f29b17a9	Don't block one policy queue Only put policies when policy is actually updated	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Ervin Teng	d1fed8ae	Remove empty_queue interface	5 年前
Ervin Teng	e90ef688	Revert to get_nowait method in AgentManagerQueue	5 年前
Ervin Teng	e5fbfc35	Remove params from get_nowait	5 年前
Ervin Teng	392fcb4e	Fix stall in ghost trainer non-threaded	5 年前
GitHub	048d66fa	Update comment on time horizon in agent processor (#3842 )	5 年前
GitHub	4641038e	Renaming max_step to interrupted in TermialStep(s) (#3908 )	5 年前
Arthur Juliani	9724c9ac	Merge master	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	3bcb029b	[refactor] Remove BrainParameters from Python code (#4138 )	4 年前
GitHub	20f1386a	Don't drop multiple stats from the same step (#4236 )	4 年前
Scott Jordan	d695c044	initial addition of active learning (incomplete)	4 年前
Scott Jordan	56745026	Initial commit of running active learning code Active learning code is running on walker variable speed. Needs to be tested to see if it is working.	4 年前
Scott Jordan	78f8a9a2	Updated task manager active learning is no optional and defaults to uniform sampling of tasks. Renamed ActiveLearningTaskManager to just TaskManager	4 年前
Scott Jordan	87969325	added histogram recorded, fixed active learning bug added histogram recorder for task samples. Fixed a bug that prevented active learning from being used.	4 年前
Andrew Cohen	9c2be310	commenting action pre continuous	4 年前
Andrew Cohen	eaecb59e	torch utils to and from buffer	4 年前
GitHub	b853e5ba	Action buffer (#4612 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	3c96a3a2	Action Model (#4580 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
vincentpierre	e14e1c4d	Improvements and new tests	4 年前
Ervin Teng	3b15cc32	Multiprocessing but Stats are quite broken	4 年前
Andrew Cohen	3f771e61	add ActionBuffers and utils	4 年前
Ervin Teng	15c463cf	Add collab obs to trajectory	4 年前
Ervin Teng	f479ce83	Fix bug; add critic_obs to buffer	4 年前
Andrew Cohen	bd917c9c	action buffer passes continuous	4 年前
Andrew Cohen	85e4db33	bc tests pass	4 年前
Ervin Teng	56dcd75a	Get next critic observations into value estimate	4 年前
Ervin Teng	25dfd883	Merge branch 'master' into develop-centralizedcritic	4 年前
Andrew Cohen	cd73cce2	test_trajectory fixed	4 年前
Andrew Cohen	3c65b964	fixed recurrent prev_action issue	4 年前
Andrew Cohen	e9cb1066	agent processor tests	4 年前
Ruo-Ping Dong	fbfdc05b	send and process team manager id	4 年前
Ruo-Ping Dong	413246c2	remove print	4 年前
vincentpierre	f7a4a31f	[Experiment] Bullet hell	4 年前
GitHub	8a40c58a	Added SUM as aggregation type for custom statistics (#4816 )	4 年前
GitHub	7387a77f	remove pylint (#4836 ) * remove pylint * remove other pylint disables	4 年前
Andrew Cohen	231328ea	remove warning prints	4 年前
Ervin Teng	aba633b2	Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm	4 年前
Ruo-Ping Dong	180d3e20	Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager	4 年前
GitHub	70220f95	Team manager prototype (#4850 ) * remove group id * very rough sketch for TeamManager interface * add team manager id to proto * team manager for hallway * add manager to hallway * send and process team manager id * remove print * small cleanup Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Ervin Teng	a7e368b8	Fix AgentProcessor for TeamManager Should work for variable decision frequencies (untested)	4 年前
Ervin Teng	fdf97d99	Add team reward to buffer	4 年前
Ervin Teng	92fc78a5	Use new trajectory	4 年前
Ruo-Ping Dong	910da750	change teammanager id from string to int	4 年前
Ervin Teng	65b866b0	Actions added but untested	4 年前
Ruo-Ping Dong	fb4a3bd2	fix grouping for int id	4 年前
Ruo-Ping Dong	34a67a8e	fix passing manager id to trainer	4 年前
Ruo-Ping Dong	e470fa12	make global manager id	4 年前
Ruo-Ping Dong	d7ade5c3	update agent processor to use group id	4 年前
Ervin Teng	30db9ef4	AgentProcessor fixes	4 年前
Ervin Teng	514873bf	Use correct memories (t-1 instead of t) for training	4 年前
Ervin Teng	eb13a14a	Renaming fest	4 年前
Ervin Teng	a6b4917a	Use NamedTuples instead of attrs classes	4 年前
Ervin Teng	a9116382	Bug fixes	4 年前
Ervin Teng	4aee6787	more renaming	4 年前
Ervin Teng	a25bb4d4	Global group ids	4 年前
Ervin Teng	ae659ac4	Addressed some comments	4 年前
Ervin Teng	ffdfd8ff	Address some comments	4 年前
Ervin Teng	61781a1a	Merge branch 'main' into develop-agentprocessor-teammanager	4 年前
GitHub	d36a5242	Python Dataflow for Group Manager (#4926 ) * Make buffer type-agnostic * Edit types of Apped method * Change comment * Collaborative walljump * Make collab env harder * Add group ID * Add collab obs to trajectory * Fix bug; add critic_obs to buffer * Set group ids for some envs * Pretty broken * Less broken PPO * Update SAC, fix PPO batching * Fix SAC interrupted condition and typing * Fix SAC interrupted again * Remove erroneous file * Fix multiple obs * Update curiosity reward provider * Update GAIL and BC * Multi-input network * Some minor tweaks but still broken * Get next critic observations into value estimate * Temporarily disable exporting * Use Vince's ONNX export code * Cleanup * Add walljump collab YAML * Lower max height * Update prefab * Update prefab * Collaborative Hallway * Set num teammates to 2 * Add config and group ids to HallwayCollab * Fix bug with hallway collab * E...	4 年前
GitHub	2933f235	Fix the reporting of histogram stats and adding a test (#5410 ) * Fix the reporting of histogram stats and adding a test * Appending to the Changelog	3 年前

1 2

94 次代码提交 (823fa3a5-ec34-4f3d-9781-77f244f9fbe0)