ml-agents

作者	SHA1	备注	提交日期
GitHub	7b69bd14	Refactor Trainer and Model (#2360 ) - Move common functions to trainer.py, model.pyfromppo/trainer.py, ppo/policy.pyandppo/model.py' - Introduce RLTrainer class and move most of add_experiences and some common reward signal code there. PPO and SAC will inherit from this, not so much BC Trainer. - Add methods to Buffer to enable sampling, truncating, and save/loading. - Add scoping to create encoders in model.py	5 年前
GitHub	689765d6	Modification of reward signals and rl_trainer for SAC (#2433 ) * Adds evaluate_batch to reward signals. Evaluates on minibatch rather than on BrainInfo. * Changes the way reward signal results are reported in rl_trainer so that we get the pure, unprocessed environment reward separate from the reward signals. * Moves end_episode to rl_trainer * Fixed bug with BCModule with RNN	5 年前
Ervin Teng	e0da93d1	Fix bug with construct_curr_info and test	5 年前
Ervin Teng	aca81efb	Add more tests	5 年前
Ervin Teng	28ef8983	Add 2 visual obs test	5 年前
GitHub	5d3e05d1	Fix "memory leak" during inference (#2722 ) * Clear buffer if not training * Add tests	5 年前
GitHub	0fe5adc2	Develop remove memories (#2795 ) * Initial commit removing memories from C# and deprecating memory fields in proto * initial changes to Python * Adding functionalities * Fixes * adding the memories to the dictionary * Fixing bugs * tweeks * Resolving bugs * Recreating the proto * Addressing comments * Passing by reference does not work. Do not merge * Fixing huge bug in Inference * Applying patches * fixing tests * Addressing comments * Renaming variable to reflect type * test	5 年前
Andrew Cohen	13fe9cf8	Bubbled up indexing of AllBrainInfo to trainer controller from trainers	5 年前
GitHub	69d1a033	Develop remove past action communication (#2913 ) * Modifying the .proto files * attempt 1 at refactoring Python * works for ppo hallway * changing the documentation * now works with both sac and ppo both training and inference * Ned to fix the tests * TODOs : - Fix the demonstration recorder - Fix the demonstration loader - verify the intrinsic reward signals work - Fix the tests on Python - Fix the C# tests * Regenerating the protos * fix proto typo * protos and modifying the C# demo recorder * modified the demo loader * Demos are loading * IMPORTANT : THESE ARE THE FILES USED FOR CONVERSION FROM OLD TO NEW FORMAT * Modified all the demo files * Fixing all the tests * fixing ci * addressing comments * removing reference to memories in the ll-api	5 年前
Ervin Teng	df5ee7bf	Split buffer into two buffers (PPO works)	5 年前
Ervin Teng	29cdf77a	Fix RL tests	5 年前
Ervin Teng	fd0647a6	Rename append_update_buffer to append_to_update_buffer	5 年前
GitHub	652488d9	check for numpy float64 (#2948 )	5 年前
GitHub	213cd68d	Split Buffer into processing and update buffers (#2964 ) This is the first in a series of PRs that intend to move the agent processing logic (add_experiences and process_experiences) out of the trainer and into a separate class. The plan is to do so in steps: - Split the processing buffers (keeping track of agent trajectories and assembling trajectories) and update buffer (complete trajectories to be used for training) within the Trainer (this PR) - Move the processing buffer and add/process experiences into a separate, outside class - Change the data type of the update buffer to be a Trajectory - Place and read Trajectories from queues, add subscription mechanism for both AgentProcessor and Trainers	5 年前
Andrew Cohen	ef2dfd4c	adjusting tests to expect trainer.add_policy to be called	5 年前
Ervin Teng	336ca456	Kill the ProcessingBuffer	5 年前
Ervin Teng	62d609f8	Fix some of the tests	5 年前
Jonathan Harper	9f166f9e	Update tests to support pytest 5.x Our tests were using pytest fixtures by actually calling the fixture methods, but in newer 5.x versions of pytest this causes test failures. The recommended method for using fixtures is dependency injection. This change updates the relevant test fixtures to either not use `pytest.fixture` or to use dependency injection to pass the fixture. The version range requirements in `test_requirements.txt` were also updated accordingly.	5 年前
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
GitHub	0b5b1b01	Develop magic string + trajectory (#3122 ) * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * trainer_controller expects name_behavior_ids * add_policy and create_policy separated * adjusting tests to expect trainer.add_policy to be called * fixing tests * fixed naming ...	5 年前
GitHub	bec2e8f0	Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113 )	5 年前
Ervin Teng	48793ec1	Fix test	5 年前
Ervin Teng	c48ddcf2	Fix pre-commit error	5 年前
GitHub	1f9d04f2	Fix clear update buffer when trainer stops training, add test (#3422 ) * Fix clear update buffer when trainer stops training, add test * Fix buffer changing types when truncated	5 年前
GitHub	e4177de0	[change] Organize trainer files a bit better (#3538 )	5 年前
GitHub	6709a9bf	[change] Clean up trainer interface, clean up GhostTrainer stats (#3634 )	5 年前
Ervin Teng	f29b17a9	Don't block one policy queue Only put policies when policy is actually updated	5 年前
Ervin Teng	99ce4b59	Improve tests	5 年前
Ervin Teng	e90ef688	Revert to get_nowait method in AgentManagerQueue	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	5 年前
GitHub	09853e13	[refactor] Move checkpoint saving into trainer (#4034 )	4 年前
Jonathan Harper	80127232	Convert checkpoints to .nn format Fixed style Fixed more style Nit changes Fixed signature Convert checkpoints to .nn format Fixed style Nit changes Fixed tests, checkpoint management and style Check checkpoint management Modify statement on artifacts Nit changes Fixed signature Nit changes Fixed signature Fixed tests, checkpoint management and style Check checkpoint management Modify statement on artifacts	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	93517833	[feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305 )	4 年前
GitHub	3bcb029b	[refactor] Remove BrainParameters from Python code (#4138 )	4 年前
Ruo-Ping Dong	e06812aa	fix tests	4 年前
Ruo-Ping Dong	d3eb6c46	Merge branch 'develop-add-fire' into develop-add-fire-checkpoint	4 年前
Ruo-Ping Dong	95858e25	update saver interface and add tests	4 年前
GitHub	25dc8c3d	Add Saver Class to handle all save/load/checkpoint/export work (#4323 )	4 年前
Ervin Teng	d65a9326	Merge branch 'master' into develop-add-fire-mm3	4 年前
Ruo-Ping Dong	d57aa9ab	Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint	4 年前
Ruo-Ping Dong	c47ffc20	Rename saver	4 年前
Ruo-Ping Dong	09c22679	fix NNCheckpointManager for Torch	4 年前
Ruo-Ping Dong	e60c7038	Merge branch 'master' into develop-saver-name	4 年前
GitHub	badca342	Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540 )	4 年前
GitHub	cb8e4d25	Add ActionSpec (#4586 ) Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Andrew Cohen	590adc01	make_fake_trajectory/step take ActionSpec arg	4 年前
Andrew Cohen	0e28dd8f	add static method to create continuous/discrete	4 年前
vincentpierre	b863af57	Removing TensorFlow Trainers	4 年前
vincentpierre	719c969c	addressing comments. ObservationSpec is no longer a list	4 年前
vincentpierre	4bba4e8e	Renaming ObservationSpec to SensorSpec	4 年前
vincentpierre	c5a057d2	renaming obs_spec variables	4 年前
vincentpierre	449712b0	renaming sensor_spec to sensor_specS	4 年前
GitHub	8a40c58a	Added SUM as aggregation type for custom statistics (#4816 )	4 年前
GitHub	14129a08	[MLA-470] Barracuda + TF cleanup (#4837 ) * remove barracuda conversion, tensorflow cleanup * unused var	4 年前
Arthur Juliani	0b4b0992	Rename more files	4 年前
Arthur Juliani	7c37c759	Fix some mis-renamings	4 年前
Arthur Juliani	e3de0406	Plurals	4 年前
GitHub	67ad9651	Merge pull request #4825 from Unity-Technologies/sensor-types [WIP] Observation Types	4 年前
GitHub	8387e252	[release] Fix rl trainer warning (#5144 ) * Fix rl trainer warning * Fix typo	4 年前
Ervin Teng	d1c24251	[bug-fix] When agent isn't training, don't clear update buffer (#5205 ) * Don't clear update buffer, but don't append to it either * Update changelog * Address comments * Make experience replay buffer saving more verbose (cherry picked from commit 63e7ad44d96b7663b91f005ca1d88f4f3b11dd2a)	4 年前
GitHub	28eb43dd	[bug-fix] Delete .pt checkpoints past keep-checkpoints (#5271 ) * Manage non-ONNX files with checkpoint manager too * Update tests * Update training status version * Change ticking of status file version	4 年前

1 2

63 次代码提交 (09b37fb2-6063-411c-8038-4cd572fc9d0b)