ml-agents

作者	SHA1	备注	提交日期
GitHub	e4177de0	[change] Organize trainer files a bit better (#3538 )	5 年前
Andrew Cohen	573b1f6d	Merge branch 'master' into soccer-fives	5 年前
Anupam Bhatnagar	f4dbedcf	removed extraneous logging imports and loggers	5 年前
GitHub	86141eee	Merge pull request #3560 from Unity-Technologies/new-logger Add timestamps to logs	5 年前
Anupam Bhatnagar	e8e0078e	first commit	5 年前
Anupam Bhatnagar	07b15ae7	[skip-ci] small refactors	5 年前
GitHub	e3af96ca	Merge branch 'master' into develop-demo-load-seek	5 年前
GitHub	873ba7fd	[bug-fix] Fix stats reporting for reward signals in SAC (#3606 )	5 年前
Andrew Cohen	b1cfa74d	Merge branch 'master' into develop-test-imitation	5 年前
GitHub	ec278616	Hotfixes for Release 0.15.1 (#3698 ) * [bug-fix] Increase height of wall in CrawlerStatic (#3650) * [bug-fix] Improve performance for PPO with continuous actions (#3662) * Corrected a typo in a name of a function (#3670) OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document * Add Academy.AutomaticSteppingEnabled to migration (#3666) * Fix editor port in Dockerfile (#3674) * Hotfix memory leak on Python (#3664) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com> * Remove vis_encode_type from list of required (#3677) * Update changelog (#3678) * Shorten timeout duration for environment close (#3679) The timeout duration for closing an environment was set to the same duration as the timeout when waiting ...	5 年前
Andrew Cohen	53bea15c	Merge branch 'master' into soccer-fives	5 年前
Andrew Cohen	ac261e36	Merge branch 'master' into self-play-mutex	5 年前
GitHub	29f82921	[bug-fix] Improve performance for PPO with continuous actions (#3662 )	5 年前
GitHub	de3fc4e8	Hotfix memory leak on Python (#3664 ) * Hotfix memory leak on Python * Fixing * Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done * [bug-fix] Make Python able to deal with 0-step episodes (#3671) * adding some comments Co-authored-by: Ervin T <ervin@unity3d.com>	5 年前
Andrew Cohen	4c9ac553	Merge branch 'master' into self-play-mutex	5 年前
Andrew Cohen	93d344ff	simple rl asymm ghost tests	5 年前
GitHub	4ecd6ad3	Fix how we set logging levels (#3703 ) * cleanup logging * comments and cleanup * pylint, gym	5 年前
Andrew Cohen	cd677346	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Andrew Cohen	c7a34413	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
GitHub	bc1fdf07	[refactor] CLI changes (#3705 )	5 年前
Andrew Cohen	59b88be6	Merge branch 'master' into self-play-mutex	5 年前
GitHub	9cbc3fa2	Asymmetric self-play (#3653 )	5 年前
Ervin Teng	06fa3d39	Merge branch 'master' into develop-sac-apex	5 年前
Anupam Bhatnagar	50e52d9c	Merge branch 'master' into distributed-training	5 年前
GitHub	d7ca6b8d	[feature] Add --initialize-from option (#3710 )	5 年前
Andrew Cohen	1b9c643b	Merge branch 'master' into self-play-mutex	5 年前
Anupam Bhatnagar	001fce2a	first commit	5 年前
GitHub	43f23ee3	WIP : Changes to the LL-API - Refactor of “done” logic (#3681 ) * [skip ci] WIP : Modify the base_env.py file * [skip ci] typo * [skip ci] renamed some methods * [skip ci] Incorporated changes from our meeting * [skip ci] everything is broken * [skip ci] everything is broken * [skip ci] formatting * Fixing the gym tests * Fixing bug, C# has an error that needs fixing * Fixing the test * relaxing the threshold of 0.99 to 0.9 * fixing the C# side * formating * Fixed the llapi integratio test * [Increasing steps for testing] * Fixing the python tests * Need __contains__ after all * changing the max_steps in the tests * addressing comments * Making env_manager logic clearer as proposed in the comments * Remove duplicated logic and added back in episode length (#3728) * removing mentions of multi-agent in gym and changed the docstring in base_env.py * Edited the Documentation for the changes to the LLAPI (#3733) * Edite...	5 年前
Anupam Bhatnagar	9341f7a2	[skip-ci] small refactors	5 年前
Andrew Cohen	7a7eb324	Merge branch 'master' into internal-policy-ghost	5 年前
Andrew Cohen	930d6fa3	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
GitHub	aae58330	Merge branch 'master' into develop-add-inference-examples	5 年前
Andrew Cohen	b0c506a6	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	933d7b32	Merge branch 'self-play-mutex' into soccer-2v1	5 年前
Ervin Teng	5e980ec1	Merge branch 'master' into develop-sac-apex	5 年前
Andrew Cohen	f41695b9	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Arthur Juliani	dc50162d	Add ResNet and distributions	5 年前
Andrew Cohen	80469267	Merge branch 'internal-policy-ghost' into soccer-2v1	5 年前
Andrew Cohen	4468280a	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Andrew Cohen	9bec75ee	Merge branch 'master' into soccer-2v1	5 年前
Arthur Juliani	e166d018	Dynamically construct actor and critic	5 年前
Arthur Juliani	6879bae4	Initial optimizer port	5 年前
Arthur Juliani	7c3bd376	Refactoring policy and optimizer	5 年前
Arthur Juliani	2e51260a	Resolving a few bugs	5 年前
Andrew Cohen	185d4b35	Merge branch 'soccer-2v1' into asymm-envs	5 年前
Arthur Juliani	b997f214	Share more code between tf and torch policies	5 年前
Arthur Juliani	947f0d32	Slightly closer to running model	5 年前
Arthur Juliani	3c82bf59	Training runs, but doesn’t actually work	5 年前
Arthur Juliani	8c6f4696	Fix a couple additional bugs	5 年前
Arthur Juliani	4a50444f	Support discrete actions as well	5 年前
Arthur Juliani	a11a79e4	Continuous and discrete now train	5 年前
GitHub	232519e4	[refactor] Move output artifacts to a single results/ folder (#3829 )	5 年前
Arthur Juliani	82688e5c	GRU in-progress and dynamic cnns	5 年前
Arthur Juliani	1736559f	Combine actor and critic classes. Initial export.	5 年前
Arthur Juliani	ca887743	Support tf and pytorch alongside one another	5 年前
Arthur Juliani	9835d26c	Prepare model for onnx export	5 年前
Chris Elion	68b68396	Merge remote-tracking branch 'origin/master' into release_1_to_master	5 年前
GitHub	d2bc86c8	Release 2 cherry pick (#3971 ) * [bug-fix] Fix issue with initialize not resetting step count (#3962) * Develop better error message for #3953 (#3963) * Making the error for wrong number of agents raise consistently * Better error message for inputs of wrong dimensions * Fix #3932, stop the editor from going into a loop when a prefab is selected. (#3949) * Minor doc updates to release * add unit tests and fix exceptions (#3930) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Chris Goy <christopherg@unity3d.com>	5 年前
vincentpierre	c34dd5b6	Merge branch 'master' into develop-gym-wrapper	5 年前
Arthur Juliani	89ad3020	Merge remote-tracking branch 'origin/master' into develop-add-fire # Conflicts: # ml-agents/mlagents/trainers/policy/tf_policy.py	5 年前
Arthur Juliani	be7e55e1	Use LSTM and fix a few merge errors	5 年前
Arthur Juliani	b7be7f04	Fix bug in probs calculation	5 年前
Arthur Juliani	3eef9d78	Optimize np -> tensor operations	5 年前
Arthur Juliani	c02e75d6	Time action sample function	5 年前
Arthur Juliani	039f545a	Small performance improvement during inference	5 年前
Ervin Teng	565f92ef	Seems to speed it up	5 年前
Christopher Goy	ba80b292	format files with pre-commit.	4 年前
Ervin Teng	2fae31e6	Remove another if statement	5 年前
GitHub	f7373172	Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2 update verified brach with barracuda 1.0.2	4 年前
Ervin Teng	72180f9b	Experiment with JIT compiler	5 年前
GitHub	abbc6424	[bug-fix] Fix issue with initialize not resetting step count (#3962 )	5 年前
vincentpierre	6ddfe74f	Merge branch 'master' into develop-gym-wrapper	5 年前
Arthur Juliani	28e095e0	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
Ervin Teng	f214836a	Changes for speed test	4 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	5 年前
GitHub	335cff3e	[versioning] Save ML-Agents version in checkpoints and check on load (#4035 )	5 年前
Andrew Cohen	e7750fc9	Merge branch 'master' into develop-sampler-refactor	5 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
GitHub	a1c63c4b	Release 3 Cherry-pick bug-fixes and doc changes from master (#4102 ) * [bug-fix] Fix regression in --initialize-from feature (#4086) * Fixed text in GettingStarted page specifying the logdir for tensorboard. Before it was in a directory summaries which no longer existed. Results are now saved to the results dir. (#4085) * [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) * Reverting bug introduced in #4071 (#4101) Co-authored-by: Scott <Scott.m.jordan91@gmail.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	8a49e8e0	[refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087 )	4 年前
Anupam Bhatnagar	4afd8f92	first commit	4 年前
Arthur Juliani	9724c9ac	Merge master	4 年前
Arthur Juliani	46874cc7	ONNX exporting	4 年前
yanchaosun	c2d6f5c0	basic implementation	4 年前
yanchaosun	a9c788d7	new model	4 年前
Arthur Juliani	5d33aca7	Remove double setting	4 年前
GitHub	0d80d87a	Fix for discrete actions (#4181 )	4 年前
Anupam Bhatnagar	24d5f881	first commit	4 年前
Ervin Teng	510583d2	Move memory validation to settings	4 年前
yanchaosun	ac4c80c2	integrate the implementation and hyperparameters	4 年前
Ervin Teng	68169434	Fix discrete actions and GridWorld	4 年前
GitHub	cde8bd29	Convert List[np.ndarray] to np.ndarray before using torch.as_tensor (#4183 ) Big speedup in visual obs	4 年前
GitHub	bb675bf4	Merge pull request #4134 from Unity-Technologies/develop-removebrainnamepolicy [refactor] Remove references to brain_name in policy	4 年前
yanchaosun	f0881a94	fix commands for cloud training	4 年前
GitHub	05a11c96	Develop add fire exp framework (#4213 ) * Experiment branch for comparing torch * Updates and merging ervin changes * improvements on experiment_torch.py * Better printing of results * preliminary gpu experiment * Testing gpu * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * _ * _ * _ * _ * _ * _ * _ * _ * Attempt at gpu on tf. Does not work * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * Fixing learn.py	4 年前
yanchaosun	05a96355	remove slim package	4 年前
GitHub	45154f52	Pytorch port of SAC (#4219 )	4 年前
yanchaosun	ad95032b	transfer path	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	69579611	[refactor] Refactor Actor and Critic classes (#4287 )	4 年前
Ruo-Ping Dong	6feec58a	add Saver class (only TF working)	4 年前
Ervin Teng	bd97532d	Add normalizer update context	4 年前
Ruo-Ping Dong	9449d711	fix onnx save path and output_name	4 年前
yanchaosun	a80915a8	yaml update	4 年前
Andrew Cohen	f74d301a	Merge branch 'develop-add-fire' into develop-add-fire-bc	4 年前
Ruo-Ping Dong	6d67f857	move tf and add torch model serialization	4 年前
yanchaosun	666c8ba9	new cloud training change	4 年前
Ruo-Ping Dong	01e60921	add sac checkpoint	4 年前
vincentpierre	599d7e9f	Merging master	4 年前
yanchaosun	5eccb4c9	new transfer test for cloud	4 年前
HH	7afa1761	Merge branch 'master' into hh/develop/ragdoll-updates	4 年前
yanchaosun	858b97ec	bug fix	4 年前
Ruo-Ping Dong	4e87b422	move checkpoint_path logic to saver	4 年前
yanchaosun	da87eae9	predict model fix	4 年前
Ervin Teng	884c97ce	Fix policy memory storinig	4 年前
yanchaosun	d1e8d344	with swish activation	4 年前
Ruo-Ping Dong	71fe4df6	fix formatting and test	4 年前
Ruo-Ping Dong	b4713baa	small improvements	4 年前
Ruo-Ping Dong	79d89158	Merge branch 'develop-add-fire' into develop-add-fire-checkpoint	4 年前
GitHub	3bcb029b	[refactor] Remove BrainParameters from Python code (#4138 )	4 年前
Ruo-Ping Dong	e06812aa	fix tests	4 年前
GitHub	839eb2cb	Develop model transfer test (#4214 ) * test env, and code integration * delete results	4 年前
yanchaosun	7e3216ae	simple env test	4 年前
yanchaosun	cdaaa318	bisim	4 年前
yanchaosun	3d0d359c	bisimulation draft	4 年前
yanchaosun	1fdbfe65	no normalization	4 年前
yanchaosun	5a778ca3	fix normalization	4 年前
yanchaosun	a212fef9	new bisim implementation	4 年前
HH	0fdac847	Merge branch 'master' into hh/develop/crawler-ragdoll-updates	4 年前
yanchaosun	0e2f6e19	small fix	4 年前
yanchaosun	ec929746	minor update	4 年前
GitHub	84440f05	Convert checkpoints to .NN (#4127 ) This change adds an export to .nn for each checkpoint generated by RLTrainer and adds a NNCheckpointManager to track the generated checkpoints and final model in training_status.json. Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>	4 年前
Andrew Cohen	d0133066	working	4 年前
yanchaosun	9bc90956	fix bug with bisimulation	4 年前
Andrew Cohen	b6bf1860	fix bisim metric	4 年前
yanchaosun	f8b91faa	try to fix the bisim metric	4 年前
yanchaosun	ce36349b	some changes	4 年前
Andrew Cohen	1b17ae56	add tanh activ	4 年前
yanchaosun	7508a130	small fix	4 年前
Arthur Juliani	6bee0fd1	Merge master	4 年前
yanchaosun	caeffa3e	add two envs	4 年前
Andrew Cohen	5fa28f5f	merge YC changes	4 年前
yanchaosun	28355444	bisim fix, disable stop gradient	4 年前
yanchaosun	3246570c	added action encoder, and flags related with action training/transferring; set model_schedule as a changable hyperparameter	4 年前
GitHub	9f041970	Develop bisim action encoder, incorporate related hyperparameter settings (#4253 )	4 年前
GitHub	1f5eb9da	add pyupgrade to pre-commit and run (#4239 )	4 年前
GitHub	129f9ddc	[MLA-427] make pyupgrade convert f-strings too (#4244 ) * make pyupgrade convert f-strings too	4 年前
yanchaosun	80bad241	init sac transfer, and added action encoder to bisim; configs for crawler	4 年前
yanchaosun	a505cb16	new config	4 年前
yanchaosun	b991096b	update target encoder soft copy	4 年前
Andrew Cohen	d8c123a0	Merge branch 'master' into sensitivity	4 年前
Andrew Cohen	02df39ab	ignore precommit	4 年前
Andrew Cohen	0c7db26a	target encoder	4 年前
yanchaosun	b74294bf	target encoders and new forward loss	4 年前
Andrew Cohen	fa35292c	write hist to tb	4 年前
yanchaosun	0c468084	sac transfer implementation; disable action encoder	4 年前
GitHub	1b098c9a	Refactor TFPolicy and Policy (#4254 ) * Refactor TFPolicy and Policy	4 年前
yanchaosun	0a1a30d3	sac update	4 年前
GitHub	380fef57	[refactor] Move TF-specific files to tf/ folder (#4266 )	4 年前
GitHub	beb5aca5	[refactor] Make classes except Optimizer framework agnostic (#4268 )	4 年前
yanchaosun	00bb821c	fix sac transfer problems	4 年前
Andrew Cohen	302e8e77	no action encoder	4 年前
Andrew Cohen	06e4356c	Merge branch 'master' into sensitivity	4 年前
yanchaosun	2b67d1a6	fix crawler config	4 年前
Andrew Cohen	9d7ed6cc	op buffer	4 年前
Andrew Cohen	9c012d6a	no op buffer no acen	4 年前
Arthur Juliani	1a123641	Merge remote-tracking branch 'origin/master' into r5-master	4 年前
Andrew Cohen	2dec257c	no encoder for single task	4 年前
Andrew Cohen	18ff42a6	use mean of first trajectory to initialize the normalizer	4 年前
yanchaosun	6df774ed	update: separate model train as an option	4 年前
Andrew Cohen	5878b952	remove blank line	4 年前
Andrew Cohen	ce9bcefe	cleaned up initialization of variance/mean	4 年前
Ruo-Ping Dong	95858e25	update saver interface and add tests	4 年前
GitHub	9dc1d99e	Initialize normalizer with mean/variance from first trajectory (#4299 ) Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	ab8e5afa	Release 6 fix nan (#4343 ) * test initalize steps to 100 * use mean of first trajectory to initialize the normalizer * remove blank line * update changelog * cleaned up initialization of variance/mean * large normalization obs unit test * add --upgrade to pip to get newer downloader (#4338) * Fix format of the changelog for validation. (#4340) Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Chris Goy <christopherg@unity3d.com>	4 年前
GitHub	f374f87a	[add-fire] Add LSTM to SAC, LSTM fixes and initializations (#4324 )	4 年前
Anupam Bhatnagar	87bdf353	[skip ci] save model on worker zero only	4 年前
Anupam Bhatnagar	d3e8f124	removing horovod from tf policy	4 年前
Anupam Bhatnagar	abc1220f	Merge branch 'master' into global-variables	4 年前
Andrew Cohen	2cd0de04	action enc	4 年前
Chris Elion	d2133d83	comments and cleanup	4 年前
HH	8eaddb61	Merge branch 'master' into hh/develop/loco-walker-variable-speed	4 年前
Andrew Cohen	12f3786c	Revert "action enc" This reverts commit 9acb5f846a77a47d14694cb1271e2a370f395b6a.	4 年前
yanchaosun	3ce88589	1 layer everything	4 年前
Ruo-Ping Dong	59cc1a9f	Merge branch 'develop-add-fire' into develop-add-fire-checkpoint	4 年前
GitHub	76deba36	Merge pull request #4334 from Unity-Technologies/global-variables Adding rank to ml-agents	4 年前
GitHub	25dc8c3d	Add Saver Class to handle all save/load/checkpoint/export work (#4323 )	4 年前
Ervin Teng	cb0085a7	Memory size abstraction and fixes	4 年前
Ervin Teng	d65a9326	Merge branch 'master' into develop-add-fire-mm3	4 年前
Ruo-Ping Dong	d57aa9ab	Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint	4 年前
Andrew Cohen	463db9e8	backprop enc single task	4 年前
GitHub	bd6bcd2f	Merge master and add Saver class for save/load checkpoints	4 年前
Andrew Cohen	12eda929	try reload all	4 年前
Ervin Teng	42e25b25	Merge branch 'develop-add-fire' into develop-add-fire-memoryclass	4 年前
GitHub	8985a040	Removing the experiment script from add fire (#4373 ) * Removing the experiment script * Removing the script	4 年前
Andrew Cohen	b822283f	merge add fire	4 年前
Ervin Teng	6e946dba	Policy bugfixes and policy tests	4 年前
Ervin Teng	9ae22c61	Fix SeparateActorCritic export	4 年前
Christopher Goy	5a233353	Merge remote-tracking branch 'origin/master' into release_6-to-master	4 年前
GitHub	03eac72c	[add-fire] Add tests and fix issues with Policy (#4372 )	4 年前
Andrew Cohen	a65d08c7	ghost trainer tests	4 年前
Ervin Teng	116303f1	Typing for torch policy	4 年前
GitHub	49545ce1	Pytorch ghost trainer (#4370 )	4 年前
Andrew Cohen	71f9c241	fix tf policy for ghosts	4 年前
GitHub	6a1d993f	[add-fire] Memory class abstraction (#4375 )	4 年前
GitHub	2332bc32	Add fire to test_simple_rl.py (#4378 ) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
yanchaosun	3762358d	fix action stop gradient	4 年前
vincentpierre	349cee77	Fix of the test for multi visual input	4 年前
Andrew Cohen	20083987	move tf policy comment	4 年前
Ervin Teng	a04e68a4	Merge branch 'develop-add-fire' into develop-add-fire-memoryclass	4 年前
Andrew Cohen	effdec13	return copy of state_dict	4 年前
yanchaosun	3ed56471	remove bi-forward-loss	4 年前
Ervin Teng	8ff8c401	Merge branch 'develop-add-fire' into develop-add-fire-export	4 年前
yanchaosun	c5d9e376	add bi-forward-loss back	4 年前
GitHub	1955af9e	[feature] Add experimental PyTorch support (#4335 ) * Begin porting work * Add ResNet and distributions * Dynamically construct actor and critic * Initial optimizer port * Refactoring policy and optimizer * Resolving a few bugs * Share more code between tf and torch policies * Slightly closer to running model * Training runs, but doesn’t actually work * Fix a couple additional bugs * Add conditional sigma for distribution * Fix normalization * Support discrete actions as well * Continuous and discrete now train * Mulkti-discrete now working * Visual observations now train as well * GRU in-progress and dynamic cnns * Fix for memories * Remove unused arg * Combine actor and critic classes. Initial export. * Support tf and pytorch alongside one another * Prepare model for onnx export * Use LSTM and fix a few merge errors * Fix bug in probs calculation * Optimize np -> tensor operations * Time action sample funct...	4 年前
yanchaosun	2e927257	separate policy net	4 年前
vincentpierre	108fac9a	Replace torch.detach().cpu().numpy() with a utils method	4 年前
HH	d9962254	Merge branch 'master' into hh/develop/loco-walker-variable-speed	4 年前
Anupam Bhatnagar	5e8aa485	renaming file from globals.py to global_values.py	4 年前
Anupam Bhatnagar	71c301bc	minor fixes	4 年前
Anupam Bhatnagar	1f60979f	[skip ci] change self.rank to global_values.get_rank()	4 年前
Anupam Bhatnagar	f4f1a8d9	merge master into trainer-plugin branch	4 年前
yanchaosun	1ce53c55	discrete action	4 年前
GitHub	498934f9	Replace torch.detach().cpu().numpy() with a utils method (#4406 ) * Replace torch.detach().cpu().numpy() with a utils method * Using item() in place of to_numpy() * more use of item() and additional tests	4 年前
Ruo-Ping Dong	27fb4270	brain_name to behavior_name	4 年前
GitHub	bfda9576	Replace brain_name with behavior_name (#4419 ) brain_name -> behavior_name some prob -> log_prob in comments rename files optimizer -> optimizer_tf for tensorflow	4 年前
Ruo-Ping Dong	fd1dc3a6	Merge branch 'master' into develop-torch-omp	4 年前
Ruo-Ping Dong	f5dee9d1	jit for continuous control	4 年前
GitHub	6f534366	Add torch_utils class, auto-detect CUDA availability (#4403 ) * Add torch_utils * Use torch from torch_utils * Add torch to banned modules in CI * Better import error handling * Fix flake8 errors * Address comments * Move networks to GPU if enabled * Switch to torch_utils * More flake8 problems * Move reward providers to GPU/CPU * Remove anothere set default tensor * Fix banned import in test	4 年前
Ervin Teng	fdc887a1	Some experimental stuff	4 年前
Ervin Teng	f59f35ea	Remove stuff in policy	4 年前
Andrew Cohen	3997b14b	Merge branch 'master' into develop-hybrid-actions	4 年前
Ervin Teng	3e771cbb	Permute visual obs outside of network	4 年前
Ervin Teng	77c810fb	Fix SAC and make utility method	4 年前
Ervin Teng	7754ad7b	Don't run value during inference	4 年前
Ervin Teng	b6095151	Execute critic with LSTM	4 年前
GitHub	4e4ad7b0	Don't run value during policy evaluate, optimized soft update function (#4501 ) * Don't run value during inference * Execute critic with LSTM * Address comments * Unformat * Optimized soft update * Move soft update to model utils * Add test for soft update	4 年前
Ervin Teng	f9ff3efe	Merge branch 'develop-policyonly' into develop-sac-targetq	4 年前
Andrew Cohen	643c8e58	ppo extended	4 年前
Andrew Cohen	db37db34	fixing errors	4 年前
Andrew Cohen	44c9879e	action models	4 年前
Andrew Cohen	c494bfcc	trains successfully	4 年前
GitHub	badca342	Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540 )	4 年前
Andrew Cohen	190d8e4d	action model as a singleton	4 年前
Ervin Teng	8dec4771	Add hybrid actions to SAC	4 年前
Ervin Teng	be159ad3	Make entropy reporting same as TF	4 年前
Ervin Teng	81342148	Revert "Add hybrid actions to SAC" This reverts commit a759b36a51df4f8f1fd296f9f148269f0f026e42.	4 年前
Andrew Cohen	fc3027ac	tf tests except gail pass	4 年前
Andrew Cohen	e5f14400	Merge branch 'master' into develop-hybrid-actions-singleton	4 年前
GitHub	e4db5dc5	ActionSpec and ActionBuffer (#4578 )	4 年前
Andrew Cohen	7827ca06	add ActionSpec; test_simple_rl torch passes	4 年前
Andrew Cohen	eaecb59e	torch utils to and from buffer	4 年前
Andrew Cohen	8013e544	ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member)	4 年前
Andrew Cohen	f654df34	fixing tensorflow tests	4 年前
GitHub	e0ef30a5	[bug-fix] Change entropy computation and loss reporting in Torch to match TF (#4538 ) * Proper dimensions for entropy, sum before bonus in PPO * Make entropy reporting same as TF * Always use separate critic * Revert to shared * Remove unneeded extra line * Change entropy shape in test * Change another entropy shape * Add entropy summing to evaluate_actions * Add notes about torch.abs(policy_loss)	4 年前
GitHub	cb8e4d25	Add ActionSpec (#4586 ) Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Andrew Cohen	9689cf2c	remove _action_ from function names	4 年前
vincentpierre	a3a9a56b	Merge branch 'exp-multi-head-attention' into exp-bullet-hell	4 年前
Ruo-Ping Dong	9e08be87	Merge branch 'master' into release_9_branch_merge	4 年前
Andrew Cohen	6cf54bf2	remove self.action_spec from policy/bc	4 年前
GitHub	b853e5ba	Action buffer (#4612 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	3c96a3a2	Action Model (#4580 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
GitHub	87a7ccf8	use int64 steps, check for NaN actions (#4607 ) * use int64 steps * check for NaN actions Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>	4 年前
GitHub	23800f33	Merge branch 'master' into develop-action-spec	4 年前
GitHub	85a7c0f7	[bug-fix] Add clipping to PyTorch policy, fix initialization (#4649 )	4 年前
Ervin Teng	0548057d	Use real clipping (as in TF)	4 年前
GitHub	733bffbf	use int64 steps, check for NaN actions (#4607 ) (#4654 ) * use int64 steps * check for NaN actions Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Andrew Cohen	b6d10456	removed action_spec.size	4 年前
GitHub	8175d558	[bug-fix] Fix BC module + action clipping (#4667 )	4 年前
GitHub	2a8c6800	[bug-fix] Add clipping to PyTorch policy, fix initialization (#4649 ) (#4662 )	4 年前
Ruo-Ping Dong	953cb6bb	Merge branch 'master' into develop-windows-delay	4 年前
Andrew Cohen	1bbe492c	fixed tests/ -> single validate_action func	4 年前
Ruo-Ping Dong	ee5313e4	Merge branch 'master' into develop-windows-delay	4 年前
Ervin Teng	78f88c15	Add clip to export and make optional in policy	4 年前
GitHub	f0ed3a38	Cherry-pick BC fixes to Release 10 (#4668 )	4 年前
Andrew Cohen	3f771e61	add ActionBuffers and utils	4 年前
Ervin Teng	7a0ebfbd	Pretty broken	4 年前
Ervin Teng	95bdbba3	Less broken PPO	4 年前
vincentpierre	b863af57	Removing TensorFlow Trainers	4 年前
Ervin Teng	98948c59	Skip critic when given empty memory array	4 年前
Ervin Teng	4158629e	Properly feed in None rather than empty arrays	4 年前
Andrew Cohen	d8dd5e44	Merge branch 'master' into develop-action-buffer	4 年前
Andrew Cohen	bd917c9c	action buffer passes continuous	4 年前
Andrew Cohen	b36fcf16	discrete runs/cont passes	4 年前
Andrew Cohen	ad951493	debugging discrete	4 年前
Andrew Cohen	fcf6471e	2d discrete passes	4 年前
Ervin Teng	fdaa8c3d	Merge branch 'develop-unified-obs' into develop-centralizedcritic	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
vincentpierre	735fcd52	[WIP] Refactor trainers to use list of obs rather than vec and vis obs	4 年前
Ervin Teng	6846af21	Multi-input network	4 年前
Andrew Cohen	85e4db33	bc tests pass	4 年前
vincentpierre	93ca1409	fixing the tests	4 年前
vincentpierre	7a5cc9ec	Merge master into develop-rm-tf	4 年前
Ervin Teng	cb4b7ed3	Some minor tweaks but still broken	4 年前
vincentpierre	12619155	added some docstrings	4 年前
vincentpierre	c1587bce	Solving merge conflicts	4 年前
Andrew Cohen	8172b3d6	test_simple_rl/reward providers pass tf/torch	4 年前
Andrew Cohen	4ebc6c44	ml-agents-envs pass	4 年前
GitHub	8ab2e619	update type of evaluate_actions to list tensor (#4747 )	4 年前
Arthur Juliani	0d2f8887	Merge remote-tracking branch 'origin/master' into goal-conditioning # Conflicts: # ml-agents-envs/mlagents_envs/base_env.py # ml-agents-envs/mlagents_envs/rpc_utils.py # ml-agents/mlagents/trainers/tests/mock_brain.py # ml-agents/mlagents/trainers/tests/simple_test_envs.py	4 年前
GitHub	a0d1c829	Action Docs part2 (#4739 ) * reduce usage of "vector action" and "action space" * more cleanup * undo GettingStarted change for now * batch size description * Apply suggestions from code review Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	cc6b4564	Multi Directional Walker and Initial Hypernetwork (#4740 )	4 年前
Ervin Teng	25dfd883	Merge branch 'master' into develop-centralizedcritic	4 年前
vincentpierre	bc9d3975	merge master	4 年前
Andrew Cohen	cd73cce2	test_trajectory fixed	4 年前
GitHub	ad5f878c	[refactor] Remove critic pass during inference (#4743 )	4 年前
GitHub	11687f8d	[cherry-pick] Cherry-pick #4743 into Release 11 (#4756 )	4 年前
GitHub	22658a40	use sensor types to differentiate obs (#4749 )	4 年前
GitHub	903d3afe	Merge pull request #4707 from Unity-Technologies/develop-rm-tf Removing TensorFlow Trainers	4 年前
vincentpierre	14378aa5	Merging master	4 年前
Andrew Cohen	97d94a83	fix test_tf_policy	4 年前
vincentpierre	0c81006d	addressing comments	4 年前
vincentpierre	8cb050ef	WIP Made initial changes to enale dimension properties and added attention module	4 年前
Andrew Cohen	498b1ee6	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
GitHub	c2783e70	remove "discrete nan" check (#4770 ) (#4775 )	4 年前
GitHub	525a4308	remove "discrete nan" check (#4770 )	4 年前
Andrew Cohen	6174c428	move action model to explicit distributions	4 年前
vincentpierre	c3699de8	merging master and addressing comments	4 年前
Andrew Cohen	453a2bba	ActionTuple default is now np.array, not None	4 年前
GitHub	29d94c7c	Merge pull request #4734 from Unity-Technologies/develop-obs-as-list Refactor trainers to use list of obs rather than vec and vis obs	4 年前
Andrew Cohen	1d234d1d	bc works	4 年前
vincentpierre	719c969c	addressing comments. ObservationSpec is no longer a list	4 年前
vincentpierre	4bba4e8e	Renaming ObservationSpec to SensorSpec	4 年前
Andrew Cohen	c0d01baf	Merge branch 'master' into merge-release11-master	4 年前
Andrew Cohen	95566e44	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
vincentpierre	c5a057d2	renaming obs_spec variables	4 年前
Andrew Cohen	e81e68de	comms agent and fixed hallway	4 年前
vincentpierre	44ed3258	Merging master	4 年前
Andrew Cohen	3457cd3c	save only discrete actions as prev	4 年前
Andrew Cohen	da5b8420	update make_empty docstring	4 年前
Andrew Cohen	411b0a19	reuse action dict in torch policy for pre_action	4 年前
vincentpierre	449712b0	renaming sensor_spec to sensor_specS	4 年前
Andrew Cohen	35769b53	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
Andrew Cohen	17496265	move AgentAction, ActionLogProbs, and ActionFlattener to separate files	4 年前
Chris Elion	76ebc20c	Merge remote-tracking branch 'origin/master' into r12-to-master	4 年前
GitHub	458fee17	Merge pull request #4763 from Unity-Technologies/develop-att WIP Made initial changes to enable dimension properties and added attention module	4 年前
vincentpierre	36cc4665	Removing some vis and vec fields from policy.py	4 年前
GitHub	9d8a7d6f	Update ml-agents/mlagents/trainers/policy/tf_policy.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Ervin Teng	330fc1d0	Merge branch 'master' into develop-centralizedcritic-mm	4 年前
Andrew Cohen	60309d8f	fix torch policy tests	4 年前
vincentpierre	519c5f47	merging master	4 年前
Ruo-Ping Dong	8ed14762	Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp	4 年前
Andrew Cohen	7ba10239	remove action spec attribute from policy	4 年前
vincentpierre	318622a6	removing the extra code to set normalize to false if no observation can be normalized	4 年前
Andrew Cohen	662fd6b1	added docstrings to action flattener	4 年前
GitHub	a272bff0	Merge pull request #4803 from Unity-Technologies/develop-remove-vec-vis-fields Remove some vis and vec fields from Policy	4 年前
GitHub	cc948a41	Policy output actiontuple (#4651 )	4 年前
GitHub	7387a77f	remove pylint (#4836 ) * remove pylint * remove other pylint disables	4 年前
Andrew Cohen	886883b3	Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton	4 年前
Arthur Juliani	0b4b0992	Rename more files	4 年前
Ervin Teng	aba633b2	Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm	4 年前
Andrew Cohen	85b18389	fix test tf policy	4 年前
Ruo-Ping Dong	a7d04be6	Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp	4 年前
Arthur Juliani	0a876b9c	Fix typos	4 年前
Arthur Juliani	e3de0406	Plurals	4 年前
Ruo-Ping Dong	180d3e20	Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager	4 年前
HH	0024a286	merge ervin's new stuff	4 年前
GitHub	67ad9651	Merge pull request #4825 from Unity-Technologies/sensor-types [WIP] Observation Types	4 年前
vincentpierre	8660b1c2	merging master	4 年前
Ervin Teng	457b2630	I think it's running	4 年前
brccabral	457fb612	Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents	4 年前
Andrew Cohen	6e1826f8	might be right	4 年前
vincentpierre	52b011d6	_	4 年前
vincentpierre	6f3ea7b8	_	4 年前
Andrew Cohen	a4c336c2	value estimator	4 年前
Andrew Cohen	9af22d30	use only value funcs	4 年前
Ervin Teng	3283b6a1	Remove Q-net for perf	4 年前
Ervin Teng	b6f88d6d	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Andrew Cohen	f73b9dba	update policy to not use critic	4 年前
Andrew Cohen	9b92f5fb	remove commented code	4 年前
Ervin Teng	0bde7598	Back out trainer changes	4 年前
Ervin Teng	514873bf	Use correct memories (t-1 instead of t) for training	4 年前
Ervin Teng	24ee4bd5	Merge remote-tracking branch 'origin/develop-critic-optimizer' into develop-critic-optimizer	4 年前
Ervin Teng	7014404a	Add presence check	4 年前
Andrew Cohen	c74dca9f	add SharedActorCritic	4 年前
Ervin Teng	f3a2a81f	Merge branch 'develop-fix-lstms' into develop-gru	4 年前
Andrew Cohen	00b891df	fix sac shared	4 年前
Ervin Teng	219e773b	Merge branch 'develop-fix-lstms' into develop-critic-op-lstm	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
HH	15d512f9	Merge branch 'master' into hh/develop/dodgeball	4 年前
GitHub	338af2ec	Move the Critic into the Optimizer (#4939 ) Co-authored-by: Ervin Teng <ervin@unity3d.com>	4 年前
HH	4c947151	Merge branch 'main' into hh/develop/dodgeball	4 年前
Ervin Teng	61781a1a	Merge branch 'main' into develop-agentprocessor-teammanager	4 年前
Andrew Cohen	9060da06	Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer	4 年前
Arthur Juliani	06c147f8	Merge remote-tracking branch 'origin/main' into goal-conditioning-new # Conflicts: # Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity # Project/ProjectSettings/TagManager.asset # com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs # com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs # ml-agents/mlagents/trainers/torch/networks.py # ml-agents/mlagents/trainers/torch/utils.py	4 年前
GitHub	d36a5242	Python Dataflow for Group Manager (#4926 ) * Make buffer type-agnostic * Edit types of Apped method * Change comment * Collaborative walljump * Make collab env harder * Add group ID * Add collab obs to trajectory * Fix bug; add critic_obs to buffer * Set group ids for some envs * Pretty broken * Less broken PPO * Update SAC, fix PPO batching * Fix SAC interrupted condition and typing * Fix SAC interrupted again * Remove erroneous file * Fix multiple obs * Update curiosity reward provider * Update GAIL and BC * Multi-input network * Some minor tweaks but still broken * Get next critic observations into value estimate * Temporarily disable exporting * Use Vince's ONNX export code * Cleanup * Add walljump collab YAML * Lower max height * Update prefab * Update prefab * Collaborative Hallway * Set num teammates to 2 * Add config and group ids to HallwayCollab * Fix bug with hallway collab * E...	4 年前
Ervin Teng	fd0dd35c	Merge branch 'main' into develop-coma2-trainer	4 年前
Ervin Teng	c8137dcd	Merge branch 'main' into develop-superpush-int	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
Christopher Goy	921ba4f0	Update v2-staging from main (March 15) (#5123 )	4 年前
Christopher Goy	ebe45056	Merge branch 'main' into release_14_branch-to-main	4 年前
Chris Elion	970f1d40	Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec	4 年前
GitHub	2e19759c	Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211 ) * turning some logger.info into logger.debug and remove some logging overhead when not using debug * Addressing comments * Adding to changelog	4 年前
GitHub	28eb43dd	[bug-fix] Delete .pt checkpoints past keep-checkpoints (#5271 ) * Manage non-ONNX files with checkpoint manager too * Update tests * Update training status version * Change ticking of status file version	4 年前
Miguel Alonso Jr	4846cf0f	Merge branch 'main' into develop-api-documentation-update Updating with main.	4 年前

... 4 5 6 7 8

396 次代码提交 (2677d314-546f-4cae-8cef-d6e1f2dd7f5a)