ml-agents

作者	SHA1	备注	提交日期
GitHub	0e0daf47	[add-fire] Merge post-0.19.0 master into add-fire (#4328 )	4 年前
vincentpierre	eb951ca0	fixing typo	4 年前
GitHub	6b193d03	Develop add fire layers (#4321 ) * Layer initialization + swish as a layer * integrating with the existing layers * fixing tests * setting the seed for a test * Using swish and fixing tests	4 年前
GitHub	36613cad	[add-fire] Fix CategoricalDistInstance test and replace `range` with `arange` (#4327 )	4 年前
GitHub	dba529ff	Fix discrete export (#4322 ) Fix discrete export	4 年前
GitHub	7ddfd81f	Added Reward Providers for Torch (#4280 ) * Added Reward Providers for Torch * Use NetworkBody to encode state in the reward providers * Integrating the reward prodiders with ppo and torch * work in progress, integration with PPO. Not training properly Pyramids at the moment * Integration in PPO * Removing duplicate file * Gail and Curiosity working * addressing comments * Enfore float32 for tests * enfore np.float32 in buffer	4 年前
GitHub	3a982317	[add-fire] Add learning rate and beta/epsilon decay to PyTorch (#4318 )	4 年前
GitHub	69d29b86	[add-fire] Halve Gaussian entropy (#4319 ) * Halve entropy * Fix utils test	4 年前
GitHub	38ce37c9	Add components directory and init (#4320 )	4 年前
GitHub	b4749b31	Test fixes on add-fire (#4317 )	4 年前
GitHub	93517833	[feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305 )	4 年前
GitHub	5bcbef8d	[tests] Add tests for core PyTorch files (#4292 )	4 年前
GitHub	d8db1477	[bug-fix] Fix error with discrete probs (#4309 )	4 年前
GitHub	17f03980	[bug-fix] Fix non-LSTM SeparateActorCritic (#4306 )	4 年前
GitHub	69579611	[refactor] Refactor Actor and Critic classes (#4287 )	4 年前
Ruo-Ping Dong	9449d711	fix onnx save path and output_name	4 年前
GitHub	74c99ec8	[refactor] Refactor normalizers and encoders (#4275 ) * Refactor normalizers and encoders * Unify Critic and ValueNetwork * Rename ActionVectorEncoder * Update docstring of create_encoders * Add docstring to UnnormalizedInputEncoder	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	45154f52	Pytorch port of SAC (#4219 )	4 年前
vincentpierre	fd98cddd	reformating experiment_torch.py	4 年前
GitHub	05a11c96	Develop add fire exp framework (#4213 ) * Experiment branch for comparing torch * Updates and merging ervin changes * improvements on experiment_torch.py * Better printing of results * preliminary gpu experiment * Testing gpu * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * _ * _ * _ * _ * _ * _ * _ * _ * Attempt at gpu on tf. Does not work * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * Fixing learn.py	4 年前
GitHub	cde8bd29	Convert List[np.ndarray] to np.ndarray before using torch.as_tensor (#4183 ) Big speedup in visual obs	4 年前
Ervin Teng	0476c599	Remove print statement	4 年前
Ervin Teng	68169434	Fix discrete actions and GridWorld	4 年前
GitHub	0d80d87a	Fix for discrete actions (#4181 )	4 年前
Arthur Juliani	5d33aca7	Remove double setting	4 年前
Arthur Juliani	b6dfb4ac	Fix ResNet	4 年前
Arthur Juliani	6408fd4e	Fix bug in pdf function	4 年前
Arthur Juliani	e14eb72b	Fix some issues with pdf	4 年前
Arthur Juliani	46874cc7	ONNX exporting	5 年前
Arthur Juliani	9724c9ac	Merge master	5 年前
Arthur Juliani	28e095e0	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
Arthur Juliani	039f545a	Small performance improvement during inference	5 年前
Arthur Juliani	c02e75d6	Time action sample function	5 年前
Arthur Juliani	3eef9d78	Optimize np -> tensor operations	5 年前
Arthur Juliani	b7be7f04	Fix bug in probs calculation	5 年前
Arthur Juliani	2b3a6347	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
Arthur Juliani	be7e55e1	Use LSTM and fix a few merge errors	5 年前
Arthur Juliani	89ad3020	Merge remote-tracking branch 'origin/master' into develop-add-fire # Conflicts: # ml-agents/mlagents/trainers/policy/tf_policy.py	5 年前
Arthur Juliani	9835d26c	Prepare model for onnx export	5 年前
Arthur Juliani	ca887743	Support tf and pytorch alongside one another	5 年前
Arthur Juliani	1736559f	Combine actor and critic classes. Initial export.	5 年前
Arthur Juliani	596cc103	Remove unused arg	5 年前
Arthur Juliani	29223931	Fix for memories	5 年前
Arthur Juliani	82688e5c	GRU in-progress and dynamic cnns	5 年前
Arthur Juliani	212e2d1d	Merge remote-tracking branch 'origin/master' into develop-add-fire	5 年前
Arthur Juliani	5f936990	Visual observations now train as well	5 年前
Arthur Juliani	a5b5b109	Mulkti-discrete now working	5 年前
Arthur Juliani	a11a79e4	Continuous and discrete now train	5 年前
Arthur Juliani	4a50444f	Support discrete actions as well	5 年前

1 2 3

124 次代码提交 (0e0daf47-4d1c-46e0-b513-bc2dc4035220)