ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	dc50162d	Add ResNet and distributions	5 年前
Arthur Juliani	7c3bd376	Refactoring policy and optimizer	5 年前
Arthur Juliani	8c6f4696	Fix a couple additional bugs	5 年前
Arthur Juliani	61d671d8	Add conditional sigma for distribution	5 年前
Arthur Juliani	4a50444f	Support discrete actions as well	5 年前
Arthur Juliani	a11a79e4	Continuous and discrete now train	5 年前
Arthur Juliani	5f936990	Visual observations now train as well	5 年前
Ervin Teng	2fae31e6	Remove another if statement	5 年前
Ervin Teng	72180f9b	Experiment with JIT compiler	5 年前
Arthur Juliani	46874cc7	ONNX exporting	4 年前
Arthur Juliani	e14eb72b	Fix some issues with pdf	4 年前
Arthur Juliani	6408fd4e	Fix bug in pdf function	4 年前
GitHub	0d80d87a	Fix for discrete actions (#4181 )	4 年前
GitHub	45154f52	Pytorch port of SAC (#4219 )	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	69579611	[refactor] Refactor Actor and Critic classes (#4287 )	4 年前
GitHub	5bcbef8d	[tests] Add tests for core PyTorch files (#4292 )	4 年前
Ervin Teng	a172fb46	Halve entropy	4 年前
GitHub	7ddfd81f	Added Reward Providers for Torch (#4280 ) * Added Reward Providers for Torch * Use NetworkBody to encode state in the reward providers * Integrating the reward prodiders with ppo and torch * work in progress, integration with PPO. Not training properly Pyramids at the moment * Integration in PPO * Removing duplicate file * Gail and Curiosity working * addressing comments * Enfore float32 for tests * enfore np.float32 in buffer	4 年前
GitHub	dba529ff	Fix discrete export (#4322 ) Fix discrete export	4 年前
GitHub	36613cad	[add-fire] Fix CategoricalDistInstance test and replace `range` with `arange` (#4327 )	4 年前
Ervin Teng	6b29a4c9	Fix test and replace range with arange	4 年前
GitHub	6b193d03	Develop add fire layers (#4321 ) * Layer initialization + swish as a layer * integrating with the existing layers * fixing tests * setting the seed for a test * Using swish and fixing tests	4 年前
Ruo-Ping Dong	f5dee9d1	jit for continuous control	4 年前
Ruo-Ping Dong	ef3be79e	sac	4 年前
GitHub	6f534366	Add torch_utils class, auto-detect CUDA availability (#4403 ) * Add torch_utils * Use torch from torch_utils * Add torch to banned modules in CI * Better import error handling * Fix flake8 errors * Address comments * Move networks to GPU if enabled * Switch to torch_utils * More flake8 problems * Move reward providers to GPU/CPU * Remove anothere set default tensor * Fix banned import in test	4 年前
Andrew Cohen	8294bc59	ignore commit	4 年前
Andrew Cohen	b0b2e22e	prepare to merge action_out	4 年前
Andrew Cohen	85602279	add action_out to dist	4 年前
Andrew Cohen	f0a2c6f2	add action_out abstract method and docstring	4 年前
Andrew Cohen	7c0aa77b	Merge branch 'develop-actions-out' into develop-hybrid-actions	4 年前
Andrew Cohen	b6b453a0	rename to exported model output	4 年前
Andrew Cohen	9cbf6c3f	Merge branch 'develop-actions-out' into develop-hybrid-actions	4 年前
Andrew Cohen	72cd0d39	separate tensors for disc/cont	4 年前
Andrew Cohen	643c8e58	ppo extended	4 年前
Andrew Cohen	db37db34	fixing errors	4 年前
Andrew Cohen	44c9879e	action models	4 年前
Ervin Teng	e8431a6d	Proper dimensions for entropy, sum before bonus in PPO	4 年前
Andrew Cohen	190d8e4d	action model as a singleton	4 年前
GitHub	e4db5dc5	ActionSpec and ActionBuffer (#4578 )	4 年前
GitHub	a4ba90ec	[bug-fix] Fix ONNX export/Barracuda import for continuous actions (#4608 ) * Use torch.ones rather than expand * use tf.cat instead of tf.expand for Barracuda * fix error msg handling * handle loading exception Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Ervin Teng	f00ff21d	Add clipping to Torch	4 年前
Ervin Teng	7e6cc222	Take mean of continuous entropy	4 年前
Ervin Teng	89489ae0	Invert divide by 3 in log prob	4 年前
GitHub	3c96a3a2	Action Model (#4580 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Ervin Teng	7502a383	Proper dimensions	4 年前
Ervin Teng	1f305f24	Test commit	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
Ervin Teng	66f3d2ad	Merge branch 'develop-contentropy' into develop-torch-clip	4 年前
Ervin Teng	db6ebff6	Remove clip	4 年前
GitHub	85a7c0f7	[bug-fix] Add clipping to PyTorch policy, fix initialization (#4649 )	4 年前
Ervin Teng	08c8862e	Add epsilon to log	4 年前
Ervin Teng	eb4f3065	Add another epsilon	4 年前
GitHub	de27d7a6	[bug-fix] Fix issue where NaNs are outputted by the policy when training Match3 (#4664 ) * match3 settings * Add epsilon to log * Add another epsilon * Revert match3 configs * NaN-free masking method * Add comment for paper * Add comment for paper Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
Ervin Teng	4f8f21e4	Bigger scale	4 年前
vincentpierre	90da7426	Fix masking for torch	4 年前
vincentpierre	e85d8e35	Added a comment and included the change of #4715 for simplicity	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
GitHub	cc6b4564	Multi Directional Walker and Initial Hypernetwork (#4740 )	4 年前
Arthur Juliani	79898e06	Use hypernetwork in both places	4 年前
Arthur Juliani	f36b10d8	Fix typo	4 年前
GitHub	6f4f6044	Fix model inference issue with Barracuda v1.2.1 (#4766 ) (#4768 ) Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前
Andrew Cohen	498b1ee6	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
Andrew Cohen	06f1f254	1:1 and continuous/discrete train	4 年前
Andrew Cohen	6174c428	move action model to explicit distributions	4 年前
Andrew Cohen	e81e68de	comms agent and fixed hallway	4 年前
Andrew Cohen	ca5a5194	soccer comms on the cloud	4 年前
Andrew Cohen	12828bdc	remove tau from diff for	4 年前
Andrew Cohen	8543b917	add kickpower bonus	4 年前
vincentpierre	52b011d6	_	4 年前
vincentpierre	03c905b2	Fix equation for entropy	4 年前
vincentpierre	5f9ea5ea	_	4 年前
vincentpierre	a4b78d53	_	4 年前
vincentpierre	6f3ea7b8	_	4 年前
vincentpierre	f101393c	Better bounds for log probabilities	4 年前
GitHub	3c1e98ca	Update ml-agents/mlagents/trainers/torch/distributions.py	4 年前
vincentpierre	811319c0	Use clipped gaussian distribution for entropy calculation	4 年前
GitHub	0ddfbfee	V2 staging new model version (#5080 ) * Make modelCheck have flavors of error messages * ONNX exporter v3 * Using a better CheckType and a switch statement * Removing unused message * More tests * Use an enum for valid versions and use GetVersion on model directly * Maybe the model export version a static constant in Python * Use static constructor for FailedCheck * Use static constructor for FailedCheck * Modifying the docstrings * renaming LegacyDiscreteActionOutputApplier * removing testing code * better warning message * Nest the CheckTypeEnum into the FailedCheck class * Update com.unity.ml-agents/Runtime/Inference/BarracudaModelParamLoader.cs * Adding a line explaining that legacy tensor checks are for versions 1.X only * Modifying the changelog * Exporting all the branches size instead of omly the sum (#5092) * addressing comments * Update com.unity.ml-agents/Runtime/Inference/BarracudaModelParamLoader.cs Co-authored...	4 年前

1 2

78 次代码提交 (024bb104-c278-45a6-afc3-552ac446c9a9)