ml-agents

作者	SHA1	备注	提交日期
GitHub	7ddfd81f	Added Reward Providers for Torch (#4280 ) * Added Reward Providers for Torch * Use NetworkBody to encode state in the reward providers * Integrating the reward prodiders with ppo and torch * work in progress, integration with PPO. Not training properly Pyramids at the moment * Integration in PPO * Removing duplicate file * Gail and Curiosity working * addressing comments * Enfore float32 for tests * enfore np.float32 in buffer	4 年前
vincentpierre	108fac9a	Replace torch.detach().cpu().numpy() with a utils method	4 年前
vincentpierre	fdd343b2	more use of item() and additional tests	4 年前
GitHub	12e15e29	Fix on GAIL Torch when using actions (#4407 )	4 年前
GitHub	498934f9	Replace torch.detach().cpu().numpy() with a utils method (#4406 ) * Replace torch.detach().cpu().numpy() with a utils method * Using item() in place of to_numpy() * more use of item() and additional tests	4 年前
Ruo-Ping Dong	fd1dc3a6	Merge branch 'master' into develop-torch-omp	4 年前
GitHub	7b4d0865	[Bug fix] Fix bug in GAIL gradient penalty (#4425 )	4 年前
GitHub	4e93cb6e	[torch] Restructure PyTorch encoders (#4421 ) * Move linear encoding to NetworkBody * moved encoders to processors (#4420) * fix bad merge * Get it running * Replace mentions of visual_encoders * Remove output_size property * Fix tests * Fix some references * Revert test_simple_rl * Fix networks test * Make curiosity test more accomodating * Rename total_input_size * [Bug fix] Fix bug in GAIL gradient penalty (#4425) (#4426) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Up number of steps * Rename to visual_processors and vector_processors Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	6f534366	Add torch_utils class, auto-detect CUDA availability (#4403 ) * Add torch_utils * Use torch from torch_utils * Add torch to banned modules in CI * Better import error handling * Fix flake8 errors * Address comments * Move networks to GPU if enabled * Switch to torch_utils * More flake8 problems * Move reward providers to GPU/CPU * Remove anothere set default tensor * Fix banned import in test	4 年前
GitHub	676f5f7c	[refactor] Refactor GAIL to use new encoder structure (#4433 ) Co-authored-by: Ervin Teng <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Ervin Teng	60eacc0d	Merge branch 'master' into develop-adjust-cpu-settings	4 年前
Andrew Cohen	3997b14b	Merge branch 'master' into develop-hybrid-actions	4 年前
vincentpierre	181bdec0	-	4 年前
GitHub	60b76790	Random Network Distillation for Torch (#4473 ) * initial commit * works with Pyramids * added unit tests and a separate config file * Adding first batch of documentation * adding in the docs that rnd is only for PyTorch * adding newline at the end of the config files * adding some docs * Code comments * no normalization of the reward * Fixing the tests * [skip ci] * [skip ci] Make sure RND will only work for Torch by editing the config file * [skip ci] Additional information in the Documentation * Remove the _has_updated_once flag	4 年前
Andrew Cohen	e5f14400	Merge branch 'master' into develop-hybrid-actions-singleton	4 年前
Andrew Cohen	7827ca06	add ActionSpec; test_simple_rl torch passes	4 年前
Andrew Cohen	6e23bafd	ActionFlattener Refactor	4 年前
GitHub	cb8e4d25	Add ActionSpec (#4586 ) Co-authored-by: Ervin T <ervin@unity3d.com>	4 年前
Andrew Cohen	9689cf2c	remove _action_ from function names	4 年前
GitHub	64e998a2	[bug-fix] Use float64 when converting np.ndarray to torch.tensor, cap Torch version to 1.7.x (#4610 ) * Use float64 in GAIL tests * Use float32 when converting np arrays by default * Enforce torch 1.7.x or below * Add comment about Windows install * Adjust tests	4 年前
vincentpierre	a3a9a56b	Merge branch 'exp-multi-head-attention' into exp-bullet-hell	4 年前
Ruo-Ping Dong	9e08be87	Merge branch 'master' into release_9_branch_merge	4 年前
GitHub	b853e5ba	Action buffer (#4612 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Andrew Cohen	0e28dd8f	add static method to create continuous/discrete	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
GitHub	85a7c0f7	[bug-fix] Add clipping to PyTorch policy, fix initialization (#4649 )	4 年前
GitHub	2a8c6800	[bug-fix] Add clipping to PyTorch policy, fix initialization (#4649 ) (#4662 )	4 年前
Ruo-Ping Dong	953cb6bb	Merge branch 'master' into develop-windows-delay	4 年前
Andrew Cohen	afd16cc9	rename make_x to creat_x/remove redundant properties	4 年前
Andrew Cohen	e5cc57f9	rename create random to random action	4 年前
Andrew Cohen	f6355ba9	Merge branch 'develop-action-spec' into develop-action-buffer	4 年前
GitHub	990f801a	Develop hybrid action staging (#4702 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
vincentpierre	735fcd52	[WIP] Refactor trainers to use list of obs rather than vec and vis obs	4 年前
Andrew Cohen	85e4db33	bc tests pass	4 年前
Arthur Juliani	b8f22fd7	Update second half of tests	4 年前
vincentpierre	7a5cc9ec	Merge master into develop-rm-tf	4 年前
Andrew Cohen	24fd9b3c	torch reward providers all pass	4 年前
vincentpierre	c1587bce	Solving merge conflicts	4 年前
Andrew Cohen	4ebc6c44	ml-agents-envs pass	4 年前
GitHub	ded1f79b	Merge pull request #4732 from Unity-Technologies/goal-sensors Adds SensorTypes and GoalSensors	4 年前
Andrew Cohen	b5d1c071	Merge branch 'master' into develop-action-buffer	4 年前
Arthur Juliani	0d2f8887	Merge remote-tracking branch 'origin/master' into goal-conditioning # Conflicts: # ml-agents-envs/mlagents_envs/base_env.py # ml-agents-envs/mlagents_envs/rpc_utils.py # ml-agents/mlagents/trainers/tests/mock_brain.py # ml-agents/mlagents/trainers/tests/simple_test_envs.py	4 年前
Ervin Teng	25dfd883	Merge branch 'master' into develop-centralizedcritic	4 年前
vincentpierre	8cb050ef	WIP Made initial changes to enale dimension properties and added attention module	4 年前
Andrew Cohen	498b1ee6	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
GitHub	a73f7d73	Turn down gain on GAIL discriminator output (#4762 )	4 年前
GitHub	b6bb01b9	Turn down gain on GAIL discriminator output (#4762 ) (#4772 )	4 年前
vincentpierre	c3699de8	merging master and addressing comments	4 年前
GitHub	29d94c7c	Merge pull request #4734 from Unity-Technologies/develop-obs-as-list Refactor trainers to use list of obs rather than vec and vis obs	4 年前
vincentpierre	719c969c	addressing comments. ObservationSpec is no longer a list	4 年前
vincentpierre	4bba4e8e	Renaming ObservationSpec to SensorSpec	4 年前
Andrew Cohen	c0d01baf	Merge branch 'master' into merge-release11-master	4 年前
Andrew Cohen	5f0f7e3a	fix reward provider tests	4 年前
vincentpierre	c5a057d2	renaming obs_spec variables	4 年前
vincentpierre	44ed3258	Merging master	4 年前
vincentpierre	449712b0	renaming sensor_spec to sensor_specS	4 年前
Andrew Cohen	35769b53	Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton	4 年前
Chris Elion	76ebc20c	Merge remote-tracking branch 'origin/master' into r12-to-master	4 年前
GitHub	458fee17	Merge pull request #4763 from Unity-Technologies/develop-att WIP Made initial changes to enable dimension properties and added attention module	4 年前
Ervin Teng	330fc1d0	Merge branch 'master' into develop-centralizedcritic-mm	4 年前
vincentpierre	519c5f47	merging master	4 年前
Ruo-Ping Dong	8ed14762	Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp	4 年前
Arthur Juliani	0b4b0992	Rename more files	4 年前
Arthur Juliani	7c37c759	Fix some mis-renamings	4 年前
Ruo-Ping Dong	180d3e20	Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager	4 年前
HH	0024a286	merge ervin's new stuff	4 年前
GitHub	67ad9651	Merge pull request #4825 from Unity-Technologies/sensor-types [WIP] Observation Types	4 年前
vincentpierre	8660b1c2	merging master	4 年前
brccabral	457fb612	Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents	4 年前
GitHub	64fc7f43	Buffer key enums (#4907 )	4 年前
Ervin Teng	b6f88d6d	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Ruo-Ping Dong	c87bce9e	Merge branch 'master' into develop-base-teammanager	4 年前
vincentpierre	e1b94b8b	Merge branch 'master' into develop-var-len-obs-feature	4 年前
Chris Elion	e4f51ca7	Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider	4 年前
Ervin Teng	d4438878	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Chris Elion	c3bc8991	cleanup, don't store mask	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
HH	15d512f9	Merge branch 'master' into hh/develop/dodgeball	4 年前
Arthur Juliani	06c147f8	Merge remote-tracking branch 'origin/main' into goal-conditioning-new # Conflicts: # Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity # Project/ProjectSettings/TagManager.asset # com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs # com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs # ml-agents/mlagents/trainers/torch/networks.py # ml-agents/mlagents/trainers/torch/utils.py	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
Christopher Goy	921ba4f0	Update v2-staging from main (March 15) (#5123 )	4 年前
GitHub	ba2af269	[coma2] Make group extrinsic reward part of extrinsic (#5033 ) * Make group extrinsic part of extrinsic * Fix test and init * Fix tests and bug * Add baseline loss to TensorBoard	4 年前
Chris Elion	970f1d40	Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec	4 年前
GitHub	8f35bdd3	POCA trainer (#5005 ) Co-authored-by: Ervin Teng <ervin@unity3d.com> Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com> Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
Andrew Cohen	9e77d7e1	Merge branch 'main' into develop-soccer-groupman	4 年前

1 2

85 次代码提交 (d27ae3fc-de5f-492e-adce-1bba8e7dc778)