GitHub
7ddfd81f
Added Reward Providers for Torch ( #4280 )
* Added Reward Providers for Torch
* Use NetworkBody to encode state in the reward providers
* Integrating the reward prodiders with ppo and torch
* work in progress, integration with PPO. Not training properly Pyramids at the moment
* Integration in PPO
* Removing duplicate file
* Gail and Curiosity working
* addressing comments
* Enfore float32 for tests
* enfore np.float32 in buffer
4 年前
vincentpierre
108fac9a
Replace torch.detach().cpu().numpy() with a utils method
4 年前
vincentpierre
fdd343b2
more use of item() and additional tests
4 年前
GitHub
12e15e29
Fix on GAIL Torch when using actions ( #4407 )
4 年前
GitHub
498934f9
Replace torch.detach().cpu().numpy() with a utils method ( #4406 )
* Replace torch.detach().cpu().numpy() with a utils method
* Using item() in place of to_numpy()
* more use of item() and additional tests
4 年前
Ruo-Ping Dong
fd1dc3a6
Merge branch 'master' into develop-torch-omp
4 年前
GitHub
7b4d0865
[Bug fix] Fix bug in GAIL gradient penalty ( #4425 )
4 年前
GitHub
4e93cb6e
[torch] Restructure PyTorch encoders ( #4421 )
* Move linear encoding to NetworkBody
* moved encoders to processors (#4420 )
* fix bad merge
* Get it running
* Replace mentions of visual_encoders
* Remove output_size property
* Fix tests
* Fix some references
* Revert test_simple_rl
* Fix networks test
* Make curiosity test more accomodating
* Rename total_input_size
* [Bug fix] Fix bug in GAIL gradient penalty (#4425 ) (#4426 )
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
* Up number of steps
* Rename to visual_processors and vector_processors
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
6f534366
Add torch_utils class, auto-detect CUDA availability ( #4403 )
* Add torch_utils
* Use torch from torch_utils
* Add torch to banned modules in CI
* Better import error handling
* Fix flake8 errors
* Address comments
* Move networks to GPU if enabled
* Switch to torch_utils
* More flake8 problems
* Move reward providers to GPU/CPU
* Remove anothere set default tensor
* Fix banned import in test
4 年前
GitHub
676f5f7c
[refactor] Refactor GAIL to use new encoder structure ( #4433 )
Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
Ervin Teng
60eacc0d
Merge branch 'master' into develop-adjust-cpu-settings
4 年前
Andrew Cohen
3997b14b
Merge branch 'master' into develop-hybrid-actions
4 年前
vincentpierre
181bdec0
-
4 年前
GitHub
60b76790
Random Network Distillation for Torch ( #4473 )
* initial commit
* works with Pyramids
* added unit tests and a separate config file
* Adding first batch of documentation
* adding in the docs that rnd is only for PyTorch
* adding newline at the end of the config files
* adding some docs
* Code comments
* no normalization of the reward
* Fixing the tests
* [skip ci]
* [skip ci] Make sure RND will only work for Torch by editing the config file
* [skip ci] Additional information in the Documentation
* Remove the _has_updated_once flag
4 年前
Andrew Cohen
e5f14400
Merge branch 'master' into develop-hybrid-actions-singleton
4 年前
Andrew Cohen
7827ca06
add ActionSpec; test_simple_rl torch passes
4 年前
Andrew Cohen
6e23bafd
ActionFlattener Refactor
4 年前
GitHub
cb8e4d25
Add ActionSpec ( #4586 )
Co-authored-by: Ervin T <ervin@unity3d.com>
4 年前
Andrew Cohen
9689cf2c
remove *_action_* from function names
4 年前
GitHub
64e998a2
[bug-fix] Use float64 when converting np.ndarray to torch.tensor, cap Torch version to 1.7.x ( #4610 )
* Use float64 in GAIL tests
* Use float32 when converting np arrays by default
* Enforce torch 1.7.x or below
* Add comment about Windows install
* Adjust tests
4 年前
vincentpierre
a3a9a56b
Merge branch 'exp-multi-head-attention' into exp-bullet-hell
4 年前
Ruo-Ping Dong
9e08be87
Merge branch 'master' into release_9_branch_merge
4 年前
GitHub
b853e5ba
Action buffer ( #4612 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
Andrew Cohen
0e28dd8f
add static method to create continuous/discrete
4 年前
GitHub
88d3ec3e
Merge master into hybrid actions staging branch ( #4704 )
4 年前
GitHub
85a7c0f7
[bug-fix] Add clipping to PyTorch policy, fix initialization ( #4649 )
4 年前
GitHub
2a8c6800
[bug-fix] Add clipping to PyTorch policy, fix initialization ( #4649 ) ( #4662 )
4 年前
Ruo-Ping Dong
953cb6bb
Merge branch 'master' into develop-windows-delay
4 年前
Andrew Cohen
afd16cc9
rename make_x to creat_x/remove redundant properties
4 年前
Andrew Cohen
e5cc57f9
rename create random to random action
4 年前
Andrew Cohen
f6355ba9
Merge branch 'develop-action-spec' into develop-action-buffer
4 年前
GitHub
990f801a
Develop hybrid action staging ( #4702 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
vincentpierre
735fcd52
[WIP] Refactor trainers to use list of obs rather than vec and vis obs
4 年前
Andrew Cohen
85e4db33
bc tests pass
4 年前
Arthur Juliani
b8f22fd7
Update second half of tests
4 年前
vincentpierre
7a5cc9ec
Merge master into develop-rm-tf
4 年前
Andrew Cohen
24fd9b3c
torch reward providers all pass
4 年前
vincentpierre
c1587bce
Solving merge conflicts
4 年前
Andrew Cohen
4ebc6c44
ml-agents-envs pass
4 年前
GitHub
ded1f79b
Merge pull request #4732 from Unity-Technologies/goal-sensors
Adds SensorTypes and GoalSensors
4 年前
Andrew Cohen
b5d1c071
Merge branch 'master' into develop-action-buffer
4 年前
Arthur Juliani
0d2f8887
Merge remote-tracking branch 'origin/master' into goal-conditioning
# Conflicts:
# ml-agents-envs/mlagents_envs/base_env.py
# ml-agents-envs/mlagents_envs/rpc_utils.py
# ml-agents/mlagents/trainers/tests/mock_brain.py
# ml-agents/mlagents/trainers/tests/simple_test_envs.py
4 年前
Ervin Teng
25dfd883
Merge branch 'master' into develop-centralizedcritic
4 年前
vincentpierre
8cb050ef
WIP Made initial changes to enale dimension properties and added attention module
4 年前
Andrew Cohen
498b1ee6
Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton
4 年前
GitHub
a73f7d73
Turn down gain on GAIL discriminator output ( #4762 )
4 年前
GitHub
b6bb01b9
Turn down gain on GAIL discriminator output ( #4762 ) ( #4772 )
4 年前
vincentpierre
c3699de8
merging master and addressing comments
4 年前
GitHub
29d94c7c
Merge pull request #4734 from Unity-Technologies/develop-obs-as-list
Refactor trainers to use list of obs rather than vec and vis obs
4 年前
vincentpierre
719c969c
addressing comments. ObservationSpec is no longer a list
4 年前
vincentpierre
4bba4e8e
Renaming ObservationSpec to SensorSpec
4 年前
Andrew Cohen
c0d01baf
Merge branch 'master' into merge-release11-master
4 年前
Andrew Cohen
5f0f7e3a
fix reward provider tests
4 年前
vincentpierre
c5a057d2
renaming obs_spec variables
4 年前
vincentpierre
44ed3258
Merging master
4 年前
vincentpierre
449712b0
renaming sensor_spec to sensor_specS
4 年前
Andrew Cohen
35769b53
Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton
4 年前
Chris Elion
76ebc20c
Merge remote-tracking branch 'origin/master' into r12-to-master
4 年前
GitHub
458fee17
Merge pull request #4763 from Unity-Technologies/develop-att
WIP Made initial changes to enable dimension properties and added attention module
4 年前
Ervin Teng
330fc1d0
Merge branch 'master' into develop-centralizedcritic-mm
4 年前
vincentpierre
519c5f47
merging master
4 年前
Ruo-Ping Dong
8ed14762
Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp
4 年前
Arthur Juliani
0b4b0992
Rename more files
4 年前
Arthur Juliani
7c37c759
Fix some mis-renamings
4 年前
Ruo-Ping Dong
180d3e20
Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager
4 年前
HH
0024a286
merge ervin's new stuff
4 年前
GitHub
67ad9651
Merge pull request #4825 from Unity-Technologies/sensor-types
[WIP] Observation Types
4 年前
vincentpierre
8660b1c2
merging master
4 年前
brccabral
457fb612
Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents
4 年前
GitHub
64fc7f43
Buffer key enums ( #4907 )
4 年前
Ervin Teng
b6f88d6d
Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager
4 年前
Ruo-Ping Dong
c87bce9e
Merge branch 'master' into develop-base-teammanager
4 年前
vincentpierre
e1b94b8b
Merge branch 'master' into develop-var-len-obs-feature
4 年前
Chris Elion
e4f51ca7
Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider
4 年前
Ervin Teng
d4438878
Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager
4 年前
Chris Elion
c3bc8991
cleanup, don't store mask
4 年前
Ervin Teng
e46a86ad
Merge branch 'master' into develop-superpush-int
4 年前
HH
15d512f9
Merge branch 'master' into hh/develop/dodgeball
4 年前
Arthur Juliani
06c147f8
Merge remote-tracking branch 'origin/main' into goal-conditioning-new
# Conflicts:
# Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity
# Project/ProjectSettings/TagManager.asset
# com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs
# com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs
# ml-agents/mlagents/trainers/torch/networks.py
# ml-agents/mlagents/trainers/torch/utils.py
4 年前
GitHub
f16ce486
Update v2-staging from main (March 15) ( #5123 )
4 年前
Christopher Goy
921ba4f0
Update v2-staging from main (March 15) ( #5123 )
4 年前
GitHub
ba2af269
[coma2] Make group extrinsic reward part of extrinsic ( #5033 )
* Make group extrinsic part of extrinsic
* Fix test and init
* Fix tests and bug
* Add baseline loss to TensorBoard
4 年前
Chris Elion
970f1d40
Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec
4 年前
GitHub
8f35bdd3
POCA trainer ( #5005 )
Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
Andrew Cohen
9e77d7e1
Merge branch 'main' into develop-soccer-groupman
4 年前