GitHub
6b255790
Behavioral Cloning Pytorch ( #4293 )
4 年前
Ervin Teng
3387f56d
Fix BC module
4 年前
GitHub
6a1d993f
[add-fire] Memory class abstraction ( #4375 )
4 年前
Ervin Teng
8ff8c401
Merge branch 'develop-add-fire' into develop-add-fire-export
4 年前
GitHub
1955af9e
[feature] Add experimental PyTorch support ( #4335 )
* Begin porting work
* Add ResNet and distributions
* Dynamically construct actor and critic
* Initial optimizer port
* Refactoring policy and optimizer
* Resolving a few bugs
* Share more code between tf and torch policies
* Slightly closer to running model
* Training runs, but doesn’t actually work
* Fix a couple additional bugs
* Add conditional sigma for distribution
* Fix normalization
* Support discrete actions as well
* Continuous and discrete now train
* Mulkti-discrete now working
* Visual observations now train as well
* GRU in-progress and dynamic cnns
* Fix for memories
* Remove unused arg
* Combine actor and critic classes. Initial export.
* Support tf and pytorch alongside one another
* Prepare model for onnx export
* Use LSTM and fix a few merge errors
* Fix bug in probs calculation
* Optimize np -> tensor operations
* Time action sample funct...
4 年前
vincentpierre
108fac9a
Replace torch.detach().cpu().numpy() with a utils method
4 年前
vincentpierre
31750e97
Using item() in place of to_numpy()
4 年前
GitHub
498934f9
Replace torch.detach().cpu().numpy() with a utils method ( #4406 )
* Replace torch.detach().cpu().numpy() with a utils method
* Using item() in place of to_numpy()
* more use of item() and additional tests
4 年前
Ruo-Ping Dong
fd1dc3a6
Merge branch 'master' into develop-torch-omp
4 年前
GitHub
4e93cb6e
[torch] Restructure PyTorch encoders ( #4421 )
* Move linear encoding to NetworkBody
* moved encoders to processors (#4420 )
* fix bad merge
* Get it running
* Replace mentions of visual_encoders
* Remove output_size property
* Fix tests
* Fix some references
* Revert test_simple_rl
* Fix networks test
* Make curiosity test more accomodating
* Rename total_input_size
* [Bug fix] Fix bug in GAIL gradient penalty (#4425 ) (#4426 )
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
* Up number of steps
* Rename to visual_processors and vector_processors
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
6f534366
Add torch_utils class, auto-detect CUDA availability ( #4403 )
* Add torch_utils
* Use torch from torch_utils
* Add torch to banned modules in CI
* Better import error handling
* Fix flake8 errors
* Address comments
* Move networks to GPU if enabled
* Switch to torch_utils
* More flake8 problems
* Move reward providers to GPU/CPU
* Remove anothere set default tensor
* Fix banned import in test
4 年前
Andrew Cohen
3997b14b
Merge branch 'master' into develop-hybrid-actions
4 年前
Ervin Teng
43c41d66
Fix BC and Reward Signals
4 年前
Ervin Teng
7754ad7b
Don't run value during inference
4 年前
vincentpierre
181bdec0
-
4 年前
GitHub
4e4ad7b0
Don't run value during policy evaluate, optimized soft update function ( #4501 )
* Don't run value during inference
* Execute critic with LSTM
* Address comments
* Unformat
* Optimized soft update
* Move soft update to model utils
* Add test for soft update
4 年前
Ervin Teng
f9ff3efe
Merge branch 'develop-policyonly' into develop-sac-targetq
4 年前
Andrew Cohen
e5f14400
Merge branch 'master' into develop-hybrid-actions-singleton
4 年前
Andrew Cohen
8013e544
ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member)
4 年前
GitHub
cb8e4d25
Add ActionSpec ( #4586 )
Co-authored-by: Ervin T <ervin@unity3d.com>
4 年前
Andrew Cohen
9689cf2c
remove *_action_* from function names
4 年前
vincentpierre
a3a9a56b
Merge branch 'exp-multi-head-attention' into exp-bullet-hell
4 年前
Ruo-Ping Dong
9e08be87
Merge branch 'master' into release_9_branch_merge
4 年前
Andrew Cohen
6cf54bf2
remove self.action_spec from policy/bc
4 年前
GitHub
b853e5ba
Action buffer ( #4612 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
3c96a3a2
Action Model ( #4580 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
88d3ec3e
Merge master into hybrid actions staging branch ( #4704 )
4 年前
GitHub
8175d558
[bug-fix] Fix BC module + action clipping ( #4667 )
4 年前
Ruo-Ping Dong
ee5313e4
Merge branch 'master' into develop-windows-delay
4 年前
GitHub
f0ed3a38
Cherry-pick BC fixes to Release 10 ( #4668 )
4 年前
Ervin Teng
4158629e
Properly feed in None rather than empty arrays
4 年前
Ervin Teng
79a3051e
Update GAIL and BC
4 年前
Ervin Teng
fdaa8c3d
Merge branch 'develop-unified-obs' into develop-centralizedcritic
4 年前
GitHub
990f801a
Develop hybrid action staging ( #4702 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
Andrew Cohen
85e4db33
bc tests pass
4 年前
vincentpierre
93ca1409
fixing the tests
4 年前
vincentpierre
7a5cc9ec
Merge master into develop-rm-tf
4 年前
Andrew Cohen
24fd9b3c
torch reward providers all pass
4 年前
vincentpierre
c1587bce
Solving merge conflicts
4 年前
Arthur Juliani
0d2f8887
Merge remote-tracking branch 'origin/master' into goal-conditioning
# Conflicts:
# ml-agents-envs/mlagents_envs/base_env.py
# ml-agents-envs/mlagents_envs/rpc_utils.py
# ml-agents/mlagents/trainers/tests/mock_brain.py
# ml-agents/mlagents/trainers/tests/simple_test_envs.py
4 年前
Ervin Teng
25dfd883
Merge branch 'master' into develop-centralizedcritic
4 年前
vincentpierre
0c81006d
addressing comments
4 年前
Andrew Cohen
a545859e
fix torch test policy
4 年前
Andrew Cohen
498b1ee6
Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton
4 年前
vincentpierre
c3699de8
merging master and addressing comments
4 年前
GitHub
29d94c7c
Merge pull request #4734 from Unity-Technologies/develop-obs-as-list
Refactor trainers to use list of obs rather than vec and vis obs
4 年前
Andrew Cohen
1d234d1d
bc works
4 年前
Andrew Cohen
7973b46c
remove print bc
4 年前
Andrew Cohen
c0d01baf
Merge branch 'master' into merge-release11-master
4 年前
vincentpierre
44ed3258
Merging master
4 年前
vincentpierre
449712b0
renaming sensor_spec to sensor_specS
4 年前
Andrew Cohen
17496265
move AgentAction, ActionLogProbs, and ActionFlattener to separate files
4 年前
Chris Elion
76ebc20c
Merge remote-tracking branch 'origin/master' into r12-to-master
4 年前
GitHub
458fee17
Merge pull request #4763 from Unity-Technologies/develop-att
WIP Made initial changes to enable dimension properties and added attention module
4 年前
Ervin Teng
330fc1d0
Merge branch 'master' into develop-centralizedcritic-mm
4 年前
vincentpierre
519c5f47
merging master
4 年前
Ruo-Ping Dong
8ed14762
Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp
4 年前
Andrew Cohen
7ba10239
remove action spec attribute from policy
4 年前
Andrew Cohen
886883b3
Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton
4 年前
Arthur Juliani
0b4b0992
Rename more files
4 年前
Ruo-Ping Dong
a7d04be6
Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp
4 年前
Arthur Juliani
0a876b9c
Fix typos
4 年前
Ruo-Ping Dong
180d3e20
Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager
4 年前
HH
0024a286
merge ervin's new stuff
4 年前
GitHub
67ad9651
Merge pull request #4825 from Unity-Technologies/sensor-types
[WIP] Observation Types
4 年前
vincentpierre
8660b1c2
merging master
4 年前
brccabral
457fb612
Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents
4 年前
vincentpierre
6f3ea7b8
_
4 年前
GitHub
64fc7f43
Buffer key enums ( #4907 )
4 年前
Ervin Teng
b6f88d6d
Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager
4 年前
Andrew Cohen
6bd396ee
add critic to optimizer, ppo runs
4 年前
Ervin Teng
0bde7598
Back out trainer changes
4 年前
Ruo-Ping Dong
c87bce9e
Merge branch 'master' into develop-base-teammanager
4 年前
vincentpierre
e1b94b8b
Merge branch 'master' into develop-var-len-obs-feature
4 年前
Chris Elion
e4f51ca7
Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider
4 年前
Ervin Teng
d4438878
Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager
4 年前
Chris Elion
c3bc8991
cleanup, don't store mask
4 年前
Ervin Teng
e46a86ad
Merge branch 'master' into develop-superpush-int
4 年前
HH
15d512f9
Merge branch 'master' into hh/develop/dodgeball
4 年前
GitHub
338af2ec
Move the Critic into the Optimizer ( #4939 )
Co-authored-by: Ervin Teng <ervin@unity3d.com>
4 年前
HH
4c947151
Merge branch 'main' into hh/develop/dodgeball
4 年前
Ervin Teng
61781a1a
Merge branch 'main' into develop-agentprocessor-teammanager
4 年前
Arthur Juliani
06c147f8
Merge remote-tracking branch 'origin/main' into goal-conditioning-new
# Conflicts:
# Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity
# Project/ProjectSettings/TagManager.asset
# com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs
# com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs
# ml-agents/mlagents/trainers/torch/networks.py
# ml-agents/mlagents/trainers/torch/utils.py
4 年前
Ervin Teng
c8137dcd
Merge branch 'main' into develop-superpush-int
4 年前
Christopher Goy
ebe45056
Merge branch 'main' into release_14_branch-to-main
4 年前
vincentpierre
983982ee
Removing misleading learning rate
3 年前