GitHub
328353bc
Torch : Saving/Loading of the reward providers ( #4405 )
* Saving the reward providers
* adding tests
* Moved the tests around
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>
Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>
4 年前
Ruo-Ping Dong
88eff042
Merge branch 'master' into develop-saver-name
4 年前
Ruo-Ping Dong
56feb8af
update test_saver_reward_providers.py
4 年前
GitHub
48f217b9
Rename Saver to ModelSaver ( #4402 )
Rename Saver to ModelSaver to avoid confusion with tf.Saver
4 年前
Ruo-Ping Dong
fd1dc3a6
Merge branch 'master' into develop-torch-omp
4 年前
GitHub
6f534366
Add torch_utils class, auto-detect CUDA availability ( #4403 )
* Add torch_utils
* Use torch from torch_utils
* Add torch to banned modules in CI
* Better import error handling
* Fix flake8 errors
* Address comments
* Move networks to GPU if enabled
* Switch to torch_utils
* More flake8 problems
* Move reward providers to GPU/CPU
* Remove anothere set default tensor
* Fix banned import in test
4 年前
Andrew Cohen
3997b14b
Merge branch 'master' into develop-hybrid-actions
4 年前
Ervin Teng
7754ad7b
Don't run value during inference
4 年前
GitHub
4e4ad7b0
Don't run value during policy evaluate, optimized soft update function ( #4501 )
* Don't run value during inference
* Execute critic with LSTM
* Address comments
* Unformat
* Optimized soft update
* Move soft update to model utils
* Add test for soft update
4 年前
Ervin Teng
f9ff3efe
Merge branch 'develop-policyonly' into develop-sac-targetq
4 年前
GitHub
400e14cb
[Bug-fix] RND would not be saved correctly. Added tests ( #4514 )
4 年前
HH
a3bf96fd
Merge branch 'master' into hh/develop/gridsensor-tests
4 年前
Andrew Cohen
e5f14400
Merge branch 'master' into develop-hybrid-actions-singleton
4 年前
GitHub
b853e5ba
Action buffer ( #4612 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
8175d558
[bug-fix] Fix BC module + action clipping ( #4667 )
4 年前
Ruo-Ping Dong
ee5313e4
Merge branch 'master' into develop-windows-delay
4 年前
GitHub
f0ed3a38
Cherry-pick BC fixes to Release 10 ( #4668 )
4 年前
GitHub
990f801a
Develop hybrid action staging ( #4702 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
vincentpierre
93ca1409
fixing the tests
4 年前
vincentpierre
7a5cc9ec
Merge master into develop-rm-tf
4 年前
vincentpierre
c1587bce
Solving merge conflicts
4 年前
Arthur Juliani
0d2f8887
Merge remote-tracking branch 'origin/master' into goal-conditioning
# Conflicts:
# ml-agents-envs/mlagents_envs/base_env.py
# ml-agents-envs/mlagents_envs/rpc_utils.py
# ml-agents/mlagents/trainers/tests/mock_brain.py
# ml-agents/mlagents/trainers/tests/simple_test_envs.py
4 年前
Ervin Teng
25dfd883
Merge branch 'master' into develop-centralizedcritic
4 年前
vincentpierre
1a1070b1
forgot a file
4 年前
Andrew Cohen
a545859e
fix torch test policy
4 年前
Andrew Cohen
498b1ee6
Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton
4 年前
vincentpierre
c3699de8
merging master and addressing comments
4 年前
GitHub
29d94c7c
Merge pull request #4734 from Unity-Technologies/develop-obs-as-list
Refactor trainers to use list of obs rather than vec and vis obs
4 年前
Andrew Cohen
c0d01baf
Merge branch 'master' into merge-release11-master
4 年前
vincentpierre
44ed3258
Merging master
4 年前
Chris Elion
76ebc20c
Merge remote-tracking branch 'origin/master' into r12-to-master
4 年前
Ervin Teng
330fc1d0
Merge branch 'master' into develop-centralizedcritic-mm
4 年前
Ruo-Ping Dong
8ed14762
Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp
4 年前
Ruo-Ping Dong
180d3e20
Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager
4 年前
HH
0024a286
merge ervin's new stuff
4 年前
GitHub
d7f549f9
Run pytest on GPU ( #4865 )
* make tests device-friendly
* mark all tests in test_simple_rl
4 年前
Arthur Juliani
ff70c5c4
Merge branch 'master' into goal-conditioning-new
4 年前
GitHub
2fb87e4f
Merge branch 'master' into reward-dist
4 年前
Ervin Teng
b6f88d6d
Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager
4 年前
Andrew Cohen
6828713c
fix saver test
4 年前
Ervin Teng
24ee4bd5
Merge remote-tracking branch 'origin/develop-critic-optimizer' into develop-critic-optimizer
4 年前
Ervin Teng
e46a86ad
Merge branch 'master' into develop-superpush-int
4 年前
HH
15d512f9
Merge branch 'master' into hh/develop/dodgeball
4 年前
GitHub
338af2ec
Move the Critic into the Optimizer ( #4939 )
Co-authored-by: Ervin Teng <ervin@unity3d.com>
4 年前
HH
4c947151
Merge branch 'main' into hh/develop/dodgeball
4 年前
Ervin Teng
61781a1a
Merge branch 'main' into develop-agentprocessor-teammanager
4 年前
Arthur Juliani
06c147f8
Merge remote-tracking branch 'origin/main' into goal-conditioning-new
# Conflicts:
# Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity
# Project/ProjectSettings/TagManager.asset
# com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs
# com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs
# ml-agents/mlagents/trainers/torch/networks.py
# ml-agents/mlagents/trainers/torch/utils.py
4 年前
Ervin Teng
c8137dcd
Merge branch 'main' into develop-superpush-int
4 年前
GitHub
f16ce486
Update v2-staging from main (March 15) ( #5123 )
4 年前
Christopher Goy
921ba4f0
Update v2-staging from main (March 15) ( #5123 )
4 年前
Christopher Goy
ebe45056
Merge branch 'main' into release_14_branch-to-main
4 年前
GitHub
fc5d0a3f
[bug-fix] Fix save/restore critic, add test ( #5062 )
* Fix save/restore critic, add test
* Rename module for PPO
* Use correct policy in test
4 年前
Chris Elion
970f1d40
Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec
4 年前
Ervin Teng
1f026c70
Merge branch 'main' into develop-superpush-branch-cleanup
4 年前
Ervin Teng
ce872033
Revert "Merge branch 'main' into develop-superpush-branch-cleanup"
This reverts commit 5bea802525381f931a5e0f8b8778fe27a12f03af, reversing
changes made to cee3524e85161e13689d95f66bc6bff994d2cdfd.
4 年前
GitHub
8f35bdd3
POCA trainer ( #5005 )
Co-authored-by: Ervin Teng <ervin@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
Andrew Cohen
9e77d7e1
Merge branch 'main' into develop-soccer-groupman
4 年前
Ervin Teng
bed4bf36
Load individual elements if state dict load fails ( #5213 )
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ervin T. <ervin@unity3d.com>
(cherry picked from commit ac4f43cf18b98d0fc7063b9b831e07429f7ea39e)
4 年前
Andrew Cohen
42105f23
add load different reward tests
4 年前
Andrew Cohen
98dcb548
test convolutions can be loaded properly
4 年前
Andrew Cohen
2e5b1352
add check that layers still have different dimensions
4 年前
GitHub
cb1f5462
Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
f3d586bc
Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver.py
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
9dfe6c7f
Load individual elements if state dict load fails ( #5213 )
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ervin T. <ervin@unity3d.com>
4 年前