GitHub
e4177de0
[change] Organize trainer files a bit better ( #3538 )
5 年前
Andrew Cohen
573b1f6d
Merge branch 'master' into soccer-fives
5 年前
Anupam Bhatnagar
f4dbedcf
removed extraneous logging imports and loggers
5 年前
GitHub
86141eee
Merge pull request #3560 from Unity-Technologies/new-logger
Add timestamps to logs
5 年前
Anupam Bhatnagar
e8e0078e
first commit
5 年前
Anupam Bhatnagar
07b15ae7
[skip-ci] small refactors
5 年前
GitHub
e3af96ca
Merge branch 'master' into develop-demo-load-seek
5 年前
GitHub
873ba7fd
[bug-fix] Fix stats reporting for reward signals in SAC ( #3606 )
5 年前
Andrew Cohen
b1cfa74d
Merge branch 'master' into develop-test-imitation
5 年前
GitHub
ec278616
Hotfixes for Release 0.15.1 ( #3698 )
* [bug-fix] Increase height of wall in CrawlerStatic (#3650 )
* [bug-fix] Improve performance for PPO with continuous actions (#3662 )
* Corrected a typo in a name of a function (#3670 )
OnEpsiodeBegin was corrected to OnEpisodeBegin in Migrating.md document
* Add Academy.AutomaticSteppingEnabled to migration (#3666 )
* Fix editor port in Dockerfile (#3674 )
* Hotfix memory leak on Python (#3664 )
* Hotfix memory leak on Python
* Fixing
* Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done
* [bug-fix] Make Python able to deal with 0-step episodes (#3671 )
* adding some comments
Co-authored-by: Ervin T <ervin@unity3d.com>
* Remove vis_encode_type from list of required (#3677 )
* Update changelog (#3678 )
* Shorten timeout duration for environment close (#3679 )
The timeout duration for closing an environment was set to the
same duration as the timeout when waiting ...
5 年前
Andrew Cohen
53bea15c
Merge branch 'master' into soccer-fives
5 年前
Andrew Cohen
ac261e36
Merge branch 'master' into self-play-mutex
5 年前
GitHub
29f82921
[bug-fix] Improve performance for PPO with continuous actions ( #3662 )
5 年前
GitHub
de3fc4e8
Hotfix memory leak on Python ( #3664 )
* Hotfix memory leak on Python
* Fixing
* Fixing a bug in the heuristic policy. A decision should not be requested when the agent is done
* [bug-fix] Make Python able to deal with 0-step episodes (#3671 )
* adding some comments
Co-authored-by: Ervin T <ervin@unity3d.com>
5 年前
Andrew Cohen
4c9ac553
Merge branch 'master' into self-play-mutex
5 年前
Andrew Cohen
93d344ff
simple rl asymm ghost tests
5 年前
GitHub
4ecd6ad3
Fix how we set logging levels ( #3703 )
* cleanup logging
* comments and cleanup
* pylint, gym
5 年前
Andrew Cohen
cd677346
Merge branch 'self-play-mutex' into soccer-2v1
5 年前
Andrew Cohen
c7a34413
Merge branch 'self-play-mutex' into soccer-2v1
5 年前
GitHub
bc1fdf07
[refactor] CLI changes ( #3705 )
5 年前
Andrew Cohen
59b88be6
Merge branch 'master' into self-play-mutex
5 年前
GitHub
9cbc3fa2
Asymmetric self-play ( #3653 )
5 年前
Ervin Teng
06fa3d39
Merge branch 'master' into develop-sac-apex
5 年前
Anupam Bhatnagar
50e52d9c
Merge branch 'master' into distributed-training
5 年前
GitHub
d7ca6b8d
[feature] Add --initialize-from option ( #3710 )
5 年前
Andrew Cohen
1b9c643b
Merge branch 'master' into self-play-mutex
5 年前
Anupam Bhatnagar
001fce2a
first commit
5 年前
GitHub
43f23ee3
WIP : Changes to the LL-API - Refactor of “done” logic ( #3681 )
* [skip ci] WIP : Modify the base_env.py file
* [skip ci] typo
* [skip ci] renamed some methods
* [skip ci] Incorporated changes from our meeting
* [skip ci] everything is broken
* [skip ci] everything is broken
* [skip ci] formatting
* Fixing the gym tests
* Fixing bug, C# has an error that needs fixing
* Fixing the test
* relaxing the threshold of 0.99 to 0.9
* fixing the C# side
* formating
* Fixed the llapi integratio test
* [Increasing steps for testing]
* Fixing the python tests
* Need __contains__ after all
* changing the max_steps in the tests
* addressing comments
* Making env_manager logic clearer as proposed in the comments
* Remove duplicated logic and added back in episode length (#3728 )
* removing mentions of multi-agent in gym and changed the docstring in base_env.py
* Edited the Documentation for the changes to the LLAPI (#3733 )
* Edite...
5 年前
Anupam Bhatnagar
9341f7a2
[skip-ci] small refactors
5 年前
Andrew Cohen
7a7eb324
Merge branch 'master' into internal-policy-ghost
5 年前
Andrew Cohen
930d6fa3
Merge branch 'self-play-mutex' into soccer-2v1
5 年前
GitHub
aae58330
Merge branch 'master' into develop-add-inference-examples
5 年前
Andrew Cohen
b0c506a6
Merge branch 'soccer-2v1' into asymm-envs
5 年前
Andrew Cohen
933d7b32
Merge branch 'self-play-mutex' into soccer-2v1
5 年前
Ervin Teng
5e980ec1
Merge branch 'master' into develop-sac-apex
5 年前
Andrew Cohen
f41695b9
Merge branch 'soccer-2v1' into asymm-envs
5 年前
Arthur Juliani
dc50162d
Add ResNet and distributions
5 年前
Andrew Cohen
80469267
Merge branch 'internal-policy-ghost' into soccer-2v1
5 年前
Andrew Cohen
4468280a
Merge branch 'soccer-2v1' into asymm-envs
5 年前
Andrew Cohen
9bec75ee
Merge branch 'master' into soccer-2v1
5 年前
Arthur Juliani
e166d018
Dynamically construct actor and critic
5 年前
Arthur Juliani
6879bae4
Initial optimizer port
5 年前
Arthur Juliani
7c3bd376
Refactoring policy and optimizer
5 年前
Arthur Juliani
2e51260a
Resolving a few bugs
5 年前
Andrew Cohen
185d4b35
Merge branch 'soccer-2v1' into asymm-envs
5 年前
Arthur Juliani
b997f214
Share more code between tf and torch policies
5 年前
Arthur Juliani
947f0d32
Slightly closer to running model
5 年前
Arthur Juliani
3c82bf59
Training runs, but doesn’t actually work
5 年前
Arthur Juliani
8c6f4696
Fix a couple additional bugs
5 年前
Arthur Juliani
4a50444f
Support discrete actions as well
5 年前
Arthur Juliani
a11a79e4
Continuous and discrete now train
5 年前
GitHub
232519e4
[refactor] Move output artifacts to a single results/ folder ( #3829 )
5 年前
Arthur Juliani
82688e5c
GRU in-progress and dynamic cnns
5 年前
Arthur Juliani
1736559f
Combine actor and critic classes. Initial export.
5 年前
Arthur Juliani
ca887743
Support tf and pytorch alongside one another
5 年前
Arthur Juliani
9835d26c
Prepare model for onnx export
5 年前
Chris Elion
68b68396
Merge remote-tracking branch 'origin/master' into release_1_to_master
5 年前
GitHub
d2bc86c8
Release 2 cherry pick ( #3971 )
* [bug-fix] Fix issue with initialize not resetting step count (#3962 )
* Develop better error message for #3953 (#3963 )
* Making the error for wrong number of agents raise consistently
* Better error message for inputs of wrong dimensions
* Fix #3932 , stop the editor from going into a loop when a prefab is selected. (#3949 )
* Minor doc updates to release
* add unit tests and fix exceptions (#3930 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Chris Goy <christopherg@unity3d.com>
5 年前
vincentpierre
c34dd5b6
Merge branch 'master' into develop-gym-wrapper
5 年前
Arthur Juliani
89ad3020
Merge remote-tracking branch 'origin/master' into develop-add-fire
# Conflicts:
# ml-agents/mlagents/trainers/policy/tf_policy.py
5 年前
Arthur Juliani
be7e55e1
Use LSTM and fix a few merge errors
5 年前
Arthur Juliani
b7be7f04
Fix bug in probs calculation
5 年前
Arthur Juliani
3eef9d78
Optimize np -> tensor operations
5 年前
Arthur Juliani
c02e75d6
Time action sample function
5 年前
Arthur Juliani
039f545a
Small performance improvement during inference
5 年前
Ervin Teng
565f92ef
Seems to speed it up
5 年前
Christopher Goy
ba80b292
format files with pre-commit.
4 年前
Ervin Teng
2fae31e6
Remove another if statement
5 年前
GitHub
f7373172
Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2
update verified brach with barracuda 1.0.2
4 年前
Ervin Teng
72180f9b
Experiment with JIT compiler
5 年前
GitHub
abbc6424
[bug-fix] Fix issue with initialize not resetting step count ( #3962 )
5 年前
vincentpierre
6ddfe74f
Merge branch 'master' into develop-gym-wrapper
5 年前
Arthur Juliani
28e095e0
Merge remote-tracking branch 'origin/master' into develop-add-fire
5 年前
Ervin Teng
f214836a
Changes for speed test
5 年前
GitHub
e92b4f88
[refactor] Structure configuration files into classes ( #3936 )
5 年前
GitHub
335cff3e
[versioning] Save ML-Agents version in checkpoints and check on load ( #4035 )
5 年前
Andrew Cohen
e7750fc9
Merge branch 'master' into develop-sampler-refactor
5 年前
Andrew Cohen
34ecc7e6
Merge branch 'master' into asymm-envs
5 年前
GitHub
a1c63c4b
Release 3 Cherry-pick bug-fixes and doc changes from master ( #4102 )
* [bug-fix] Fix regression in --initialize-from feature (#4086 )
* Fixed text in GettingStarted page specifying the logdir for tensorboard. Before it was in a directory summaries which no longer existed. Results are now saved to the results dir. (#4085 )
* [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087 )
* Reverting bug introduced in #4071 (#4101 )
Co-authored-by: Scott <Scott.m.jordan91@gmail.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
5 年前
GitHub
8a49e8e0
[refactor] Remove nonfunctional `output_path` option from TrainerSettings ( #4087 )
5 年前
Anupam Bhatnagar
4afd8f92
first commit
5 年前
Arthur Juliani
9724c9ac
Merge master
5 年前
Arthur Juliani
46874cc7
ONNX exporting
5 年前
yanchaosun
c2d6f5c0
basic implementation
5 年前
yanchaosun
a9c788d7
new model
5 年前
Arthur Juliani
5d33aca7
Remove double setting
5 年前
GitHub
0d80d87a
Fix for discrete actions ( #4181 )
5 年前
Anupam Bhatnagar
24d5f881
first commit
5 年前
Ervin Teng
510583d2
Move memory validation to settings
5 年前
yanchaosun
ac4c80c2
integrate the implementation and hyperparameters
5 年前
Ervin Teng
68169434
Fix discrete actions and GridWorld
5 年前
GitHub
cde8bd29
Convert List[np.ndarray] to np.ndarray before using torch.as_tensor ( #4183 )
Big speedup in visual obs
5 年前
GitHub
bb675bf4
Merge pull request #4134 from Unity-Technologies/develop-removebrainnamepolicy
[refactor] Remove references to brain_name in policy
5 年前
yanchaosun
f0881a94
fix commands for cloud training
5 年前
GitHub
05a11c96
Develop add fire exp framework ( #4213 )
* Experiment branch for comparing torch
* Updates and merging ervin changes
* improvements on experiment_torch.py
* Better printing of results
* preliminary gpu experiment
* Testing gpu
* Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two
* Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two
* _
* _
* _
* _
* _
* _
* _
* _
* Attempt at gpu on tf. Does not work
* _
* _
* _
* _
* _
* _
* _
* _
* _
* _
* _
* Fixing learn.py
5 年前
yanchaosun
05a96355
remove slim package
5 年前
GitHub
45154f52
Pytorch port of SAC ( #4219 )
5 年前
yanchaosun
ad95032b
transfer path
5 年前
GitHub
a28e2767
Update add-fire to latest master, including Policy refactor ( #4263 )
* Update Dockerfile
* Separate send environment data from reset (#4128 )
* Fixed a typo on ML-Agents-Overview.md (#4130 )
Fixed redundant "to" word from the sentence since it is probably a typo in document.
* Updated the badge’s link to point to the newest doc version
* Replaced all of the doc to release_3_doc
* Fix 3DBall and 3DBallHard SAC regressions (#4132 )
* Move memory validation to settings
* Update docs
* Add settings test
* Update to release_3 in installation.md (#4144 )
* rename to SideChannelManager +backcompat (#4137 )
* Remove comment about logo with --help (#4148 )
* [bugfix] Make FoodCollector heuristic playable (#4147 )
* Make FoodCollector heuristic playable
* Update changelog
* script to check for old release links and references (#4153 )
* Remove package validation suite from Project (#4146 )
* RayPerceptionSensor: handle empty and invalid tags (#4155...
5 年前
GitHub
69579611
[refactor] Refactor Actor and Critic classes ( #4287 )
4 年前
Ruo-Ping Dong
6feec58a
add Saver class (only TF working)
4 年前
Ervin Teng
bd97532d
Add normalizer update context
5 年前
Ruo-Ping Dong
9449d711
fix onnx save path and output_name
5 年前
yanchaosun
a80915a8
yaml update
5 年前
Andrew Cohen
f74d301a
Merge branch 'develop-add-fire' into develop-add-fire-bc
4 年前
Ruo-Ping Dong
6d67f857
move tf and add torch model serialization
4 年前
yanchaosun
666c8ba9
new cloud training change
5 年前
Ruo-Ping Dong
01e60921
add sac checkpoint
4 年前
vincentpierre
599d7e9f
Merging master
5 年前
yanchaosun
5eccb4c9
new transfer test for cloud
5 年前
HH
7afa1761
Merge branch 'master' into hh/develop/ragdoll-updates
5 年前
yanchaosun
858b97ec
bug fix
5 年前
Ruo-Ping Dong
4e87b422
move checkpoint_path logic to saver
4 年前
yanchaosun
da87eae9
predict model fix
5 年前
Ervin Teng
884c97ce
Fix policy memory storinig
4 年前
yanchaosun
d1e8d344
with swish activation
5 年前
Ruo-Ping Dong
71fe4df6
fix formatting and test
4 年前
Ruo-Ping Dong
b4713baa
small improvements
4 年前
Ruo-Ping Dong
79d89158
Merge branch 'develop-add-fire' into develop-add-fire-checkpoint
4 年前
GitHub
3bcb029b
[refactor] Remove BrainParameters from Python code ( #4138 )
5 年前
Ruo-Ping Dong
e06812aa
fix tests
4 年前
GitHub
839eb2cb
Develop model transfer test ( #4214 )
* test env, and code integration
* delete results
5 年前
yanchaosun
7e3216ae
simple env test
5 年前
yanchaosun
cdaaa318
bisim
5 年前
yanchaosun
3d0d359c
bisimulation draft
5 年前
yanchaosun
1fdbfe65
no normalization
5 年前
yanchaosun
5a778ca3
fix normalization
5 年前
yanchaosun
a212fef9
new bisim implementation
5 年前
HH
0fdac847
Merge branch 'master' into hh/develop/crawler-ragdoll-updates
5 年前
yanchaosun
0e2f6e19
small fix
5 年前
yanchaosun
ec929746
minor update
5 年前
GitHub
84440f05
Convert checkpoints to .NN ( #4127 )
This change adds an export to .nn for each checkpoint generated by
RLTrainer and adds a NNCheckpointManager to track the generated
checkpoints and final model in training_status.json.
Co-authored-by: Jonathan Harper <jharper+moar@unity3d.com>
5 年前
Andrew Cohen
d0133066
working
5 年前
yanchaosun
9bc90956
fix bug with bisimulation
5 年前
Andrew Cohen
b6bf1860
fix bisim metric
5 年前
yanchaosun
f8b91faa
try to fix the bisim metric
5 年前
yanchaosun
ce36349b
some changes
5 年前
Andrew Cohen
1b17ae56
add tanh activ
5 年前
yanchaosun
7508a130
small fix
5 年前
Arthur Juliani
6bee0fd1
Merge master
5 年前
yanchaosun
caeffa3e
add two envs
5 年前
Andrew Cohen
5fa28f5f
merge YC changes
5 年前
yanchaosun
28355444
bisim fix, disable stop gradient
5 年前
yanchaosun
3246570c
added action encoder, and flags related with action training/transferring; set model_schedule as a changable hyperparameter
5 年前
GitHub
9f041970
Develop bisim action encoder, incorporate related hyperparameter settings ( #4253 )
5 年前
GitHub
1f5eb9da
add pyupgrade to pre-commit and run ( #4239 )
5 年前
GitHub
129f9ddc
[MLA-427] make pyupgrade convert f-strings too ( #4244 )
* make pyupgrade convert f-strings too
5 年前
yanchaosun
80bad241
init sac transfer, and added action encoder to bisim; configs for crawler
5 年前
yanchaosun
a505cb16
new config
5 年前
yanchaosun
b991096b
update target encoder soft copy
5 年前
Andrew Cohen
d8c123a0
Merge branch 'master' into sensitivity
5 年前
Andrew Cohen
02df39ab
ignore precommit
5 年前
Andrew Cohen
0c7db26a
target encoder
5 年前
yanchaosun
b74294bf
target encoders and new forward loss
5 年前
Andrew Cohen
fa35292c
write hist to tb
5 年前
yanchaosun
0c468084
sac transfer implementation; disable action encoder
5 年前
GitHub
1b098c9a
Refactor TFPolicy and Policy ( #4254 )
* Refactor TFPolicy and Policy
5 年前
yanchaosun
0a1a30d3
sac update
5 年前
GitHub
380fef57
[refactor] Move TF-specific files to tf/ folder ( #4266 )
5 年前
GitHub
beb5aca5
[refactor] Make classes except Optimizer framework agnostic ( #4268 )
5 年前
yanchaosun
00bb821c
fix sac transfer problems
5 年前
Andrew Cohen
302e8e77
no action encoder
5 年前
Andrew Cohen
06e4356c
Merge branch 'master' into sensitivity
5 年前
yanchaosun
2b67d1a6
fix crawler config
5 年前
Andrew Cohen
9d7ed6cc
op buffer
5 年前
Andrew Cohen
9c012d6a
no op buffer no acen
5 年前
Arthur Juliani
1a123641
Merge remote-tracking branch 'origin/master' into r5-master
5 年前
Andrew Cohen
2dec257c
no encoder for single task
5 年前
Andrew Cohen
18ff42a6
use mean of first trajectory to initialize the normalizer
4 年前
yanchaosun
6df774ed
update: separate model train as an option
5 年前
Andrew Cohen
5878b952
remove blank line
4 年前
Andrew Cohen
ce9bcefe
cleaned up initialization of variance/mean
4 年前
Ruo-Ping Dong
95858e25
update saver interface and add tests
4 年前
GitHub
9dc1d99e
Initialize normalizer with mean/variance from first trajectory ( #4299 )
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
GitHub
ab8e5afa
Release 6 fix nan ( #4343 )
* test initalize steps to 100
* use mean of first trajectory to initialize the normalizer
* remove blank line
* update changelog
* cleaned up initialization of variance/mean
* large normalization obs unit test
* add --upgrade to pip to get newer downloader (#4338 )
* Fix format of the changelog for validation. (#4340 )
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Chris Goy <christopherg@unity3d.com>
4 年前
GitHub
f374f87a
[add-fire] Add LSTM to SAC, LSTM fixes and initializations ( #4324 )
4 年前
Anupam Bhatnagar
87bdf353
[skip ci] save model on worker zero only
4 年前
Anupam Bhatnagar
d3e8f124
removing horovod from tf policy
4 年前
Anupam Bhatnagar
abc1220f
Merge branch 'master' into global-variables
4 年前
Andrew Cohen
2cd0de04
action enc
5 年前
Chris Elion
d2133d83
comments and cleanup
4 年前
HH
8eaddb61
Merge branch 'master' into hh/develop/loco-walker-variable-speed
4 年前
Andrew Cohen
12f3786c
Revert "action enc"
This reverts commit 9acb5f846a77a47d14694cb1271e2a370f395b6a.
5 年前
yanchaosun
3ce88589
1 layer everything
5 年前
Ruo-Ping Dong
59cc1a9f
Merge branch 'develop-add-fire' into develop-add-fire-checkpoint
4 年前
GitHub
76deba36
Merge pull request #4334 from Unity-Technologies/global-variables
Adding rank to ml-agents
4 年前
GitHub
25dc8c3d
Add Saver Class to handle all save/load/checkpoint/export work ( #4323 )
4 年前
Ervin Teng
cb0085a7
Memory size abstraction and fixes
4 年前
Ervin Teng
d65a9326
Merge branch 'master' into develop-add-fire-mm3
4 年前
Ruo-Ping Dong
d57aa9ab
Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint
4 年前
Andrew Cohen
463db9e8
backprop enc single task
5 年前
GitHub
bd6bcd2f
Merge master and add Saver class for save/load checkpoints
4 年前
Andrew Cohen
12eda929
try reload all
5 年前
Ervin Teng
42e25b25
Merge branch 'develop-add-fire' into develop-add-fire-memoryclass
4 年前
GitHub
8985a040
Removing the experiment script from add fire ( #4373 )
* Removing the experiment script
* Removing the script
4 年前
Andrew Cohen
b822283f
merge add fire
4 年前
Ervin Teng
6e946dba
Policy bugfixes and policy tests
4 年前
Ervin Teng
9ae22c61
Fix SeparateActorCritic export
4 年前
Christopher Goy
5a233353
Merge remote-tracking branch 'origin/master' into release_6-to-master
4 年前
GitHub
03eac72c
[add-fire] Add tests and fix issues with Policy ( #4372 )
4 年前
Andrew Cohen
a65d08c7
ghost trainer tests
4 年前
Ervin Teng
116303f1
Typing for torch policy
4 年前
GitHub
49545ce1
Pytorch ghost trainer ( #4370 )
4 年前
Andrew Cohen
71f9c241
fix tf policy for ghosts
4 年前
GitHub
6a1d993f
[add-fire] Memory class abstraction ( #4375 )
4 年前
GitHub
2332bc32
Add fire to test_simple_rl.py ( #4378 )
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ervin T <ervin@unity3d.com>
4 年前
yanchaosun
3762358d
fix action stop gradient
5 年前
vincentpierre
349cee77
Fix of the test for multi visual input
4 年前
Andrew Cohen
20083987
move tf policy comment
4 年前
Ervin Teng
a04e68a4
Merge branch 'develop-add-fire' into develop-add-fire-memoryclass
4 年前
Andrew Cohen
effdec13
return copy of state_dict
4 年前
yanchaosun
3ed56471
remove bi-forward-loss
5 年前
Ervin Teng
8ff8c401
Merge branch 'develop-add-fire' into develop-add-fire-export
4 年前
yanchaosun
c5d9e376
add bi-forward-loss back
5 年前
GitHub
1955af9e
[feature] Add experimental PyTorch support ( #4335 )
* Begin porting work
* Add ResNet and distributions
* Dynamically construct actor and critic
* Initial optimizer port
* Refactoring policy and optimizer
* Resolving a few bugs
* Share more code between tf and torch policies
* Slightly closer to running model
* Training runs, but doesn’t actually work
* Fix a couple additional bugs
* Add conditional sigma for distribution
* Fix normalization
* Support discrete actions as well
* Continuous and discrete now train
* Mulkti-discrete now working
* Visual observations now train as well
* GRU in-progress and dynamic cnns
* Fix for memories
* Remove unused arg
* Combine actor and critic classes. Initial export.
* Support tf and pytorch alongside one another
* Prepare model for onnx export
* Use LSTM and fix a few merge errors
* Fix bug in probs calculation
* Optimize np -> tensor operations
* Time action sample funct...
4 年前
yanchaosun
2e927257
separate policy net
4 年前
vincentpierre
108fac9a
Replace torch.detach().cpu().numpy() with a utils method
4 年前
HH
d9962254
Merge branch 'master' into hh/develop/loco-walker-variable-speed
4 年前
Anupam Bhatnagar
5e8aa485
renaming file from globals.py to global_values.py
4 年前
Anupam Bhatnagar
71c301bc
minor fixes
4 年前
Anupam Bhatnagar
1f60979f
[skip ci] change self.rank to global_values.get_rank()
4 年前
Anupam Bhatnagar
f4f1a8d9
merge master into trainer-plugin branch
4 年前
yanchaosun
1ce53c55
discrete action
4 年前
GitHub
498934f9
Replace torch.detach().cpu().numpy() with a utils method ( #4406 )
* Replace torch.detach().cpu().numpy() with a utils method
* Using item() in place of to_numpy()
* more use of item() and additional tests
4 年前
Ruo-Ping Dong
27fb4270
brain_name to behavior_name
4 年前
GitHub
bfda9576
Replace brain_name with behavior_name ( #4419 )
brain_name -> behavior_name
some prob -> log_prob in comments
rename files optimizer -> optimizer_tf for tensorflow
4 年前
Ruo-Ping Dong
fd1dc3a6
Merge branch 'master' into develop-torch-omp
4 年前
Ruo-Ping Dong
f5dee9d1
jit for continuous control
4 年前
GitHub
6f534366
Add torch_utils class, auto-detect CUDA availability ( #4403 )
* Add torch_utils
* Use torch from torch_utils
* Add torch to banned modules in CI
* Better import error handling
* Fix flake8 errors
* Address comments
* Move networks to GPU if enabled
* Switch to torch_utils
* More flake8 problems
* Move reward providers to GPU/CPU
* Remove anothere set default tensor
* Fix banned import in test
4 年前
Ervin Teng
fdc887a1
Some experimental stuff
4 年前
Ervin Teng
f59f35ea
Remove stuff in policy
4 年前
Andrew Cohen
3997b14b
Merge branch 'master' into develop-hybrid-actions
4 年前
Ervin Teng
3e771cbb
Permute visual obs outside of network
4 年前
Ervin Teng
77c810fb
Fix SAC and make utility method
4 年前
Ervin Teng
7754ad7b
Don't run value during inference
4 年前
Ervin Teng
b6095151
Execute critic with LSTM
4 年前
GitHub
4e4ad7b0
Don't run value during policy evaluate, optimized soft update function ( #4501 )
* Don't run value during inference
* Execute critic with LSTM
* Address comments
* Unformat
* Optimized soft update
* Move soft update to model utils
* Add test for soft update
4 年前
Ervin Teng
f9ff3efe
Merge branch 'develop-policyonly' into develop-sac-targetq
4 年前
Andrew Cohen
643c8e58
ppo extended
4 年前
Andrew Cohen
db37db34
fixing errors
4 年前
Andrew Cohen
44c9879e
action models
4 年前
Andrew Cohen
c494bfcc
trains successfully
4 年前
GitHub
badca342
Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX ( #4540 )
4 年前
Andrew Cohen
190d8e4d
action model as a singleton
4 年前
Ervin Teng
8dec4771
Add hybrid actions to SAC
4 年前
Ervin Teng
be159ad3
Make entropy reporting same as TF
4 年前
Ervin Teng
81342148
Revert "Add hybrid actions to SAC"
This reverts commit a759b36a51df4f8f1fd296f9f148269f0f026e42.
4 年前
Andrew Cohen
fc3027ac
tf tests except gail pass
4 年前
Andrew Cohen
e5f14400
Merge branch 'master' into develop-hybrid-actions-singleton
4 年前
GitHub
e4db5dc5
ActionSpec and ActionBuffer ( #4578 )
4 年前
Andrew Cohen
7827ca06
add ActionSpec; test_simple_rl torch passes
4 年前
Andrew Cohen
eaecb59e
torch utils to and from buffer
4 年前
Andrew Cohen
8013e544
ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member)
4 年前
Andrew Cohen
f654df34
fixing tensorflow tests
4 年前
GitHub
e0ef30a5
[bug-fix] Change entropy computation and loss reporting in Torch to match TF ( #4538 )
* Proper dimensions for entropy, sum before bonus in PPO
* Make entropy reporting same as TF
* Always use separate critic
* Revert to shared
* Remove unneeded extra line
* Change entropy shape in test
* Change another entropy shape
* Add entropy summing to evaluate_actions
* Add notes about torch.abs(policy_loss)
4 年前
GitHub
cb8e4d25
Add ActionSpec ( #4586 )
Co-authored-by: Ervin T <ervin@unity3d.com>
4 年前
Andrew Cohen
9689cf2c
remove *_action_* from function names
4 年前
vincentpierre
a3a9a56b
Merge branch 'exp-multi-head-attention' into exp-bullet-hell
4 年前
Ruo-Ping Dong
9e08be87
Merge branch 'master' into release_9_branch_merge
4 年前
Andrew Cohen
6cf54bf2
remove self.action_spec from policy/bc
4 年前
GitHub
b853e5ba
Action buffer ( #4612 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
3c96a3a2
Action Model ( #4580 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
88d3ec3e
Merge master into hybrid actions staging branch ( #4704 )
4 年前
GitHub
87a7ccf8
use int64 steps, check for NaN actions ( #4607 )
* use int64 steps
* check for NaN actions
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
4 年前
GitHub
23800f33
Merge branch 'master' into develop-action-spec
4 年前
GitHub
85a7c0f7
[bug-fix] Add clipping to PyTorch policy, fix initialization ( #4649 )
4 年前
Ervin Teng
0548057d
Use real clipping (as in TF)
4 年前
GitHub
733bffbf
use int64 steps, check for NaN actions ( #4607 ) ( #4654 )
* use int64 steps
* check for NaN actions
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
Andrew Cohen
b6d10456
removed action_spec.size
4 年前
GitHub
8175d558
[bug-fix] Fix BC module + action clipping ( #4667 )
4 年前
GitHub
2a8c6800
[bug-fix] Add clipping to PyTorch policy, fix initialization ( #4649 ) ( #4662 )
4 年前
Ruo-Ping Dong
953cb6bb
Merge branch 'master' into develop-windows-delay
4 年前
Andrew Cohen
1bbe492c
fixed tests/ -> single validate_action func
4 年前
Ruo-Ping Dong
ee5313e4
Merge branch 'master' into develop-windows-delay
4 年前
Ervin Teng
78f88c15
Add clip to export and make optional in policy
4 年前
GitHub
f0ed3a38
Cherry-pick BC fixes to Release 10 ( #4668 )
4 年前
Andrew Cohen
3f771e61
add ActionBuffers and utils
4 年前
Ervin Teng
7a0ebfbd
Pretty broken
4 年前
Ervin Teng
95bdbba3
Less broken PPO
4 年前
vincentpierre
b863af57
Removing TensorFlow Trainers
4 年前
Ervin Teng
98948c59
Skip critic when given empty memory array
4 年前
Ervin Teng
4158629e
Properly feed in None rather than empty arrays
4 年前
Andrew Cohen
d8dd5e44
Merge branch 'master' into develop-action-buffer
4 年前
Andrew Cohen
bd917c9c
action buffer passes continuous
4 年前
Andrew Cohen
b36fcf16
discrete runs/cont passes
4 年前
Andrew Cohen
ad951493
debugging discrete
4 年前
Andrew Cohen
fcf6471e
2d discrete passes
4 年前
Ervin Teng
fdaa8c3d
Merge branch 'develop-unified-obs' into develop-centralizedcritic
4 年前
GitHub
990f801a
Develop hybrid action staging ( #4702 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
Co-authored-by: Ruo-Ping Dong <ruoping.dong@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
vincentpierre
735fcd52
[WIP] Refactor trainers to use list of obs rather than vec and vis obs
4 年前
Ervin Teng
6846af21
Multi-input network
4 年前
Andrew Cohen
85e4db33
bc tests pass
4 年前
vincentpierre
93ca1409
fixing the tests
4 年前
vincentpierre
7a5cc9ec
Merge master into develop-rm-tf
4 年前
Ervin Teng
cb4b7ed3
Some minor tweaks but still broken
4 年前
vincentpierre
12619155
added some docstrings
4 年前
vincentpierre
c1587bce
Solving merge conflicts
4 年前
Andrew Cohen
8172b3d6
test_simple_rl/reward providers pass tf/torch
4 年前
Andrew Cohen
4ebc6c44
ml-agents-envs pass
4 年前
GitHub
8ab2e619
update type of evaluate_actions to list tensor ( #4747 )
4 年前
Arthur Juliani
0d2f8887
Merge remote-tracking branch 'origin/master' into goal-conditioning
# Conflicts:
# ml-agents-envs/mlagents_envs/base_env.py
# ml-agents-envs/mlagents_envs/rpc_utils.py
# ml-agents/mlagents/trainers/tests/mock_brain.py
# ml-agents/mlagents/trainers/tests/simple_test_envs.py
4 年前
GitHub
a0d1c829
Action Docs part2 ( #4739 )
* reduce usage of "vector action" and "action space"
* more cleanup
* undo GettingStarted change for now
* batch size description
* Apply suggestions from code review
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
4 年前
GitHub
cc6b4564
Multi Directional Walker and Initial Hypernetwork ( #4740 )
4 年前
Ervin Teng
25dfd883
Merge branch 'master' into develop-centralizedcritic
4 年前
vincentpierre
bc9d3975
merge master
4 年前
Andrew Cohen
cd73cce2
test_trajectory fixed
4 年前
GitHub
ad5f878c
[refactor] Remove critic pass during inference ( #4743 )
4 年前
GitHub
11687f8d
[cherry-pick] Cherry-pick #4743 into Release 11 ( #4756 )
4 年前
GitHub
22658a40
use sensor types to differentiate obs ( #4749 )
4 年前
GitHub
903d3afe
Merge pull request #4707 from Unity-Technologies/develop-rm-tf
Removing TensorFlow Trainers
4 年前
vincentpierre
14378aa5
Merging master
4 年前
Andrew Cohen
97d94a83
fix test_tf_policy
4 年前
vincentpierre
0c81006d
addressing comments
4 年前
vincentpierre
8cb050ef
WIP Made initial changes to enale dimension properties and added attention module
4 年前
Andrew Cohen
498b1ee6
Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton
4 年前
GitHub
c2783e70
remove "discrete nan" check ( #4770 ) ( #4775 )
4 年前
GitHub
525a4308
remove "discrete nan" check ( #4770 )
4 年前
Andrew Cohen
6174c428
move action model to explicit distributions
4 年前
vincentpierre
c3699de8
merging master and addressing comments
4 年前
Andrew Cohen
453a2bba
ActionTuple default is now np.array, not None
4 年前
GitHub
29d94c7c
Merge pull request #4734 from Unity-Technologies/develop-obs-as-list
Refactor trainers to use list of obs rather than vec and vis obs
4 年前
Andrew Cohen
1d234d1d
bc works
4 年前
vincentpierre
719c969c
addressing comments. ObservationSpec is no longer a list
4 年前
vincentpierre
4bba4e8e
Renaming ObservationSpec to SensorSpec
4 年前
Andrew Cohen
c0d01baf
Merge branch 'master' into merge-release11-master
4 年前
Andrew Cohen
95566e44
Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton
4 年前
vincentpierre
c5a057d2
renaming obs_spec variables
4 年前
Andrew Cohen
e81e68de
comms agent and fixed hallway
4 年前
vincentpierre
44ed3258
Merging master
4 年前
Andrew Cohen
3457cd3c
save only discrete actions as prev
4 年前
Andrew Cohen
da5b8420
update make_empty docstring
4 年前
Andrew Cohen
411b0a19
reuse action dict in torch policy for pre_action
4 年前
vincentpierre
449712b0
renaming sensor_spec to sensor_specS
4 年前
Andrew Cohen
35769b53
Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton
4 年前
Andrew Cohen
17496265
move AgentAction, ActionLogProbs, and ActionFlattener to separate files
4 年前
Chris Elion
76ebc20c
Merge remote-tracking branch 'origin/master' into r12-to-master
4 年前
GitHub
458fee17
Merge pull request #4763 from Unity-Technologies/develop-att
WIP Made initial changes to enable dimension properties and added attention module
4 年前
vincentpierre
36cc4665
Removing some vis and vec fields from policy.py
4 年前
GitHub
9d8a7d6f
Update ml-agents/mlagents/trainers/policy/tf_policy.py
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
Ervin Teng
330fc1d0
Merge branch 'master' into develop-centralizedcritic-mm
4 年前
Andrew Cohen
60309d8f
fix torch policy tests
4 年前
vincentpierre
519c5f47
merging master
4 年前
Ruo-Ping Dong
8ed14762
Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp
4 年前
Andrew Cohen
7ba10239
remove action spec attribute from policy
4 年前
vincentpierre
318622a6
removing the extra code to set normalize to false if no observation can be normalized
4 年前
Andrew Cohen
662fd6b1
added docstrings to action flattener
4 年前
GitHub
a272bff0
Merge pull request #4803 from Unity-Technologies/develop-remove-vec-vis-fields
Remove some vis and vec fields from Policy
4 年前
GitHub
cc948a41
Policy output actiontuple ( #4651 )
4 年前
GitHub
7387a77f
remove pylint ( #4836 )
* remove pylint
* remove other pylint disables
4 年前
Andrew Cohen
886883b3
Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton
4 年前
Arthur Juliani
0b4b0992
Rename more files
4 年前
Ervin Teng
aba633b2
Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm
4 年前
Andrew Cohen
85b18389
fix test tf policy
4 年前
Ruo-Ping Dong
a7d04be6
Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp
4 年前
Arthur Juliani
0a876b9c
Fix typos
4 年前
Arthur Juliani
e3de0406
Plurals
4 年前
Ruo-Ping Dong
180d3e20
Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager
4 年前
HH
0024a286
merge ervin's new stuff
4 年前
GitHub
67ad9651
Merge pull request #4825 from Unity-Technologies/sensor-types
[WIP] Observation Types
4 年前
vincentpierre
8660b1c2
merging master
4 年前
Ervin Teng
457b2630
I think it's running
4 年前
brccabral
457fb612
Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents
4 年前
Andrew Cohen
6e1826f8
might be right
4 年前
vincentpierre
52b011d6
_
4 年前
vincentpierre
6f3ea7b8
_
4 年前
Andrew Cohen
a4c336c2
value estimator
4 年前
Andrew Cohen
9af22d30
use only value funcs
4 年前
Ervin Teng
3283b6a1
Remove Q-net for perf
4 年前
Ervin Teng
b6f88d6d
Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager
4 年前
Andrew Cohen
f73b9dba
update policy to not use critic
4 年前
Andrew Cohen
9b92f5fb
remove commented code
4 年前
Ervin Teng
0bde7598
Back out trainer changes
4 年前
Ervin Teng
514873bf
Use correct memories (t-1 instead of t) for training
4 年前
Ervin Teng
24ee4bd5
Merge remote-tracking branch 'origin/develop-critic-optimizer' into develop-critic-optimizer
4 年前
Ervin Teng
7014404a
Add presence check
4 年前
Andrew Cohen
c74dca9f
add SharedActorCritic
4 年前
Ervin Teng
f3a2a81f
Merge branch 'develop-fix-lstms' into develop-gru
4 年前
Andrew Cohen
00b891df
fix sac shared
4 年前
Ervin Teng
219e773b
Merge branch 'develop-fix-lstms' into develop-critic-op-lstm
4 年前
Ervin Teng
e46a86ad
Merge branch 'master' into develop-superpush-int
4 年前
HH
15d512f9
Merge branch 'master' into hh/develop/dodgeball
4 年前
GitHub
338af2ec
Move the Critic into the Optimizer ( #4939 )
Co-authored-by: Ervin Teng <ervin@unity3d.com>
4 年前
HH
4c947151
Merge branch 'main' into hh/develop/dodgeball
4 年前
Ervin Teng
61781a1a
Merge branch 'main' into develop-agentprocessor-teammanager
4 年前
Andrew Cohen
9060da06
Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer
4 年前
Arthur Juliani
06c147f8
Merge remote-tracking branch 'origin/main' into goal-conditioning-new
# Conflicts:
# Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab
# Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity
# Project/ProjectSettings/TagManager.asset
# com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs
# com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs
# ml-agents/mlagents/trainers/torch/networks.py
# ml-agents/mlagents/trainers/torch/utils.py
4 年前
GitHub
d36a5242
Python Dataflow for Group Manager ( #4926 )
* Make buffer type-agnostic
* Edit types of Apped method
* Change comment
* Collaborative walljump
* Make collab env harder
* Add group ID
* Add collab obs to trajectory
* Fix bug; add critic_obs to buffer
* Set group ids for some envs
* Pretty broken
* Less broken PPO
* Update SAC, fix PPO batching
* Fix SAC interrupted condition and typing
* Fix SAC interrupted again
* Remove erroneous file
* Fix multiple obs
* Update curiosity reward provider
* Update GAIL and BC
* Multi-input network
* Some minor tweaks but still broken
* Get next critic observations into value estimate
* Temporarily disable exporting
* Use Vince's ONNX export code
* Cleanup
* Add walljump collab YAML
* Lower max height
* Update prefab
* Update prefab
* Collaborative Hallway
* Set num teammates to 2
* Add config and group ids to HallwayCollab
* Fix bug with hallway collab
* E...
4 年前
Ervin Teng
fd0dd35c
Merge branch 'main' into develop-coma2-trainer
4 年前
Ervin Teng
c8137dcd
Merge branch 'main' into develop-superpush-int
4 年前
GitHub
f16ce486
Update v2-staging from main (March 15) ( #5123 )
4 年前
Christopher Goy
921ba4f0
Update v2-staging from main (March 15) ( #5123 )
4 年前
Christopher Goy
ebe45056
Merge branch 'main' into release_14_branch-to-main
4 年前
Chris Elion
970f1d40
Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec
4 年前
GitHub
2e19759c
Turning some logger.info into logger.debug and remove some logging overhead when not using debug ( #5211 )
* turning some logger.info into logger.debug and remove some logging overhead when not using debug
* Addressing comments
* Adding to changelog
4 年前
GitHub
28eb43dd
[bug-fix] Delete .pt checkpoints past keep-checkpoints ( #5271 )
* Manage non-ONNX files with checkpoint manager too
* Update tests
* Update training status version
* Change ticking of status file version
4 年前
Miguel Alonso Jr
4846cf0f
Merge branch 'main' into develop-api-documentation-update
Updating with main.
4 年前