GitHub
c145e75b
Split Policy and Optimizer, common Policy for PPO and SAC ( #3345 )
5 年前
GitHub
97a1d4b1
[change] Remove the action_holder placeholder from the policy. ( #3492 )
5 年前
GitHub
7d954797
[change] Separate action outputs into OutputDistributions object ( #3514 )
5 年前
GitHub
e4177de0
[change] Organize trainer files a bit better ( #3538 )
5 年前
GitHub
870338b4
[bug-fix] Fix issue with more than one continuous actions ( #3547 )
5 年前
Anupam Bhatnagar
f4dbedcf
removed extraneous logging imports and loggers
5 年前
Anupam Bhatnagar
e8e0078e
first commit
5 年前
GitHub
94de596b
[change] Remove concatenate in discrete action probabilities to improve inference performance ( #3598 )
5 年前
Anupam Bhatnagar
001fce2a
first commit
5 年前
Arthur Juliani
8c6f4696
Fix a couple additional bugs
5 年前
Arthur Juliani
ca887743
Support tf and pytorch alongside one another
5 年前
Andrew Cohen
4a3ad193
Add constant decay to beta and epsilon
5 年前
Arthur Juliani
2b3a6347
Merge remote-tracking branch 'origin/master' into develop-add-fire
5 年前
Christopher Goy
ba80b292
format files with pre-commit.
4 年前
Andrew Cohen
1e50c76e
calculating gradient norms
5 年前
Andrew Cohen
0e965a4d
sensitivity
5 年前
Andrew Cohen
c1f91b5a
slightly nicer output
5 年前
Andrew Cohen
23b84dea
ignoring commit checks but write to csv
5 年前
Andrew Cohen
61aa9915
write to csv
5 年前
Andrew Cohen
d794964f
constant beta
5 年前
Andrew Cohen
13c2a209
added opp, decay eps removed
5 年前
GitHub
e92b4f88
[refactor] Structure configuration files into classes ( #3936 )
5 年前
Andrew Cohen
50e4585f
fixed beta
5 年前
Arthur Juliani
9724c9ac
Merge master
4 年前
GitHub
a28e2767
Update add-fire to latest master, including Policy refactor ( #4263 )
* Update Dockerfile
* Separate send environment data from reset (#4128 )
* Fixed a typo on ML-Agents-Overview.md (#4130 )
Fixed redundant "to" word from the sentence since it is probably a typo in document.
* Updated the badge’s link to point to the newest doc version
* Replaced all of the doc to release_3_doc
* Fix 3DBall and 3DBallHard SAC regressions (#4132 )
* Move memory validation to settings
* Update docs
* Add settings test
* Update to release_3 in installation.md (#4144 )
* rename to SideChannelManager +backcompat (#4137 )
* Remove comment about logo with --help (#4148 )
* [bugfix] Make FoodCollector heuristic playable (#4147 )
* Make FoodCollector heuristic playable
* Update changelog
* script to check for old release links and references (#4153 )
* Remove package validation suite from Project (#4146 )
* RayPerceptionSensor: handle empty and invalid tags (#4155...
4 年前
Ruo-Ping Dong
6feec58a
add Saver class (only TF working)
4 年前
GitHub
93517833
[feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed ( #4305 )
4 年前
GitHub
129f9ddc
[MLA-427] make pyupgrade convert f-strings too ( #4244 )
* make pyupgrade convert f-strings too
4 年前
Andrew Cohen
d8c123a0
Merge branch 'master' into sensitivity
4 年前
Andrew Cohen
02df39ab
ignore precommit
4 年前
Andrew Cohen
fa35292c
write hist to tb
4 年前
GitHub
380fef57
[refactor] Move TF-specific files to tf/ folder ( #4266 )
4 年前
Andrew Cohen
06e4356c
Merge branch 'master' into sensitivity
4 年前
GitHub
3f44a0bc
cleanup around AdamOptimizer ( #4333 )
* cleanup around AdamOptimizer
* methods to creat Optimizer instances
4 年前
Ruo-Ping Dong
d3eb6c46
Merge branch 'develop-add-fire' into develop-add-fire-checkpoint
4 年前
GitHub
25dc8c3d
Add Saver Class to handle all save/load/checkpoint/export work ( #4323 )
4 年前
Ervin Teng
d65a9326
Merge branch 'master' into develop-add-fire-mm3
4 年前
GitHub
b853e5ba
Action buffer ( #4612 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
3c96a3a2
Action Model ( #4580 )
Co-authored-by: Ervin T <ervin@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
88d3ec3e
Merge master into hybrid actions staging branch ( #4704 )
4 年前
Andrew Cohen
8172b3d6
test_simple_rl/reward providers pass tf/torch
4 年前
Andrew Cohen
cd73cce2
test_trajectory fixed
4 年前
GitHub
d2d46103
Remove print from ppo tf opti
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前