GitHub
6a81a2f4
Add Soft Actor-Critic as trainer option ( #2341 )
* Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml
* Add documentation for SAC and tweak PPO documentation to reference the new pages.
* Add tests for SAC, change simple_rl test to run both PPO and SAC.
5 年前
GitHub
3df585d9
Fix issue where SAC encoder type is always simple ( #2548 )
5 年前
GitHub
3683cc1c
Enable learning rate decay to be disabled ( #2567 )
5 年前
Jonathan Harper
3fc14963
EXPERIMENTAL horovod support
5 年前
Jonathan Harper
47893e9c
minor tweaks
5 年前
Ervin Teng
35669d27
Fix SAC + LSTM Barracuda inference ( #2698 )
5 年前
Chris Elion
43e23941
rough pass at tf2 support, needs cleanup
5 年前
Chris Elion
806c77e4
centralize tensorflow imports
5 年前
Chris Elion
8da16bdb
move compat functions
5 年前
Chris Elion
254c7d86
Merge remote-tracking branch 'origin/develop' into try-tf2-support
5 年前
GitHub
c6c01a03
Enable pylint and fix a few things ( #2767 )
* enable pylint, disable some messages and fix a few
* SAC memories in init
5 年前
GitHub
4da157fe
more pylint fixes ( #2842 )
5 年前
Chris Elion
fca51de8
Merge remote-tracking branch 'origin/develop' into try-tf2-support
5 年前
Chris Elion
73a346cb
cleanup
5 年前
GitHub
f57b7ac6
Allow usage with tensorflow 2.0.0 (via tf.compat.v1) ( #2665 )
5 年前
GitHub
99981937
fix errors from new flake8-comprehensions ( #2917 )
5 年前
GitHub
42bea858
Improve mypy coverage by adding --namespace-packages ( #3049 )
5 年前
GitHub
1fa07edb
Remove Standalone Offline BC Training ( #2969 )
5 年前
GitHub
e536c09c
Remove unused tf.placeholder ( #3138 )
5 年前
GitHub
7fbf6b1d
add flake8-bugbear ( #3137 )
* unused loop variables
* change loop variable
5 年前
GitHub
f058b18c
Replace BrainInfos with BatchedStepResult ( #3207 )
5 年前