GitHub
9e2c30ee
Made the _check_environment_trains test a little more easy to pass so the test will not randomly fail ( #2520 )
5 年前
GitHub
12d57671
Changing Training-RewardSignals.md --> Reward-Signals.md ( #2525 )
5 年前
Anupam Bhatnagar
d1b99bda
more small edits
5 年前
GitHub
d80812be
Merge pull request #2526 from Unity-Technologies/develop-update-offline-bc
Update the offline_bc_config path
5 年前
Anupam Bhatnagar
efe16491
added cloud training unsupported comment
5 年前
Yuan Gao
0c42db82
Update the offline_bc_config path
5 年前
GitHub
876aca1e
Use numpy for random sample in buffer ( #2524 )
5 年前
Anupam Bhatnagar
baf25046
small edits
5 年前
GitHub
36528481
Merge pull request #2522 from Unity-Technologies/develop-cleanupconfig
Clean up SAC config
5 年前
Anupam Bhatnagar
2cd2048b
changes reflecting comments on github
5 年前
GitHub
6f67cf40
unit test - don't use global random generator ( #2521 )
* unit test - don't use global random generator
* Update test_simple_rl.py
5 年前
Anupam Bhatnagar
cc933115
adding colon
5 年前
GitHub
7ec3d7ad
Merge pull request #2516 from Unity-Technologies/master
Merege 0.9.3 changes to develop
5 年前
Ervin Teng
b1bfb9e8
Delete VisualBanana
5 年前
Anupam Bhatnagar
fddede25
first commit
5 年前
Jonathan Harper
2f083c8a
Renamed "StepInfo" to "EnvironmentStep"
This change was requested for clarity during the async EnvManager
PR. It's a simple rename of the StepInfo class.
5 年前
Ervin T
6fb5b63c
Fix Baselines gym_unity example to work with the latest Baselines ( #2489 )
* This addresses #1835 . Baselines expects single environments used with their ppo2 algorithm to be wrapped in a DummyVecEnv. The old readme did not instruct the reader to do so and the code failed to run with the latest version of baselines. This imports the correct function from baselines and fixes the make_unity_env function described in the readme.
* added line to gym-unity/README.md to note the version of baselines the examples were tested with
5 年前
GitHub
6a81a2f4
Add Soft Actor-Critic as trainer option ( #2341 )
* Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml
* Add documentation for SAC and tweak PPO documentation to reference the new pages.
* Add tests for SAC, change simple_rl test to run both PPO and SAC.
5 年前
GitHub
25926795
initialize trainer step count ( #2498 )
* initialize trainer step count
* remove step init from RLTrainer
5 年前
Ervin T
06d9678c
Minor fix to link to GAIL reward signal doc ( #2435 )
5 年前
GitHub
4bb97e25
Fix bug with construct_curr_info ( #2490 )
* Fix bug with construct_curr_info
* Add more tests
5 年前
Ervin Teng
28ef8983
Add 2 visual obs test
5 年前
Ervin Teng
aca81efb
Add more tests
5 年前
Ervin Teng
e0da93d1
Fix bug with construct_curr_info and test
5 年前
GitHub
bf375235
Change update buffer to float32 instead of float64 ( #2461 )
- Reduces memory usage of buffer.
5 年前
GitHub
3c1f4dbb
python coverage: specify dirs, exclude test files ( #2473 )
* specify dirs, exclude test files
* update comments
* html coverage in CI artifacts
* add destination
* ignore coverage files
* check gym-unity too
5 年前
Jeffrey Shih
df64b64a
Fixed typo in Training-Imitation-Learning.md ( #2485 )
5 年前
GitHub
ec7fd11d
Merge pull request #2451 from DanAmador/patch-1
Fixed small typo in documentation.
5 年前
GitHub
cf9e67fb
Merge pull request #2470 from Unity-Technologies/release-0.9.2
Release 0.9.2 to develop
5 年前
GitHub
c13828ce
Merge pull request #2474 from Unity-Technologies/release-0.9.2-flake8-fix
Fixed the flake8
5 年前
Yuan Gao
33404e1b
Fixed the flake8
5 年前
GitHub
df0196f9
Merge pull request #2472 from Unity-Technologies/release-0.9.2-multi-gpu-doc
Added the doc for multi-gpu
5 年前
Yuan Gao
b9210f4c
Updated the comment for —multi-gpu option.
5 年前
Yuan Gao
66205c3e
Added the doc for multi-gpu
5 年前
GitHub
be66102d
Merge pull request #2471 from Unity-Technologies/setup.py-h5py-version
More flexibility on the h5py version
5 年前
GitHub
0afd58fc
More flexibility on the h5py version
5 年前
Yuan Gao
0c492fb7
Updated the model
5 年前
Yuan Gao
f33830bc
Updated the python packages version to 0.9.2
5 年前
GitHub
261ee0b6
Merge pull request #2457 from Unity-Technologies/hh/fix-training-NaN-errors-crawler
Fix NaN training errors for crawler
5 年前
Hunter
83703d20
moved look rotation logic to avoid potential NaN LookRotation
5 年前
GitHub
1b7045bf
Merge pull request #2448 from Unity-Technologies/hh/fix-broken-crawler-prefabs
fixed broken crawler prefabs
5 年前
Hunter
45fe60db
fixed broken prefabs
5 年前
GitHub
3880fd3a
Update development release version to 0.10.0.dev0 ( #2443 )
In order for downstream packages to make use of the latest
pre-release features, we can pre-release versions of our packages.
For packages ending in `devN` pip will not install that package
version by default. This change manually updates our package version
to a development version with the idea that we can manually perform
development versions with the potential for future automated / nightly
dev releases.
5 年前
GitHub
43696d60
Fix bug in add_rewards_output and add test ( #2442 )
5 年前
GitHub
689765d6
Modification of reward signals and rl_trainer for SAC ( #2433 )
* Adds evaluate_batch to reward signals. Evaluates on minibatch rather than on BrainInfo.
* Changes the way reward signal results are reported in rl_trainer so that we get the pure, unprocessed environment reward separate from the reward signals.
* Moves end_episode to rl_trainer
* Fixed bug with BCModule with RNN
5 年前
Arthur Juliani
fa46be7f
Merge branch 'RunSwimFlyRich-master' into develop
5 年前
GitHub
4abe89bc
Only call get_action on brains with policies ( #2437 )
5 年前
GitHub
bd7eb286
Update reward signals in parallel with policy ( #2362 )
5 年前
Jonathan Harper
e333abf8
Fixing compile error
Variable "model" is undefined.
5 年前
GitHub
4472838e
Merge pull request #2421 from Unity-Technologies/hotfix-v0.9.1
Hotfix v0.9.1 - develop
5 年前