GitHub
beb5eb30
[bug-fix] Fixes for Torch SAC and tests ( #4408 )
* Fixes for Torch SAC and tests
* FIx recurrent sac test
* Properly update normalization for SAC-continuous
* Fix issue with log ent coef reporting in SAC Torch
4 年前
GitHub
4e6d46cc
[tests] Add tests for Torch PPO ( #4429 )
4 年前
GitHub
4e93cb6e
[torch] Restructure PyTorch encoders ( #4421 )
* Move linear encoding to NetworkBody
* moved encoders to processors (#4420 )
* fix bad merge
* Get it running
* Replace mentions of visual_encoders
* Remove output_size property
* Fix tests
* Fix some references
* Revert test_simple_rl
* Fix networks test
* Make curiosity test more accomodating
* Rename total_input_size
* [Bug fix] Fix bug in GAIL gradient penalty (#4425 ) (#4426 )
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
* Up number of steps
* Rename to visual_processors and vector_processors
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com>
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前
GitHub
cc10cd82
Worm Ragdoll & Env Updates ( #4413 )
* add worm updates
* add rewman
* cp
* normalize rewards
* only cookie
* try 20M. Add3.5Mnn file
* reduce strength to 3000spring
* facing reward troubleshooting
* Update WormAgent.cs
* troubleshoot nan
* try product of rewards
* train 5M steps
* try end episode on target touch
* fix joint obsv
* use 7M steps
* added nn file for observation joint fix. looks great
* don't end episode
* remove old code
* refactor to patterns used in walker & crawler
* add auto-setup code
* reformat
* use head vel
* remove unneeded observ. update prefabs
* update static scenes
* keeps rolling. added debug. try 5 m/s
* gate the facing reward based on angle tolerance
* added 10ms_angle30rew_nn files
* use fromto rot
* use 7M steps
* add new trained files. cleanup code and prefabs
* use avgvel. add code comments
* remove unused method
* add more comments
* Update Learning-E...
4 年前
GitHub
4eb47e2f
[docs] Update 'Record Demonstrations' documentation ( #4432 )
* [docs] Update 'Record Demonstrations' documentation
Updates a screenshot and documentation to include the newer `Num Steps To Record` field.
4 年前
GitHub
582859b6
New Crawler Variable Speed Scenes ( #4382 )
* init
* updating prefabs
* spawn a target
* add brains
* update static prefabs
* enable enhanced determinism
* reset manifest
* add nn files. update to 15M steps
* update prefabs
* increase max speed to 15
* add new local model for 15 speed
* update prefabs
* add configs
* update configs/prefabs
* cleanup
* added final nn models
* add new demos and do more cleanup.
* add meta files
* add RigidbodySensor
* update prefab. about to retrain
* remove body pen
* add fixed crawler & retrained nn file, new demos
* train 10M steps
* Update Crawler Docs
* more prefab cleanup
* add meta files
* Update Project/Assets/ML-Agents/Examples/Crawler/Scripts/CrawlerAgent.cs
Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
* remove unused prefab
* update comment
* add summary tags
* cleanup and add more comments
* remove unused prefab
* Update P...
4 年前
GitHub
06f788a4
Fix a few out-of-date things in CONTRIBUTING.md ( #4428 )
* Fix a few out-of-date things in CONTRIBUTING.md
* Update CONTRIBUTING.md
4 年前
GitHub
7b4d0865
[Bug fix] Fix bug in GAIL gradient penalty ( #4425 )
4 年前
GitHub
a117c932
Grid Sensor ( #4399 )
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
GitHub
6db9520c
enable enhanceddeterminism in physics settings ( #4423 )
4 年前
GitHub
bfda9576
Replace brain_name with behavior_name ( #4419 )
brain_name -> behavior_name
some prob -> log_prob in comments
rename files optimizer -> optimizer_tf for tensorflow
4 年前
Ervin Teng
1dca75d8
Move linear encoding to NetworkBody
4 年前
Ruo-Ping Dong
27fb4270
brain_name to behavior_name
4 年前
GitHub
0e590a49
[Feature] use extra_requirement to add torch as optional dependency to mlagents ( #4417 )
* Feature use extra_requirement to add torch as optional dependency to mlagents
- todo: documentation
* Update ml-agents/setup.py
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
Co-authored-by: Chris Elion <chris.elion@unity3d.com>
4 年前
GitHub
1076d275
Remove unused methods in trainer_controller.py ( #4418 )
4 年前
GitHub
3379734a
Run pytest on GitHub Actions ( #4416 )
4 年前
GitHub
498934f9
Replace torch.detach().cpu().numpy() with a utils method ( #4406 )
* Replace torch.detach().cpu().numpy() with a utils method
* Using item() in place of to_numpy()
* more use of item() and additional tests
4 年前
GitHub
4775e23c
ignore code coverage and api scrape output ( #4412 )
4 年前
GitHub
ffcb00d2
Update Background-Machine-Learning.md ( #4415 )
Added missing word in text.
4 年前
GitHub
9bf1c03f
precommit github action ( #4410 )
4 年前
GitHub
12e15e29
Fix on GAIL Torch when using actions ( #4407 )
4 年前
GitHub
d08bad06
Increase min supported tensorflow to 1.14.0 ( #4411 )
4 年前
GitHub
48f217b9
Rename Saver to ModelSaver ( #4402 )
Rename Saver to ModelSaver to avoid confusion with tf.Saver
4 年前
Ruo-Ping Dong
56feb8af
update test_saver_reward_providers.py
4 年前
Ruo-Ping Dong
88eff042
Merge branch 'master' into develop-saver-name
4 年前
Ruo-Ping Dong
e60c7038
Merge branch 'master' into develop-saver-name
4 年前
GitHub
328353bc
Torch : Saving/Loading of the reward providers ( #4405 )
* Saving the reward providers
* adding tests
* Moved the tests around
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>
* Update ml-agents/mlagents/trainers/tests/torch/saver/test_saver_reward_providers.py
Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>
Co-authored-by: Ruo-Ping (Rachel) Dong <ruoping.dong@unity3d.com>
4 年前
Ruo-Ping Dong
07e82899
update torch saver test
4 年前
GitHub
38e9387b
Fix NNCheckpointManager for Torch
Fix NNCheckpointManager for Torch
4 年前
Ruo-Ping Dong
a74c904a
Merge branch 'master' into develop-saver-name
4 年前
GitHub
df685184
Make --torch use torch even without config ( #4400 )
* Make --torch use torch even without config
* Update ml-agents/mlagents/trainers/trainer_util.py
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
* Update ml-agents/mlagents/trainers/trainer_util.py
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
* Update ml-agents/mlagents/trainers/trainer_util.py
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
* renaming use_torch to force_torch
Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>
4 年前
GitHub
70197342
Add torch saver test
Add torch saver test
4 年前
Ruo-Ping Dong
f2a8c421
add torch saver test
4 年前
Ruo-Ping Dong
09c22679
fix NNCheckpointManager for Torch
4 年前
Ruo-Ping Dong
c47ffc20
Rename saver
4 年前
GitHub
a79aa854
[ci] Shorten max steps for strikergoalie ( #4394 )
4 年前
GitHub
1955af9e
[feature] Add experimental PyTorch support ( #4335 )
* Begin porting work
* Add ResNet and distributions
* Dynamically construct actor and critic
* Initial optimizer port
* Refactoring policy and optimizer
* Resolving a few bugs
* Share more code between tf and torch policies
* Slightly closer to running model
* Training runs, but doesn’t actually work
* Fix a couple additional bugs
* Add conditional sigma for distribution
* Fix normalization
* Support discrete actions as well
* Continuous and discrete now train
* Mulkti-discrete now working
* Visual observations now train as well
* GRU in-progress and dynamic cnns
* Fix for memories
* Remove unused arg
* Combine actor and critic classes. Initial export.
* Support tf and pytorch alongside one another
* Prepare model for onnx export
* Use LSTM and fix a few merge errors
* Fix bug in probs calculation
* Optimize np -> tensor operations
* Time action sample funct...
4 年前
GitHub
ea3944fc
DemonstrationRecorder: Add numStepsToRecord feature. ( #4381 )
* add numSteps property and update recorder script
* add changes based on PR suggestions
* make zero default. update logic
* updated comments & tooltip
4 年前
GitHub
a796b0f1
Merge pull request #4387 from Unity-Technologies/master-pin-xdist
Pin xdist version.
4 年前
Christopher Goy
c448eb65
Pin xdist version on verfided branch.
4 年前
GitHub
b337e62b
add build-in module dependencies ( #4384 )
* add build-in module dependencies
* changelog
4 年前
GitHub
abfadb3d
Reduce max steps for striker vs. goalie ( #4377 )
4 年前
GitHub
31919e08
[MLA-1267] Account for actuators in training and inference. ( #4371 )
4 年前
GitHub
5457ff38
Update README.md ( #4376 )
Adjusted readme to reflect new release
4 年前
GitHub
9cc4e0b2
update demonstration meta files ( #4367 )
4 年前
GitHub
99ccb7b9
Merge pull request #4369 from Unity-Technologies/release_6-to-master
Merge release 6 back into master
4 年前
GitHub
e7916b08
add pre-commit hook for dotnet-format ( #4362 )
4 年前
GitHub
cf570f8c
Update barracuda in the hopes that our burst crashes go away. ( #4359 ) ( #4365 )
4 年前
GitHub
8bbcfc28
fix deleted observations ( #4366 )
4 年前
GitHub
bb9417f7
Update example environments to use the Actuator API ( #4363 )
4 年前