Ervin Teng
5ab2563b
Fixes for recurrent
5 年前
Ervin Teng
c7632aa7
Fix some bugs for visual obs
5 年前
Ervin Teng
3697e616
Convert BC (warning) might be broken
5 年前
Ervin Teng
336ca456
Kill the ProcessingBuffer
5 年前
Ervin Teng
c9116ed2
Move some common logic to buffer class
5 年前
Ervin Teng
f2b3cd7f
Remove dead code
5 年前
Ervin Teng
28eba789
Migrate SAC
5 年前
Ervin Teng
2f82a550
Remove epsilon
5 年前
Ervin Teng
88b1123a
Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor
5 年前
Ervin Teng
76abf968
Add back max_step logic
5 年前
Ervin Teng
8b3b9e6c
Move trajectory and related functions to trajectory.py
5 年前
Ervin Teng
f94365a2
No longer using ProcessingBuffer for PPO
5 年前
Ervin Teng
e0e57188
Clean up some stuff
5 年前
Ervin Teng
9c5fdd31
Stats reporting is working
5 年前
Ervin Teng
a97ffb47
Attempt reward reporting
5 年前
Ervin Teng
c2b729a6
Fix memory leak
5 年前
Ervin Teng
9e661f0c
Looks like it's training
5 年前
Ervin Teng
2c9376bc
Convert to trajectory
5 年前
Ervin Teng
f008dac0
Use ProcessingBuffer in AgentProcessor
5 年前
Ervin Teng
34f9577c
Merge branch 'develop' into develop-agentprocessor
5 年前
Ervin Teng
1e36028d
Runs but doesn't do anything yet
5 年前
Ervin Teng
17dca3ce
Another nonworking commit
5 年前
Ervin Teng
02b5e1ef
Revert buffer for now
5 年前
Ervin Teng
3434352a
Non-working commit
5 年前
Ervin Teng
73000a6b
Merge branch 'develop' into develop-splitbuffer
5 年前
Ervin Teng
fd0647a6
Rename append_update_buffer to append_to_update_buffer
5 年前
Ervin Teng
c2d216ca
Add type hints to Buffer
5 年前
Ervin Teng
c5b23f46
Remove MANIFEST file
5 年前
Ervin Teng
a80b47d1
Fix demo loader and remaining tests
5 年前
Ervin Teng
29cdf77a
Fix RL tests
5 年前
Ervin Teng
9053610f
Fix buffer tests and truncate
5 年前
Ervin Teng
e5459c49
buffer split for SAC
5 年前
Ervin Teng
df5ee7bf
Split buffer into two buffers (PPO works)
5 年前
GitHub
e6cace92
add options to set version on files ( #2954 )
* add options to set version on files
* fix precommit
5 年前
GitHub
bc5bf388
Convert most other scenes to RayPerception sensor ( #2916 )
5 年前
GitHub
e2eef3c4
Clean up env logging on initialization ( #2950 )
5 年前
Chris Elion
e2e76c51
Develop barracuda 0.3.x ( #2952 )
* Removed Barracuda as drop-in library, added Barracuda package dependency
* Removed Google Protobuf library as now it comes with Barracuda package
* List<T>.Length seems to be extension, which is not available in .NET coming with Unity 2017, switched to .Count
5 年前
GitHub
28dbf4c5
Allow --version argument in mlagents-learn ( #2942 )
* allow --version argument in mlagents-learn
* Develop version print add strings (#2945 )
* add __version__ to libs
* more version info
* use actual version
5 年前
Ervin T
5ff3ad8f
Update Learning-Environment-Create-New.md ( #2925 )
Added a missing instruction in the Final Editor Setup section.
5 年前
GitHub
8a15cac5
Making the behavior type and enum ( #2937 )
* Making the behavior type and enum
* CamelCase
5 年前
Andrew Cohen
e96b80db
recieves brain_name and identifier on python side
5 年前
GitHub
69d1a033
Develop remove past action communication ( #2913 )
* Modifying the .proto files
* attempt 1 at refactoring Python
* works for ppo hallway
* changing the documentation
* now works with both sac and ppo both training and inference
* Ned to fix the tests
* TODOs :
- Fix the demonstration recorder
- Fix the demonstration loader
- verify the intrinsic reward signals work
- Fix the tests on Python
- Fix the C# tests
* Regenerating the protos
* fix proto typo
* protos and modifying the C# demo recorder
* modified the demo loader
* Demos are loading
* IMPORTANT : THESE ARE THE FILES USED FOR CONVERSION FROM OLD TO NEW FORMAT
* Modified all the demo files
* Fixing all the tests
* fixing ci
* addressing comments
* removing reference to memories in the ll-api
5 年前
Andrew Cohen
c257e053
set team id in prefab
5 年前
GitHub
38d39e38
disable tensorflow warnings by default ( #2931 )
5 年前
Andrew Cohen
184af227
splitting brain params into brain name and identifiers
5 年前
GitHub
8426501b
Fix width and height in visual observations ( #2919 )
* swap h/w in sensor
* change texture to non-square, retrain model
* get dimensions from RenderTexture
5 年前
Andrew Cohen
518ce6f8
added team id and identifier concat to behavior parameters
5 年前
GitHub
99981937
fix errors from new flake8-comprehensions ( #2917 )
5 年前
GitHub
c0453ae1
Merge pull request #2912 from Unity-Technologies/develop-allbraininfo
Bubbled up indexing of AllBrainInfo to trainer controller from trainers
5 年前
Andrew Cohen
b11f04ea
Fixed test code by creating brain_name variable instead of hardcoding
5 年前