Ervin Teng
|
ea49396a
|
Add docstring
|
5 年前 |
Ervin Teng
|
6242b67d
|
Add way to check if trajectory is done or max_reached
|
5 年前 |
Ervin Teng
|
fdf9aea7
|
Make conversion methods part of NamedTuples
|
5 年前 |
Ervin Teng
|
47f8fa7a
|
Fix some import errors
|
5 年前 |
Ervin Teng
|
c330f6f6
|
Merge branch 'master' into develop-agentprocessor
|
5 年前 |
Ervin Teng
|
9d1eff12
|
Fix one more np float32 issue
|
5 年前 |
Ervin Teng
|
77aea4cd
|
Fix np float32 errors
|
5 年前 |
Ervin Teng
|
2b811fc8
|
Properly report value estimates and episode length
|
5 年前 |
Ervin Teng
|
83126bb2
|
Fix PPO value tests
|
5 年前 |
Ervin Teng
|
43c0acfb
|
Fix test again
|
5 年前 |
Ervin Teng
|
77ff4822
|
Add back next_obs
|
5 年前 |
Ervin Teng
|
324d217b
|
Move agent_id to Trajectory
|
5 年前 |
Ervin Teng
|
97d66e71
|
Remove BootstrapExperience
|
5 年前 |
Ervin Teng
|
27c2a55b
|
Lots of test fixes
|
5 年前 |
Ervin Teng
|
38ff674e
|
Fix BC and tests
|
5 年前 |
Ervin Teng
|
3449b551
|
Add test for trajectory
|
5 年前 |
Ervin Teng
|
62d609f8
|
Fix some of the tests
|
5 年前 |
Ervin Teng
|
40bbe173
|
Better decoupling for agent processor
|
5 年前 |
Ervin Teng
|
5ab2563b
|
Fixes for recurrent
|
5 年前 |
Ervin Teng
|
c7632aa7
|
Fix some bugs for visual obs
|
5 年前 |
Ervin Teng
|
3697e616
|
Convert BC (warning) might be broken
|
5 年前 |
Ervin Teng
|
336ca456
|
Kill the ProcessingBuffer
|
5 年前 |
Ervin Teng
|
c9116ed2
|
Move some common logic to buffer class
|
5 年前 |
Ervin Teng
|
f2b3cd7f
|
Remove dead code
|
5 年前 |
Ervin Teng
|
28eba789
|
Migrate SAC
|
5 年前 |
Ervin Teng
|
2f82a550
|
Remove epsilon
|
5 年前 |
Ervin Teng
|
88b1123a
|
Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor
|
5 年前 |
Ervin Teng
|
76abf968
|
Add back max_step logic
|
5 年前 |
Ervin Teng
|
8b3b9e6c
|
Move trajectory and related functions to trajectory.py
|
5 年前 |
Ervin Teng
|
f94365a2
|
No longer using ProcessingBuffer for PPO
|
5 年前 |
Ervin Teng
|
e0e57188
|
Clean up some stuff
|
5 年前 |
Ervin Teng
|
9c5fdd31
|
Stats reporting is working
|
5 年前 |
Ervin Teng
|
a97ffb47
|
Attempt reward reporting
|
5 年前 |
Ervin Teng
|
c2b729a6
|
Fix memory leak
|
5 年前 |
Ervin Teng
|
9e661f0c
|
Looks like it's training
|
5 年前 |
Ervin Teng
|
2c9376bc
|
Convert to trajectory
|
5 年前 |
Ervin Teng
|
f008dac0
|
Use ProcessingBuffer in AgentProcessor
|
5 年前 |
Ervin Teng
|
34f9577c
|
Merge branch 'develop' into develop-agentprocessor
|
5 年前 |
Ervin Teng
|
1e36028d
|
Runs but doesn't do anything yet
|
5 年前 |
Ervin Teng
|
17dca3ce
|
Another nonworking commit
|
5 年前 |
Ervin Teng
|
02b5e1ef
|
Revert buffer for now
|
5 年前 |
Ervin Teng
|
3434352a
|
Non-working commit
|
5 年前 |
Ervin Teng
|
73000a6b
|
Merge branch 'develop' into develop-splitbuffer
|
5 年前 |
Ervin Teng
|
fd0647a6
|
Rename append_update_buffer to append_to_update_buffer
|
5 年前 |
Ervin Teng
|
c2d216ca
|
Add type hints to Buffer
|
5 年前 |
Ervin Teng
|
c5b23f46
|
Remove MANIFEST file
|
5 年前 |
Ervin Teng
|
a80b47d1
|
Fix demo loader and remaining tests
|
5 年前 |
Ervin Teng
|
29cdf77a
|
Fix RL tests
|
5 年前 |
Ervin Teng
|
9053610f
|
Fix buffer tests and truncate
|
5 年前 |
Ervin Teng
|
e5459c49
|
buffer split for SAC
|
5 年前 |