89 次代码提交 (77aea4cd-17fe-405b-9943-bdffa1ba2be8)

作者 SHA1 备注 提交日期
Ervin Teng 77aea4cd Fix np float32 errors 5 年前
Ervin Teng 2b811fc8 Properly report value estimates and episode length 5 年前
Ervin Teng 83126bb2 Fix PPO value tests 5 年前
Ervin Teng 43c0acfb Fix test again 5 年前
Ervin Teng 77ff4822 Add back next_obs 5 年前
Ervin Teng 324d217b Move agent_id to Trajectory 5 年前
Ervin Teng 97d66e71 Remove BootstrapExperience 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
Ervin Teng 38ff674e Fix BC and tests 5 年前
Ervin Teng 3449b551 Add test for trajectory 5 年前
Ervin Teng 62d609f8 Fix some of the tests 5 年前
Ervin Teng 40bbe173 Better decoupling for agent processor 5 年前
Ervin Teng 5ab2563b Fixes for recurrent 5 年前
Ervin Teng c7632aa7 Fix some bugs for visual obs 5 年前
Ervin Teng 3697e616 Convert BC (warning) might be broken 5 年前
Ervin Teng 336ca456 Kill the ProcessingBuffer 5 年前
Ervin Teng c9116ed2 Move some common logic to buffer class 5 年前
Ervin Teng f2b3cd7f Remove dead code 5 年前
Ervin Teng 28eba789 Migrate SAC 5 年前
Ervin Teng 2f82a550 Remove epsilon 5 年前
Ervin Teng 88b1123a Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor 5 年前
Ervin Teng 76abf968 Add back max_step logic 5 年前
Ervin Teng 8b3b9e6c Move trajectory and related functions to trajectory.py 5 年前
Ervin Teng f94365a2 No longer using ProcessingBuffer for PPO 5 年前
Ervin Teng e0e57188 Clean up some stuff 5 年前
Ervin Teng 9c5fdd31 Stats reporting is working 5 年前
Ervin Teng a97ffb47 Attempt reward reporting 5 年前
Ervin Teng c2b729a6 Fix memory leak 5 年前
Ervin Teng 9e661f0c Looks like it's training 5 年前
Ervin Teng 2c9376bc Convert to trajectory 5 年前
Ervin Teng f008dac0 Use ProcessingBuffer in AgentProcessor 5 年前
Ervin Teng 34f9577c Merge branch 'develop' into develop-agentprocessor 5 年前
Ervin Teng 1e36028d Runs but doesn't do anything yet 5 年前
Ervin Teng 17dca3ce Another nonworking commit 5 年前
Ervin Teng 02b5e1ef Revert buffer for now 5 年前
Ervin Teng 3434352a Non-working commit 5 年前
Ervin Teng 73000a6b Merge branch 'develop' into develop-splitbuffer 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
Ervin Teng c2d216ca Add type hints to Buffer 5 年前
Ervin Teng c5b23f46 Remove MANIFEST file 5 年前
Ervin Teng a80b47d1 Fix demo loader and remaining tests 5 年前
Ervin Teng 29cdf77a Fix RL tests 5 年前
Ervin Teng 9053610f Fix buffer tests and truncate 5 年前
Ervin Teng e5459c49 buffer split for SAC 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub e6cace92 add options to set version on files (#2954) 5 年前
GitHub bc5bf388 Convert most other scenes to RayPerception sensor (#2916) 5 年前
GitHub e2eef3c4 Clean up env logging on initialization (#2950) 5 年前
Chris Elion e2e76c51 Develop barracuda 0.3.x (#2952) 5 年前
GitHub 28dbf4c5 Allow --version argument in mlagents-learn (#2942) 5 年前