97 次代码提交 (bad47dad-b8c2-4967-b0ec-e1efdf027a98)

作者 SHA1 备注 提交日期
Ervin Teng bad47dad Allow None max steps 5 年前
Ervin Teng e2b2f4be Address AgentProcessor comments 5 年前
Ervin Teng ea49396a Add docstring 5 年前
Ervin Teng 6242b67d Add way to check if trajectory is done or max_reached 5 年前
Ervin Teng fdf9aea7 Make conversion methods part of NamedTuples 5 年前
Ervin Teng 47f8fa7a Fix some import errors 5 年前
Ervin Teng c330f6f6 Merge branch 'master' into develop-agentprocessor 5 年前
Ervin Teng 9d1eff12 Fix one more np float32 issue 5 年前
Ervin Teng 77aea4cd Fix np float32 errors 5 年前
Ervin Teng 2b811fc8 Properly report value estimates and episode length 5 年前
Ervin Teng 83126bb2 Fix PPO value tests 5 年前
Ervin Teng 43c0acfb Fix test again 5 年前
Ervin Teng 77ff4822 Add back next_obs 5 年前
Ervin Teng 324d217b Move agent_id to Trajectory 5 年前
Ervin Teng 97d66e71 Remove BootstrapExperience 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
Ervin Teng 38ff674e Fix BC and tests 5 年前
Ervin Teng 3449b551 Add test for trajectory 5 年前
Ervin Teng 62d609f8 Fix some of the tests 5 年前
Ervin Teng 40bbe173 Better decoupling for agent processor 5 年前
Ervin Teng 5ab2563b Fixes for recurrent 5 年前
Ervin Teng c7632aa7 Fix some bugs for visual obs 5 年前
Ervin Teng 3697e616 Convert BC (warning) might be broken 5 年前
Ervin Teng 336ca456 Kill the ProcessingBuffer 5 年前
Ervin Teng c9116ed2 Move some common logic to buffer class 5 年前
Ervin Teng f2b3cd7f Remove dead code 5 年前
Ervin Teng 28eba789 Migrate SAC 5 年前
Ervin Teng 2f82a550 Remove epsilon 5 年前
Ervin Teng 88b1123a Merge branch 'master' of github.com:Unity-Technologies/ml-agents into develop-agentprocessor 5 年前
Ervin Teng 76abf968 Add back max_step logic 5 年前
Ervin Teng 8b3b9e6c Move trajectory and related functions to trajectory.py 5 年前
Ervin Teng f94365a2 No longer using ProcessingBuffer for PPO 5 年前
Ervin Teng e0e57188 Clean up some stuff 5 年前
Ervin Teng 9c5fdd31 Stats reporting is working 5 年前
Ervin Teng a97ffb47 Attempt reward reporting 5 年前
Ervin Teng c2b729a6 Fix memory leak 5 年前
Ervin Teng 9e661f0c Looks like it's training 5 年前
Ervin Teng 2c9376bc Convert to trajectory 5 年前
Ervin Teng f008dac0 Use ProcessingBuffer in AgentProcessor 5 年前
Ervin Teng 34f9577c Merge branch 'develop' into develop-agentprocessor 5 年前
Ervin Teng 1e36028d Runs but doesn't do anything yet 5 年前
Ervin Teng 17dca3ce Another nonworking commit 5 年前
Ervin Teng 02b5e1ef Revert buffer for now 5 年前
Ervin Teng 3434352a Non-working commit 5 年前
Ervin Teng 73000a6b Merge branch 'develop' into develop-splitbuffer 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
Ervin Teng c2d216ca Add type hints to Buffer 5 年前
Ervin Teng c5b23f46 Remove MANIFEST file 5 年前
Ervin Teng a80b47d1 Fix demo loader and remaining tests 5 年前
Ervin Teng 29cdf77a Fix RL tests 5 年前