63 次代码提交 (e2e62cb9-9803-4bd9-886a-b237e504173f)

作者 SHA1 备注 提交日期
Arthur Juliani de700c3a Multi Brain Training and Recurrent state encoder (#166) 7 年前
GitHub 51621334 State Stacking & Banan Environment (#262) 7 年前
GitHub 36d58cee Add Seeding, MaxStepReached, and Bootstrapping fix (#303) 7 年前
GitHub f134016b On Demand Decision (#308) 6 年前
GitHub 69481d2d Imitation Learning Helper (#371) 6 年前
Arthur Juliani 5d402be9 Minor Optimizations (#836) 6 年前
GitHub 0c417c55 Release v0.5 (#1202) 6 年前
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
GitHub 3c9603d6 Demonstration Recorder (#1240) 6 年前
GitHub b6c97cb6 Fix for divide-by-zero error with Discrete Actions (#1520) 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 5 年前
Chris Elion bb7773c1 add flake8 to precommit 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub bf375235 Change update buffer to float32 instead of float64 (#2461) 5 年前
GitHub 876aca1e Use numpy for random sample in buffer (#2524) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
Ervin Teng 9053610f Fix buffer tests and truncate 5 年前
Ervin Teng c2d216ca Add type hints to Buffer 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
Ervin Teng 3434352a Non-working commit 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng c9116ed2 Move some common logic to buffer class 5 年前
Chris Elion fdc810ff move (first pass) 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub e2ae7124 pass file mode to h5py.File() (#3165) 5 年前
GitHub 1f9d04f2 Fix clear update buffer when trainer stops training, add test (#3422) 4 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub 7ddfd81f Added Reward Providers for Torch (#4280) 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
GitHub 215b35c6 [refactor] Optimize buffer sample_minibatch (#4508) 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
Ervin Teng 184f27c6 Make buffer type-agnostic 4 年前
Ervin Teng f5c180bc Edit types of Apped method 4 年前
Ervin Teng 1a79cf5f Change comment 4 年前
Ervin Teng 95bdbba3 Less broken PPO 4 年前
Ervin Teng 56dcd75a Get next critic observations into value estimate 4 年前
Andrew Cohen e1fad8a4 buffer error 4 年前
GitHub 64fc7f43 Buffer key enums (#4907) 3 年前
Ervin Teng 3d0abb03 Make buffer typing neater 3 年前
Ervin Teng ae7643b8 Proper critic memories for PPO 3 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 3 年前
Ervin Teng c2883f5b Pad from back of trajectory 3 年前
Chris Elion c3bc8991 cleanup, don't store mask 3 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 3 年前
Ervin Teng 9bc88c41 Running COMA (not sure if learning) 3 年前
Ervin Teng 2f209c12 Buffer fixes 3 年前
Ervin Teng b3958a8d Buffer fixes 3 年前
Ervin Teng be45d8c0 Move padding method to AgentBufferField 3 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 3 年前
Andrew Cohen 9060da06 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 3 年前
Ervin Teng 50ab983e Fix slicing typing and string printing in AgentBufferField 3 年前
Ervin Teng bc3d3a95 Fix slicing typing and string printing in AgentBufferField 3 年前
Andrew Cohen 8f799687 ignoring precommit, grabbing baseline/critic mems from buffer in trainer 3 年前
GitHub af36ef3b [bug-fix] Fix typo (#5035) 3 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 3 年前
GitHub 47db8ce1 [bug-fix] Fix padding for List entries in buffer (#5046) 3 年前
Ervin Teng 8902c058 Merge branch 'main' into develop-coma2-trainer 3 年前
Ervin Teng c108da4a [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 3 年前
GitHub b9cab453 [perf] Optimizations for performance (#5192) 3 年前
GitHub c5589b59 [bug-fix] Fix POCA LSTM, pad sequences in the back (#5206) 3 年前