94 次代码提交 (1ac30864-48b3-4e29-911f-2ca4eb4a8f80)

作者 SHA1 备注 提交日期
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
GitHub 42bea858 Improve mypy coverage by adding --namespace-packages (#3049) 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
GitHub bed7debf Fix issue with different decision intervals for different brains (#3181) 5 年前
GitHub d985dded Merge branch 'master' into merge-release-0.13.0 5 年前
Andrew Cohen 4c260917 fix flake merge conflicts with master 5 年前
GitHub 4c241a80 Only send previous action and current BrainInfo (#3187) 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
GitHub 56a67403 Fix lost trajectories when they are produced faster than they are consumed (#3233) 5 年前
GitHub a64e7850 Fix issue with BatchedStepResult with no agents (#3240) 5 年前
GitHub ca96b293 Move advance() logic for environment manager out of trainer_controller (#3234) 5 年前
GitHub 590559e7 Make the Agent reset immediately after Done (#3291) 5 年前
Ervin Teng 7bbd91ad Change logic to fix memory leak 4 年前
GitHub 3939ca52 Change AgentProcessor logic to fix memory leak (#3383) 4 年前
GitHub f20a27e0 Clear agent processor properly on episode reset (#3437) 4 年前
Ervin Teng ff607162 Move learning rate reporting 4 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 4 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 4 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 4 年前
Ervin Teng 3deb8e30 Make trainer in separate threads 4 年前
GitHub de3fc4e8 Hotfix memory leak on Python (#3664) 4 年前
GitHub 11c518a3 Stats SideChannel (for custom TensorBoard metrics) (#3660) 4 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 4 年前
Ervin Teng 971e4b2d Don't block when disabling threading 4 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 4 年前
Ervin Teng 817aab95 Update steps_per_update documentation 4 年前
GitHub 83ac520a Merge 0.15.1 to master (#3755) 4 年前
Ervin Teng f6fcf512 Clean up interface for AP 4 年前
Ervin Teng 81f78aec Make fields properties 4 年前
Ervin Teng f29b17a9 Don't block one policy queue 4 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 4 年前
Ervin Teng d1fed8ae Remove empty_queue interface 4 年前
Ervin Teng e90ef688 Revert to get_nowait method in AgentManagerQueue 4 年前
Ervin Teng e5fbfc35 Remove params from get_nowait 4 年前
Ervin Teng 392fcb4e Fix stall in ghost trainer non-threaded 4 年前
GitHub 048d66fa Update comment on time horizon in agent processor (#3842) 4 年前
GitHub 4641038e Renaming max_step to interrupted in TermialStep(s) (#3908) 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
GitHub 20f1386a Don't drop multiple stats from the same step (#4236) 4 年前
Scott Jordan d695c044 initial addition of active learning (incomplete) 4 年前
Scott Jordan 56745026 Initial commit of running active learning code 4 年前
Scott Jordan 78f8a9a2 Updated task manager 4 年前
Scott Jordan 87969325 added histogram recorded, fixed active learning bug 4 年前
Andrew Cohen 9c2be310 commenting action pre continuous 4 年前
Andrew Cohen eaecb59e torch utils to and from buffer 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
vincentpierre e14e1c4d Improvements and new tests 4 年前
Ervin Teng 3b15cc32 Multiprocessing but Stats are quite broken 4 年前
Andrew Cohen 3f771e61 add ActionBuffers and utils 4 年前
Ervin Teng 15c463cf Add collab obs to trajectory 4 年前
Ervin Teng f479ce83 Fix bug; add critic_obs to buffer 4 年前
Andrew Cohen bd917c9c action buffer passes continuous 4 年前
Andrew Cohen 85e4db33 bc tests pass 4 年前
Ervin Teng 56dcd75a Get next critic observations into value estimate 4 年前
Ervin Teng 25dfd883 Merge branch 'master' into develop-centralizedcritic 4 年前
Andrew Cohen cd73cce2 test_trajectory fixed 4 年前
Andrew Cohen 3c65b964 fixed recurrent prev_action issue 4 年前
Andrew Cohen e9cb1066 agent processor tests 4 年前
Ruo-Ping Dong fbfdc05b send and process team manager id 4 年前
Ruo-Ping Dong 413246c2 remove print 4 年前
vincentpierre f7a4a31f [Experiment] Bullet hell 4 年前
GitHub 8a40c58a Added SUM as aggregation type for custom statistics (#4816) 4 年前
GitHub 7387a77f remove pylint (#4836) 4 年前
Andrew Cohen 231328ea remove warning prints 4 年前
Ervin Teng aba633b2 Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm 4 年前
Ruo-Ping Dong 180d3e20 Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager 4 年前
GitHub 70220f95 Team manager prototype (#4850) 4 年前
Ervin Teng a7e368b8 Fix AgentProcessor for TeamManager 4 年前
Ervin Teng fdf97d99 Add team reward to buffer 4 年前
Ervin Teng 92fc78a5 Use new trajectory 3 年前
Ruo-Ping Dong 910da750 change teammanager id from string to int 3 年前
Ervin Teng 65b866b0 Actions added but untested 4 年前
Ruo-Ping Dong fb4a3bd2 fix grouping for int id 3 年前
Ruo-Ping Dong 34a67a8e fix passing manager id to trainer 3 年前
Ruo-Ping Dong e470fa12 make global manager id 3 年前
Ruo-Ping Dong d7ade5c3 update agent processor to use group id 3 年前
Ervin Teng 30db9ef4 AgentProcessor fixes 3 年前
Ervin Teng 514873bf Use correct memories (t-1 instead of t) for training 3 年前
Ervin Teng eb13a14a Renaming fest 3 年前
Ervin Teng a6b4917a Use NamedTuples instead of attrs classes 3 年前
Ervin Teng a9116382 Bug fixes 3 年前
Ervin Teng 4aee6787 more renaming 3 年前
Ervin Teng a25bb4d4 Global group ids 3 年前
Ervin Teng ae659ac4 Addressed some comments 3 年前
Ervin Teng ffdfd8ff Address some comments 3 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 3 年前
GitHub d36a5242 Python Dataflow for Group Manager (#4926) 3 年前
GitHub 2933f235 Fix the reporting of histogram stats and adding a test (#5410) 3 年前