77 次代码提交 (soccer-fives/multiagent)

作者 SHA1 备注 提交日期
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 6 年前
GitHub 9c50abcf GAIL and Pretraining (#2118) 5 年前
GitHub 78c0c202 fix mock_brain (#2377) 5 年前
GitHub b498c19d Fix BCTrainer increment_steps (#2384) 5 年前
GitHub d7ebaae1 Return list instead of np array for make_mini_batch() (#2371) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
Ervin Teng 072d2ef8 Merge latest develop 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
GitHub b73fa378 Add more extensive tests for BC trainer (#2506) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
Ervin Teng e826f4bb Bugfix for LSTM+BC (#2679) 5 年前
GitHub 68965c7b Use a class for camera res, not dict (#2656) 5 年前
GitHub 24ba9d58 Develop deprecate broadcasting (#2669) 5 年前
GitHub e6240c7a Bugfix for LSTM+BC (#2679) 5 年前
GitHub 619465e1 Fix crash when SAC is used with Curiosity and Continuous Actions (#2740) 5 年前
GitHub 0892ef2c [WIP] ISensor interface and use for visual observations (#2731) 5 年前
GitHub ccb7eab4 Remove {text,custom} {action,observations} (#2839) 5 年前
GitHub e6f549dc [MLA-12] update protobuf for vector observations (#2862) 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Ervin Teng 29cdf77a Fix RL tests 5 年前
Ervin Teng 3a4fa244 Switch to tanh squash in PPO 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng eb4a04a5 Merge branch 'master' into develop-tanhsquash 5 年前
GitHub 3b4b0d55 Remove random normal epsilon (#3039) 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
Ervin Teng 336ca456 Kill the ProcessingBuffer 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 7fbf6b1d add flake8-bugbear (#3137) 5 年前
GitHub 29c91b14 update flake8 plugin version and fix warnings (#3180) 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 48b39b80 Fix ghost trainer and all tests 5 年前
Ervin Teng 7b0f700b Add test for deletion calls 5 年前
Ervin Teng 5ef902bf Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
GitHub 870338b4 [bug-fix] Fix issue with more than one continuous actions (#3547) 5 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 5 年前
GitHub 4641038e Renaming max_step to interrupted in TermialStep(s) (#3908) 5 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Andrew Cohen 4b094d25 large normalization obs unit test 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
Andrew Cohen 590adc01 make_fake_trajectory/step take ActionSpec arg 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
Andrew Cohen 0e28dd8f add static method to create continuous/discrete 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
Andrew Cohen b6d10456 removed action_spec.size 4 年前
Arthur Juliani b8f22fd7 Update second half of tests 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前
Arthur Juliani 0d2f8887 Merge remote-tracking branch 'origin/master' into goal-conditioning 4 年前
GitHub a0d1c829 Action Docs part2 (#4739) 4 年前
Andrew Cohen cd73cce2 test_trajectory fixed 4 年前
vincentpierre 8cb050ef WIP Made initial changes to enale dimension properties and added attention module 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
vincentpierre c5a057d2 renaming obs_spec variables 4 年前
Andrew Cohen 3457cd3c save only discrete actions as prev 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 886883b3 Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
Arthur Juliani 7c37c759 Fix some mis-renamings 4 年前
Arthur Juliani 0a876b9c Fix typos 4 年前
Arthur Juliani e3de0406 Plurals 4 年前
GitHub 64fc7f43 Buffer key enums (#4907) 4 年前
Ruo-Ping Dong 471a2e82 fix tests 4 年前
Ruo-Ping Dong c87bce9e Merge branch 'master' into develop-base-teammanager 4 年前
Ervin Teng e112ede0 Fix mock brain 4 年前
Ervin Teng 44073593 Test for team obs in agentprocessor 4 年前
Ervin Teng a81512c9 Test for group and add team reward 4 年前
Ervin Teng d4438878 Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Andrew Cohen 5d517c5e clean ups 4 年前
GitHub d36a5242 Python Dataflow for Group Manager (#4926) 4 年前
Ervin Teng fd0dd35c Merge branch 'main' into develop-coma2-trainer 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前