63 次代码提交 (09b37fb2-6063-411c-8038-4cd572fc9d0b)

作者 SHA1 备注 提交日期
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
Ervin Teng e0da93d1 Fix bug with construct_curr_info and test 5 年前
Ervin Teng aca81efb Add more tests 5 年前
Ervin Teng 28ef8983 Add 2 visual obs test 5 年前
GitHub 5d3e05d1 Fix "memory leak" during inference (#2722) 5 年前
GitHub 0fe5adc2 Develop remove memories (#2795) 5 年前
Andrew Cohen 13fe9cf8 Bubbled up indexing of AllBrainInfo to trainer controller from trainers 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
Ervin Teng 29cdf77a Fix RL tests 5 年前
Ervin Teng fd0647a6 Rename append_update_buffer to append_to_update_buffer 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Andrew Cohen ef2dfd4c adjusting tests to expect trainer.add_policy to be called 5 年前
Ervin Teng 336ca456 Kill the ProcessingBuffer 5 年前
Ervin Teng 62d609f8 Fix some of the tests 5 年前
Jonathan Harper 9f166f9e Update tests to support pytest 5.x 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Ervin Teng 48793ec1 Fix test 5 年前
Ervin Teng c48ddcf2 Fix pre-commit error 5 年前
GitHub 1f9d04f2 Fix clear update buffer when trainer stops training, add test (#3422) 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 5 年前
Ervin Teng f29b17a9 Don't block one policy queue 5 年前
Ervin Teng 99ce4b59 Improve tests 5 年前
Ervin Teng e90ef688 Revert to get_nowait method in AgentManagerQueue 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
Jonathan Harper 80127232 Convert checkpoints to .nn format 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 93517833 [feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
Ruo-Ping Dong d3eb6c46 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
Ruo-Ping Dong d57aa9ab Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
Ruo-Ping Dong 09c22679 fix NNCheckpointManager for Torch 4 年前
Ruo-Ping Dong e60c7038 Merge branch 'master' into develop-saver-name 4 年前
GitHub badca342 Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 590adc01 make_fake_trajectory/step take ActionSpec arg 4 年前
Andrew Cohen 0e28dd8f add static method to create continuous/discrete 4 年前
vincentpierre b863af57 Removing TensorFlow Trainers 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
vincentpierre c5a057d2 renaming obs_spec variables 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
GitHub 8a40c58a Added SUM as aggregation type for custom statistics (#4816) 4 年前
GitHub 14129a08 [MLA-470] Barracuda + TF cleanup (#4837) 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
Arthur Juliani 7c37c759 Fix some mis-renamings 4 年前
Arthur Juliani e3de0406 Plurals 4 年前
GitHub 67ad9651 Merge pull request #4825 from Unity-Technologies/sensor-types 4 年前
GitHub 8387e252 [release] Fix rl trainer warning (#5144) 4 年前
Ervin Teng d1c24251 [bug-fix] When agent isn't training, don't clear update buffer (#5205) 4 年前
GitHub 28eb43dd [bug-fix] Delete .pt checkpoints past keep-checkpoints (#5271) 4 年前