126 次代码提交 (4671cf17-7b01-4722-935e-99b514a6ebbe)

作者 SHA1 备注 提交日期
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub 689765d6 Modification of reward signals and rl_trainer for SAC (#2433) 5 年前
GitHub f628d18b initialize trainer step count (#2498) (#2505) 5 年前
Ervin Teng e0da93d1 Fix bug with construct_curr_info and test 5 年前
GitHub 25926795 initialize trainer step count (#2498) 5 年前
Ervin Teng 4cb340b5 Fix crash when next_info is empty and using recurrent 5 年前
GitHub b7e12a37 Fix crash in construct_curr_info when next_info doesn't have any agents (#2549) 5 年前
GitHub 67d754c5 Fix flake8 import warnings (#2584) 5 年前
GitHub 5d3e05d1 Fix "memory leak" during inference (#2722) 5 年前
GitHub 0fe5adc2 Develop remove memories (#2795) 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
GitHub ccb7eab4 Remove {text,custom} {action,observations} (#2839) 5 年前
Andrew Cohen 13fe9cf8 Bubbled up indexing of AllBrainInfo to trainer controller from trainers 5 年前
GitHub 69d1a033 Develop remove past action communication (#2913) 5 年前
Ervin Teng df5ee7bf Split buffer into two buffers (PPO works) 5 年前
GitHub 652488d9 check for numpy float64 (#2948) 5 年前
GitHub 213cd68d Split Buffer into processing and update buffers (#2964) 5 年前
Ervin Teng 9e661f0c Looks like it's training 5 年前
Ervin Teng 9c5fdd31 Stats reporting is working 5 年前
Ervin Teng f94365a2 No longer using ProcessingBuffer for PPO 5 年前
Ervin Teng 8b3b9e6c Move trajectory and related functions to trajectory.py 5 年前
Andrew Cohen 8578b0b7 add_policy and create_policy separated 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub 42bea858 Improve mypy coverage by adding --namespace-packages (#3049) 5 年前
Ervin Teng 62d609f8 Fix some of the tests 5 年前
Ervin Teng 27c2a55b Lots of test fixes 5 年前
Andrew Cohen d1edbf43 add_policy and create_policy separated 5 年前
Ervin Teng 2b811fc8 Properly report value estimates and episode length 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 0b5b1b01 Develop magic string + trajectory (#3122) 5 年前
Andrew Cohen 082789ea Merge branch 'master' into develop-magic-string 5 年前
Ervin Teng e577d5ea Fix some mypy issues and remove unused code 5 年前
GitHub bec2e8f0 Add Trajectory/Policy Queues, move Trainer logic to advance() (#3113) 5 年前
Andrew Cohen fc485077 fixed more ci problems/removed self.policies 5 年前
Ervin Teng db743971 Move private methods out of trainer, simplify interface 5 年前
GitHub 45010af3 Add stats reporter class and re-enable missing stats (#3076) 5 年前
Ervin Teng b3a4e641 Remove some vestigial code 5 年前
Ervin Teng 48793ec1 Fix test 5 年前
Ervin Teng cd74e51b More progress 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
Ervin Teng 164732a9 Move optimizer creation to Trainer, fix some of the reward signals 5 年前
Ervin Teng cbfbff2c Split optimizer and TFOptimizer 5 年前
Ervin Teng 4d94e180 Move optimizer to common folder 5 年前
GitHub 1f9d04f2 Fix clear update buffer when trainer stops training, add test (#3422) 5 年前
Ervin Teng 5ef902bf Merge branch 'master' into develop-splitpolicyoptimizer 5 年前
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub 6709a9bf [change] Clean up trainer interface, clean up GhostTrainer stats (#3634) 5 年前
Ervin Teng 3deb8e30 Make trainer in separate threads 5 年前
Ervin Teng 93351d30 Fix comments 5 年前
Ervin Teng ed06f37c Ability to disable threading 5 年前
Ervin Teng 971e4b2d Don't block when disabling threading 5 年前
Ervin Teng f29b17a9 Don't block one policy queue 5 年前
Anupam Bhatnagar ac80ec82 [skip ci] increment steps on training 5 年前
Anupam Bhatnagar d49ceecc [skip ci] moving summary writer to update_policy 5 年前
Anupam Bhatnagar 95ba923d [skip ci] fix first summary statement output 5 年前
Anupam Bhatnagar 63abbe71 [skip ci] moving summary writer to update_policy 5 年前
Anupam Bhatnagar 45bac63e [skip ci] more fixes 5 年前
Ervin Teng d1fed8ae Remove empty_queue interface 5 年前
Ervin Teng e90ef688 Revert to get_nowait method in AgentManagerQueue 5 年前
Anupam Bhatnagar 9d7dd3b6 [skip ci] moving step increment to trainer from environment for sac 5 年前
Arthur Juliani 7c3bd376 Refactoring policy and optimizer 5 年前
Ervin Teng 6fa7ad0b Avoid stall when multiple brains are present 5 年前
Ervin Teng 744db929 Adjust yield timeout 5 年前
GitHub ccd40ce7 [bug-fix] Bugfixes for Threaded Trainers (#3817) 5 年前
Arthur Juliani 212e2d1d Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub 09853e13 [refactor] Move checkpoint saving into trainer (#4034) 4 年前
PSankalp Patro 45c4ea36 Save checkpoint files as .nn files in checkpoint directory 4 年前
Anupam Bhatnagar 4afd8f92 first commit 4 年前
Anupam Bhatnagar 0aedad7c fixing should_still_train call in rl_trainer.py 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
Anupam Bhatnagar 24d5f881 first commit 4 年前
GitHub 45154f52 Pytorch port of SAC (#4219) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Ruo-Ping Dong 6feec58a add Saver class (only TF working) 4 年前
GitHub 93517833 [feature] Fix TF tests, add --torch CLI option, allow run TF without torch installed (#4305) 4 年前
Ruo-Ping Dong 6d67f857 move tf and add torch model serialization 4 年前
Ruo-Ping Dong bdb2ba93 small improvements 4 年前
GitHub 7ddfd81f Added Reward Providers for Torch (#4280) 4 年前
Ruo-Ping Dong 3b729a82 small improvements 4 年前
Ruo-Ping Dong 4e87b422 move checkpoint_path logic to saver 4 年前
Ruo-Ping Dong 71fe4df6 fix formatting and test 4 年前
GitHub 0e0daf47 [add-fire] Merge post-0.19.0 master into add-fire (#4328) 4 年前
Ruo-Ping Dong b4713baa small improvements 4 年前
Ruo-Ping Dong 09a741c8 small improvement 4 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 4 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 4 年前
GitHub beb5aca5 [refactor] Make classes except Optimizer framework agnostic (#4268) 4 年前
GitHub 8128defb Don't save model twice, copy instead (#4302) 4 年前
Ruo-Ping Dong d3eb6c46 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
Ervin Teng 0ba67eb6 Fix ONNX import for continuous 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
GitHub 8985a040 Removing the experiment script from add fire (#4373) 4 年前
Andrew Cohen a65d08c7 ghost trainer tests 4 年前
GitHub 49545ce1 Pytorch ghost trainer (#4370) 4 年前
Ruo-Ping Dong c47ffc20 Rename saver 4 年前
Ruo-Ping Dong 09c22679 fix NNCheckpointManager for Torch 4 年前
Ruo-Ping Dong e60c7038 Merge branch 'master' into develop-saver-name 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
vincentpierre 6b6d4c38 _ 4 年前
vincentpierre 6cbe892f _ 4 年前
vincentpierre 8be52c38 - 4 年前
vincentpierre c10da7ef - 4 年前
vincentpierre 29f08b2e - 4 年前
vincentpierre 170f47a5 - 4 年前
vincentpierre a8137478 - 4 年前
vincentpierre f49aa8c7 - 4 年前
GitHub badca342 Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540) 4 年前
GitHub c188781b [life improvement] Moving Python files around (#4531) 4 年前
GitHub a690af74 [refactor] Make PyTorch the default and TensorFlow optional (#4517) 4 年前
Ervin Teng 3b15cc32 Multiprocessing but Stats are quite broken 4 年前
vincentpierre b863af57 Removing TensorFlow Trainers 4 年前
GitHub 7387a77f remove pylint (#4836) 4 年前
Arthur Juliani 9e2f0814 Add histogram aggregation type 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
GitHub 62314056 Fix ghost curriculum and make steps private (#5098) 4 年前
GitHub 63169e2c [cherry-pick] Fix group rewards for POCA, add warning for non-POCA trainers (#5120) 4 年前
GitHub 8387e252 [release] Fix rl trainer warning (#5144) 4 年前
Ervin Teng d1c24251 [bug-fix] When agent isn't training, don't clear update buffer (#5205) 4 年前
GitHub 28eb43dd [bug-fix] Delete .pt checkpoints past keep-checkpoints (#5271) 4 年前
GitHub ed69fd2b collecting latest step as a stat (#5264) 4 年前