94 次代码提交 (20ae24dc-b21e-4343-903d-dd617110d143)

作者 SHA1 备注 提交日期
GitHub 8317a659 Behavioral Cloning & Trainers Reorg (#328) 7 年前
GitHub e11dae1d Python Testing & Image Inference Improvements (#353) 7 年前
Arthur Juliani b8a4f5f1 Add Hallway envronment to validate LSTM models 7 年前
Arthur Juliani c3644f56 Buffer fix for properly masking gradients 7 年前
GitHub f134016b On Demand Decision (#308) 7 年前
GitHub a7c9096f [Semantics] Modified the placeholder names (#381) 7 年前
GitHub 848b8a58 Fix PPO regression (#434) 7 年前
GitHub 237b41f9 Hotfix 0.3.0c (#618) 7 年前
GitHub 3b866e9f Use Clipped Gaussian (#649) 7 年前
GitHub c17937ef Curiosity Driven Exploration & Pyramids Environments (#739) 7 年前
Arthur Juliani 5d402be9 Minor Optimizations (#836) 7 年前
GitHub 282d5bd4 Fix Pytests (#843) 7 年前
GitHub a720e370 Fix bug and update tests (#850) 7 年前
GitHub 47fc38ab Additional Tests & Bug Fixes (#854) 7 年前
GitHub 6df07946 Fix for Discrete observations + Curiosity (#866) 7 年前
Arthur Juliani 5e48766d Remove discrete observations 7 年前
Arthur Juliani b46b8708 Rename function 7 年前
Arthur Juliani 12d52cb0 Replace tanh on cc models w/ swish 7 年前
GitHub e50ac7ae Merge branch 'develop' into hotfix-0 7 年前
Arthur Juliani 3659bbcd Develop multi discrete (#1022) 6 年前
Deric Pang 634280a6 Fixed imports, all tests are passing. 6 年前
GitHub ded0d8c7 Develop action masking (#1080) 6 年前
GitHub 2e489abc Normalization of the probabilities after masking (#1123) 6 年前
Deric Pang cdb41480 Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure 6 年前
Deric Pang d4ca94a1 Merge remote-tracking branch 'upstream/develop' into develop-flat-code-restructure 6 年前
GitHub fbf92810 Refactor Trainers to use Policy (#1098) 6 年前
GitHub 10d2a19d Release v0.5 (Develop) (#1203) 6 年前
GitHub 2d4b4209 Use single scope declaration for models (#1160) 6 年前
GitHub ab6eb8dc Fix TF Nan bug (#1178) 6 年前
GitHub 6c354d16 New Learning Brain (#1303) 6 年前
vincentpierre 03a8b7ed fix discrete curiosity 6 年前
vincentpierre eb4e23a7 making masked actions impossible instead of improbable 6 年前
GitHub 249e86a4 Ticked API : (#1696) 6 年前
Ervin T b30f4c90 Split `mlagents` into two packages (#1812) 6 年前
eshvk ef8009d9 Python code reformat via [`black`](https://github.com/ambv/black). 6 年前
GitHub 4ac79742 Refactor reward signals into separate class (#2144) 6 年前
GitHub b05c9ac1 Add environment manager for parallel environments (#2209) 5 年前
GitHub d80d5852 add some types to the reward signals (#2215) 5 年前
GitHub be4292fb Add different types of visual encoder (nature cnn/resnet) 5 年前
GitHub 6225317d refactor vis_encoder_type and add to doc 5 年前
GitHub a9fe719c Add Multi-GPU implementation for PPO (#2288) 5 年前
GitHub 7b69bd14 Refactor Trainer and Model (#2360) 5 年前
GitHub 3683cc1c Enable learning rate decay to be disabled (#2567) 5 年前
GitHub 4980b904 Cleanup visual obs setup (#2647) 5 年前
GitHub 68965c7b Use a class for camera res, not dict (#2656) 5 年前
Chris Elion 43e23941 rough pass at tf2 support, needs cleanup 5 年前
Chris Elion 806c77e4 centralize tensorflow imports 5 年前
Chris Elion 8da16bdb move compat functions 5 年前
GitHub 4da157fe more pylint fixes (#2842) 5 年前
GitHub e6f549dc [MLA-12] update protobuf for vector observations (#2862) 5 年前
Chris Elion fca51de8 Merge remote-tracking branch 'origin/develop' into try-tf2-support 5 年前
Chris Elion 73a346cb cleanup 5 年前
GitHub f57b7ac6 Allow usage with tensorflow 2.0.0 (via tf.compat.v1) (#2665) 5 年前
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub 42bea858 Improve mypy coverage by adding --namespace-packages (#3049) 5 年前
GitHub 2fd305e7 Move add_experiences out of trainer, add Trajectories (#3067) 5 年前
GitHub 3de3c1f1 check min size for visual encoders (#3112) 5 年前
Ervin Teng 69e7eeac Normalize based on number of elements 5 年前
Ervin Teng 0046ea2d Add comment 5 年前
Ervin Teng 0040dc7f New way to update mean and var 5 年前
Ervin Teng f80b1d12 Use running norm and std 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
Ervin Teng 03c750a7 Move some functionality to optimizer 5 年前
Ervin Teng cd74e51b More progress 5 年前
Ervin Teng 2373cae8 Move methods into common optimizer 5 年前
Ervin Teng 9ad99eb6 Combined model and policy for PPO 5 年前
Ervin Teng 28f7608f Clean up value head creation 5 年前
Ervin Teng 6bbcf2d7 Add typing to value head creator 5 年前
Ervin Teng 08cb91de Remove __init__ for LearningModel static class 5 年前
Ervin Teng a6e28cf4 Fix for visual obs 5 年前
Ervin Teng 7004604d Used NamedTuple for create normalization tensors 5 年前
Ervin Teng a990e5e8 Add docstrings for model 5 年前
GitHub c145e75b Split Policy and Optimizer, common Policy for PPO and SAC (#3345) 5 年前
Ervin Teng 53c25fb1 Move one-hot out of policy and remove selected_actions 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub 94de596b [change] Remove concatenate in discrete action probabilities to improve inference performance (#3598) 5 年前
Arthur Juliani 8c6f4696 Fix a couple additional bugs 5 年前
Andrew Cohen 4a3ad193 Add constant decay to beta and epsilon 5 年前
GitHub c5b94ca6 Use LR schedule for beta and epsilon (#3940) 5 年前
Arthur Juliani 2b3a6347 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
yanchaosun ac4c80c2 integrate the implementation and hyperparameters 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 4 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 4 年前
GitHub 380fef57 [refactor] Move TF-specific files to tf/ folder (#4266) 4 年前
Andrew Cohen 41216d7a test initalize steps to 100 4 年前
yanchaosun 36f36750 target critic for ppo 4 年前
Andrew Cohen 18ff42a6 use mean of first trajectory to initialize the normalizer 4 年前
Andrew Cohen ce9bcefe cleaned up initialization of variance/mean 4 年前
Andrew Cohen 4b094d25 large normalization obs unit test 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
GitHub bf6506fc [feature] Add small CNN for grids 5x5 and up (#4434) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前