396 次代码提交 (0968daa8-51d2-433d-a6e4-e3dd0f33392a)

作者 SHA1 备注 提交日期
GitHub e4177de0 [change] Organize trainer files a bit better (#3538) 5 年前
Andrew Cohen 573b1f6d Merge branch 'master' into soccer-fives 5 年前
Anupam Bhatnagar f4dbedcf removed extraneous logging imports and loggers 5 年前
GitHub 86141eee Merge pull request #3560 from Unity-Technologies/new-logger 5 年前
Anupam Bhatnagar e8e0078e first commit 5 年前
Anupam Bhatnagar 07b15ae7 [skip-ci] small refactors 5 年前
GitHub e3af96ca Merge branch 'master' into develop-demo-load-seek 5 年前
GitHub 873ba7fd [bug-fix] Fix stats reporting for reward signals in SAC (#3606) 5 年前
Andrew Cohen b1cfa74d Merge branch 'master' into develop-test-imitation 5 年前
GitHub ec278616 Hotfixes for Release 0.15.1 (#3698) 5 年前
Andrew Cohen 53bea15c Merge branch 'master' into soccer-fives 5 年前
Andrew Cohen ac261e36 Merge branch 'master' into self-play-mutex 5 年前
GitHub 29f82921 [bug-fix] Improve performance for PPO with continuous actions (#3662) 5 年前
GitHub de3fc4e8 Hotfix memory leak on Python (#3664) 5 年前
Andrew Cohen 4c9ac553 Merge branch 'master' into self-play-mutex 5 年前
Andrew Cohen 93d344ff simple rl asymm ghost tests 5 年前
GitHub 4ecd6ad3 Fix how we set logging levels (#3703) 5 年前
Andrew Cohen cd677346 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Andrew Cohen c7a34413 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
GitHub bc1fdf07 [refactor] CLI changes (#3705) 5 年前
Andrew Cohen 59b88be6 Merge branch 'master' into self-play-mutex 5 年前
GitHub 9cbc3fa2 Asymmetric self-play (#3653) 5 年前
Ervin Teng 06fa3d39 Merge branch 'master' into develop-sac-apex 5 年前
Anupam Bhatnagar 50e52d9c Merge branch 'master' into distributed-training 5 年前
GitHub d7ca6b8d [feature] Add --initialize-from option (#3710) 5 年前
Andrew Cohen 1b9c643b Merge branch 'master' into self-play-mutex 5 年前
Anupam Bhatnagar 001fce2a first commit 5 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 5 年前
Anupam Bhatnagar 9341f7a2 [skip-ci] small refactors 5 年前
Andrew Cohen 7a7eb324 Merge branch 'master' into internal-policy-ghost 5 年前
Andrew Cohen 930d6fa3 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
GitHub aae58330 Merge branch 'master' into develop-add-inference-examples 5 年前
Andrew Cohen b0c506a6 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen 933d7b32 Merge branch 'self-play-mutex' into soccer-2v1 5 年前
Ervin Teng 5e980ec1 Merge branch 'master' into develop-sac-apex 5 年前
Andrew Cohen f41695b9 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Arthur Juliani dc50162d Add ResNet and distributions 5 年前
Andrew Cohen 80469267 Merge branch 'internal-policy-ghost' into soccer-2v1 5 年前
Andrew Cohen 4468280a Merge branch 'soccer-2v1' into asymm-envs 5 年前
Andrew Cohen 9bec75ee Merge branch 'master' into soccer-2v1 5 年前
Arthur Juliani e166d018 Dynamically construct actor and critic 5 年前
Arthur Juliani 6879bae4 Initial optimizer port 5 年前
Arthur Juliani 7c3bd376 Refactoring policy and optimizer 5 年前
Arthur Juliani 2e51260a Resolving a few bugs 5 年前
Andrew Cohen 185d4b35 Merge branch 'soccer-2v1' into asymm-envs 5 年前
Arthur Juliani b997f214 Share more code between tf and torch policies 5 年前
Arthur Juliani 947f0d32 Slightly closer to running model 5 年前
Arthur Juliani 3c82bf59 Training runs, but doesn’t actually work 5 年前
Arthur Juliani 8c6f4696 Fix a couple additional bugs 5 年前
Arthur Juliani 4a50444f Support discrete actions as well 5 年前
Arthur Juliani a11a79e4 Continuous and discrete now train 5 年前
GitHub 232519e4 [refactor] Move output artifacts to a single results/ folder (#3829) 5 年前
Arthur Juliani 82688e5c GRU in-progress and dynamic cnns 5 年前
Arthur Juliani 1736559f Combine actor and critic classes. Initial export. 5 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 5 年前
Arthur Juliani 9835d26c Prepare model for onnx export 5 年前
Chris Elion 68b68396 Merge remote-tracking branch 'origin/master' into release_1_to_master 5 年前
GitHub d2bc86c8 Release 2 cherry pick (#3971) 5 年前
vincentpierre c34dd5b6 Merge branch 'master' into develop-gym-wrapper 5 年前
Arthur Juliani 89ad3020 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Arthur Juliani be7e55e1 Use LSTM and fix a few merge errors 5 年前
Arthur Juliani b7be7f04 Fix bug in probs calculation 5 年前
Arthur Juliani 3eef9d78 Optimize np -> tensor operations 5 年前
Arthur Juliani c02e75d6 Time action sample function 5 年前
Arthur Juliani 039f545a Small performance improvement during inference 5 年前
Ervin Teng 565f92ef Seems to speed it up 5 年前
Christopher Goy ba80b292 format files with pre-commit. 4 年前
Ervin Teng 2fae31e6 Remove another if statement 5 年前
GitHub f7373172 Merge pull request #4385 from Unity-Technologies/release_2_verified-barracuda-1.0.2 4 年前
Ervin Teng 72180f9b Experiment with JIT compiler 5 年前
GitHub abbc6424 [bug-fix] Fix issue with initialize not resetting step count (#3962) 5 年前
vincentpierre 6ddfe74f Merge branch 'master' into develop-gym-wrapper 5 年前
Arthur Juliani 28e095e0 Merge remote-tracking branch 'origin/master' into develop-add-fire 5 年前
Ervin Teng f214836a Changes for speed test 5 年前
GitHub e92b4f88 [refactor] Structure configuration files into classes (#3936) 5 年前
GitHub 335cff3e [versioning] Save ML-Agents version in checkpoints and check on load (#4035) 5 年前
Andrew Cohen e7750fc9 Merge branch 'master' into develop-sampler-refactor 5 年前
Andrew Cohen 34ecc7e6 Merge branch 'master' into asymm-envs 5 年前
GitHub a1c63c4b Release 3 Cherry-pick bug-fixes and doc changes from master (#4102) 5 年前
GitHub 8a49e8e0 [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) 5 年前
Anupam Bhatnagar 4afd8f92 first commit 5 年前
Arthur Juliani 9724c9ac Merge master 5 年前
Arthur Juliani 46874cc7 ONNX exporting 5 年前
yanchaosun c2d6f5c0 basic implementation 5 年前
yanchaosun a9c788d7 new model 5 年前
Arthur Juliani 5d33aca7 Remove double setting 5 年前
GitHub 0d80d87a Fix for discrete actions (#4181) 5 年前
Anupam Bhatnagar 24d5f881 first commit 5 年前
Ervin Teng 510583d2 Move memory validation to settings 5 年前
yanchaosun ac4c80c2 integrate the implementation and hyperparameters 5 年前
Ervin Teng 68169434 Fix discrete actions and GridWorld 5 年前
GitHub cde8bd29 Convert List[np.ndarray] to np.ndarray before using torch.as_tensor (#4183) 5 年前
GitHub bb675bf4 Merge pull request #4134 from Unity-Technologies/develop-removebrainnamepolicy 5 年前
yanchaosun f0881a94 fix commands for cloud training 5 年前
GitHub 05a11c96 Develop add fire exp framework (#4213) 5 年前
yanchaosun 05a96355 remove slim package 5 年前
GitHub 45154f52 Pytorch port of SAC (#4219) 4 年前
yanchaosun ad95032b transfer path 5 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
GitHub 69579611 [refactor] Refactor Actor and Critic classes (#4287) 4 年前
Ruo-Ping Dong 6feec58a add Saver class (only TF working) 4 年前
Ervin Teng bd97532d Add normalizer update context 4 年前
Ruo-Ping Dong 9449d711 fix onnx save path and output_name 4 年前
yanchaosun a80915a8 yaml update 5 年前
Andrew Cohen f74d301a Merge branch 'develop-add-fire' into develop-add-fire-bc 4 年前
Ruo-Ping Dong 6d67f857 move tf and add torch model serialization 4 年前
yanchaosun 666c8ba9 new cloud training change 5 年前
Ruo-Ping Dong 01e60921 add sac checkpoint 4 年前
vincentpierre 599d7e9f Merging master 5 年前
yanchaosun 5eccb4c9 new transfer test for cloud 5 年前
HH 7afa1761 Merge branch 'master' into hh/develop/ragdoll-updates 5 年前
yanchaosun 858b97ec bug fix 5 年前
Ruo-Ping Dong 4e87b422 move checkpoint_path logic to saver 4 年前
yanchaosun da87eae9 predict model fix 5 年前
Ervin Teng 884c97ce Fix policy memory storinig 4 年前
yanchaosun d1e8d344 with swish activation 5 年前
Ruo-Ping Dong 71fe4df6 fix formatting and test 4 年前
Ruo-Ping Dong b4713baa small improvements 4 年前
Ruo-Ping Dong 79d89158 Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
GitHub 3bcb029b [refactor] Remove BrainParameters from Python code (#4138) 5 年前
Ruo-Ping Dong e06812aa fix tests 4 年前
GitHub 839eb2cb Develop model transfer test (#4214) 5 年前
yanchaosun 7e3216ae simple env test 5 年前
yanchaosun cdaaa318 bisim 5 年前
yanchaosun 3d0d359c bisimulation draft 5 年前
yanchaosun 1fdbfe65 no normalization 5 年前
yanchaosun 5a778ca3 fix normalization 5 年前
yanchaosun a212fef9 new bisim implementation 5 年前
HH 0fdac847 Merge branch 'master' into hh/develop/crawler-ragdoll-updates 5 年前
yanchaosun 0e2f6e19 small fix 5 年前
yanchaosun ec929746 minor update 5 年前
GitHub 84440f05 Convert checkpoints to .NN (#4127) 5 年前
Andrew Cohen d0133066 working 5 年前
yanchaosun 9bc90956 fix bug with bisimulation 5 年前
Andrew Cohen b6bf1860 fix bisim metric 5 年前
yanchaosun f8b91faa try to fix the bisim metric 5 年前
yanchaosun ce36349b some changes 5 年前
Andrew Cohen 1b17ae56 add tanh activ 5 年前
yanchaosun 7508a130 small fix 5 年前
Arthur Juliani 6bee0fd1 Merge master 5 年前
yanchaosun caeffa3e add two envs 5 年前
Andrew Cohen 5fa28f5f merge YC changes 5 年前
yanchaosun 28355444 bisim fix, disable stop gradient 5 年前
yanchaosun 3246570c added action encoder, and flags related with action training/transferring; set model_schedule as a changable hyperparameter 5 年前
GitHub 9f041970 Develop bisim action encoder, incorporate related hyperparameter settings (#4253) 5 年前
GitHub 1f5eb9da add pyupgrade to pre-commit and run (#4239) 5 年前
GitHub 129f9ddc [MLA-427] make pyupgrade convert f-strings too (#4244) 5 年前
yanchaosun 80bad241 init sac transfer, and added action encoder to bisim; configs for crawler 4 年前
yanchaosun a505cb16 new config 4 年前
yanchaosun b991096b update target encoder soft copy 4 年前
Andrew Cohen d8c123a0 Merge branch 'master' into sensitivity 4 年前
Andrew Cohen 02df39ab ignore precommit 4 年前
Andrew Cohen 0c7db26a target encoder 4 年前
yanchaosun b74294bf target encoders and new forward loss 4 年前
Andrew Cohen fa35292c write hist to tb 4 年前
yanchaosun 0c468084 sac transfer implementation; disable action encoder 4 年前
GitHub 1b098c9a Refactor TFPolicy and Policy (#4254) 4 年前
yanchaosun 0a1a30d3 sac update 4 年前
GitHub 380fef57 [refactor] Move TF-specific files to tf/ folder (#4266) 4 年前
GitHub beb5aca5 [refactor] Make classes except Optimizer framework agnostic (#4268) 4 年前
yanchaosun 00bb821c fix sac transfer problems 4 年前
Andrew Cohen 302e8e77 no action encoder 4 年前
Andrew Cohen 06e4356c Merge branch 'master' into sensitivity 4 年前
yanchaosun 2b67d1a6 fix crawler config 4 年前
Andrew Cohen 9d7ed6cc op buffer 4 年前
Andrew Cohen 9c012d6a no op buffer no acen 4 年前
Arthur Juliani 1a123641 Merge remote-tracking branch 'origin/master' into r5-master 4 年前
Andrew Cohen 2dec257c no encoder for single task 4 年前
Andrew Cohen 18ff42a6 use mean of first trajectory to initialize the normalizer 4 年前
yanchaosun 6df774ed update: separate model train as an option 4 年前
Andrew Cohen 5878b952 remove blank line 4 年前
Andrew Cohen ce9bcefe cleaned up initialization of variance/mean 4 年前
Ruo-Ping Dong 95858e25 update saver interface and add tests 4 年前
GitHub 9dc1d99e Initialize normalizer with mean/variance from first trajectory (#4299) 4 年前
GitHub ab8e5afa Release 6 fix nan (#4343) 4 年前
GitHub f374f87a [add-fire] Add LSTM to SAC, LSTM fixes and initializations (#4324) 4 年前
Anupam Bhatnagar 87bdf353 [skip ci] save model on worker zero only 4 年前
Anupam Bhatnagar d3e8f124 removing horovod from tf policy 4 年前
Anupam Bhatnagar abc1220f Merge branch 'master' into global-variables 4 年前
Andrew Cohen 2cd0de04 action enc 4 年前
Chris Elion d2133d83 comments and cleanup 4 年前
HH 8eaddb61 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
Andrew Cohen 12f3786c Revert "action enc" 4 年前
yanchaosun 3ce88589 1 layer everything 4 年前
Ruo-Ping Dong 59cc1a9f Merge branch 'develop-add-fire' into develop-add-fire-checkpoint 4 年前
GitHub 76deba36 Merge pull request #4334 from Unity-Technologies/global-variables 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Ervin Teng cb0085a7 Memory size abstraction and fixes 4 年前
Ervin Teng d65a9326 Merge branch 'master' into develop-add-fire-mm3 4 年前
Ruo-Ping Dong d57aa9ab Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint 4 年前
Andrew Cohen 463db9e8 backprop enc single task 4 年前
GitHub bd6bcd2f Merge master and add Saver class for save/load checkpoints 4 年前
Andrew Cohen 12eda929 try reload all 4 年前
Ervin Teng 42e25b25 Merge branch 'develop-add-fire' into develop-add-fire-memoryclass 4 年前
GitHub 8985a040 Removing the experiment script from add fire (#4373) 4 年前
Andrew Cohen b822283f merge add fire 4 年前
Ervin Teng 6e946dba Policy bugfixes and policy tests 4 年前
Ervin Teng 9ae22c61 Fix SeparateActorCritic export 4 年前
Christopher Goy 5a233353 Merge remote-tracking branch 'origin/master' into release_6-to-master 4 年前
GitHub 03eac72c [add-fire] Add tests and fix issues with Policy (#4372) 4 年前
Andrew Cohen a65d08c7 ghost trainer tests 4 年前
Ervin Teng 116303f1 Typing for torch policy 4 年前
GitHub 49545ce1 Pytorch ghost trainer (#4370) 4 年前
Andrew Cohen 71f9c241 fix tf policy for ghosts 4 年前
GitHub 6a1d993f [add-fire] Memory class abstraction (#4375) 4 年前
GitHub 2332bc32 Add fire to test_simple_rl.py (#4378) 4 年前
yanchaosun 3762358d fix action stop gradient 4 年前
vincentpierre 349cee77 Fix of the test for multi visual input 4 年前
Andrew Cohen 20083987 move tf policy comment 4 年前
Ervin Teng a04e68a4 Merge branch 'develop-add-fire' into develop-add-fire-memoryclass 4 年前
Andrew Cohen effdec13 return copy of state_dict 4 年前
yanchaosun 3ed56471 remove bi-forward-loss 4 年前
Ervin Teng 8ff8c401 Merge branch 'develop-add-fire' into develop-add-fire-export 4 年前
yanchaosun c5d9e376 add bi-forward-loss back 4 年前
GitHub 1955af9e [feature] Add experimental PyTorch support (#4335) 4 年前
yanchaosun 2e927257 separate policy net 4 年前
vincentpierre 108fac9a Replace torch.detach().cpu().numpy() with a utils method 4 年前
HH d9962254 Merge branch 'master' into hh/develop/loco-walker-variable-speed 4 年前
Anupam Bhatnagar 5e8aa485 renaming file from globals.py to global_values.py 4 年前
Anupam Bhatnagar 71c301bc minor fixes 4 年前
Anupam Bhatnagar 1f60979f [skip ci] change self.rank to global_values.get_rank() 4 年前
Anupam Bhatnagar f4f1a8d9 merge master into trainer-plugin branch 4 年前
yanchaosun 1ce53c55 discrete action 4 年前
GitHub 498934f9 Replace torch.detach().cpu().numpy() with a utils method (#4406) 4 年前
Ruo-Ping Dong 27fb4270 brain_name to behavior_name 4 年前
GitHub bfda9576 Replace brain_name with behavior_name (#4419) 4 年前
Ruo-Ping Dong fd1dc3a6 Merge branch 'master' into develop-torch-omp 4 年前
Ruo-Ping Dong f5dee9d1 jit for continuous control 4 年前
GitHub 6f534366 Add torch_utils class, auto-detect CUDA availability (#4403) 4 年前
Ervin Teng fdc887a1 Some experimental stuff 4 年前
Ervin Teng f59f35ea Remove stuff in policy 4 年前
Andrew Cohen 3997b14b Merge branch 'master' into develop-hybrid-actions 4 年前
Ervin Teng 3e771cbb Permute visual obs outside of network 4 年前
Ervin Teng 77c810fb Fix SAC and make utility method 4 年前
Ervin Teng 7754ad7b Don't run value during inference 4 年前
Ervin Teng b6095151 Execute critic with LSTM 4 年前
GitHub 4e4ad7b0 Don't run value during policy evaluate, optimized soft update function (#4501) 4 年前
Ervin Teng f9ff3efe Merge branch 'develop-policyonly' into develop-sac-targetq 4 年前
Andrew Cohen 643c8e58 ppo extended 4 年前
Andrew Cohen db37db34 fixing errors 4 年前
Andrew Cohen 44c9879e action models 4 年前
Andrew Cohen c494bfcc trains successfully 4 年前
GitHub badca342 Rename NNCheckpoint to ModelCheckpoint as Model can be NN or ONNX (#4540) 4 年前
Andrew Cohen 190d8e4d action model as a singleton 4 年前
Ervin Teng 8dec4771 Add hybrid actions to SAC 4 年前
Ervin Teng be159ad3 Make entropy reporting same as TF 4 年前
Ervin Teng 81342148 Revert "Add hybrid actions to SAC" 4 年前
Andrew Cohen fc3027ac tf tests except gail pass 4 年前
Andrew Cohen e5f14400 Merge branch 'master' into develop-hybrid-actions-singleton 4 年前
GitHub e4db5dc5 ActionSpec and ActionBuffer (#4578) 4 年前
Andrew Cohen 7827ca06 add ActionSpec; test_simple_rl torch passes 4 年前
Andrew Cohen eaecb59e torch utils to and from buffer 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
Andrew Cohen f654df34 fixing tensorflow tests 4 年前
GitHub e0ef30a5 [bug-fix] Change entropy computation and loss reporting in Torch to match TF (#4538) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
vincentpierre a3a9a56b Merge branch 'exp-multi-head-attention' into exp-bullet-hell 4 年前
Ruo-Ping Dong 9e08be87 Merge branch 'master' into release_9_branch_merge 4 年前
Andrew Cohen 6cf54bf2 remove self.action_spec from policy/bc 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
GitHub 87a7ccf8 use int64 steps, check for NaN actions (#4607) 4 年前
GitHub 23800f33 Merge branch 'master' into develop-action-spec 4 年前
GitHub 85a7c0f7 [bug-fix] Add clipping to PyTorch policy, fix initialization (#4649) 4 年前
Ervin Teng 0548057d Use real clipping (as in TF) 4 年前
GitHub 733bffbf use int64 steps, check for NaN actions (#4607) (#4654) 4 年前
Andrew Cohen b6d10456 removed action_spec.size 4 年前
GitHub 8175d558 [bug-fix] Fix BC module + action clipping (#4667) 4 年前
GitHub 2a8c6800 [bug-fix] Add clipping to PyTorch policy, fix initialization (#4649) (#4662) 4 年前
Ruo-Ping Dong 953cb6bb Merge branch 'master' into develop-windows-delay 4 年前
Andrew Cohen 1bbe492c fixed tests/ -> single validate_action func 4 年前
Ruo-Ping Dong ee5313e4 Merge branch 'master' into develop-windows-delay 4 年前
Ervin Teng 78f88c15 Add clip to export and make optional in policy 4 年前
GitHub f0ed3a38 Cherry-pick BC fixes to Release 10 (#4668) 4 年前
Andrew Cohen 3f771e61 add ActionBuffers and utils 4 年前
Ervin Teng 7a0ebfbd Pretty broken 4 年前
Ervin Teng 95bdbba3 Less broken PPO 4 年前
vincentpierre b863af57 Removing TensorFlow Trainers 4 年前
Ervin Teng 98948c59 Skip critic when given empty memory array 4 年前
Ervin Teng 4158629e Properly feed in None rather than empty arrays 4 年前
Andrew Cohen d8dd5e44 Merge branch 'master' into develop-action-buffer 4 年前
Andrew Cohen bd917c9c action buffer passes continuous 4 年前
Andrew Cohen b36fcf16 discrete runs/cont passes 4 年前
Andrew Cohen ad951493 debugging discrete 4 年前
Andrew Cohen fcf6471e 2d discrete passes 4 年前
Ervin Teng fdaa8c3d Merge branch 'develop-unified-obs' into develop-centralizedcritic 4 年前
GitHub 990f801a Develop hybrid action staging (#4702) 4 年前
vincentpierre 735fcd52 [WIP] Refactor trainers to use list of obs rather than vec and vis obs 4 年前
Ervin Teng 6846af21 Multi-input network 4 年前
Andrew Cohen 85e4db33 bc tests pass 4 年前
vincentpierre 93ca1409 fixing the tests 4 年前
vincentpierre 7a5cc9ec Merge master into develop-rm-tf 4 年前
Ervin Teng cb4b7ed3 Some minor tweaks but still broken 4 年前
vincentpierre 12619155 added some docstrings 4 年前
vincentpierre c1587bce Solving merge conflicts 4 年前
Andrew Cohen 8172b3d6 test_simple_rl/reward providers pass tf/torch 4 年前
Andrew Cohen 4ebc6c44 ml-agents-envs pass 4 年前
GitHub 8ab2e619 update type of evaluate_actions to list tensor (#4747) 4 年前
Arthur Juliani 0d2f8887 Merge remote-tracking branch 'origin/master' into goal-conditioning 4 年前
GitHub a0d1c829 Action Docs part2 (#4739) 4 年前
GitHub cc6b4564 Multi Directional Walker and Initial Hypernetwork (#4740) 4 年前
Ervin Teng 25dfd883 Merge branch 'master' into develop-centralizedcritic 4 年前
vincentpierre bc9d3975 merge master 4 年前
Andrew Cohen cd73cce2 test_trajectory fixed 4 年前
GitHub ad5f878c [refactor] Remove critic pass during inference (#4743) 4 年前
GitHub 11687f8d [cherry-pick] Cherry-pick #4743 into Release 11 (#4756) 4 年前
GitHub 22658a40 use sensor types to differentiate obs (#4749) 4 年前
GitHub 903d3afe Merge pull request #4707 from Unity-Technologies/develop-rm-tf 4 年前
vincentpierre 14378aa5 Merging master 4 年前
Andrew Cohen 97d94a83 fix test_tf_policy 4 年前
vincentpierre 0c81006d addressing comments 4 年前
vincentpierre 8cb050ef WIP Made initial changes to enale dimension properties and added attention module 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
GitHub c2783e70 remove "discrete nan" check (#4770) (#4775) 4 年前
GitHub 525a4308 remove "discrete nan" check (#4770) 4 年前
Andrew Cohen 6174c428 move action model to explicit distributions 4 年前
vincentpierre c3699de8 merging master and addressing comments 4 年前
Andrew Cohen 453a2bba ActionTuple default is now np.array, not None 4 年前
GitHub 29d94c7c Merge pull request #4734 from Unity-Technologies/develop-obs-as-list 4 年前
Andrew Cohen 1d234d1d bc works 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
Andrew Cohen c0d01baf Merge branch 'master' into merge-release11-master 4 年前
Andrew Cohen 95566e44 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
vincentpierre c5a057d2 renaming obs_spec variables 4 年前
Andrew Cohen e81e68de comms agent and fixed hallway 4 年前
vincentpierre 44ed3258 Merging master 4 年前
Andrew Cohen 3457cd3c save only discrete actions as prev 4 年前
Andrew Cohen da5b8420 update make_empty docstring 4 年前
Andrew Cohen 411b0a19 reuse action dict in torch policy for pre_action 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 35769b53 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
Andrew Cohen 17496265 move AgentAction, ActionLogProbs, and ActionFlattener to separate files 4 年前
Chris Elion 76ebc20c Merge remote-tracking branch 'origin/master' into r12-to-master 4 年前
GitHub 458fee17 Merge pull request #4763 from Unity-Technologies/develop-att 4 年前
vincentpierre 36cc4665 Removing some vis and vec fields from policy.py 4 年前
GitHub 9d8a7d6f Update ml-agents/mlagents/trainers/policy/tf_policy.py 4 年前
Ervin Teng 330fc1d0 Merge branch 'master' into develop-centralizedcritic-mm 4 年前
Andrew Cohen 60309d8f fix torch policy tests 4 年前
vincentpierre 519c5f47 merging master 4 年前
Ruo-Ping Dong 8ed14762 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Andrew Cohen 7ba10239 remove action spec attribute from policy 4 年前
vincentpierre 318622a6 removing the extra code to set normalize to false if no observation can be normalized 4 年前
Andrew Cohen 662fd6b1 added docstrings to action flattener 4 年前
GitHub a272bff0 Merge pull request #4803 from Unity-Technologies/develop-remove-vec-vis-fields 4 年前
GitHub cc948a41 Policy output actiontuple (#4651) 4 年前
GitHub 7387a77f remove pylint (#4836) 4 年前
Andrew Cohen 886883b3 Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton 4 年前
Arthur Juliani 0b4b0992 Rename more files 4 年前
Ervin Teng aba633b2 Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm 4 年前
Andrew Cohen 85b18389 fix test tf policy 4 年前
Ruo-Ping Dong a7d04be6 Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp 4 年前
Arthur Juliani 0a876b9c Fix typos 4 年前
Arthur Juliani e3de0406 Plurals 4 年前
Ruo-Ping Dong 180d3e20 Merge branch 'develop-centralizedcritic-mm' into develop-cc-teammanager 4 年前
HH 0024a286 merge ervin's new stuff 4 年前
GitHub 67ad9651 Merge pull request #4825 from Unity-Technologies/sensor-types 4 年前
vincentpierre 8660b1c2 merging master 4 年前
Ervin Teng 457b2630 I think it's running 4 年前
brccabral 457fb612 Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents 4 年前
Andrew Cohen 6e1826f8 might be right 4 年前
vincentpierre 52b011d6 _ 4 年前
vincentpierre 6f3ea7b8 _ 4 年前
Andrew Cohen a4c336c2 value estimator 4 年前
Andrew Cohen 9af22d30 use only value funcs 4 年前
Ervin Teng 3283b6a1 Remove Q-net for perf 4 年前
Ervin Teng b6f88d6d Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager 4 年前
Andrew Cohen f73b9dba update policy to not use critic 4 年前
Andrew Cohen 9b92f5fb remove commented code 4 年前
Ervin Teng 0bde7598 Back out trainer changes 4 年前
Ervin Teng 514873bf Use correct memories (t-1 instead of t) for training 4 年前
Ervin Teng 24ee4bd5 Merge remote-tracking branch 'origin/develop-critic-optimizer' into develop-critic-optimizer 4 年前
Ervin Teng 7014404a Add presence check 4 年前
Andrew Cohen c74dca9f add SharedActorCritic 4 年前
Ervin Teng f3a2a81f Merge branch 'develop-fix-lstms' into develop-gru 4 年前
Andrew Cohen 00b891df fix sac shared 4 年前
Ervin Teng 219e773b Merge branch 'develop-fix-lstms' into develop-critic-op-lstm 4 年前
Ervin Teng e46a86ad Merge branch 'master' into develop-superpush-int 4 年前
HH 15d512f9 Merge branch 'master' into hh/develop/dodgeball 4 年前
GitHub 338af2ec Move the Critic into the Optimizer (#4939) 4 年前
HH 4c947151 Merge branch 'main' into hh/develop/dodgeball 4 年前
Ervin Teng 61781a1a Merge branch 'main' into develop-agentprocessor-teammanager 4 年前
Andrew Cohen 9060da06 Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer 4 年前
Arthur Juliani 06c147f8 Merge remote-tracking branch 'origin/main' into goal-conditioning-new 4 年前
GitHub d36a5242 Python Dataflow for Group Manager (#4926) 4 年前
Ervin Teng fd0dd35c Merge branch 'main' into develop-coma2-trainer 4 年前
Ervin Teng c8137dcd Merge branch 'main' into develop-superpush-int 4 年前
GitHub f16ce486 Update v2-staging from main (March 15) (#5123) 4 年前
Christopher Goy 921ba4f0 Update v2-staging from main (March 15) (#5123) 4 年前
Christopher Goy ebe45056 Merge branch 'main' into release_14_branch-to-main 4 年前
Chris Elion 970f1d40 Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec 4 年前
GitHub 2e19759c Turning some logger.info into logger.debug and remove some logging overhead when not using debug (#5211) 4 年前
GitHub 28eb43dd [bug-fix] Delete .pt checkpoints past keep-checkpoints (#5271) 4 年前
Miguel Alonso Jr 4846cf0f Merge branch 'main' into develop-api-documentation-update 4 年前