60 次代码提交 (19ee6e5e-2a16-4754-b9af-e4faeb2818da)

作者 SHA1 备注 提交日期
GitHub 36048cb6 Moving Env Manager to Trainers (#3062) The Env Manager is only used by the trainer codebase. The entry point to interact with an environment is UnityEnvironment. 5 年前
GitHub f058b18c Replace BrainInfos with BatchedStepResult (#3207) 5 年前
GitHub 43f23ee3 WIP : Changes to the LL-API - Refactor of “done” logic (#3681) 5 年前
Arthur Juliani 7c3bd376 Refactoring policy and optimizer 4 年前
Arthur Juliani b997f214 Share more code between tf and torch policies 4 年前
Arthur Juliani 1736559f Combine actor and critic classes. Initial export. 4 年前
Arthur Juliani ca887743 Support tf and pytorch alongside one another 4 年前
Arthur Juliani be7e55e1 Use LSTM and fix a few merge errors 4 年前
Arthur Juliani 9724c9ac Merge master 4 年前
GitHub 0d80d87a Fix for discrete actions (#4181) 4 年前
GitHub a28e2767 Update add-fire to latest master, including Policy refactor (#4263) 4 年前
Ruo-Ping Dong 6feec58a add Saver class (only TF working) 4 年前
Ruo-Ping Dong 6d67f857 move tf and add torch model serialization 4 年前
Ruo-Ping Dong 71fe4df6 fix formatting and test 4 年前
GitHub 1b098c9a Refactor TFPolicy and Policy (#4254) 4 年前
GitHub 25dc8c3d Add Saver Class to handle all save/load/checkpoint/export work (#4323) 4 年前
Andrew Cohen 643c8e58 ppo extended 4 年前
Andrew Cohen db37db34 fixing errors 4 年前
Andrew Cohen fc3027ac tf tests except gail pass 4 年前
GitHub e4db5dc5 ActionSpec and ActionBuffer (#4578) 4 年前
Andrew Cohen eaecb59e torch utils to and from buffer 4 年前
Andrew Cohen 8013e544 ignoring Instance of 'AbstractContextManager' has no 'enter_context' member (no-member) 4 年前
GitHub cb8e4d25 Add ActionSpec (#4586) 4 年前
Andrew Cohen 9689cf2c remove *_action_* from function names 4 年前
Andrew Cohen 6cf54bf2 remove self.action_spec from policy/bc 4 年前
GitHub b853e5ba Action buffer (#4612) 4 年前
GitHub 3c96a3a2 Action Model (#4580) 4 年前
GitHub 88d3ec3e Merge master into hybrid actions staging branch (#4704) 4 年前
GitHub 87a7ccf8 use int64 steps, check for NaN actions (#4607) 4 年前
Andrew Cohen b6d10456 removed action_spec.size 4 年前
Andrew Cohen 3f771e61 add ActionBuffers and utils 4 年前
Andrew Cohen d8dd5e44 Merge branch 'master' into develop-action-buffer 4 年前
Andrew Cohen bd917c9c action buffer passes continuous 4 年前
vincentpierre 735fcd52 [WIP] Refactor trainers to use list of obs rather than vec and vis obs 4 年前
vincentpierre c1587bce Solving merge conflicts 4 年前
Andrew Cohen 4ebc6c44 ml-agents-envs pass 4 年前
GitHub 22658a40 use sensor types to differentiate obs (#4749) 4 年前
vincentpierre 8cb050ef WIP Made initial changes to enale dimension properties and added attention module 4 年前
Andrew Cohen 498b1ee6 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
GitHub c2783e70 remove "discrete nan" check (#4770) (#4775) 4 年前
vincentpierre c3699de8 merging master and addressing comments 4 年前
Andrew Cohen 453a2bba ActionTuple default is now np.array, not None 4 年前
Andrew Cohen 1d234d1d bc works 4 年前
vincentpierre 719c969c addressing comments. ObservationSpec is no longer a list 4 年前
vincentpierre 4bba4e8e Renaming ObservationSpec to SensorSpec 4 年前
Andrew Cohen 95566e44 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
vincentpierre c5a057d2 renaming obs_spec variables 4 年前
vincentpierre 44ed3258 Merging master 4 年前
Andrew Cohen 3457cd3c save only discrete actions as prev 4 年前
vincentpierre 449712b0 renaming sensor_spec to sensor_specS 4 年前
Andrew Cohen 35769b53 Merge branch 'develop-action-buffer' into develop-hybrid-actions-singleton 4 年前
vincentpierre 36cc4665 Removing some vis and vec fields from policy.py 4 年前
vincentpierre 519c5f47 merging master 4 年前
Andrew Cohen 7ba10239 remove action spec attribute from policy 4 年前
vincentpierre 318622a6 removing the extra code to set normalize to false if no observation can be normalized 4 年前
GitHub cc948a41 Policy output actiontuple (#4651) 4 年前
vincentpierre 52b011d6 _ 4 年前
Ervin Teng 514873bf Use correct memories (t-1 instead of t) for training 4 年前
Ervin Teng 7014404a Add presence check 4 年前
GitHub d36a5242 Python Dataflow for Group Manager (#4926) 4 年前