Update ml-agents/mlagents/trainers/torch/utils.py

Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>
4 年前 · 559609a2
--- a/ml-agents/mlagents/trainers/torch/utils.py
+++ b/ml-agents/mlagents/trainers/torch/utils.py
    discrete log probs of individual actions as well as all the log probs for an entire branch.
    Utility functions provide numpy <=> tensor conversions to be used by the optimizers.
    :param continuous_tensor: Torch tensor corresponding to log probs of continuous actions
-    :param discrete_list: List of Torch tensors each corresponding to log probs of discrete actions
+    :param discrete_list: List of Torch tensors each corresponding to log probs of the discrete actions that were sampled.
-    a discrete action branch
+    a discrete action branch, even the discrete actions that were not sampled. all_discrete_list is a list of Tensors,
+    each Tensor corresponds to one discrete branch log probabilities. 
    """

    continuous_tensor: torch.Tensor