浏览代码
Hybrid FoodCollector (#4746) (#4765)
Hybrid FoodCollector (#4746) (#4765)
* use continuous action for moving and discrete for shooting * update models/release_11_branch
GitHub
4 年前
当前提交
0ffad9aa
共有 19 个文件被更改,包括 2120 次插入 和 2006 次删除
-
1001Project/Assets/ML-Agents/Examples/FoodCollector/Demos/ExpertFood.demo
-
62Project/Assets/ML-Agents/Examples/FoodCollector/Prefabs/FoodCollectorArea.prefab
-
52Project/Assets/ML-Agents/Examples/FoodCollector/Prefabs/GridFoodCollectorArea.prefab
-
32Project/Assets/ML-Agents/Examples/FoodCollector/Prefabs/VisualFoodCollectorArea.prefab
-
72Project/Assets/ML-Agents/Examples/FoodCollector/Scripts/FoodCollectorAgent.cs
-
3com.unity.ml-agents/CHANGELOG.md
-
14com.unity.ml-agents/Editor/DemonstrationDrawer.cs
-
19com.unity.ml-agents/Runtime/Communicator/GrpcExtensions.cs
-
7com.unity.ml-agents/Runtime/Policies/BrainParameters.cs
-
9docs/Learning-Environment-Examples.md
-
31ml-agents/mlagents/trainers/demo_loader.py
-
467Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/FoodCollector.onnx
-
14Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/FoodCollector.onnx.meta
-
1001Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/GridFoodCollector.onnx
-
14Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/GridFoodCollector.onnx.meta
-
11Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/FoodCollector.nn.meta
-
305Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/FoodCollector.nn
-
1001Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/GridFoodCollector.nn
-
11Project/Assets/ML-Agents/Examples/FoodCollector/TFModels/GridFoodCollector.nn.meta
1001
Project/Assets/ML-Agents/Examples/FoodCollector/Demos/ExpertFood.demo
文件差异内容过多而无法显示
查看文件
文件差异内容过多而无法显示
查看文件
|
|||
pytorch1.7:�� |
|||
@ |
|||
vector_observation23Concat_0"Concat* |
|||
axis���������� |
|||
� |
|||
23 |
|||
/network_body.linear_encoder.seq_layers.0.weight |
|||
-network_body.linear_encoder.seq_layers.0.bias24Gemm_1"Gemm* |
|||
alpha �?�* |
|||
beta �?�* |
|||
transB� |
|||
|
|||
2425 Sigmoid_2"Sigmoid |
|||
|
|||
24 |
|||
2526Mul_3"Mul |
|||
� |
|||
26 |
|||
/network_body.linear_encoder.seq_layers.2.weight |
|||
-network_body.linear_encoder.seq_layers.2.bias27Gemm_4"Gemm* |
|||
alpha �?�* |
|||
beta �?�* |
|||
transB� |
|||
|
|||
2728 Sigmoid_5"Sigmoid |
|||
|
|||
27 |
|||
2829Mul_6"Mul |
|||
� |
|||
29 |
|||
/action_model._continuous_distribution.mu.weight |
|||
-action_model._continuous_distribution.mu.bias30Gemm_7"Gemm* |
|||
alpha �?�* |
|||
beta �?�* |
|||
transB� |
|||
|
|||
5632Exp_8"Exp |
|||
K |
|||
action_masks33Slice_9"Slice* |
|||
axes@�* |
|||
ends@�* |
|||
starts@ � |
|||
� |
|||
29 |
|||
5action_model._discrete_distribution.branches.0.weight |
|||
3action_model._discrete_distribution.branches.0.bias34Gemm_10"Gemm* |
|||
alpha �?�* |
|||
beta �?�* |
|||
transB� |
|||
135Constant_11"Constant* |
|||
value*J ��� |
|||
|
|||
33 |
|||
3536Mul_12"Mul |
|||
137Constant_13"Constant* |
|||
value*J �?� |
|||
|
|||
36 |
|||
3738Add_14"Add |
|||
|
|||
34 |
|||
3339Mul_15"Mul |
|||
140Constant_16"Constant* |
|||
value*J ��L� |
|||
|
|||
38 |
|||
4041Mul_17"Mul |
|||
|
|||
39 |
|||
4142Sub_18"Sub |
|||
* |
|||
4243 |
|||
Softmax_19"Softmax* |
|||
axis� |
|||
= |
|||
3044RandomNormalLike_20"RandomNormalLike* |
|||
dtype� |
|||
|
|||
44 |
|||
3245Mul_21"Mul |
|||
|
|||
30 |
|||
4546Add_22"Add |
|||
5 |
|||
4647Clip_23"Clip* |
|||
max @@�* |
|||
min @�� |
|||
) |
|||
47 |
|||
57continuous_actionsDiv_24"Div |
|||
151Constant_25"Constant* |
|||
value*J���3� |
|||
|
|||
43 |
|||
5152Add_26"Add |
|||
|
|||
5253Log_27"Log |
|||
6 |
|||
53discrete_actions Concat_28"Concat* |
|||
axis� |
|||
<memory_sizeConstant_29"Constant* |
|||
value* |
|||
J �torch-jit-export*B56JX����cý "н*B57J @@*AB-action_model._continuous_distribution.mu.biasJ3=אG���*��B/action_model._continuous_distribution.mu.weightJ�X �V:=!�����;@D��w{=�m ��F���y��Q����d=�����M<�F,=a���$���q���]��<@��=�����ւ�<�z=�<Q$=�$j<GE��E��=w���w=��4=�i<~9Fd�=o<�;�]5=?��=��<_Y�3�P=�滚��<�H�<�X�=
���6�������-�+�.=Q����f� |
|||
[˼�4b�M9h���ƻR����9�<H�<��9�4���@�;�`�n��=@U�C�d�����V�ѽ�? =�tA=�q�=T#��M����Cg��U�=A�#=(�l��p9<��k=�%�<�%�=��g�{�G�c����T��L�?�<N����&8�2ĽW��=e#Ľ�MK�灗�c,��S���1o]�D�=&Iν���-J��b/k;�}�� S=��=�i�;Ds�<��4�j"��͓�D F=T����]�<+��<�G-<Y�n�X=5��fp=��k��7�<qY���<�ME�e�1�*�b=?��±ϽZ?�<�ډ��2�=|�=��H�f}B=a�ƽ��A<D�*�*a�=]�����<x����@��N��=��/���W��<�ّ�ٌ-���,�l��[�<¦\=����!��k�Ṭ�!<�"b<���<�����c�_�� ��=�yܼCn���E�h࠽��=8"�<����|�=;l�=L�ż��bB�=�ၼ��[��*/�˕��MP �l�����9}�����½_���L=��j=�l��_�|=�S������W��=�q������ټ�Ȁ=LHo�l5R��!˼�v�;��ҽ�ü��4=�C�=@���=$R=T0�;�`����M=��<+�Q�=��BI��/�65U=�=��o=�(��S_<��1�EIr�lǶ�̣p��i
<�3����=䫜=!V�;�F�ɞ�=[s�f���Q��b�;��3�U�)=,��;��<؋=��<;;=t���%���|F=�W�=�s�=���b�e��ح���)�ٮ%=^��� |
|||
M=��Ҽ�j�c]���P���Z=M�����y=�7D����<������j=��C=�c�=E 꾽 �5=���=��i=�/1= �5�1V^=�n =�� �R������<��N=�"�=��L��x=R`�;k�%��/�<Sj7=�qY<|���G/R=tn=t�<�}�<d�C<�v��j�L=~�ʽ$�o����<�b�=� |
|||
y<�Ҋ=T�5;����o�~<t =�*=�?=�7B=S�8=_�;1�<r@S��C��a�A=�����-=�=�ڮ:v�b�kjN==�W�����p��fp=��%0=rr�<N�;�Y�;�+<��7<��V�h�3=0�K��Wq�9Gֻ�n�=����_`�<-��;:,�;$ڋ��F,=�L��K���u���i��=� �= |
|||
�"=��k<R <�C��. |
|||
:�+�=���=b���߲-=n��<0R��0��=�A���|�=��3��\9=ԎK�F2�<�$��Z��v/�����(>��4wm=���a�-��+��=\�N=<Hh=���=A��Z=�;�<*CB3action_model._discrete_distribution.branches.0.biasJ��˻���;*��B5action_model._discrete_distribution.branches.0.weightJ����4=���L��� |
|||
�� �;��Ҽ{약>Ii�Vւ�Mf5=����%��<�ں<ɂ&�,+8�<�-b:�
=�#=fb><�%=p.=~��<?�=�ǎ;�X�<y��; �x<�$=8 �<��=�+=T��=ؖq�Nz=��<Y�7=�K=�^�M�C=k� ���^����<&�4=�X�<��K�[�� z�D=?�T� +�<P���jb�<�� =O``< jH���� |
|||
���'��mR;/��;?��ZO�=��;$l�r�����l�!;[=��4==���<T�@�3;�;�2N=��"=�A��.μ�G=�*ۼ`�*=����X���պ� =�p9����<��8���:=�m����<���Gt#�z</��<v�;(�_;��<9�R�ny�P��<�n��NYL<v��<J4�<tj=��� |
|||
�3=1^��b�껹��<^~<��=S��<^��;SK�����;�uw�����J�!����3��q�<�-O=e�G={��V�3=�=4r�=��Y�;f�=��<'1�<�|�1��<2?|=��;<�]=��@�OSQ���м6��.�;~��;U\�!�P���:����������ů(��r��V�������K7��y;�� |
|||
��G���06�N���±Ҽ��G���;<�+ �vE��a��9�Bht<{e)���<g�o=%� �4�%�& ����=�Q=���<%��vP��0�;�)4<�{@�C|I���%�IR`��/;��<�� =���:��<g�Y<�>��d����$o=���=�1=7ը�����W���֦����<O��:yG8�Y |
|||
(��Z=sQ=ˤ�+kj<(u����<� |
|||
=�G���p��,�:�����:��%� |
|||
��=uy"�Z��=-�{=ZhE��S��ͅd;d�+��a*�!t=���=7����<&qT����j�*��F༡��<o�*��"�<GG;�F���һ>!b��G��ƿ�JN�<�ʌ9n#�;,�<
L=�*�;.�U=�o��f1�����"�<q����;**Bcontinuous_action_output_shapeJ @@*(Bdiscrete_action_output_shapeJ @*��B-network_body.linear_encoder.seq_layers.0.biasJ�7J7<��4=x�=���<��]=N#!=��U=�D�=硒<Iό=![�=���=I=+��h�<f&<�/p=i\�=cw=�i=K����g�<��W=�$�;/� |
|||
��&<��(���s�c�G=-3=�8O<q׀�@=�=�m��;?��=���W�A=V�=�;�=@�7��=H�<&iF=��-��g�����;To�;8�R�갣=g� =)d1=@��=�@<=��.< |
|||
|