Vincent Gao
a32ff913
make the Ball3DDecision code's comment less than 80 characters in length
7 年前
eshvk
78906771
[docs] Rework hyperparameter wordings and alternative to PPO jupyter notebook patches
7 年前
GitHub
a7c9096f
[Semantics] Modified the placeholder names ( #381 )
7 年前
Vincent Gao
ba0ecf24
fixed other tabs and spaces
7 年前
eshvk
9082f15c
[docs] Provide alternative to PPO jupyter notebook
7 年前
GitHub
e0d5b1b0
Fix for when not using teacher helper ( #379 )
* Fix for when not using teacher helper
* Rename expert to teacher throughout
7 年前
Vincent Gao
1c2c800a
Fixed all the // without a space after it
7 年前
eshvk
e33a083f
[docs] Update PPO hyperparameter wordings
7 年前
GitHub
430a5486
[Semantics] renaming StateType to SpaceType ( #382 )
7 年前
Vincent Gao
38bd3e40
replaced all the tabs to 4 spaces in the project
7 年前
GitHub
26a1ed87
Merge pull request #380 from Unity-Technologies/dev-reacher-cleanup
Add comments to Reacher and re-train model w/o epsilon needed
7 年前
Vincent Gao
e0b60e56
removed the tab
7 年前
GitHub
0838c2bc
Merge pull request #378 from Unity-Technologies/docs/semantics-internal-brain
Docs/semantics internal brain
7 年前
vincentpierre
0d2a5031
[Documentation] cleaned repo and additional semantic changes
7 年前
Vincent Gao
4a23c5cf
clean up the code in Ball3DDecision
7 年前
vincentpierre
6c55017e
[FixingPytests] Added the new Semantic and modified the pytest
7 年前
Arthur Juliani
22d931c0
Add comments to Reacher and re-train model w/ epsilon needed
7 年前
Vincent Gao
933317be
modified comments
7 年前
GitHub
d4cd72d8
[timeBetweenDecisions] Reimplementation of waitTime for GridWorld and… ( #368 )
* [timeBetweenDecisions] Reimplementation of waitTime for GridWorld and Basic
* [EnvironmentModification] Changed the gridworld TimeBetweenDecisionAtInference
7 年前
GitHub
a809630f
Add config for crawler, and change crawler scene ( #376 )
* Add config for crawler, and change crawler scene
* Changed number of crawlers in scene to 12
* Changed Max-steps for crawlers to 5000
* Newer hyperparameters and newly trained crawler model
* Clean up crawler code, and improve efficency
7 年前
GitHub
171e551b
[Semantics] Modified some fields of the CoreInternalBrain ( #373 )
7 年前
GitHub
dcf58f75
Feature/previous text action ( #375 )
* [Previous Text Actions] Renamed previous_action to previous_vector_action
added previous_text_action to the BrainInfo
* [Semantics] Carried the modifications to the semantics of previous_vector_action to the trainers
7 年前
GitHub
06fa6616
Docs/new semantics ( #370 )
* [Semantics] Modified the semantics for the documentation
* [Semantics] Updated the images
* [Semantics] Made further changes to the docs based of the comments received
7 年前
GitHub
704aab24
[AcademyFirstReset] Changed the first reset logic of the academy to be consistent between training and inference ( #369 )
7 年前
GitHub
69481d2d
Imitation Learning Helper ( #371 )
* Add helper class to for Imitation Learning teacher. Allows for clearing buffer "C" and toggling adding info to the buffer "R".
7 年前
GitHub
2b66e6fb
Merge pull request #372 from Unity-Technologies/feature/unityfilesasbinaries
[git] Use .gitattributes to treat all Unity assets as binaries
7 年前
vincentpierre
edb5ccdb
[git] Use .gitattributes to treat all Unity assets as binaries
7 年前
GitHub
f134016b
On Demand Decision ( #308 )
* On Demand Decision : Use RequestDecision and RequestAction
* New Agent Inspector : Use it to set On Demand Decision
* New BrainParameters interface
* LSTM memory size is now set in python
* New C# API
* Semantic Changes
* Replaced RunMDP
* New Bouncer Environment to test On Demand Dscision
7 年前
GitHub
99103b29
Use `curr_brain_info`
7 年前
GitHub
2bba53b8
Merge pull request #367 from Unity-Technologies/feature/LSTM2
Hallway & LSTM Improvements
7 年前
GitHub
f8d27dc5
Merge branch 'development-0.3' into feature/LSTM2
7 年前
Arthur Juliani
c3644f56
Buffer fix for properly masking gradients
7 年前
GitHub
9ad4182e
Merge pull request #366 from Unity-Technologies/feature/cleanup
[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.
7 年前
eshvk
030ac5c5
[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.
7 年前
Arthur Juliani
b8a4f5f1
Add Hallway envronment to validate LSTM models
7 年前
Arthur Juliani
85ae912d
Dev docs ( #361 )
New documentation structure and content.
7 年前
GitHub
a3c7b426
Merge pull request #357 from Unity-Technologies/feature/containerization
Feature/containerization
7 年前
eshvk
64b6abf6
[Containerization] Docs clean up
7 年前
eshvk
fbb1a3d2
[containerization] Added screenshot of Docker Build Settings
7 年前
eshvk
2d85a873
[containerization] Use image name rather than tag name.
7 年前
eshvk
44a16f6b
Merge branch 'feature/containerization' of https://github.com/Unity-Technologies/ml-agents into feature/containerization
7 年前
eshvk
218887c6
[Containerization] Minor fixes
7 年前
eshvk
75a14ac8
[Hotfix] Upgrade Tensorflow to 1.4.0
7 年前
eshvk
6c1b6fe5
[Containerization] Minor fixes
7 年前
eshvk
e4ef7ea3
[containerization] updated docs per Vince and Yuan's comments
7 年前
eshvk
b4bad6bb
[Hotfix] Upgrade Tensorflow to 1.4.0
7 年前
eshvk
9345614c
[cleanup] Use debug mode for some log messages
7 年前
eshvk
6a19ae80
[containerization] updated docs per Vince and Yuan's comments
7 年前
Arthur Juliani
cbe42506
More text changes
7 年前
eshvk
403e4aef
[cleanup] Use debug mode for some log messages
7 年前