ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	982fab41	Initial commit	7 年前
vincentpierre	cde3c8f7	formating and added documentation	7 年前
Arthur Juliani	cfceb9f4	Fix timestep for PPO.ipynb	7 年前
GitHub	410f8709	Merge pull request #4 from Unity-Technologies/ppo-timestep Fix timestep for PPO.ipynb	7 年前
vincentpierre	c4745ba7	fix on the socket timeout error on windows due to the use of signal.SIGALRM	7 年前
vincentpierre	bddfb85e	changed the connection to non-blocking	7 年前
GitHub	a14c4a4f	Merge pull request #9 from Unity-Technologies/socket-timeout-fix fix on the socket timeout error	7 年前
Alexander Scheurer	3152c971	--keep-checkpoints=<n> option for ppo.py	7 年前
GitHub	64037ccb	Merge pull request #16 from ASPePeX/keep-checkpoints --keep-checkpoints=<n> option for ppo.py	7 年前
Arthur Juliani	2133d9cb	Remove scipy from requirements	7 年前
Arthur Juliani	71591043	PPO additions and warnings * Add linear decay to learning rate for PPO * Add warning/exception for unsupported brain configurations w/ PPO	7 年前
vincentpierre	e36b8bf0	added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards	7 年前
Arthur Juliani	0c7debb9	Change tensorboard command to work on Windows	7 年前
vincentpierre	7118a209	bug fix : The environment only requests actions from external brains when unique	7 年前
vincentpierre	e191fbef	added warning in case no brins are set to external	7 年前
vincentpierre	65df8ae9	fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step	7 年前
vincentpierre	0df8326e	minor fixes	7 年前
GitHub	15fea1be	Update README.md	7 年前
GitHub	daf205da	Merge pull request #35 from Unity-Technologies/fix-docs Move Wiki to Docs directory	7 年前
GitHub	aee5d336	Fix discrete state (#33 ) * made BrainParameters a class to set default values Modified the error message if the state is discrete * Add discrete state support to PPO and provide discrete state example environment * Add flexibility to continuous control as well * Finish PPO flexible model generation implementation * Fix formatting * Support color observations * Add best practices document * bug fix for non square observations * Update Readme.md * Remove scipy dependency * Add installation doc	7 年前
vincentpierre	3f85bb56	Merge branch 'master' into dev-broadcast	7 年前
Arthur Juliani	cd3bfb87	Added worker-id flag and pass through to enviroment in order to more easily manage multiple running simulations. (#40 )	7 年前
Arthur Juliani	adac2683	Fix for multi-agent with observations	7 年前
Arthur Juliani	c190eb22	Randomize ppo training batch	7 年前
vincentpierre	431fc43c	Merge branch 'master' of https://github.com/Unity-Technologies/ml-agents into dev-broadcast	7 年前
GitHub	4b7e0d4b	Make clear meaning of <env_name>	7 年前
vincentpierre	5cae720d	modified Environment to send a specific error when no external brains are in the environment	7 年前
vincentpierre	ac910514	initial commit of the curriculum with broadcast. Improved the Unity python handshake	7 年前
vincentpierre	360984c4	curriculum.json params must have 4 entries	7 年前
vincentpierre	e8429059	bug fix for python3	7 年前
vincentpierre	250eb8e1	better checking of the format of the curriculum file	7 年前
vincentpierre	d421a300	updated the tests of unityagents	7 年前
vincentpierre	c16e0ac3	modified the socket to receive states and images of any size	7 年前
Arthur Juliani	b6ce30bf	Add curriculum support to PPO	7 年前
Arthur Juliani	e6696ed3	Don't print	7 年前
Arthur Juliani	4a11c005	Add curriculum code to notebook and simplify	7 年前
Arthur Juliani	06d9bbec	Log lesson in TensorBoard	7 年前
vincentpierre	3b00302a	merging dev-broadcast-curriculum	7 年前
vincentpierre	22db3d64	added the modified files from dev-cooperative-env	7 年前
vincentpierre	2b8353b2	porting the changes on ppo.py and removing AgentMonitor.cs	7 年前
vincentpierre	d71ee998	changes on the ppo.py	7 年前
vincentpierre	6e950cd3	Can now switch inference configuration on/off in the editor. Reintroduced the broadcast feature for the non-External brains. Introduced the API number to check the compatibility between Unity and Python.	7 年前
Arthur Juliani	d1b81a32	Add push curriculum	7 年前
Arthur Juliani	51f23cd2	0.2 Update * added broadcast to the player and heuristic brain. Allows the python API to record actions taken along with the states and rewards * removed the broadcast checkbox Added a Handshake method for the communicator The academy will try to handshake regardless of the brains present Player and Heuristic brains will send their information through the communicator but will not receive commands * bug fix : The environment only requests actions from external brains when unique * added warning in case no brins are set to external * fix on the instanciation of coreBrains, fix on the conversion of actions to arrays in the BrainInfo received from step * default discrete action is now 0 bug fix for discrete broadcast action (the action size should be one in Agents.cs) modified Tennis so that the default action is no action modified the TemplateDecsion.cs to ensure non null values are sent from Decide() and MakeMemory() * minor fixes * need to convert the s...	7 年前
Arthur Juliani	b56259f6	Fix cumulative reward (Unity) and Nan reward (python) bugs	7 年前
Arthur Juliani	5ef4be55	Fix curriculum smoothing, and use reward for push curriculum	7 年前
Arthur Juliani	216888ee	Fixed to give lesson index parameter when start up (#179 ) * fixed to give lesson parameter when start up * applied to PPO.ipynb and modified ppo.py a bit	7 年前
vincentpierre	cd1feef6	minor fix to the Notebook	7 年前
GitHub	59a2bbe0	Improve memory management (#180 ) * More efficiently allocate memory when sending states * Code clean-up * Additional changes * More GC reduction * Remove state list initialization from example environments * Use built-in json tool to serialize state message * Remove commented code * Use more efficient CompareTag * Comments before code * Use type inference where appropriate	7 年前
vincentpierre	7b534423	updated tests	7 年前
vincentpierre	ebaf5268	ignoring the Packages folder that is created in unity-environment for Unity version 2017.3 Print a message if someone tries to lauch ppo with the load flag but an invalid run-path	7 年前
vincentpierre	053c3739	Launching the environment with absolute path. Need testing on Windows and Linux	7 年前
vincentpierre	22bfd276	simplifications on launching from absolute path, bug fix : closing the environment when the file_name was wrong.	7 年前
GitHub	00534390	Refactored GridWorld (#225 ) Greatly simplified GridWorld code. It now also only uses a visual observation rather than state vector in order to demonstrate learning purely from a visual input.	7 年前
Arthur Juliani	2a0e9e6f	Fixed issue with unity environment not being found on MacOS (#236 ) If the internal executable can't be found, it will look for any file in the folder and run it.	7 年前
Arthur Juliani	9ded88f3	Provide support with incompatible API	7 年前
Arthur Juliani	75ea16ff	Add comments and alphabetize flags	7 年前
Arthur Juliani	de700c3a	Multi Brain Training and Recurrent state encoder (#166 ) * `learn.py` is now main script for training brains. * Simultaneous multi-brain training is now possible. * `ghost-trainer` allows for proper training in adversarial scenarios. * `imitation-trainer` provides a basic implementation of real-time behavioral cloning. * All trainer hyperparameters now exist in `.yaml` files. * `PPO.ipynb` removed. * LSTM model added. * More dynamic buffer class to handle greater variety of scenarios.	7 年前
Arthur Juliani	adedd491	Initial support for multiple observations (#256 ) * Initial support for multiple observations * Fix PPO for continuous control	7 年前
vincentpierre	a54e459c	partial fix on the lstm The recurrent encoding now happens at the end	7 年前
Arthur Juliani	5b8822a0	Bug fix multiple observations	7 年前
Arthur Juliani	fc1b8a1b	Fix academy reset out of order	7 年前
GitHub	51621334	State Stacking & Banan Environment (#262 ) * Add support for stacking past n states to allow network to learn temporal dependencies. * Add Banana Collector environment for demonstrating partially observable multi-agent environments. * Add 3DBall Hard which lacks velocity information in state representation. Used as test for LSTM and state-stacking features. * Rework Tennis environment to be continuous control and trainable in 100k steps.	7 年前
Arthur Juliani	ce2ce437	Added growth parameter to stop failing with allocation under windows for #277 (#278 )	7 年前
vincentpierre	d01bd6c2	the bytes file will besaved under the name of the environment, not its path	7 年前
vincentpierre	b7f787f6	bug fix on range of observations	7 年前
Arthur Juliani	57a9ed38	Require tensorflow 1.4.1 (#315 ) * Require tensorflow 1.4.1 * modified the python/README.md	7 年前
vincentpierre	62089508	Modified the tests	7 年前
Arthur Juliani	7bf0c888	trainer will raise an error if the memory of the brain is set wrong (#273 )	7 年前
Arthur Juliani	2d6254c3	Require TensoFlow 1.4.0 (#326 )	7 年前
vincentpierre	db3cb9df	Merge branch 'development' into dev-logfile	7 年前
Arthur Juliani	3b8755d2	fixes on imitation trainer, now works with demo (#274 )	7 年前
Arthur Juliani	98cebd82	Fix typo "leaning_rate" (#324 )	7 年前
Arthur Juliani	54652c69	dev-logParam (#135 ) * added the method write text to trainer so it is easy to write log the hyperparameters as a dictionary. Note: needs tensorflow version r1.2 or above * added message if impossible to write text summary in Tensorboard	7 年前
vincentpierre	539c081f	modified the python side to read the logfile path from the academy parameters	7 年前
Arthur Juliani	94c20ef0	Curriculum documentation and improved Area code	7 年前
vincentpierre	5e1d05af	added the logfile_path property to the environment class. Give a link to the logfile when the timeout error is launched. Note: still need testing on windows	7 年前
GitHub	faa53e35	Fix observations on PPO trainer (#340 ) * Fix observations on PPO trainer * tested and fixed the fix	7 年前
vincentpierre	34b6e786	made the UnityTimeOutException that reads into the logfile when available	7 年前
GitHub	f8a8b112	Move epsilon generation into graph (#283 )	7 年前
vincentpierre	50f91f66	use logging instead of print Replaced the print statements with logging statements in the exception.py file Uses the same logger as the environment one named the logger unityagents	7 年前
GitHub	36d58cee	Add Seeding, MaxStepReached, and Bootstrapping fix (#303 ) * Add ability to seed learning (numpy, tensorflow, and Unity) with `--seed` flag. * Add `maxStepReached` flag to Agents and Academy. * Change way value bootstrapping works in PPO to take advantage of timeouts. * Default size of GridWorld changed to 5x5 in order to validate bootstrapping changes.	7 年前
vincentpierre	1bbaf0dd	added test in test_unityagents.py for curriculum class	7 年前
GitHub	e676017b	Reorganize learn.py (#302 ) Split learn.py into learn.py as command-line wrapper, and trainer_controller.py as core trainer/env logic.	7 年前
Arthur Juliani	6ad7f010	Fix for discrete control image observations	7 年前
Arthur Juliani	4418421a	Rename variables in imitation trainer	7 年前
Arthur Juliani	c42eff57	Misc fixes	7 年前
GitHub	d1cf3030	Merge pull request #309 from Unity-Technologies/dev-imitation Miscellaneous Fixes	7 年前
GitHub	8317a659	Behavioral Cloning & Trainers Reorg (#328 ) * Implement behavioral cloning for cc/dc, fc/rnn, state/observations. * Re-organize folder structure in anticipation of unitytrainers as a package. * Create demo environment BananaImitation to validate behavioral cloning. * Fixes #336	7 年前
vincentpierre	d8f74dc9	If reset does not take either config or progress, no information is logged. Bug fix : Environment handles invalid configurations better	7 年前
vincentpierre	41ab078d	replaced actions with previous_actions in the BrainInfo object	7 年前
Arthur Juliani	c21a391d	Various bug fixed and changes * Adjust demo curricula * Fix training buffer reset bug * Make wall height a float * Add pertained models for Area env	7 年前
GitHub	e11dae1d	Python Testing & Image Inference Improvements (#353 ) * Reorganized python tests into separate folder, and make individiual test files for different (sub) modules. * Add tests for trainer_controller, PPO, and behavioral cloning. More to come soon. * Minor bug fixes discovered while writing tests. * Reworked GirdWorld to reset much faster. * Cleaned ObservationToTex and reworked GetObservationMatrixList to be 3x faster.	7 年前
Arthur Juliani	9d26767d	Instantiate training buffer with trainer	7 年前
GitHub	0277039d	Fix Basic Environment & Discrete States (#356 ) * Fix Basic environment to properly reflect number of states. * Fix discrete states when using stacked states. * Add trained model for Basic environment.	7 年前
eshvk	23981dbf	[containerization] CPU based containerization to support all environments that don't use observations	7 年前
eshvk	403e4aef	[cleanup] Use debug mode for some log messages	7 年前
eshvk	9345614c	[cleanup] Use debug mode for some log messages	7 年前
eshvk	b4bad6bb	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
eshvk	75a14ac8	[Hotfix] Upgrade Tensorflow to 1.4.0	7 年前
GitHub	a3c7b426	Merge pull request #357 from Unity-Technologies/feature/containerization Feature/containerization	7 年前
Arthur Juliani	85ae912d	Dev docs (#361 ) New documentation structure and content.	7 年前
Arthur Juliani	b8a4f5f1	Add Hallway envronment to validate LSTM models	7 年前
eshvk	030ac5c5	[cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
GitHub	989dea4a	Merge pull request #132 from Unity-Technologies/dev-logfile Dev logfile	7 年前
GitHub	9ad4182e	Merge pull request #366 from Unity-Technologies/feature/cleanup [cleanup] Add a new type hint to call a dictionary of BrainInfo objects as an AllBrainInfo. Propagate this hint to all methods. Some pep8 cleanups.	7 年前
Arthur Juliani	c3644f56	Buffer fix for properly masking gradients	7 年前
GitHub	f8d27dc5	Merge branch 'development-0.3' into feature/LSTM2	7 年前
vincentpierre	eaf0745f	fix on the test script Error was due to the absence of logpath in the dummy handshake message	7 年前
GitHub	2bba53b8	Merge pull request #367 from Unity-Technologies/feature/LSTM2 Hallway & LSTM Improvements	7 年前
GitHub	99103b29	Use `curr_brain_info`	7 年前
Arthur Juliani	827dca28	Fix typo in model vars	7 年前
GitHub	f134016b	On Demand Decision (#308 ) * On Demand Decision : Use RequestDecision and RequestAction * New Agent Inspector : Use it to set On Demand Decision * New BrainParameters interface * LSTM memory size is now set in python * New C# API * Semantic Changes * Replaced RunMDP * New Bouncer Environment to test On Demand Dscision	7 年前
vincentpierre	36481ff2	removed the monitor display when training Increased the timeout time to 30 seconds to be consistent with start of application	7 年前
GitHub	69481d2d	Imitation Learning Helper (#371 ) * Add helper class to for Imitation Learning teacher. Allows for clearing buffer "C" and toggling adding info to the buffer "R".	7 年前
Arthur Juliani	b4838a0f	Version 0.2	7 年前
Arthur Juliani	1bf46a85	Add flags for normalization and variable layers	7 年前
Arthur Juliani	6c1c8220	Python2 fix	7 年前
GitHub	dcf58f75	Feature/previous text action (#375 ) * [Previous Text Actions] Renamed previous_action to previous_vector_action added previous_text_action to the BrainInfo * [Semantics] Carried the modifications to the semantics of previous_vector_action to the trainers	7 年前
GitHub	a809630f	Add config for crawler, and change crawler scene (#376 ) * Add config for crawler, and change crawler scene * Changed number of crawlers in scene to 12 * Changed Max-steps for crawlers to 5000 * Newer hyperparameters and newly trained crawler model * Clean up crawler code, and improve efficency	7 年前
Arthur Juliani	22d931c0	Add comments to Reacher and re-train model w/ epsilon needed	7 年前
vincentpierre	6c55017e	[FixingPytests] Added the new Semantic and modified the pytest	7 年前
GitHub	0838c2bc	Merge pull request #378 from Unity-Technologies/docs/semantics-internal-brain Docs/semantics internal brain	7 年前
GitHub	26a1ed87	Merge pull request #380 from Unity-Technologies/dev-reacher-cleanup Add comments to Reacher and re-train model w/o epsilon needed	7 年前
GitHub	e0d5b1b0	Fix for when not using teacher helper (#379 ) * Fix for when not using teacher helper * Rename expert to teacher throughout	7 年前
GitHub	a7c9096f	[Semantics] Modified the placeholder names (#381 )	7 年前
Vincent Gao	02df3b34	resolved conflicts	7 年前
GitHub	cfc6bdc8	[Fix] The environment logs information about itself when lauched. (#395 )	7 年前
Vincent Gao	621ba3af	clarify the python docs and learn.py help message	7 年前
Vincent Gao	6806c801	resolved comments	7 年前
Vincent Gao	2f373c5a	fixed the learn.py with a better way	7 年前
GitHub	0c6aaa1e	Merge pull request #400 from Unity-Technologies/docs/python-api clarify the python docs and learn.py help message	7 年前
Vincent Gao	1bc43933	Merge branch 'development-0.3' into hotfix/issue#333	7 年前
GitHub	5bdef358	[Fix] Must take mean of entropy to avoid errors what number of agents change during training (#407 )	7 年前
Marwan Mattar	ba6911c3	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Editor/MLAgentsEditModeTest.cs # unity-environment/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs # unity-environment/Assets/ML-Agents/Scripts/Academy.cs	7 年前
GitHub	41d32aca	[Bouncer Environment] Now in 3D (#408 ) * [New Bouncer] Revamped the Bouncer to be in 3D * [Bouncer Configuration file] Added the BouncerBrain configuration * [Documentation] Added the Bouncer tot he documentation page * [Fixes] Fixed lines too long and the documentation typo * Slight adjustments to bouncer environment * Don't default to internal brain on bouncer	7 年前
GitHub	bb82e25d	Revamped Push Block (#404 ) * Adds new revamped Push Block environment. * Adds "Shared Assets" folder to Examples sub-directory.	7 年前
Marwan Mattar	bab02a21	Merge branch 'development-0.3' into dev-api-doc-academy # Conflicts: # unity-environment/Assets/ML-Agents/Scripts/Brain.cs	7 年前
GitHub	9ca530cd	Soccer Twos Environment (#420 ) * Add Soccer Twos environment, along with training parameters, embedded model, and relevant documentation.	7 年前
Marwan Mattar	095632d6	Added reference to Basics in Jupyter installation - Added consistent naming to the 3D Balance Ball environment - Minor fixes to the Basics notebook	7 年前
Marwan Mattar	20ce0286	Cleared notebook output.	7 年前
Joe Ward	86474d7a	Merge remote-tracking branch 'origin/development-0.3' into docs-training-brains-etc	7 年前
GitHub	c83b0e7d	Merge pull request #435 from Unity-Technologies/docs/installation Added reference to Basics in Jupyter installation	7 年前
GitHub	848b8a58	Fix PPO regression (#434 ) * Fix PPO regression	7 年前
GitHub	f19739cb	Update API version in anticipation of v0.3 release (#437 ) * Update API version in anticipation of v0.3 release * Use _version_ across both Unity/Python	7 年前
GitHub	4a7481a1	RayPercpetion, Push Block, and misc environment changes (#432 ) RayPerception moved to a component that is now used by Banana, Soccer, Hallway, and Push Block. Converted Push Block to use RayPerception for local perception and retrained model. Re-worked Hallway to be more extensible.	7 年前
GitHub	16e04ee0	[BugFix] Updated the apiNumber in the pytests (#449 )	7 年前
GitHub	d8c09831	Feature/new wall jump (#446 ) * [New Environment] Added the WallJump and its configuration * [Documentation] Added the WallJump doc * [Fixes] Now uses switch and added comment	7 年前
Marwan Mattar	72a71a08	Merge branch 'development-0.3' into dev-api-doc-decision	7 年前
Joe Ward	9163a54a	resolved merge conflict with dev-0.3 branch	7 年前
Vincent Gao	9066f399	updated the comments	7 年前
Vincent Gao	0df2f777	Used the comment's sentence	7 年前
GitHub	95366dc2	Merge pull request #460 from Unity-Technologies/docs/comment-change updated the comments	7 年前
GitHub	dede2f80	Modify setup file (#486 )	7 年前
Marwan Mattar	06cc85cc	Merge branch 'development-0.3' into docs/random-fixes # Conflicts: # docs/Learning-Environment-Create-New.md	7 年前
GitHub	68692f8f	Remove unused configs (#489 )	7 年前
vincentpierre	e5a59e9b	[Refactor] renamed is_continuous to is_continuous_action and added is_continuous_observation to decrease confusion	7 年前
GitHub	6dd3c284	Hotfix 0.3.0b (#519 ) * Fixes internal brain for Banana Imitation. * Fixes Discrete Control training for Imitation Learning. * Fixes Visual Observations in internal brain with non-square inputs.	7 年前
GitHub	a6385cbf	Merge pull request #536 from Unity-Technologies/master Bring develop to v0.3.0b	7 年前
eshvk	2d2eb64b	[containers] Enables container support for scenes that use visual observations	7 年前
Marwan Mattar	ffb4ffee	Added simple check to Python version in notebook. - Including a few minor comments.	7 年前
GitHub	74064891	Merge pull request #520 from Unity-Technologies/feature-trainer-ppo-is-continuous Feature trainer ppo is continuous	7 年前
GitHub	e43c069e	Merge pull request #547 from Unity-Technologies/develop-feature-docker-improvements [containers] Enables container support for scenes that use visual obsvervations	7 年前
GitHub	02b189d4	Merge pull request #568 from Unity-Technologies/develop-improve-jupyter-notebook Added simple check to Python version in notebook.	7 年前
GitHub	237b41f9	Hotfix 0.3.0c (#618 ) Fixes the following issues: * Missing component reference in BananaRL environment. * Neural Network for multiple visual observations was not properly generated. * Episode time-out value estimate bootstrapping used incorrect observation as input.	7 年前
GitHub	78d411f6	Merge pull request #619 from Unity-Technologies/develop Release v0.3.1	7 年前
GitHub	1a449e98	Hotfix 0.3.1b (#637 ) * [Fix] Use the stored agent info instead of the previous agent info when bootstraping the value * [Bug Fix] Addressed #643 * [Added Line Break]	7 年前
vincentpierre	076c8744	Report means instead of totals for losses (#580 ) * Report means instead of totals for losses. * Report absolute loss for policy.	7 年前
GitHub	b2675216	Hotfix 0.3.1b (#656 ) * [Fix] Use the stored agent info instead of the previous agent info when bootstraping the value * [Bug Fix] Addressed #643 * [Added Line Break]	7 年前
GitHub	755be43e	[Cold Fix] Making the episode length and mean reward more accurate for the first episode (#657 )	7 年前
GitHub	3b866e9f	Use Clipped Gaussian (#649 ) This PR makes the following changes: * Moves clipping of continuous control model into model itself. Output is now always [-1, 1]. * Internal model values are now clipped between [-3, 3] before being rescaled to [-1, 1] for output. * This improves training performance by providing a wider range of values within which the pdf of the gaussian can fall. Output of [-1, 1] is used to be more environment-creator friendly. * Fixes issue where epsilon was erroneously being used to reconstruct old probabilities during PPO update, leading to reduced learning performance. * Introduce ScaleAction() function within python to easily rescale values from [-1, 1] to arbitrary range. * Re-train all CC models using improved algorithm. All performance levels are equal or improved. In the case of Crawler, improvement is drastic. * Update documentation appropriately. * Made miscellaneous minor code style and optimization improvements within environments.	7 年前
Arthur Juliani	9477eaa9	Develop fix cumulative reward (#725 ) * [Cold Fix] Split the way cummulative rewards and episode length are counted The reward is appended at each step to the cummulative reward The episode count is ONLY incremented when d_t+1 is false	7 年前
GitHub	9594f3d8	Walker Environment (#720 ) * Add `Walker` example environment and documentation.	7 年前
GitHub	38098a12	[Fixed BC with LSTM] (#766 ) Fixes the issue raised by @hsaikia in #552 Added the memory_size variable to the BC model Added memory_size and recurrent_out to the output nodes of the graph when using BC with LSTM	7 年前
Arthur Juliani	ce5e2dba	[Added Ascii art on learn.py] (#727 ) * [Added Ascii art on learn.py] Note : This is by far the best feature of 0.4	7 年前
Arthur Juliani	0264de49	[Update Curriculum for WallJump] Updating the curriculum for WallJump (#774 )	7 年前
GitHub	ffcf8c9c	Newer Ascii Art (#780 ) Replaced UNITY ML AGENTS with the unity logo	7 年前
GitHub	bdeb506c	TensorFlowSharp 1.7 upgrade package (#746 ) * some random change so that I can create this PR * docs update for TensorFlowSharp new version * changed the links to the new unitypackage file * resolved conflicts, updated the pictures for CUDA 9.0 * fixed a typo * resolved arthur's comment * blurred the usernames * modified the AWS doc * resolved Vince's comment	7 年前
GitHub	7914387f	Develop communicator redesign (#638 ) * [containers] Enables container support for scenes that use visual observations * [Initial Commit] Works only with simple balance ball * [Optimiztion] Store the academy in the brainBatcher as a temporary measure * [Modifications] Made it work from the editor as a prototype * [Made socket communicator and reimplmented all functionalities] * [Forgotten file] removed .meta file * [Forgot the meta file] * [Metafile] deleted metafile * [Comments] Removed dead code * [Comments] Added some descriptions * [Bug Fix] Multi brain scenario * [improved AgentInfo converter] * [Optimization] Remove VectorObs since StackedVectorObs is present in the AgentInfo protobuf object * [Timeout] Implemented a timeout for the rpc communicator in Unity * [Libraries] Added the C# Protobuf and Grpc libraries * [Requirements] Added protobuf 3.5.2 to the requirements * [Code Formating] Removed dead code and split some lines ...	7 年前
GitHub	702d98c6	[Fix] The summary writer is now implemented in the abtract trainer class. (#806 ) Summary writer now displays {}: Step: {}. No episode was completed since last summary. when there was no completed episodes	7 年前
GitHub	c17937ef	Curiosity Driven Exploration & Pyramids Environments (#739 ) * Adds implementation of Curiosity-driven Exploration by Self-supervised Prediction (https://arxiv.org/abs/1705.05363) to PPO trainer. * To enable, set use_curiosity flag to true in hyperparameter file. * Includes refactor of unitytrainers model code to accommodate new feature. * Adds new Pyramids environment (w/ documentation). Environment contains sparse reward, and can only be solved using PPO+Curiosity.	7 年前
GitHub	9ab98584	Additional Environment Variations (#791 ) * Add Visual (Camera) and Imitation Learning variations to example environments	7 年前
Arthur Juliani	5abb001b	[Add curiosity_enc_size: 128 to the trainer_config.yaml] (#826 )	7 年前
vincentpierre	a22c0f65	[fixing encoding_size]	7 年前
Arthur Juliani	d7338050	Enable concurrent sessions	6 年前
vincentpierre	3c2283e8	[fix tennis]	6 年前
vincentpierre	85b844cc	[Better version of the fix]	6 年前
GitHub	678e5dab	Merge pull request #837 from Unity-Technologies/develop-fix-tennis [fix tennis]	6 年前
eshvk	680b0767	[Imitation Learning] Minor fix to make sure that step increment loads from the last saved global step if the model is being trained after loading	6 年前
GitHub	e195b495	Merge pull request #838 from Unity-Technologies/develop-bc [Imitation Learning] Minor fix to make sure that step increment loads from the last saved global step if the model is being trained after loading	6 年前
Arthur Juliani	5d402be9	Minor Optimizations (#836 )	6 年前
GitHub	282d5bd4	Fix Pytests (#843 )	6 年前
GitHub	8526dcfc	Fix for visual observations (#847 )	6 年前
GitHub	0f65e272	[Addresses #842 ] (#849 ) In the case the agent is done imediately after spawning, its stats are empty because the stats need at least 2 successive experieces to create the stats. By specifying the default value of 0, the error does no longer appear	6 年前
GitHub	a720e370	Fix bug and update tests (#850 )	6 年前
GitHub	c9c9e147	Revamp Crawler & Walker (#841 ) * Revamps agent code for walker and crawler environments to use shared JointDriveController system. * Crawler has been reworked to be very cute. * Crawler & Walker environments have been reworked to be visually consistent. * Added Dynamic Crawler scene. * All scenes re-trained and new models added. * Documentation changes.	6 年前
GitHub	47fc38ab	Additional Tests & Bug Fixes (#854 ) * Add tests and fix for sparse tensor warning * Rename mock communicator parameter * Test longer sequences * Curiosity tests and bug fixes	6 年前
GitHub	6e6e8d96	Fix for CC models w/ RNN and Curiosity (#860 )	6 年前
vincentg	3c4cb523	some hack to make windows save the model when do ctrl+c	6 年前
GitHub	75218e58	Several final improvement to docs, scene and configs. (#871 ) * Added missing declaration to docs sample code. * Added pretrained model as default graph in Internal brain of Tennis scene * Disabled PlayerBrain in Tennis by default. * Removed accidental config.	6 年前
GitHub	b5722dc9	Fix for visual observation w/ curiosity (#873 )	6 年前
GitHub	9156737e	Merge pull request #876 from Unity-Technologies/release-windows-save-model-fix some hack to make windows save the model when do ctrl+c	6 年前
vincentpierre	4c6439d5	[Attempted fix]	6 年前
GitHub	6df07946	Fix for Discrete observations + Curiosity (#866 )	6 年前
GitHub	dda6ad8b	Replaced message printed in Python and in documentation. (#881 )	6 年前
GitHub	68d6170f	Error message when using ODD and Curiosity (#883 ) * Remove extra bouncer brain hyperparameters * Add error when using curiosity+odd	6 年前
GitHub	bf858cd6	Merge pull request #884 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
GitHub	4b3c6c9f	Merge pull request #885 from Unity-Technologies/release-v0.4 Release v0.4	6 年前
Arthur Juliani	7b03597f	Update setup version to v0.4	6 年前
Arthur Juliani	5e48766d	Remove discrete observations	6 年前
GitHub	3eac018a	Merge pull request #889 from Unity-Technologies/hotfix-setup-version Update setup version to v0.4	6 年前
Arthur Juliani	b46b8708	Rename function	6 年前
Arthur Juliani	12d52cb0	Replace tanh on cc models w/ swish	6 年前
Arthur Juliani	96e599e1	New proto files	6 年前
Arthur Juliani	8088d94a	Change lambda	6 年前
GitHub	b6fe0bca	Merge pull request #906 from Unity-Technologies/develop-no-discrete-obs Remove Discrete Observations	6 年前
Arthur Juliani	195ac934	Merge branch 'develop' into develop-runs # Conflicts: # python/learn.py # python/unitytrainers/trainer.py	6 年前
vincentpierre	e47cec56	[Initial Commit]	6 年前
Arthur Juliani	fad0da30	Log run-id in console	6 年前
GitHub	1626587d	Merge pull request #901 from Unity-Technologies/hotfix-swish Replace CC `tanh` activation with `swish`	6 年前
Arthur Juliani	11b50054	Replace Ray with multiprocess	6 年前
Arthur Juliani	fa65ee61	Fix bug in grpc logic	6 年前
unityjeffrey	0d67f311	changed ml agents to ml-agents	6 年前
Arthur Juliani	e5202092	Remove empty line	6 年前
unityjeffrey	19fb437a	changed to Unity ML-Agents Toolkit (english)	6 年前
unityjeffrey	6ed6b8d6	updated ml-agents to ml-agents toolkit where appropriate	6 年前
GitHub	7b9a2905	Merge pull request #916 from Unity-Technologies/hotfix-trademarkupdate update for trademark and consistency of ml-agents	6 年前
Arthur Juliani	9701c3db	Merge branch 'hotfix-0' into release-v0.4-fix-curiosity-odd # Conflicts: # python/unitytrainers/ppo/trainer.py	6 年前
Arthur Juliani	6b359062	Fix for visual-only imitation learning	6 年前
Arthur Juliani	0c6411c2	Use switch between old and new behavior	6 年前
GitHub	7b497341	Merge pull request #936 from Unity-Technologies/hotfix-visual-imitation Fix for visual-only imitation learning	6 年前
Arthur Juliani	1bfbf67a	Simplify approach	6 年前
Arthur Juliani	cfb7cfef	Code clean-up	6 年前
Arthur Juliani	083cbff5	Add to docstring	6 年前
Arthur Juliani	c31f63b5	Fix typo	6 年前
GitHub	3b5af6b2	Merge pull request #937 from Unity-Technologies/release-v0.4-fix-curiosity-odd Hotfix - Curiosity & ODD	6 年前
GitHub	f155d661	Merge pull request #908 from Unity-Technologies/hotfix-0 Release v0.4.0a	6 年前
GitHub	e50ac7ae	Merge branch 'develop' into hotfix-0	6 年前
GitHub	b36e6a2e	Merge pull request #946 from Unity-Technologies/hotfix-0 v0.4.0a into Develop	6 年前
vincentpierre	c104d31d	[Hotfix] Made the Pipe of the grpc communicator an instance property	6 年前
Deric Pang	8380f2f2	Moved curriculum code out of environment code.	6 年前
Deric Pang	9b37b410	Removed test references to vector_observation_space_type.	6 年前
GitHub	dcd4b4f9	Merge pull request #967 from dericp/develop-fix-python-tests Fixed Python tests.	6 年前
Deric Pang	e580e544	Removing commented out code.	6 年前
Deric Pang	ae944381	Removing print statements.	6 年前
Deric Pang	6eb10797	Removed test references to vector_observation_space_type.	6 年前
GitHub	8d79581f	Merge pull request #1001 from Unity-Technologies/hotfix-grpc-multiprocessing [Hotfix] Made the Pipe of the grpc communicator an instance property	6 年前
Deric Pang	db031b07	Updating tests for refactored curriculum learning.	6 年前
Deric Pang	798c8bf9	Removing print statements.	6 年前
GitHub	59f74e07	Merge pull request #1002 from Unity-Technologies/hotfix-0.4b Hotfix 0.4.0b	6 年前
Deric Pang	7963f8ac	Merge remote-tracking branch 'upstream/develop' into develop-curriculum-learning-refactor	6 年前
Deric Pang	134548ac	Updating tests for refactored curriculum.	6 年前
Deric Pang	d85038aa	Removing some trailing spaces.	6 年前
GitHub	2d715dc5	Revert "Release v0.5 (#1202 )" (#1221 ) This reverts commit 983c4029cb435fc7ad27a796e79a1d59904e53e5.	6 年前
Deric Pang	eb251008	Removing unnecessary import.	6 年前
Deric Pang	cd7c854c	Created exception module for unitytrainers.	6 年前
GitHub	34035176	Merge pull request #968 from dericp/develop-curriculum-learning-refactor Curriculum learning moved from environment to trainer.	6 年前
GitHub	4e73f770	Merge branch 'develop' into hotfix-0.4b	6 年前
GitHub	a912e039	Merge pull request #1005 from Unity-Technologies/hotfix-0.4b Hotfix 0.4b	6 年前
Arthur Juliani	1eb701af	Merge remote-tracking branch 'origin/develop' into develop-value-estimates-ppo	6 年前
Arthur Juliani	f52d5a92	Merge remote-tracking branch 'origin/develop' into develop-runs	6 年前
Arthur Juliani	43e40b8c	Add protobuf files for value estimate	6 年前
Arthur Juliani	3b916dd9	Add exception for in-edtior training	6 年前
GitHub	1e21c143	Merge pull request #934 from Unity-Technologies/develop-value-estimates-ppo Develop value estimates ppo	6 年前
Arthur Juliani	ffe365dc	Add white space	6 年前
GitHub	ef3025e6	Merge pull request #1004 from Unity-Technologies/develop-runs Enable multiple runs in learn.py	6 年前
GitHub	e60272f2	New error when using In Editor Training with a non-zero worker-id (#1012 )	6 年前
GitHub	7d0990cf	Fix MultiBrain bug that was introduced with the value estimates (#1018 )	6 年前
Deric Pang	de128fa1	Refactoring Curriculum tests and code. - Curriculum tests are now separate from other trainers. - Property setter is now used in Curriculum.	6 年前
Deric Pang	c6617b70	Multi-curriculum support added. - New school module maps brains to curriculums.	6 年前
Deric Pang	c754e9db	Curriculum tests updated to match develop branch.	6 年前
Deric Pang	9ea00ab6	Changing curricula to match reworked curriculum.	6 年前
Deric Pang	c88c7e42	Fixing bugs, updating tests. - Added more unit tests for school module. - Fixed bugs found during testing with PushBlock env.	6 年前
Deric Pang	10ab5965	Finished testing School. Added documentation.	6 年前
Deric Pang	06eb8037	Renaming School to MetaCurriculum.	6 年前
Deric Pang	aaab8c50	Fix iteration over brains_to_curriculums.	6 年前
Deric Pang	645cd074	Moving push curriculum.	6 年前
Deric Pang	4b92071b	Fixing line lengths in test_meta_curriculum.py.	6 年前
Deric Pang	db6fa4ba	Removing commented line.	6 年前
Deric Pang	e678e691	Addressing Vince's offline comments. - Warning logged if two curriculums attempt to reset the same parameter. - Error is raised when a curriculum file is not named to match a brain.	6 年前
Deric Pang	361d56b9	Curriculums now hold the brain name.	6 年前
Deric Pang	ca54fc4f	Adding back import that was accidentally removed.	6 年前
Deric Pang	ff4ce695	Updated logging in trainer. - The logger in trainer.py is now unitytrainers. This makes it easier to differentiate it from unityagents logs.	6 年前
Deric Pang	9d9c91e4	Fixed TensorBoard lesson logging.	6 年前
Deric Pang	70308432	Adding space in metacurriculum error message.	6 年前
Deric Pang	4429077f	Improving MetaCurriculum initialization. - Raises MetaCurriculumError when curriculum_folder is not a folder. - Removed the ability to set curriculum_folder to None. trainer_controller.py has been refactored to not depend on this functionality which will make curriculums more stable.	6 年前
Deric Pang	23740545	Changing warning message to log.warning.	6 年前
GitHub	322d2bbe	Merge pull request #1003 from dericp/develop-curriculum-learning-rework Curriculum learning now supports multiple brains.	6 年前
Deric Pang	822d329a	Fixing bug when no curriculum folder is passed. - The old Curriculum object would accept None as a location for the curriculum. If the location was None, it would return default values as its config and lesson number. - The new MetaCurriculum does not accept None as a location for the curriculum folder. This was done to remove unnecessary edge case functionality from curriculums. - None checks have been added into trainer_controller. In the future, it should be possible to better refactor trainer_controller so that these None checks can be removed. This is preferable to hard-coding default behavior into MetaCurriculum objects when a metacurriculum would not even be in place.	6 年前
GitHub	73ecb4fe	Merge pull request #1035 from dericp/develop-fix-no-curriculum-case Fixing bug when no curriculum folder is passed.	6 年前
vincentpierre	7f74131d	Nan Rewards converted to 0 and throwing a warning	6 年前
GitHub	5efa9d4e	Merge pull request #1045 from Unity-Technologies/develop-unityagents-nan-reward Nan Rewards converted to 0 and throwing a warning	6 年前
Deric Pang	30c4f2d7	Splitting up unitytrainers tests.	6 年前
Arthur Juliani	52865022	[Fix bug 1040] (#1062 )	6 年前
Deric Pang	032446de	Trainer controller lines wrapped.	6 年前
Arthur Juliani	708e2bb9	Check NaN in observations (#1063 ) * Check NaN in observations * Replace math with np	6 年前
Deric Pang	bb8e74f9	Helper func for incrementing lessons and resetting.	6 年前
Arthur Juliani	9e8049f0	Will now print summaries even when not training or when training is over (#1020 ) * [Initial Commit] * [Addressed comments] * [Now using global step to write the summaries]	6 年前
GitHub	9538d699	Move seed randomization to learn.py (#1071 ) * Move seed randomization to learn.py * Remove print statement	6 年前
Deric Pang	6eba6940	Merge remote-tracking branch 'upstream/develop' into develop-trainer-controller-cleanup	6 年前
GitHub	514cd757	Merge pull request #1058 from dericp/develop-trainer-controller-cleanup Fixing trainer controller line lengths and splitting unitytrainers tests.	6 年前
Arthur Juliani	3659bbcd	Develop multi discrete (#1022 ) Replace discrete control with multi-discrete control.	6 年前
GitHub	c600a706	Optional gym wrapper (#1007 ) Adds optional gym wrapper UnityEnv to use as python interfaces to Unity environments.	6 年前
Arthur Juliani	fee02a84	Attempted fix for #1059 (#1089 )	6 年前
Arthur Juliani	567ad3f0	fix Unity-Technologies/ml-agents#1041 (#1102 )	6 年前
GitHub	2edaf342	Clean up learn.py (#1106 )	6 年前
Arthur Juliani	17224292	Fix for Curiosity with ODD (#1107 ) This branch addresses the issue referenced in #1059	6 年前
GitHub	ded0d8c7	Develop action masking (#1080 ) * [Initial Commit] Modified the model.py file and the ppo/trainer.py file to use masked actions * Preliminary modifications to the python side of the code to enable action masking * Preliminary modifications to the C# side of the code to enable action masking * Preliminary modifications to the communication side of the code to enable action masking * Implemented action masking for BC Note : The actions of the teacher are not masked * More error messages for the action masking * fix pytests * Added Documentation * Address comment * Addressed Comments on docs * Addressed second comment on docs * Addressed comments for the python side of the code * Created the action masker and associated unit tests * Addressed comments on the C# side * Addressed the comment regarding action_masking_name * Addressed the comments	6 年前
GitHub	9ba493ef	Fixing develop after merging action masking (#1114 ) Ran into problems due to inacurate merging of develop into the branch	6 年前
GitHub	d0158b01	Update visual hyperparameters (#1118 )	6 年前
GitHub	106d562d	Fix for Windows (#1120 ) addresses #1113	6 年前
GitHub	2e489abc	Normalization of the probabilities after masking (#1123 ) * python/unitytrainers/bc/models.py * Updated BC to reflect the changes	6 年前

... 3 4 5 6 7

312 次代码提交 (d4a2df66-0aad-4dd3-9b4f-c53e7192f508)