ml-agents

作者	SHA1	备注	提交日期
GitHub	36ed3c16	Fix issue exporting graph with multi-GPU (#2573 ) Our multi-GPU training had a regression such that freezing the graph was broken. This change fixes that issue by making a few changes: * Removes the top level "tower" variable scope added by multi-GPU so that the output nodes have correct names * Removes the use of "freeze_graph" and replaces it with our own similar functionality. * Adds the "auto reuse" to network layers which require them	5 年前
GitHub	d64a01e1	Added option to use environment arguments in learn (#2594 ) * Added option to use environment arguments in learn * hook into argparse * add example to readme	5 年前
GitHub	82bf38ef	TensorFlowSharp is no more (#2590 ) * TensorFlowSharp is no more * Removed old documents	5 年前
GitHub	0d48a352	Use argparse for arg parsing (#2586 ) * encapsulate commandline args * fix tests * add tests on cmdline parsing * cleanup * remove docopt * simplify --slow	5 年前
GitHub	11e13518	Merge pull request #2580 from Unity-Technologies/develop-removeUnitySDKlog Remove UnitySDK.log file	5 年前
GitHub	e1d93a0e	Allow mypy to reject incomplete defs for mlagents-envs (#2585 ) This wasn't working before because of several remaining partially defined function definitions.	5 年前
GitHub	67d754c5	Fix flake8 import warnings (#2584 ) We have been ignoring unused imports and star imports via flake8. These are both bad practice and grow over time without automated checking. This commit attempts to fix all existing import errors and add back the corresponding flake8 checks.	5 年前
GitHub	832e4a47	Normalize observations when adding experiences (#2556 ) * Normalize observations when adding experiences This change moves normalization of vector observations into the trainer's "add_experiences" interface. Prior to this change, normalization occurred at inference time. This was somewhat confusing since usually executing a forward pass shouldn't have side-effects which would change the training step. Also, in a asynchronous or distributed setting where we copy the neural network weights from a trainer to a remote actor / inference worker we'd end up with training issues because of the weights being different on the trainer than the workers.	5 年前
GitHub	babe9e2f	Develop remove academy done (#2519 ) * Initial Commit * Remove the Academy Done flag from the protobuf definitions * remove global_done in the environment * Removed irrelevant unitTests * Remove the max_step from the Academy inspector * Removed global_done from the python scripts * Modified and removed some tests * This actually does not break either curriculum nor generalization training * Replace global_done with reserved. Addressing Chris Elion's comment regarding the deprecation of the global_done field. We will use a reserved field to make sure the global done does not get replaced in the future causing errors. * Removed unused fake brain * Tested that the first call to step was the same as a reset call * black formating * Added documentation changes * Editing the migrating doc * Addressing comments on the Migrating doc * Addressing comments : - Removing dead code - Resolving forgotten merged conflicts - Editing documentations...	5 年前
Andrew Cohen	06dcbaae	fixed formatting	5 年前
GitHub	2a5da881	check for potentially bad env variables (#2540 )	5 年前
Andrew Cohen	86c598bb	Removed writing to UnitySDK.log from Academy/Changed UnityTimeOutException to no longer read from UnitySDK.log	5 年前
GitHub	d21be895	Develop allow python 3.7 (#2544 ) * relax versions, add python 3.7 to CI * add workflows * try paramaterized circleci build, disable slow test * fix workflow * fix (?) pyversion * set job name, fix pip freeze output * test_requirements.txt * fix install * fix paths (again) - should use pushd popd instead * use pushd and popd * sort deps, restore unit test, cleanup CI * relax versions more * clean up versions in docs * test older libs for 3.6, newer for 3.7 * pip: progress bar off * fix gym-unity pip install * try cat'ing setups for checksum * dont use fallback (temporarily) * dont turn off progress bar before upgrading pip * PR feedback * add parameter descriptions in CI config	5 年前
GitHub	5a2e60b6	[coding conventions] Revert NNModelImporter rename and remove NonAlloc Collider check suggestion from rider. (#2571 )	5 年前
GitHub	3683cc1c	Enable learning rate decay to be disabled (#2567 )	5 年前
GitHub	f01dd1c1	[coding conventions] Change c# code to be compliant with Unity coding conventions. (#2555 )	5 年前
GitHub	eaf8ca35	Add clearer message for bad permissions (#2539 )	5 年前
GitHub	b7e12a37	Fix crash in construct_curr_info when next_info doesn't have any agents (#2549 ) Fixes #1687	5 年前
GitHub	88b917b3	[format] Format code whitespace with Unity Formatter. (#2550 )	5 年前
Ervin Teng	38b872af	Revert "Fix crash when next_info is empty and using recurrent" This reverts commit 2107fdc587183db80706fc25c1bec574f6c7cb57.	5 年前
Ervin Teng	4cb340b5	Fix crash when next_info is empty and using recurrent	5 年前
GitHub	3df585d9	Fix issue where SAC encoder type is always simple (#2548 )	5 年前
GitHub	c8796488	Markdown link check in CI (#2543 ) * check using xargs * fix broken BC link * install npm, run precommit before unit tests * try to install npm * try a node image build * add workflow * don't use precommit on node run * sudo make me a sandwich * pass config arg * revert CI order change * retry precommit * sudo apt-get * sudo npm * make sure fails on bad link * cleanup and refix link	5 年前
GitHub	9b004c54	Fixes missing camera resolution info in demos (#2523 )	5 年前
GitHub	7720db33	Fix run_id typing in trainer.py (#2537 )	5 年前
GitHub	9358fd4f	[memory] Fix for tensors not being disposed of. (#2541 ) * [memory] Fix for tensors not being disposed of. * Fix member name.	5 年前
GitHub	5d2a6a69	Add a note on Custom messages about needing trainer changes (#2534 ) * Add a note on Custom messages about needing trainer changes * move to top (aside doesn't render as expected) * Don't blockquote * Update wording	5 年前
GitHub	69613a01	Reducing complexity on a number of classes. (#2480 ) Only cosmetic and readability improvements. No functional changes were intended. Utilities.cs - Fixed comments across file - Made class static - Removed unnecessary imports - Removed unused method arguments - Renamed variables as appropriate to make usage clearer - In AddRangeNoAlloc, disabled (by comment) Rider’s suggestion to revert to use of built-in Range field (Fixed) - In TextureToTensorProxy, swapped order of first two arguments to be more in-line with convention of input, output UtilitiesTests.cs - Removed unnecessary imports - Simplified array creation commands GeneratorImp.cs - Rider automatically deleted spaces on empty lines - Changed call to TextureToTensorProxy to mirror new argument ordering * Clean-up to UnityAgentsException.cs - Removed unnecessary imports - Fixed comment warning - Fixed method header * Improvements to Startup.cs - Created const for SCENE_NAME field - Fixed strin...	5 年前
GitHub	0390c78b	Fix determinism in unit test (#2530 ) * initialize random instance correctly * restore threshold (I hope)	5 年前
GitHub	9e2c30ee	Made the _check_environment_trains test a little more easy to pass so the test will not randomly fail (#2520 )	5 年前
GitHub	12d57671	Changing Training-RewardSignals.md --> Reward-Signals.md (#2525 )	5 年前
Anupam Bhatnagar	d1b99bda	more small edits	5 年前
GitHub	d80812be	Merge pull request #2526 from Unity-Technologies/develop-update-offline-bc Update the offline_bc_config path	5 年前
Anupam Bhatnagar	efe16491	added cloud training unsupported comment	5 年前
Yuan Gao	0c42db82	Update the offline_bc_config path	5 年前
GitHub	876aca1e	Use numpy for random sample in buffer (#2524 )	5 年前
Anupam Bhatnagar	baf25046	small edits	5 年前
GitHub	36528481	Merge pull request #2522 from Unity-Technologies/develop-cleanupconfig Clean up SAC config	5 年前
Anupam Bhatnagar	2cd2048b	changes reflecting comments on github	5 年前
GitHub	6f67cf40	unit test - don't use global random generator (#2521 ) * unit test - don't use global random generator * Update test_simple_rl.py	5 年前
Ervin Teng	b1bfb9e8	Delete VisualBanana	5 年前
Anupam Bhatnagar	cc933115	adding colon	5 年前
GitHub	7ec3d7ad	Merge pull request #2516 from Unity-Technologies/master Merege 0.9.3 changes to develop	5 年前
Anupam Bhatnagar	fddede25	first commit	5 年前
Jonathan Harper	2f083c8a	Renamed "StepInfo" to "EnvironmentStep" This change was requested for clarity during the async EnvManager PR. It's a simple rename of the StepInfo class.	5 年前
Ervin T	6fb5b63c	Fix Baselines gym_unity example to work with the latest Baselines (#2489 ) * This addresses #1835. Baselines expects single environments used with their ppo2 algorithm to be wrapped in a DummyVecEnv. The old readme did not instruct the reader to do so and the code failed to run with the latest version of baselines. This imports the correct function from baselines and fixes the make_unity_env function described in the readme. * added line to gym-unity/README.md to note the version of baselines the examples were tested with	5 年前
GitHub	6a81a2f4	Add Soft Actor-Critic as trainer option (#2341 ) * Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.	5 年前
GitHub	25926795	initialize trainer step count (#2498 ) * initialize trainer step count * remove step init from RLTrainer	5 年前
Ervin T	06d9678c	Minor fix to link to GAIL reward signal doc (#2435 )	5 年前
GitHub	4bb97e25	Fix bug with construct_curr_info (#2490 ) * Fix bug with construct_curr_info * Add more tests	5 年前

1 2

80 次代码提交 (36ed3c16-ee11-421b-b3a7-c2b5af3cd81d)