ml-agents

作者	SHA1	备注	提交日期
GitHub	0390c78b	Fix determinism in unit test (#2530 ) * initialize random instance correctly * restore threshold (I hope)	5 年前
Anupam Bhatnagar	097b1b86	removing docker and windows docs	5 年前
GitHub	9e2c30ee	Made the _check_environment_trains test a little more easy to pass so the test will not randomly fail (#2520 )	5 年前
Anupam Bhatnagar	02f9ad67	added venv setup documentation for windows	5 年前
GitHub	12d57671	Changing Training-RewardSignals.md --> Reward-Signals.md (#2525 )	5 年前
Anupam Bhatnagar	d1b99bda	more small edits	5 年前
GitHub	d80812be	Merge pull request #2526 from Unity-Technologies/develop-update-offline-bc Update the offline_bc_config path	5 年前
Anupam Bhatnagar	efe16491	added cloud training unsupported comment	5 年前
Yuan Gao	0c42db82	Update the offline_bc_config path	5 年前
GitHub	876aca1e	Use numpy for random sample in buffer (#2524 )	5 年前
Anupam Bhatnagar	baf25046	small edits	5 年前
GitHub	36528481	Merge pull request #2522 from Unity-Technologies/develop-cleanupconfig Clean up SAC config	5 年前
Anupam Bhatnagar	2cd2048b	changes reflecting comments on github	5 年前
GitHub	6f67cf40	unit test - don't use global random generator (#2521 ) * unit test - don't use global random generator * Update test_simple_rl.py	5 年前
Anupam Bhatnagar	cc933115	adding colon	5 年前
GitHub	7ec3d7ad	Merge pull request #2516 from Unity-Technologies/master Merege 0.9.3 changes to develop	5 年前
Ervin Teng	b1bfb9e8	Delete VisualBanana	5 年前
Anupam Bhatnagar	fddede25	first commit	5 年前
Jonathan Harper	2f083c8a	Renamed "StepInfo" to "EnvironmentStep" This change was requested for clarity during the async EnvManager PR. It's a simple rename of the StepInfo class.	5 年前
Ervin T	6fb5b63c	Fix Baselines gym_unity example to work with the latest Baselines (#2489 ) * This addresses #1835. Baselines expects single environments used with their ppo2 algorithm to be wrapped in a DummyVecEnv. The old readme did not instruct the reader to do so and the code failed to run with the latest version of baselines. This imports the correct function from baselines and fixes the make_unity_env function described in the readme. * added line to gym-unity/README.md to note the version of baselines the examples were tested with	5 年前
GitHub	6a81a2f4	Add Soft Actor-Critic as trainer option (#2341 ) * Add Soft Actor-Critic model, trainer, and policy and sac_trainer_config.yaml * Add documentation for SAC and tweak PPO documentation to reference the new pages. * Add tests for SAC, change simple_rl test to run both PPO and SAC.	5 年前
GitHub	25926795	initialize trainer step count (#2498 ) * initialize trainer step count * remove step init from RLTrainer	5 年前
Ervin T	06d9678c	Minor fix to link to GAIL reward signal doc (#2435 )	5 年前
GitHub	4bb97e25	Fix bug with construct_curr_info (#2490 ) * Fix bug with construct_curr_info * Add more tests	5 年前
Ervin Teng	28ef8983	Add 2 visual obs test	5 年前
Ervin Teng	aca81efb	Add more tests	5 年前
Ervin Teng	e0da93d1	Fix bug with construct_curr_info and test	5 年前
GitHub	bf375235	Change update buffer to float32 instead of float64 (#2461 ) - Reduces memory usage of buffer.	5 年前
GitHub	3c1f4dbb	python coverage: specify dirs, exclude test files (#2473 ) * specify dirs, exclude test files * update comments * html coverage in CI artifacts * add destination * ignore coverage files * check gym-unity too	5 年前
Jeffrey Shih	df64b64a	Fixed typo in Training-Imitation-Learning.md (#2485 )	5 年前
GitHub	ec7fd11d	Merge pull request #2451 from DanAmador/patch-1 Fixed small typo in documentation.	5 年前
GitHub	cf9e67fb	Merge pull request #2470 from Unity-Technologies/release-0.9.2 Release 0.9.2 to develop	5 年前
GitHub	c13828ce	Merge pull request #2474 from Unity-Technologies/release-0.9.2-flake8-fix Fixed the flake8	5 年前
Yuan Gao	33404e1b	Fixed the flake8	5 年前
GitHub	df0196f9	Merge pull request #2472 from Unity-Technologies/release-0.9.2-multi-gpu-doc Added the doc for multi-gpu	5 年前
Yuan Gao	b9210f4c	Updated the comment for —multi-gpu option.	5 年前
Yuan Gao	66205c3e	Added the doc for multi-gpu	5 年前
GitHub	be66102d	Merge pull request #2471 from Unity-Technologies/setup.py-h5py-version More flexibility on the h5py version	5 年前
GitHub	0afd58fc	More flexibility on the h5py version	5 年前
Yuan Gao	0c492fb7	Updated the model	5 年前
Yuan Gao	f33830bc	Updated the python packages version to 0.9.2	5 年前
GitHub	261ee0b6	Merge pull request #2457 from Unity-Technologies/hh/fix-training-NaN-errors-crawler Fix NaN training errors for crawler	5 年前
Hunter	83703d20	moved look rotation logic to avoid potential NaN LookRotation	5 年前
GitHub	1b7045bf	Merge pull request #2448 from Unity-Technologies/hh/fix-broken-crawler-prefabs fixed broken crawler prefabs	5 年前
Hunter	45fe60db	fixed broken prefabs	5 年前
GitHub	3880fd3a	Update development release version to 0.10.0.dev0 (#2443 ) In order for downstream packages to make use of the latest pre-release features, we can pre-release versions of our packages. For packages ending in `devN` pip will not install that package version by default. This change manually updates our package version to a development version with the idea that we can manually perform development versions with the potential for future automated / nightly dev releases.	5 年前
GitHub	43696d60	Fix bug in add_rewards_output and add test (#2442 )	5 年前
GitHub	689765d6	Modification of reward signals and rl_trainer for SAC (#2433 ) * Adds evaluate_batch to reward signals. Evaluates on minibatch rather than on BrainInfo. * Changes the way reward signal results are reported in rl_trainer so that we get the pure, unprocessed environment reward separate from the reward signals. * Moves end_episode to rl_trainer * Fixed bug with BCModule with RNN	5 年前
Arthur Juliani	fa46be7f	Merge branch 'RunSwimFlyRich-master' into develop	5 年前
GitHub	4abe89bc	Only call get_action on brains with policies (#2437 )	5 年前

1 2 3 4 5

204 次代码提交 (691d21e6-6061-4e8d-9f16-3e2ca6b5471a)