ml-agents

作者	SHA1	备注	提交日期
GitHub	f86fc81d	[refactor] Move configuration files to single YAML file (#3791 )	5 年前
Hunter-Unity	cb8eec30	Create WalkerDynamic.yaml	5 年前
GitHub	c0d96ecd	Increase 3DBall generalization sampling interval (#3995 ) * increase sampling interval to 2000 * bring it up to 5000	5 年前
Ervin Teng	f214836a	Changes for speed test	4 年前
Hunter-Unity	da6d25c9	updated walker dynamic demo file. cleanup	5 年前
Hunter-Unity	c9821f85	100M steps	5 年前
Hunter-Unity	f4c8f344	2e7 steps	5 年前
GitHub	e92b4f88	[refactor] Structure configuration files into classes (#3936 )	4 年前
Hunter-Unity	85958dad	try 8x mem for cloud	5 年前
Hunter-Unity	b06dd988	8x batch size for cloud test	5 年前
Hunter-Unity	6b92b01a	epoch 10	5 年前
Andrew Cohen	a0dc8789	test new sampling method	4 年前
Hunter-Unity	e032db74	hyptest	5 年前
Hunter-Unity	f17b1075	increase timescale for cloudtraining	5 年前
Hunter-Unity	769dbec5	cp	4 年前
Hunter-Unity	b3bf1418	try new cluster	4 年前
Hunter-Unity	a3f7b980	cp	4 年前
Andrew Cohen	4464ca46	ignoring commit checks	4 年前
Hunter-Unity	aca47e1f	200k buff cloud	4 年前
Andrew Cohen	91217b0d	use settings.py to check PR config	4 年前
Chris Elion	20b5a157	update scenes and get them training	4 年前
Hunter-Unity	32feefee	update configs	4 年前
GitHub	91f199cd	Self play hyperparameter improvements (#4063 )	4 年前
GitHub	101a8e00	Add Dynamic Walker. Improved Ragdoll Stability/Performance (#4037 ) * about to implement orientation cube * oCube spawining works. ready to train * working. about to try com * ready for training * add random rot on episode start * feet now alternate but runs backwards * still running with right leg in front * increased joint strength to 40k * removed texture example * reduced maxAngVel, enabled enhanced determinism, cont spec * rebuilt walker ragdoll to scale 1 * rebuilt ragdoll ready * update walker pair prefab * fixed bp heirarchy * added trained model, renamed scene, usecollisioncallbacks * updated dynamic platforms * added dynamic walker tf file. max speed 5 * DynamicWalker working. has working nn file * collect local rotations * added new dynamic nn file * hip facing reward * Create WalkerDynamic.yaml * fix hip rotation * about to clean up code * added dirIndicator and orentCubeGizmo * clean up * clea...	4 年前
HH	de87c750	Create WalkerDynamic.yaml	4 年前
Andrew Cohen	c0f7052b	Merge branch 'master' into develop-sampler-refactor	4 年前
Andrew Cohen	34ecc7e6	Merge branch 'master' into asymm-envs	5 年前
Andrew Cohen	33458d24	running cubewar	5 年前
HH	2d2844bd	updated walker dynamic demo file. cleanup	4 年前
GitHub	a1c63c4b	Release 3 Cherry-pick bug-fixes and doc changes from master (#4102 ) * [bug-fix] Fix regression in --initialize-from feature (#4086) * Fixed text in GettingStarted page specifying the logdir for tensorboard. Before it was in a directory summaries which no longer existed. Results are now saved to the results dir. (#4085) * [refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087) * Reverting bug introduced in #4071 (#4101) Co-authored-by: Scott <Scott.m.jordan91@gmail.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	8a49e8e0	[refactor] Remove nonfunctional `output_path` option from TrainerSettings (#4087 )	4 年前
Andrew Cohen	34f3ac64	updated cube war	5 年前
HH	8f463c55	100M steps	4 年前
Andrew Cohen	81cc5f69	reduce epsilon tennis ppo	5 年前
Anupam Bhatnagar	4afd8f92	first commit	4 年前
Andrew Cohen	45293f01	larger batch size	5 年前
HH	999fc7ab	2e7 steps	4 年前
Andrew Cohen	c68e865b	opp	5 年前
Andrew Cohen	03eef40b	constrain x tennis	5 年前
HH	ef3be52c	try 8x mem for cloud	4 年前
HH	25d7ba5e	8x batch size for cloud test	4 年前
Andrew Cohen	0c17dc1b	cannot hit scenery tennis	5 年前
HH	2cce3bbe	epoch 10	4 年前
HH	90c7d05f	hyptest	4 年前
Andrew Cohen	31a5b2ee	4096 batch	5 年前
GitHub	5b0a5b9b	Moving domain randomization to C# (#4065 )	4 年前
Andrew Cohen	71d7c24b	0.0 latest model	5 年前
Arthur Juliani	9724c9ac	Merge master	4 年前
HH	f7dd600f	increase timescale for cloudtraining	4 年前
Andrew Cohen	346a90ba	move agent back	5 年前
HH	65b80abb	cp	4 年前
HH	fa937cb9	try new cluster	4 年前
yanchaosun	c2d6f5c0	basic implementation	4 年前
HH	ba835a22	cp	4 年前
HH	ad2e63d6	200k buff cloud	4 年前
Andrew Cohen	8c0b3548	reduce batch size Tennis	4 年前
HH	48d78ac7	update configs	4 年前
Anupam Bhatnagar	24d5f881	first commit	4 年前
HH	a121795d	Merge branch 'hh/develop/dynamic-walker' of https://github.com/Unity-Technologies/ml-agents into hh/develop/dynamic-walker	4 年前
HH	ced14d9d	update configs to new class format	4 年前
Andrew Cohen	1f305f23	no latest model	5 年前
Jonathan Harper	4e7a1170	Adding training configs	4 年前
Jonathan Harper	7656f419	More experimentation	4 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
Andrew Cohen	4839e040	try team change zero	5 年前
Andrew Cohen	48f02b61	int as team change	5 年前
Andrew Cohen	12bc2143	large window	5 年前
Andrew Cohen	4f03be74	window 30	5 年前
Andrew Cohen	5efa1e92	time hor	5 年前
Andrew Cohen	68c6d513	reduce time hor	5 年前
vincentpierre	599d7e9f	Merging master	4 年前
HH	7afa1761	Merge branch 'master' into hh/develop/ragdoll-updates	4 年前
GitHub	1308b344	[CI] Better hyperparameters for Pyramids-SAC, WalkerStatic-SAC, and Reacher-PPO (#4154 )	4 年前
GitHub	8b913a96	Add TargetController/OrientationCubeController Components & Bugfix (#4157 ) * added Target and OCube controllers. updated crawler envs * update walker prefab * add refs to prefab * Update Crawler.prefab * update platform, ragdoll, ocube prefabs * reformat file * reformat files * fix behavior name * add final retrained crawler and walker nn files * collect hip ocube rot in world space * update crawler observations and update prefabs * change to 20M steps * update crwl prefab to 142 observ * update obsvs to 241. add expvel reward * change walkspeed to 3 * add new crawler and walker nn files * adjust rewards * enable other pairs * add RewardManager * cleanup about to do final training * cleanup add nn files for increased facing rew reduced height rew * try no facing rew * add vel only policy, try dy target * inc torq on cube * added dynamic cube nn. gonna try 40M steps * add 40M step test, more cleanup * ch...	4 年前
HH	84430eec	update config to match master	4 年前
GitHub	d42e82a8	Fix 3DBall PPO hard regression (#4133 )	4 年前
GitHub	8eefdcd3	Refactor of Curriculum and parameter sampling (#4160 ) * Introduced the Constant Parameter Sampler that will be useful later as samplers and floats can be used interchangeably * Refactored the settings.py to refect the new format of the config.yaml * First working version * Added the unit tests * Update to Upgrade for Updates * fixing the tests * Upgraded the config files * Fixes * Additional error catching * addressing some comments * Making the code nicer with cattr * Added and registered an unstructure hook for PrameterRandomization * Updating C# Walljump * Adding comments * Add test for settings export (#4164) * Add test for settings export * Update ml-agents/mlagents/trainers/tests/test_settings.py Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Including environment parameters for the test for settings export * First documentation up...	4 年前
HH	0fdac847	Merge branch 'master' into hh/develop/crawler-ragdoll-updates	4 年前
HH	9e6edb6c	try new reward falloff	4 年前
HH	c3c83920	cleanup	4 年前
Andrew Cohen	d8c123a0	Merge branch 'master' into sensitivity	4 年前
HH	e2217a9a	new curve	4 年前
Ruo-Ping Dong	262f38ea	add basketball example	4 年前
HH	00cb4c89	add WalkerStaticVariableSpeedScene and PPO config	4 年前
HH	7c63197e	start dynamic cleanup and more debug for NaNs	4 年前
HH	977287dd	add all scenes	4 年前
HH	b88434f8	increase to 30M	4 年前
HH	8eaddb61	Merge branch 'master' into hh/develop/loco-walker-variable-speed	4 年前
HH	c038362c	use all bp for avg vel	4 年前
GitHub	b51347ac	New Variable Speed Walker Environments (#4301 ) * init * Add reward manager and hurryUpReward * fix hurry reward/ add awful first training * Turn off head height and hurry rew * changed max speed to 15. added small hh rew * add NaN check for reward manager. start vel penalty * add bpVel pen * add new BPVelPen nn file * remove outdated nn file * add randomize speed bool * try rewad product * change coeff to 1 * try avg vel of all bp for reward * move outside loop * try linear inverselerp for vel * add avg rew matchspeed15 nn file. looks much better * save scene * no hand penalty, random walk speed * fix inverse lerp * try new reward falloff * cleanup * added new nn file. don't allow hand contact * update obsv * remove hh rew. add trained no-hh model * add new nn file * new curve * add new models. try no reset * add hh rew * clamp hh * zero rewards if ground contact * switch to approved with movi...	4 年前
HH	1bbd76fe	update prefabs	4 年前
Ervin Teng	d65a9326	Merge branch 'master' into develop-add-fire-mm3	4 年前
Ruo-Ping Dong	d57aa9ab	Merge branch 'develop-add-fire-mm3' into develop-add-fire-checkpoint	4 年前
Ervin Teng	7032fe82	Reduce max steps for striker vs. goalie	4 年前
HH	ef62939e	updating prefabs	4 年前
GitHub	bd6bcd2f	Merge master and add Saver class for save/load checkpoints	4 年前
Ervin Teng	42e25b25	Merge branch 'develop-add-fire' into develop-add-fire-memoryclass	4 年前
Andrew Cohen	b822283f	merge add fire	4 年前
Christopher Goy	5a233353	Merge remote-tracking branch 'origin/master' into release_6-to-master	4 年前
Andrew Cohen	5f7a7e44	revert tennis config	4 年前
GitHub	abfadb3d	Reduce max steps for striker vs. goalie (#4377 )	4 年前
HH	7e7743d1	update static prefabs	4 年前
Ervin Teng	6455654b	Shorten max steps for strikergoalie	4 年前
HH	e3b1c5cf	add nn files. update to 15M steps	4 年前
GitHub	a79aa854	[ci] Shorten max steps for strikergoalie (#4394 )	4 年前
vincentpierre	ba7eb360	Merge branch 'master' into develop-torch-save-rp	4 年前
HH	5bedaef6	add configs	4 年前
HH	f0a12c70	update configs/prefabs	4 年前
HH	a9d9ea4c	Merge branch 'master' into hh/develop/loco-crawler-variable-speed	4 年前
Scott Jordan	3d98516d	incorporated task parameter channel branch added the ability to set task parameters from python	4 年前
Anupam Bhatnagar	f4f1a8d9	merge master into trainer-plugin branch	4 年前
Scott Jordan	56745026	Initial commit of running active learning code Active learning code is running on walker variable speed. Needs to be tested to see if it is working.	4 年前
Scott Jordan	78f8a9a2	Updated task manager active learning is no optional and defaults to uniform sampling of tasks. Renamed ActiveLearningTaskManager to just TaskManager	4 年前
vincentpierre	0dd5effa	DO NOT MERGE	4 年前
vincentpierre	7cfb763d	[DO NOT MERGE]	4 年前
vincentpierre	9b8924a6	-	4 年前
Scott Jordan	e33168d6	Added comments and new yaml files for variable speed walker	4 年前
vincentpierre	e2e62cb9	-	4 年前
GitHub	a117c932	Grid Sensor (#4399 ) Co-authored-by: Chris Elion <chris.elion@unity3d.com>	4 年前
vincentpierre	3b8a8971	no threading	4 年前
GitHub	582859b6	New Crawler Variable Speed Scenes (#4382 ) * init * updating prefabs * spawn a target * add brains * update static prefabs * enable enhanced determinism * reset manifest * add nn files. update to 15M steps * update prefabs * increase max speed to 15 * add new local model for 15 speed * update prefabs * add configs * update configs/prefabs * cleanup * added final nn models * add new demos and do more cleanup. * add meta files * add RigidbodySensor * update prefab. about to retrain * remove body pen * add fixed crawler & retrained nn file, new demos * train 10M steps * Update Crawler Docs * more prefab cleanup * add meta files * Update Project/Assets/ML-Agents/Examples/Crawler/Scripts/CrawlerAgent.cs Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * remove unused prefab * update comment * add summary tags * cleanup and add more comments * remove unused prefab * Update P...	4 年前
GitHub	cc10cd82	Worm Ragdoll & Env Updates (#4413 ) * add worm updates * add rewman * cp * normalize rewards * only cookie * try 20M. Add3.5Mnn file * reduce strength to 3000spring * facing reward troubleshooting * Update WormAgent.cs * troubleshoot nan * try product of rewards * train 5M steps * try end episode on target touch * fix joint obsv * use 7M steps * added nn file for observation joint fix. looks great * don't end episode * remove old code * refactor to patterns used in walker & crawler * add auto-setup code * reformat * use head vel * remove unneeded observ. update prefabs * update static scenes * keeps rolling. added debug. try 5 m/s * gate the facing reward based on angle tolerance * added 10ms_angle30rew_nn files * use fromto rot * use 7M steps * add new trained files. cleanup code and prefabs * use avgvel. add code comments * remove unused method * add more comments * Update Learning-E...	4 年前
Andrew Cohen	3997b14b	Merge branch 'master' into develop-hybrid-actions	4 年前
Ervin Teng	d4beb937	Make 3dball longer	4 年前
GitHub	60b76790	Random Network Distillation for Torch (#4473 ) * initial commit * works with Pyramids * added unit tests and a separate config file * Adding first batch of documentation * adding in the docs that rnd is only for PyTorch * adding newline at the end of the config files * adding some docs * Code comments * no normalization of the reward * Fixing the tests * [skip ci] * [skip ci] Make sure RND will only work for Torch by editing the config file * [skip ci] Additional information in the Documentation * Remove the _has_updated_once flag	4 年前
HH	0d42b277	train combo. added nn files.	4 年前
HH	d02c90f6	added more variants	4 年前
HH	1912e47a	Dynamic Sensor Benchmarks In	4 年前
GitHub	9e1a28c2	Add vector flag of agent's frozen state to VisualFoodCollector (#4511 ) VisualFoodCollector is now an example environment of using a mix of visual and vector observation and is able to train with default config file. Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	b33e310f	Add Visual3DBall scene (#4513 ) * Add Visual3DBall scene which use visual observations with stacking	4 年前
Andrew Cohen	e5f14400	Merge branch 'master' into develop-hybrid-actions-singleton	4 年前
Andrew Cohen	2f870407	bullet hell game	4 年前
Ervin Teng	56196761	hyperparameteers and tweaks	4 年前
GitHub	90a9d214	Match3 example (#4515 )	4 年前
Ervin Teng	89489ae0	Invert divide by 3 in log prob	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
Ervin Teng	7bec1df2	Better hyperparams	4 年前
HH	281e0be1	added sensors & controls UI	4 年前
Chris Elion	8cf87ed6	match3 settings	4 年前
Ervin Teng	e1378efc	Merge commit '6d729a0a2b2ba1fc946720cdb7871c9be3e38d45' into develop-fix-nan	4 年前
Ervin Teng	4c49f181	Change num envs	4 年前
vincentpierre	e14e1c4d	Improvements and new tests	4 年前
Andrew Cohen	d62f6b0a	modify bullet/attn	4 年前
GitHub	edc2ae2f	[bug-fix] Disable threading for self-play envs (#4679 )	4 年前
Ervin Teng	ce7d34a3	Revert "Invert divide by 3 in log prob" This reverts commit a708af66e740f19df5082b4b4e152a566c703385.	4 年前
GitHub	63704803	[bug-fix] Disable threading for self-play envs (#4679 ) (#4681 )	4 年前
Andrew Cohen	ef8f70e8	Add WalljumpPushblock env	4 年前
Ervin Teng	5130c9b3	Add walljump collab YAML	4 年前
GitHub	cc6b4564	Multi Directional Walker and Initial Hypernetwork (#4740 )	4 年前
Ervin Teng	d816513e	Add config and group ids to HallwayCollab	4 年前
Andrew Cohen	8a95b0bb	rays and disc	4 年前
Andrew Cohen	5b2e704f	updated heuristic	4 年前
Andrew Cohen	5bbe796b	update soccer raycasts	4 年前
Andrew Cohen	34420044	fix trainer c and soccer config	4 年前
Andrew Cohen	ca5a5194	soccer comms on the cloud	4 年前
Andrew Cohen	12828bdc	remove tau from diff for	4 年前
HH	16acb693	update max steps and add config	4 年前
HH	fce83c8a	try curiosity	4 年前
HH	9d17392a	about to merge in master	4 年前
HH	dd1fbd8a	update config to train 5M steps	4 年前
Andrew Cohen	c183040a	update soccer scene	4 年前
vincentpierre	f7a4a31f	[Experiment] Bullet hell	4 年前
Andrew Cohen	f57875e0	layer norm	4 年前
Andrew Cohen	6fae089e	bullet config	4 年前
Andrew Cohen	a6294e38	run bullet on cloud	4 年前
HH	5c5539af	add zomb scene	4 年前
HH	fd7d9c4a	add trained models	4 年前
HH	a738d235	add new env scene	4 年前
Andrew Cohen	32d77b5e	Merge branch 'develop-hybrid-action-staging' into develop-hybrid-actions-singleton	4 年前
Andrew Cohen	e2506856	sequence env	4 年前
Andrew Cohen	bedf9886	update sequencer env	4 年前
Andrew Cohen	9effa1b5	update sorter yaml	4 年前
Ruo-Ping Dong	a7d04be6	Merge branch 'develop-hybrid-actions-singleton' into develop-hybrid-actions-csharp	4 年前
HH	a29ce02c	train 4 env	4 年前
Ruo-Ping Dong	224d2087	add team reward	4 年前
Ervin Teng	384bfaac	Add configuration yaml for pushblockcollab	4 年前
Andrew Cohen	fecddfed	refactored sequence env	4 年前
Andrew Cohen	3a4aa513	COMAA runs	4 年前
Andrew Cohen	5741f8f6	no target net	4 年前
Arthur Juliani	1cf97635	Additional conditional experiments	4 年前
Andrew Cohen	a4c336c2	value estimator	4 年前
Arthur Juliani	d2526ce2	Modify CrawlerDynamic	4 年前
Andrew Cohen	2792cc87	update coma config	4 年前
Andrew Cohen	6c6d54b0	cubewars config	4 年前
Andrew Cohen	bd341f7f	no target, increase lambda	4 年前
Andrew Cohen	00e3c5c5	fix config	4 年前
GitHub	8cf3b93b	Merge pull request #4741 from Unity-Technologies/walljump-pushblock Add WalljumpPushblock env	4 年前
Arthur Juliani	759fd2b5	PushJump modifications	4 年前
Andrew Cohen	e997a5fc	cloud config	4 年前
Arthur Juliani	b84b4880	Add GoalNav environment	4 年前
Andrew Cohen	fce842aa	adding zombie to coma2 brnch	4 年前
Andrew Cohen	b0bf7817	clipping values and updated zombie	4 年前
Andrew Cohen	da4f4ae8	update configs	4 年前
vincentpierre	8dd003e6	-	4 年前
Andrew Cohen	869a2811	update zombie config	4 年前
Andrew Cohen	2047ab1f	cubewars config	4 年前
vincentpierre	48bd37ee	-	4 年前
Ervin Teng	e9e80149	Change names of behaviors	4 年前
Andrew Cohen	e1061302	config	4 年前
Ervin Teng	f4f559da	Remove a bunch of stuff from envs	4 年前
Ervin Teng	844b5955	Remove a bunch of extra files	4 年前
Ervin Teng	985c80d7	Remove remaining files	4 年前
GitHub	ed28d1ba	[MLA-1768] retrain Match3 scene (#4943 ) * improved settings and move to default_settings * update models	4 年前
vincentpierre	fdf21dbd	addressing some of the comments	4 年前
GitHub	307d7cd2	Merge pull request #4912 from Unity-Technologies/develop-var-len-obs-feature-refactor-model-loader-checks Develop var len obs feature refactor model loader checks	4 年前
vincentpierre	695c02fd	[skip ci] Attempting new config	4 年前
vincentpierre	272097ed	new curriculum	4 年前
vincentpierre	9f51d91a	New curriculum, new model	4 年前
Christopher Goy	9cadfa7a	Merge master -> release_13_branch-to-master	4 年前
GitHub	332e9b8b	Merge pull request #4909 from Unity-Technologies/develop-var-len-obs-feature Develop var len obs feature	4 年前
Ruo-Ping Dong	b5da488d	Merge branch 'master' into develop-base-teammanager	4 年前
Andrew Cohen	dc8e8494	Merge branch 'master' into develop-critic-optimizer	4 年前
Chris Elion	e4f51ca7	Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider	4 年前
Ervin Teng	93a59971	Merge branch 'develop-critic-optimizer' into develop-critic-op-lstm	4 年前
Ervin Teng	d4438878	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
vincentpierre	3499a645	-	4 年前
GitHub	4d5545c8	Set ignore done=False in GAIL (#4971 )	4 年前
Ervin Teng	f409c40c	Merge branch 'master' into develop-agentprocessor-teammanager	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
HH	15d512f9	Merge branch 'master' into hh/develop/dodgeball	4 年前
Ervin Teng	08db7c2f	Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer-mm	4 年前
vincentpierre	8f729b75	Fixing the number of layers in the config of PyramidsRND	4 年前
GitHub	5ce1083b	Merge pull request #5006 from Unity-Technologies/fix-num-layers-rnd-pyramids Fixing the number of layers in the config of PyramidsRND	4 年前
Christopher Goy	747e2228	Merge branch 'master' into release_13_branch-to-master	4 年前
GitHub	ccca1309	Merge pull request #5007 from Unity-Technologies/release_13_branch-to-master Release 13 branch to master	4 年前
Ervin Teng	4b159789	Add PushBlockCollab config and fix some stuff	4 年前
Chris Elion	f5bf6e08	simple TicTacToe example	4 年前
HH	4c947151	Merge branch 'main' into hh/develop/dodgeball	4 年前
Ervin Teng	61781a1a	Merge branch 'main' into develop-agentprocessor-teammanager	4 年前
Andrew Cohen	9060da06	Merge branch 'develop-agentprocessor-teammanager' into develop-coma2-trainer	4 年前
HH	1f8aa5c3	add simple training scene	4 年前
Arthur Juliani	06c147f8	Merge remote-tracking branch 'origin/main' into goal-conditioning-new # Conflicts: # Project/Assets/ML-Agents/Examples/Crawler/Prefabs/CrawlerBase.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Prefabs/Area.prefab # Project/Assets/ML-Agents/Examples/GridWorld/Scenes/GridWorld.unity # Project/ProjectSettings/TagManager.asset # com.unity.ml-agents/Runtime/Sensors/CameraSensor.cs # com.unity.ml-agents/Runtime/Sensors/VectorSensor.cs # ml-agents/mlagents/trainers/torch/networks.py # ml-agents/mlagents/trainers/torch/utils.py	4 年前
Ervin Teng	c8137dcd	Merge branch 'main' into develop-superpush-int	4 年前
GitHub	85f8b40b	Removing some scenes (#4997 ) * Removing some scenes, All the Static and all the non variable speed environments. Also removed Bouncer, PushBlock, WallJump and reacher. Removed a bunch of visual environements as well. Removed 3DBallHard and FoodCollector (kept Visual and Grid FoodCollector) * readding 3DBallHard * readding pushblock and walljump * Removing tennis * removing mentions of removed environments * removing unused images * Renaming Crawler demos * renaming some demo files * removing and modifying some config files * new examples image? * removing Bouncer from build list * replacing the Bouncer environment with Match3 for llapi tests * Typo in yamato test	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
Ervin Teng	d9cbae07	Dodgeball config update	4 年前
Christopher Goy	921ba4f0	Update v2-staging from main (March 15) (#5123 )	4 年前
GitHub	ba2af269	[coma2] Make group extrinsic reward part of extrinsic (#5033 ) * Make group extrinsic part of extrinsic * Fix test and init * Fix tests and bug * Add baseline loss to TensorBoard	4 年前
Ervin Teng	f45afff3	Different YAML settings	4 年前
Ervin Teng	d5aee550	Add num_envs for cloud run	4 年前
Christopher Goy	ebe45056	Merge branch 'main' into release_14_branch-to-main	4 年前
Ervin Teng	8902c058	Merge branch 'main' into develop-coma2-trainer	4 年前
Chris Elion	970f1d40	Merge remote-tracking branch 'origin/v2-staging' into MLA-1634-ObservationSpec	4 年前
Ervin Teng	1f026c70	Merge branch 'main' into develop-superpush-branch-cleanup	4 年前
Ervin Teng	8263eb52	Backup more changes	4 年前
Ervin Teng	ce872033	Revert "Merge branch 'main' into develop-superpush-branch-cleanup" This reverts commit 5bea802525381f931a5e0f8b8778fe27a12f03af, reversing changes made to cee3524e85161e13689d95f66bc6bff994d2cdfd.	4 年前
Ervin Teng	8ef2c390	Merge branch 'develop-superpush-branch-cleanup' into develop-pushcollabonly	4 年前
GitHub	d015ef17	[environment] Push Block Collaborative (#5090 ) * Add pushblock collab * Make SimpleMultiAgentGroup public * Remove GoalDetectTrigger * Remove GDT meta file * Remove some comments * Add training configuration * Rename behavior * Add to docs * Change the reward structure in docs * Add back GoalDetectTrigger Co-authored-by: HH <brandonh@unity3d.com>	4 年前
Andrew Cohen	9e77d7e1	Merge branch 'main' into develop-soccer-groupman	4 年前
GitHub	62aa3d47	Move PushBlockCollab config to poca directory (#5097 )	4 年前
Ervin Teng	09e7e805	[cherry-pick] Move PushBlockCollab config to poca directory (#5097 )	4 年前
Andrew Cohen	d95d8d92	soccer fours, agent prefabs	4 年前
Andrew Cohen	9176247c	Merge branch 'main' into develop-soccer-groupman-mod	4 年前
GitHub	6895ba50	Integrate Group Manager to soccer/retrain with POCA (#5115 )	4 年前
Andrew Cohen	25be5ff7	increase beta	4 年前
GitHub	d2ee2e6f	[cherry-pick] Integrate Group Manager to soccer/retrain with POCA (#5115 ) (#5121 ) * Integrate Group Manager to soccer/retrain with POCA (#5115) * Add Soccer env to changelog Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	fe1d3e26	Fix GridFoodCollector yaml (#5134 )	4 年前
GitHub	6eef8929	Fix GridFoodCollector yaml (#5134 ) (#5136 ) Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com>	4 年前
GitHub	43147c1a	Remove env settings from Sorter (#5146 )	4 年前
GitHub	65cd8dab	Remove env settings from Sorter (#5145 )	4 年前
Ervin Teng	c108da4a	[bug-fix] Fix POCA LSTM, pad sequences in the back (#5206 ) * Pad buffer at the end * Fix padding in optimizer value estimate * Fix additional bugs and POCA * Fix groupmate obs, add tests * Update changelog * Improve tests * Address comments * Fix poca test * Fix buffer test * Increase entropy for Hallway * Add EOF newline * Fix Behavior Name * Address comments (cherry picked from commit 2ce6810846ba9268e4fb5fb082fa54e90414c980)	4 年前
vincentpierre	42a3732c	Code improvements	4 年前
Andrew Cohen	18be47e8	Merge branch 'main' into develop-soccer-groupman-mod	4 年前
vincentpierre	7fa8b242	Code improvements	4 年前
GitHub	2980ade0	Goal conditioning grid world : Example of goal conditioning (#5193 ) * Aded the Goal conditioned GridWorld to replace regular gridworld * adding missing files * Code improvements * Documentation change on gridworld * resolving conflicts * new model * Addressing comments * comments and renames * Update docs/Learning-Environment-Examples.md Co-authored-by: Ervin T. <ervin@unity3d.com> * adding reference to gridworld in docs about goal signal Co-authored-by: Chris Elion <chris.elion@unity3d.com> Co-authored-by: Ervin T. <ervin@unity3d.com>	4 年前
GitHub	c5589b59	[bug-fix] Fix POCA LSTM, pad sequences in the back (#5206 ) * Pad buffer at the end * Fix padding in optimizer value estimate * Fix additional bugs and POCA * Fix groupmate obs, add tests * Update changelog * Improve tests * Address comments * Fix poca test * Fix buffer test * Increase entropy for Hallway * Add EOF newline * Fix Behavior Name * Address comments	4 年前
GitHub	45e75e01	[config] Disable `threading` by default (#5221 ) * Remove threading as default * New description * Remove threaded option from YAML configs * Remove from Match3	4 年前
vincentpierre	4e14879d	Updating the barracuda 1.4.0 (#5291 ) Initial commit second commit. The no-extrinsic was trained without the log reward (reward = prob) while the new one is (reward = log_prob - log_prior) A few results, it looks like Walker-diverse-r05-bigger.onnx is doing something Modified pushblock using next state and action. Did not help Fixing bug that had 9 diversity settings instead of 8 removing results	4 年前
vincentpierre	47fa1682	-	4 年前
vincentpierre	8450b154	-	4 年前
Scott	130512b4	fixed episode length modification issue.	3 年前
Scott	97990611	Added decision frequency and evaluation metric	3 年前

... 2 3 4 5 6

272 次代码提交 (0ffad9aa-c8f8-446d-9f9c-afcf83c1c3f0)