ml-agents

作者	SHA1	备注	提交日期
GitHub	2fd305e7	Move add_experiences out of trainer, add Trajectories (#3067 )	5 年前
GitHub	0b5b1b01	Develop magic string + trajectory (#3122 ) * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * added team id and identifier concat to behavior parameters * splitting brain params into brain name and identifiers * set team id in prefab * recieves brain_name and identifier on python side * rebased with develop * Correctly calls concatBehaviorIdentifiers * trainer_controller expects name_behavior_ids * add_policy and create_policy separated * adjusting tests to expect trainer.add_policy to be called * fixing tests * fixed naming ...	5 年前
GitHub	f058b18c	Replace BrainInfos with BatchedStepResult (#3207 )	5 年前
GitHub	4641038e	Renaming max_step to interrupted in TermialStep(s) (#3908 )	4 年前
GitHub	b853e5ba	Action buffer (#4612 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	3c96a3a2	Action Model (#4580 ) Co-authored-by: Ervin T <ervin@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	88d3ec3e	Merge master into hybrid actions staging branch (#4704 )	4 年前
Andrew Cohen	3f771e61	add ActionBuffers and utils	4 年前
Andrew Cohen	653de147	fix AgentExperience typing	4 年前
Ervin Teng	7a0ebfbd	Pretty broken	4 年前
Ervin Teng	15c463cf	Add collab obs to trajectory	4 年前
Ervin Teng	f479ce83	Fix bug; add critic_obs to buffer	4 年前
Andrew Cohen	bd917c9c	action buffer passes continuous	4 年前
Andrew Cohen	b36fcf16	discrete runs/cont passes	4 年前
Ervin Teng	fdaa8c3d	Merge branch 'develop-unified-obs' into develop-centralizedcritic	4 年前
vincentpierre	735fcd52	[WIP] Refactor trainers to use list of obs rather than vec and vis obs	4 年前
vincentpierre	93ca1409	fixing the tests	4 年前
vincentpierre	12619155	added some docstrings	4 年前
Ervin Teng	56dcd75a	Get next critic observations into value estimate	4 年前
vincentpierre	c1587bce	Solving merge conflicts	4 年前
Andrew Cohen	8172b3d6	test_simple_rl/reward providers pass tf/torch	4 年前
GitHub	cc6b4564	Multi Directional Walker and Initial Hypernetwork (#4740 )	4 年前
Ervin Teng	25dfd883	Merge branch 'master' into develop-centralizedcritic	4 年前
Andrew Cohen	cd73cce2	test_trajectory fixed	4 年前
GitHub	22658a40	use sensor types to differentiate obs (#4749 )	4 年前
vincentpierre	0c81006d	addressing comments	4 年前
Andrew Cohen	5ec3fb98	fix action mask in trajectory	4 年前
Andrew Cohen	e81e68de	comms agent and fixed hallway	4 年前
Andrew Cohen	8071beb6	remove unused line in traj	4 年前
Andrew Cohen	ca5a5194	soccer comms on the cloud	4 年前
Andrew Cohen	12828bdc	remove tau from diff for	4 年前
Ervin Teng	330fc1d0	Merge branch 'master' into develop-centralizedcritic-mm	4 年前
Ervin Teng	9c3da1b6	New buffer layout, TeamObsUtil, pad dead agents	4 年前
Ervin Teng	eab7e42a	Use NaNs to get masks for attention	4 年前
Ervin Teng	fdf97d99	Add team reward to buffer	4 年前
Ervin Teng	92fc78a5	Use new trajectory	3 年前
Ervin Teng	65b866b0	Actions added but untested	4 年前
Ervin Teng	0919a32d	Add next action and next team obs	4 年前
Andrew Cohen	3a4aa513	COMAA runs	4 年前
Andrew Cohen	feb38012	add lambda return and target network	4 年前
Chris Elion	dbf1c946	WIP	3 年前
Andrew Cohen	45dd7401	move from average to sum of rewards	3 年前
GitHub	64fc7f43	Buffer key enums (#4907 )	3 年前
Ervin Teng	b21094f1	Use reward sum	3 年前
Ervin Teng	eb13a14a	Renaming fest	3 年前
Ervin Teng	a6b4917a	Use NamedTuples instead of attrs classes	3 年前
Ervin Teng	a81512c9	Test for group and add team reward	3 年前
Chris Elion	e4f51ca7	Merge remote-tracking branch 'origin/master' into MLA-1734-demo-provider	3 年前
Ervin Teng	d4438878	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	3 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	3 年前
Ervin Teng	be45d8c0	Move padding method to AgentBufferField	3 年前
GitHub	d36a5242	Python Dataflow for Group Manager (#4926 ) * Make buffer type-agnostic * Edit types of Apped method * Change comment * Collaborative walljump * Make collab env harder * Add group ID * Add collab obs to trajectory * Fix bug; add critic_obs to buffer * Set group ids for some envs * Pretty broken * Less broken PPO * Update SAC, fix PPO batching * Fix SAC interrupted condition and typing * Fix SAC interrupted again * Remove erroneous file * Fix multiple obs * Update curiosity reward provider * Update GAIL and BC * Multi-input network * Some minor tweaks but still broken * Get next critic observations into value estimate * Temporarily disable exporting * Use Vince's ONNX export code * Cleanup * Add walljump collab YAML * Lower max height * Update prefab * Update prefab * Collaborative Hallway * Set num teammates to 2 * Add config and group ids to HallwayCollab * Fix bug with hallway collab * E...	3 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	3 年前
Ervin Teng	a9fb37aa	Fix reporting of group rewards, CLI print of group	3 年前
GitHub	b9cab453	[perf] Optimizations for performance (#5192 ) * Lazy init the buffer when sampling * Update references rather than copy data * Don't create unneeded numpy arrays * Remove self[key] from loop	3 年前

1 2

55 次代码提交 (c4c67218-5ea3-45b6-a22d-3091c20fa411)