ml-agents

作者	SHA1	备注	提交日期
Arthur Juliani	6879bae4	Initial optimizer port	5 年前
Arthur Juliani	7c3bd376	Refactoring policy and optimizer	5 年前
Arthur Juliani	2e51260a	Resolving a few bugs	5 年前
Arthur Juliani	947f0d32	Slightly closer to running model	5 年前
Arthur Juliani	3c82bf59	Training runs, but doesn’t actually work	5 年前
Arthur Juliani	8c6f4696	Fix a couple additional bugs	5 年前
Arthur Juliani	61d671d8	Add conditional sigma for distribution	5 年前
Arthur Juliani	a5b5b109	Mulkti-discrete now working	5 年前
Arthur Juliani	5f936990	Visual observations now train as well	5 年前
Arthur Juliani	1736559f	Combine actor and critic classes. Initial export.	5 年前
Arthur Juliani	be7e55e1	Use LSTM and fix a few merge errors	5 年前
Arthur Juliani	3eef9d78	Optimize np -> tensor operations	5 年前
Ervin Teng	72180f9b	Experiment with JIT compiler	5 年前
Arthur Juliani	9724c9ac	Merge master	5 年前
GitHub	cde8bd29	Convert List[np.ndarray] to np.ndarray before using torch.as_tensor (#4183 ) Big speedup in visual obs	5 年前
GitHub	05a11c96	Develop add fire exp framework (#4213 ) * Experiment branch for comparing torch * Updates and merging ervin changes * improvements on experiment_torch.py * Better printing of results * preliminary gpu experiment * Testing gpu * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * Prepare to see a lot of commits, because I like my IDE and I am testing on a server and I am using git to sync the two * _ * _ * _ * _ * _ * _ * _ * _ * Attempt at gpu on tf. Does not work * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * _ * Fixing learn.py	5 年前
GitHub	a28e2767	Update add-fire to latest master, including Policy refactor (#4263 ) * Update Dockerfile * Separate send environment data from reset (#4128) * Fixed a typo on ML-Agents-Overview.md (#4130) Fixed redundant "to" word from the sentence since it is probably a typo in document. * Updated the badge’s link to point to the newest doc version * Replaced all of the doc to release_3_doc * Fix 3DBall and 3DBallHard SAC regressions (#4132) * Move memory validation to settings * Update docs * Add settings test * Update to release_3 in installation.md (#4144) * rename to SideChannelManager +backcompat (#4137) * Remove comment about logo with --help (#4148) * [bugfix] Make FoodCollector heuristic playable (#4147) * Make FoodCollector heuristic playable * Update changelog * script to check for old release links and references (#4153) * Remove package validation suite from Project (#4146) * RayPerceptionSensor: handle empty and invalid tags (#4155...	4 年前
GitHub	69579611	[refactor] Refactor Actor and Critic classes (#4287 )	4 年前
Andrew Cohen	ccb492dc	ignore precommit/first bc commit	4 年前
Andrew Cohen	84ea84a6	bc loss for both continuous and disc	4 年前
Andrew Cohen	f74d301a	Merge branch 'develop-add-fire' into develop-add-fire-bc	4 年前
Andrew Cohen	22a0cabc	changed path to torch bc module	4 年前
GitHub	7ddfd81f	Added Reward Providers for Torch (#4280 ) * Added Reward Providers for Torch * Use NetworkBody to encode state in the reward providers * Integrating the reward prodiders with ppo and torch * work in progress, integration with PPO. Not training properly Pyramids at the moment * Integration in PPO * Removing duplicate file * Gail and Curiosity working * addressing comments * Enfore float32 for tests * enfore np.float32 in buffer	4 年前
Andrew Cohen	598826fe	Merge branch 'develop-add-fire' into develop-add-fire-bc	4 年前
GitHub	6b255790	Behavioral Cloning Pytorch (#4293 )	4 年前
GitHub	f374f87a	[add-fire] Add LSTM to SAC, LSTM fixes and initializations (#4324 )	4 年前
Ervin Teng	f4da3592	Add memories and sequence length to critic_pass	4 年前
Ervin Teng	fa0d3cb6	Fix next_obs in get_trajectory_value_estimates	4 年前
vincentpierre	108fac9a	Replace torch.detach().cpu().numpy() with a utils method	4 年前
GitHub	4e93cb6e	[torch] Restructure PyTorch encoders (#4421 ) * Move linear encoding to NetworkBody * moved encoders to processors (#4420) * fix bad merge * Get it running * Replace mentions of visual_encoders * Remove output_size property * Fix tests * Fix some references * Revert test_simple_rl * Fix networks test * Make curiosity test more accomodating * Rename total_input_size * [Bug fix] Fix bug in GAIL gradient penalty (#4425) (#4426) Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com> * Up number of steps * Rename to visual_processors and vector_processors Co-authored-by: andrewcoh <54679309+andrewcoh@users.noreply.github.com> Co-authored-by: Andrew Cohen <andrew.cohen@unity3d.com> Co-authored-by: Vincent-Pierre BERGES <vincentpierre@unity3d.com>	4 年前
GitHub	6f534366	Add torch_utils class, auto-detect CUDA availability (#4403 ) * Add torch_utils * Use torch from torch_utils * Add torch to banned modules in CI * Better import error handling * Fix flake8 errors * Address comments * Move networks to GPU if enabled * Switch to torch_utils * More flake8 problems * Move reward providers to GPU/CPU * Remove anothere set default tensor * Fix banned import in test	4 年前
Ervin Teng	3e771cbb	Permute visual obs outside of network	4 年前
Ervin Teng	77c810fb	Fix SAC and make utility method	4 年前
vincentpierre	d3d4eb90	Trainer with attention	4 年前
Ervin Teng	95bdbba3	Less broken PPO	4 年前
Ervin Teng	5a5bd515	Fix multiple obs	4 年前
vincentpierre	735fcd52	[WIP] Refactor trainers to use list of obs rather than vec and vis obs	4 年前
Ervin Teng	cb4b7ed3	Some minor tweaks but still broken	4 年前
Ervin Teng	56dcd75a	Get next critic observations into value estimate	4 年前
GitHub	cc6b4564	Multi Directional Walker and Initial Hypernetwork (#4740 )	4 年前
GitHub	22658a40	use sensor types to differentiate obs (#4749 )	4 年前
vincentpierre	44ed3258	Merging master	4 年前
vincentpierre	449712b0	renaming sensor_spec to sensor_specS	4 年前
Ervin Teng	330fc1d0	Merge branch 'master' into develop-centralizedcritic-mm	4 年前
Ervin Teng	ad439fb6	Additional changes	4 年前
Ervin Teng	d02a1033	Some more fixes	4 年前
GitHub	7387a77f	remove pylint (#4836 ) * remove pylint * remove other pylint disables	4 年前
Arthur Juliani	0b4b0992	Rename more files	4 年前
Ervin Teng	aba633b2	Merge branch 'develop-attention-refactor' into develop-centralizedcritic-mm	4 年前
Arthur Juliani	0a876b9c	Fix typos	4 年前
Ervin Teng	9c3da1b6	New buffer layout, TeamObsUtil, pad dead agents	4 年前
GitHub	67ad9651	Merge pull request #4825 from Unity-Technologies/sensor-types [WIP] Observation Types	4 年前
Ervin Teng	6b8b3db3	Try subtract marginalized value	4 年前
Ervin Teng	092ea232	Some more progress - still broken	4 年前
Ervin Teng	457b2630	I think it's running	4 年前
Andrew Cohen	6e1826f8	might be right	4 年前
Andrew Cohen	1511588d	forcing this to work	4 年前
Andrew Cohen	e1fad8a4	buffer error	4 年前
Andrew Cohen	feb38012	add lambda return and target network	4 年前
Andrew Cohen	a4c336c2	value estimator	4 年前
Andrew Cohen	fce842aa	adding zombie to coma2 brnch	4 年前
Andrew Cohen	7f491ae7	cloud run with coma2 of held out zombie test env	4 年前
Andrew Cohen	9af22d30	use only value funcs	4 年前
Andrew Cohen	95253b47	ntegrate teammate dones	4 年前
Andrew Cohen	687f411b	try again on cloud	4 年前
Andrew Cohen	f9ff3fef	shared baseline and v	4 年前
Ervin Teng	3283b6a1	Remove Q-net for perf	4 年前
Ervin Teng	b6f88d6d	Merge branch 'develop-base-teammanager' into develop-agentprocessor-teammanager	4 年前
Andrew Cohen	6bd396ee	add critic to optimizer, ppo runs	4 年前
Andrew Cohen	3aec18a1	fix precommit errors	4 年前
Andrew Cohen	8efdeeb0	make critic a property	4 年前
Ervin Teng	514873bf	Use correct memories (t-1 instead of t) for training	4 年前
Ervin Teng	219e773b	Merge branch 'develop-fix-lstms' into develop-critic-op-lstm	4 年前
Ervin Teng	ae7643b8	Proper critic memories for PPO	4 年前
Ervin Teng	2b0dd850	Still somewhat broken but cleaner	4 年前
Ervin Teng	64839237	Fix indexing issue	4 年前
Ervin Teng	21e9785a	Fix padding issues	4 年前
Ervin Teng	8d834f0b	Fix more indexing bugs	4 年前
Ervin Teng	4fc0f93e	Code cleanup	4 年前
Ervin Teng	6a573ebf	Code cleanup	4 年前
Ervin Teng	f3cec983	Append the right memories	4 年前
Ervin Teng	a9666a0b	Don't pad when not needed	4 年前
Ervin Teng	c2883f5b	Pad from back of trajectory	4 年前
Ervin Teng	e46a86ad	Merge branch 'master' into develop-superpush-int	4 年前
GitHub	338af2ec	Move the Critic into the Optimizer (#4939 ) Co-authored-by: Ervin Teng <ervin@unity3d.com>	4 年前
GitHub	c1d19e89	Fix gpu pytests (#5019 ) * Move tensors to cpu before converting it to numpy	4 年前
Andrew Cohen	131fa328	inital evaluate_by_seq, does not run	4 年前
Andrew Cohen	67beef88	finished evaluate_by_seq, does not run	4 年前
Andrew Cohen	8f799687	ignoring precommit, grabbing baseline/critic mems from buffer in trainer	4 年前
GitHub	f16ce486	Update v2-staging from main (March 15) (#5123 )	4 年前
GitHub	ba2af269	[coma2] Make group extrinsic reward part of extrinsic (#5033 ) * Make group extrinsic part of extrinsic * Fix test and init * Fix tests and bug * Add baseline loss to TensorBoard	4 年前
GitHub	d24b0966	[bug-fix] Fix memory leak when using LSTMs (#5048 ) * Detach memory before storing * Add test * Evaluate with no_grad	4 年前
Ervin Teng	c108da4a	[bug-fix] Fix POCA LSTM, pad sequences in the back (#5206 ) * Pad buffer at the end * Fix padding in optimizer value estimate * Fix additional bugs and POCA * Fix groupmate obs, add tests * Update changelog * Improve tests * Address comments * Fix poca test * Fix buffer test * Increase entropy for Hallway * Add EOF newline * Fix Behavior Name * Address comments (cherry picked from commit 2ce6810846ba9268e4fb5fb082fa54e90414c980)	4 年前
Ervin Teng	d461a66a	Fix padding in optimizer value estimate	4 年前
Ervin Teng	81b74634	Fix additional bugs and POCA	4 年前
Ervin Teng	9fd4a81e	Address comments	4 年前

1 2

96 次代码提交 (0968daa8-51d2-433d-a6e4-e3dd0f33392a)