ml-agents

目录树: db058ef1

作者	SHA1	备注	提交日期
GitHub	3c9603d6	Demonstration Recorder (#1240 )	6 年前
GitHub	48578199	Fix brain name bug in offline bc (#1395 )	6 年前
vincentpierre	d1cb6ce0	Fix on the bc_offline_training using deep copies	6 年前
eshvk	cc9bdf17	Added logging per Brain of time to update policy, time elapsed during training, time to collect experiences, buffer length, average return	6 年前
eshvk	ef8009d9	Python code reformat via [`black`](https://github.com/ambv/black ). Features: - Reformat code via black. - Adding circleci configurations. - Add contribution guidelines. Steps to reproduce: - `pip install black` - `black <source code directory>`	6 年前
GitHub	a4d5b2d3	Doc/comment cleanup - Fix some occurrences of 'the the' (#2119 )	6 年前
GitHub	7b69bd14	Refactor Trainer and Model (#2360 ) - Move common functions to trainer.py, model.pyfromppo/trainer.py, ppo/policy.pyandppo/model.py' - Introduce RLTrainer class and move most of add_experiences and some common reward signal code there. PPO and SAC will inherit from this, not so much BC Trainer. - Add methods to Buffer to enable sampling, truncating, and save/loading. - Add scoping to create encoders in model.py	5 年前
Chris Elion	3cb1755e	When checking for the compatibility of the expert brain with the policy brain, we will remove the action descriptions from the dictionary of things we need to compare. This is to prevent the case where a user has different descriptions for his actions but still wants to train a brain using expert demonstrations. (#2517 )	5 年前

8 次代码提交 (db058ef1-c1e5-4f5f-9b78-82919cd9be8a)