浏览代码

added to mig doc/address comments

/develop/cubewars
Andrew Cohen 4 年前
当前提交
573f80cd
共有 3 个文件被更改,包括 3 次插入2 次删除
  1. 1
      docs/Migrating.md
  2. 2
      ml-agents/mlagents/trainers/behavior_id_utils.py
  3. 2
      ml-agents/mlagents/trainers/ghost/controller.py

1
docs/Migrating.md


### Important changes
* The `--load` and `--train` command-line flags have been deprecated and replaced with `--resume` and `--inference`.
* Running with the same `--run-id` twice will now throw an error.
* The `play_against_current_self_ratio` self-play trainer hyperparameter has been renamed to `play_against_latest_model_ratio`
### Steps to Migrate
* Replace the `--load` flag with `--resume` when calling `mlagents-learn`, and don't use the `--train` flag as training

2
ml-agents/mlagents/trainers/behavior_id_utils.py


class BehaviorIdentifiers(NamedTuple):
"""
BehaviorIdentifiers is a named tuple if the identifiers that uniquely distinguish
BehaviorIdentifiers is a named tuple of the identifiers that uniquely distinguish
an agent encountered in the trainer_controller. The named tuple consists of the
fully qualified behavior name, the name of the brain name (corresponds to trainer
in the trainer controller) and the team id. In the future, this can be extended

2
ml-agents/mlagents/trainers/ghost/controller.py


logger = get_logger(__name__)
class GhostController(object):
class GhostController:
"""
GhostController contains a queue of team ids. GhostTrainers subscribe to the GhostController and query
it to get the current learning team. The GhostController cycles through team ids every 'swap_interval'

正在加载...
取消
保存