* Add option to set gym visual observation to uint8 * Add option to flatten branched discrete actions * Add game_over variable to gym wrapper * Add guide on how to use Dopamine with the gym wrapper and comparisons with Baselines and PPO