The disadvantage of this method is to select the action that results in the original ultimate repeatedly by not selecting the Action. Because the data of the other levels are insufficient.
The disadvantage of this approach is to choose the best action that results repeated. Action by refusing to choose a new avatar, that of the other is not enough.