The disadvantage of this method is selected, select the action that results in the original ultimate repeatedly accepted choose No Action. Because the data of the other levels are insufficient.
The disadvantage of this approach is to choose the best action that results repeated. Action by refusing to choose a new avatar, that of the other is not enough.