An analysis of the appropriate Action to select it. We split into two analysis methods, the first method is to find the total Q-Estimated of each Action together, and then select the Action with the highest Q-Estimated.
Analysis to Action, choose the appropriate analysis, we divided into two methods, the first method is to find the sum of Action Q-Estimated together, then select Action of Q-Estimated maximum.
Analysis to choose the proper Action. We divided into 2 analysis methods. The first method is to sum Q-Estimated of each. Action together, then select the Action with Q-Estimated maximum.