The analysis, which is to select an appropriate Action then we split into two methods, the first method, the method is to include Q-Estimated of each Action together, then select the Action with the highest Q-Estimated.
The analysis to select the appropriate Action. We split into two ways, the first is how to approach integration of Q-Estimated Action, then choose Action together with Q-Estimated maximum.
The analysis to select the appropriate Action. We divide the way into 2 methods. The first method is to combine Q-Estimated of each. Action together, and then select the Action with Q-Estimated maximum.