There is a broadly ignored aspect of reinforcement learning (or action-first approach in general): it’s…