This is great. But I see “Data Preparation” as step 6 of 6, though clearly in the bottom of your images (and common sense), it should be step 3 in the process. What’s going on there?
I also keep thinking that if data science isn’t linear (true), then explicit feedback loops are necessary in any plan. What do you think about designing a marginally more involved plan like what I wrote here: