Eight guidelines that will help you execute your data science initiatives with excellence

Dialexa, an IBM Company
back to the napkin
Published in
2 min readJan 16, 2020

By Dave Jimenez, VP of Strategy for Dialexa

We are fortunate to be able to work on meaningful data science engagements with our customers. As we reflect on those engagements from 2019, we want to share some of the lessons learned to further improve how we approach data science in the future.

We live in an agile world with respect to how we typically execute on technology initiatives — nothing surprising to those of us in the technology building business. With that said, data science work can be hard to measure, causing breakdowns in both traditional and agile processes. We thought quite a lot about this dichotomy because we feel strongly that core tenants like transparency and the ability to pivot are very critically important when working on data science projects. These eight guidelines will help you execute with excellence.

  1. Liberally use time-boxed spikes for exploring ideas and tracking open-ended work .
  2. Always build naive baseline models first, and then incrementally improve and revise them, adding complexity as you go.
  3. Demonstrate your progress just like you would any other software initiative, even if the results are underperforming .
  4. There is a tendency with data science engagements to have a somewhat irrational focus on just the numbers. Remember that the point of these initiatives is to drive to an outcome, so telling the complete story about how the work is leading toward the outcome is as important as the numbers that show it.
  5. Kickoff every major data-driven feature with robot empathy maps to help set guidelines about how non-deterministic, intelligent algorithms should work.
  6. Have frequent (at least daily) check-ins, desk checks, or collaboration sessions focused on the longer tasks to promote healthy ideation and to prevent frequently check-in (daily) on longer tasks to prevent those pesky rabbit-hole scenarios that can present themselves.
  7. Track experiments and knowledge in a common repository and share them as you go and certainly during retrospectives.
  8. Allocate time liberally based on the level of uncertainty of the task at hand. Uncertainty is natural with data science initiatives. Embrace it, consider the first seven guidelines, and plan accordingly.

Learn more at dialexa.com.

--

--

Dialexa, an IBM Company
back to the napkin

Digital product engineering firm working with today's most innovative companies to build game-changing products.