This Week in Data Preparation (Feb 14, 2020)

Nikolaos Konstantinou
The Data Value Factory
3 min readFeb 14, 2020

In this week’s blog post: (i) a report by Cognilytica, (ii) a survey from Okta, (iii) an announcement from Qlik, (iv) an interview with a Gartner VP, (v) a note on the technical side of data preparation, and (vi) a list of the 13 top “Integration Platform as a Service” vendors for 2020.

Image by Markus Spiske from Pixabay

In a report by Cognilytica, it is estimated, among others, that (a) the market for AI and machine learning relevant data preparation solutions is over $1.5B in 2019 growing to $3.5B by the end of 2024, and (b) data preparation and engineering tasks represent over 80% of the time consumed in most AI and Machine Learning projects.

A new survey by Okta reveals that data management tools have rocketed to the top of the most-used applications by businesses. The fastest growing application used by Okta’s customers is Snowflake: the data wrangling platform soared 273 percent year-over-year in 2019. Ming Wu, VP of data and analytics at Okta, said: “The rapid rise of Snowflake, along with the acquisitions of Looker and Tableau, highlights the growing popularity and importance of data applications in the enterprise. Artificial intelligence and machine learning have helped drive this growth, but we’re also seeing trends in data cataloguing, sharing, democratisation and compliance.”.

Qlik announces an expanded partnership with Snowflake. “Our customers want to accelerate their modernization efforts by utilizing highly performant and robust solutions to replicate data into Snowflake,” said Colleen Kapase, Snowflake VP of WW Partners and Alliances. “With Qlik’s real-time data integration capabilities, customers will realize an immediate benefit to easily bringing that data directly into Snowflake. We are excited about Qlik joining our partner connect program, bringing new capabilities for customers to modernize to Snowflake.”

In this article, Tech Wire Asia catches up with distinguished speaker and Gartner Managing VP Ian Bertram, to learn about some of the exciting themes that are expected to be discussed in the upcoming 2020 Gartner Data & Analytics Summit conference. According to Bertram, most organizations are hoping that they’ll be able to lean on natural language processing to augment their AI and ML algorithms and help with most of the heavy lifting with data preparation.

This article discusses the difference between data cleansing and data enrichment. ThinkDataWorks’ Nikko Torres comments: “Think of data cleansing like the work of a prep cook: separate the good from the bad; wash off the dirt; measure; cut to size; then put the pieces in labeled containers. Data enrichment, on the other hand, is the chef’s touch: combining different ingredients to create a dish that’s greater than the sum of its individual components”.

In this article, the Solutions Review editors list the 13 top “Integration Platform as a Service” vendors for 2020, via a meta-analysis of real user sentiment through the web’s most trusted business software review sites.

Interested in “hands-off” data preparation? Try Data Preparer for free, schedule a discussion, follow on twitter, or reach out by email.

--

--