This Week in Data Preparation (August 31, 2020)

Nikolaos Konstantinou
The Data Value Factory
4 min readAug 31, 2020

This weekly post with news items from the data preparation market is brought to you by The Data Value Factory, the company offering Data Preparer.

14 links in this week’s post: 1 tutorial on data preparation, 1 survey report (on DataOps, by 451 Research), 7 articles (on data science teams, mergers and acquisitions, data warehousing, data products, etc., by Deloitte, Exonar, Pyramart, Diyotta, General Motors, and Hitachi ABB Power Grids), 2 company announcements (by Mayo Clinic, and CData Software), and 3 capital raise announcements (by Narrative, Dataiku, and Grid).

This week in data preparation — A weekly post by The Data Value Factory, with news items from the data preparation market.
The Data Value Factory — This Week in Data Preparation. August 2020 Image by Gerd Altmann from Pixabay.

An Extensive Step By Step Guide for Data Preparation. Exploratory data analysis, data preprocessing, data splitting and more, by Terence Shin.

Maximize DataOps ROI With Three Key Technologies. The 451 Research study confirmed that processes and technology will evolve across the board, but there are some buckets that will see more investment — at least at the beginning of the journey. According to respondents, the top three buckets for investment include: analytics and self-service data access (40%), data virtualization (37%) and data preparation (32%). And data preparation — one of the practices that few people think about — impacts every person in an organization, says Madhup Mishra, Hitachi Vantara’s Director of Product Marketing. “We see that engineers can spend more than 50% of their time simply preparing data. It’s not clean. It’s got gaps. It needs to be tagged. You have to prepare data before you can start to use it successfully,” Mishra explains.

What makes up a strong data science team structure? “Instead of saying, ‘Let’s just get the data scientists, and let’s just build a data team,’ it has to tie to business challenge, [such as] process optimization, cost savings, new product lines or what your competitors are doing,” said Beena Ammanath, executive director of the Deloitte AI Institute.

The hidden dangers of data in the M&A process. Gareth Tranter, head of customer success at Exonar, discusses the hidden dangers that data can bring to the mergers and acquisitions (M&A) process.

Automated data warehousing divides data control between IT and business groups. “When it comes to data usage, smart public- and private-sector organizations are taking a cue from the nations’ founding fathers and structuring hierarchy, control and autonomy in their data’s governance.” says Dan Bruns, founder and president of Pyramart.

When In Doubt, Follow Your Data And Build Your Data Kitchen. “Believe it or not, there are many correlations between cooking your perfect recipe and getting that perfect analytical dataset that drives actions.”, says Sanjay Vyas, CEO and Co-founder at Diyotta, a modern data integration company.

Chief Digital Officer vs. Chief Data Officer: Defining the Scope for Digital. “In the end, we need our chief data officers to focus on data fueled apps and data products. And whether it is a chief digital officer or CMO or other chief officer, they should focus on the number of components needed to cross the digital divide. Making data great and analytics work is a big agenda all by itself.” says Myles Suer, the 9th leading influencer of CIOs, according to LeadTail.

A Fresh Look At Data Management. Robert Welborn is the Director, Data & Data Science for General Motors. As he sees it, global corporate enterprise has- for the most part, seen data as a luxury, “we’ll get to reporting when we get to reporting we’ll we will, we’ll upsell, we’ll monetize.”

How to Truly Leverage Data for Asset Performance Management. In this special guest feature, Bryan Friehauf, EVP and GM of Enterprise Software Solutions at Hitachi ABB Power Grids, discusses how to leverage data for Asset Performance Management (APM).

Mayo Clinic leans on Google Cloud for AI success. Mayo Clinic CIO Cris Ross said the Google Cloud partnership was critical to its AI success.

CData Software Appoints New Chief Revenue Officer (CRO), Hugh Raiford. “As we continue to expand our product portfolio and support a rapidly growing roster of enterprise customers, we are dedicated to delivering the highest level of service through every customer interaction,” said Amit Sharma, CEO of CData Software. Hugh Raiford said, “Combining unrivaled data source integration with exceptional engineering and performance, CData has built an impressive portfolio of data connectivity solutions and components with unparalleled depth and breadth in the market.”

Narrative Wins Funds, Launches Data Marketplace. Data acquisition and distribution software company Narrative has raised $8.5m in Series A funding, to support the launch of ‘a new category: Data Streaming’, replacing the data broker industry model. As part of this the firm has launched a Data Streams Marketplace. The round was led by G20 Ventures, with additional funding from existing investors Glasswing Ventures, NathCapital, Revel Partners, Tuhave Venture Partners and XSeed Capital. Founder and CEO Nick Jordan comments: “We are excited to have the backing and expertise from G20 Ventures, and all of our investors as we scale up the business.”

AI development unicorn Dataiku picks up $100M in fresh funding. The investment was led by the growth equity firm Stripes, and included Alphabet Inc.’s CapitalG startup investment arm, Battery Ventures, Iconiq Capital and others.

Analytics startup Grid raises $12 million in funding. Grid, founded in 2018 and based in Reykjavik, Iceland, previously raised $3.5 million in seed funding in March 2019 and this latest round represents its Series A financing, led by New Enterprise Associates. BlueYard Capital, Slack Fund and Acequia Capital are among the other investors. The new financing will enable Grid to move out of the open beta testing stage and go to market within the next few months, according to founder and CEO Hjalmar Gislason.

The Data Value Factory. A week’s worth of manual data preparation in minutes.
A week’s worth of manual data preparation in minutes.

Thank you for reading our weekly post with news items from the data preparation market. Have you tried Data Preparer?

--

--