This Week in Data Preparation (November 30, 2020)

Nikolaos Konstantinou
The Data Value Factory
5 min readNov 30, 2020

This weekly post with news items from the data preparation market is brought to you by The Data Value Factory, the company offering Data Preparer.

14 links in this week’s post:

  • 6 articles (on data wrangling, automation, data warehouses, AI, and insights from data, by Trifacta, University of Chicago’s Center for Data and Computing, Metis, Coegil, DotData, Kinetica, Supahands, Cyft, Nuqleous, and Exasol),
  • 7 company updates (by Ascend.io, Calibo, Snowflake, Callahan, Big Blue, Confluent, Infinia ML, DXC, and ThoughtSpot), and
  • 1 capital raise announcement (by Hasty).
This week in data preparation — A weekly post by The Data Value Factory, with news items from the data preparation market.
The Data Value Factory — This Week in Data Preparation. November 2020 Image by Pete Linforth from Pixabay.

Articles

4 Pros Share Their Go-To Data Wrangling Techniques. Connor Carreras, Director of adoption and enablement at Trifacta, Daniel Grzenda, Data scientist at the University of Chicago’s Center for Data and Computing, Javed Ahmed, Senior data scientist at Metis, and Michael Guadarrama, Founder and CEO at Coegil, share tips to keep in mind — and common pitfalls to avoid — when wrangling messy source data into model-ready shape.

Automation: A data scientist’s new best friend? Founder and CEO of DotData, Ryohei Fujimaki, explains how automation can help the data science industry become more efficient.

Thank Your Data Engineers With A Streaming Data Warehouse. “The streaming data warehouse simplifies the data engineering experience by fusing all the necessary capabilities for modern analytical applications into a single solution.” writes Andrew Wooler, Global Marketing Manager at Kinetica, in this article for Forbes.

In the glamorous new AI world, it pays to do the tedious work: Startup Stories. “Looking at the machine learning process as a whole, data labeling is an important part of the puzzle and simultaneously one of the most time-consuming and laborious tasks to be done when developing a model,” said Supahands co-founder and COO Susian Yeap.

Debunking Top 4 Myths of Artificial Intelligence. Leonard D’Avolio, co-founder of Cyft and an assistant professor at Harvard Medical School and Brigham & Women’s Hospital, pinpointed various misconceptions around AI. He argued that it’s not a magic bullet and needs discernment for desired results. “It’s a tool, not a sentient being,” he said.

87 Percent of US Retailers Race to Achieve Faster Data-Driven Insights to Support Online Sales. “To succeed in retail today you have to have your pulse on local consumer preferences and demand curve changes, be able to respond to shifts in distribution channels, optimize your supply chain, and collaborate with your partners,” said Paul Sims, co-founder and CEO, Nuqleous. “The pandemic has shown that retailers that adapt and scale their business based on real-time data insights will continue to thrive,” said Rishi Diwan, chief product officer (CPO) at Exasol.

Company Updates

Ascend.io grows program for partners building data pipelines. Ascend.io CEO and founder Sean Knapp said the company signed up several enterprise architecture advisory and data consultancy partners to its channel roster.

Calibo LLC Launches, Unveils Integrated Business Digital Platform to Accelerate Digital Business. Raj Vattikuti, Executive Chairman of Altimetrik, founded and will serve as CEO of Calibo, and the two companies will work hand-in-glove.

Snowflake announces updates geared towards data mobilisation. Snowflake cofounder and president of products Benoit Dageville says, “Data is central to how we run our lives, businesses, and institutions. Many of today’s organisations still struggle to mobilise all of their data in service of their enterprise.” Snowflake senior vice president of product Christian Kleinerman says, “Snowflake’s platform enables organisations to leverage the power of the Data Cloud regardless of which supported public cloud they use, or where an organisation’s data or users are located.”

How Callahan Improved Media Impact by 90% By Automating its Cloud Data Warehouse. Zack Pike, VP Data Strategy & Marketing Analytics at Callahan, explains that his team uses Fivetran to ingest marketing data like Facebook Ads, Google Analytics or Marketo, while Cloud Dataprep by Trifacta is perfect for messier data.

Big Blue Taps Into Streaming Data with Confluent Connection. “In every single business, there is a constant stream of real-time events occurring unnoticed,” writes Savio Rodrigues, the vice president of application platform and integration OM for IBM, in a blog post. Confluent CEO Jay Kreps says the partnership lays the groundwork for helping enterprise to build real-time applications that make use of all of their data, even if the systems weren’t designed for event streaming.

Machine learning startup Infinia ML lands big partner for cloud project. “Infinia ML has a proven ability to bring machine learning out of the lab and into the real world, and its capabilities align well with the ‘new DXC’, which is focused on our customers and our people,” said Vinod Bagal, executive vice president, DXC in a statement. “With its deep expertise, customer-focused offerings, and global scale, DXC has the ability to reshape entire industries through applied machine learning,” said Infinia ML’s Chief Scientist Larry Carin.

ThoughtSpot democratizes data access, bringing fast insights for fast action. “The thing that stands in the way of delivering value for customers is almost always not technology, not product, and not even quality of data,” said Sudheesh Nair, chief executive officer of ThoughtSpot Inc. “It is lack of courage, lack of vision, lack of ability to empathize with your customers and truly see what can we do to make their lives better where data-driven insights might be a part of it. “

Capital Raise Announcements

Berlin-based AI startup Hasty raises $3.7 million to help label computer vision data. “There are over 750,000 machine learning practitioners today who are working on vision AI topics and spending the majority of their time managing data rather than building and tuning neural networks,” said Tristan Rouillard, co-founder, and CEO of Hasty. “This represents a $30 billion dollar waste that we aim to tackle head-on.”

The Data Value Factory. A week’s worth of manual data preparation in minutes.
A week’s worth of manual data preparation in minutes.

Thank you for reading our weekly post with news items from the data preparation market. Have you tried Data Preparer?

--

--