This Week in Data Preparation (October 19, 2020)

Nikolaos Konstantinou
The Data Value Factory
6 min readOct 19, 2020

This weekly post with news items from the data preparation market is brought to you by The Data Value Factory, the company offering Data Preparer.

14 links in this week’s post: 4 articles (on data science, data analytics, data wrangling and reproducibility, data management, and data literacy, by ACIA, Trifacta, IHS Markit, and Rivery), 2 company updates (by Isima, and InsideView), 5 partnership announcements (by KNIME and H2O.ai, Boomi and Aible, BigPanda and Datadog, Ascend.io and Qubole, Alation and Dataiku), 2 acquisitions (Idera acquires Qubole, Focus Technology acquires IntegritD), and 1 capital raise announcement (by Datavant).

This week in data preparation — A weekly post by The Data Value Factory, with news items from the data preparation market.
The Data Value Factory — This Week in Data Preparation. October 2020 Image by Tumisu from Pixabay.

Articles

Data scientist versus data analyst: What’s the difference? Siliconrepublic.com spoke to two employees — senior data analyst Beatrice Russell and data scientist Dylan Butler — at Aon’s Centre for Innovation and Analytics (ACIA). Both Russell and Butler draw on their data expertise to inform software solutions for the insurance industry, but they spend their days on different types of work.

Joe Hellerstein on Data Wrangling and Reproducibility. It is well known that machine learning is powered by data. Unfortunately, the raw data that we would like to use to train models is often created and stored in such a way that it is not machine consumable. As part of determined.ai podcast series, Craig and Ameet recently had a conversation with Joe Hellerstein, a computer science professor at UC Berkeley, a leading researcher in the databases community, and co-founder of the data wrangling company Trifacta.

Digitalization: Understanding the Role of Data Management. “The wave of digitalization has been building within the maritime and shipping industry for a number of years. Now, with the COVID-19 pandemic underlining just how important data and technology are for understanding supply chain transparency and predictability, it is clear digitalization is a priority for those who have not previously focused on it.” said Jamie Penniman, Vice President, Head of Business, EDM at IHS Markit.

Why Data Literacy Should Be Taught In High Schools. Itamar Ben Hemo is CEO and Co-Founder of Rivery, a platform that empowers businesses to unlock the possibilities hidden within their data.

Company Updates

Can Isima Be the Nutanix of Data Management? “At the end of day, we’re still in the trough of disillusionment around machine learning, deep learning, because it’s so damn expensive and time consuming and the quality of the people you need to get the data organized and right and curated and clean [is so high],” says Darshan Rawal, CEO and co-founder of Isima. “We’re automating that piece. ….Give us one Python-trained engineer and two-to-three weeks, and we can get you into production. Full stop.”

InsideView cleans up data update. InsideView has revealed that its Sales Intelligence platform has completed over 4 billion API transactions using its open API integration platform. Heidi Tucker, VP Global Alliances at InsideView, commented: “Data brings life and context to B2B applications, enhancing and enriching functionality with real companies and contacts.

Partnership Announcements

KNIME and H2O.ai Accelerate and Simplify End-to-end Data Science Automation. “We have been using KNIME and H2O Driverless AI for years, and we are very excited about this new integration and the automation and simplification that it will bring to our data science workflow,” said Alejandro Lopez, data science leader of Vision Banco. “H2O Driverless AI users can now get an integrated data access and preparation platform with KNIME. This allows seamless operationalization and continuous learning demanded by our customers adapting at the speed of change today,” said Sri Ambati, CEO and founder of H2O.ai. “The integration of Driverless AI offers KNIME users a strong, additional option to automate machine learning out of the box with a huge range of powerful algorithms. We believe that flexibility of choice brings most value to our users and customers, and H2O is a great addition to the mix,” said Michael Berthold, CEO and co-founder of KNIME.

Boomi Partners with Aible to Equip Business Users with AI Insights. “AI is critical to the future of business and its effectiveness depends on having quality data to power intelligent insights. Together with Aible, Boomi will empower customers to unlock the value of their data without having to write a single line of code,” said Ed Macosky, head of product at Boomi. “This partnership with Boomi is expected to equip users with the insights they need to optimize sales, maximize profits, combat resource constraints, and drive smarter business decisions,” said Arijit Sengupta, Founder and CEO at Aible.

BigPanda and Datadog Form Partnership for Integration and Go-to-Market. “Event correlation and automation is now simpler on Datadog’s platform,” said Ilan Rabinovitch, Vice President for Product and Community, Datadog. “There is great synergy between BigPanda and Datadog because we are both modern platforms built for the modern enterprise and the technology stacks that go along with that,” said Elik Eizenberg, co-founder and Chief Technology Officer for BigPanda.

Ascend.io and Qubole Partnership Enables Data Pipelines with 95% Less Code. “The partnership between Ascend.io and Qubole will serve as a game changer for customers as they look to unlock their next level of data maturity. Customers will receive an end-to-end solution to simplify data engineering and reduce the time it takes to extract valuable business insights from real-time, large-scale analytics,” said Mike Leone, senior analyst at Enterprise Strategy Group. “Until now, data lake ETL required months of dedicated data engineering resources to build and deploy data pipelines. Even then, scarce data engineering talent was bogged down maintaining brittle code, managing changes in data payloads, and tuning pipelines,” said Sean Knapp, CEO and founder of Ascend.io. “In response to the tremendous increase in demand for machine learning, streaming analytics, data exploration, and more, data pipelines have emerged as the standard technique for data movement in and out of the data lake,” said Dave Lassiter, VP global cloud partnerships and alliances at Qubole.

Alation Partners With Dataiku to Accelerate and Democratize Data-Driven Insights. “We are constantly identifying ways to help our customers reduce the complexity they experience working with data. Data scientists spend the majority of their time searching for data and data science models, and we increase their productivity by rapidly connecting them with the data they need,” said Kiran Narsu, Vice President, Business Development, Alation. “We are excited to partner with Alation, as this marks the first industry partnership that allows data scientists to catalog and govern models directly within a machine learning platform,” said Florian Douetteau, CEO, Dataiku. “One of our strategic goals is to better understand the classification of analytics, similar to how we understand our core control data, and the partnership with Alation and Dataiku enables us to do just that,” said Jon Tudor, Director, Data & Analytics, GE Aviation.

Acquisitions

Qubole is Latest Acquisition Target. “Companies generate and store both structured and unstructured data at unprecedented levels,” Idera CEO Randy Jacops said in announcing the acquisition. “Qubole’s reputation as the leading cross-platform solution focused on unstructured data is a fantastic addition to Idera’s Database Tools division.

Focus Technology Acquires Data Engineering Firm IntegritD. “Data is at the center of digital modernization, and this acquisition builds on our ability to expertly help our customers store, secure and manage data across their data centers and in the cloud,” said Doug Alexander, CEO, Focus Technology. “We are eager to combine our deep cloud and data engineering expertise with the senior-level engineering expertise of the Focus team to more profoundly help customers overcome obstacles and uncover valuable opportunities through data,” said Steve DiPietro, co-founder, IntegritD.

Capital Raise Announcement

Datavant brings in $40M to power health data exchange across providers, life sciences. “Datavant’s mission is to connect the world’s health data to improve patient outcomes. The fragmentation of health data across institutions holds back every part of medical research and patient care,” Travis May, founder and CEO of Datavant, said in a statement.

The Data Value Factory. A week’s worth of manual data preparation in minutes.
A week’s worth of manual data preparation in minutes.

Thank you for reading our weekly post with news items from the data preparation market. Have you tried Data Preparer?

--

--