This Week in Data Preparation (Feb 7, 2020)

Nikolaos Konstantinou
The Data Value Factory
4 min readFeb 7, 2020

This has been an exciting week here at The Data Value Factory, as we announced the release of Data Preparer, the first hands-off data wrangling software product to market! Hooray!

In a nutshell, this blog post features: three new software releases, the launch of a ML & AI community by Dataiku, four discussions with data industry thought leaders, the key takeaways from a Gartner report on data integration, three announcements by Dresner, Stardog and Qlik, respectively, and finally, an opinion article on digital finance transformation.

You will also notice that the summary card image changed to what will be this February’s card image; a new one will be selected monthly.

Image by Markus Spiske from Pixabay

This week, we at The Data Value Factory officially launched Data Preparer: the first hands-off data wrangling software product to market. Prof. Norman Paton, co-founder at The Data Value Factory, explains: “The emphasis is on refining the description of what is needed, rather than on pinning down how it should be produced. The system will use all available evidence to automatically clean and integrate the data, minimising human intervention and enabling data preparation at scale”. The company will be very happy to work with early adopters to explore how the approach can be applied in different applications. Free trial and paid versions of the software are available for download.

erwin, Inc. announced the availability of a new version of the erwin Data Intelligence Suite (erwin DI). erwin DI synchronizes data management and data governance processes in an automated flow so data assets are stored in a central data catalog and then made accessible and understandable within a business context via role-based views. “The erwin DI Suite harmonizes IT-focused data management with business-led data governance so every stakeholder has access to relevant data to do their jobs,” explains Adam Famularo, erwin’s CEO.

LabKey releases sample management software with data integration and workflow features. The LabKey Sample Manager software addresses common challenges in laboratory sample management, including sampling creation, discerning sample lineage, workflow features to assign and track sample processing tasks, and data integration features to unify samples with their related experiment data. “Having participated in the Sample Manager Product Advisory Council and seeing the development of the application from idea to fruition, Sample Manager has been thoughtfully developed and has the potential to positively impact the collaboration, efficiency and throughput of many laboratories” says Mike MacCoss, Professor of Genome Sciences at the University of Washington.

Dataiku announces the launch of the Dataiku community to bring together professionals in data science, machine learning and AI. “At Dataiku, our mission is to democratize data science and to create human-centric Enterprise AI solutions that power the future of business,” said Florian Douetteau, CEO at Dataiku.

In this Q&A article, HULFT’s CTO, Rin Nagaike, discusses information overload, data-workflow orchestration and quickening the process to create a bill-of-material.

In this special guest feature in the insideHPC blog, Dave Fellinger, Data Management Technologist at the iRODS Consortium, writes about the secure federation capabilities of iRODS, and the change in the way that we think of data locality.

In this ZDNet article, Andy Palmer and a team of co-authors in their latest report, Getting DataOps Right, published by O’Reilly, recommend 5 steps to building a healthy DataOps ecosystem that paves the way to a data-driven enterprise.

Ibrahim Surani, the CEO of Astera Software, a data management software provider founded in 1990 that caters to several Fortune 500 companies, discusses why data access and integration are key to enterprise success.

Key takeaways from Gartner’s report Critical Capabilities for Data Integration Tools 2019. The study highlights 16 vendors Gartner considers most significant in this software sector and evaluates them against 10 critical capabilities and 6 use cases prevalent in the space (optimized analytics, master data management, data consistency between operational applications, inter enterprise data acquisition, data services orchestration, and data migration and consolidation). The editors at the Solutions Review magazine have read the report, available here, and pulled out three key takeaways.

Dresner Advisory Services announced the winners of its 2019 Technology Innovation Awards. Topics in the 2019 Wisdom of Crowds thematic research include Big Data Analytics, Cloud Computing + BI, Data Catalog, Data Preparation, Data Science + Machine Learning, Embedded BI, IoT Intelligence®, Location Intelligence, Sales Planning, and Self-Service BI. Each report examines current deployment trends, user intentions, and industry capabilities.

Stardog, the Enterprise Knowledge Graph platform, joins the Cloud Information Model (CIM) consortium developing open standards of data exchange to simplify data integration and accelerate innovation.

Qlik announced the re-branding of Attunity’s products into the Qlik brand after it acquired the data integration provider in February 2019. The merger has enabled Qlik to leverage Attunity’s partner network to expand its presence in the data management software category.

This blog post is the second in a series of three blog posts that outlines how a digital finance transformation enables efficient operations, smart data exploitation, strategic cloud usage, seamless user experience and an expanded role for finance. The blog post focuses on the benefits of efficient operations and smart data exploitation.

--

--