What does big data look like?

Michael Moreno
Cloudera
Published in
5 min readNov 7, 2016

“Big data is not easy.” We’ve all heard this time and time again yet organizations are racing for data driven solutions to help with cyber security, recommendation engines, customer 360, genome sequencing, IoT, and many other big data use cases. Yet when you engage with your IT support or data analyst team, you keep hearing how under-resourced they are to support your projects. What if there are pre-existing solutions that might work today? What if you are able to analyze data with existing software products that integrate with Cloudera Enterprise? What if you could set up visualizations to an existing Apache Impala (incubating) data store in under 10 minutes?

Over the past quarter, we have posted a number of partner demos on the Cloudera YouTube channel that illustrate different ways to analyze, ingest, and process different forms of data. On top of the video demos themselves, some even have links to hands-on demo environments that can provide you a test drive of their software without having to purchase or download anything.

Here is a summary of these demos, so you can find a solution that fits your big data needs.

Zoomdata Demo — Data Sharpening with Apache Impala (incubating)

This demo shows how to do data sharpening by using Zoomdata with Apache Impala (incubating). Learn how to sort, pivot, and get complete visualizations of data captured in Impala.

Check out the hands-on test drive here.

Syncsort Demo — Mainframe Data Access with Hadoop

This demo shows you how to migrate data from mainframes into Cloudera Enterprise. Syncsort DMX can be particularly useful for end users who aren’t familiar with mainframe systems but need to get valuable data into HDFS to run analytics.

Trifacta Demo — Customer Behavior Use Case

In this demo, you will learn how to use Trifacta to wrangle data from different data sources and create automated pipelines for future data preparation. You will be presented with a use case where retailers are trying to understand how the weather affects retail sales. Check out the hands-on test drive here.

Paxata Demo — Retail Solution Use Case

This demo highlights how Paxata is able to help users quickly gain faster insights from their data in Apache Hadoop. It compares customer data from purchases, loyal customer demographics, and external survey data so marketing can pick the right social media channel to reach interested customers. Check out the hands-on test drive here.

Cask Demo — Using CDAP with Cloudera for Data Flows, Ingestion, and Governance

This demo shows how to manage different data pipelines using Cask CDAP and Cloudera Manager. This is a great solution for organizations looking to have greater control of their data ingestions process. Check out the hands-on test drive here.

Informatica Demo — Create Better Upsell and Cross-sell Initiatives with Customer Prospects

This demo shows how you can use Informatica Big Data Manager, using Cloudera Navigator, to get valuable customer insights that allow marketing organizations to improve up-sell and cross-sell opportunities.

Qlik Demo — Using Big Data Analytics to Monitor Zika

This demo, developed by Bardess Group, shows how people can use big data analytics to monitor the spread of Zika virus with Qlik and Cloudera. The use case is a combination of near real-time analytics with visualizations that can allow healthcare, travel, and government organizations to make critical, life saving, decisions.

StreamSets Demo — Connected Car with StreamSets Data Collector

This demo highlights how you can use the StreamSets Data Collector for building big data ingest pipelines with Apache Hadoop. The use case presents a connected car simulation that shows, in real-time, traffic issues that could be avoided by drivers. Check out the hands-on test drive here.

Paxata Demo — Preparing Data for Health Care Cost Analytics

This demo shows how you can use Cloudera and Paxata to integrate and cleanse disparate insurance claim data to uncover potential savings for payers and patients.

Pentaho Demo — Transactional Fraud Detection

In this demo you will learn how to use Pentaho Data Integration with Cloudera Enterprise to address transaction fraud at a financial services organization.

Arcadia Data Demo — Security with Role-Based Access Control

Learn how Arcadia Data’s role-based access control provides security to Apache Hadoop by using Sentry with Cloudera Manager. This example can help BI systems avoid fragmented role definitions and policies.

For more information about Cloudera software partners, check out the partner solution page.

--

--

Michael Moreno
Cloudera

Father / Husband / Marketing Leader / Interests — Marketing Automation, AI, Customer Experience / Hobbies - Guitar, Surfing, Cycling / Opinions are mine