Big Data Analytics Pipeline using the Hadoop Ecosystem

Learn and implement the Hadoop Ecosystem to drive Big Data Analytics

Prathamesh Nimkar
1 min readMar 29, 2020
Big Data Analytics Pipeline

The above image is the pipeline for Big Data analytics using the Hadoop Ecosystem. Let’s learn about their architectures and build upon it using a practical real-life project in the aviation domain. If you’re new to Big Data, I would suggest to go through the below in order (by number).

Data Ingestion

Sqoop

Flume

Data Storage

❶ HDFS — Comprehensive Guide, HDFS Commands, HDFS Erasure Coding

❼ HBase

Data Processing

MapReduce

❽ Spark

Data Analysis

Pig

❻ Hive

Data Exploration

❾ Hue

Data Visualization

❿ Tableau

--

--