The Modern Data Stack: An Overview
Published in
1 min readNov 15, 2021
This week, the team at Inclined is leading a training on the components of the modern data stack, based upon the following infographic:
While I don’t have time in this blog post to describe all of these tools in detail, I wanted to share some of the technologies we’re exited about in this space:
ETL (Extract, Transform, Load) Tools:
- Segment
- Stitch
- FiveTran
- AirByte (open source)
- Apache Airflow (open source)
Data Warehouses, Lakes, & Lakehouses:
- Amazon RedShift
- Google BigQuery
- Snowflake
- Panoply
- Delta Lake on Databricks
- Apache Hive
Graph Databases & Analysis
Customer Data Platforms:
- Segment Personas
- mParticle
- RudderStack (open source)
Data Transformation Tools:
- Data Build Tools (DBT)
Business Intelligence Tools:
Data Catalog & Event Discovery, Documentation, & Governance Tools
More on all these tools (and others!) soon. What are you getting excited about in this space?