The State of Data Infrastructure Landscape in 2022 and Beyond

Key trends to expect in the data infrastructure domain in 2022 and beyond.

The decade from 2010 to 2020


The decade from 2020 to 2030

The Modern Data Stack

Modern Data Stack is a radically new approach to data integration that saves engineering time, allowing engineers and analysts to pursue higher-value activities.

MDS components at a minimum

Analytics engineering and dbt

Analytics engineers provide clean data sets to end users, modeling data in a way that empowers end users to answer their questions. While data analysts spend their time analyzing data, analytics engineers spend their time transforming, testing, deploying, and documenting data. Analytics engineers apply software engineering best practices like version control and continuous integration to the analytics code base.

Analytics engineer sits between data engineer and data analyst.

Streaming databases/low-latency OLAP databases

Incremental updated materialized view engines

Real-time OLAP databases with the scatter-gather query execution

Classification of streaming databases

Metadata management and data catalogs

A data catalog creates and maintains an inventory of data assets through the discovery, description, and organization of distributed datasets. The data catalog provides context to enable data stewards, data/business analysts, data engineers, data scientists, and other line of business (LOB) data consumers to find and understand relevant datasets to extract business value.

Data lakehouses and open data architecture


Data platform as a service (dPaaS)




