Data Engineering Digest #12 (May 2020)

Maycon Viana Bordin
data.plumbers
Published in
17 min readJun 19, 2020
Photo by Simon Migaj from Pexels

New Tools

Spark 3.0

Data Engineering Role

Courses & Training

Podcasts & Presentations

Real Data Architectures

Data Culture

Data Lake

Data Governance

DataOps

Data Formats

Delta Lake

Apache Avro

Apache Parquet

Data Pipelines

Data Quality Tools

Data Processing

Apache Spark

Apache Hive

Apache Hadoop

Presto

Apache Drill

Apache Sqoop

Stream Processing

Apache Flink

Apache Spark

Apache Flume

Clustering & Resources

Change Data Capture

Debezium

Storage

Apache HDFS

Messaging

Apache Kafka

Apache Pulsar

Workflow Management

Apache Airflow

Machine Learning Workflow

Cloud Providers

AWS

Google Cloud

Azure

Databases

NoSQL

In-Memory & Data Grid

Relational

Modern Data Warehouses

--

--