Vu Trinh – Medium

Vu Trinh

Pinned

Vu Trinh
in
The Deep Hub

Procella — The query engine at YouTube

Everything at once

Jun 29

Procella — The query engine at YouTube

Jun 29

Pinned

Vu Trinh
in
Data Engineer Things

The Architecture of Apache Druid

When Hadoop can solve every problem

Jun 15

The Architecture of Apache Druid

Jun 15

Pinned

Vu Trinh
in
The Deep Hub

How does LinkedIn process 4 Trillion Events every day?

Key insights on how LinkedIn leverages Apache Beam for real-time processing

Jun 10

How does LinkedIn process 4 Trillion Events every day?

Jun 10

Pinned

Vu Trinh
in
The Deep Hub

All you need to know about the Google File System

How did Google build its large-scale file system?

May 12

All you need to know about the Google File System

May 12

Pinned

Vu Trinh
in
Data Engineer Things

How does Uber build real-time infrastructure to handle petabytes of data every day?

All insights from the paper: Real-time data infrastructure at Uber

Mar 23

How does Uber build real-time infrastructure to handle petabytes of data every day?

Mar 23

Vu Trinh
in
Data Engineer Things

Apache Kafka — Producer

The clients who write

3d ago

Apache Kafka — Producer

3d ago

Vu Trinh
in
Data Engineer Things

Apache Kafka — Important Designs

Filesystem, Zero-copy, and Batching

Jul 13

Apache Kafka — Important Designs

Jul 13

Vu Trinh
in
Data Engineer Things

Apache Kafka — Overview

The terminology and the architecture.

Jul 6

Apache Kafka — Overview

Jul 6

Vu Trinh
in
Data Engineer Things

How does Uber handle petabytes of Spark shuffle data every day?

The Remote External Service (RSS)

Jun 22

How does Uber handle petabytes of Spark shuffle data every day?

Jun 22

Vu Trinh
in
Data Engineer Things

Everything you need to know about MapReduce

All the key insights from the paper MapReduce: Simplified Data Processing on Large Clusters from Google

Jun 1

Everything you need to know about MapReduce

Jun 1

Vu Trinh

Vu Trinh

🚀 My newsletter vutr.substack.com 🚀 Subscribe for weekly writing, mainly about OLAP databases and other data engineering topics.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams