Open in app

Sign in

Medium Logo
Write

Sign in

Mastodon
Vu Trinh
Vu Trinh

27K followers

Home

Lists

About

Pinned
Data Engineer Things

Published in

Data Engineer Things

Deep dive into the challenges of building Kafka on top of S3.

It’s really tough

May 8
2
Deep dive into the challenges of building Kafka on top of S3.
Deep dive into the challenges of building Kafka on top of S3.
May 8
2
Pinned
Data Engineer Things

Published in

Data Engineer Things

Bufstream: Stream Kafka Messages to Iceberg Tables in Minutes

8x cheaper than Kafka + native support for data quality and seamless transformation of Kafka topics into Iceberg tables.

Mar 27
Bufstream: Stream Kafka Messages to Iceberg Tables in Minutes
Bufstream: Stream Kafka Messages to Iceberg Tables in Minutes
Mar 27
Pinned
Data Engineer Things

Published in

Data Engineer Things

Bauplan: Operate your lakehouse with zero infrastructure

FaaS data pipelines on S3

Mar 20
Bauplan: Operate your lakehouse with zero infrastructure
Bauplan: Operate your lakehouse with zero infrastructure
Mar 20
Pinned
Data Engineer Things

Published in

Data Engineer Things

I spent 8 hours learning Parquet. Here’s what I discovered

I finally sat down and learned about it.

Aug 24, 2024
23
I spent 8 hours learning Parquet. Here’s what I discovered
I spent 8 hours learning Parquet. Here’s what I discovered
Aug 24, 2024
23
Pinned
Data Engineer Things

Published in

Data Engineer Things

How does Uber build real-time infrastructure to handle petabytes of data every day?

All insights from the paper: Real-time data infrastructure at Uber

Mar 23, 2024
21
How does Uber build real-time infrastructure to handle petabytes of data every day?
How does Uber build real-time infrastructure to handle petabytes of data every day?
Mar 23, 2024
21
Data Engineer Things

Published in

Data Engineer Things

How is Databricks’ Spark different from Open-Source Spark?

Why don’t they just use the open-sourced Apache Spark?

3d ago
How is Databricks’ Spark different from Open-Source Spark?
How is Databricks’ Spark different from Open-Source Spark?
3d ago
Data Engineer Things

Published in

Data Engineer Things

How did Airbnb build their semantic layer?

Minerva, the Airbnb metric platform

May 1
2
How did Airbnb build their semantic layer?
How did Airbnb build their semantic layer?
May 1
2
Data Engineer Things

Published in

Data Engineer Things

Let’s use Orchestra to build an end-to-end data pipeline in 10 minutes

Spoiler: You don’t have to manage the infrastructure.

Apr 24
Let’s use Orchestra to build an end-to-end data pipeline in 10 minutes
Let’s use Orchestra to build an end-to-end data pipeline in 10 minutes
Apr 24
Data Engineer Things

Published in

Data Engineer Things

Why is dbt So Popular?

The motivation behind dbt and why it’s becoming a transformation standard(?)

Apr 17
18
Why is dbt So Popular?
Why is dbt So Popular?
Apr 17
18
Data Engineer Things

Published in

Data Engineer Things

Why Walmart Chose Apache Hudi for Their Lakehouse

What can we learn

Apr 10
5
Why Walmart Chose Apache Hudi for Their Lakehouse
Why Walmart Chose Apache Hudi for Their Lakehouse
Apr 10
5
Vu Trinh

Vu Trinh

27K followers

Moving out to 👉 vutr.substack.com

Following
  • Sanjeet Shukla

    Sanjeet Shukla

  • George Zefkilis

    George Zefkilis

  • Ciro Greco

    Ciro Greco

  • Roger Martin

    Roger Martin

  • Jano le Roux

    Jano le Roux

See all (73)

Help

Status

About

Careers

Press

Blog

Privacy

Rules

Terms

Text to speech