PinnedVu TrinhinThe Deep HubHow do we run Kafka 100% on the object storage?Let’s see how AutoMQ makes this dream come true.Aug 271Aug 271
PinnedVu TrinhinData Engineer ThingsI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 249Aug 249
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at UberMar 2318Mar 2318
Vu TrinhI spent 5 hours learning how Google manages terabytes of metadata for BigQuery.How Google manages metadata at a large scale.5h ago5h ago
Vu TrinhUber’s Big Data Revolution: From MySQL to Hadoop and BeyondVolume: 100+ PB Data, Latency: Minutes3d ago13d ago1
Vu TrinhinData Engineer ThingsI spent 6 hours learning how Apache Spark plans the execution for us.Catalyst, Adaptive Query Execution, and how Airbnb leverages Spark 3.6d ago26d ago2
Vu TrinhinData Engineer ThingsI spent 7 hours diving deep into Apache IcebergThe more details on how everything worksAug 311Aug 311
Vu TrinhinData Engineer ThingsHow did Discord evolve to handle trillions of data pointsFrom in-house solutions to the modern data stackAug 20Aug 20