PinnedVu TrinhinThe Deep HubHow do we run Kafka 100% on the object storage?Let’s see how AutoMQ makes this dream come true.Aug 271Aug 271
PinnedVu TrinhinData Engineer ThingsI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 248Aug 248
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at UberMar 2318Mar 2318
Vu TrinhinData Engineer ThingsI spent 6 hours learning how Apache Spark plans the execution for us.Catalyst, Adaptive Query Execution, and how Airbnb leverages Spark 3.2d ago22d ago2
Vu TrinhinData Engineer ThingsI spent 7 hours diving deep into Apache IcebergThe more details on how everything worksAug 311Aug 311
Vu TrinhinData Engineer ThingsHow did Discord evolve to handle trillions of data pointsFrom in-house solutions to the modern data stackAug 20Aug 20
Vu TrinhinData Engineer ThingsHow did Facebook design their Real-Time Processing ecosystemHundreds of GBs per SecondAug 171Aug 171
Vu TrinhinData Engineer ThingsHow Did LinkedIn Handle 7 Trillion Messages Daily With Apache Kafka?Was adding more machines enough?Aug 143Aug 143