PinnedVu TrinhinThe Deep HubHow do we run Kafka 100% on the object storage?Let’s see how AutoMQ makes this dream come true.Aug 274Aug 274
PinnedVu TrinhinData Engineer ThingsI spent 8 hours learning Parquet. Here’s what I discoveredI finally sat down and learned about it.Aug 2415Aug 2415
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at UberMar 2319Mar 2319
Vu TrinhI spent 8 hours researching WarpStreamRewriting Kafka protocol in Go and running 100% on object storage1d ago11d ago1
Vu TrinhinData Engineer ThingsI spent 8 hours diving deep into Snowflake (again)Virtual Warehouse, Intermediate Storage, Cache, and Remote StorageSep 281Sep 281
Vu TrinhinGoogle Cloud - CommunityI spent 5 hours learning how Google lets us build a Lakehouse.The Google Cloud BigLakeSep 24Sep 24
Vu TrinhinData Engineer ThingsI spent 5 hours learning how ClickHouse built their internal data warehouse.19 data sources and a total of 470 TB of compressed data.Sep 211Sep 211
Vu TrinhinData Engineer ThingsI spent 5 hours learning how Google manages terabytes of metadata for BigQuery.How Google manages metadata at a large scale.Sep 17Sep 17
Vu TrinhinData Engineer ThingsUber’s Big Data Revolution: From MySQL to Hadoop and BeyondVolume: 100+ PB Data, Latency: MinutesSep 141Sep 141
Vu TrinhinData Engineer ThingsI spent 6 hours learning how Apache Spark plans the execution for us.Catalyst, Adaptive Query Execution, and how Airbnb leverages Spark 3.Sep 112Sep 112