PinnedVu TrinhinData Engineer ThingsThe Architecture of Apache DruidWhen Hadoop can solve every problemJun 151Jun 151
PinnedVu TrinhinThe Deep HubHow does LinkedIn process 4 Trillion Events every day?Key insights on how LinkedIn leverages Apache Beam for real-time processingJun 104Jun 104
PinnedVu TrinhinThe Deep HubAll you need to know about the Google File SystemHow did Google build its large-scale file system?May 126May 126
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at UberMar 2316Mar 2316
Vu TrinhHow does Uber handle petabytes of Spark shuffle data every day?The Remote External Service (RSS)5d ago5d ago
Vu TrinhinData Engineer ThingsEverything you need to know about MapReduceAll the key insights from the paper MapReduce: Simplified Data Processing on Large Clusters from GoogleJun 13Jun 13
Vu TrinhinData Engineer ThingsHow Twitter processes 4 billion events in real-time dailyFrom Lambda to KappaMay 253May 253
Vu TrinhinData Engineer ThingsThe Hadoop Distributed File SystemEverything you need to know about the HDFSMay 25May 25
Vu TrinhinData Engineer ThingsI spent 5 hours understanding more about the Delta Lake table formatAll insights from the paper: Delta Lake: High-Performance ACID Table Storage over Cloud Object StoresMay 42May 42
Vu TrinhGroupBy #33: Data GatewayโโโA Platform for Growing and Protecting the Data Tier at Netflix, TheโฆPlus: Solving RevenueCatโs data ingestion challenges into Snowflake, From ZooKeeper to KRaft: How the Kafka migration worksMay 3May 3