PinnedVu TrinhinData Engineer ThingsThe Architecture of Apache DruidWhen Hadoop can solve every problemJun 151Jun 151
PinnedVu TrinhinThe Deep HubHow does LinkedIn process 4 Trillion Events every day?Key insights on how LinkedIn leverages Apache Beam for real-time processingJun 104Jun 104
PinnedVu TrinhinThe Deep HubAll you need to know about the Google File SystemHow did Google build its large-scale file system?May 125May 125
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at UberMar 2317Mar 2317
Vu TrinhinData Engineer ThingsApache Kafka — OverviewThe terminology and the architecture.Jul 63Jul 63
Vu TrinhinData Engineer ThingsHow does Uber handle petabytes of Spark shuffle data every day?The Remote External Service (RSS)Jun 22Jun 22
Vu TrinhinData Engineer ThingsEverything you need to know about MapReduceAll the key insights from the paper MapReduce: Simplified Data Processing on Large Clusters from GoogleJun 13Jun 13
Vu TrinhinData Engineer ThingsHow Twitter processes 4 billion events in real-time dailyFrom Lambda to KappaMay 253May 253