PinnedVu TrinhinData Engineer ThingsThe Architecture of Apache DruidWhen Hadoop can solve every problemJun 151Jun 151
PinnedVu TrinhinThe Deep HubHow does LinkedIn process 4 Trillion Events every day?Key insights on how LinkedIn leverages Apache Beam for real-time processingJun 104Jun 104
PinnedVu TrinhinThe Deep HubAll you need to know about the Google File SystemHow did Google build its large-scale file system?May 125May 125
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at UberMar 2318Mar 2318
Vu TrinhinData Engineer ThingsApache Kafka — Important DesignsFilesystem, Zero-copy, and BatchingJul 132Jul 132
Vu TrinhinData Engineer ThingsApache Kafka — OverviewThe terminology and the architecture.Jul 64Jul 64
Vu TrinhinData Engineer ThingsHow does Uber handle petabytes of Spark shuffle data every day?The Remote External Service (RSS)Jun 221Jun 221
Vu TrinhinData Engineer ThingsEverything you need to know about MapReduceAll the key insights from the paper MapReduce: Simplified Data Processing on Large Clusters from GoogleJun 13Jun 13