💡Mike ShakhomirovinTowards Data ScienceAdvanced Data ModellingData model layers, environments, tests and data quality explained8h ago
Vu TrinhinData Engineer ThingsApache Kafka — Important DesignsFilesystem, Zero-copy, and BatchingJul 131Jul 131
💡Mike ShakhomirovinTowards Data ScienceAdvanced Data ModellingData model layers, environments, tests and data quality explained8h ago
Vu TrinhinData Engineer ThingsApache Kafka — Important DesignsFilesystem, Zero-copy, and BatchingJul 131
Vu TrinhinData Engineer ThingsHow Twitter processes 4 billion events in real-time dailyFrom Lambda to KappaMay 254
ShanojApache Hive 101: MSCK Repair TableThe MSCK REPAIR TABLE command in Hive is used to update the metadata in the Hive metastore to reflect the current state of the partitions…2h ago
Vu TrinhinData Engineer ThingsEverything you need to know about MapReduceAll the key insights from the paper MapReduce: Simplified Data Processing on Large Clusters from GoogleJun 13