Zhidong KeinSalesforce EngineeringEngagement Activity Delta LakeUnlike data in our other data lakes, engagement activity is mutable and the mutation ratio is high, which creates a huge challenge for us.Sep 23, 20202Sep 23, 20202
Zhidong KeinSalesforce EngineeringGuaranteed Data Delivery System for Distributed ServicesWe need our data pipeline to be reliable, with as hight a fault tolerance as possible, to satisfy our commitment to customer trust.Jun 30, 20201Jun 30, 20201
Zhidong KeinSalesforce EngineeringOur Journey to Optimal Job Sizes for Apache SparkUsing Spark to compact data lake with metadata store and dynamic job scheduler at scaleAug 1, 2019Aug 1, 2019