SimonaZhangOne tutorial project covers Airflow, EC2, S3, ETLToday I followed a really a good tutorial providing a grasp of multiple concepts, such as EC2, data transformation, S3, and especially…Aug 10Aug 10
SimonaZhangSalting technique to solve data skewnessWhen working with large datasets in Spark, distributing data across multiple partitions helps in parallel processing, utilizing the…May 2May 2
SimonaZhangAzure Data Factory — Project on Covid19 (Note-1)Azure Data Factory OverviewJan 27, 20221Jan 27, 20221
SimonaZhangMeasures of Dispersion ReviewMeasures of dispersion can be important tools for understanding your marketing data. In marketing analytics, we use these tools to describe…Jan 17, 2022Jan 17, 2022
SimonaZhangAttribution Models Reference Guide(Notes from the Marketing Analytics course on Coursera)Jan 17, 2022Jan 17, 2022
SimonaZhangSetting up Spark Clusters with AWSOverview of the Set up of a Spark ClusterJan 15, 2022Jan 15, 2022