Lalitha Mohanasundaram🌟 AWS Glue vs. EMR: A Comparative Analysis 🌟AWS Glue and Elastic MapReduce (EMR) are both powerful tools offered by Amazon Web Services (AWS) for performing Extract, Transform, and…6d ago
Sachin Kala SidhardhanAWS Glue vs EMR vs EMR Serverless: A ComparisonAWS Glue, Amazon EMR (Elastic MapReduce), and EMR Serverless are all services offered by Amazon Web Services (AWS) for data processing and…Aug 22, 2023
Apache DolphinSchedulerinThe Deep HubHow to Integrate Apache DolphinScheduler with AWS EMR&RedshiftIn this article, we will share the practice of integrating DolphinScheduler with AWS’s EMR and Redshift.Aug 15Aug 15
Akashdeep GuptaPyDeequ — Testing Data Quality at ScaleThis blog post will cover the different components of PyDeequ and how to use PyDeequ to test data quality in depth.Dec 24, 2023Dec 24, 2023
Life-is-short--so--enjoy-itHudi: EMR on EKS: Prefer to use OSS Hudi over AWS EMR HudiUpgrading to Hudi 0.14.1 revealed the benefits of using OSS bundles over AWS EMR’s version due to transparency issues and patch…Aug 5Aug 5
Lalitha Mohanasundaram🌟 AWS Glue vs. EMR: A Comparative Analysis 🌟AWS Glue and Elastic MapReduce (EMR) are both powerful tools offered by Amazon Web Services (AWS) for performing Extract, Transform, and…6d ago
Sachin Kala SidhardhanAWS Glue vs EMR vs EMR Serverless: A ComparisonAWS Glue, Amazon EMR (Elastic MapReduce), and EMR Serverless are all services offered by Amazon Web Services (AWS) for data processing and…Aug 22, 2023
Apache DolphinSchedulerinThe Deep HubHow to Integrate Apache DolphinScheduler with AWS EMR&RedshiftIn this article, we will share the practice of integrating DolphinScheduler with AWS’s EMR and Redshift.Aug 15
Akashdeep GuptaPyDeequ — Testing Data Quality at ScaleThis blog post will cover the different components of PyDeequ and how to use PyDeequ to test data quality in depth.Dec 24, 2023
Life-is-short--so--enjoy-itHudi: EMR on EKS: Prefer to use OSS Hudi over AWS EMR HudiUpgrading to Hudi 0.14.1 revealed the benefits of using OSS bundles over AWS EMR’s version due to transparency issues and patch…Aug 5
Konstantin MogilevskiiinDev GeniusRun Apache Spark 3.5.1 workloads 4.5 times faster with Amazon EMR runtime for Apache SparkThe Amazon EMR runtime for Apache Spark is a performance-optimized runtime that is 100% API compatible with open source Apache Spark. It…Aug 2
Ramakrishna SanikommuUnleash the Power of GPUs: Implementing Apache Spark on NVIDIA in AWS EMRIn the fast-paced world of big data, processing power is paramount. Enter the potent duo of Apache Spark and NVIDIA GPUs, ready to…Jan 8