Anup ChakoleAWS Glue, Lambda, S3, EMR, Athena and IAMHere’s a detailed explanation of AWS Glue, AWS Lambda, S3, EMR, Athena and IAM, their use cases, and how they can be integrated, especially…3d ago1
ZIRUAWS EMR vs. Databricks: Choosing the Right Data PlatformIn the world of big data, AWS EMR (Amazon Elastic MapReduce) and Databricks are two powerful platforms that help businesses process and…Jun 30
THE BRICK LEARNINGA. Understanding EMR to Databricks Migration: AI Assisted Inventory MappingMigrating from legacy systems like Amazon EMR to a modern data platform such as Databricks is a transformative endeavor that promises…6d ago6d ago
Akashdeep GuptaPyDeequ — Testing Data Quality at ScaleThis blog post will cover the different components of PyDeequ and how to use PyDeequ to test data quality in depth.Dec 24, 2023Dec 24, 2023
SoumyajitchatterjeeCreating Delta Tables in AWS S3 Using AWS EMRHey everyone! Recently, at the startup I work for, I was tasked with creating Delta Tables. While we considered using third-party tools…Dec 1Dec 1
Anup ChakoleAWS Glue, Lambda, S3, EMR, Athena and IAMHere’s a detailed explanation of AWS Glue, AWS Lambda, S3, EMR, Athena and IAM, their use cases, and how they can be integrated, especially…3d ago1
ZIRUAWS EMR vs. Databricks: Choosing the Right Data PlatformIn the world of big data, AWS EMR (Amazon Elastic MapReduce) and Databricks are two powerful platforms that help businesses process and…Jun 30
THE BRICK LEARNINGA. Understanding EMR to Databricks Migration: AI Assisted Inventory MappingMigrating from legacy systems like Amazon EMR to a modern data platform such as Databricks is a transformative endeavor that promises…6d ago
Akashdeep GuptaPyDeequ — Testing Data Quality at ScaleThis blog post will cover the different components of PyDeequ and how to use PyDeequ to test data quality in depth.Dec 24, 2023
SoumyajitchatterjeeCreating Delta Tables in AWS S3 Using AWS EMRHey everyone! Recently, at the startup I work for, I was tasked with creating Delta Tables. While we considered using third-party tools…Dec 1
InAWS TipbyAbdoulKaledBuilding a batch ETL pipeline using Airflow, Spark, EMR, and Snowflake.This project will join the hourly_ridership(60M records) and wifi_location (300 records) datasets based on a column and calculate the total…Jun 11
Mayurkumar SuraniAWS EMR Full Course: A Comprehensive Guide to Big Data ProcessingCredit: AuthorNov 15
Sachin Kala SidhardhanAWS Glue vs EMR vs EMR Serverless: A ComparisonAWS Glue, Amazon EMR (Elastic MapReduce), and EMR Serverless are all services offered by Amazon Web Services (AWS) for data processing and…Aug 22, 2023