PinnedDinesh Kumar ArjunaninDev GeniusNested JSON Shredding using AWS AthenaStructured data formats plays the backbone of many data platforms, however semi-structured and unstructured data formats were increasingly…May 19May 19
Dinesh Kumar ArjunanStreaming ETL — Replication with AWS DMS, AWS MSK and Kafka-PythonApache Kafka have become the most prominent tool for streaming applications, and it has been adopted by 80% of the organisations globally…Aug 5Aug 5
Dinesh Kumar ArjunanOptimzing PySpark JDBC Read Performance — Using Glue, Lambda and Step FunctionGreetings..! Hope you all were doing good..!Jul 20Jul 20
Dinesh Kumar ArjunaninAWS TipCross Account S3 Reads Using AWS Glue SparkToday organizations have their central datalake as their main data collection center, where they cleanse, validate and store all their…Jul 4Jul 4
Dinesh Kumar ArjunanHow I tuned a day LongRunning Glue Job to complete in 40 minutesPyspark Optimization Techniques in GlueJun 22Jun 22
Dinesh Kumar ArjunanGlitche noticed when creating AWS Resources using AWS CDKAWS CDK — Cloud Development Kit is a powerful tool that helps us to define/deploy AWS resources like lambda, S3, SNS, RDS, DMS, Glue, etc…Jun 1Jun 1