PinnedKalpan ShahDeploy Spark Using DockerSpark Deploy for Data Engineering with all packagesJan 29, 2023Jan 29, 2023
Kalpan ShahinPlumbers Of Data ScienceSpark Chapter 12 Spark with Apache KafkaSpark Stretured streaming with Apache KafkaAug 4, 20231Aug 4, 20231
Kalpan ShahinDev GeniusTraditional Datawarehouse vs LakehouseDetailed understanding of Lakehouse and Datawarehouse with Lakehouse implementation guide (using Delta/HUDI/Iceberg formats)May 29, 2023May 29, 2023
Kalpan ShahinPlumbers Of Data ScienceDelta Lake with Python (delta-rs)Delta tables Read, Write, History check, and vacuum using Python (Without Apache Spark)May 21, 20231May 21, 20231
Kalpan ShahinFAUN — Developer Community 🐾Delta Lake: An Introduction to a High-Performance Data Management SystemEnd-to-End Lakehouse Implementation using Delta LakeMay 1, 2023May 1, 2023
Kalpan ShahinPlumbers Of Data ScienceSpark ETL Chapter 11 with Lakehouse OptimizationDelta table Optimization (ZORDER and Compaction)Apr 5, 2023Apr 5, 2023
Kalpan ShahinPlumbers Of Data ScienceSpark ETL Chapter 10 with LakehouseSpark ETL with Delta Lake, Apache Iceberg, and Apache HudiApr 4, 2023Apr 4, 2023
Kalpan ShahinLevel Up CodingSpark ETL Chapter 9 with Lakehouse | Apache IcebergPrevious blog/Context:Mar 26, 20231Mar 26, 20231
Kalpan ShahinPlumbers Of Data ScienceSpark ETL Chapter 8 with Lakehouse | Apache HUDIPrevious blog/Context:Mar 24, 2023Mar 24, 2023
Kalpan ShahinGeek CultureSpark ETL Chapter 7 with Lakehouse | Delta LakePrevious blog/Context:Mar 19, 2023Mar 19, 2023