Sushil KumarThe Big Debate : Databricks vs SnowflakeIn the realm of cloud data warehousing, two prominent contenders have emerged as frontrunners: Databricks and Snowflake. Both platforms…Dec 31, 20236Dec 31, 20236
Sushil KumarinPython in Plain EnglishPolicy-Based Access Control (PBAC): What It Is and Why You Need It in Your Modern Data Lakehouse ?My recent project involved implementing PBAC in a Databricks environment. It was a new challenge for me, but I’m excited to share what I…Oct 17, 2023Oct 17, 2023
Sushil KumarinPython in Plain EnglishSay Goodbye to Pandas : Introducing PolarsA Lightning-Fast DataFrame Library for PythonAug 7, 20231Aug 7, 20231
Sushil KumarinPython in Plain EnglishMaximizing Spark Performance: Minimizing Shuffle Overhead“Shuffle the cards, not the problems.” — AnonymousJul 23, 20232Jul 23, 20232
Sushil KumarinPython in Plain EnglishA Deep Dive into Apache Spark Join StrategiesJoin operations are frequently used in big data analytics to merge two data sets, represented as tables or DataFrames, based on a common…Jul 20, 20232Jul 20, 20232
Sushil KumarinPython in Plain EnglishUnlocking Performance: Exploring Delta Engine Optimizations in Databricks - (Part 1/3)Databricks, with its Delta Engine, offers a suite of powerful optimizations that can significantly enhance the performance of your data…Jun 4, 2023Jun 4, 2023
Sushil KumarSpark Ignited: Unleashing the Performance Beast within Apache SparkIntroduction: Apache Spark has revolutionized big data processing, empowering businesses to extract valuable insights from vast datasets…Jun 3, 2023Jun 3, 2023