Sushil KumarThe Big Debate : Databricks vs SnowflakeIn the realm of cloud data warehousing, two prominent contenders have emerged as frontrunners: Databricks and Snowflake. Both platforms…3 min read·Dec 31, 2023--6--6
Sushil KumarPolicy-Based Access Control (PBAC): What It Is and Why You Need It in Your Modern Data Lakehouse ?My recent project involved implementing PBAC in a Databricks environment. It was a new challenge for me, but I’m excited to share what I…5 min read·Oct 17, 2023----
Sushil KumarSay Goodbye to Pandas : Introducing PolarsA Lightning-Fast DataFrame Library for Python6 min read·Aug 7, 2023--1--1
Sushil KumarMaximizing Spark Performance: Minimizing Shuffle Overhead“Shuffle the cards, not the problems.” — Anonymous5 min read·Jul 23, 2023--2--2
Sushil KumarA Deep Dive into Apache Spark Join StrategiesJoin operations are frequently used in big data analytics to merge two data sets, represented as tables or DataFrames, based on a common…6 min read·Jul 20, 2023--1--1
Sushil KumarUnlocking Performance: Exploring Delta Engine Optimizations in Databricks - (Part 1/3)Databricks, with its Delta Engine, offers a suite of powerful optimizations that can significantly enhance the performance of your data…3 min read·Jun 4, 2023----
Sushil KumarSpark Ignited: Unleashing the Performance Beast within Apache SparkIntroduction: Apache Spark has revolutionized big data processing, empowering businesses to extract valuable insights from vast datasets…3 min read·Jun 3, 2023----