Ashish GargSingle user cluster vs shared cluster — DatabricksAcross companies in the data industry one of the common questions popping up is about cluster types and its usage usually when companies…Jul 28Jul 28
Ashish GargDatabricks SQL — Cricket points table problemA cricket team’s standing is given in the left table in the diagram below. The expectation is to identify the win/loss count of each team…May 30May 30
Ashish GargHow to add external location in Unity Catalog?This article illustrates how to add external locations to the Unity Catalog in the Databricks UI, pictorially.May 30May 30
Ashish GargHow to download files from Databricks?This is tricky and you can do this in two ways. Both the methods are shown here with example.May 28May 28
Ashish GargBroadcasting — Python vs PySparkBroadcasting is the common term used in pyspark as well as in python. So what’s the difference between the two or three (2 in pyspark…May 21May 21
Ashish GargError Logging in PySparkLogging the runtime errors is critical to help troubleshooting at the runtime problems that can occur at various environments. Thus a…May 16May 16
Ashish GargML/AI — Classifier evaluationsIn the world of ML/AI, think of a classifier model as the trusty sidekick for all sorts of cool tasks. It’s like the Swiss Army knife for…May 16May 16
Ashish GargPython Generator Function (Yield)Generator functions are a special kind of functions that doesn’t return the result at once but yield it one at a time. Confused! Don’t…May 3May 3
Ashish GargAzure Data Factory — Primer!Let’s dive into Azure Data Factory by looking at its UI to understand the basic options and controls available for the user.Apr 25Apr 25
Ashish GargSQL — Understand Self JoiningIt’s a quick note on the self join as demand came from my subordinates who asked me to write one article on self joining to get clear…Apr 23Apr 23