Shubhodaya Hampiholi – Medium

Shubhodaya Hampiholi

Shubhodaya Hampiholi

Monitor costs using System Catalog tables in Databricks

Cloud costs can grow significantly if there are no proper monitoring mechanisms in place to ensure timely action on ill-performing jobs…

Oct 6

Monitor costs using System Catalog tables in Databricks

Oct 6

Shubhodaya Hampiholi

Why Writes in Cassandra are fast ?

In this article we will deep dive into the steps involved as part of a write operation in Cassandra and the features which enable it to…

Oct 5

Why Writes in Cassandra are fast ?

Oct 5

Shubhodaya Hampiholi

What is Databricks LakeFlow ?

A unified, intelligent solution for data engineering

Jul 2

What is Databricks LakeFlow ?

Jul 2

Shubhodaya Hampiholi

Implement Stream Data Processing using Databricks Autoloader and continuous workflow.

Introduction: This article provides an end-to-end guide to implement a continuous streaming data intake and processing workflow using…

Mar 20

Implement Stream Data Processing using Databricks Autoloader and continuous workflow.

Mar 20

Shubhodaya Hampiholi

Comprehensive Guide on Databricks Performance Optimization

As part of this article I have tried to cover various Spark and Databricks performance optimization strategies. This article is to provide…

Jan 16

Comprehensive Guide on Databricks Performance Optimization

Jan 16

Shubhodaya Hampiholi

Streaming Data Ingestion with Databricks Auto Loader

Use case: As part of out Data Ingestion framework we wanted to adapt to a robust, scalable and reusable ingestion mechanism which can cater…

Nov 27, 2023

Streaming Data Ingestion with Databricks Auto Loader

Nov 27, 2023

Shubhodaya Hampiholi

Data Cataloging using PyApacheAtlas and Microsoft Purview

Use case: As part of our Data landscape, we wanted to have an unified and centralized capability which would allow searching for a…

Oct 23, 2023

Data Cataloging using PyApacheAtlas and Microsoft Purview

Oct 23, 2023

Shubhodaya Hampiholi

Data Reconciliation using Apache Spark on Azure Databricks

Use case: As part of Platform Modernization and migration from Azure Gen1 to Gen2, we wanted to have a data reconciliation tool which would…

Oct 11, 2023

Data Reconciliation using Apache Spark on Azure Databricks

Oct 11, 2023

Shubhodaya Hampiholi

Configuration driven Data Lifecycle Management policies for Azure Storage Accounts

Use case: We wanted to have a configuration driven data lifecycle management policy framework which could be used to apply different…

Oct 5, 2023

Configuration driven Data Lifecycle Management policies for Azure Storage Accounts

Oct 5, 2023

Shubhodaya Hampiholi

Shubhodaya Hampiholi

Principal Data Engineer at Haleon.

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams