PinnedRitam MukherjeeinTowards Data EngineeringBuilding End-to-End Customer Insights Pipeline by Integrating Multiple Data Sources in Spark With…My articles are open to everyone; non-members can read the full article by clicking this link .6d ago6d ago
PinnedRitam MukherjeeinTowards Data EngineeringSQL Interview Questions That Every Candidate Should KnowMy articles are open to everyone; non-members can read the full article by clicking this link.Oct 271Oct 271
PinnedRitam MukherjeeinTowards Data EngineeringMany Orgs are moving from Cassandra to ScyllaDB. But why ?In recent years, companies have been making a quiet but significant move: switching from Apache Cassandra to ScyllaDB. If you’re wondering…Oct 12Oct 12
PinnedRitam MukherjeeinTowards Data EngineeringParquet is Good for OLAP but Not for OLTP Use Cases. But Why?My articles are open to everyone; non-members can read the full article by clicking this link.Sep 285Sep 285
PinnedRitam MukherjeeinTowards Data EngineeringData Skew in Spark : Using Salting while avoiding common mistakesMy articles are open to everyone; non-members can read the full article by clicking this link.Oct 51Oct 51
Ritam MukherjeeinTowards Data EngineeringScaling Apache Spark: Understanding Cluster Utilisation with a 50-Node SetupIn this article, we will explore how resource management impacts performance in Apache Spark. We will use a 50-node Spark cluster setup to…Sep 23Sep 23