PinnedRubihaliFacade of Knowledge — An illusion of understandingDaniel J. Boorstin, an American historian aptly noted, “The greatest enemy of knowledge is not ignorance, it is the illusion of knowledge.”Jun 26Jun 26
PinnedRubihaliInterleaving — An underrated Experimentation TechniqueIntroductionApr 20, 2023Apr 20, 2023
RubihaliUnveiling Spark’s Tungsten Execution Engine: A Deep Dive into Advanced OptimizationsTungsten is a set of low level optimizations introduced in Spark 1.4 that significantly enhance memory management, execution efficiency and…1d ago1d ago
RubihaliCache() in PySpark — A misconceptionUnderstanding the nuances between transformations and actions in pyspark is crucial for optimizing data processing workflows. One common…Jul 51Jul 51
RubihaliAsynchronous World — Async Context ManagersWe will deep dive into Python’s async context manager — What are they and how they are used.Jun 241Jun 241
RubihaliKubernetes Architectural BitsLets go through with quick sneak peak of kubernetes architecture. Before that, lets revisit official definition of kubernetes.Jun 11Jun 11
RubihaliDocker Architecture OverviewWe all know what docker is and we do use it in our cloud native applications but do we know whats the internal structure of docker and how…May 28May 28
RubihaliUnderstanding the Ecosystem: Containers, Dockers,Kubernetes and Pods ExplainedUnderstanding the key differences between Docker, containers, pods, and Kubernetes is essential for anyone involved in modern software…May 26May 26
RubihaliSpark Mastery — Let’s explore catalyst optimizer in spark.How does Catalyst Optimizer work in spark ? What role does catalyst Optimizer play? How are query plans optimized in spark?May 20May 20
RubihaliSpark Mastery — Lets explore data skewness interview question with coding snippetWe will go through most asked questions in spark interviews along with coding examples in this series, Have Categorized them from most…May 19May 19