Anand SatheeshHandling Large number of Small Files in Big Data FrameworksRecently I came across an interesting programming challenge, I am ingesting change data capture(CDC) data from a Relational Database into a…3d ago3d ago
Anand SatheeshTop 20 Hands-On Apache Kafka Interview Questions with Detailed AnswersApache Kafka has become a critical tool in modern data architectures due to its ability to handle large-scale, real-time data streams with…Sep 27Sep 27
Anand SatheeshDeriving value from LLMs using RAGLarge Language Models (LLMs) are advanced artificial intelligence models designed to understand, generate, and manipulate human language…Jul 17Jul 17
Anand SatheeshApache Spark Commonly seen errors in production and their solutions.Apache Spark is a powerful tool for big data processing, it uses distributed data processing in memory to reduce the execution time…Jul 1Jul 1
Anand SatheeshSpark’s Dynamic Resource Allocation, does it really help?Apache Spark, a powerful distributed computing framework, excels in processing large-scale data workloads efficiently. One of its key…Jun 30Jun 30
Anand SatheeshTest-Driven Development (TDD): Explanation, Advantages, and ExamplesIntroduction to Test-Driven Development (TDD)Jun 24Jun 24
Anand SatheeshDatabase Dilemmas: Cracking the Code to Your Perfect Database MatchChoosing the right database for your application is a pivotal decision that can significantly influence your projects’s success. With a…Jun 17Jun 17
Anand SatheeshFlask vs FastAPI for microservicesRecently I came across an interesting design dilemma, to build a few microservices using python, which would be the best framework to use?Jun 13Jun 13
Anand SatheeshSetting up a Kafka pipeline to ingest a stream of stock market dataThis project covers the following topics:May 30May 30
Anand SatheeshA Practical Guide to Solve LeetCode ProblemsIf you landed on this article, chances are you are in the 95% (totally made up stats btw) of readers who are trying to make a job switch to…May 30May 30