Rahul VaidinDev GeniusEfficient Unique Count Calculation in ClickHouse : a comparitive analysisRecently, I was tasked with calculating the unique count of a column in ClickHouse. This post will walk you through various methods for…Jul 29Jul 29
Rahul VaidImplementing Bloom filters in GolangA Bloom filter is a space-efficient probabilistic data structure used to test whether an element is a member of a set.Jul 15Jul 15
Rahul VaidBloom filters : An IntroductionWhile signing up on a website you would have sometimes seen a message — username is already taken. Considering the website might have…Jan 17Jan 17
Rahul VaidKafka go : Handling arbitrary JSON data at scaleIn this article we will discuss how to write a Kafka producer and consumer in golang. The producer and consumer will deal with arbitrary…Dec 28, 2023Dec 28, 2023
Rahul VaidReduce the size of conda based docker imagesIf you are a data scientist or a data engineer, you would have probably used conda atleast once in your career. With the massive adoption…Sep 25, 2022Sep 25, 2022
Rahul VaidReading and Writing data to Azure Blob Storage Using PysparkAzure Blob Storage is a managed cloud storage service for storing large amounts of unstructured data. It is a secure, scalable and highly…Jul 8, 20201Jul 8, 20201