Photo by Mathew Schwartz on Unsplash

Hyperloglog: A Simple Estimation of The Number of Unique Elements in a Large Data Set

Prof Bill Buchanan OBE FRSE

--

We are increasingly moving towards large datasets, and where we need to build hashtables based on the data elements that we have. But, how do we count the number of data elements that are unique? Well, first we have to parse our input data into data elements. As a simple example — in Python — we can just take a string, and then parse it for…

--

--

Prof Bill Buchanan OBE FRSE

Professor of Cryptography. Serial innovator. Believer in fairness, justice & freedom. Based in Edinburgh. Old World Breaker. New World Creator. Building trust.