Analytics Vidhya
Published in

Analytics Vidhya

Get Hive count in seconds

To get an accurate count of the amount of data, when you can’t have any less or one more, COUNT(*) is the only way. But there are times when you don't need an exact number, but you need a rough estimate of the table size, for example, to understand that the table is not empty, or to roughly estimate the size of the data to be migrated. There is a faster way than COUNT(*) for such tasks. We can use Hive statistics.

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Kirill Bobrov

Kirill Bobrov

211 Followers

helping robots conquer the earth and trying not to increase entropy using Python, Big Data, ML. Linkedin @luminousmen. Check out my blog — luminousmen.com