Avoid partition skew on BigQuery

Antonio Cachuan
3 min readDec 11, 2023

I hope this end of year finds you planning your next year's data resolutions. For me I wanted to return to writing, It’s been a while since my last article so get ready for regular BigQuery articles 😊. Let’s go!

What is partition skew?

When you group data from a specific column one value could occur more often than any other value so the partition may crash the slot that processes the oversized partition.

Following the definition the next image resumes the idea, in this case, imagine you are running a query…

--

--

Antonio Cachuan

Google Cloud Professional Data Engineer (2x GCP). When code meets data, success is assured 🧡. Happy to share code and ideas 💡 linkedin.com/in/antoniocachuan/