Image for post
Image for post
Photo by Jonathan Farber on Unsplash

Hive Optimization — Quick Refresher

Amit Singh Rathore
Nov 25, 2020 · 4 min read

Partitioning & Bucketing

Image for post
Image for post

File format

Image for post
Image for post

Execution engine

Image for post
Image for post
Image from https://docs.cloudera.com/

Vectorization

Predicate push down

Use of Single scan

Enable compression of intermediate data

Join Optimizations

SMB Join

SKEWED TABLE

SKEWED JOIN

Cost based optimization

The Startup

Medium's largest active publication, followed by +756K people. Follow to join our community.

Medium is an open platform where 170 million readers come to find insightful and dynamic thinking. Here, expert and undiscovered voices alike dive into the heart of any topic and bring new ideas to the surface. Learn more

Follow the writers, publications, and topics that matter to you, and you’ll see them on your homepage and in your inbox. Explore

If you have a story to tell, knowledge to share, or a perspective to offer — welcome home. It’s easy and free to post your thinking on any topic. Write on Medium

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store