Fighting with Apache Phoenix. Secondary Indexes

Vlad Krava
Apr 14 · 5 min read

Long Story Short

Setup

USER_DATA_150K / USER_DATA_15M / USER_DATA_30M Tables Structure

Performance Overview

SELECT REC_KEY, REC_STATE, REC_CREATE_DATE, REC_EMAIL, REC_ FIRST_NAME, REC_LAST_NAME FROM TEST.USER_DATA_XXX WHERE REC_KEY = 'xxx'
Performance Metrics — Retreive by Primary Key
SELECT REC_KEY, REC_STATE, REC_CREATE_DATE, REC_EMAIL, REC_ FIRST_NAME, REC_LAST_NAME FROM TEST.USER_DATA_XXX WHERE REC_LAST_NAME = 'xxx' ORDER BY REC_MODIFY_DATE
Performance Metrics — Filtering and Sorting by Table Fields

Phoenix’s Global Indexes

Global Indexes. Problems Overview

Important Notes

$ hadoop fs -du -h hdfs://{PATH_TO_HBASE}/data/data/{SCHEMA}/{INDEX}

Conclusion

Vlad Krava

Written by

Engineering technologist, full-time lucky man https://twitter.com/vkrava4

Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade