Anuj Goyal
Oct 20, 2015 · 1 min read

When we push data to the Elasticsearch, we define index based on the timestamp in the event. We have defined fix number of shards for each type of the data depending on its volume. We mostly query latest data from Elasticsearch so mostly that index is in the memory. Sometime we do query more than one indexes and Elasticsearch is able to handle that well.

You are right. Redshift is not great for online querying, and that’s why we use Redshift only for business intelligence - which is offline reporting and analytics.

Anuj Goyal

Written by

Data & Machine Learning at IFTTT

More From Medium

Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade