Prefer Unions over Or in Spark JoinsAn anti-pattern in Spark SQL is the use of an `or` in a `join`. This might look intuitive, but has some performance issues.Oct 11, 2019Oct 11, 2019
Published inLogistimo Engineering BlogDynamo vs Cassandra : Systems Design of NoSQL DatabasesState-of-the-art distributed databases represent a distillation of years of research in distributed systems. The concepts underlying any…Oct 2, 2018Oct 2, 2018
A Simple Dichotomy for Modeling Data-Intensive SystemsTL; DR: This post presents a simple, almost trivial, mental-model to help think about data-intensive systems.Aug 18, 2018Aug 18, 2018
Published inLogistimo Engineering BlogUnderstanding Apache Spark on YARNTL; DR Apache Spark is a lot to digest; running it on YARN even more so. I hope this post helps a bit.Jul 24, 2018Jul 24, 2018
An Early Employee’s Field Guide to Workplace ArgumentsConflicts are common in an early-stage startup. This post lists mental models to prevent, judge, diffuse & take leverage of conflicts.Feb 3, 2018Feb 3, 2018
Published inLogistimo Engineering BlogMultiple Parameter Lists in Scala27 Jan 2018Jan 27, 2018Jan 27, 2018