Mainstream OLAP data analysis engines, such as Redshift and Spark, use distributed architecture to distribute computing tasks to various working nodes. They utilize large clusters of machines to improve data processing throughput. However, if we only need to analyze and process a few gigabytes of data, it would be overkill…