Dask — Parallelism for Analytics at Scale
Dask is one of the wonderful tools that exist in the Python ecosystem which allows the scaling of data workloads for datasets that typically do not fit in memory in a ‘typical’ workstation. I will be listing why I find it useful and why it works so well in scaling the existing Python packages.

Dask at its heart is a parallel computing library for Python. While there are other parallel…