Analytics Vidhya
Published in

Analytics Vidhya

Dask — Parallelism for Analytics at Scale

Dask is one of the wonderful tools that exist in the Python ecosystem which allows the scaling of data workloads for datasets that typically do not fit in memory in a ‘typical’ workstation. I will be listing why I find it useful and why it works so well in scaling the existing Python packages.

Image Courtesy —

Dask at its heart is a parallel computing library for Python. While there are other parallel…




Analytics Vidhya is a community of Analytics and Data Science professionals. We are building the next-gen data science ecosystem

Recommended from Medium

Роль информации в эскалации конфликтов

Exploring San Francisco Apartments on Craigslist with R

AI Ethics doesn’t exist

Zoom out, zoom in, focus

How to create Fact and Dimension tables from denormalized raw data

From objective to outcome: Make your data driven projects more successful in 6 steps

nanochomp monster data chomp logo

Multinomial Naive Bayes and Binary Multinomial Naive Bayes - With Example

The Internet Mince Pie Data Base: 2018 Edition

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Nuzhi Meyen

Nuzhi Meyen

Co-founder of Helios P2P. Sri Lankan. Interested in Finance, Advanced Analytics, BI, Data Visualization, Computer Science, Statistics, and Design Thinking.

More from Medium

ML OPS with DB2: Manage and Score Python ML Models in IBM DB2

HyperDriveStep in data pipelines

Model Monitoring Implementation - Amazon SageMaker Linear Learner Algorithm

Accessing Teradata from Databricks for Rapid Experimentation in Data Science and Analytics Projects

Databricks Side Bar — Compute