Member-only story
Spark vs Pandas, part 3 — Scala vs Python
Why programming languages matter
In this third installment of the series “Pandas vs Spark” we will have a closer look at the programming languages and the implications of choosing one.
Originally I wanted to write a single article for a fair comparison of Pandas and Spark, but it continued to grow until I decided to split this up. This is the second part of the small series.
- Spark vs Pandas, part 1 — Pandas
- Spark vs Pandas, part 2 — Spark
- Spark vs Pandas, part 3 — Programming Languages
- Spark vs Pandas, part 4 — Shootout and Recommendation
What to Expect
This third part of the series will focus on the programming languages Scala and Python. Spark itself is written in Scala with bindings for Python while Pandas is available only for Python.
Why Programming Languages matter
Of course programming languages play an important role, although their relevance is often misunderstood. Having the right programming language in your CV may eventually be one of the deciding factors for getting a specific job or project. This is a good example where the relevance of programming languages might be…