Apache Spark is a cluster computing framework for fast processing of Hadoop data. Spark applications can be written in Scala