Today’s cluster computing arena spark is getting used for its fast and scalable application model. while comparing spark with traditional map-reduce, it provides In-memory computing which is 10x faster and provides real-time data processing with Spark streams. RDDs Spark provides a distributed collection object which is immutable and called Resilient…