Open in app
Home
Notifications
Lists
Stories

Write
Ramesh Ganesan
Ramesh Ganesan

Home

May 31, 2019

Basic usage of Spark RDDs and Data frames.

Today’s cluster computing arena spark is getting used for its fast and scalable application model. while comparing spark with traditional map-reduce, it provides In-memory computing which is 10x faster and provides real-time data processing with Spark streams. RDDs Spark provides a distributed collection object which is immutable and called Resilient…

Big Data

4 min read

Basic usage of Spark RDDs and Data frames.
Basic usage of Spark RDDs and Data frames.
Ramesh Ganesan

Ramesh Ganesan

Data Engineering enthusiast | Big Data | Python | SQL

Following
  • Ben Le Fort

    Ben Le Fort

  • Ellie Perlman

    Ellie Perlman

  • Matthew Kent

    Matthew Kent

  • Maxime Beauchemin

    Maxime Beauchemin

  • Workfall

    Workfall

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Knowable