Harshita MadanSpark: RDDs V/S DataFrames V/S DataSetsThere are 3 different APIs available in Spark for data processing.Mar 16, 2022Mar 16, 2022
Harshita MadanApache Spark: Largest Open Source Project in Data ProcessingApache Spark is the in-memory, general-purpose compute engine. It is the plug and plays engine that needs two things to work with:Mar 8, 2022Mar 8, 2022
Harshita MadanCan Apache Spark replace Hadoop?It is wrong to assume that Apache Spark can replace Hadoop. Spark is the in-memory general-purpose compute engine whereas Hadoop is the…Feb 22, 2022Feb 22, 2022
Harshita MadanWhat is Big Data?Data that can be categorized into 3V’s is termed Big Data. These 3V’s are Volume, Variety, and Velocity. Lets’s see what these terms mean:Feb 10, 2022Feb 10, 2022