Manisha GuptaFile Formats used in Big Data WorldFile Types- Brief Introduction of Parquet, Avro, ORCAug 11, 2019Aug 11, 2019
Manisha GuptaSpark: Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing1. Existing cluster computing frameworks lack abstractions for leveraging distributed shared memory. This makes them inefficient for two…Aug 11, 2019Aug 11, 2019