Meenakshi Sundaram SekarHDFS (Hadoop distributed File System) Architecture in a nutshellThe Hadoop distributed File System (HDFS) is Hadoop’s storage platform. Though Hadoop can interact can with multiple different filesystems…May 9, 20181May 9, 20181
Meenakshi Sundaram SekarYARN — Resource scheduler/Cluster Manager for Hadoop platform — in a nutshellYARN Schedules and orchestrates applications and tasks in Hadoop platform. When tasks to be run need data from HDFS, YARN will attempt to…Apr 22, 2018Apr 22, 2018
Meenakshi Sundaram SekarSpark Applications Running on Yarn — in a nutshellYARN (yet another resource negotiator) is Hadoop’s popular resource scheduling cluster manager. In this post, let’s see how a Spark…Apr 22, 20182Apr 22, 20182
Meenakshi Sundaram SekarAnatomy of a Spark Application — in a nutshellSpark application contains several components, all of which exists whether you are running spark on single machine or across a cluster of…Apr 20, 2018Apr 20, 2018
Meenakshi Sundaram SekarHadoop Architecture — Design Considerations — in a nutshell1.Hadoop is a platform for the storing big data (*petabytes of information) by distributing the data across the machines in the Hadoop…Apr 8, 2018Apr 8, 2018