Gangadhar KadamAutomating Spark Jobs with Oozie Spark ActionIf you use Apache Spark as part of a complex workflow with multiple processing steps, triggers, and interdependencies, consider using…Sep 10, 20183Sep 10, 20183
Gangadhar KadamBeneath RDD(Resilient Distributed Dataset) in Apache Sparkis the primary data abstraction in Apache Spark and the core of Spark that we often refer to as “Spark Core”.Sep 3, 2018Sep 3, 2018