Robin JEANSimple script to display the data distribution on HDFS filesThe purpose of this article is to present a simple Python script (Python3) to display the data distribution for any HDFS file and…Feb 19, 2019Feb 19, 2019
Robin JEANinBig Data on Amazon Elastic MapReduceRun a Spark job within Amazon EMR in 15 minutesThis tutorial is for Spark developper’s who don’t have any knowledge on Amazon Web Services and want to learn an easy and quick way to run…Jan 9, 20185Jan 9, 20185