Spinning Up a Free Hadoop Cluster: Step by Step
Austin Ouyang
2628

Thank Austin for the nice note. I wrote a Jupyter notebook that covers these steps and enables however many nodes you want without going through the hacking of terminal group input. You can spin this up entirely within your Jupyter notebook, but you do need to install boto3 and awscli, and setting up the aws_access_key_id and aws_secret_access_key through *your terminal* (‘pip install awscli’ then ‘aws configure’). (A similar notebook setting up Spark is also there, but I did not finish the port-forwarding part so that you can run your Jupyter notebook harnessing the cluster). Link: https://github.com/ddu1/Hadoop-spark-setup/blob/master/Setup_Hadoop_Jupyter.ipynb

Show your support

Clapping shows how much you appreciated Daping Du’s story.