Quickstart PySpark with Anaconda on AWS

PySpark and Anaconda on AWS.
pip install boto3
import boto3
import botocore
import yaml
import time
import logging
s3://<Name of Bucket to be created on S3>/bootstrap_actions.sh
s3://<Name of Bucket to be created on S3>/pyspark_quick_setup.sh
python emr_loader.py
Output of the EMR Jumpstart Script.

--

--

CTO & Co-Founder @ Priceloop (https://priceloop.ai/). Ex-Pivotal, Ex-idealo, Ex-Axel-Springer. More on @datitran.

Love podcasts or audiobooks? Learn on the go with our new app.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store