How to Create Tables and Query Data with Redshift Spectrum From S3

A hands-on tutorial on Redshift Spectrum

George Pipis
Geek Culture

--

In this tutorial, we will show you how to create several tables in Redshift Spectrum from data stored in S3. Finally, we will perform queries on the tables that we have created. Note that Redshift Spectrum is similar to Athena, since both services are for running SQL queries on S3 data.

Redshift Service

The first thing that we need to do is to go to Amazon Redshift and create a cluster. In my case, the Redshift cluster is running.

S3 Bucket

We will need to get the data from S3. For this example, we have used the following bucket but we provide you the data which are in json format.

--

--

George Pipis
Geek Culture

Sr. Director, Data Scientist @ Persado | Co-founder of the Data Science blog: https://predictivehacks.com/