Hands-on with the Snowflake Data Marketplace, Wunderman, and COVID-19

Learn how to use the Snowflake Data Marketplace with a real example.

Photo by NASA on Unsplash

The Snowflake Data Marketplace is quickly becoming one of my favorite things about working at Snowflake. With the marketplace, I have live, instant access to some of the world's best datasets to give my data deeper context and meaning.

I had the pleasure of recently presenting a virtual hands-on lab (recording) with Snowflake using Starschema’s COVID-19 dataset (link) and Wunderman Thompson’s COVID-19 Risk, Readiness and Recovery dataset (link) built on the backbone of their fabulous Amerilink Insights dataset. I highly recommend both datasets to drive a deeper understanding of our collective pandemic context. The best part is they are both free to play with, free! I love me some free data…let’s geek out!

Getting and querying data from the Snowflake Data Marketplace is dead simple and built on a powerful premise. After 3 clicks of the mouse, I have data ready for analysis and always in sync with Starschema’s or Wundermand’s latest updates. This means I have instant access to a single source of truth on COVID-19 cases reported by public health authorities worldwide, along with key economic drivers to enhance my data.

Getting Started

First, you will need a Snowflake account if you want to run these queries. Snowflake offers free trials (link) to get started, even with these powerful datasets. If you have an existing account, you must be the ACCOUNTADMIN to follow along. Grab a full copy of the SQL file here on GitHub (https://github.com/BigDataDave1/SnowflakeDataMarketplaceSQL/blob/main/SnowflakeDataMarketplaceVHOL_WundermanSep2020.SQL).

Login to your Snowflake account and switch your context to ACCOUNTADMIN.

Browse to the Data Marketplace.

Get the Starschema COVID-19 set by browsing to the Health category or searching.

Get the data and assign to roles that need access; keep the name as STARSCHEMA_COVID19 or the later queries will not work.

Starschema COVID-19 Data Set Queries

Now we are ready to start querying the data, please go ahead and create a new worksheet and paste this code for your own use. Be sure to set the session variables to a meaningful context, or these queries will not run. Grab a full copy of the SQL file ( here).

Snowsight Output of the Last Query in the JHU Query Block

Safegraph US Census Data & Neighborhood Insights Data Set

Now you are ready to augment the JHU data with populations. Thankfully, Safegraph has a free version of their US Census Data & Neighborhood Insights dataset. Get the Safegraph data from the Demographics category, get the database, assign to roles that need access, change the name to SAFEGRAPH_SHARE or the later queries will not work.

Safegraph Queries

Snowsight Output of the Last Query in the Safegraph Query Block

Weathersource Weather & Climate Data for BI Data Set

Now you are ready to augment the JHU data with the weather further. Weathersource has a free version of their US Weather & Climate for BI dataset. Be sure to get that specific dataset. Get the Weathersource data from the Weather category, get the database, and assign to roles that need access; change the name to WEATHERSOURCE_PARTNER_WS_ONPOINT_WEATHER_DATA_SHARE or the later queries will not work.

Weathersource Queries

Snowsight Output of the Last Query in the Weathersource Query Block

Wunderman COVID-19 Risk, Readiness & Recovery

Now you are ready to augment the JHU data with the amazing Wunderman data fully. Get the Wunderman Amerilink Insights data from the Marketing section, get the database, assign to roles that need access, change the name to WUNDERMAN or the later queries will not work.

The free, ready to query data set on the marketplace is just a random 1% sample of the overall data from the Wunderman data set. Request the “Personalized” version of the data set from the Marketplace to enter into a conversation with Wunderman Thompson about acquiring the unmasked, full data set for your needs.

Wunderman Queries

Snowsight Output of the Last Query in the Wunderman Query Block

Conclusion

As you can see for yourself, the Snowflake Data Marketplace is an amazing place where data enthusiasts are coming together to further enhance data as actionable and meaningful to your narrative. With just clicks of a mouse, we could query 4 different data sources with COVID-19 cases, weather, population, and demographic insights across the globe. I highly encourage playing around in the marketplace. To learn more about the Snowflake Data Marketplace beyond the data sets described in this article, please visit https://www.snowflake.com/data-marketplace/. Happy (Data Marketplace) Querying!

Links & Resources

To learn more about this and other topics, check out a few helpful resources:

Originally published at http://bigdatadave.com on September 11, 2020.

--

--

David A Spezia
Snowflake Builders Blog: Data Engineers, App Developers, AI/ML, & Data Science

David spends his time working with Web-Scale Data Applications built on Snowflake with the largest tenants on the Snowflake Data Cloud.