Project Journal, Week 5
Welcome to our DATA360 team blog! This blog will be the journey of our investigation into interesting aspects of the crime in Chicago.
We started our project by brainstorming the topic we would like to dig into. KD was interested in crime in general, so we decided to investigate crime. Aside from that, Chicago is known by many people as the city of crime and violence. With that in mind, we agreed to make ‘Crime in Chicago’ as our topic.
Data Mining
For the first week, we found some relating and interesting datasets on Kaggle and on other sources.
Crimes of Chicago
The first dataset we found interesting is Crimes in Chicago, a ernomous BigQuery dataset which consists of crime data from 2001 to 2017. This dataset contains more than 6,000,000 rows of incident data (Yes, 6 millions). We are not so sure to what extend we can use this dataset for, but we are sure that we can do many great things with it. We figured we can merge other datasets with this one to tell interesting stories.
Other Datasets
Other datasets we found include the temperature of Chicago’s Midway Airport from 2000–2019. We found an interesting finding with Crimes in Chicago and Midway Airport Temperature which we will share in our next blog post. Additionally, we gathered Chicago’s Gasoline Price from 2000 to 2019 and Chicago’s Unemployment Rate from 1990 to 2019. We have not done any analysis with those two datasets yet, but we expect to find some interesting correlations once we do.
That’s what we got for this week blog! There will be more, but we will save the fun for next times. I hope you enjoy this blog post. Please comment below for what you found interesting or what you want to suggest about our project!