Homework #2

Three ideas that come to mind for midterm beat

  • What are the statistics of vegan soul food places in NYC?
  • What are the statistics of black and brown folk getting harassed/ pulled over by the cops after the stop and frisk laws were considered unconstitutional?
  • what are the statistics of black-owned farmers market stands in NYC?

The thing that surprised me about the “Cleaner, Smarter Spreadsheets Start with Structure” article was how important it is to have a data dictionary and what the definition of what that is. It is a “file that explains what each column header in your spreadsheet is, along with information on the source of the data.” I didn’t realize that you needed a field name, meaning, format and more! The second article “Basic Steps in Working with Data” was also super interesting in the data dictionary department as well. I did not realize how powerful data could be until I read that example of the Miami Herald printing that DUI story that put all the judges in a tizzy. One little oversight (which in their case, was the 0’s for community service) can bring down a whole article, or even a judge’s whole career into questioning.

When I did the NYC data, I chose to compare it by Region (NYC, NYS, & Non-NYC), and by Index crimes (Murder, Rape, Agg. Assault, & Robbery).

But when I did the Data chart on excel online, things did not turn out the way I wanted them to turn out.

The data did not make sense at all to me, so I had to chart it out by myself for it to actually convey what I was trying to convey.

There are many type of ways you can interpret this chart, but I did it from the highest number (murder:red , rape:pink, robbery:green, agg. assault:yellow) for each place (nys, nyc, non-nyc) of all time. But you can also compare each date, each statistic, both, lowest, you can choose one date to compare, etc.

All in all, this was great to experience!