IBM Data Science Capstone Final Project
Find the most suitable location to open a new North & South Indian restaurant in New York, USA


Introduction
This final project explores the best locations for Indian restaurants throughout the city of New York. Food Business/Entrepreneur News stated that worldwide Indian Food sales were up for the second year in a row with the United States holding the largest market (Donley, 2018). New York is a major metropolitan area with more than 8.4 million (Quick Facts, 2018) people living within city limits. Most of the Indian immigration into the United States occurred during the late 19th and early 20th century with over two million immigrants between 1900 and 1910. With almost 600,000 Manhattan inhabitants reporting Indian ancestry, the need to find and enjoy Indian cuisine is on the rise. This report explores which neighborhoods and boroughs of New York City have the most as well as the best Indian restaurants. Additionally, I will attempt to answer the questions “Where should I open an Indian Restaurant?” and “Where should I stay If I want great Indian food?”
Data
In order to answer the above questions, data on New York City neighborhoods, boroughs to include boundaries, latitude, longitude, restaurants, and restaurant ratings and tips are required.
New York City data containing the neighborhoods and boroughs, latitudes, and longitudes will be obtained from the data source: https://cocl.us/new_york_dataset
New York City data containing neighborhood boundaries will be obtained from the data source: https://data.cityofnewyork.us/City-Government/Borough-Boundaries/tqmj-j8zm
All data related to locations and quality of Italian restaurants will be obtained via the FourSquare API utilized via the Request library in Python.
Methodology
• Data will be collected from https://cocl.us/new_york_dataset and cleaned and processed into a dataframe.
• FourSquare is used to locate all venues and then filtered by Indian restaurants. Ratings, tips, and likes by users will be counted and added to the dataframe.
• Data will be sorted based on rankings
• Finally, the data be will be visually assessed using graphing from various Python libraries.
Problem Statement
- What is/are the best location(s) for Indian cuisine in New York City?
- In what Neighborhood and/or borough should I open an Indian restaurant to have the best chance of being successful?
- Where would I go to New York City to have the best Indian food?
Observations
Map of New York
You can now see Join, Labels and Level_labels columns as the last three ones in the below table. You can also see a clustered map boroughs of New York Indian Restaurant in the below.

Number of Indian restaurants in Borough

Results / Conclusion
Bronx and Manhattan have the best rated Indian restaurants on average. The Bronx has the highest number of Indian restaurants per borough. However, of note, Baychester, City Island, and co-op city, etc. has the neighborhood in all of NYC with the same Indian Restaurants. Despite Manhattan having the least number of neighborhoods as a comparison to the Bronx but, it has the most Indian restaurants. Based on this information, I would state that Manhattan and Bronx are the best locations for Indian cuisine in NYC. To have the best shot of success, I would open an Indian restaurant in the Bronx. The Bronx has multiple neighborhoods with average ratings exceeding 8.0 of a scale of 1.0 to 10.0 and has the least number of Indian restaurants making competition easier than in other boroughs. As a final note, all of the above analyses is depended on the adequacy and accuracy of Four Square data. A more comprehensive analysis and future work would need to incorporate data from other external databases.
. References:
[1] New York Dataset — Wikipedia
[2] New York City data containing neighborhood boundaries will be obtained from the data source: https://data.cityofnewyork.us/City-Government/Borough-Boundaries/tqmj-j8zm
[3] Foursquare API
