Build Your Own Dataset With Beautiful Soup

Dr. Monica
The Startup
Published in
3 min readJul 12, 2020

--

In this world of the Internet, the amount of data that is surrounded by you is like a vast ocean for any field of research or personal interest. To effectively harvest this data, you’ll need to develop the skillset for web scraping. With Web scraping, we can build our own dataset as per our requirements for further analysis. Using this technique only, Amazon’s rating, Netflix review, IMDB ratings, and many other datasets are prepared and analyzed.

I learned recently Web Scrapping and then suddenly I got an exciting idea if I can apply to scrape the Medium data to find the list of publications and do further analysis of it.

Also, there’s a key point to remember here:

The things that we learned from examples or toy implementation is always different for real-time examples.

Let’s start without any delay.

This blog speaks about the code for web scraping multiple pages using Beautiful Soup. The website I have scrapped to build the dataset for this analysis is “https://toppub.xyz/publications”. To implement this…

--

--

Dr. Monica
The Startup

Research aspirant in Machine learning and Data Science. Aspirant to blog about life and it’s experience