At the time of writing this, I used an automatic scraper to scrape the data + some manual work. However, soon after this I wrote a thorough tutorial where I show how to scrape and clean data from Metacritic and IMDB using Python: https://www.dataquest.io/blog/web-scraping-beautifulsoup/
I have recently started to work for Dataquest, so my recommendation can’t help but be a bit biased.
As I’ve been learning Data Science for close to a year now, this is what worked for me: Dataquest’s data scientist path + using extra-Dataquest resources for studying statistics + working a…
Thanks for the quick reply, David!
Clarifying Class Central’s purpose (free video-based courses only from universities) cleared up a lot of my doubts. I understand the need to justify the one-off with DC’s two courses, but, just to be clear, I didn’t and don’t have any problem with those courses to be listed. My only issue…
LATER EDIT: I’ve recently started to work for Dataquest.io, but when I posted the reply below I wasn’t. Everything below remains unchanged.
Congratulations on your new job, David! I’ve been keeping an eye on your Medium posts ever since I first read your article about your DS curriculum, which I found helpful and inspiring.
It really depends on the distribution of all movies before they get filtered.
If the initial distribution is normal, then it is likely to become slightly skewed to the right after the quality filtering.
If the initial distribution is slightly skewed to the left, then it may become normal after…
I really like the mathematical modelling you did to transform IMDB’s distribution.
However, I don’t agree with applying it to this particular case.
The main reason is that you’d have to apply it to the other variables as well if you want to compare them fairly, and this would cause skews into…
That is a really nice suggestion. I will try to see how can I actually do that when I will have the time (I’ve just started working on another project).
For me, the tomatometer was quite problematic for two reasons:
Hey Andrew, it’s really great to see you have given some serious thoughts to my article. Thanks a lot for taking the time to share some!
I will try to address your issues one by one:
I. The demographics of those who rate movies