I get it — You are tired of searching for datasets online for your machine learning project or maybe for analyzing a popular Twitter trend.
Today we will learn how to generate your own custom dataset from Twitter by using hashtag search. According to internetlivestats.com, every second, on average, around 6,000 tweets are tweeted which corresponds to over 350,000 tweets sent per minute and 500 million tweets per day. This makes Twitter an excellent place to get data for your projects as Tweets are an accurate representation of today’s natural language on social media.
Today we will learn to automate Lead Generation/Email Crawling with a simple python script.
Want to skip the post and see the good stuff directly? Here is the Github repo
Lead Generation is a very Lucrative business and people earn a ton of money just by finding emails to their client.
Let’s see what our end product will look like so that I won’t waste your time in case you don’t find this interesting.
Our crawler will visit each and every sub-page of the provided website and look for emails and then save…
I get it — you’re tired of waiting for your program to download images. It takes hours sometimes when I have to download thousands of images, and you can’t possibly keep waiting for your program to finish downloading these dumb images. You have a lot of important stuff to do.
Let’s build a simple image-downloader script that’ll read a text file and download all the images listed nicely in a folder super fast.
This is what we’ll be building at the end.