Fig. 1 — Scraping with Python — A gentle introduction to web data extraction (Image by author)

A Gentle Introduction to Web Data Extraction | Scraping With 🐍

Data is everywhere. Data is on the web. But what is the web? And how to extract data from it?

Thibaud Lamothe 🤠
Published in
8 min readFeb 3, 2021

--

This post is the first of the series “Scraping with Python 🐍 ” where I intend to explain and teach from basic to most advanced scraping concepts.

Introduction

You want to collect data from the web but don’t know where to start? You followed the Scrapy startprojecttutorial but your spider is still not crawling the web? You’ve never even heard of the word crawling ? Great, this series of articles is for you: you’re about to discover the amazing world of scraping!

  • Today, in Part 1, we’ll discover the underlying principles of scraping to the web.
  • Part 2 will be an explanation of how to programmatically extract data from a single web page.
  • Part 3 will be an introduction to Scrapy and how to run a project within this wonderful Python framework.
  • Part 4 will be a use case on a real-life website. A sort of zero to hero to grab product data from one of the biggest platforms of the world.

How does the web work?

--

--

Thibaud Lamothe 🤠
Geek Culture

Head of Data @ Iroko, from Paris | Visit my website www.etomal.com | Unlock unlimited content medium.etomal.com/membership | 🤠