Have a personal or library account? Click to login
Python Web Scraping Cover

Python Web Scraping

Successfully scrape data from any website with the power of Python

Paid access
|Sep 2025
Product purchase options

Key Features

    Book Description

    What you will learn

    • Extract data from web pages with simple Python programming
    • Build a threaded crawler to process web pages in parallel
    • Follow links to crawl a website
    • Download cache to reduce bandwidth
    • Use multiple threads and processes to scrape faster
    • Learn how to parse JavaScriptdependent websites
    • Interact with forms and sessions
    • Solve CAPTCHAs on protected web pages
    • Discover how to track the state of a crawl

    Who this book is for

    This book is aimed at developers who want to use web scraping for legitimate purposes. Prior programming experience with Python would be useful but not essential. Anyone with general knowledge of programming languages should be able to pick up the book and understand the principals involved.

    Table of Contents

    1. Introduction to Web Scraping
    2. Scraping the data
    3. Caching the html
    4. Concurrent downloading
    5. Dynamic content
    6. Working with forms
    7. Cracking CAPTCHA
    8. Tracking with Scrapy
    9. Overview
    PDF ISBN: 978-1-78216-437-1
    Publisher: Packt Publishing Limited
    Copyright owner: © 2015 Packt Publishing Limited
    Publication date: 2025
    Language: English
    Pages: 174