Part I. Building scrapers: Your first web scraper -- Advanced HTML parsing -- Starting to crawl -- Using APIs -- Storing data -- Reading documents. Part II. Advanced scraping: Cleaning your dirty data -- Reading and writing natural languages -- Crawling through forms and logins -- Scraping JavaScript -- Image processing and text recognition -- Avoiding scraping traps -- Testing your website with scrapers -- Testing your website with scrapers -- Scraping remotely -- Python at a glance -- The internet at a glance -- The legalities and ethics of web scraping
Summary
Learn web scraping and crawling techniques to access data from any web source in any format. Teaches basic web scraping mechanics, but also delves into more advanced topics, such as analyzing raw data or using scrapers for frontend website testing
Notes
Includes index
Online resource; title from PDF title page (EBSCO, viewed June 19, 2015)