Grab - Site Scraping Framework

Grab is site scraping framework. Grab could be used for:

  • website data mining
  • work with network API
  • automation of actions performed on websites i.e. creation of profile on some site

Grab consists of following parts:

  • Grab interface for creating network requests and working with results of these requests. This interface is good for simple scripts where is no need in multithreading.
  • Grab::Spider interface which allows to develop complex multithreaded asynchronous site scrapers. This interface has two main benefits:
  1. It restrict you spider to have clean structure
  2. It allows to perform multiple concurrent requests without big CPU/memory consumption

Grab User Guide

Grab is powerful python site scraping framework which allows to build both simple scripts and complex multithreaded website spiders.

Translations

Useful Links

This Page

Fork me on GitHub