This document contains code for a web scraping application. It uses Sinatra to create an API endpoint that stores scraped URL and data. It also includes a Greasemonkey user script that posts scraped data using AJAX calls and redirects to the processed URL returned from the API. The user script scrapes Google Image search pages and sends the data to the Sinatra API for storage and processing.
1 of 27
Download to read offline
More Related Content
Introduce of the parallel distributed Crawler with scraping Dynamic HTML