created on 23 Jun 2008, by Syndication, read more…
An API for scraping the Internet via cURL, HTMLTidy, and SimpleXML. Makes scraping sites, parsing XML, and following hyperlinks a snap. Can be extended to implement proxy rotation, delayed hits, useragent rotation, etc.