For the past two/three months (on and off) I have been developing a web spider framework that would bring together many web scraping concepts and methodologies and combine them in one library (package). That framework is Scrappy which is a play on the term (scraper happy or happy scraper), not to be misunderstood as a port of Scrapy the Python Scraper Framework.
For the sake of time I will just highlight some of the features that I think are innovative and add value.
♥ Scrappy can be used as DSL or OOP (even interchangeably). e.g.