Web scraping news
The Latest Scrapings
The latest web scraping news and incidents from all over the world, commented on by scraping experts from Sentor Managed Security Services.
2008-07-15 - How CAPTCHA got trashed
Computerworld has an article about how CAPTCHA got trashed. It mentions products used to beat Craigslist anti-spam mechanism and how Cragslist try to fight back against spammers using phone verification.
2008-07-06 - Scraping case in Irish court
The Sunday Business Post reports that Ryanair is taking legal measures against Bravofly arguing that screen scraping violates the terms and conditions of their website. Read more about this and other legal scraping information.
2008-04-14 - More news about broken captchas
The Register has an article about British researchers showing that the MSN captchas are crackable. Using a normal home computer they manage to read them with a high success rate generally destroying their whole purpose. This is especially interesting in the light of the reports from google that spammers have been able to create massive amounts of email accounts on their gmail service.
2008-04-07 - Legal issues with web scraping
Article about the legal aspects of screen scraping. Before moving into the legal area the article goes through defining screen scraping and a couple of the more common countermeasures.
2008-03-18 - Captchas does not stop scraping
Labor cost varies a lot around the globe and this shows how a simple image test is not enough to stop scraping. Humans in low cost countries are hired to break CAPTCHA images designed to protect free services from automatic signups.
2008-02-04 - Scraping and data theft
SC Magazine has an article about scraping and data theft. Some comments on the article.
2007-12-20 - Article about scraping in Wired
A lengthy article in Wired about scraping, Should Web Giants Let Startups Use the Information They Have About You? The article is focused around web 2.0 and scraping showing both sides of it, the smaller start-ups that try to build services around someone else's data and the large existing services working on preventing them from scraping.
2007-12-17 - Facebook Sues Porn Site for scraping
The popular social networking website Facebook has filed a lawsuit against a Canadian company for "unauthorized attempts to access and harvest proprietary information" - Scraping that is.
On more than 200,000 occasions during a two-week period it is said that the Canadian porn company Istra Holdings harvested information from the Facebook site with the help of automated spidering and scraping tools. Read more about the the scraping lawsuit at NewsFactor.

Risk assessment




