26
MAR
2014

Web Scraping – Definition, Detection and Prevention

Web Scraping is the act of stealing data secretly from the web. It has become the most common threat to millions of online businesses today. The vast importance of data and the criticality to access it for multiple business purposes has inspired people to adopt various legal or...
22
JAN
2014
scraper-enemy

Know your enemy and learn how to prevent screen scraping

When protecting against screen scraping it is useful to understand whom you are up against.  This article will go through the common methods people and companies use to scrape data. Many scrapers are not aware that they are breaking the terms of use of a website.  A carefully...
15
JAN
2014
captcha-tests

Will a CAPTCHA test stop scraping?

Yes and No. CAPTCHA tests can be highly effective in the right place if the data is not too valuable for scrapers. There are two main ways of circumventing CAPTCHA tests, by using OCR (optical character recognition) software or to use labor in low cost countries to manually solve...
13
JAN
2014
how_to_block_ip_address

How can I block an IP from accessing my site?

Generally the hard part of stopping screen scrapers is not placing a block on them, but rather finding them in the first place. Once you have identified a scraper, it is essential to place the block as quickly as possible to stop the activity from the current source. When...
24
NOV
2011

Ryanair continue to work hard to prevent scraping

According to this article Ryanair stopped using captchas: http://www.travelweekly.co.uk/Articles/2020/11/21/38831/comment-what-now… The article makes a few bald statements of how dependant ryanair is on the travelagents. Ryanairs answer to this seems to have been to move...
29
SEP
2009

Malicious screen scraping: Worth preventing?

A number of recent cases have heightened the profile of screen scraping. But why should firms now be looking out for screen scraping issues that could affect them? Is it that much of a problem that it deserves a business’ attention? According to TJ McIntyre, a lecturer in...
24
JUL
2009

Cultuzz gets rid of screen scraping

Cultuzz Digital Media has announced that it is to no longer use screen scraping and its interfaces are now XML-based. The company asserted that screen scraping is not as reliable as an XML interface, which may scare off those looking to use screen scraping in a malicious manner....