• START
  • SCRAPING
    • SCRAPING DEFINED
    • SCRAPER BOTS
    • SCRAPING THREATS
    • SECTORS AT RISK
  • SERVICES
  • CLIENTS
  • ABOUT
  • RESOURCES
    • SCRAPING NEWS
    • CASE STUDIES
    • THE SCRAPING THREAT REPORT 2015
  • CONTACT

Scraping defined (also data scraping, web scraping or screen scraping)

What is scraping and why should you bother?

The growing phenomenon of systematic data theft known as “scraping” is definitely something to be taken seriously by online businesses who maintain large public databases on their web sites. As you read this article, you will learn more about why scraping can be a serious threat to your company.

Scraping (also web scraping, screen scraping or data scraping) is what you do when you copy large amounts of data from a web site – manually or with a script or program.

Scraping can sometimes be benevolent and totally acceptable like for example the search engine robots that index the web. You definitely want those to spider your site so that your customers can find you! On this web site however we will focus on malicious scraping which is carried out for commercial gain – and how to prevent it.

/ Martin Zetterlund, Anti Scraping Expert at Sentor Managed Security Services

 


What is malicious scraping?

Malicious scraping is systematic theft of intellectual property in the form of data accessible on a web site. This can be illustrated using an online directory as an example. They publish intellectual property online, names, addresses, and business information.

It is free for all to use the information as long as they comply with the term and conditions of the site. Unfortunately, scrapers do not care about terms and conditions, and will abuse the service by systematically downloading large amounts of data for personal gain.

The online directory looses control over its data which they have invested time and money to gather, maintain, and make available as a part of their service offering. In a worst case scenario, they wake up to a new competitor that is able to offer the same data as them.


Why it hurts your business

If a scrapers downloads all the data from your database and adds it to a competing site, you may suddenly have competition offering the same exact service.

Adding insult to injury, the company doing the scraping will not have the same overhead expenses as you, and can offer similar services at a significant discount to your target groups.

We see entire businesses threatened by out of control scraping every day. Many of these companies do not understand their exposure to the threat of scraping until it is too late.

Do  you understand the impact of scraping on your business today?

More about the Threats


Who is threatened by scraping?

All online businesses that share information on their websites are threatened by scraping. Examples of these include:

  • Online directories
  • Airlines
  • Betting
  • B2B-portals
  • Online property
  • Insurance

More about Sectors at Risk

Want to know more about ScrapeSentry?

More about ScrapeSentry! Back to Scraping Wiki!

Stop scraping and bad bots with ScrapeSentry

ScrapeSentry is a complete combination of technology, behavioral analysis, expertise and most importantly 24/7 human moderation. Find out how ScrapeSentry can secure your business!
Read more!

Find more information

  • Scraping wiki
    • Scraping
      • Scraping defined
      • Scraping techniques
      • How to detect scrapers
      • Web scraping in the ticketing industry
      • How can web scraping be harmful for Real Estate Portals?
      • Where do scraping attacks origin
      • Online retailers threatened by price scraping
    • Bots
      • Different types of bots
      • Web bots and how they affect businesses
      • Click Fraud Bots
      • Spam bots
      • Good bots – bringing your website crucial benefits
      • How much is legitimate web traffic
    • CAPTCHA
      • All you need to know about CAPTCHA
      • CAPTCHA and userability
      • Will a CAPTCHA test stop scraping
      • Common methods and tools to break CAPTCHA
    • How to stop scraping/bots
      • Things to consider when preventing scraping
      • Block an IP address from accessing my site
      • Common methods used to prevent scraping
    • Content protection
      • How to keep track of online content
      • Protecting personal data from scraping
      • Protect online content
      • Duplicate content is a problem
    • Legal implications
      • Is scraping legal?
      • Web scraping: legal or illegal?
      • Scraping and the Computer Fraud and abuse act

ScrapeSentry - The Anti Scraping Service

We offer guaranteed detection and scraping prevention in near real-time. A combination of technology, behavioral analysis, expertise and most importantly 24/7 human moderation.
More about ScrapeSentry!

The Scraping Threat Report 2015

The Scraping Threat Report 2014 is a report based on data from the world's largest database for scraping related activity. The report shows an huge increase in scraping related activity.
Download the report!

Recent Articles in our Newsroom

Distil Networks Acquires Sentor ScrapeSentry to Add 24/7 Security Operations Center and Expert Team of Analysts

January 13, 2020

When Reservation Bots Steals Your Favorite Table

November 10, 2020

Data Scraping – Terms & Conditions

September 07, 2020

How Python Is Used to Scrape Websites

September 02, 2020

Price Scraping a Growing Threat to Ecommerce Sites

August 20, 2020

Contact Distil Networks:

[email protected]
US: (866) 598-6787
UK: +44 203 3184751
EU: +46 8 545 333 50

East Coast Headquarters:

4501 North Fairfax Drive
Suite 120
Arlington, VA 22203

West Coast Headquarters:

49 Stevenson St.
Suite 200
San Francisco, CA 94105

European Headquarters:

Björn Trädgårdsgränd 1
116 21 Stockholm
Sweden
Copyright © 2016 Distil Networks. All rights reserved.