• START
  • SCRAPING
    • SCRAPING DEFINED
    • SCRAPER BOTS
    • SCRAPING THREATS
    • SECTORS AT RISK
  • SERVICES
  • CLIENTS
  • ABOUT
  • RESOURCES
    • SCRAPING NEWS
    • SCRAPESENTRY THREAT REPORT 2014
    • CASE STUDIES
    • SCRAPING FAQ
    • SCRAPING TERMINOLOGY
  • CONTACT

Blog Post

31
JUL
2009

Data scraping ‘can be simple or very hard’

Tags : data theft, how it works, scraping
Posted By : Martin Zetterlund
Comments : Off

For those people that carry out screen scraping, it can either be a simple or particularly difficult task, it has been suggested. This depends on how complex the source is, according to Martin Streicher, writing for Linux Magazine.

The tools for carrying out scraping activity are mainly the same, whatever the task is, Mr Streicher explained. He admitted to scraping himself, noting that he had probably scraped tens of sites in the past for purposes such as aggregating and analysing sales data.

There are a number of tasks that those looking to carry out scraping activities will need to take on, he explained. The first step they will have to take will be the identification of content they are interested in, then moving on to finding those sites that have the desired information, Mr Streicher asserted. Scrapers will then need to determine if the data on the site is accessible and the find or create tools to collect pages and extract data, he added.

People that do carry out scraping activities may run into trouble, however, as recently highlighted by Ryanair’s announcement that it has lodged proceedings in the High Court in Dublin against Travelviva AG, a German screen scraping ticket tout. The airline has claimed that Travelviva has been carrying out unauthorised screen scraping as well as reselling Ryanair’s flights with unjustified mark-ups. Ryanair said that it is planning to carry out more actions against other European unauthorised screen scrapers in the coming weeks.

“Ryanair is determined to continue its crusade against screen scraping ticket-tout websites until the last screen scraper stops overcharging unsuspecting consumers and breaching Ryanair’s copyright and terms of use of www.ryanair.com,” said Ryanair’s Daniel de Carvalho.

“We are confident that unauthorised screen scraping and overcharging of consumers will eventually be outlawed throughout Europe, to the benefit of consumers and legitimate businesses,” he added.

 

Others also read

  • Screen scraping: The basicsScreen scraping: The basics
  • Protecting personal data from screen scraping
  • Google Launches Google Scraper Report Form

Social Share

  • google-share

Need to Talk to an Expert?

We have helped several companies in various sectors since 2006. Are you afraid that your business is at risk? Then you should talk to one of our anti scraping experts. We operate with integrity and respect your confidentiality.
Contact us today!

Recent Articles

scraping_problems_in_ticketing

The Scraping Problem in Ticketing - View Slideshow

April 09, 2020
scraper_report_tool

Google Launches Google Scraper Report Form

April 07, 2020
scraper_bots_linkedin

Competitor Used Scraper Bots in Order to Copy Linkedin Profiles

April 04, 2020
scrapers_in_ticketing

Ticketmaster Sues Notorious Ticket Scraper Higs

March 28, 2020

Web Scraping – Definition, Detection and Prevention

March 26, 2020

Head Office:

Sentor MSS AB
Björns Trädgårdsgränd 1
116 21 Stockholm
Sweden
Phone:+46 8 545 333 00

UK Office:

Sentor MSS UK
35-37 Blackstock Road
London N4 2JF, UK
UK Phone: +44 77 69 75 63 77
USA/Canada Toll Free: 1-800-351-1691

Latest News

scraping_problems_in_ticketing

The Scraping Problem in Ticketing - View Slideshow

April 09, 2020
scraper_report_tool

Google Launches Google Scraper Report Form

April 07, 2020
Copyright © 2014 ScrapeSentry. All rights reserved