Scraper sites were a menace to the search engines before couple of years. 100% of scraper sites don't have any original contents. All they do is they have search engine crawlers like what Google has. They search Google for a particular keyword phrase and index all the URLs that come up in top 1000 for that particular keyword phrase. Then they start crawling most of those URLs and fetch a block of contents from those pages indexed. So if the scraper sites crawl over 500 URLs they can easily get 200 words content from each URL. So that is like 100,000 words website in a matter of hours.
Scrapers mainly target sites like Wikipedia or other content sites which has lot of juicy contents. As soon as all the contents are indexed the scraper sites use content generation software to create 100s of pages with all the scraped contents. Voila you have a website with 100s of pages of contents all stolen from other websites.
Because this process is very easy 1000s of scraper sites started coming up in search engines. Most of them were run by affiliates or MFA spammers. These guys create scraper sites in few hours, stuff them with affiliate links or Adsense or other ads and go live. When Google or other search engine index these websites they rank the page from the keywords listed on the contents of those pages and the traffic these sites get are redirected to the ads on those pages. Search engine manipulators and spammers earned huge sum of money running these scraper sites. Many types of software came into market just for running scraper sites.
All the hard work many website owners did were stolen by these scraper sites and in some cases the scraper sites out rank the original content of a website. This could be because of the diversity of the content the scraper sites had compared to small websites who have only 5 to 10 pages. Webmaster and site owners complained in forums and through spam reports to search engines. For many years scraper sites were pretty obvious in search results.
Today scraper sites are rarely successful. Especially after the panda update which sees the quality of the article and site scraper sites lost its value. Scraper sites were an important search engine manipulation which the search engines found difficult to eradicate. Still scraper sites appear in other country search results but mostly filtered in US results.
Images |
Image of a site with excessive links and scraped content.
Screen Shots |
Working Examples |
The below mentioned image is an example of a scraper site.
The below mentioned screen shot is an extract of an ezine article, which talks about position reconciliation. The highlighted text in the article is used by a scraper site in order to add content to its site and increase the ranking.
The highlighted webpage in the Google search results is a scraper site that uses the content from the ezine articles webpage. The site is actually a Chinese site but has scraped content from different websites in order to obtain higher search engine rankings.
Below mentioned are the images of the site that show scrapped content in the chinise site. In the second image, the phrase that is scraped from the ezine article is also highlighted.
References |
Other sites that refer to the same manipulation tactic are as follows |
Search Engine Optimization SEO Company | Privacy Policy | Term of Service | Copyright
Search Engine Genie is an Ethical Search Engine Optimization Company Specializing in Search Engine Marketing, Search Engine Promotion and Search Engine Ranking Services.