Duplicate content is a hotly debated issue when it comes to how it affects your web-site ranking. And it’s become an even bigger issue over time as spammers and other malicious Internet users have taken to the practice of content scraping, or scraping the content from a web site to use on their own with only minor changes to the appearance, not to the content itself.
Content scraping has become such a problem that search engines now look for duplicate copy, even when it’s hidden behind a link like the Similar Pages that Google uses for related content. If they find it, your site may be lowered in the rankings or even delisted completely.
Still, the duplicate-copy issue isn’t as simple as it may seem. Some people think there’s too much worry about it, whereas others insist the problem needs to be addressed. And both are right to some degree. Let me explain.
First, you need to understand that not all duplicate content is the same kind. You need to appreciate some differences.
-
Reprints:This is duplicate content published on multiple sites with the permission of the copyright holder. These are the articles that you or others create and then distribute to create links back to your site or to sites that are relevant to the content of yours. Reprints are not bad duplicate content, but they can get your site thrown into the realm of Similar Pages, which means they’ll be buried behind other results.
-
Site Mirroring: This is the kind of duplication that can cause one or more of your sites to be delisted from a search engine. Site mirroring is literally keeping exact copies of your web site in two different places on the Internet. Web sites used to practice site mirroring all the time as a way to avoid downtime when one site crashed. These days, server capabilities are such that site mirroring isn’t as necessary as it once was, and search engines now “dis-include” mirrored content because of the spamming implications it can have. Spammers have been known to mirror sites to create a false Internet for the purpose of stealing user names, passwords, account numbers, and other personal information.
Read the rest of this entry »
Tags: duplicate content




