Scatter/Gather thoughts

by Johan Petersson

rel="nofollow" links

The major search engines Google, Yahoo, and MSN Search as well as several blog vendors/communities like Six Apart have announced support for a link attribute specifically meant to combat comment spamming.

The idea is to add rel attributes with the value nofollow to untrusted, user-submitted links (such as the links in blog comments or guestbooks), for example:

<a href="http://www.example.com/" rel="nofollow">example</a>

Search engines are then to ignore such links in page rank calculations, reducing the incentive for comment spamming. Similar mechanisms have been proposed by many web authors, particularly bloggers. Unlike some previous proposals, this is valid HTML and XHTML markup: rel attributes are already allowed in anchors (the a element).

I'd like to share some observations about this initiative:

In Google's Quality Guidelines the first basic principle is make pages for users, not for search engines. This is sound advice for several reasons, but perhaps most importantly it is because the search engine exist to assist users in finding content. In order to do so a search engine's model of the content needs to match that of the users, which is impossible if the search engine spider is unable to view pages like human visitors do. The rel="nofollow" initiative violates that principle.

I'm open to describing relationship semantics with rel, but links always imply some kind of relationship; to break the relationship, remove the link. A link is not necessarily an endorsement, and page relevancy calculations need to take this fact into account. But littering the web with explicit instructions to ignore certain links isn't productive. It does not solve anything in the short term, nor does it help search engines in the long term.

21 January, 2005