Bottom stain Coffee stains
Darren Ravens

A Purple pot of golf at the end of the Search rainbow

by Darren Ravens

2006/08/22

Recently, Quirk, in conjunction with Rafiq, decided to run a little test using an obscure search term. Two domains were created and set up with identical web content, optimised for the surprisingly unpopular term Purple Pot of Golf. I’m quite shocked that some niche marketing guru hasn’t jumped on the monetisation potential of this sure-fire traffic driver ;-)  I guess that’s the Internet for you; awash with ripe opportunities.

Anyway, the aim of the experiment was to test whether search engines rank pages higher when their content is placed above their navigation.

So two sites were set up on separate domains (hosted on the same server), with just one key difference in the websites’ structure;

-          The first one, hosted at http://stokes.za.net/  and with a page title Purple Pot Of Golf | Rafiq, has its navigation positioned before the content – let’s call it the Stokes Page from here on

-          The second, titled Purple Pot Of Golf | Lyle and hosted at http://ybc.za.net/, has navigation positioned after the content - we’ll call this one the YBC Page

With all other factors being equal, we hoped to isolate this single variable and determine its effects on ranking.

Both pages were given a link from the Quirk homepage, titled test 1 (Stokes Page) and test 2 (YBC Page) – effectively this meant there’d be another variable but one of the links would have to be first so we saw no way around it. So, the moment of truth. How would the big G rank them?

A search for the keyphrase, (we first just typed in the phrase and later repeated the search using quotation marks) ranks the YBC Page first, damning the Stokes Page to supplemental hell. Of course we expected that one of them would have to get flagged as duplicate and the question was really just which one it’d be.

Checking the cached times we see that the YBC page got indexed at 17 Aug 2006 12:20:28 GMT while the Stokes page got cached at 17 Aug 2006 00:13:44 GMT. -  So while the Stokes page got indexed fully twelve hours earlier, it’s still considered to be the duplicator!

Interesting indeed.

Then, we checked what Yahoo! would do.

For the same search, Yahoo! ranks the Stokes page first and considers the YBC Page the duplicate. We don’t have the exact times of indexing but my assumption would be that the spiders found them in the same order as Google’s spiders. Running the search again using quotation marks, we’re surprised to find that now only the YBC page ranks, with the Stokes page supplemental.

These are just our initial findings, and we’re still investigating to be able to draw proper conclusions but at this point what I find interesting is:

-         Google appears to favour the YBC page, which has the navigation after the content.

-         This might suggest it considers the page more relevant

-         Using quotations to force a search for the entire string yields pretty much the same result

-         It takes Google twelve hours to go from the test1 link to the test 2 link…unless it came back an indexed the YBC page a second time and ignored the Stokes page.

-         On Yahoo! the Stokes page is favoured.

-         I’d speculate that this is because that page was indexed first

-         Searching the exact string on Yahoo! finds the YBC page again

-         Perhaps this suggests something about how Yahoo!’s duplicate content filter works

In hindsight, one variable we’d overlooked relates to the titles of the pages. The YBC page uses “Lyle” in the title. Since Lyle is an employee of Quirk, his name appears elsewhere on the site and this may mean the link to that page gets a boost of relevancy over the Stokes page.

We’re working to find any other variables that we may have missed and will hopefully be able to draw some solid conclusions from all of this. As always, we’d love to hear what you think about it.

Keep watching this blog for updates…

Comments

Interesting experiment. Reminds me of something similar Mike Davidson did a while back... http://www.mikeindustries.com/blog/archive/2006/01/the-roundabout-seo-test

Posted by JBagley on 2006/08/23

TYFYC Jason

Posted by rafiq on 2006/08/23

wow, i would have thought it'd be the other way around. is what I've been teached all this time, links first, content after. will play around on client sites, see if differences are noticed...

Posted by adrian kirsten on 2007/03/18

Does duplicate content still count? There are rumours that this is no longer a real issue? that the web is full of duplicate content and that the sites that do publish duplicate content are no longer penalised.

Posted by Paulina on 2008/10/13

@Paulina , thanks for your comment.

In the past 2 years, since this post was published, the duplicate content issue has evolved.

This SEOMoz post www.seomoz.org/blog/the-illustrated-guide-to-duplicate-content-in-the-search-engines explains it fairly simply.

Generally:

* Google will treat duplicate content which is an exact (or almost exact) replica of any previous indexed content piece, as duplicate content.

* Usually, the content piece which is published first gets recognised or indexed by Google... as the originator piece.

* However, we have seen instances where a site with a high page rank (PR) is designated/deemed the originator of the content, by virtue of its perceived authority in the eyes of the search engines - even though it may not have been the first to publish the content.

* There is open debate about whether Google "penalizes" sites which consistently or solely publish duplicate content.

We have seen instances where sites are penalized, not penalized and/or left unaffected (i.e. indexed) when publishing duplicate content. It seems that Google assesses any SERP penalties for duplicate content according to contextual criteria...and sites need to monitor if they are affected by this.

Naturally, there are exceptions - and Google will evolve its algorithm.

Posted by Suzan Gray on 2008/10/13

Thanks for the fast response. Lets maybe add content. There are several newspaper sites all sharing some stories (exact duplicates). Do we have to stop this? It seems that they are all being indexed (type in title you get a listing of story on most domains in various orders). Could we do better if we stopped this?

IOL is thinking about finding a new SEO provider to help us with redevelopment and we need someone that understands the publisher environment.

Posted by Paulina on 2008/10/13

Make a comment

To prevent GottaQuirk from becoming spam central, we block the use of certain words like porn, sex etc. We apologise for any inconvenience, but can't spend our lives deleting messages left by spammy friends.

Captcha
 
Pencil

eMarketing News

Subscribe to our fortnightly newsletter which is packed with interesting eMarketing news, views and other quirky titbits.

Subscribe

RSS

RSS to Email

Get our latest blog posts delivered straight to your inbox.

Follow us on Twitter

Follow us on Twitter


What's on offer?

Afrigator