previous: Blogs vs SEO: Problems with Wordpress and other Blog Software
next: Stopping Blog Comment Abuse and Spam
SEO Principles to Bear in Mind when Setting up a Blog
- using snippets,
- planning your content carefully, and
- proper use of the date archive or tags as navigation systems.
It is of primary importance that Google (or any search engine for that matter) find each article in your blog on one url only.
I would also recommend that it see this article belonging to as few categories / tags as possible.
Snippets
Snippets are an introduction to each article. These (instead of the article) are what should be displayed on the blog's home page and any category pages.
You can use the first paragraph or two of the main article as the snippet, but I prefer to have a unique snippet that is a quick summary of (or better still, a teaser for) the article.
By using snippets we guarantee that the main article text appears on one page only. This will go a long way to solving duplicate content issues, and should also ensure that anyone minded to link to the article is clear which page they need to link to.
Categories vs Tags
Basically the distinction I use is this: categories do not share content, tags do. In other words, an article could have multiple tags, but will only belong to one category. Therefore, even if you use snippets, the more tags that are used per article, the more likely it is that there will be duplicate or near-duplicate pages in the site.
The point is to plan your content and how it will be organised. Because you can tag an article after you've written it, many people get lazy with their classification systems. Instead, spend some time thinking about what you will be writing about and draw up a system to organise it in one consistent manner.
Depending on your niche, there might be grey areas when it comes to content classification, and tags are preferable to strict and exclusive categories. Or you might want to use tags as an 'assisted search facility', giving people an instant visual clue to what your blog is about.
Or let's say you spot a new trend or buzzword in your industry, write a post about it, create a new tag for this and then go through your old blog posts and tag the appropriate ones, so people can instantly find relevant content on your site organised under this new term.
All of these would be great uses of tagging. Unfortunately, this isn't how I see tagging used on most sites - in fact I've seen lots of blogs with nearly as many tags as they have posts. That's not only bad for SEO, it's just downright confusing.
If you're going to use tags instead of categories, then I personally would recommend inserting
<meta name="robots" content="noindex,follow">
onto the tag pages. Note that this is 'noindex', 'follow' (NOT 'nofollow'). This allows Google to crawl links on the pages, count anchor text and flow PageRank through them to your articles.
Using Categories
If you use categories, each article belongs to one category only, and has only one possible url path. This means that you can use url-rewriting to create virtual folders and a site hierarchy, so instead of urls like this:
www.somesite.com/blog/article.php?category_id=5&item_id=23
we could first swap the ID numbers for text identifiers:
www.somesite.com/blog/article.php?category=prime-ministers&item=tony-blair
which can then be rewritten to: www.somesite.com/blog/prime-ministers/tony-blair
The second is more user-friendly (people don't trust cryptic urls) and better for SEO. Furthermore, because there is one path to each article we can also use 'breadcrumb' navigation to help people see where they are in the site, i.e.
You are here: Home > Blog > Prime Ministers > Tony Blair
Using Tags
An article belongs to every tag it has. If we use the tag in the url path we will end up with duplicate pages. Using our 'Tony Blair' example, and supposing the article was tagged 'labour politicians', 'election candidates' and 'prime ministers', we'd have these pages:
www.somesite.com/blog/labour-politicians/tony-blair
www.somesite.com/blog/election-candidates/tony-blair
www.somesite.com/blog/prime-ministers/tony-blair
This is obviously no good as we've created three versions of the same article. We can fix this in two ways:
1) Abandon the Folder Hierarchy, 'noindex' the tag pages
Each article sits only in the blog folder i.e. www.somesite.com/blog/tony-blair. We lose our folder hierarchy, but Google may still associate the item with the paths it takes to reach it (i.e. via the tag pages). I'd recommend 'noindexing' the tag pages as explained above.
2) Use the Google Canonical Tag
The rel=canonical tag was developed by Google as a fix for problems like this.
Where you have pages that are very similar or practically identical, you decide which is your 'main' page and which are the 'dupe' pages. Then you place a tag in each page to let Google know your choice. So if we wanted Google to see
www.somesite.com/blog/labour-politicians/tony-blair
as the main page we'd place this tag:
<link rel="canonical" href="http://www.somesite.com/blog/labour-politicians/tony-blair">
into the page. Google says that it will then effectively amalgamate the three pages it sees; it will only index the main page, and no PageRank will be lost.
But what about the breadcrumb navigation?
The canonical fix is an easy one to implement without being a programmer, but fixing the breadcrumb navigation to display only the indexable click path to the document will require programming knowledge. I've seen cookies suggested as a means of controlling the breadcrumb navigation (in other words, show the hierarchy that was used to get to the page) but I don't like this. First, it doesn't help people who end up on the page without hitting a tag page first, and second cookies don't work with Google. I would prefer to show either one path or all paths to the item in the breadcrumb navigation i.e.
You are here: Home > Blog > Prime Ministers > Tony Blair (see also: 'Labour Politicians', 'Election Candidates')
Again, this will require programming knowledge to implement.
Different snippets depending on the tag?
If you're only using a few tags per article, then you could consider write more than one snippet per article. This will
- eliminate duplicate content (in fact you create extra content because you have more category / tag pages) and you can make them all indexable
- enable you to angle each snippet to the subject of the tag page
- send more PageRank to each article
The Date Archive
This is a feature on most blogs, but I don't think it is always required. The important thing is to think about the purpose of the blog.
If you intend the blog only to be used as a simple 'news' section and don't plan to update often, then it could be that a date archive would be your only system of navigation.
If you plan to use it more often and for different purposes, then categories or tags would be recommended. But do you need a date archive too? Ask yourself this: is it important for people to be able to see what you wrote on your blog in December 2009 - or will they only care what you have written about a specific subject?
If there is an element of 'current events' to the blog then a calendar would be useful. If however the content in the blog is 'evergreen', then I personally would deem a calendar less useful or even unnecessary.
If a date archive is deemed to be important enough to keep in the site, then the simplest way to deal with it is to insert
<meta name="robots" content="noindex,follow">
on all of the pages in the date archive. As stated above, This will stop Google from indexing them but allow PageRank to flow through them to the articles and the rest of the site, and allow Google to count the anchor text used to link to the articles.
Is that all?
Yes - and no. So far we've covered your content and how to present it to Google in the best way. In the next and last section of this article I'm going to talk about other people's content on your site - comments on your blog. If it's on your site, it is effectively your content because you are responsible to Google for it.
previous: Blogs vs SEO: Problems with Wordpress and other Blog Software
next: Stopping Blog Comment Abuse and Spam
