Information Architecture for SEO

INTERACTIVE STRATEGY - 55, AV. MONT-ROYAL O., SUITE 999, MONTRÉAL (QC) H2T 2S6 T 514.524.7149 NVISOLUTIONS.COM

Search Engine Optimization:Indexation and Link Juice

Indexation and Link Juice

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

The Holistic SEO Recipe:

• One Part Marketing• One Part Editorial• One Part Webmaster• One Part SEO education

Don’t expect to find it in one person!

Put together your best team worker from each group and have them ALL learn SEO then co-ordinate on its implementation.

Marketing


:: Keywords to Fuel Site Expansion

Editorial


:: Page and Link Placement

Webmasters


:: Link Tech, Sitemaps, Publishing, etc...

EVERYBODY NEEDS TO GET THE SEO BASICS AND SEE

THE SEO BIG PICTURE

What is it they all need to ‘get’?


• That search engines ‘crawl’ sites by following HTML and other links

• That the quality and quantity of links pointing to a page = Link Popularity

• Pages Need Link Popularity (or juice) to get indexed and rank

• YOU get to control which pages on your site get indexed, and which ones get link juice.

• A page being indexed by search engines is separate from that page’s ability to accumulate or pass on link juice.

Link Juice and Indexation Cheat Sheet:


Tag/Command

Indexation Link Juice

Robots.txt(file in root of site)

EG: Disallow: /news/pdf-copies/

Stops pages or directories from appearing in Search Engine indexes – (except ‘uncrawled references’ )

Pages ‘blocked’ by robots.txt can still accumulate and pass link-juice

Block what you don’t want indexed:

Session IDs / dupe URLs: Disallow: *partner=* Entire directories: Disallow: /news/pdf-copies/ Internal SERPs: Disallow: *car-search-query=*External affiliate links: Disallow: *GO.cgi*

If excluded pages have external links, expect 'URL-only' listings: No title, snippet, size or cache. For no listing at all, allow bots & use Meta noindex

On-Page Meta No-Index(<head> of page)

EG: <meta name=“robots” content=“noindex”>

Stops the page from appearing in Search Engine indexes entirely

The page can still accumulate and pass link-juice

You may want to use Meta Noindex if: you can’t alter your robots.txt, or if robots.txt standard is not flexible enough, or if you don’t want URL listings



Rel=nofollow(in an <a href> link)

EG: <a href=“http://non-trusted-site.com” rel=“nofollow”>

Stops spiders from following a specific link. They don’t crawl or discover through nofollow links.

Stops Link Juice from flowing through a specific link

Can be used from one domain to another when a link does not imply trust.

Lots of controversy recently on its use within a domain to ‘sculpt PR flow’

301 Redirection(many types of implementation)

EG: redirect /old/page.php /new_page.php [301, permanent]

Spiders follow redirect and discover new pages

Search Engines transfer link juice from old pages to new pages

If any URLs change, this is the best way to shift link juice from old to new.301s are the only way to transfer link juice from one domain to another.

Tag/Command


On-Page Meta No-follow(<head> of page)

EG: <meta name=“robots” content=“nofollow”>

Stops spiders from following the links on the page (which may still get indexed via other links)

The pages can still accumulate link-juice (and rank), but can’t pass it on

You may wish to nofollow an entire page like a list of paid sponsors.



Tag/Command


Canonicalization tag(<head> of page)

EG: <canonical =“/proper/product/page.php“>

Spiders go to referred page like a 301 redirect . Does not work across domains.

Search Engines transfer link juice from variation pages to real page

• A new approach to both indexation control and link juice control• Supported by The Big Three: Google, Yahoo, MSN• May be cheaper than redoing your entire site from scratch, for now• May see faster results than redoing your entire site• May become a maintenance nightmare• Can turn out to be as or more complex than doing it right from scratch

Javascript Link EG: <div onclick="document.location.href='http://www.domain.com/'">

Google tries to crawl and index if URL is easily to access – in onclick or href

If crawlable, Google will try to pass link juice

• Before rel=nofollow, many SEOs used uncrawlable JS links for sculpting• May not carry as much weight, and should not be used as main navigation

Two Free ToolsGoogle Webmaster Central:

• Identify Crawl problems (spider data over time!)• Find duplicate titles and meta descriptions• Quickly identify 404 issues• List your pages by internal or external links• Manage and find errors in sitemaps• Test your robots.txt file against specific URLs• Basic domain canonicalization (www to non-www)

Xenu Link Sleuth:• Find broken links (sort by status)• Find duplicate title tags (sort by title)• Find heavy pages (sort by size)• Find pages too many click from home (sort by level)• Find pages with too few internal links (sort by In links)• Find images without ALT text (sort by type, scan title

field)• Test non-canonical URLs (from a text file) and view

status• Find outgoing links to broken pages or expired content• Find bot traps (like open ended calendars)



Dos and Don’tsDo:

• Define your IA and determine canonical URLs for hub pages across all major categories, with expansion ability

• Use breadcrumb style navigation – Put all new content in it!(EG: Home > Kitchen > Major Appliances > Stoves)

• Include relevant category-specific navigation at each level

• Make interlinking mandatory! Include in-content links to similar pages around the site, and give links from other pages

• Keep updated HTML and XML sitemaps for all new content

• Learn all the ways to control indexation and link juice flow

Don’t:• Let the same content appear on more than one URL• Just throw content up without linking to it, or linking

from it• Spread your link juice thin over pages don’t have

unique content• Leave open ended page scripts like calendars• Don’t archive poorly without respect to your IA• Return server headers other than 404 for error pages • Think you can fix link juice distribution issues with

robots.txt


Thank You!

Keep up with NVI:

Website - NVIsolutions.comNVI Blog EN - NVIsolutions.com/blogNVI Blog FR - GO-Referencement.org

E-mail me: [email protected] me on Twitter @NaoiseOsborne


Information Architecture for SEO

Technology

link juice control

linkjuice block

specific link

link popularity pages

link tech

link placement interactive

scratch javascript link

entire page