Top Banner
INTERACTIVE STRATEGY - 55, AV. MONT-ROYAL O., SUITE 999, MONTRÉAL (QC) H2T 2S6 T 514.524.7149 NVISOLUTIONS.COM Search Engine Optimization: Indexation and Link Juice
14

Information Architecture for SEO

May 19, 2015

Download

Technology

A look at the Information Architecture for SEO. A presention given by Naoise Osborne (NVI) at Search Engine Strategies, SES Toronto in June 9th 2009.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Information Architecture for SEO

INTERACTIVE STRATEGY - 55, AV. MONT-ROYAL O., SUITE 999, MONTRÉAL (QC) H2T 2S6 T 514.524.7149 NVISOLUTIONS.COM

Search Engine Optimization:Indexation and Link Juice

Page 2: Information Architecture for SEO

Indexation and Link Juice

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

The Holistic SEO Recipe:

• One Part Marketing• One Part Editorial• One Part Webmaster• One Part SEO education

Don’t expect to find it in one person!

Put together your best team worker from each group and have them ALL learn SEO then co-ordinate on its implementation.

Page 3: Information Architecture for SEO

Marketing

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

:: Keywords to Fuel Site Expansion

Page 4: Information Architecture for SEO

Editorial

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

:: Page and Link Placement

Page 5: Information Architecture for SEO

Webmasters

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

:: Link Tech, Sitemaps, Publishing, etc...

Page 6: Information Architecture for SEO

EVERYBODY NEEDS TO GET THE SEO BASICS AND SEE

THE SEO BIG PICTURE

Page 7: Information Architecture for SEO

What is it they all need to ‘get’?

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

• That search engines ‘crawl’ sites by following HTML and other links

• That the quality and quantity of links pointing to a page = Link Popularity

• Pages Need Link Popularity (or juice) to get indexed and rank

• YOU get to control which pages on your site get indexed, and which ones get link juice.

• A page being indexed by search engines is separate from that page’s ability to accumulate or pass on link juice.

Page 8: Information Architecture for SEO

Link Juice and Indexation Cheat Sheet:

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

Tag/Command

Indexation Link Juice

Robots.txt(file in root of site)

EG: Disallow: /news/pdf-copies/

Stops pages or directories from appearing in Search Engine indexes – (except ‘uncrawled references’ )

Pages ‘blocked’ by robots.txt can still accumulate and pass link-juice

Block what you don’t want indexed:

Session IDs / dupe URLs: Disallow: *partner=* Entire directories: Disallow: /news/pdf-copies/ Internal SERPs: Disallow: *car-search-query=*External affiliate links: Disallow: *GO.cgi*

If excluded pages have external links, expect 'URL-only' listings: No title, snippet, size or cache. For no listing at all, allow bots & use Meta noindex

On-Page Meta No-Index(<head> of page)

EG: <meta name=“robots” content=“noindex”>

Stops the page from appearing in Search Engine indexes entirely

The page can still accumulate and pass link-juice

You may want to use Meta Noindex if: you can’t alter your robots.txt, or if robots.txt standard is not flexible enough, or if you don’t want URL listings

Page 9: Information Architecture for SEO

Link Juice and Indexation Cheat Sheet:

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

Rel=nofollow(in an <a href> link)

EG: <a href=“http://non-trusted-site.com” rel=“nofollow”>

Stops spiders from following a specific link. They don’t crawl or discover through nofollow links.

Stops Link Juice from flowing through a specific link

Can be used from one domain to another when a link does not imply trust.

Lots of controversy recently on its use within a domain to ‘sculpt PR flow’

301 Redirection(many types of implementation)

EG: redirect /old/page.php /new_page.php [301, permanent]

Spiders follow redirect and discover new pages 

Search Engines transfer link juice from old pages to new pages

If any URLs change, this is the best way to shift link juice from old to new.301s are the only way to transfer link juice from one domain to another.

Tag/Command

Indexation Link Juice

On-Page Meta No-follow(<head> of page)

EG: <meta name=“robots” content=“nofollow”>

Stops spiders from following the links on the page (which may still get indexed via other links)

The pages can still accumulate link-juice (and rank), but can’t pass it on

You may wish to nofollow an entire page like a list of paid sponsors.

Page 10: Information Architecture for SEO

Link Juice and Indexation Cheat Sheet:

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

Tag/Command

Indexation Link Juice

Canonicalization tag(<head> of page)

EG: <canonical =“/proper/product/page.php“>

Spiders go to referred page like a 301 redirect . Does not work across domains.

Search Engines transfer link juice from variation pages to real page

• A new approach to both indexation control and link juice control• Supported by The Big Three: Google, Yahoo, MSN• May be cheaper than redoing your entire site from scratch, for now• May see faster results than redoing your entire site• May become a maintenance nightmare• Can turn out to be as or more complex than doing it right from scratch

Javascript Link EG: <div onclick="document.location.href='http://www.domain.com/'">

Google tries to crawl and index if URL is easily to access – in onclick or href

If crawlable, Google will try to pass link juice

• Before rel=nofollow, many SEOs used uncrawlable JS links for sculpting• May not carry as much weight, and should not be used as main navigation

Page 11: Information Architecture for SEO

Two Free ToolsGoogle Webmaster Central:

• Identify Crawl problems (spider data over time!)• Find duplicate titles and meta descriptions• Quickly identify 404 issues• List your pages by internal or external links• Manage and find errors in sitemaps• Test your robots.txt file against specific URLs• Basic domain canonicalization (www to non-www)

Xenu Link Sleuth:• Find broken links (sort by status)• Find duplicate title tags (sort by title)• Find heavy pages (sort by size)• Find pages too many click from home (sort by level)• Find pages with too few internal links (sort by In links)• Find images without ALT text (sort by type, scan title

field)• Test non-canonical URLs (from a text file) and view

status• Find outgoing links to broken pages or expired content• Find bot traps (like open ended calendars)

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

Page 12: Information Architecture for SEO

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

Page 13: Information Architecture for SEO

Dos and Don’tsDo:

• Define your IA and determine canonical URLs for hub pages across all major categories, with expansion ability

• Use breadcrumb style navigation – Put all new content in it!(EG: Home > Kitchen > Major Appliances > Stoves)

• Include relevant category-specific navigation at each level

• Make interlinking mandatory! Include in-content links to similar pages around the site, and give links from other pages

• Keep updated HTML and XML sitemaps for all new content

• Learn all the ways to control indexation and link juice flow

Don’t:• Let the same content appear on more than one URL• Just throw content up without linking to it, or linking

from it• Spread your link juice thin over pages don’t have

unique content• Leave open ended page scripts like calendars• Don’t archive poorly without respect to your IA• Return server headers other than 404 for error pages • Think you can fix link juice distribution issues with

robots.txt

INTERACTIVE STRATEGY – NVISOLUTIONS.COM

Page 14: Information Architecture for SEO

Thank You!

Keep up with NVI:

Website - NVIsolutions.comNVI Blog EN - NVIsolutions.com/blogNVI Blog FR - GO-Referencement.org

E-mail me: [email protected] me on Twitter @NaoiseOsborne

INTERACTIVE STRATEGY – NVISOLUTIONS.COM