Top Banner
Site Search Analytics in a Nutshell Louis Rosenfeld [email protected] @louisrosenfeld Webdagane 10 September 2013
85

Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Jan 28, 2015

Download

Technology

webdagene

 
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Site Search Analytics in a Nutshell

Louis Rosenfeld

[email protected] • @louisrosenfeld

Webdagane • 10 September 2013

Page 2: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Hello, my name is Lou

www.louisrosenfeld.com | www.rosenfeldmedia.com

Page 3: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Let’s look at the data

Page 4: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

No, let’s look at the real dataCritical elements in bold: IP address, time/date stamp, query, and # of

results:

XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800] "GET /search?access=p&entqr=0&output=xml_no_dtd&sort=date%3AD%3AL%3Ad1&ud=1&site=AllSites&ie=UTF-8&client=www&oe=UTF-8&proxystylesheet=www&q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1" 200 971 0 0.02

XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800] "GET /searchaccess=p&entqr=0&output=xml_no_dtd&sort=date%3AD%3AL%3Ad1&ie=UTF-8&client=www&q=license+plate&ud=1&site=AllSites&spell=1&oe=UTF-8&proxystylesheet=www&ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16

Page 5: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

No, let’s look at the real dataCritical elements in bold: IP address, time/date stamp, query, and # of

results:

XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800] "GET /search?access=p&entqr=0&output=xml_no_dtd&sort=date%3AD%3AL%3Ad1&ud=1&site=AllSites&ie=UTF-8&client=www&oe=UTF-8&proxystylesheet=www&q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1" 200 971 0 0.02

XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800] "GET /searchaccess=p&entqr=0&output=xml_no_dtd&sort=date%3AD%3AL%3Ad1&ie=UTF-8&client=www&q=license+plate&ud=1&site=AllSites&spell=1&oe=UTF-8&proxystylesheet=www&ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16

What are users searching?

Page 6: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

No, let’s look at the real dataCritical elements in bold: IP address, time/date stamp, query, and # of

results:

XXX.XXX.X.104 - - [10/Jul/2006:10:25:46 -0800] "GET /search?access=p&entqr=0&output=xml_no_dtd&sort=date%3AD%3AL%3Ad1&ud=1&site=AllSites&ie=UTF-8&client=www&oe=UTF-8&proxystylesheet=www&q=lincense+plate&ip=XXX.XXX.X.104 HTTP/1.1" 200 971 0 0.02

XXX.XXX.X.104 - - [10/Jul/2006:10:25:48 -0800] "GET /searchaccess=p&entqr=0&output=xml_no_dtd&sort=date%3AD%3AL%3Ad1&ie=UTF-8&client=www&q=license+plate&ud=1&site=AllSites&spell=1&oe=UTF-8&proxystylesheet=www&ip=XXX.XXX.X.104 HTTP/1.1" 200 8283 146 0.16

What are users searching?

How often are users failing?

Page 7: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

SSA is semantically rich data, and...

Page 8: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

SSA is semantically rich data, and...

Queries sorted by frequency

Page 9: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

...what users want--in their own words

Page 10: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences

Page 11: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences

Not all queries are distributed equally

Page 12: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences

Page 13: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences

Nor do they diminish gradually

Page 14: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences

Page 15: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

A little goes a long wayA handful of queries/tasks/ways to navigate/features/ documents meet the needs of your most important audiences

80/20 rule isn’t quite accurate

Page 16: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

(and the tail is quite long)

Page 17: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

(and the tail is quite long)

Page 18: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

(and the tail is quite long)

Page 19: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

(and the tail is quite long)

Page 20: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

(and the tail is quite long)The Long Tail is

much longer than you’d suspect

Page 21: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

The Zipf Distribution, textually

Page 22: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Some things you can do with SSA

1.Make it harder to get lost in deep content2.Make search smarter3.Reduce jargon4.Learn how your audiences differ5.Know when to publish what6.Own and enjoy your failures7.Avoid disaster8.Predict the future

Page 23: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#1Make it harder to get lost

Page 24: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Start with basic SSA data: queries and query frequency

Percent: volume of search activity for a unique query during a particular time period

Cumulative Percent: running sum of percentages

Page 25: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Tease out common content types

Page 26: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Tease out common content types

Page 27: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Tease out common content types

Took an hour to...• Analyze top 50 queries (20% of all search activity)

• Ask and iterate: “what kind of content would users be looking for when they searched these terms?”

• Add cumulative percentages

Result: prioritized list of potential content types#1) application: 11.77%

#2) reference: 10.5% #3) instructions: 8.6%

#4) main/navigation pages: 5.91%

#5) contact info: 5.79%

#6) news/announcements: 4.27%

Page 28: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Clear content types lead to better contextual navigation

artist descriptions

album reviews

album pages

artist biosdiscography

TV listings

Page 29: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#2Make search smarter

Page 30: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Clear content types improve search performance

Page 31: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Clear content types improve search performance

Page 32: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Clear content types improve search performance

Content objects related to products

Page 33: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Clear content types improve search performance

Content objects related to products

Raw search results

Page 34: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Contextualizing “advanced” features

Page 35: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Session data suggest progression and context

Page 36: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Session data suggest progression and context

search session patterns1. solar energy2. how solar energy works

Page 37: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Session data suggest progression and context

search session patterns1. solar energy2. how solar energy works

search session patterns1. solar energy2. energy

Page 38: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Session data suggest progression and context

search session patterns1. solar energy2. how solar energy works

search session patterns1. solar energy2. energy

search session patterns1. solar energy2. solar energy charts

Page 39: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Session data suggest progression and context

search session patterns1. solar energy2. how solar energy works

search session patterns1. solar energy2. energy

search session patterns1. solar energy2. solar energy charts

search session patterns1. solar energy2. explain solar energy

Page 40: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Session data suggest progression and context

search session patterns1. solar energy2. how solar energy works

search session patterns1. solar energy2. energy

search session patterns1. solar energy2. solar energy charts

search session patterns1. solar energy2. explain solar energy

search session patterns1. solar energy2. solar energy news

Page 41: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Recognizing proper nouns, dates, and unique ID#s

Page 42: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#3Reduce jargon

Page 43: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Saving the brand by killing jargon at a community collegeJargon related to online education: FlexEd, COD,

College on Demand

Marketing’s solution: expensive campaign to educate public (via posters, brochures)

The Numbers (from SSA):

Result: content relabeled, money saved

query rank query#22 online*#101 COD#259 College on Demand#389 FlexTrack

* “online” part of 213 queries

Page 44: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#4Learn how your audiences differ

Page 45: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Who cares about what?

Page 46: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Who cares about what?

Page 47: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Who cares about what?

Page 48: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Who cares about what?

Page 49: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Why analyze queries by audience?

Fortify your personas with dataLearn about differences between audiences

• Open University “Enquirers”: 16 of 25 queries are for subjects not taught at OU

• Open University Students: search for course codes, topics dealing with completing program

Determine what’s commonly important to all audiences (these queries better work well)

Page 50: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#5Know when to publish what

Page 51: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)
Page 52: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Interest in the football team:

going...

Page 53: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Interest in the football team:

going...

...going...

Page 54: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Interest in the football team:

going...

...going...

gone

Page 55: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Interest in the football team:

going...

...going...

gone

Time to study!

Page 56: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)
Page 57: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Before Tax Day

Page 58: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)
Page 59: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

After Tax Day

Page 60: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#6Own and enjoy your failures

Page 61: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Failed navigation?Examining unexpected searching

Look for places searches happen beyond main page

What’s going on?

• Navigational failure?

• Content failure?

• Something else?

Page 62: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Where navigation is failing (“Professional Resources” page)

Do users and AIGA mean different things by “Professional Resources”?

Page 63: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Comparing what users findand what they want

Page 64: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Comparing what users findand what they want

Page 65: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Failed business goals?Developing custom metrics

Netflix asks

1. Which movies most frequently searched? (query count)

2. Which of them most frequently clicked through? (MDP views)

3. Which of them least frequently added to queue? (queue adds)

Page 66: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Failed business goals?Developing custom metrics

Netflix asks

1. Which movies most frequently searched? (query count)

2. Which of them most frequently clicked through? (MDP views)

3. Which of them least frequently added to queue? (queue adds)

Page 67: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Failed business goals?Developing custom metrics

Netflix asks

1. Which movies most frequently searched? (query count)

2. Which of them most frequently clicked through? (MDP views)

3. Which of them least frequently added to queue? (queue adds)

Page 68: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#7Avoid disasters

Page 69: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

The new and improved search engine that wasn’t

Vanguard used SSA to help benchmark existing search engine’s performance and help select new engine

New search engine “performed” poorlyBut IT needed

convincing to delay launch

Information Architect &

Dev Team Meeting

Search seems to have a few

problems… Nah

.

Where’s the

proof?

You can’t tell

for sure.

Page 70: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

What to do? Test performance of common queries

“Before and after” testing using two sets of metrics1.Relevance: how reliably the search engine

returns the best matches first2.Precision: proportion of relevant results

clustered at the top of the list

Page 71: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Old engine (target) and new compared

Note: low relevance and high precision scores are optimal

More on Vanguard case study: http://bit.ly/D3B8c

Page 72: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Old engine (target) and new compared

Note: low relevance and high precision scores are optimal

More on Vanguard case study: http://bit.ly/D3B8c

uh-oh

Page 73: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Old engine (target) and new compared

Note: low relevance and high precision scores are optimal

More on Vanguard case study: http://bit.ly/D3B8c

uh-oh better

Page 74: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

#8Predict the future

Page 75: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Shaping the Financial Times’ editorial agendaFT compares these

• Spiking queries for proper nouns (i.e., people and companies)

• Recent editorial coverage of people and companies

Discrepancy? • Breaking story?!

• Let the editors know!Seed your

Page 76: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Can SSA bring us together?

Page 77: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Lou’s TABLE OF OVERGENERALIZED

DICHOTOMIESWeb Analytics User Experience

What they analyze Users' behaviors (what's happening)

Users' intentions and motives (why those things happen)

What methods they employ

Quantitative methods to determine what's happening

Qualitative methods for explaining why things happen

What they're trying to achieve

Helps the organization meet goals (expressed as KPI)

Helps users achieve goals (expressed as tasks or topics of interest)

How they use data Measure performance (goal-driven analysis)

Uncover patterns and surprises (emergent analysis)

What kind of data they use

Statistical data ("real" data in large volumes, full of errors)

Descriptive data (in small volumes, generated in lab environment, full of errors)

Page 78: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)
Page 79: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)
Page 80: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Lands End and SKUs

Page 81: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Lands End and SKUs

SKU: # 39072-2AH1

Page 82: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Use SSA to start work on a site report card

Page 83: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Use SSA to start work on a site report card

SSA helps determine common information needs

Page 84: Louis Rosenfeld: Nettstedssøk i et nøtteskall (Webdagene 2013)

Read this

Search Analytics for Your Site: Conversations with Your Customers by Louis Rosenfeld (Rosenfeld Media, 2011)

www.rosenfeldmedia.com

Use code WEBDAGENE2013

for 20% off allRosenfeld Media books