Top Banner
Recommendations Improve the Search Experience Innovations in Activity Data workshop 04 July2011 Richard Nurse http://www.open.ac.uk/bl ogs/rise http://www.flickr.com/photos/jm3/2779185414/sizes/ m/in/photostream/
63

Rise presentation for workshop 2011 07-04

Nov 17, 2014

Download

Education

Richard Nurse

Presentation by JISC RISE project at the Innovations in Activity Data event for academic libraries at the Open University 4 July 2011
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Rise presentation for workshop 2011 07-04

Recommendations Improve the Search Experience Innovations in Activity Data workshop04 July2011

Richard Nurse

http://www.open.ac.uk/blogs/rise

http://www.flickr.com/photos/jm3/2779185414/sizes/m/in/photostream/

Page 2: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Can you use search data to

make recommendations?

Are recommendations

useful for Discovery systems?

http://www.flickr.com/photos/mag3737/1419690363/sizes/m/in/photostream/

Recommendations Improve the Search Experience?

Page 3: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

JISC funded project

February – July 2011

One of eight projects [list at http://bit.ly/gwCmNS]

http://www.flickr.com/photos/mag3737/3069729100/sizes/m/in/photostream/

JISC Activity Data Programme

Page 4: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Usage data

Attention data

http://www.flickr.com/photos/mag3737/2326898219/sizes/m/in/photostream/

What is Activity Data?

Page 5: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/zerimski/5215633183/sizes/z/in/photostream/

"Every day I wake up and ask, 'how can I flow data better, manage data better, analyse data better?"

Rollin Ford, the CIO of Wal-Mart

So what’s the point of this activity data stuff then?

Page 6: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

http://www.flickr.com/photos/xq311z/2468769929/sizes/m/in/photostream/

Library activity data

Page 7: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/neilwykes/134162792/sizes/z/in/photostream/

OU environment

Page 8: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

xx

http://www.flickr.com/photos/stefz/7913287/sizes/m/in/photostream/

Library activity data

Page 9: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/julietteculver/4731004168/in/photostream

OU environment

Page 10: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

xx

x x

http://www.flickr.com/photos/cassidy/352549326/sizes/m/in/photostream/

Library activity data

Page 11: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resourcesx

x xx

http://www.flickr.com/photos/lexnger/116314355/sizes/m/in/photostream

Library activity data

Page 12: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Ebsco Discovery Solution

SFX knowledge base and OpenURL link resolver

EZProxy remote user authentication

Athens DA authentication built into local (SAMS) login system

http://www.flickr.com/photos/nataliesap/3553982299/sizes/m/in/photostream/

OU systems environment

Page 13: Rise presentation for workshop 2011 07-04

Scope of the project

Page 14: Rise presentation for workshop 2011 07-04

So what about collecting more data?

http://www.open.ac.uk/blogs/rise http://library.open.ac.uk/rise

Page 15: Rise presentation for workshop 2011 07-04

So what about collecting more data?

http://www.open.ac.uk/blogs/rise

Page 16: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Page 17: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Page 19: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Page 20: Rise presentation for workshop 2011 07-04

E-journals E-journal articles E-books

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/smallritual/5393527886/sizes/m/in/photostream/

What resources are involved?

Page 21: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

EZProxy

SFX

EDS

VLE

website

http://www.flickr.com/photos/cdevers/2665335157/sizes/m/in/photostream/

bookmarklet

What data is RISE using?

Page 22: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

• Remote host • Date/Time• Oucu• Request• Status• Size of response• Referrer• User agent• Session

http://www.flickr.com/photos/vincentgallegos/5123100365/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 23: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"http://www.flickr.com/photos/smohundro/2449517861/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 24: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

date and time

http://www.flickr.com/photos/adactio/225402453/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 25: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

User name

http://www.flickr.com/photos/dlytle/422738735/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 26: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

Request

http://www.flickr.com/photos/spoinknet/35410171/sizes/m/in/photostream/

So what is in the EZProxy logs?

Page 27: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vanderwal/279135451/sizes/m/in/photostream/

Page 28: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

RISE database

Remote host | Date/Time | Oucu | request | status | size of response | referrer | user agent | session

user type | course code(s)

EZProxy

CIRCE

http://www.flickr.com/photos/dw/4950924376/sizes/m/in/photostream/

Page 29: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vanderwal/279135451/sizes/m/in/photostream/

Page 30: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/clankennedy/3022286303/sizes/m/in/photostream/

ISSNs DOI

Article information

Subject terms

But what isn’t there?

Page 31: Rise presentation for workshop 2011 07-04

People on course ‘A’ viewed resource ‘B’ People who looked at resource ‘C’ also looked at resource ‘D’

What can the data tell us?

http://www.open.ac.uk/blogs/rise

Course recommendation

Relationship recommendation

Which are the most popular resources, subjects

http://www.flickr.com/photos/vblibrary/5190554053/sizes/m/in/photostream/

Page 32: Rise presentation for workshop 2011 07-04

So what about collecting more data?

http://www.open.ac.uk/blogs/rise

Page 33: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vblibrary/5052421946/sizes/m/in/photostream/

Page 34: Rise presentation for workshop 2011 07-04

Remote host | Date/Time | Oucu | request | status | size of response | referrer | user agent | session

user type | course code(s)

Searches in RISE

EZProxy

CIRCE

RISE

So how do you improve your data?

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/suttonhoo22/5004250051/sizes/m/in/photostream/

Page 35: Rise presentation for workshop 2011 07-04

People on course ‘A’ viewed resource ‘B’

People who looked at resource ‘C’ also looked at resource ‘D’

People who searched for subject ‘E’ looked at resource ‘F’

What can the data tell us?

http://www.open.ac.uk/blogs/rise

Course recommendation

Search recommendation

Relationship recommendation

http://www.flickr.com/photos/vblibrary/4581698063/sizes/m/in/photostream/

People are looking at resources on this subjectSubject data

This resource is being used by people studying this course

Resource management

Page 36: Rise presentation for workshop 2011 07-04

So how do you get a recommendation?

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/will-lion/2442088335/sizes/m/in/photostream/

Page 37: Rise presentation for workshop 2011 07-04

So how do you get a recommendation?

http://www.open.ac.uk/blogs/rise

User A Course A123

Resource BRV=14

Views

User C Course A123

Resource BRV=15

Recommended +1 Resource BRV=16

Views +1

User C Course A123

Resource BRV=17

Rate Useful +1

User C Course A123

Resource BRV=14

Rate Not Useful -2

Resource BRV=15

Views +1

Page 38: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Interface usage

Page views

Browser 7,462

Gadget 855

Page 39: Rise presentation for workshop 2011 07-04

Added a privacy policy to RISE, EDS and SFX interfaces

Provided an opt-out feature

http://www.open.ac.uk/blogs/rise

Privacy and opt-out URL

http://library.open.ac.uk/rise/?page=privacy

Data Protection and privacy

Page 40: Rise presentation for workshop 2011 07-04

Release data

openly

E-Resource accesses

Search terms

Course subjects

http://www.open.ac.uk/blogs/rise

Open Data

http://www.flickr.com/photos/narisa/2720873442/sizes/m/in/photostream/

Page 41: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Anonymization

Ensure compliance with Data Protection requirements

Get agreement to release data

Page 42: Rise presentation for workshop 2011 07-04

Remove the user name

Remove all records for courses with less than x

students

Replace the course code with a generic subject

http://www.open.ac.uk/blogs/rise

Anonymization

Page 43: Rise presentation for workshop 2011 07-04

Data formats and standards

XML

KE Usage Statistics standard

OpenURL

CSV

MOSAIC

Linked Data

http://www.flickr.com/photos/mararie/4121128381/sizes/m/in/photostream/

Page 44: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

“That recommender systems can enhance the student

experience in new generation e-resource discovery services”

http://www.flickr.com/photos/mag3737/3318791086/sizes/m/in/photostream/

Hypothesis

Page 45: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Review of web analytics

Face to Face interviews

Online Survey

http://www.flickr.com/photos/8lettersuk/148685757/sizes/m/in/photostream/

Evaluation

Page 46: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Survey results 1

Not useful17%

Quite useful17%

Very useful30%Not used

4%

Not sure9%

Not applicable22%

These resources may be related to others you've viewed recently

Page 47: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Survey results 2

Not useful22%

Slightly useful9%

Quite useful9%Very useful

17%Not used9%

Not applicable35%

People on your course(s) viewed

Page 48: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Survey results 3

Not useful17%

Quite useful13%

Very useful30%

Not used17%

Not applicable22%

People using similar search terms often viewed

Page 49: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Survey results 4

Not relevant35%

Slightly relevant

13%

Quite relevant30%

Very relevant17%

Not used4%

How relevant where the recommendations?

Page 50: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Face to Face evaluation

http://www.flickr.com/photos/ryanhealy/3729881896/sizes/m/in/photostream/

UndergraduatesLike ratings and reviews from other

students

‘other people’s experiences valuable’

Which module studied?

How high a mark?

Postgraduates

Citation as a recommendation

Wary of provenance

Feed to module website

Want synonyms

Trust repository

Page 51: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Should we have a recommender system?

“I think it would be a very good useful feature. It would be definitely very very useful” postgraduate Maths student

“So it would be interesting to see what other people are looking at. Yes, I would definitely use that because my limited knowledge of the library might mean that other people were using slightly different ways of searching and getting different results.” undergraduate English Literature student

I have just had a go, it was good with suggested papers that I had already found (which shows potential in my view) through Google.

http://www.flickr.com/photos/earlg/337743409/sizes/m/in/photostream/

Page 52: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Should we have a recommender system?

“I'm afraid my first reaction is to be a bit sceptical - it presumably doesn't tell you if fellow students found the information/article useful or relevant to what they were looking for.  I would hate to waste time following unproductive links laid down by others who might be failing students or think that any "lazy" students might develop poor practice by relying on what others had looked at.  It sounds like a good idea but I think caution needs to be exercised. ”

http://www.flickr.com/photos/rob-sinclair/2189457309/sizes/m/in/photostream/

Page 53: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Why they prefer course-related recommendations

“I can’t be bothered with knowing what everybody else is interested in. I take a really operational view you know, I’m on here, I want to get the references for this particular piece of work, and those are the people that are most likely to be doing a similar thing that I can use.” H800 student

“I suppose if I wasn’t so sure on an assignment it would perhaps be quite useful to see what other people were looking at to know if I was thinking along the right lines.” - Undergrad literature student

Page 54: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Suggestions for improvement

“Maybe include a date. It would be interesting to know when a resource was last looked at” Postgraduate political philosophy student

“If somebody used similar search but three years ago, is that going to carry the same weight?” Postgraduate maths student

Include course drop-down choice. “I would be looking at that and saying “which of my courses does it refer to?”

Page 55: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Rating the recommendations

8 out of ten so far would be happy to rate the recommendation

Most people understood why we were asking them to rate it.

Page 56: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Recommendations usage

Search

40%

Course

36%

Relationship24%

Page 57: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

100

200

300

400

500

600

700

800

900

1000

People using similar search terms often viewed

http://www.flickr.com/photos/antrover/5810373016/sizes/m/in/photostream/

Page 58: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

100

200

300

400

500

600

700

People on your course(s) viewed

http://www.flickr.com/photos/exitfestival/5835349579/sizes/m/in/photostream/

Page 59: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

50

100

150

200

250

300

350

400

450

500

These resources may be related to others you've viewed recently

http://www.flickr.com/photos/oldton_tim/479685673/sizes/m/in/photostream/

Page 60: Rise presentation for workshop 2011 07-04

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

200

400

600

800

1000

1200

1400

1600

1800

2000

Relationship

Course

Search

http://www.flickr.com/photos/smin/613285324/sizes/m/in/photostream/

Page 61: Rise presentation for workshop 2011 07-04

•EZProxy data

•Use other data sources

•Search terms

•Need more data!

•Users like recommendations ‘in principle’

•Recommendations provenance

•Interest in the search tools

•Quality of recommendations isn’t high

•Limited use of it so far

http://www.open.ac.uk/blogs/rise

Interim findings

Page 62: Rise presentation for workshop 2011 07-04

Release of code via Google Code

Release of data

Complete evaluation work

Final blog posts and write ups

Dissemination

Still to do

Page 63: Rise presentation for workshop 2011 07-04

Recommendations Improve the Search ExperienceInnovations in Activity Data workshop04 July2011

Richard Nurse

http://www.open.ac.uk/blogs/rise

http://www.flickr.com/photos/jm3/2779185414/sizes/m/in/photostream/