Rise presentation for workshop 2011 07-04

Post on 17-Nov-2014

753 Views

Category:

Education

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Presentation by JISC RISE project at the Innovations in Activity Data event for academic libraries at the Open University 4 July 2011

Transcript

Recommendations Improve the Search Experience Innovations in Activity Data workshop04 July2011

Richard Nurse

http://www.open.ac.uk/blogs/rise

http://www.flickr.com/photos/jm3/2779185414/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

Can you use search data to

make recommendations?

Are recommendations

useful for Discovery systems?

http://www.flickr.com/photos/mag3737/1419690363/sizes/m/in/photostream/

Recommendations Improve the Search Experience?

http://www.open.ac.uk/blogs/rise

JISC funded project

February – July 2011

One of eight projects [list at http://bit.ly/gwCmNS]

http://www.flickr.com/photos/mag3737/3069729100/sizes/m/in/photostream/

JISC Activity Data Programme

http://www.open.ac.uk/blogs/rise

Usage data

Attention data

http://www.flickr.com/photos/mag3737/2326898219/sizes/m/in/photostream/

What is Activity Data?

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/zerimski/5215633183/sizes/z/in/photostream/

"Every day I wake up and ask, 'how can I flow data better, manage data better, analyse data better?"

Rollin Ford, the CIO of Wal-Mart

So what’s the point of this activity data stuff then?

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

http://www.flickr.com/photos/xq311z/2468769929/sizes/m/in/photostream/

Library activity data

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/neilwykes/134162792/sizes/z/in/photostream/

OU environment

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

xx

http://www.flickr.com/photos/stefz/7913287/sizes/m/in/photostream/

Library activity data

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/julietteculver/4731004168/in/photostream

OU environment

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resources

xx

x x

http://www.flickr.com/photos/cassidy/352549326/sizes/m/in/photostream/

Library activity data

http://www.open.ac.uk/blogs/rise

Loans Holds Computer bookings

Library access

e-resourcesx

x xx

http://www.flickr.com/photos/lexnger/116314355/sizes/m/in/photostream

Library activity data

http://www.open.ac.uk/blogs/rise

Ebsco Discovery Solution

SFX knowledge base and OpenURL link resolver

EZProxy remote user authentication

Athens DA authentication built into local (SAMS) login system

http://www.flickr.com/photos/nataliesap/3553982299/sizes/m/in/photostream/

OU systems environment

Scope of the project

So what about collecting more data?

http://www.open.ac.uk/blogs/rise http://library.open.ac.uk/rise

So what about collecting more data?

http://www.open.ac.uk/blogs/rise

http://www.open.ac.uk/blogs/rise

http://www.open.ac.uk/blogs/rise

http://www.open.ac.uk/blogs/rise

E-journals E-journal articles E-books

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/smallritual/5393527886/sizes/m/in/photostream/

What resources are involved?

http://www.open.ac.uk/blogs/rise

EZProxy

SFX

EDS

VLE

website

http://www.flickr.com/photos/cdevers/2665335157/sizes/m/in/photostream/

bookmarklet

What data is RISE using?

http://www.open.ac.uk/blogs/rise

• Remote host • Date/Time• Oucu• Request• Status• Size of response• Referrer• User agent• Session

http://www.flickr.com/photos/vincentgallegos/5123100365/sizes/m/in/photostream/

So what is in the EZProxy logs?

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"http://www.flickr.com/photos/smohundro/2449517861/sizes/m/in/photostream/

So what is in the EZProxy logs?

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

date and time

http://www.flickr.com/photos/adactio/225402453/sizes/m/in/photostream/

So what is in the EZProxy logs?

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

User name

http://www.flickr.com/photos/dlytle/422738735/sizes/m/in/photostream/

So what is in the EZProxy logs?

http://www.open.ac.uk/blogs/rise

\"0\"|||\"137.108.143.168\"|||20110115235421|||\“nn1234\"|||\"GET http://libezproxy.open.ac.uk:80/connect?Session=st3ShtizgtrS7tU5&url=http://search.ebscohost.com/login.aspx?direct=true&site=edslive&scope=site&type=0&cli0=FT&clv0=Y&cli1=FT1&clv1=Y&authtype=ip&group=VCStud&bquery=War%20Against%20the%20Panthers HTTP/1.1\“|||302|||0|||\http://library.open.ac.uk/\|||\"Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.2.13) Gecko/20101206 Ubuntu/10.10 (maverick) Firefox/3.6.13\"|||\"t3ShtizgtrS7tU5\"

Request

http://www.flickr.com/photos/spoinknet/35410171/sizes/m/in/photostream/

So what is in the EZProxy logs?

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vanderwal/279135451/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

RISE database

Remote host | Date/Time | Oucu | request | status | size of response | referrer | user agent | session

user type | course code(s)

EZProxy

CIRCE

http://www.flickr.com/photos/dw/4950924376/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vanderwal/279135451/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/clankennedy/3022286303/sizes/m/in/photostream/

ISSNs DOI

Article information

Subject terms

But what isn’t there?

People on course ‘A’ viewed resource ‘B’ People who looked at resource ‘C’ also looked at resource ‘D’

What can the data tell us?

http://www.open.ac.uk/blogs/rise

Course recommendation

Relationship recommendation

Which are the most popular resources, subjects

http://www.flickr.com/photos/vblibrary/5190554053/sizes/m/in/photostream/

So what about collecting more data?

http://www.open.ac.uk/blogs/rise

http://www.open.ac.uk/blogs/rise

RISE database

http://www.flickr.com/photos/vblibrary/5052421946/sizes/m/in/photostream/

Remote host | Date/Time | Oucu | request | status | size of response | referrer | user agent | session

user type | course code(s)

Searches in RISE

EZProxy

CIRCE

RISE

So how do you improve your data?

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/suttonhoo22/5004250051/sizes/m/in/photostream/

People on course ‘A’ viewed resource ‘B’

People who looked at resource ‘C’ also looked at resource ‘D’

People who searched for subject ‘E’ looked at resource ‘F’

What can the data tell us?

http://www.open.ac.uk/blogs/rise

Course recommendation

Search recommendation

Relationship recommendation

http://www.flickr.com/photos/vblibrary/4581698063/sizes/m/in/photostream/

People are looking at resources on this subjectSubject data

This resource is being used by people studying this course

Resource management

So how do you get a recommendation?

http://www.open.ac.uk/blogs/rise http://www.flickr.com/photos/will-lion/2442088335/sizes/m/in/photostream/

So how do you get a recommendation?

http://www.open.ac.uk/blogs/rise

User A Course A123

Resource BRV=14

Views

User C Course A123

Resource BRV=15

Recommended +1 Resource BRV=16

Views +1

User C Course A123

Resource BRV=17

Rate Useful +1

User C Course A123

Resource BRV=14

Rate Not Useful -2

Resource BRV=15

Views +1

http://www.open.ac.uk/blogs/rise

Interface usage

Page views

Browser 7,462

Gadget 855

Added a privacy policy to RISE, EDS and SFX interfaces

Provided an opt-out feature

http://www.open.ac.uk/blogs/rise

Privacy and opt-out URL

http://library.open.ac.uk/rise/?page=privacy

Data Protection and privacy

Release data

openly

E-Resource accesses

Search terms

Course subjects

http://www.open.ac.uk/blogs/rise

Open Data

http://www.flickr.com/photos/narisa/2720873442/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

Anonymization

Ensure compliance with Data Protection requirements

Get agreement to release data

Remove the user name

Remove all records for courses with less than x

students

Replace the course code with a generic subject

http://www.open.ac.uk/blogs/rise

Anonymization

Data formats and standards

XML

KE Usage Statistics standard

OpenURL

CSV

MOSAIC

Linked Data

http://www.flickr.com/photos/mararie/4121128381/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

“That recommender systems can enhance the student

experience in new generation e-resource discovery services”

http://www.flickr.com/photos/mag3737/3318791086/sizes/m/in/photostream/

Hypothesis

http://www.open.ac.uk/blogs/rise

Review of web analytics

Face to Face interviews

Online Survey

http://www.flickr.com/photos/8lettersuk/148685757/sizes/m/in/photostream/

Evaluation

http://www.open.ac.uk/blogs/rise

Survey results 1

Not useful17%

Quite useful17%

Very useful30%Not used

4%

Not sure9%

Not applicable22%

These resources may be related to others you've viewed recently

http://www.open.ac.uk/blogs/rise

Survey results 2

Not useful22%

Slightly useful9%

Quite useful9%Very useful

17%Not used9%

Not applicable35%

People on your course(s) viewed

http://www.open.ac.uk/blogs/rise

Survey results 3

Not useful17%

Quite useful13%

Very useful30%

Not used17%

Not applicable22%

People using similar search terms often viewed

http://www.open.ac.uk/blogs/rise

Survey results 4

Not relevant35%

Slightly relevant

13%

Quite relevant30%

Very relevant17%

Not used4%

How relevant where the recommendations?

http://www.open.ac.uk/blogs/rise

Face to Face evaluation

http://www.flickr.com/photos/ryanhealy/3729881896/sizes/m/in/photostream/

UndergraduatesLike ratings and reviews from other

students

‘other people’s experiences valuable’

Which module studied?

How high a mark?

Postgraduates

Citation as a recommendation

Wary of provenance

Feed to module website

Want synonyms

Trust repository

http://www.open.ac.uk/blogs/rise

Should we have a recommender system?

“I think it would be a very good useful feature. It would be definitely very very useful” postgraduate Maths student

“So it would be interesting to see what other people are looking at. Yes, I would definitely use that because my limited knowledge of the library might mean that other people were using slightly different ways of searching and getting different results.” undergraduate English Literature student

I have just had a go, it was good with suggested papers that I had already found (which shows potential in my view) through Google.

http://www.flickr.com/photos/earlg/337743409/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

Should we have a recommender system?

“I'm afraid my first reaction is to be a bit sceptical - it presumably doesn't tell you if fellow students found the information/article useful or relevant to what they were looking for.  I would hate to waste time following unproductive links laid down by others who might be failing students or think that any "lazy" students might develop poor practice by relying on what others had looked at.  It sounds like a good idea but I think caution needs to be exercised. ”

http://www.flickr.com/photos/rob-sinclair/2189457309/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

Why they prefer course-related recommendations

“I can’t be bothered with knowing what everybody else is interested in. I take a really operational view you know, I’m on here, I want to get the references for this particular piece of work, and those are the people that are most likely to be doing a similar thing that I can use.” H800 student

“I suppose if I wasn’t so sure on an assignment it would perhaps be quite useful to see what other people were looking at to know if I was thinking along the right lines.” - Undergrad literature student

http://www.open.ac.uk/blogs/rise

Suggestions for improvement

“Maybe include a date. It would be interesting to know when a resource was last looked at” Postgraduate political philosophy student

“If somebody used similar search but three years ago, is that going to carry the same weight?” Postgraduate maths student

Include course drop-down choice. “I would be looking at that and saying “which of my courses does it refer to?”

http://www.open.ac.uk/blogs/rise

Rating the recommendations

8 out of ten so far would be happy to rate the recommendation

Most people understood why we were asking them to rate it.

http://www.open.ac.uk/blogs/rise

Recommendations usage

Search

40%

Course

36%

Relationship24%

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

100

200

300

400

500

600

700

800

900

1000

People using similar search terms often viewed

http://www.flickr.com/photos/antrover/5810373016/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

100

200

300

400

500

600

700

People on your course(s) viewed

http://www.flickr.com/photos/exitfestival/5835349579/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

50

100

150

200

250

300

350

400

450

500

These resources may be related to others you've viewed recently

http://www.flickr.com/photos/oldton_tim/479685673/sizes/m/in/photostream/

http://www.open.ac.uk/blogs/rise

Recommendations usage

1 2 3 40

200

400

600

800

1000

1200

1400

1600

1800

2000

Relationship

Course

Search

http://www.flickr.com/photos/smin/613285324/sizes/m/in/photostream/

•EZProxy data

•Use other data sources

•Search terms

•Need more data!

•Users like recommendations ‘in principle’

•Recommendations provenance

•Interest in the search tools

•Quality of recommendations isn’t high

•Limited use of it so far

http://www.open.ac.uk/blogs/rise

Interim findings

Release of code via Google Code

Release of data

Complete evaluation work

Final blog posts and write ups

Dissemination

Still to do

Recommendations Improve the Search ExperienceInnovations in Activity Data workshop04 July2011

Richard Nurse

http://www.open.ac.uk/blogs/rise

http://www.flickr.com/photos/jm3/2779185414/sizes/m/in/photostream/

top related