Top Banner
Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampe UNLV Digital Collections American Library Association Annual Meeting June 28, 2014 Las Vegas, NV
57

Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Dec 31, 2015

Download

Documents

Tobias Booth
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Taking Action: Linked Data for Digital Library Managers

Silvia Southwick and Cory LampertUNLV Digital Collections

American Library Association Annual MeetingJune 28, 2014Las Vegas, NV

Page 2: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Agenda

• Motivation • Environment• UNLV Linked Data project• Technologies used for transforming metadata into linked

data• Visualizations of linked data (demos)• Next steps and Q & A

Page 3: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Linked Data Overview

• My collections are already visible through Google; so who cares• This is a topic for catalogers• It’s too technical / complicated / boring

Actually ... • Linked data is the future of the Web• Data will no longer be in trapped in silos imposed by systems, collections, or

records • Exposed open data presents new opportunities for users

Page 4: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

What is Linked Data?

• Linked Data refers to a set of best practices for publishing and interlinking data on the Web

• Data needs to be machine-readable

• Linked data (Web of Data) is an expansion of the Web we know (Web of documents)

Page 5: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Current Practice

• Data (or metadata) encapsulated in records• Records contained in collections• Very few links are created within and/or across collections• Links have to be manually created• Existing links do not specify the nature of the relationships

among recordsThis structure hides potential links within and across collections

Page 6: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

What we can do with linked data

• Free data from silos• Expose relationships• Powerful, seamless, interlinking of our data• Users interact or query data in new ways• Search results would be more precise• Data can be easily repurposed

Page 7: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Why?Our data needs an upgrade.

http://5stardata.info/

Page 8: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

The Linked Open Data Cloud

Page 9: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Making the Case for Linked Data

Problem: – Rich metadata is being lost when adopting a standard that is

designed for interoperability (Dublin Core)– Rationale for adopting linked data is being disseminated, but there

is very little practical implementation to serve as reference; no “recipe” or uniform solution

– Evolving beyond records takes resources and requires embracing an exciting but uncertain future

Page 10: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Example of a metadata record

Page 11: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

How can we create linked data?

• Our metadata records are deconstructed in triples (statements) that are machine-readable

• Triples are expressed as: Subject – Predicate - Object For example: This book – has creator – Tom Heath

This book – has title – Linked Data: Evolving the…” • Subjects, predicates and most objects should have unique identifiers

(URIs) creating data that can be used in Web architecture (HTTP)• These statements are expressed using the Resource Description

Framework (RDF)• Linked data can be queried using SPARQL

Page 12: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Expressing metadata as triples

• <this thing> <has creator> <Las Vegas News Bureau>• < this thing > <has genre> <Photographic print>• < this thing> <depicts> <Frank Sinatra>• < this thing> <depicts> <Jack Entratter>-------------------------------------------------------------------• <Frank Sinatra> <has profession> <entertainer>• <Jack Entratter> <has profession> <theatrical producer>

Page 13: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Graphic Representation

Page 14: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Triples and RDF

– Once we have triples we need to:

– Assign URIs to each subject

– URIs definitely are used for subjects, and might also represent objects.

URIs are essential for constructing RDF statementsThese steps take the human readable graph and make it machine readable!

Page 15: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Examples of records

Showgirls Menus

Dreaming theSkyline

Page 16: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

title

Page 17: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

How can I transform textual triples into machine-readable?

• We need vocabularies to express our triples• Even better – a data model with these vocabularies• Europeana Data Model gives us a framework to help

organize, structure, and define which predicates we are going to use

• Adopting an existing model is preferable to creating your own (interoperability)

Page 18: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

title

Page 19: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Triples with URIs & EDM model predicates

(Local URI)

Page 20: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Machine-readable triple@prefix dc: <http://purl.org/dc/elements/1.1/> . @prefix edm: <http://www.europeana.eu/schemas/edm/> . @prefix foaf: <http://xmlns.com/foaf/0.1/> .

<http://digloc7.library.unlv.edu:8890/ProvidedCHO/sho000071> dc:creator http://digcol7.library.unlv.edu:8890/Agent/Las-Vegas-News-Bureau .

<http://digloc7.library.unlv.edu:8890/ProvidedCHO/sho000071> foaf:depicts <http://id.loc.gov/authorities/names/n50026395> .

<http://digloc7.library.unlv.edu:8890/ProvidedCHO/sho000071> edm:hasType http://id.loc.gov/vocabulary/graphicMaterials/tgm007779 .

Page 21: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

“I’m a digital collections manager”…

• What is known? – lots of THEORY and lots of TECHNICAL information

• What is happening? – a move toward PRACTICE and APPLICATION in libraries by non-programmers

• Is there a “recipe” yet? - No. But, our staff CAN do significant work to prepare for linked data and to understand linked data principles, even if it isn’t realistic to run a parallel process.

Page 22: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

UNLV Linked Data ProjectGoals: • Study the feasibility of developing a common process that

would allow the conversion of our collection records into linked data preserving their original expressivity and richness

• Publish data from our collections in the Linked Data Cloud to improve discoverability and connections with other related data sets on the Web

Page 23: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Actions Technologies

Clean dataExport data

Import dataPublish

Open Refine

Mulgara /Virtuoso

CONTENTdm

Import dataPrepare dataGenerate triplesExport RDF

Page 24: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Phase 1

• Clean data

• Export data

Page 25: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Clean / Export Data

Technology: CONTENTdm• Increase consistency across collections: – metadata element labels– use of well-known CVs– share local CVs– etc.

• Export data as spreadsheet

Page 26: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Phase 2

• Import to OpenRefine• Prepare (Reconcile)• Generate triples• Export RDF files

Page 27: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

OpenRefine

• Open source

• It is a server – can communicate with other datasets via http

• Open Refine and its RDF extension should be installed

Screenshots to show some of the functions we have used

Page 28: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Import

Page 29: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Facets

Page 30: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Split multi-value cells

Page 31: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 32: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Facet view forGraphic Elementsafter splitting

Page 33: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Reconciliation

Page 34: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Specifying Reconciliation service

Page 35: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Activating Reconciliation

Page 36: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 37: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 38: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 39: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Creating the Mapping (Skeleton)

Page 40: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 41: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 42: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Exporting RDF files

Page 43: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Actions Technologies

Prepare dataExport data

Import dataPublishQuery

Open Refine

Mulgara /Virtuoso

CONTENTdm

Import dataPrepare dataGenerate triplesExport RDF

Page 44: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Phase 3

• Import data• Publish• Query

Page 45: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Mulgara Triple Store: Import

Page 46: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 47: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Simple SPARQL Query

Select *

Where {?s ?p ?o} limit 100

Page 48: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.
Page 49: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Visualization Open Source Tools

• OpenLink Virtuoso Pivot Viewer

• RelFinder

UNLV Linked Data Blog with videos: http://www.library.unlv.edu/linked-data/2014/04/selected-presentations-project.html

Page 50: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

• Good for displaying images

• Selection of images through SPARQL Queries

• Allows refinements using facets

• Allows creating dynamic “collections”

OpenLink Pivot Viewer

Page 51: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

SPARQLQuery

Costume DesignDrawings

Showgirls

Page 52: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

• Video clip: http://youtu.be/-83FTKEkYZ0

Example of Pivot Viewer

Page 53: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

RelFinder

• Good to show relationships:– Among people – Among “things”

• Show type of relationships

Demos:– African American Experience in Las Vegas (Oral History): http://

youtu.be/wKCEl3KXdGk – Cross collections people relationship: http://youtu.be/co3nlbMkoWE

Page 54: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

• Video clip

Examples of Relfinder

Page 55: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Next steps for the UNLV project

• Transform all digital collections into linked data (parallel structure)

• Publish our collections metadata as Linked Open Data• Increase linkage with other datasets• Design and assess user friendly interfaces to access and

display our data and related data from other datasets• Produce a cost benefit analysis to inform future plans for the

development of digital collections

Page 56: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Our Experience

• Project led, implemented and managed by two busy faculty librarians

• No model to follow; our model was experimentation and research

• With interest and motivation, Linked Open Data is a feasible goal

Page 57: Taking Action: Linked Data for Digital Library Managers Silvia Southwick and Cory Lampert UNLV Digital Collections American Library Association Annual.

Thank You!

Questions?

Cory LampertHead, Digital [email protected]

Silvia SouthwickDigital Collections Metadata [email protected]