Linking Data with sameAs: Challenges and Solutions - Workshop

Post on 16-May-2015

262 Views

Category:

Education

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

Feedback from 'Linking Data with sameAs: Challenges and Solutions' 3 hour workshop given at ELAG 2014 in Bath, UK. http://elag2014.org/programme/elag-2014-workshops/stevenson/

Transcript

ELAG 2014 Workshop. Bath, UK. 11–12th June 2014

Adrian Stevenson and Jane StevensonMimas, University of Manchester, UK@adrianstevenson @janestevenson

Linking Data with sameAs: Challenges and Solutions

Linking Lives

• An interface to biographical data, using– the Archives Hub– VIAF– DBPedia– the British National Biography (BNB)– Copac

• http://archiveshub.ac.uk/linkinglives/

3

owl:sameAs

<Archives Hub Person> owl:sameAs <VIAF Person>

<http://data.archiveshub.ac.uk/id/person/nra/webbmarthabeatrice1858-1943socialreformer>

owl:sameAs

<http://viaf.org/viaf/86607236> .

4

http://data.archiveshub.ac.uk/id/person/nra/webbmarthabeatrice1858-1943socialreformerfoaf:familyName + foaf:givenName + hub:dates

“Webb, Martha Beatrice, 1858-1943”

http://viaf.org/viaf/86607236/foaf:name

“Webb, Martha Beatrice, 1858-1943”

5

Matching

• LOD Refine• http://code.zemanta.com/sparkica/download.html

• SILK Framework• http://wifo5-03.informatik.uni-mannheim.de/bizer/

silk/#workbench

6

LOD Refine

7

SILK

Comments on the workshop

• ‘great lead-through on LOD refine’• LOD Refine and Silk seem to be workable tools

for creating sameAs triples that can help matching

• ‘purpose and possibilities of Silk perhaps a little rushed for me’

• ‘made me realize how disconnected my concept of Silk restrictions and Sparql was. This is now fixed. Ta!’

Comments on Linking Lives

• ‘Great to see the British National Biography (BNB) being used’

• Linking Lives project shows the need for more open data!’

• ‘We need robust Sparql endpoints!’

Comments…

• ‘Funny how hard it is to find useful stuff to link to, and how the user is to make sense of it’.

• ‘I feel reconciled!’• ‘Linking = hard work’

Challenges

Identifying entities: • One of the main problems we came up with in

our linked data pilot connecting library catalogue data and theatre performance data was the lack of identifiers for people and works

• String matching on personal names and work titles in legacy heterogenous systems is extremely important

Challenges

• Question is how to match work titles in multiple languages.

top related