Top Banner
ENRICH > LINK > SEARCH The lean approach for advanced search applications over linked data Michiel Hildebrand Semantics Conference Vienna 2015
39

Enrich, Link, Search

Jan 22, 2018

Download

Data & Analytics

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Enrich, Link, Search

ENRICH > LINK > SEARCHThe lean approach for advanced search applications over linked data

Michiel HildebrandSemantics Conference Vienna 2015

Page 2: Enrich, Link, Search

2

Page 3: Enrich, Link, Search

Do you see value in open data?

3

Page 4: Enrich, Link, Search

Do you think that open data could improve

the access to your own data?

4

Page 5: Enrich, Link, Search

Have you integrated open data with your own data?

5

Page 6: Enrich, Link, Search

Have you created an application on top of your

integrated data?

6

Page 7: Enrich, Link, Search

The billion $ Open Data example

7

Page 8: Enrich, Link, Search

Cultural Heritage: advanced access through (Open) Data

multi-lingual

location-based

recommendation

personalization

advanced ranking

analytics

http://www.getty.edu/research/tools/vocabularies/aat/ 8

Page 9: Enrich, Link, Search

multi-lingual

location-based

recommendation

personalization

advanced ranking

analytics

Cultural Heritage: advanced access through (Open) Data

http://www.vistory.nl/9

Page 10: Enrich, Link, Search

Cultural Heritage: advanced access through (Open) Data

multi-lingual

location-based

recommendation

personalization

advanced ranking

analytics

query logs

content-based10

Page 11: Enrich, Link, Search

Cultural Heritage: advanced access through (Open) Data

multi-lingual

location-based

recommendation

personalization

advanced ranking

analytics

http://manovich.net/11

Page 12: Enrich, Link, Search

Historic newsreels and photographs

12

Page 13: Enrich, Link, Search

Demo: Linked Open Images

13

http://link.spinque.com/openbeelden

Page 14: Enrich, Link, Search

Can we build this in a day?

14

Page 15: Enrich, Link, Search

Factory metaphor

PUSH: make to stock

PULL: make to order

Output and efficiency oriented

exact needs of user secondary

User needs oriented

production costly

15

Page 16: Enrich, Link, Search

How can we reduce the time

and cost?

Data factory

PUSH: make to stock

PULL: make to order

16

How good is the data for

your application?

Page 17: Enrich, Link, Search

The lean approach

17

Your data Integrate Access Deploy

API

Enrich

Page 18: Enrich, Link, Search

Open Data Node platform

http://opendatanode.org/

Methodology for publishing Open Data

http://www.comsode.eu/index.php/deliverables/

Moving from one-off to sustainable data publishing

18

http://unifiedviews.eu/

Page 19: Enrich, Link, Search

Key requirements for integration step

Sustainable

Quality control

19

Your data Integrate Access Deploy

API

Enrich

Page 20: Enrich, Link, Search

Integrating historic newsreels with photographs

GTAA thesaurus (SKOS)NIOD subject terms (SKOS)

20

Page 21: Enrich, Link, Search

preferred label

antisemitisme

spionage

amnestie

...

preferred label

antisemitisme

spionage

amnestie

...

NIOD subject terms GTAA thesaurus

preferred label = preferred label

21

Page 22: Enrich, Link, Search

prefered label alternative label

politieagenten agenten

militaire parades parades

optochten parades

prefered label

agenten

parades

NIOD subject termsGTAA thesaurus

Introduces ambiguity

preferred label = alternative label

22

Page 23: Enrich, Link, Search

prefered label

dodenherdenking

hamsteren

NIOD subject terms GTAA thesaurus

Introduces errors

prefered label

dodenherdenkingen

hamsters

singular label = plural label (stemming)

23

Page 24: Enrich, Link, Search

prefered label

dieren

graven

NIOD subject terms

GTAA thesaurus

filter sources

prefered label concept scheme

dieren subject terms

dieren geographical names

graven subject terms

grave geographical names

subject ≠ location (noise)

24

Page 25: Enrich, Link, Search

Other alignment techniques

fuzzy string matching

join matches on multiple attributes

similarity in the hierarchy (skos:broader)

select best candidate (most generic/specific term)

....

25

Page 27: Enrich, Link, Search

Key requirements integration step checked

Quality control• Model link strategy out of (simple) building blocks• Iterative process (trial and error)• Exploration of the source data• Direct access to the results• Evaluate the subsets

Sustainable• Export links and link strategy• Provenance of the process is explicit in the strategy• Rerun after update of datasets

27

Page 28: Enrich, Link, Search

Dutch National Strategy Digital Heritage

28

Page 29: Enrich, Link, Search

CultuurLINK a free service for the cultural heritage domain

29

http://cultuurlink.beeldengeluid.nl/

Page 30: Enrich, Link, Search

Rijksmuseum Amsterdam integrated multilingual vocabularies

http://www.rijksmuseum.nl/nl/collectie/BK-NM-1010 http://www.getty.edu/research/tools/vocabularies/aat/ 30

Page 31: Enrich, Link, Search

Key requirements for access step

31

Your data Integrate Access Deploy

API

Enrich

Model complex access (search)

Combine graph queries and ranking

Page 32: Enrich, Link, Search

Already three types of search in a simple app

32

keyword search location-based search recommendation

Page 33: Enrich, Link, Search

multilingual

location-based

recommendation

personalization

ranking

analyticsProbabilistic Graph Database

Building blocks (SPINQL)

Search by Strategy

Advanced search applications with Spinque

33

Page 34: Enrich, Link, Search

Demo Spinque Search

34

Page 35: Enrich, Link, Search

Key requirements access step checked

Model complex search problems• Search strategy out of (simple) building blocks• No programming required

Combine graph queries and ranking• Integrated triple store and search index• Probabilistic graph database• Building blocks for graph queries• Building blocks for search and ranking

35

Page 36: Enrich, Link, Search

Your data Enrich Link strategy

API

DeploySearch strategy

36

The lean approach

Page 37: Enrich, Link, Search

Breakout

What kind of functionality would you like to provide to your users?

1. What kind of data do you want to make accessible in a richer way?

2. What additional (open) data can you use for this enriched access?

3. What type of (search) functionality is required?

37

Page 38: Enrich, Link, Search

Other applications: Restaurant inspections

38

Page 39: Enrich, Link, Search

Other applications: Community platform

39