Consuming Linked Open Data WorkdistributedunderthelicenseCreativeCommonsAttribution- Noncommercial-Share Alike 3.0 Boris Villazón-Terrazas Facultad de Informática, Universidad Politécnica de Madrid Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net [email protected]Phone: 34.91.3366605, Fax: 34.91.3524819 @boricles Slides available at: http://www.slideshare.net/boricles/ Acknowledgements: Alexander de Leon, Filip Wisniewki, Daniel Vila-Suero, Daniel Garijo, Victor Saquicela, Michael Hausenblas, Richard Cyganiak, Sarven Capadisli, Oscar Corcho, Asunción Gómez-Pérez, all OEG members involved in the Linked Data initiatives, and Local Government Management Services Board - Ireland.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
@boricles Slides available at: http://www.slideshare.net/boricles/
Acknowledgements: Alexander de Leon, Filip Wisniewki, Daniel Vila-Suero, Daniel Garijo, Victor Saquicela, Michael Hausenblas, Richard Cyganiak, Sarven Capadisli, Oscar Corcho, Asunción Gómez-Pérez, all OEG members involved in the Linked Data initiatives, and Local Government Management Services Board - Ireland.
Some references
Wood, David (Ed) Linking Government Data - 2011!
Methodological Guidelines for Publishing Government Linked Data!
Boris Villazón-Terrazas, Luis M. Vilches, Oscar Corcho, Asunción Gómez-Pérez!
Best Practices for Publishing Linked Data!
W3C Editor’s Draft – Government Linked Data Working Group!
Bernadette Hyland, Boris Villazón-Terrazas, Michael Hausenblas, !
It is also possible to reuse and apply an existing license of the government data sources.
12
Specification
13
Specifica(on
Modelling
Genera(on
Linking
Publica(on
Exploita(on
Reuse available vocabularies
14
Search for suitable vocabularies
Linked Open Vocabularies
are there suitable
vocabularies?
Build the vocabulary by reusing available
vocabularies
Yes
No
…
Modelling
Reuse available non-ontological resources
15
Search for suitable non-ontological resources
Highly reliable Web Sites
Domain-related sites
Government Catalogs
are there suitable
resources?
Build the vocabulary by transforming available
resources
Yes
No
Build the vocabulary from scratch
Modelling
16
Specifica(on
Modelling
Genera(on
Linking
Publica(on
Exploita(on
Transformation
• Take the data sources selected in the specification activity and transform them to RDF according to the vocabulary created in the modelling activity
• Some tools • CSV and spreadsheets
• RDF extension of Google Refine, XLWrap, RDF123, NOR2O • RDB
• D2R Server, ODEMapster, W3C RDB2RDF WG – R2RML • XML
• GRDDL, ReDeFer
17
Generation
• A majority of dynamic Web content is backed by relational databases (RDB), and so are many enterprise systems.
• W3C RDB2RDF Working Group • R2RML: RDB to RDF Mapping Language - http://www.w3.org/TR/r2rml/ • Direct Mapping - http://www.w3.org/TR/rdb-direct-mapping/ • R2RML and Direct Mapping Test Cases - http://www.w3.org/2001/sw/rdb2rdf/test-cases/ • RDB2RDF Implementation report – http://www.w3.org/2001/sw/rdb2rdf/implementation-
report/
Transformation – RDB2RDF
transformation description
transformation engine
18
19
Specifica(on
Modelling
Genera(on
Linking
Publica(on
Exploita(on
20
Identify suitable data sets as linking targets
http://thedatahub.org
Discover relationships between data items
Silk Framework LIMES
Validate the relationships discovered sameAs Validator
Effective usage, develop applications that exploit these data
26
Streaming resources
ToC
• Introduction
• Publishing & Consuming Linked Open Data
• Use cases • GeoLinkedData - ES • AEMET - ES • El Viajero - ES • datos.bne.es - ES • Service Indicators - IE
• Conclusions and future work
27
GeoLinkedData
28
• An open initiative whose aim is to enrich the Web of Data with Spanish geospatial data
• It started off by publishing diverse information sources, such as National Geographic Institute of Spain (IGN) and Statistical Institue of Spain (INE).
• http://geo.linkeddata.es/
GeoLinkedData – Identification of the data sources
IGN National Geographic Institute of Spain
Oracle & MySQL
INE National Statistic Institute of Spain
29
Agreement with the IGN
Data sources available in a public data catalog
Specification
GeoLinkedData – Analysis of the data sources
30
Specification
Industry Production Index Province
Year
GeoLinkedData - URI design
• Base URI http://linkeddata.es/ http://geo.linkeddata.es/
• Gflot • Flot is a pure Javascript plotting library, and Gflot is a GWT
adaptation of Flot.
11/02/11
http://code.google.com/p/gflot/
56
Weather stations
11/02/11
57
Observations for each station
11/02/11
58
A particular observation
11/02/11
Specific visualization for datasets based on SSN Ontology – ongoing work
http://www.w3.org/2005/Incubator/ssn/ssnx/ssn
59
ToC
• Introduction
• Publishing & Consuming Linked Open Data
• Use cases • GeoLinkedData - ES • AEMET - ES • El Viajero - ES • datos.bne.es - ES • Service Indicators - IE
• Conclusions and future work
60
El Viajero – tourism and travelling
• Content is aggregated from different platforms, such as “Suplemento El País”, ”Guías Aguilar”, “Canal Viajar” o “Prisa Digital”.
• Heterogeneous content (images, travel guides, posts, videos, news) with different sources and from people with different profiles (journalists, bloggers and normal users)
61
Modelling
Ontology network
• OPM (1): • Centered in the description of
the evolution of the resource.
• OPM profile (2): • OPM Extension to our specific
domain.
• SIOC (3): • Describes the social
relationships in the platforms, plus posts and blogs.
• MPEG-7 (3): • Image and video description.
• GEO (3): • Localization of the resources.
OPM Core
OPM extension to our domain
SIOC MPEG-7 GEO
1
2
3
62
Overview of the architecture
Repository
Post Parser
Blog Parser
XML Parser
IPTC
Parser
PARSERS
Annotation interface
HTTP POST
Request
HTTP GET
Request (SPARQL query)
REST API
Insert processed data
Store in the repository
Insert XML data Receive request Send
response
User/content provider Application
Send/receive
RDF response
OWL Model SPARQL request
63
Linking • SILK has been used to:
• Link resources to DBpedia through gelocation • Link resources to GeolinkedData through geolocation
• Linking resources to LUF (Linked User Feedback). • Guide & travel recommendation.
• Linking travel guides to hotels and restaurants of “Guía Santillana”.
SILK
64
Exploitation
El Viajero: • Extension of map4rdf to our domain.
• New queries for browsing resources • Image addition • Filtering and time-line plugins
Additional exploitation: • Resource searcher using the dataset. • LARKC demo (ISOCO) http://contextmanager.isoco.net/webn1/demolarkc/
http://www.simile-widgets.org/timeline/
65
Browser
66
Initial screen
Selecting a type of resource, we will see all of the available resources on the map
Guide Browsing
67
More images of the guides
Link to the news in “El Viajero”
Pubby frontend
Guide Browsing
68
More images of the guides
Year filtering
69
Plugin selection Year selection
Trip Browsing
70
Trip metadata Itinerary followed in the trip
Timeline
71
Trip timeline (drawn from its provenance
information)
Trip features (price, duration, type, etc)
Quick search - Author
72
Reference to locations
Guides
ToC
• Introduction
• Publishing & Consuming Linked Open Data
• Use cases • GeoLinkedData - ES • AEMET - ES • El Viajero - ES • datos.bne.es - ES • Service Indicators - IE
• Conclusions and future work
73
datos.bne.es project
• Joint project between the National Library of Spain (BNE) and Ontology Engineering Group
• Started as a small proof-of-concept project:
Publishing "Cervantes" Datasets as LD
• Evolved into a bigger project: Publishing a significant part of the BNE catalogue
• Published in December 2011, public announcement
at BNE
74
datos.bne.es: Methodological approach
• Derived from several experiences at OEG: geolinkeddata.es, Met agency, etc. [1]
• Design principle: Have more control over the different
activities, allow for iterative, incremental process
75
Data specification
Modelling
RDF generation
Link generation
[1] Villazón-Terrazas, B. et al., Methodological Guidelines for Publishing Government Linked Data. In D. Wood, ed. Linking Government Data. Springer.
@boricles Slides available at: http://www.slideshare.net/boricles/
Acknowledgements: Alexander de Leon, Filip Wisniewki, Daniel Vila-Suero, Daniel Garijo, Victor Saquicela, Michael Hausenblas, Richard Cyganiak, Sarven Capadisli, Oscar Corcho, Asunción Gómez-Pérez, all OEG members involved in the Linked Data initiatives, and Local Government Management Services Board - Ireland.