Publishing Linked Data from RDB Boris Villazón-Terrazas. Oscar Corcho Facultad de Informática Universidad Politécnica de Madrid Facultad de Informática, Universidad Politécnica de Madrid Campus de Montegancedo sn, 28660 Boadilla del Monte, Madrid http://www.oeg-upm.net [email protected]Ph 34 91 3366605 F 34 91 3524819 Phone: 34.91.3366605, Fax: 34.91.3524819 Slides available at: http://www.slideshare.net/boricles/ Acknowledgements: Freddy Priyatna, Jan Schulte, Richard WorkdistributedunderthelicenseCreativeCommonsAttribution- Noncommercial-Share Alike 3.0 Cyganiak and many others that we may have omitted.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Publishing Linked Data from RDB
Boris Villazón-Terrazas. Oscar CorchoFacultad de Informática Universidad Politécnica de MadridFacultad de Informática, Universidad Politécnica de Madrid
Campus de Montegancedo sn, 28660 Boadilla del Monte, Madridhttp://www.oeg-upm.net
Linked Open Vocabularies http://labs.mondeca.com/dataset/lov/
FOAFFOAFFriend of a Friend vocabulary
BIBOTh Bibli hi O t lThe Bibliographic Ontology
GEOPFAO Geopolitical Ontology
6
Modelling – NeOn Toolkit
http://neon-toolkit.org/
New Project
New Ontology
7
Modelling – Creating some elements
ClassesClassesClassesClasses
Obj t P tiObj t P tiObject PropertiesObject Properties
Datatype PropertiesDatatype Properties
8
9
Transformation – RDB2RDF
• A majority of dynamic Web content is backed by relational databases (RDB), and so are many enterprise systems.
• W3C RDB2RDF Working Group 1• R2RML: RDB to RDF Mapping Language - http://www.w3.org/TR/r2rml/• Direct Mapping - http://www.w3.org/TR/rdb-direct-mapping/• R2RML and Direct Mapping Test Cases - http://www.w3.org/2001/sw/rdb2rdf/test-cases/
R2RML/Direct Mapping in process to be a W3C Recommendation
10
1 http://www.w3.org/TR/r2rml/
• R2O is an extensible, fully declarative language to describe Transformation – R2O & ODEMapster
2O s a e te s b e, u y dec a at e a guage to desc bemappings between relational database schemas and ontologies.
• The ODEMapster processor generates RDF instances from relational instances based on the mapping description pp g pexpressed in the R2O document
• ODBC Registration: • Set up VIRTUOSO_HOME variable pointing out to virtuoso-opensource directory:• cd %VIRTUOSO HOME%\libcd %VIRTUOSO_HOME%\lib• regsvr32 virtodbc.dll
• Creating a Windows Service• cd %VIRTUOSO_HOME%\database• SET PATH=%PATH%;%VIRTUOSO_HOME%\bin;%VIRTUOSO_HOME%\lib• virtuoso-t -? //to verify• virtuoso-t +service screate +instance "Instance Name" +configfile virtuoso.ini • virtuoso-t +service list //to verify• virtuoso t I "Instance Name" +service start //start the service• virtuoso-t -I Instance Name +service start //start the service
• Dataset: http://research.ull.es/graph/dataset -research.rdfLi k h // h ll / h/li k• Links: http://research.ull.es/graph/links - output.nt
40
Virtuoso endpoint
• http://localhost:8890/sparql
41
Virtuoso endpoint
• Now you can play a bit with SPARQL … ;)
42
Metadata publication – VOiD
• VOiD description• VOiD description• void.ttl
43
Metadata Publication – CKAN.net / thedatahub.org
44
Enable effective discovery
• Sindice: the best RDF search engine
45
Enable effective discovery
• Sitemap Protocol• Used by web crawlers• Efficiently find all your content & discover what
has been updatedhtt // it /http://sitemaps.org/
A sitemap file contains information regarding one or more URLs on your Web site. Theinformation that is stored there helps search engines better spider your website.
A sitemap file contains information regarding one or more URLs on your Web site. The information that is stored there helps search enginesWeb site. The information that is stored there helps search engines better spider your website.
• run sitemap4rdf specifying the SPARQL endpointand the prefix of the URLs to include in the Sitemap
50
Submit the sitemap location - Sindice
• http://sindice.com/main/submit
51
Submit the sitemap location - Google
• https://www.google.com/webmasters/tools/
52
53
Publishing Linked Data from RDB
Boris Villazón-Terrazas, Oscar CorchoFacultad de Informática Universidad Politécnica de MadridFacultad de Informática, Universidad Politécnica de Madrid
Campus de Montegancedo sn, 28660 Boadilla del Monte, Madridhttp://www.oeg-upm.net