KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu Institute of Applied Informatics and Formal Description Methods (AIFB) Institute of Applied Informatics and Formal Description Methods (AIFB) A semantically enabled architecture for crowdsourced Linked Data management Elena Simperl, 1 Maribel Acosta , 1 Barry Norton 2 1 Institute AIFB, Karlsruhe Institute of Technology, Germany 2 Ontotext AD, Bulgaria
22
Embed
Crowdsourcing-enabled Linked Data management architecture
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu
Institute of Applied Informatics and Formal Description Methods (AIFB)
Institute of Applied Informatics and Formal Description Methods (AIFB)
A semantically enabled architecture for crowdsourced Linked Data management Elena Simperl,1 Maribel Acosta,1 Barry Norton2
1Institute AIFB, Karlsruhe Institute of Technology, Germany 2Ontotext AD, Bulgaria
Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
2 07.06.2012
Background: What is Linked Data?
CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Linked Data management
Linked Data: set of best practices to publish and connect structured data on the Web.
URIs to identify entities and concepts in the world HTTP to access and retrieve resources and descriptions of these resources RDF as generic graph-based data model to structure and link data
Taken together Linked Data is said to form a ‘cloud’ of shared references and vocabularies. Query language: SPARQL.
Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
5 07.06.2012
1. Motivation
„Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human-readable description of the airport given in the comment“.
This query cannot be optimally answered automatically:
Incorrect/missing classification of entities (e.g. classification as airports instead of commercial airports).
Missing information in data sets (e.g. German labels).
It is not possible to optimally perform subjective operations (e.g. comparisons of pictures or NL comments).
CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Linked Data management
User Query: Give me the German names of all commercial airports in Baden-Württemberg, ordered by their most informative description.
Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
6 07.06.2012
1. Motivation
„Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human-readable description of the airport given in the comment“.
In order to answer the query as intended:
Classification of airports as commercial airports.
Identity resolution of places (Baden-Württemberg).
Translation of the labels of the airports.
Ordering of the comments by a subjective comparison.
CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Linked Data management
Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
7 07.06.2012
1. Motivation
„Retrieve the labels in German of commercial airports located in Baden-Württemberg, ordered by the better human-readable description of the airport given in the comment“.
SPARQL Query: SELECT ?label WHERE { ?x a metar:CommercialHubAirport; rdfs:label ?label; rdfs:comment ?comment . ?x geonames:parentFeature ?z . ?z owl:sameAs <http://dbpedia.org/resource/Baden-Wuerttemberg> . FILTER (LANG(?label) = "de") } ORDER BY CROWD(?comment, "Better description of %x")
CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Linked Data management
Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
16 07.06.2012
3. Extensions to VoID and SPARQL
RDF data can be queried using the language SPARQL.
Common SPARQL operators: join, union, optional, filter, order by.
Properties related to general ontology languages such as OWL are treated as extensions of SPARQL operators, and are modeled in our architecture as tasks.
CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Linked Data management
Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
19 07.06.2012
4.2. Ordering
Orderings defined via less straightforward built-ins; for instance, the ordering of pictorial representations of entities. SPARQL extension: ORDER BY CROWD Example: Retrieves all airports and their pictures, and the pictures should
be ordered according to the more representative image of the given airport.
SELECT ?airport ?picture WHERE { ?airport a metar:Airport; foaf:depiction ?picture . } ORDER BY CROWD(?picture, "Most representative image for %airport")
CrowdSearch 2012 - A semantically enabled architecture for crowdsourced Linked Data management
{?airport foaf:depiction ?x, ?y} Input:
{{(?x ?y) a rdf:List} UNION {(?y ?x) a rdf:List}} Output: