Top Banner
Linked Data Data Integration and Semantic web Diego Pessoa [email protected]
38

Linked Data Integration and semantic web

Dec 16, 2014

Download

Education

Diego Pessoa

 
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Linked Data Integration and semantic web

Linked DataData Integration andSemantic web

Diego [email protected]

Page 2: Linked Data Integration and semantic web

How did we store data?

Page 3: Linked Data Integration and semantic web

Data Islands

Limited to the company

Page 4: Linked Data Integration and semantic web

DatabasesCentral Access

DistributedFederated

Page 5: Linked Data Integration and semantic web

Web

Hypertext (Web 1.0)

Social/Collaborative Content(Web 2.0)

Massive data volumes

Page 6: Linked Data Integration and semantic web

Web Data Volume?

Growing at 40% per year

45 ZB ~= 48.318.382.080 TB

Page 7: Linked Data Integration and semantic web

It means we have problems?

Page 8: Linked Data Integration and semantic web

Searching the web…

Who are the brazilian players (includingw/ dual nationality) in the 2014 worldcup?

Page 9: Linked Data Integration and semantic web

Googling…

54.700.000 results?!?!

Just one player information

Page 10: Linked Data Integration and semantic web

Let’s try again

81.100.000 results?! (50%+)

WTF?

Page 11: Linked Data Integration and semantic web

Let’s try again

Page 12: Linked Data Integration and semantic web

And now?!?!

Page 13: Linked Data Integration and semantic web

We need data! Machines process data!

How to resolve?APIs? Mashups?

Page 14: Linked Data Integration and semantic web

Web Challenges…Increase content

structure

Provide semantics

to data

Establish links

among contents

Publishing of

Standard data

Page 15: Linked Data Integration and semantic web

WebEvolution

Rich data

Vocabularies

Semantics

Page 16: Linked Data Integration and semantic web

Presenting…

“The Semantic Web is the extension of the World Wide Web that enables people to share content beyond the boundaries of applications and websites. It has been described in rather different ways: as a utopic vision, as a web of data, or merely as a natural paradigm shift in our daily use of the Web. Most

of all, the Semantic Web has inspired and engaged many people to create innovative semantic technologies and

applications.”semanticweb.org

Page 17: Linked Data Integration and semantic web

Semantic Web

Unique Identifiers (URI)

Data = Resources

Easy sharing!

Page 18: Linked Data Integration and semantic web

Semantic WebBut… How to represent data in the Web?

Example - Traditional way (tuples):

Id Name Former Institution

Birthplace

01 Diego Pessoa UFPB Campina Grande/PB02 Everaldo Netto FAL Palmeiras/PE03 Gabrielle Karine UTFPR Medianeira/PR04 Marcelo Iury UFCG Fortaleza/CE

Student

Page 19: Linked Data Integration and semantic web

Semantic WebBut… How to represent data in the Web?

Example - Traditional way (tuples):01 Diego Pessoa UFPB Campina Grande/PB

Former Institution

UFPBFALUFTPRUFCG

1)

2)

We need something more!

Page 20: Linked Data Integration and semantic web

We need triples!Subject Predicate ObjectGabrielle Karine Was born in Medianeira/PRDiego Pessoa Studied In UFPB

Campina Grande Is in ParaíbaGabrielle Karine Is friend of Everaldo NettoFAL Is In AlagoasAlagoas Part of Maceió

Extra links:

Page 21: Linked Data Integration and semantic web

DBPEDIA

Triples as Graphs

Diego Pessoa

Campina Grande

Paraíba Brazil

Gabrielle Karine

Everaldo Netto

Alagoas

Maceió

Was born in

Is in

Is part of

Is part of

Is in

Is in Is friend of

Combining different sources!

Page 22: Linked Data Integration and semantic web

But…How to identify different resources?

Diego Pessoa Diego Pessoa=?CIn IFPB

URI (Uniform Resource Identifiers)Ex.: CPF, ISBN, URL

cin.ufpe.br/~derp diegopessoa.com#about

Web App 1

Web App 2

Web App 3

Web App 4

is same as

Page 23: Linked Data Integration and semantic web

Semantic WebStack

Page 24: Linked Data Integration and semantic web

And how about Linked Data?

“Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods.”

linkeddata.org

“A term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF.”

wikipedia

Page 25: Linked Data Integration and semantic web

Linked Data Principles1. Use URIs as names for

things.

Tim Berners-Lee. Linked Data - Design Issues, 2006. http://www.w3.org/DesignIssues/LinkedData.html. 7, 26, 82

2. Use HTTP URIs, so that people can look up

those names.3. When someone looks up a URI, provide

useful information, using the standards

(RDF, SPARQL).4. Include links to other URIs, so that they can

discover more things

Page 26: Linked Data Integration and semantic web

LODCloud

Page 27: Linked Data Integration and semantic web

Guidelines to publish linked data1. Right URI CreationAlways HTTP

Avoid technical details (ex.: cin.ufpe.br:8080/~derp/index.php

Keep stable and persistent addresses

Feel free to use unique identifiers. (ex.: #isbn-number, #cpf)

Page 28: Linked Data Integration and semantic web

Guidelines to publish linked data2. Use dereferenceable URIs

Hash URI (Ex.:Entity Berlin): http://linkeddata.openlinksw.com/about/Berlin#this

Slash URI (Ex.:Entity Berlin): http://dbpedia.org/resource/Berlin

Page 29: Linked Data Integration and semantic web

Guidelines to publish linked data3. RDF Link Creation

Manual or automaticExternal/Internal links

Friend-of-a-Friend (FOAF)

Semantically-Interlinked Online Communities (SIOC)

Simple Knowledge Organization System (SKOS)

Description of a Project (DOAP)

Creative Commons (CC)

Dublin Core (DC)

Page 30: Linked Data Integration and semantic web

Guidelines to publish linked data4. Explicit additional ways to access data

Provide SPARQL endpoint

Framework Jena provides endpoints implementations:Joseki and Fuseki

XML JSON

RDF/XML

Turtle

N3 HTML

Page 31: Linked Data Integration and semantic web

Guidelines to publish linked data5. Standards to publish linked data

Tools for RDF conversion from CSV, XML, relational data, spreadsheets.

(Ex.: ConvertRDF)

Data load in triple database (RDF Store)

RDF Store publishing: Provide interface to access Linked Data and SPARQL endpoint.

Page 35: Linked Data Integration and semantic web

Domain Specific Applicationshttp://revyu.com (Review anything) DBPedia Mobile (DBPedia+Revyu+Flicker)

Page 37: Linked Data Integration and semantic web

Research ChallengesUser Interfaces and Interaction Paradigms

Application Architectures

Schema Mapping and Data Fusion Link Maintenance

Licensing Trust, Quality and Relevance

Privacy

Christian Bizer, Tom Heath and Tim Berners-Lee (2009) Linked Data - The Story So Far. International Journal on Semantic Web and

Information Systems, Vol. 5(3), Pages 1-22. DOI: 10.4018/jswis.2009081901

Page 38: Linked Data Integration and semantic web

Linked DataData Integration

andSemantic web

Diego [email protected]

Thanks!