Top Banner
Faculté Polytechnique DBPedia Project Pierre Masure Laurent Kalambayi
24

Présentation Web Technology

Jul 07, 2015

Download

Documents

PierreMASURE

DBPedia
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Présentation Web Technology

Faculté Polytechnique

DBPedia Project

Pierre MasureLaurent Kalambayi

Page 2: Présentation Web Technology

Université de Mons

[OBJECTIVE]What can it do?

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

2Pierre Masure | Web technology: DBPedia Project

Page 3: Présentation Web Technology

Université de Mons

Knowledge bases

[CONTEXT]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

3Pierre Masure | Web technology: DBPedia Project

Page 4: Présentation Web Technology

Université de Mons

[TOOL]To extract structured information

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

4Pierre Masure | Web technology: DBPedia Project

Page 5: Présentation Web Technology

Université de Mons

[Who develops it ?] DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

5Pierre Masure | Web technology: DBPedia Project

DBPedia

Universityof Leipzig

Universityof Berlin

OpenlinkSoftware

Page 6: Présentation Web Technology

Université de Mons

[Who uses DBPedia ?] DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

6Pierre Masure | Web technology: DBPedia Project

DPBedia

GeoNames

MusicBrainz

Eurostat

CIA Factbook

Project Gutenberg

Page 7: Présentation Web Technology

Université de Mons

[Status]How big is DBPedia ?

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

7Pierre Masure | Web technology: DBPedia Project

2 180 000 entities

36 000 movies

62 000 music

albums

293 000 places

80 000 people

489 000 links to images

GFDL

Page 8: Présentation Web Technology

Université de Mons

[Who does it work?] DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

8Pierre Masure | Web technology: DBPedia Project

Page 9: Présentation Web Technology

Université de Mons

[Working: which cases?] DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

9Pierre Masure | Web technology: DBPedia Project

Revolutionize Wikipedia Search

Include DBpedia Data in Your Web Page

Mobile and Geographic Applications

Document Classification, Annotation and Social Bookmarking

Multi-Domain Ontology

Nucleus for the Web of Data

Support Wikipedia Authors with Editing Suggestions

Page 10: Présentation Web Technology

Université de Mons

[From brute data to RDF]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

10Pierre Masure | Web technology: DBPedia Project

Wikipedia:

Page 11: Présentation Web Technology

Université de Mons

[RDF: Basics]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

11Pierre Masure | Web technology: DBPedia Project

Resource Description Framework (RDF):

a language for representing information about resources in the World Wide Web

RDF is based on the idea of identifying things using Web identifiers (called Uniform Resource Identifiers, or URIs)

Page 12: Présentation Web Technology

Université de Mons

[RDF uses URIs]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

12Pierre Masure | Web technology: DBPedia Project

RDF uses URIs to identify:• individuals, e.g., Eric Miller, identified by

http://www.w3.org/People/EM/contact#me• kinds of things, e.g., Person, identified by

http://www.w3.org/2000/10/swap/pim/contact#Person• properties of those things, e.g., mailbox, identified by

http://www.w3.org/2000/10/swap/pim/contact#mailbox• values of those properties, e.g. mailto:[email protected]

Page 13: Présentation Web Technology

Université de Mons

[RDF: XML]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

13Pierre Masure | Web technology: DBPedia Project

<?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:contact="http://www.w3.org/2000/10/swap/pim/contact#"> <contact:Person rdf:about="http://www.w3.org/People/EM/contact#me"> <contact:fullName>Eric Miller</contact:fullName><contact:mailbox rdf:resource="mailto:[email protected]"/> <contact:personalTitle>Dr.</contact:personalTitle> </contact:Person> </rdf:RDF>

Page 14: Présentation Web Technology

Université de Mons

[Extraction Motor PHP5]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

14

DBPedia Information Extraction Framework:

The framework consists of the interfaces

• Destination, Extractor, Page collection & RDFnode

Plus the essential classes

• Extraction Group, Extraction Job, Extraction Manager, Extraction Result, and RDFtriple.

Laurent KALAMBAYI | Web technology: DBPedia Project

Page 15: Présentation Web Technology

Université de Mons

[Extraction Motor PHP5]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

15

The framework is available from the GNU GPL License.

Laurent KALAMBAYI | Web technology: DBPedia Project

Page 16: Présentation Web Technology

Université de Mons

[Extraction Motor PHP5]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

16

interface Extractor{

/** @return uri */ public function getExtractorID(); public function start($language); /** @return ExtractionResult */ public function extractPage($pageID, $pageTitle, $pageSource); /** @return ExtractionResult */ public function finish();

}

Example: Interface Extractor

Laurent KALAMBAYI | Web technology: DBPedia Project

Page 17: Présentation Web Technology

Université de Mons

[SPARQL requests]to access to the database

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

17

• Access to the info

Requests in SPARQL

• Database

Resource Description Framework

SPARQL Protocol And RDF Query Language

SPARQL allows for a query to consist of triple patterns, conjunctions, disjunctions, and optional patterns…

Laurent KALAMBAYI | Web technology: DBPedia Project

Page 18: Présentation Web Technology

Université de Mons

[SPARQL requests]to access to the database

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

18

PREFIX abc: <http://example.com exampleOntology#>

SELECT ?capital ?country

WHERE {

?x abc:cityname ?capital ;

abc:isCapitalOf ?y .

?y abc:countryname ?country ;

abc:isInContinent abc:Africa .

}

Example of SPARQL query

Returns all country capitals in Africa

Laurent KALAMBAYI | Web technology: DBPedia Project

Page 19: Présentation Web Technology

Université de Mons

[Demo] (1)

19

URI Reference: http://dbpedia.org/page/NameURL Source from Wikipedia:http://en.wikipedia.org/wiki/Name

Laurent KALAMBAYI | Web technology: DBPedia Project

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

Page 20: Présentation Web Technology

Université de Mons

[Demo] (2)

20

Filtering:

http://dbpedia.neofonie.de/browse/

Laurent KALAMBAYI | Web technology: DBPedia Project

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

Page 21: Présentation Web Technology

Université de Mons

[Opinion]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

21

Advantages

DBPedia is a powerful tool

Integration of DBPedia in applications

• new set of super applications !

DBPedia is free

• A lot of web applications will be created !

Disadvantages

Uncontrolled information extraction

Uncontrolled information

Very useful but can be irrelevant

Laurent KALAMBAYI | Web technology: DBPedia Project

Page 22: Présentation Web Technology

Université de Mons

[Opinion]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

22

• Working of a semanticweb subject: DBPedia

• RDF, PHP5, SPARQL notions

• Semantic web gives a new dimension to the web (search,…)

Whatdidwe

learn

Laurent KALAMBAYI | Web technology: DBPedia Project

Page 23: Présentation Web Technology

Université de Mons

[Perspectives]DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

23Laurent KALAMBAYI | Web technology: DBPedia Project

Page 24: Présentation Web Technology

Université de Mons

Thank you for your attention.

DBPEDIA PROJECT

INTRODUCTION

OBJECTIVE

CONTEXT AND MOTIVATION

TOOL

WHO DEVELOPS IT?

WHO USES IT?

HOW BIG IS IT?

WORKING

BASICS

WORKING

TECHNICAL FLAVOURS

RDF: DATABASE

PHP5: INFORMATION

EXTRACTION

SPARQL: ACCESS TO

DATABASE

DEMO

EXAMPLE

CONCLUSION

OPINION

PERSPECTIVES

24Laurent KALAMBAYI | Web technology: DBPedia Project