Faculté Polytechnique DBPedia Project Pierre Masure Laurent Kalambayi
Faculté Polytechnique
DBPedia Project
Pierre MasureLaurent Kalambayi
Université de Mons
[OBJECTIVE]What can it do?
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
2Pierre Masure | Web technology: DBPedia Project
Université de Mons
Knowledge bases
[CONTEXT]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
3Pierre Masure | Web technology: DBPedia Project
Université de Mons
[TOOL]To extract structured information
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
4Pierre Masure | Web technology: DBPedia Project
Université de Mons
[Who develops it ?] DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
5Pierre Masure | Web technology: DBPedia Project
DBPedia
Universityof Leipzig
Universityof Berlin
OpenlinkSoftware
Université de Mons
[Who uses DBPedia ?] DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
6Pierre Masure | Web technology: DBPedia Project
DPBedia
GeoNames
MusicBrainz
Eurostat
CIA Factbook
Project Gutenberg
…
Université de Mons
[Status]How big is DBPedia ?
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
7Pierre Masure | Web technology: DBPedia Project
2 180 000 entities
36 000 movies
62 000 music
albums
293 000 places
80 000 people
489 000 links to images
…
GFDL
Université de Mons
[Who does it work?] DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
8Pierre Masure | Web technology: DBPedia Project
Université de Mons
[Working: which cases?] DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
9Pierre Masure | Web technology: DBPedia Project
Revolutionize Wikipedia Search
Include DBpedia Data in Your Web Page
Mobile and Geographic Applications
Document Classification, Annotation and Social Bookmarking
Multi-Domain Ontology
Nucleus for the Web of Data
Support Wikipedia Authors with Editing Suggestions
Université de Mons
[From brute data to RDF]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
10Pierre Masure | Web technology: DBPedia Project
Wikipedia:
Université de Mons
[RDF: Basics]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
11Pierre Masure | Web technology: DBPedia Project
Resource Description Framework (RDF):
a language for representing information about resources in the World Wide Web
RDF is based on the idea of identifying things using Web identifiers (called Uniform Resource Identifiers, or URIs)
Université de Mons
[RDF uses URIs]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
12Pierre Masure | Web technology: DBPedia Project
RDF uses URIs to identify:• individuals, e.g., Eric Miller, identified by
http://www.w3.org/People/EM/contact#me• kinds of things, e.g., Person, identified by
http://www.w3.org/2000/10/swap/pim/contact#Person• properties of those things, e.g., mailbox, identified by
http://www.w3.org/2000/10/swap/pim/contact#mailbox• values of those properties, e.g. mailto:[email protected]
Université de Mons
[RDF: XML]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
13Pierre Masure | Web technology: DBPedia Project
<?xml version="1.0"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:contact="http://www.w3.org/2000/10/swap/pim/contact#"> <contact:Person rdf:about="http://www.w3.org/People/EM/contact#me"> <contact:fullName>Eric Miller</contact:fullName><contact:mailbox rdf:resource="mailto:[email protected]"/> <contact:personalTitle>Dr.</contact:personalTitle> </contact:Person> </rdf:RDF>
Université de Mons
[Extraction Motor PHP5]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
14
DBPedia Information Extraction Framework:
The framework consists of the interfaces
• Destination, Extractor, Page collection & RDFnode
Plus the essential classes
• Extraction Group, Extraction Job, Extraction Manager, Extraction Result, and RDFtriple.
Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
[Extraction Motor PHP5]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
15
The framework is available from the GNU GPL License.
Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
[Extraction Motor PHP5]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
16
interface Extractor{
/** @return uri */ public function getExtractorID(); public function start($language); /** @return ExtractionResult */ public function extractPage($pageID, $pageTitle, $pageSource); /** @return ExtractionResult */ public function finish();
}
Example: Interface Extractor
Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
[SPARQL requests]to access to the database
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
17
• Access to the info
Requests in SPARQL
• Database
Resource Description Framework
SPARQL Protocol And RDF Query Language
SPARQL allows for a query to consist of triple patterns, conjunctions, disjunctions, and optional patterns…
Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
[SPARQL requests]to access to the database
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
18
PREFIX abc: <http://example.com exampleOntology#>
SELECT ?capital ?country
WHERE {
?x abc:cityname ?capital ;
abc:isCapitalOf ?y .
?y abc:countryname ?country ;
abc:isInContinent abc:Africa .
}
Example of SPARQL query
Returns all country capitals in Africa
Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
[Demo] (1)
19
URI Reference: http://dbpedia.org/page/NameURL Source from Wikipedia:http://en.wikipedia.org/wiki/Name
Laurent KALAMBAYI | Web technology: DBPedia Project
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
Université de Mons
[Demo] (2)
20
Filtering:
http://dbpedia.neofonie.de/browse/
Laurent KALAMBAYI | Web technology: DBPedia Project
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
Université de Mons
[Opinion]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
21
Advantages
DBPedia is a powerful tool
Integration of DBPedia in applications
• new set of super applications !
DBPedia is free
• A lot of web applications will be created !
Disadvantages
Uncontrolled information extraction
Uncontrolled information
Very useful but can be irrelevant
Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
[Opinion]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
22
• Working of a semanticweb subject: DBPedia
• RDF, PHP5, SPARQL notions
• Semantic web gives a new dimension to the web (search,…)
Whatdidwe
learn
Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
[Perspectives]DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
23Laurent KALAMBAYI | Web technology: DBPedia Project
Université de Mons
Thank you for your attention.
DBPEDIA PROJECT
INTRODUCTION
OBJECTIVE
CONTEXT AND MOTIVATION
TOOL
WHO DEVELOPS IT?
WHO USES IT?
HOW BIG IS IT?
WORKING
BASICS
WORKING
TECHNICAL FLAVOURS
RDF: DATABASE
PHP5: INFORMATION
EXTRACTION
SPARQL: ACCESS TO
DATABASE
DEMO
EXAMPLE
CONCLUSION
OPINION
PERSPECTIVES
24Laurent KALAMBAYI | Web technology: DBPedia Project