Top Banner
Introduction to Semantic Web Many of the slides of this chapter are from http://www.csd.uoc.gr/~hy566/Han douts.htm
52

Introduction to Semantic Web

Feb 25, 2016

Download

Documents

livi

Introduction to Semantic Web. Many of the slides of this chapter are from http://www.csd.uoc.gr/~hy566/Handouts.htm. What is the Semantic Web (SW)?. WWW: collection of distributed interlinked documents encoded in html Content written in natural language - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Introduction to  Semantic Web

Introduction to Semantic Web

Many of the slides of this chapter are from http://www.csd.uoc.gr/~hy566/Handouts.htm

Page 2: Introduction to  Semantic Web

What is the Semantic Web (SW)?

• WWW: collection of distributed interlinked documents encoded in html– Content written in natural language– Computers don’t understand their meaning

• In SW, machine readable annotations are added and web-pages are linked by virtue of similar content– Content of Web-pages is encoded by special vocabularies

called “ontologies”– SW offers new capabilities

Euripides G.M. Petrakis Introduction to Semantic Web 2

Page 3: Introduction to  Semantic Web

Today’s Web

• Most of today’s Web content is suitable for human consumption – Even Web content that is generated automatically

from databases is usually presented without the original structural information found in databases

• Typical Web uses today people’s– seeking and making use of information, searching for

and getting in touch with other people, reviewing catalogs of online stores and ordering products by filling out forms

Euripides G.M. Petrakis Introduction to Semantic Web 3

Page 4: Introduction to  Semantic Web

Keyword-Based Search Engines

• Current Web activities are not particularly well supported by software tools– Except for keyword-based search engines (e.g.,

Google, AltaVista, Yahoo)• The Web would not have been the huge

success it was, were it not for search engines

Euripides G.M. Petrakis Introduction to Semantic Web 4

Page 5: Introduction to  Semantic Web

Problems of Keyword-Based Search Engines

• Low or no recall• Results are highly sensitive to vocabulary • Results are single Web pages • Human involvement is necessary to interpret

and combine results• Results of Web searches are not readily

accessible by other software tools

Euripides G.M. Petrakis Introduction to Semantic Web 5

Page 6: Introduction to  Semantic Web

The Key Problem of Today’s Web

• The meaning of Web content is not machine-accessible: lack of semantics

• It is simply difficult to distinguish the meaning between these two sentences:– I am a professor of computer science.– I am a professor of computer science, you may

think. Well, . . .

Euripides G.M. Petrakis Introduction to Semantic Web 6

Page 7: Introduction to  Semantic Web

The Semantic Web Approach

• Represent Web content in a form that is more easily machine-processable.

• Use intelligent techniques to take advantage of these representations.

• The Semantic Web will gradually evolve out of the existing Web, it is not a competition to the current WWW

Euripides G.M. Petrakis Introduction to Semantic Web 7

Page 8: Introduction to  Semantic Web

Semantic Web Enabled Knowledge Management

• Knowledge will be organized in conceptual spaces according to its meaning

• Automated tools for maintenance and knowledge discovery

• Semantic query answering • Query answering over several documents• Defining who may view certain parts of information

(even parts of documents) will be possible.

Euripides G.M. Petrakis Introduction to Semantic Web 8

Page 9: Introduction to  Semantic Web

The Semantic Web Impact – B2C Electronic Commerce

• A typical scenario: user visits one or several online shops, browses their offers, selects and orders products.

• Ideally humans would visit all, or all major online stores; but too time consuming

• Shopbots are a useful tool

Euripides G.M. Petrakis Introduction to Semantic Web 9

Page 10: Introduction to  Semantic Web

Limitations of Shopbots

• They rely on wrappers: extensive programming required

• Wrappers need to be reprogrammed when an online store changes its outfit

• Wrappers extract information based on textual analysis– Error-prone– Limited information extracted

Euripides G.M. Petrakis Introduction to Semantic Web 10

Page 11: Introduction to  Semantic Web

Semantic Web Enabled B2C Electronic Commerce

• Software agents that can interpret the product information and the terms of service– Pricing and product information, delivery and

privacy policies will be interpreted and compared to the user requirements.

• Information about the reputation of shops • Sophisticated shopping agents will be able to

conduct automated negotiations

Euripides G.M. Petrakis Introduction to Semantic Web 11

Page 12: Introduction to  Semantic Web

The Semantic Web Impact – B2B Electronic Commerce

• Greatest economic promise• Currently relies mostly on EDI (Electronic Data

Interchange)– Isolated technology, understood only by experts– Difficult to program and maintain, error-prone– Each B2B communication requires separate

programming • Web appears to be perfect infrastructure

– But B2B not well supported by Web standards

Euripides G.M. Petrakis Introduction to Semantic Web 12

Page 13: Introduction to  Semantic Web

Semantic Web Enabled B2B Electronic Commerce

• Businesses enter partnerships without much overhead

• Differences in terminology will be resolved using standard abstract domain models

• Data will be interchanged using translation services • Auctioning, negotiations, and drafting contracts will

be carried out automatically (or semi-automatically) by software agents

Euripides G.M. Petrakis Introduction to Semantic Web 13

Page 14: Introduction to  Semantic Web

Semantic Web Technologies

• Explicit Metadata• Ontologies• Logic and Inference• Agents

Euripides G.M. Petrakis Introduction to Semantic Web 14

Page 15: Introduction to  Semantic Web

On HTML

• Web content is currently formatted for human readers rather than programs

• HTML is the predominant language in which Web pages are written (directly or using tools)

• Vocabulary describes presentation

Euripides G.M. Petrakis Introduction to Semantic Web 15

Page 16: Introduction to  Semantic Web

An HTML Example

<h1>Agilitas Physiotherapy Centre</h1> Welcome to the home page of the Agilitas Physiotherapy

Centre. Do you feel pain? Have you had an injury? Let our staff Lisa Davenport, Kelly Townsend (our lovely secretary) and Steve Matthews take care of your body and soul.

<h2>Consultation hours</h2> Mon 11am - 7pm<br> Tue 11am - 7pm<br> Wed 3pm - 7pm<br> Thu 11am - 7pm<br> Fri 11am - 3pm<p>But note that we do not offer consultation during the weeks of

the <a href=". . .">State Of Origin</a> games…

Euripides G.M. Petrakis Introduction to Semantic Web 16

Page 17: Introduction to  Semantic Web

Problems with HTML

• Humans have no problem with this• Machines (software agents) do:

– How distinguish therapists from the secretary, – How determine exact consultation hours – They would have to follow the link to the State Of

Origin games to find when they take place.

Euripides G.M. Petrakis Introduction to Semantic Web 17

Page 18: Introduction to  Semantic Web

A Better Representation

<company><treatmentOffered>Physiotherapy</treatmentOffered><companyName>Agilitas Physiotherapy Centre</companyName><staff>

<therapist>Lisa Davenport</therapist><therapist>Steve Matthews</therapist><secretary>Kelly Townsend</secretary>

</staff></company>

Euripides G.M. Petrakis Introduction to Semantic Web 18

Page 19: Introduction to  Semantic Web

Explicit Metadata

• This representation is far more easily processable by machines

• Metadata: data about data – Metadata capture part of the meaning of data

• Semantic Web does not rely on text-based manipulation, but rather on machine-processable metadata

Euripides G.M. Petrakis Introduction to Semantic Web 19

Page 20: Introduction to  Semantic Web

Ontologies [Jepsen 2009]

• The term ontology originates from philosophy – The study of the nature of existence

• An ontology is an explicit and formal specification of a conceptualization

• A method for representing items of knowledge (e.g., ideas, facts, thinks) in a way that defines the relationships (e.g., part-of, functional) and classifications of concepts within a specified domain of knowledge

Euripides G.M. Petrakis Introduction to Semantic Web 20

Page 21: Introduction to  Semantic Web

Typical Components of Ontologies

• Terms denote important concepts (classes of objects) of the domain – e.g., professors, staff, students, courses, departments

• Relationships between these terms: typically class hierarchies– a class C to be a subclass of another class C' if every

object in C is also included in C' – e.g., all professors are staff members

Euripides G.M. Petrakis Introduction to Semantic Web 21

Page 22: Introduction to  Semantic Web

Further Components of Ontologies

• Properties: – e.g., X teaches Y

• Value restrictions – e.g. , only faculty members can teach courses

• Disjointness statements – e.g. , faculty and general staff are disjoint

• Logical relationships between objects – e.g., every department must include at least 10 faculty

Euripides G.M. Petrakis Introduction to Semantic Web 22

Page 23: Introduction to  Semantic Web

Example of a Class Hierarchy

Euripides G.M. Petrakis Introduction to Semantic Web 23

Page 24: Introduction to  Semantic Web

The Role of Ontologies on the Web

• Ontologies provide a shared understanding of a domain: semantic interoperability– overcome differences in terminology – mappings between ontologies

• Ontologies are useful for the organization and navigation of Web sites

Euripides G.M. Petrakis Introduction to Semantic Web 24

Page 25: Introduction to  Semantic Web

The Role of Ontologies in Web Search

• Ontologies are useful for improving the accuracy of Web searches – search engines can look for pages that refer to a precise

concept in an ontology • Web searches can exploit generalization /

specialization information – If a query fails to find any relevant documents, the search

engine may suggest to the user a more general query– If too many answers are retrieved, the search engine may

suggest to the user some specializations

Euripides G.M. Petrakis Introduction to Semantic Web 25

Page 26: Introduction to  Semantic Web

Web Ontology Languages

• In XML there is no intended meaning associated with the nesting of tags

• There are many ways of representing the same meaning– <course name=“Discrete Math”>

<lecturer>David Billington</lecturer></course>

– <lecturer name=“David Billington”><teaches>Discrete Math</teaches>

</lecturer>– <teachingoffering>

<lecturer>David Billington</lecturer> <course>Discrete Math</course>

</teachingoffering>

Euripides G.M. Petrakis Introduction to Semantic Web 26

Page 27: Introduction to  Semantic Web

RDF Schema

• RDF is a data model for objects and relations between them– Does not provide syntax for defining classes, properties

and cannot related to one another• RDF Schema is a vocabulary description language

– Uses RDF and describes properties and classes of RDF resources

– RDFS vocabulary: http://www.w3.org/2000/01/rdf-schema – Provides semantics for generalization hierarchies

of properties and classes

Euripides G.M. Petrakis Introduction to Semantic Web 27

Page 28: Introduction to  Semantic Web

RDF – RDFS layers

Euripides G.M. Petrakis Introduction to Semantic Web 28

Page 29: Introduction to  Semantic Web

Web Ontology Languages

• OWL: A richer ontology language built on RDF• OWL adds semantics and vocabulary to RDF

and RDFS giving it more power to express:– relations between classes e.g., disjointness– Cardinality e.g. “exactly one”– Characteristics of properties (e.g., symmetry)

Euripides G.M. Petrakis Introduction to Semantic Web 29

Page 30: Introduction to  Semantic Web

Logic and Inference

• Logic is the discipline that studies the principles of reasoning

• Formal languages for expressing knowledge• Well-understood formal semantics

– Declarative knowledge: we describe what holds without caring about how it can be deduced

• Automated reasoners can deduce (infer) conclusions from the given knowledge

Euripides G.M. Petrakis Introduction to Semantic Web 30

Page 31: Introduction to  Semantic Web

An Inference Example

prof(X) faculty(X)faculty(X) staff(X)prof(michael)

We can deduce the following conclusions:faculty(michael)staff(michael)prof(X) staff(X)

Euripides G.M. Petrakis Introduction to Semantic Web 31

Page 32: Introduction to  Semantic Web

Logic versus Ontologies

• The previous example involves knowledge typically found in ontologies– Logic can be used to uncover ontological

knowledge that is implicitly given – It can also help uncover unexpected relationships

and inconsistencies• Logic is more general than ontologies• The more expressive a logic is, the more

computationally expensive it becomes to draw conclusions

Euripides G.M. Petrakis Introduction to Semantic Web 32

Page 33: Introduction to  Semantic Web

Software Agents

• Software agents work autonomously and proactively

• A personal agent on the Semantic Web will:– receive some tasks and preferences from the

person– seek information from Web sources, communicate

with other agents– compare information about user requirements

and preferences, make certain choices– give answers to the user

Euripides G.M. Petrakis Introduction to Semantic Web 33

Page 34: Introduction to  Semantic Web

Semantic Web Agent Technologies

• Metadata – Identify and extract information from Web sources

• Ontologies– Web searches, interpret retrieved information – Communicate with other agents

• Logic– Process retrieved information, draw conclusions

Euripides G.M. Petrakis Introduction to Semantic Web 34

Page 35: Introduction to  Semantic Web

Semantic Web Agent Technologies (2)

• Further technologies (orthogonal to the Semantic Web technologies)– Agent communication languages– Formal representation of beliefs, desires, and

intentions of agents– Creation and maintenance of user models.

Euripides G.M. Petrakis Introduction to Semantic Web 35

Page 36: Introduction to  Semantic Web

A Layered Approach

• The development of the Semantic Web proceeds in steps– Each step building a layer on top of another

Principles:• Downward compatibility • Upward partial understanding

Euripides G.M. Petrakis Introduction to Semantic Web 36

Page 37: Introduction to  Semantic Web

The Semantic Web Layer Tower

Euripides G.M. Petrakis Introduction to Semantic Web 37

Page 38: Introduction to  Semantic Web

Semantic Web Layers

• XML layer– Syntactic basis, allows one to write structured Web documents

• RDF layer– RDF basic data model for facts– RDF Schema simple ontology language

• Ontology layer– More expressive languages than RDF Schema– OWL plus a rule-based language (SWRL)

• Proof layer– Deduction process, representation of proofs, proof validation

• Trust layer– Ensures trustful information and operations

Euripides G.M. Petrakis Introduction to Semantic Web 38

Page 39: Introduction to  Semantic Web

SW Top Layers

• Logic, Proof and Trust layers: research areas and simple application demonstrations are known to exist

• The Logic layer enables the writing of rules• TheProof layer executes the rules and

evaluates together with the Trust layer whether to trust the given proof or not for the application at hand

Euripides G.M. Petrakis Introduction to Semantic Web 39

Page 40: Introduction to  Semantic Web

Semantic Web Example [Golbeck et.al, 2004]

• Consider the example of making a page about a trip to Paris

• The page contains text describing the trip, photos, dates, names, places visited, hotels stayed etc.

• In the existing Web the page is indexed by keywords and perhaps by the links included there

• There is no way for a computer to understand its content and associate it with similar content (trips, persons etc.) elsewhere or draw conclusions about the trip – Date of trip “May 20-25”: cannot infer whether the person was

or was not on travel on the 26th

Euripides G.M. Petrakis Introduction to Semantic Web 40

Page 41: Introduction to  Semantic Web

A Photo

• Name: ParisPhoto 1URL: http://www.example.com/photo1.jpgDate taken: May 25, 2009Location: Parc Du Champ De Mars, ParisPerson in photo: John DoePerson in Photo: Joe Blog

Object in Photo: Eiffel Tower

Euripides G.M. Petrakis Introduction to Semantic Web 41

Page 42: Introduction to  Semantic Web

People in Photo• Name: John Doe

First Name: JohnLast Name: DoeAge: 24

• Name: Joe BlogFirst Name: JoeLast Name: BlogAge: 26

Euripides G.M. Petrakis Introduction to Semantic Web 42

Page 43: Introduction to  Semantic Web

Classes

• SW resources are encoded in terms of classes, properties of classes, instances of classes forming ontologies – IS_A hierarchies and other relationships e.g., part-of, functional,

etc.• SW resources are given unique names and referred to by

their URI (e.g., URL of the owl document describing the trip)– http://www.example.com/parisTrip.owl#ParisPhoto1– http://www.example.com/parisTrip.owl#JohnDoe– http://www.example.com/parisTrip.owl#joeBlog

• This allows authors to make references to them from elsewhere

Euripides G.M. Petrakis Introduction to Semantic Web 43

Page 44: Introduction to  Semantic Web

Encoding Information

• The basic information unit is the triple (subject, predicate, object or value)

• Subject: http://www.example.com/parisTrip.owl#JohnDoePredicate: http://www.example.com/parisTrip.owl#ageValue: 23

• Subject: http://www.example.com/parisTrip.owl#JohnDoePredicate: http://www.example.com/parisTrip#firstNameValue: John

• Subject: http://www.example.com/parisTrip.owl#ParisPhoto1Predicate: http://www.example.com/parisTrip.owl#objectInPhotoobject: http://www.example.com/parisHistoryOntology.owl#EifelTower

Euripides G.M. Petrakis Introduction to Semantic Web 44

Page 45: Introduction to  Semantic Web

RDF graphs• Each triple is a graph with two nodes, • All descriptions are encoded into an RDF graph

• There are also super-classes “photo”, “people” from where photo1 and these persons inherited their properties

Euripides G.M. Petrakis Introduction to Semantic Web 45

photo 1

John Doe JoeBlog

May 26, 2009

Eiffel Tower

Paris

Joe

Blog

24

John

Doe

23

firstname

lastname

age

firstname

lastname

agepersoninphoto personinphoto

Paris.jpg

location filedateobjectinophoto

person

Photo

Page 46: Introduction to  Semantic Web

RDF syntax

• <rdf:RDFxmlns:http://www.w3c.org/2000/01/rdf-schema #>….</rdf:RDF>

• Specified that tags prefixed with rdf: are part of the namespace described at http://www.w3.org/2000....

Euripides G.M. Petrakis Introduction to Semantic Web 46

Page 47: Introduction to  Semantic Web

RDFS - Classes

• Classes are general categories that can be instantiated – <rdfs:Class rdf:ID=“Photo”/>– The class is assigned a unique name

• Classes can also be sub-classed– <rdfs:Class rdf:ID=“Photo”>

<rdfs:subclassof rdf:resource=http://example.com/mediaOntology.rdf#Image/></rdfs:Class>

• Subclasses are transitive: if X is a subclass of Y which is a subclass of Z then X is a subclass of Z

Euripides G.M. Petrakis Introduction to Semantic Web 47

Page 48: Introduction to  Semantic Web

RDF Properties

• Properties describe attributes• All properties in RDF are described as instances of class

rdf:Property• Eg., <rdf:Property rdf:ID=“objectInPhoto”>

<rdfs:domain rdf:resource=“#Photo”/></rdf:Property>

• Declares the property “objectOnPhoto” which can be attached to any class

• In this example the domain of this property is limited to #Photo• Because the Photo class is declared within the same namespace

(file) as the objectInPhoto the reference to it can be abbreviated

Euripides G.M. Petrakis Introduction to Semantic Web 48

Page 49: Introduction to  Semantic Web

RDF Properties (2)

• Similarly to classes and subclasses, properties can become subProperties of a more general property

• The person in photo can be a subset of the object in photo property<rdf:Property rdf:ID=“personInPhoto”><rdfs:subPropertyOf rdf:resource=“#objectInPhoto”/><rdfs:range rdf:resource=“#Person”/></rdf:Property>

• The range restriction limits the type of values

Euripides G.M. Petrakis Introduction to Semantic Web 49

Page 50: Introduction to  Semantic Web

RDF Instances

• Any person name can become instance of the class Person<Person rdf:ID=“JoeBlog”>

<firstName>Joe</firstName><lastName>Blog<lastName>

</Person>

Euripides G.M. Petrakis Introduction to Semantic Web 50

Page 51: Introduction to  Semantic Web

OWL

• The OWL (Web Ontology Languate) is a vocabulary extension of RDFS that adds more expressivity power to RDFS– Since OWL is built on RDF, any RDF graph forms a valid

OWL ontology • OWL adds many new features:

– Relation types between classes (e.g., disjointness)– Cardinality of properties (e.g., exactly one)– Equality, symmetry, inverse, transitive, characteristics

of properties etc

Euripides G.M. Petrakis Introduction to Semantic Web 51

Page 52: Introduction to  Semantic Web

References

• “A Semantic Web Primer”, Grigoris Antoniou, Frank van Harmelen

• http://www.ics.forth.gr/isl/swprimer/• http://www.csd.uoc.gr/~hy566/• www.semanticweb.org• http://www.w3.org/• “Just What is an Ontology Anyway?”, Thomas C.

Jepsen, IT PRO Sept/Oct. 2009• “Organization and Structure of Information using

Semantic Web Technologies”, Golbeck et.al. 2004

Euripides G.M. Petrakis Introduction to Semantic Web 52