Top Banner
Content Based Image Retrieval (CBIR) Michele Nappi, Ph.D [email protected] 089-963334
44

Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

Aug 15, 2018

Download

Documents

duongbao
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

Content Based Image Retrieval

(CBIR)Michele Nappi, Ph.D

[email protected]

089-963334

Page 2: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 2

Overview

• Introduce CBIR

• Applications of CBIR

• Research Prototypes

• Commercial Systems

• Evaluation

Page 3: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 3

Content Based Image Retrieval

• What is CBIR?

– Its purpose is to retrieve, from a database,

(collections) images that are relevant to a

query.

– Finding images which are “similar” to a query.

• Query: The whole or parts of an example image.

• Similarity based on either the whole image or parts

of the image.

Page 4: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 4

Examples of Image Similarity

Page 5: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 5

Examples of Image Similarity (cont.)

Page 6: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 6

Examples of Image Similarity (cont.)

Page 7: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 7

Examples of Image Similarity (cont.)

Page 8: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 8

Issues/Questions

• How is it different from text retrieval?

• Is it difficult? Why?• How do we define/specify a query?

• How do we index and represent a multimedia object?

• What is an architecture of a CBIR system?

Page 9: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 9

Information Retrieval

Documents

Query

Indexing

Representation

Indexing

Matching

Query

Features

Ranked

Result Set

Page 10: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 10

Conceptual view of CBIR

Queries ImagesIndexingIndexed

ImagesSimilarity

Computation

Retrieved

Images

Page 11: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 11

Why is Image Retrieval

Difficult?• Text retrieval

– Word is a natural unit.

– A word has a semantic meaning.

• Image retrieval– No natural unit.

– A pixel has no semantic meaning.

– Parts of objects• Example: Parts of a human body, parts of an animal

• Difficult to automatically segment.

– An object’s image depends on many factors• Viewpoint, illumination, shadows etc.

– Other complications like background.

Page 12: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 12

The Effects of Aging

• Can you guess which person on

the right matches the person on

the left?

– The pictures on the left are

of high school students.

– The pictures on the right

were taken 20 yrs later.

• This is hard.

• If this is hard for people, how

can an image retrieval system

do this?

Page 13: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 13

Who want CBIR technology?• Crime prevention

– Fingerprint and face matching

• Journalism and Publishing

• Stock photo libraries

• Medical diagnosis

• Geographical Information Systems

• Cultural heritage

• Education and training

• Home entertainment (Video on Demand)

• Web searching

Page 14: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 14

What is “Similarity”

• Ultimately user defines “similarity”.

• What is “similar”– Cars of a given model or

all cars?

– Red colored cars?

• Local or Global similarity?– Similarity of parts?

– Similarity of the entire image?

How does one find similarity?

What features?

Metric distance?

Non-metric distance?

Page 15: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 15

What is “Similarity” (cont.)

– False Alarms

• (not qualifyng image) Immagini non

significative inserite nell’insieme

risposta (answer set)

– False Dismissals

• (Qualifyng but not retrieved images)

Immagini significative non inserite

nell’insieme risposta

Page 16: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 16

• An image is likely to be interpretable at more than one level

• An image might satisfy a visual information need by dint of its generic, specific or abstract content

• The delight and frustration of pictorial resources is that a picture can mean different things to different people

Image Features

Page 17: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 17

Image Features (cont.)

• Using Primitive features

– Texture, Colors, Shapes, Spatial

Relationships…

• Using Derived (logical) features

– Object of a given type, person…

• Using Abstract features

– Events, emotional significance

Page 18: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 18

Image Features: Primitive

• Level 1 comprises retrieval by primitive features such as colour, texture, shape or the spatial location of image elements. Examples of such queries might include “find pictures with long thin dark objects in the top left-hand corner”, “find images containing yellow stars arranged in a ring” – or most commonly “find me more pictures that look like this”. This level of retrieval uses features (such as a given shade of yellow) which are both objective, and directly derivable from the images themselves, without the need to refer to any external knowledge base. Its use is largely limited to specialist applications such as trademark registration, identification of drawings in a design archive, or colour matching of fashion accessories.

Page 19: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 19

Image Features: Primitive (cont.)

1. Find all images that

have 20% of purple

and 40% of green

2. Find all images that

have 60% of green

Page 20: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 20

Image Features: Primitive (cont.)

Page 21: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 21

Image Features: Primitive (cont.)

• How can be

implemented

Retrieval using

Color Histograms?

Page 22: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 22

25

50

0

Image Features: Primitive (cont.)

Page 23: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 23

Match Histogram Algorithm

• Euclidean Metric??

– n-dimensional space• Image 1: (C11, C12, C13,……C1n)

• Image 2: (C21, C22, C23,……C2n)

– Huge Computing Time

• DFT + Euclideam Metrics

– Cut off frequencies (keeping first k coefficients)

• k<<n

Page 24: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 24

Match Histogram Algorithm (cont.)

• Th.del Parseval:

– Let X be the

Discrete Fourier

Transform of the

sequence x. Then

we have:

• No False Dismissals

1

0

1

0

22n

i

n

u

ui Xx

lyrespective and of ansformFourier tr are , where

,,

yxYX

YXDyxD

nk and ly,respective

and of ansformFourier tr are , where

,,

yxYX

YXDYXD nk

Page 25: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 25

Parseval Theorem: Application

QUERY Tollerance

or

similarity=P

Space/Time domain:

n-dimensional space

Frequencies Domain:

k-dimensional space

(k<<n)

QUERY Tollerance

or

similarity=P

Query

Included in answer set [Distance from query ≤ P]

Not included in answer set (n-dimensional) or false alarms (k-dimensional)

Page 26: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 26

Perseval Theorem: Application (cont.)

Shape Contour Texture

Page 27: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 27

Image Features: Logical

• Level 2 comprises retrieval by derived (sometimes known as logical)

features, involving some degree of logical inference about the identity of the

objects depicted in the image. It can usefully be divided further into:

– retrieval of objects of a given type (e.g. “find pictures of a double-

decker bus”);

– retrieval of individual objects or persons (“find a picture of the

Eiffel tower”).

• To answer queries at this level, reference to some outside store of

knowledge is normally required – particularly for the more specific queries at

level 2. In the first example above, some prior understanding is necessary to

identify an object as a bus rather than a lorry; in the second example, one

needs the knowledge that a given individual structure has been given the

name “the Eiffel tower”. Search criteria at this level, particularly at level 2,

are usually still reasonably objective.

Page 28: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 28

Image Features: Logical (cont.)

• Find all CT images

that have a symmetric

circular nodule in left

lung, vertically

included between

spine and aorta.

Query

Answer Set

Page 29: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 29

Image Features: Abstract

• Level 3 comprises retrieval by abstract attributes, involving a significant amount of high-level reasoning about the meaning and purpose of the objects or scenes depicted. Again, this level of retrieval can usefully be subdivided into:

– retrieval of named events or types of activity (e.g. “find pictures of Scottish folk dancing”);

– retrieval of pictures with emotional or religious significance (“find a picture depicting suffering”).

• Success in answering queries at this level can require some sophistication on the part of the searcher. Complex reasoning, and often subjective judgement, can be required to make the link between image content and the abstract concepts it is required to illustrate. Queries at this level, though perhaps less common than level 2, are often encountered in both newspaper and art libraries.

Page 30: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 30

Levels of Image Description• Content-independent metadata

– Data which is not directly concerned with image

content, but in some way related to it. (Examples:

author’s name, date, location etc.)

• Data which refers to the visual content of

images

– low intermediate features (colour, texture, shape

etc.) known as content-dependent metadata

– data refers to content semantics; known as

content-descriptive metadata

Page 31: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 31

Image Attributes

• Image attributes like color, texture,

appearance and shape are often

correlated with semantics.

• Examples

– Retrieve pictures of Bill Clinton in a crowd

using similarity by appearance.

– Pictures of red and blue parrots using

color.

– Pictures of gorillas using texture.

Page 32: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 32

Architecture of an image

retrieval system (first-generation)

Database

Annotation

(manual)

offline online

Indexing

Query

by text or SQL, .. Visualisation

of results

Page 33: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 33

User in the Loop

• Much easier to build a semi-automatic system

than a fully automatic system.

• Assume that a person will use the system and

will be able to provide some feedback

– Exploit this.

– Facilitate feedback (e.g.. relevance feedback).

– Good user interfaces.

– Person can easily modify query to get better results.

Page 34: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 34

Architecture of Visual Information

retrieval system – New Generation

Query by text/

Visual example Result Viewer / Visualisation

Search Engine

Indexing

Annotation

(manual)

offline

online

Browsin

g

Feature

Extraction

(automatic –

IP/IR)

Page 35: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 35

Query Paradigms

• QbE

– Query by Example

• User provides as example an image

• QbS

– Query by Sketch

• User draws a sketch of query image using visual

tools

Page 36: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 36

Alternatives to CBIR

technology?• Human memory

• Image browsing

• Keyword indexing

• Classification schemes

Page 37: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 37

Manual Indexing vs CBIR

• Indexing expertise widely available in image libraries

• Can use wide range of text retrieval software

BUT:

• Labour intensive

• Can be subjective and unreliable

• Difficult to capture concept of image similarity

• Feature matching both

objective and automatic

• Query formulation by

visual process

BUT:

• Available features don’t

capture image

semantics

• Features don’t

necessarily match

human similarity

judgements

Manual IndexingCBIR

Page 38: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 38

Measures for CBIR

• Efficiency

– Medium response time for retrieval

• Effectiveness

– Quantity of false alarms and false

dismissals

• Minimize False Alarms

• Minimize False Dismissals

Page 39: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 39

Measures for CBIR

• Recall

– The capability of system to retrieve all

relevant images

• Precision

– The capability of system to retrieve only

relevant images

Page 40: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 40

Measures for CBIR: Normalized Recall

• TOT is the number of images in the collection (IDB size).

• Relevant images, for each query, are ranked 1, 2,…, REL,

where REL is the number of relevant images.

• Ideal Rank (IR)

• Average Rank (AR)

REL

r REL

rIR

1

REL

r

r

REL

RankAR

1

Page 41: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 41

Measures for CBIR: Normalized Recall

• Effectiveness = (AR – IR)

– [0; (TOT-REL)]

• 0 (AR = IR) Perfect Retrieval

• (TOT – REL) Worst Case

• Normalized Recall (NR)

– [0;1]

RELTOT

IRARNR

1

Page 42: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 42

Measures for CBIR

• Recall vs Precision

0

0,1

0,2

0,3

0,4

0,5

0,6

0,7

0,8

0,9

1

0,1

2

0,3

4

0,5

5

0,6

5

0,8

6

Recall

Precis

ion

Effectiveness

Page 43: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 43

CBIR Software

• Three main commercial systems for still

images

– QBIC (IBM)

• http://wwwqbic.almaden.ibm.com/

– VIR Image Engine (Virage)

• http://www.virage.com/online

– VisualRetrievalWare (Excalibur)

• http://www.excalib.com/

Page 44: Content Based Image Retrieval (CBIR) - UNISA · 11/06/2016 M. Nappi 3 Content Based Image Retrieval • What is CBIR? –Its purpose is to retrieve, from a database, (collections)

11/06/2016 M. Nappi 44

Research Directions in CBIR.

• New visual interfaces for image access

– switch back and forth between navigation

and browsing

– query tool locate a set of candidate images

and a good visualisation tool to explore this

set

• New models of standards for

representation of visual content

• Web search and tools

• New methodology for evaluation