Top Banner
BIT 3193 MULTIMEDIA DATABASE CHAPTER 4 : QUERING MULTIMEDIA DATABASES
45

BIT 3193 MULTIMEDIA DATABASE

Mar 16, 2016

Download

Documents

winter

BIT 3193 MULTIMEDIA DATABASE. CHAPTER 4 : QUERING MULTIMEDIA DATABASES. What is QUERY ??. Language expression that describes the data to be retrieved from database. Query. omponents. Data item : as output. Information base : search is to be made. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: BIT 3193  MULTIMEDIA DATABASE

BIT 3193 MULTIMEDIA DATABASE

CHAPTER 4 : QUERING MULTIMEDIA DATABASES

Page 2: BIT 3193  MULTIMEDIA DATABASE

Language expression that describes the data to be retrieved from database.

Page 3: BIT 3193  MULTIMEDIA DATABASE

Data item : as output

Information base : search is to be made

Condition : have to be satisfied for a data item to be selected

Page 4: BIT 3193  MULTIMEDIA DATABASE

Manipulation (editing and modifying data)

Presentation

Analysis (indexing and searching)

Page 5: BIT 3193  MULTIMEDIA DATABASE
Page 6: BIT 3193  MULTIMEDIA DATABASE

• Multimedia queries can be of different types (deal with different media and with different properties)• The process of querying multimedia data is complex and it can approach this in TWO ways:

• WHAT INFORMATION CAN BE RETRIEVED• HOW THE INFORMATION CAN BE RETRIEVED

Page 7: BIT 3193  MULTIMEDIA DATABASE

• There are THREE levels of complexity associated with the “WHAT”:a)Level 1 :

• Retrieval of primitive features such as color, shape, texture, spatial location and object movement.

• Query example:“ find clips of objects flying from top-

right to bottom-left of screen”

Page 8: BIT 3193  MULTIMEDIA DATABASE

b) Level 2 : • Retrieval of logical features related to

the identity of the object within the media.

• Query example:“ find clips of an aeroplane taking off”

Page 9: BIT 3193  MULTIMEDIA DATABASE

c) Level 3 : • Retrieval of abstract attributes

associated with an understanding of the nature or purpose of the object.

• Query example:“ find a picture of nutritional disasters"

Page 10: BIT 3193  MULTIMEDIA DATABASE

• The “ HOW” can also be classified on the basis of whether the information is retrieved by:

• Query on the Content (Content based query)• Query by Example (QBE)• Time Indexed Query• Spatial Query•Application Specific Query

Page 11: BIT 3193  MULTIMEDIA DATABASE

Content BasedQuery

• The content of media information is described by the metadata associated with media objects.

• Hence, these queries have to be processed by:

• accessing directly the metadata• then the media objects

• Example:“Show the details of the movie where a cartoon character says: “I like you”

Page 12: BIT 3193  MULTIMEDIA DATABASE

Query byExample

(QBE)

• Have to be processed by finding a similar object that matches the one in the example.

• The query processor has to identify exactly the characteristics of the example object the user wants to match.

• The similarity matching required by the user can be on texture, color, spatial characteristics or the shape of the object.

Page 13: BIT 3193  MULTIMEDIA DATABASE

Query byExample

(QBE)

• Matching can be exact or partial.• For partial matching, the query processor has to identify the degree of mismatch that can be tolerated.

• Then the query processor has to apply the cluster generation function for the example media object.

• These cluster generating functions map the example object into an m- dimensional feature space.

Page 14: BIT 3193  MULTIMEDIA DATABASE

Query byExample

(QBE)

• Object present within this distance d are retrieved with a certain measure of confidence and are presented as an ordered list.

• Here the distance d is proportional to the degree of mismatch that can be tolerated.

• Example: “ Show me the movie which contains this song: Flying Without Wings”

Page 15: BIT 3193  MULTIMEDIA DATABASE

Time IndexedQuery

• These queries are made on the temporal characteristics of the media objects.

• The temporal characteristics can be stored using segment index trees.

• The query processor has to process the time indexed queries by accessing the index information stored using segment trees or other similar methods.

Page 16: BIT 3193  MULTIMEDIA DATABASE

Time IndexedQuery

• Example : “Show me the movie 30 minutes after its start”

Page 17: BIT 3193  MULTIMEDIA DATABASE

SpatialQuery

• There are made on the spatial characteristics associated with the media objects.

• These spatial characteristics can be generated as metadata information.

• The query processor can access this metadata information to generate the response.

• Example: “ Show me the image where Ali is seen to the left of Abu”

Page 18: BIT 3193  MULTIMEDIA DATABASE

ApplicationSpecificQuery

• Application specific descriptions can be stored as metadata information.

• The query processor can access this information for generating purposes.

• Example : “ Show me the video where the river changes its course”

Page 19: BIT 3193  MULTIMEDIA DATABASE

• Involve single media.• Example : text

TextIndex

TextDatabase

OtherMedia DBs

Response to Query

Text MediaQuery

Figure 4.1 : Processing Single Media Query

Page 20: BIT 3193  MULTIMEDIA DATABASE

• Query Process:• Assuming the existence of metadata for the text information:

• the indexed file is accessed first• and the information is presented to the user

Page 21: BIT 3193  MULTIMEDIA DATABASE

• Involve more than one media.• Example : text and image

TextDatabase

ImageIndex

ImageDatabase

Response to Query

Text &Image MediaQuery

Figure 4.2 : Processing Multiple Media Query

TextIndex

ImageDatabase

TextIndex

TextDatabase

Response to Query

Text &Image MediaQuery

ImageIndex

a) Accessing Text Index First

a) Accessing Image Index First

Page 22: BIT 3193  MULTIMEDIA DATABASE

• Query Process:• can be in TWO different ways:

a) Accessing Text Index First• Select an initial set of documents• This set of documents are examined to determine whether any documents contains the image object specified in the query.• This implies that documents carries the information regarding the contained images.

Page 23: BIT 3193  MULTIMEDIA DATABASE

b) Accessing Image Index First• Select a set images.• Examined to determine whether images are part of any document.• This strategy assumes that the information regarding the containment of images in documents are maintained as a separated information base.

Page 24: BIT 3193  MULTIMEDIA DATABASE
Page 25: BIT 3193  MULTIMEDIA DATABASE

• Access to multimedia information must be quick so that retrieval time is minimal.• Metadata must be stored using appropriate index structure to provide efficient access.• Index structures to be used depend on the media, the metadata and the types of the query.• SQL language is relevant to multimedia data.

Page 26: BIT 3193  MULTIMEDIA DATABASE
Page 27: BIT 3193  MULTIMEDIA DATABASE

Standard language for dealing with relational databases

Page 28: BIT 3193  MULTIMEDIA DATABASE

• developed from an earlier Structured English Query Language, SEQUEL.• with a lot of improvement, SQL become the de facto standard of the database world.• includes the specification for a data dictionary called the information schema.

Page 29: BIT 3193  MULTIMEDIA DATABASE

• Involves TWO process:• create the structure of the table• followed by inserting the data

• example:• Create the structure

CREATE TABLE employee

(employee_no CHAR (4),

employee_name VARCHAR2(30),

salary NUMBER(6,2))

Page 30: BIT 3193  MULTIMEDIA DATABASE

• Insert:INSERT INTO employee

(employee_no CHAR (4),

employee_name VARCHAR2(30),

salary NUMBER(6,2))

VALUES

(‘123B’, ‘Ali Ahmad’, ‘1666.22’)

Page 31: BIT 3193  MULTIMEDIA DATABASE

• Update:UPDATE employee

SET employee_name =‘Abu Ahmad’

WHERE employee_no = ‘123B’

Page 32: BIT 3193  MULTIMEDIA DATABASE

• SQL statements:SELECT (select list)

FROM (table list)

WHERE (search condition)

• Example:SELECT employee_name

FROM employee

WHERE employee_no = ‘123B’

Page 33: BIT 3193  MULTIMEDIA DATABASE
Page 34: BIT 3193  MULTIMEDIA DATABASE

• Text objects consist of words.• Easy to recognize because within the body of text can be delineated by spaces.• Words consist of character strings.

• These features enable to process easily using query language such as SQL.

Page 35: BIT 3193  MULTIMEDIA DATABASE

• However there are some problems with words• meaningless words such as “and, but, the, from, to”

• these are called stop words• deal with them by removing or ignoring them

• synonymy when a word has the same meaning as another word• polysemy when a word has more than one meaning in different context

Page 36: BIT 3193  MULTIMEDIA DATABASE

• Principles of techniques specialized for text:• text retrieval (TR)• Schema-directed extraction (SDE)• query-directed extraction (QDE)

Page 37: BIT 3193  MULTIMEDIA DATABASE

Text Retrieval

• simplest form of processing• documents are returned in the result set if they are considered relevant to the query• method to achieve:

• exact matching• inexact matching• proximity searches• intelligent searches, ect.

Page 38: BIT 3193  MULTIMEDIA DATABASE

• using BOOLEAN queriesSELECT cocoa_name, price

FROM cocoa_list

WHERE region = ‘Ghana’ AND category = ‘ABC’

Page 39: BIT 3193  MULTIMEDIA DATABASE

• the uncertain spelling problem can be solved by using SOUNDEX function

SELECT cocoa_name, SOUNDEX (cocoa_type)

FROM cocoa_list

WHERE SOUNDEX (cocoa_type)=SOUNDEX (‘couverture’)

Coco-type SOUN

-------------------------------

couverture C130

Page 40: BIT 3193  MULTIMEDIA DATABASE

• SOUNDEX codes

Character SOUNDEX Codes

B,F,P,V

C,G,J,K,Q,S,X,Z

D,T

L

M,N

R

1

2

3

4

5

6

Table 4.1 : SOUNDEX codes

Page 41: BIT 3193  MULTIMEDIA DATABASE

• vowel sounds “A,E,H,I,O,U,W,Y” are not assigned a value.

Page 42: BIT 3193  MULTIMEDIA DATABASE

• can be used in both exact and inexact search conditions.• LIKE keyword is used to match a column with a known string or part of a string• example:

SELECT coco_type

FROM coco_list

WHERE coco_type LIKE ‘cou%’

Page 43: BIT 3193  MULTIMEDIA DATABASE

• also use LIKE to search topics within character data• example:

SELECT coco_type

FROM coco_list

WHERE character LIKE ‘%chocolate%’

• This query can be expressed using natural language as “Give me type of each cocoa that has character that is described by a string that contains the substring “chocolate” with any number of characters before or after”

Page 44: BIT 3193  MULTIMEDIA DATABASE

Query-Directed

extraction

• it is useful for semi-structured text documents such as business or scientific domains where documents are laid out to conform to a pattern

Page 45: BIT 3193  MULTIMEDIA DATABASE

Schema-Directed

extraction

• process that manipulates large collections of related text objects such as e-mail

• schema : e-mail {sender, receiver, date, subject, persons, places}