Top Banner
CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak Bhattacharyya.
32

CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Jan 01, 2016

Download

Documents

Sharleen Dorsey
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

CS621 Seminar on: “Intelligent Database Systems”

By:-Dhaval Bonde (MTech1-08305910)-Prasad Gawde (MTech2-07305908)

Under the Guidance of Prof. Pushpak Bhattacharyya.

Page 2: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Outline :

Introduction Current types of Databases Features of Ideal Intelligent DB Evolutionary approach Expert System Case Study : Petrographer Comparison: Traditional DB and IDB Conclusion References

Page 3: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Introduction:Going beyond Relational Concepts

‘Database’ means a ‘repository of data’. It is not just tables and relationships. May contain documents and articles.

Page 4: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Introduction:

“Information Anxiety”

Categories

“Information- Management”

Library DB

No results

“Information”

Library DB

Huge List

Of

Documents

Case 1:

David

This example is inspired from “D. Sleight, ‘Intelligent Databases: Easing Access to Information,’ Michigan State University, Spring 1993.”

Page 5: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Introduction:

“Information”Library

DB

5153 results

Library DB

“Anxiety”3996 results

- Information AND Anxiety

- Information ADJ Anxiety

• Synonyms for Information Anxiety ?

This example is inspired from “D. Sleight, ‘Intelligent Databases: Easing Access to Information,’ Michigan State University, Spring 1993.”

Page 6: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Introduction:

Case 2:

“Information Anxiety”

Library DB

Leah

“Information”

+

“Anxiety”

+

“Information Anxiety”

“Information Anxiety”

Some appropriate

items

+

Synonyms for “Information

Anxiety”

Synonyms not useful

? Justify Selections

•100 common imp words in example doc, and ranked them in order of occurrence

•Compared those words with all docs in DB

•Displayed those with high no. of matches

Rules, words, weightings

This example is inspired from “D. Sleight, ‘Intelligent Databases: Easing Access to Information,’ Michigan State University, Spring 1993.”

Page 7: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Introduction:

• Identifies negative examples (rules, weights)

• Increases weightings of phrases that define “Information Anxiety”

Search AgainRelevant Results!

• Database stores the history of her search

This example is inspired from “D. Sleight, ‘Intelligent Databases: Easing Access to Information,’ Michigan State University, Spring 1993.”

Page 8: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Current Types of Databases:

Full Text Databases: Character string search Will return no documents if the exact string does not match. Many variations have to be tried.

Page 9: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Current Types of Databases:

Indexed Keywords: Each item identified by keywords. Keywords are assigned by the author or database manager. Exact keyword should be known by the user to get correct

documents. May involve lot of guessing.

Page 10: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Current Types of Databases:

Hypertext Links: Links between non-hierarchical but related types of

information. Nebulous searches easier. Links followed may or may not match the user’s thoughts.

Eg. user may click the name of a person mentioned in economics article to know his contribution to economics but may end up getting the person’s biography instead!

Page 11: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

So what is an Intelligent Database?

An intelligent database is a full-text database that employs Artificial Intelligence (AI), interacting with users to ensure that returned items (hits) contain the most relevant information possible.

John Hopkins Institute online glossary / Whatis.com

Page 12: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Features of an Ideal Intelligent DB

Feedback Eg of what the user is searching for and what he is NOT.

Interface Windows showing previous searches.

Help Guiding the user to formulate queries.

Selecting search terms Alternate search terms from an online thesaurus.

Displaying results of a search Relevance of each hit, query words highlighted.

Page 13: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Traditional DB , Intelligent System & IDBs :

Page 14: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

“Evolutionary" approach:

The amalgamation or extension of existing technologies into hybrid forms. There are 3 possible configurations of DB-AI integration:

Extending existing AI system (Loose coupling) DB system and AI system are segregated with an additional component used for relaying the

information between the two. These expert systems automatically formulate the queries and move data between the databases. Transformation processing cost due to segregation of DB and KB. These AI extensions are application specific.

Page 15: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

“Evolutionary" approach:

Extending existing DBMSs (Tight coupling) Deductive databases is the best example of tight coupling. A deductive database system is a database system which can make deductions

(i.e. conclude additional facts) based on rules and facts stored in the (deductive) database.

i.e. it can define relations extensionally (as facts) as well as intentionally (as rules.) in the database.

Ex: Consider an extensional relation “parent” represented as parent(X,Y) where “X” has parent “Y”. Then we can define relations intentionally as,

rule 1: ancestor(X,Y) :- parent(X,Y) rule 2: ancestor(X,Y) :- parent(X,Z), ancestor(Z,Y).

Here rule 1 says , if ‘X’ is a parent of ‘Y’ then ‘X’ is also an ancestor of ‘Y’ and rule 2 says that, if ‘X’ is a parent of ‘Z’ and ‘Z’ is an ancestor of ‘Y’ then ‘X’ is an ancestor of ‘Y’.

Page 16: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

“Evolutionary" approach:

Here relation “ancestor” is defined by rules which says every parent is an ancestor and parent of an ancestor is also an ancestor.

Deductive databases are an extension of relational databases which support more complex data modeling. They are more expressive than relational databases but less expressive than logic programming systems.

The strict approach: The system is built from scratch using total integration of DB technologies and

AI technologies. Very complex and time consuming

Page 17: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Knowledge Graphs:

Knowledge graphs belong to the category of semantic networks. One essential difference between knowledge graphs and semantic networks is the

explicit choice of only a few types of relations. For example, it uses relations such as, CAU : cause-effect relation (eg: unstable market positions cause polarization) AKO : is a kind of (eg: a married man is a kind of man) PAR : is part of (eg: having relations with high status is a part of social capital) Inverse relations are also allowed, such as,

CBY : is caused by HAP : has as part HAK : has a kind

In principle, the composition of a knowledge graph is including concept (tokens and types) and relationship (binary and multivariate relation). Types are labeled points representing generic concepts and token represents an arbitrary instantiations of types. As an example, “Pluto” is a token and “dog” is a type.

Page 18: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Knowledge Graphs: Example

The figure shows a knowledge graph with CAU relation and different concepts in rectangular boxes.

Partial Figure of the figure in “Popping, R. (2003). Knowledge graphs and Network Text Analysis. Social Science Information 42(1): 91-106.”

Page 19: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Expert Systems:

Commonly an expert system would consist of two components: a “knowledge base” (a database of facts and rules), and an “inference engine” (a program that can apply those rules and facts and come up with an

"expert" solution to the question of a novice.)

Example of an expert system is the Himalayan club which we discussed in the class.

Page 20: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Case Study: Petrographer

By considering the symbolic knowledge representation and inferencing strengths of an expert system and the large scale data handling abilities of a relational database (i.e. storage, management and consultation) Petrographer is developed.

The system offers, Sophisticated interface to support rock description (interface) Allows the user to consult large amount of Petrographic data via SQL interface (help) Provides geological interpretation of described petrographic features according to the knowledge

extracted (display of results) Feedback Selecting search term

Page 21: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Petrographer : Knowledge Modelling

The knowledge is elicited from expert in the form of “cases”.

Knowledge is modeled at 3 levels of abstraction; first from actual domain to knowledge model and from knowledge model to relational model.

Domain knowledge is represented in Object Attribute Value (OAV) structure to preserve most of the semantic contents.

These objects are mapped to a “frame” in symbolic system and also to entities in E-R model.

Thus, these entities are implemented in database.

The figure is taken from the paper “Querying Petrographic Descriptions in an Intelligent Database System” by Cristina Paludo Santos, URI University.

Page 22: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Petrographer : Knowledge Modelling

Roughly speaking each frame in knowledge model is mapped to an entity in E-R model and each attribute is mapped to a field in E-R model.

The inferential aspect of knowledge is represented by knowledge graphs.

Knowledge graphs capture association between petrographic features and geological interpretations.

Knowledge graphs are also mapped & stored in relational database as a special consultation table.

The figure is taken from the paper “Querying Petrographic Descriptions in an Intelligent Database System” by Cristina Paludo Santos, URI University.

Page 23: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Petrographer : System Architecture

Modified figure of the figure in the paper “Querying Petrographic Descriptions in an Intelligent Database System” by Cristina Paludo Santos, URI University.

Page 24: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Petrographer : System Architecture

The “visual interface” allows user to effectively interact with the system.

Users can perform quantitative analysis as well as visual evaluation through various interfaces provided by this module.

Page 25: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Petrographer : System Architecture

The “symbolic system” is at the heart of the architecture.

Its functions are shared by 3 subsystems. Frame module Inference module Other subsystems

A frame module keeps the schema structure of entire domain. It also maintains the knowledge graphs.

Inference module tries to match several parts of the case submitted by the user against the nodes in the knowledge graph.

Page 26: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Petrographer : System Architecture

It uses forward chaining for inferencing. The best match is calculated by the

number of user features which match against the graph and the influence of those features in associated conclusion.

The set of subsystems support many aspects of petrographic description task such as statistics and quantitative analysis.

Page 27: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Petrographer : System Architecture

It is a standard RDBMS. The database system stores all

descriptions managed by Petrographer . It manages complete domain

knowledge. The symbolic frame system is

connected to RDBMS module by a standard SQL interface such as ODBC.

Page 28: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Case Study: Petrographer

Similar to the example of Leah discussed in the introduction, Petrographer helps users to formulate the queries by accepting query expressions (ex: attribute=val instead of any SQL like query) from them.

Hence it hides the internal schematic descriptions from users. During the search, if user is not satisfied with the results returned by Petrographer,

he/she can modify the query expressions to get better results and these modified query expressions can be stored in the database for future reference.

Thus Petrographer learns new facts while searching is being performed and these facts can be stored in the database for future references.

Page 29: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Case Study: Petrographer

Advantages Allows users to write “query expression” instead of an SQL query Users need not know complete schema of the database The query expressions generated by the users can be stored in the database for later

use Disadvantage

large overheads in terms of processing speed in periods of high interaction between the various sub-systems, most noticeable during the interpretation process where inferencing is performed.

In multi-user access, the performance degrades.

Page 30: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Traditional Databases Vs IDBs:

Users need to know complete schema for querying

Users must write SQL queries Database system does not help in

searching Lacks semantic value

Stores only facts

Users need not know complete database schema

Users can simply use query expressions Provides help to make searching

effective Semantic information is stored via

Knowledge graphs and other data structures in the database itself

Stores facts and rules

Traditional DB Intelligent DB

Page 31: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

Conclusion:

Intelligent database is basically an “expert system” added as a layer on the top of “traditional database”. This integration leads to an efficient solution where expert systems are required to deal with larger databases.

Intelligent databases are more expressive than the traditional databases. Although IDBs seem to have an edge over traditional databases, we need some

mechanism to reduce their processing overheads.

Page 32: CS621 Seminar on: “Intelligent Database Systems” By: -Dhaval Bonde (MTech1-08305910) -Prasad Gawde (MTech2-07305908) Under the Guidance of Prof. Pushpak.

References:

D. Sleight, “Intelligent Databases: Easing Access to Information,” Michigan State University, Spring 1993.

Lee Jorgensen, “Artificial Intelligence: Intelligent Databases,” akri.org, Jan 2003. [online]. Available: http://www.akri.org/ai/intdata.htm.

Mara Abel et al., “PetroGrapher: a solution in intelligent database system for petrographic analysis,” [online]Available: http://www.inf.ufrgs.br/pos/SemanaAcademica/Semana98/marabel.html

Pamela A. Taylor and Dana L. Wyatt, “Database and artificial intelligence integration: a challenge to academia,”, ACM, SIGCSE Bull, 1992.

Cristina Paludo Santos,URI University “Querying Petrographic Descriptions in an Intelligent Database System” ICAIS,2002

Popping, R. (2003). Knowledge graphs and Network Text Analysis. Social Science Information 42(1): 91-106.