Top Banner
A Framework for Pay-as- you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger
22

A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

A Framework for Pay-as-you-go Extraction Ontology Based

Information RetrievalAndrew Zitzelberger

Page 2: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Problem

• Keyword search doesn’t work well for high precision

• Domain ontologies take a long time to build

Page 3: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Pay-as-you-goKeyword Search

Basic Data Frames

Derived Attributes

Interconnected Ontologies

Domain Ontologies

Data Frame Hierarchies

Relationship Data Frames

Page 4: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

OSM-O Ontologies

Decidable!

Page 5: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

OSM-EO Ontologies

• OSM-O Ontologies with data frames for object and relationship sets.– Recognition– Linguistic grounding– Understanding

Page 6: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Keyword Search

• Honda 2003 or newer for under 15 grand with under 180K miles on it.

Page 7: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Keyword Search

• Honda -170 Results• Price max of 15 grand 15

– 15,000 works (kind of)

Page 8: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Number Data Frame• Number

– Internal representation:• Double

– External representation:• [1-9]\d*|[1-9]\d{2},\d{3}+|…

– Units• K=1000; [Gg]rand=1000; million=1000000; ...

– Methods:• Greater than:

– (greater than|over|above|more than|>|…)\s+{Number}• Less than:

– (less than|under|below|<|…)\s+{Number}• …

Page 9: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Number Method Extraction

• Honda 2003 or newer for under 15 grand with under 180K miles on it.– (Number < 15000), (Number < 180000), (Number >= 2003)– (2003 <= Number < 15000)

• No change in results. Why?– Dates, Times

• Miles keyword problem

Page 10: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Data Frame Hierarchies

Page 11: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Method Extraction

• Honda 2003 or newer for under 15 grand with under 180K miles on it.– (Year >= 2003), (Price < 15000), (Mileage < 180000)

• Significant result reduction.

Page 12: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Relationship Data Frames

• {CountryName-Make}– {CountryName}\s+(makes|manufactures|…)\s+{Make}

• {Make-CountryName}– {Make}\s+(is\s)?{made in|…)\s+{CountryName}

Page 13: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Domain Ontology

Page 14: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Derived Attributes

• if Make in {JapanMake} then Japan• if Make in {GermanMake} then German• if …• else …

Page 15: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Interconnected Ontology

Page 16: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Interesting Problems

• Resolving matches across disconnected ontologies

• Choosing the extent of an ontology for extraction

• Adding relationship data frames to extraction processing

• How to efficiently choose the context ontologies when the library becomes large

Page 17: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

User Interface

• Traditional text box for search• Radio options:

– Automatic• Run the system and give me what you get

– Feedback• Run the form feed back loop

– Exact• Let me pick/build the ontology/data frames I want

Page 18: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Form Feed Back

• System understanding displayed in a form

• User can modify form for a more structured query

• User can change ontology or append new data frames

Page 19: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Interesting Problems / Contributions

• Representing relationships and derived attributes in the form and ontology editor

• Quick intuitive way to add data frames from global library– Suggestions– Match tests

Page 20: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Architecture

• System starts with keyword search and small personal data frame library

• Can submit to or retrieve from larger global library

Page 21: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

The Goal

Page 22: A Framework for Pay-as-you-go Extraction Ontology Based Information Retrieval Andrew Zitzelberger.

Future Work

• Knowledge Bundles rather than simple IR– Extraction relative to ontology from multiple

sources

• Relationally complete forms