Top Banner
Natural Intelligence - Commonsense Question Answering with Conceptual Graphs Fatih Mehmet Güler and Aysenur Birturk Department of Computer Engineering, METU 06531, Ankara/TURKEY [email protected] [email protected]
25

Natural Intelligence ICCS 2010

Jul 07, 2015

Download

Technology

fmguler

Fatih Mehmet Güler - Natural Intelligence ICCS 2010 paper presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Natural Intelligence ICCS 2010

Natural Intelligence -

Commonsense Question Answering

with Conceptual Graphs

Fatih Mehmet Güler and Aysenur BirturkDepartment of Computer Engineering, METU

06531, Ankara/TURKEY

[email protected]

[email protected]

Page 2: Natural Intelligence ICCS 2010

Motivation

Massive Knowledge found as Natural Language

Text based Question Answering (no tagging)

Open Domain Question Answering

Address Commonsense Reasoning Problem

Linguistically motivated KRR

Intelligence is the accumulation of knowledge

Integrate State of the Art Tools

Ultimate goal: Getting closer to strong AI

Page 3: Natural Intelligence ICCS 2010

Summary of the System

Natural Language is parsed

Utterances are represented using CGs

Concepts and Relation types are mapped to

Cyc equivalent counterparts

Type hierarchies are computed

Knowledge is accumulated

If the input is a question

Search for answer (projection)

Page 4: Natural Intelligence ICCS 2010

Summary of the System (Cont’d)

NI

NLP

CCG

C&C Tools

KRR

CGs

Cogitant

Commonsense

Open Cyc

Page 5: Natural Intelligence ICCS 2010

Background

Combinatory Categorial Grammar (CCG)

C&C Tools

Conceptual Graphs

Cogitant

Open Cyc

Page 6: Natural Intelligence ICCS 2010

Combinatory Categorial Grammar (CCG)

Lexicalized Theory of Grammar based on

Categorial Grammar ( Steedman 2001).

Functions can be applied or composed

Arguments can be picked up or turned into

functors (Type raising)

Easy for Semantic Representations

Small number of semantically transparent

combinatory rules to combine CCG categories.

Assign semantic representations to the lexical entries

Interpret combinatory rules

Page 7: Natural Intelligence ICCS 2010

CCG parse for “Mr. Hyde ate two

lemmons”

Page 8: Natural Intelligence ICCS 2010

CCG Parse for “Susan knows that Bob

likes Fred”

Page 9: Natural Intelligence ICCS 2010

DRS for “Susan knows that Bob likes

Fred”

Page 10: Natural Intelligence ICCS 2010

C&C Tools

Linguistically Motivated Large-Scale NLP with C&C

and Boxer. (Curran, Clark, Bos, 2007)

C&C Parser

POS Tagging, Supertagging

Parsing, Chunking

Named Entity Recognition

Boxer

Uses CCG parser output

Generates DRS Semantic Representations

Freely available for research

http://svn.ask.it.usyd.edu.au/trac/candc/wiki

Page 11: Natural Intelligence ICCS 2010

C&C Tools

Large Scale NLP is possible with C&C and

Boxer

C&C Parser: state of the art parser for CCG

Boxer: Semantic representations in DRS

Page 12: Natural Intelligence ICCS 2010

Open Cyc

Open source version of Cyc system

Cyc: greatest effort to encode Common Sense

knowledge in machine processable way

500.000 concepts 26.000 relations and 5.000.000

assertions

CycL language similar to Lisp

We use Cyc to map parsed words to common sense

counterparts such as person to #$Person

(disambiguation)

Page 13: Natural Intelligence ICCS 2010

Open Cyc (cont’d)

(#$likesAsFriend #$GeorgeWBush #$AlGore)

#$isa, #$genls

(#$isa #$GeorgeWBush

#$UnitedStatesPresident)

(#$genls #$UnitedStatesPresident #$Person)

Page 14: Natural Intelligence ICCS 2010

Cogitant

Library for Conceptual Graph operations

Supports broad CG operations (Genest &

Salvat, 1998)

Graph representation

Conversion from CGIF

Projection checking

Rule application

Page 15: Natural Intelligence ICCS 2010

Natural Intelligence – Commonsense

Question Answering with CGs

Augment Common Sense knowledge

Modular Approach

Separation of Concerns

State of the art tools

Page 16: Natural Intelligence ICCS 2010

Architecture - Modules

Natural Language Processing (C&C Tools are used for implementation) Convert natural language to CGIF

Reasoning (Cogitant library is used for implementation) CG operations

Common Sense (Open Cyc is used for implementation) Common sense mapping

Storage (Conceptual Graphs are stored in a database) Persistence of CGs

Page 17: Natural Intelligence ICCS 2010

System Definition

User enters a sentence from web interface;

This sentence is converted to CGIF using the NLP module;

CGIF is converted to CGs using the reasoning module;

Support is generated to CGs using the common sense module;

Common sense rules gathered from common sense module are applied to CGs using reasoning module;

CGs are merged to the previous ones using reasoning module;

If the input sentence is a question sentence, same operations take place, except the resulting graph is used to query existing CGs using the reasoning service, and if there are projections from this query graph to previous CGs, results are displayed to the user;

CGs are persisted using the storage module.

Page 18: Natural Intelligence ICCS 2010

Common Sense Mapping

Cyc: (prettyString TERM STRING)

Chain up to #$Thing using #$genls relations

Same for relations using #$genlPreds

Relation hierarchies are converted to forward

rules

#$performedBy -> #$temporallyRelated

Page 19: Natural Intelligence ICCS 2010

Sample Concept Hierarchy

#$Place ->

#$EnduringThing-Localized ->

#$Location-Underspecified ->

#$Thing ->

#$SomethingExisting ->

#$Individual ->

#$Thing ^^

#$Trajector-Underspecified ->

#$Location-Underspecified ^^

#$TemporallyExistingThing ->

#$TemporalThing ->

#$Individual ^^

#$SpatialThing-Localized ->

#$TemporallyExistingThing ^^

#$SpatialThing ->

#$Individual ^^

#$Boundary-Underspecified ->

#$Region-Underspecified ->

#$Location-Underspecified ^^

#$Landmark-Underspecified ->

#$Individual ^^

#$Location-Underspecified ^^

#$SpatialThing-NonSituational ->

#$SpatialThing ^^

#$Individual ^^

#$Location-Underspecified ^^

Page 20: Natural Intelligence ICCS 2010

Conversion to Cogitant Support

Convert Cyc hierarcy to Cogitant support

format

Concept Types

Relation Types

Individuals

Rules

Convert assertions to Cogitant graph format

Apply forward rules

Page 21: Natural Intelligence ICCS 2010

Answering Queries

Page 22: Natural Intelligence ICCS 2010

Significance

Sentences like;

What are the intangible things in this situation?

Was Mr. Hyde there while eating the apples?

Does Mr. Hyde exist after eating the apples?

Do the apples exist after Mr. Hyde ate them?

Deep Natural Language Understanding

State of the art tools

Open domain question answering

Page 23: Natural Intelligence ICCS 2010

Difficulties

Open Cyc API is broken

Does not work in Turkish locale (fixes are sent to

maintainers)

Still, provided API sends one IP packet per character, way

too slow over network

Custom socket API is developed and used over TCP

Custom Lisp functions for generalization hierarchy and

concept mapping

Cogitant problematic

Java API is very limited (compared to C++)

Only works over XML files

Page 24: Natural Intelligence ICCS 2010

Conclusion

Central Integrated Common Sense QAS

CCG for Natural Language Processing

Conceptual Graphs for KRR

Cyc for Common Sense

Page 25: Natural Intelligence ICCS 2010

Future Work

Implement Rule Induction

Backward Chaining (Resolution)

Improve NLP module and Common Sense mapping

Probabilistic Reasoning

Question Answering System (QAS) to be used in;

Education (Learning Management Systems)

Semantic Search (Content Management Systems)

Intelligent Help