Top Banner
Scaling Textual Inference to the Web Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1
21

Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

1

Scaling Textual Inference to the Web

Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld

Presented by Kristine MonteithCS 652 - 5/8/09

Page 2: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

2

The ProblemLots of information on the web, but answers

to questions aren’t always stated explicitly

Query: “What vegetables help prevent osteoporosis?”

Not going to find “Kale prevents osteoporosis”

Need to infer this from:kale is a vegetablekale contains calciumcalcium helps prevent osteoporosis

Page 3: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

3

OverviewHOLMES Architecture (performs textual

inference)Scaling Inference to the WebExperimental ResultsRelated Work

Page 4: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

4

The HOLMES Architecture

Information from Knowledge Basese.g. IsHighIn(kale, calcium), Prevents(calcium, osteoporosis)

Inference Rulese.g. Prevents(X,Z) :- IsHighIn(X,Z) ^ Prevents(Y,Z)

Queriese.g. query(X) :- IS-A(X,vegetable) ^ Prevents(X,osteoporosis)

Page 5: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

5

Partial proof tree (DAG) for the query “What vegetables help prevent osteoporosis?”

Page 6: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

6

Incremental ExpansionExact probabilistic inference is NP-complete

To deal with this, HOLMESUses approximate methods (loopy belief propagation)Focused queries help keep probabilistic inference

manageable

Creates networks incrementally (searches for additional proof trees and updates the network if there is more time)

Exploits standard Datalog optimization (e.g. only expands proofs of recently added nodes)

Page 7: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

7

Markov Logic Inference Rules1. Observed relations are likely to be true:

R(X,Y) :- ObservedInCorpus(X, R, Y)

2. Synonym substitution preserves meaning:RTR(X’,Y) :- RTR(X,Y) ^ Synonym(X, X’)RTR(X,Y’) :- RTR(X,Y) ^ Synonym(Y, Y’)

3. Generalizations preserve meaning:RTR(X’,Y) :- RTR(X,Y) ^ IS-A(X, X’)RTR(X,Y’) :- RTR(X,Y) ^ IS-A(Y, Y’)

4. Transitivity of Part Meronyms:RTR(X,Y’) :- RTR(X,Y) ^ Part-Of(Y, Y’) where RTR

matches ‘* in’ (e.g., ‘born in’).

Page 8: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

8

Scaling Inference to the WebIn order to scale Textual Inference to the web, it has

to scale linearly

Assumptions:Number of ground assertions |A| grows linearly with size

of corpus (True for assertions extracted by TextRunner)Size of every proof tree is bounded by some constant m

(Seems to be true in practice, could be enforced by terminating search for proof trees at a certain depth)

Need to show that constructing proof trees takes O(|A|) time

Page 9: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

9

Constructing proof trees in O(|A|) timeUsing function free horn clauses means that

logical inference can be done in polynomial timeStill not good enough to scale to the Web

Need to ensure two more things:Number of different types of proofs doesn’t grow

too quickly (e.g. Fixed number of rules results in a constant number of first-order search trees)

Number of tuples participating in each relation doesn’t grow too quickly

Page 10: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

10

Approximately Pseudo-Functional

Page 11: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

11

Experimental ResultsUses two knowledge bases:

TextRunner (183 million ground assertions from 117 million web pages)

WordNet (159 thousand manually created IS-A, Part-Of, and Synonym assertions)

Twenty queries in three domainsGeographyBusinessNutrition

Page 12: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

12

Geography Queries“Who was born in one of the following

countries?”Q(X) :- BornIn(X,{country}) Possible countries: France, Germany, China,

Thailand, Kenya, Morocco, Peru, Columbia, Guatemala

Example:Ground assertion: BornIn(Alberto Fujimori, Lima)Background knowledge: LocatedIn(Lima, Peru)New conclusion: BornIn(Alberto Fujimori, Peru)

Page 13: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

13

Business QueriesWhich companies are acquiring software companies?Q(X) :- Acquired(X, Y)^ Develops(Y, ‘software’)

This query tests HOLMES’s ability to scalably join a large number of assertions from multiple pages.

Which companies are headquartered in the USA? Q(X) :- HeadquarteredIn(X, ‘USA’) ^ IS-A(X,

‘company’)Join on HeadquarteredIn and IS-ATransitive inference:

Seattle is PartOf Washington which is PartOf the USA Microsoft IS-A software company which IS-A company

Page 14: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

14

Nutrition Queries“What foods prevent disease?” Q(X, {disease}) :- Prevents(X, {disease}) ^

IS-A(X, {food})Possible foods: fruit, vegetable, grainPossible diseases: anemia, scurvy, or

osteoporosis.

Page 15: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

15

Effect of Inference on RecallBaseline: Number of query answers derived

from information explicitly stated in the Knowledge Bases (TextRunner and WordNet)

Inference increases the number of query answers by 102% for the Geography domain, and considerable more for the other two domains

Page 16: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

16

Page 17: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

17

Prevalence of APF RelationsExamined 500 binary relations selected

randomly from TextRunners assertionsLargest two relations had over 1.25 million

unique instances52% of the relations had more than 10,000

instancesFound most of the smallest value Kmin such that

the relation was APF with degree Kmin80% of relations were APF with degree less than

496

Page 18: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

18

Page 19: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

19

Related WorkVan Durme and Schubert (2008)

Use highly expressive representations (e.g. negation, temporal information)

HOLMES is less expressive but more scalableOpen-domain Question-Answering Systems

Attempt to find individual documents or sentences containing the answer

HOLMES can infer from multiple texts, but is not well suited to answering more abstract or open-ended questions

Statistical Relational LearningTechniques for combining logical and probabilistic inferenceHOLMES uses more restrictive inference rules, but again is

more scalable

Page 20: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

20

Conclusions1. We introduce and evaluate the HOLMES system, which

leverages KBMC methods in order to scale a class of TI methods to the Web.

2. We define the notion of Approximately Pseudo-Functional (APF) relations and prove that, for a APF relations, HOLMES’s inference time increases linearly with the size of the input corpus. We show empirically that APF relations appear to be prevalent in our Web corpus and that HOLMES’s runtime does scale linearly with the size of its input taking only a few CPU minutes when run over 183 million distinct ground assertions.

3. We present experiments demonstrating that, for a set of queries in the domains of geography, business, and nutrition, HOLMES substantially improves the quality of answers (measured by AuC) relative to a “no inference” baseline.

Page 21: Stefan Schoenmackers, Oren Etzioni, and Daniel S. Weld Presented by Kristine Monteith CS 652 - 5/8/09 1.

21

Questions???