Top Banner
Reconciling facts: how to check the consistency of facts created from web crawling Dr Rob Stacey True Knowledge Ltd.
14

Dr Rob Stacey True Knowledge Ltd.. Open Domain question answering Semantic query language Structured and Unstructured knowledge acquisition >300.

Dec 15, 2015

Download

Documents

Felipe Sherman
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Reconciling facts: how to check the consistency of facts

created from web crawling

Dr Rob StaceyTrue Knowledge Ltd.

Page 2: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

True Knowledge Open Domain question answering Semantic query language Structured and Unstructured

knowledge acquisition >300 million facts 20k+ classes Billions of inferred facts

Page 3: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Answering Questions Who was prime minister of the UK

when Bernie Ecclestone was a teenager?

Page 4: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Answering Questions

Page 5: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Local time questions What is the time in Covent Garden

now?

Page 6: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Local time questions

Page 7: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Is Madonna married?

Page 8: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Answering a question Query processing

Triple representation[london] [is an instance of] [city]

Temporal knowledge represented by “facts about facts” [fact: [“123”]] [applies for timeperiod] [<1970 onwards>]

Richness within entity representation• “parametered” objects

[integer: [“8128”]] [group: [london]; [san francisco]]

* Actually 4 with negative relation

Page 9: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Mining the facts

Achieving 96% accuracy with the freetext of Wikipedia

Page 10: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Reconciliation- What system assessment can do

Accept incoming knowledge Contradict knowledge Make knowledge superfluous

Uses user assessments and scoring to determine which facts are believed

Page 11: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

How assessment works Run a negative version of the query

• [married] ~[applies to] [madonna] If the query is unknown the fact is

new to the knowledge base If the result is no then fact is either

superfluous or an endorsement If the result is yes the there is a

contradiction

Page 12: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Fact already proven The assertion may simply be an

existing fact – if so more weight is added to the truth of that fact

If the fact is different then it is superfluous to the system, though still valid is it removes the need for inference.

Page 13: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Contradiction One of two facts must be wrong

Assessment scoring decides which fact to believe

The loser is contradicted and not believed or used in query processing

Page 14: Dr Rob Stacey True Knowledge Ltd..  Open Domain question answering  Semantic query language  Structured and Unstructured knowledge acquisition  >300.

Thanks &QuestionsDr Rob Stacey - True Knowledge Ltd.