Top Banner
Knowledge Graphs: In Theory and Practice Nitish Aggarwal, IBM Watson, USA, Sumit Bhatia, IBM Research, India Saeedeh Shekarpour, Knoesis Research Centre Ohio, USA Amit Sheth, Knoesis Research Centre Ohio, USA
63

Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Jul 21, 2018

Download

Documents

dinhhanh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

KnowledgeGraphs:InTheoryandPractice

Nitish Aggarwal,IBMWatson,USA,Sumit Bhatia, IBMResearch,IndiaSaeedeh Shekarpour,Knoesis ResearchCentreOhio,USAAmitSheth,Knoesis ResearchCentreOhio,USA

Page 2: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

The material presented in this tutorial represents the personal opinion of the presenters and not of IBM and affiliated organization.

Page 3: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Outlineofthetutorial

Part 1: Knowledge Graph Construction• Introduction• DBpedia: Knowledge extraction• Approaches to extend knowledge graph• Knowledge extraction from scratch

Part 2: Knowledge Graph Analytics• Finding entities of interest• Entity exploration• Upcoming challenges

Page 4: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

WhatisKnowledgeGraph

“The KnowledgeGraph isa knowledgebase usedby Google toenhanceits searchengine'ssearchresultswith semantic-searchinformationgatheredfromawidevarietyofsources.”

Page 5: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

WhatisKnowledgeGraph

“The KnowledgeGraph isa knowledgebase usedby Google toenhanceits searchengine'ssearchresultswith semantic-searchinformationgatheredfromawidevarietyofsources.”

“AKnowledgegraph(i)mainlydescribesrealworldentitiesandinterrelations,organizedinagraph(ii)definespossibleclassesandrelationsofentitiesinaschema”(iii)allowspotentiallyinterrelatingarbitraryentitieswitheachother… [Paulheim H.]

“WedefinesaKnowledgeGraphasanRDFgraphconsistsofasetofRDFtripleswhereeachRDFtriple(s,p,o)isanorderedsetoffollowingRDFterm….”[Pujara J.alal.]

Page 6: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

WhatisKnowledgeGraph

Nosingleformaldefinition…

• Definesrealworldentities

• Providesrelationshipsbetweenthem

Page 7: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

WhatisKnowledgeGraph

Nosingleformaldefinition…

• Definesrealworldentities

• Providesrelationshipsbetweenthem

• Containsrulesdefinesthroughontologies

• Enablereasoningtoinfernewknowledge

Page 8: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

WhyKnowledgeGraph

Building an intelligent system that can interact with human, requires knowledge about real world entities.

Page 9: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

WhyKnowledgeGraph

Building an intelligent system that can interact with human, requires knowledge about real world entities.

• Enhance search results.

• Enhance ad sense.

• Help in language understanding.

• Enables knowledge discovery.

Page 10: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Isthereexistingknowledgegraphreadytouseformyapplication?

Page 11: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

GoogleKnowledgeGraphFacebook

EntityGraph

MicrosoftSatori

LinkedInKnowledgeGraph

AmazonProductGraph

Page 12: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,
Page 13: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

Page 14: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

TheCityofNewYork,oftencalledNewYorkCity orsimplyNewYork,isthemostpopulouscityintheUnitedStates.

Page 15: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

TheCityofNewYork,oftencalledNewYorkCity orsimplyNewYork,isthemostpopulouscityintheUnitedStates.

<NewYorkCity>,<CityIn><UnitedStates>.

<CityName>,<locatedIn><CountryName>.

Page 16: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

TheCityofNewYork,oftencalledNewYorkCity orsimplyNewYork,isthemostpopulouscityintheUnitedStates.

Page 17: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

Page 18: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

<headentity>,<rel>< tailentity>

Page 19: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

<headentity>,<rel>< tailentity>

WikipediaInfobox

Page 20: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,
Page 21: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

Page 22: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

Page 23: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

Page 24: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

DBpedia:Knowledgeextraction

Parsers

Ontology(Classes,properties)

dbr:IBM dbp:foundedBydbr:Charles_Ranlett_Flint

dbr:IBM dbp:foundedBydbr:Charles_Ranlett_Flint

dbr:IBM dbp:foundedBydbr:Charles_Ranlett_Flint

……………

Page 25: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

(Research)problemsinknowledgegraphs

• Incomplete knowledge– Missing entities– Missing relations– Limited entity and relation types

Page 26: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

(Research)problemsinknowledgegraphs

• Incomplete knowledge– Missing entities– Missing relations– Limited entity and relation types

• Incorrect knowledge– Wrong entity label recognition– Wrong entity and relation type– Wrong facts

Page 27: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

(Research)problemsinknowledgegraphs

• Incomplete knowledge– Missing entities– Missing relations– Limited entity and relation types

• Incorrect knowledge– Wrong entity label recognition– Wrong entity and relation type– Wrong facts

• Inconsistency in knowledge– Different labels for same entity– Merging entities with same labels

Page 28: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

• Extracting knowledge from Wikipedia tables– Large amount of raw data in form of tables– Tables have some implicit structure/patterns

Page 29: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

• Extracting knowledge from Wikipedia tables– Large amount of raw data in form of tables– Tables have some implicit structure/patterns

Wiki:AFC_Ajax containingrelationsbetweenplayers,theirshirtnumber,andcountry

Page 30: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

• <Wiki:AFC_Ajax,dbp:rel,Wiki:Andre_Onana>• 80%entitiesinthetablehaverelationdbp:rel withtheWikipediatitleentity

Wiki_AFC_Ajax• Other20%entitiesarelikelytohavethesamerelationshipdbp:rel withWiki_AFC_Ajax

[MunozE.atal.]UsingLinkedDatatoMineRDFfromWikipedia'sTables,WSDM2014

Page 31: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

[MunozE.atal.]UsingLinkedDatatoMineRDFfromWikipedia'sTables,WSDM2014

• Features– Articlefeatures:no.oftables,length– Tablefeatures:no.ofrows,no.ofcolumns– Columnfeatures:no.ofentitiesincolumn,potentialrelations– Cellfeatures:no.ofentitiesinacell,lengthofcell– Manyothers

• Combinesusingclassificationmethod

Prec. Rec. F1

Rule-based 64.23 70.46 67.20

SVM 72.43 75.77 74.06

Logistic 79.62 79.01 79.31

Page 32: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

[MunozE.atal.]UsingLinkedDatatoMineRDFfromWikipedia'sTables,WSDM2014

• Features– Articlefeatures:no.oftables,length– Tablefeatures:no.ofrows,no.ofcolumns– Columnfeatures:no.ofentitiesincolumn,potentialrelations– Cellfeatures:no.ofentitiesinacell,lengthofcell– Manyothers

• Combinesusingclassificationmethod

Prec. Rec. F1

Rule-based 64.23 70.46 67.20

SVM 72.43 75.77 74.06

Logistic 79.62 79.01 79.31

• Rules/heuristicsbasedmethodsmakesmistakes,andhardtocreateoneruleforeveryone.

• Eventhoughcombiningdifferentfeaturesachieves80%accuracy,itintroduces20%noise.

Page 33: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Tabledataislimited,weneedtogobeyond

Page 34: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

• Missingentity/literalforarelation– “ChristopherA.WeltyisanAmericancomputerscientist,whoworksat

GoogleResearchinNY”• <dbr:Chris_Welty><employedBy><?>

– "TomCruiseandBradPittappearinInterviewwiththeVampire"• <dbr:Brad_Pitt><?><dbr:Tom_Cruise>

Page 35: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

• Missingentity/literalforarelation– “ChristopherA.WeltyisanAmericancomputerscientist,whoworksat

GoogleResearchinNY”• <dbr:Chris_Welty><employedBy><?>

– "TomCruiseandBradPittappearinInterviewwiththeVampire"• <dbr:Brad_Pitt><?><dbr:Tom_Cruise>

• KnowledgeBaseCompletion– Similartolinkpredictioninsocialnetworkbutabitmorechallenging– Needtoidentifyrelationtypeinadditiontobinaryoutput.

Page 36: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Approachestoextendknowledgegraphs

• KnowledgeBaseCompletion– TransE:learntheentityandrelationembeddings byassumingthattranslation

ofentityembeddings correspondtotheirrelationembeddings.[Bordes etat.2013]

– S+R≈T,where<S,R,T>

– TransH:Learndifferententityembeddingfordifferentrelationships[Wangatel.2014]

– TransR:Learnentityandrelationembeddings indifferentspace,followingbytranslationperforminrelationspace.[LinY.atel.2015]

– Manymoremethods [NickelM.atal,2015]

Page 37: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Knowledgebasecompletionapproachesfocusonfindingmissingentities/relations

Page 38: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Needtoaddnewentitiesfromexternalsources

Page 39: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Needtoaddnewentitiesfromexternalsources

• Entityrecognitioninexternaltextresource• ManyNamedEntityRecognitionsystems

• LinkextractedentitytoKGorcreateanewnodeifitdoesnothaveacorrespondingentity

• TAC-KBP(EntityDiscoveryandLinkingtask)[JiH.atel.2016]

Page 40: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

BuildingknowledgegraphsuchasDBpedia requireslotofmanualefforts.

Page 41: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

BuildingknowledgegraphsuchasDBpedia requireslotofmanualefforts.

• Manyapplicationsrequiredomain/dataspecificcustomknowledgegraphs.

• CreatingschemawithclassstructureandconstraintsforeachKGisdifficult.

Page 42: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Howtocreateaknowledgegraphfromunstructuredtext?

Page 43: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

JonathonWatsonworksatIBM.Hehasmorethan50patents,andwonbestinventorawardforhisinvention“NeuralChipbyJonWatsonetal.

Page 44: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

JonathonWatsonworksatIBM.Hehasmorethan50patents,andwonbestinventorawardforhisinvention“NeuralChipbyJonWatsonetal.

Entityextraction

Relationextraction

Noisereduction KG

JonathonWatsonIBMJonWatson

employedBy(JonathonWatson,IBM)JonWatson

JonathonWatson,JonWatson

Page 45: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods

Predefined schema (employedBy, bornOn, BirthPlace …)

Training data

JonathonWatsonworksatIBM.

JonathonWatsonjoinedIBM.

employedBy

employedBy

Page 46: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods

Predefined schema (employedBy, bornOn, BirthPlace …)

Training data Test data

JonathonWatsonworksatIBM.

JonathonWatsonjoinedIBM.

employedBy

employedBy JonathonWatsonismanageratIBM.

?

Page 47: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods

Predefined schema (employedBy, bornOn, BirthPlace …)

Training data Test data

JonathonWatsonworksatIBM.

JonathonWatsonjoinedIBM.

employedBy

employedBy JonathonWatsonismanageratIBM.

employedBy

Page 48: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods

Pros: High accuracy and less noise

Cons: Hard and expensive to build labeled data

Page 49: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods• Distantly supervised methods

employedBy (JonWatson,IBM)

affiliated(MichaelDecker,,SMU)

JonWatsonworksatIBM.

JonWatsonbecomesVPatIBM.……….

MichaelDeckerjoinsDataSciencegroupatSMU.

MichaelDeckerwonanationalfundingawardat

SMU.……….

Page 50: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods• Distantly supervised methods

employedBy (JonWatson,IBM)

affiliated(MichaelDecker,,SMU)

JonWatsonworksatIBM.

JonWatsonbecomesVPatIBM.……….

MichaelDeckerjoinsDataSciencegroupatSMU.

MichaelDeckerwonanationalfundingawardat

SMU.……….

Trainingsentences

Page 51: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods• Distantly supervised methods

Pros: Overcome the effort of labeling data

Cons: Dependency of existing knowledge graph and corresponding . text

Page 52: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods• Distantly supervised methods• Unsupervised methods (OpenIE, Universal Schema)

JonathonWatsonworksatIBM.

JonathonWatsonjoinedIBM.

join

(ROOT(S(NP(JonWatson))(VP(VBZworks)(PP(INat)(NPIBM)))

(ROOT(S(NP(JonWatson))(VP(VBDjoined)(NPIBM))

work

Page 53: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods• Distantly supervised methods• Unsupervised methods (OpenIE, Universal Schema)

Pros: eliminates the effort of labeling data

Cons: Noisy, large number of relations

Page 54: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction

• Supervised methods• Distantly supervised methods• Unsupervised methods (OpenIE, Universal Schema)

Relation1 Relation2 Relation3

WorksemployerCompany

employedBy….

livesIncurrentCityCountry

….

VicePresidentexecutive

Boardmember

….

Page 55: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextraction(UniversalSchema)

• Clustering using vector similarity• Matrix completion and fill the empty values [YaoL.atel.,

2012]

employeBy affiliated Leaderof

Jon x x

Michael x

Steve x x

Joyce x x x

Page 56: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Entitytypesidentification(UniversalSchema)

• Clustering using vector similarity• Matrix completion and fill the empty values [YaoL.atel.,

2012]

director musician actor

Jon x x

Michael x x

Steve x

Joyce x x

Page 57: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Relationextractionindomain

• Supervised methods – Need domain experts to label the data• Distantly supervised methods – Hard to find corresponding

text• Unsupervised methods (OpenIE, Universal Schema) – Noisy

A 59-year-old African American man with a past medical history of hypertension, benign prostatichypertrophy, type II diabetes mellitus for the past 15 years, and chronic back pain presents to the hospitalwith gross hematuria. The patient states that he noticed blood in his urine last night. The patient also reportsmild, intermittent flank pain. The patient states that his diabetes and blood pressure are well controlled withmedications, and that he has managed his chronic back pain with 2 aspirin per day for the past 4 years. Vitalsigns are Temp- 98.6°F, BP- 124/82 mm/Hg, pulse- 88/min, and RR- 14/min. Blood work is notable for HbA1Cof 6.5%. A pyelogram reveals a ring sign. His current fasting glucose is 140mmol/L.<br /><br />What is themost likely etiology of hematuria in this patient?

Symptom

Page 58: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Knowledgegraphsindomain

• Domain specific entity extraction is more challenging

• Limited relation types

• Less explicit mention of entity and relation types in text

• Creating simple schema requires domain experts

Page 59: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Knowledgegraph- Simple

JonathonWatsonworksatIBM.

MichaelDeckerjoinedIBM.

MichaelDeckerattendsSMU

JonathonWatson

IBM

MichaelDecker

SMU

Page 60: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Knowledgegraph– Simple+Schema

JonathonWatsonworksatIBM.

MichaelDeckerjoinsIBM.

MichaelDeckerattendedSMU

JonathonWatson

IBM

MichaelDecker

SMU

affiliated

affiliatedaffiliated

Page 61: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Knowledgegraph– Simple+Schema+Ontology

JonathonWatsonworksatIBM.

MichaelDeckerjoinedIBM.

MichaelDeckerattendsSMU

JonathonWatson

IBM

MichaelDecker

SMU

affiliated

affiliatedaffiliated

Domain,range,constraint

Page 62: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

Summary

• Simple knowledge graph works for many applications

• Identify the requirement before finding the solution.

• Many knowledge graphs are publically available

Page 63: Knowledge Graphs: In Theory and Practice - Sumit …sumitbhatia.net/papers/KG_Tutorial_CIKM17_part1.pdf · Knowledge Graphs: In Theory and Practice NitishAggarwal, IBM Watson, USA,

https://www.youtube.com/watch?v=kao05ArIiok&feature=youtu.be