RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text

RESLVE: Leveraging User Interest to Improve En6ty Disambigua6on on Short Text

Elizabeth L. Murnane [email protected] Bernhard Haslhofer [email protected] Carl Lagoze [email protected]

A Personalized Approach to Entity Resolution

Background •  Task Defini6ons •  Challenges & Examples •  ADempted Solu6ons

Approach •  Mo6va6ons •  Modeling a Knowledge Context •  Implementa6on: The RESLVE System

Evalua2on •  Experiments •  Results •  Future Work

A Personalized Approach to Entity Resolution




Social Web

10 million pages per day

Social Web

800 million visitors per month

Social Web

7 billion images (twice 4 years ago)

Task Definition

Task Definition Named En2ty Recogni2on (NER)

•  Systema6cally iden6fying men6ons of en##es (e.g., people, places, concepts, ideas)

Task Definition Named En2ty Recogni2on (NER)

•  Systema6cally iden6fying men6ons of en##es (e.g., people, places, concepts, ideas)

Named En2ty Disambigua2on (NED) Resolving the intended meaning of ambiguous en66es from mul6ple candidate meanings

Ambiguous Entities

aaahh one more day un,l finn!!! #cantwait

office holiday party Beetle

Ambiguous Entities



Ambiguous Entities



Ambiguous Entities



Footage:

office holiday party


Footage: • Workplace?


Footage: • Workplace? • TV Show?


Episode 4



Episode 4


• US Version? • UK Version?

Episode 4


office, december 3


• US Version? • UK Version?

Challenges & Focus

Challenges & Focus

•  Short Length

Challenges & Focus

•  Short Length •  Sparse Lexical Context

Challenges & Focus

•  Short Length •  Sparse Lexical Context • Noisy

Challenges & Focus

•  Short Length •  Sparse Lexical Context • Noisy • Highly personal in nature

Challenges & Focus


Limitations of Extant Research Tweets severely degrade tradi6onal techniques


•  Stanford NER: F1 drops 90% à 46% • DBPedia Spotlight & Wikipedia Miner: P@1 < 40%



Recent strategies



Recent strategies

• Crowd-‐sourcing •  Limita6on: Dependent on reliable human workers

Tweets severely degrade tradi6onal techniques •  Stanford NER: F1 drops 90% à 46% • DBPedia Spotlight & Wikipedia Miner: P@1 < 40%

Recent strategies

• Crowd-‐sourcing •  Limita6on: Dependent on reliable human workers

• Automated aDempts •  Limita6on: Focus on NER not NED •  Limita6on: Generalizability beyond TwiDer?

Limitations of Extant Research

Challenges & Focus


• User’s past content on same plaeorm not feasible background corpus

Challenges & Focus


Task Definition

Our focus: disambigua2ng any en2ty detected in users’ text-‐based uNerances on social Web

Named En2ty Recogni2on (NER) •  Systema6cally iden6fying men6ons of en##es (e.g., people, places, concepts, ideas)

Named En2ty Disambigua2on (NED) Resolving the intended meaning of ambiguous en66es from mul6ple candidate meanings

Exploring a Personalized Solution •  Individual-‐centric approach to NED

Exploring a Personalized Solution •  Individual-‐centric approach to NED •  Incorporates external, user-‐specific seman6c data Personal

Context

Exploring a Personalized Solution •  Individual-‐centric approach to NED •  Incorporates external, user-‐specific seman6c data

• Model personal interests with respect to this informa6on

Personal Context



• Determine user’s likely intended meaning of ambiguous en6ty based on similarity between poten6al meanings and interests

Personal Context



• Determine user’s likely intended meaning of ambiguous en6ty based on similarity between poten6al meanings and interests

RESLVE Resolving En6ty Sense by LeVeraging Edits

Personal Context




Agenda

Underlying Assumptions

Underlying Assumptions • User has core interests

•  User more likely to men6on an en6ty about a topic relevant to personal interests than men6on a topic of non-‐interest

User expresses these interests consistently in content she posts online in mul6ple communi6es

Can use a seman6c knowledge base to formally represent these topics of interest



• User expresses these interests consistently in content she posts online in mul6ple communi6es

Can use a seman6c knowledge base to formally represent these topics of interest




•  Can use a seman6c knowledge base to formally represent these topics of interest




•  Can use a seman6c knowledge base to formally represent these topics of interest

Ø Bridge user iden6ty between social Web and knowledge base, K Ø Model interests using K’s organiza6onal scheme Ø Rank en6ty senses according to relevance to interests

Qualitative Analysis: Stable Interests

Qualitative Analysis: Stable Interests User’s topics of contribu6on similar across Web:

On average, 52.4% of en66es a user men6ons in social Web (e.g., “Java”) have at least 1 candidate sense in same parent category of Wikipedia ar6cle same user edited (e.g., “Programming language”) If extend to just 4 parents up category hierarchy, get all 100%


Same Topics

On average, 52.4% of en66es a user men6ons in social Web (e.g., “Java”) have at least 1 candidate sense in same parent category of Wikipedia ar6cle same user edited (e.g., “Programming language”) If extend to just 4 parents up category hierarchy, get all 100%

Ambiguous YouTube post: office, december 3

Same user’s recent Wikipedia edit: <item userid="xxxx" user="xxxx” pageid="31841130” ,tle= "The Office (U.S. season 8)"/>


Same Topics

Same categories • On average, 52.4% of en66es a user men6ons in social Web (e.g., “Java”) have at least 1 candidate sense in same parent category of Wikipedia ar6cle same user edited (e.g., “Programming language”)

•  If extend to just 4 parents up category hierarchy, get all 100%

Ambiguous YouTube post: office, december 3

Same user’s recent Wikipedia edit: <item userid="xxxx" user="xxxx” pageid="31841130” ,tle= "The Office (U.S. season 8)"/>

Theoretical Motivations

Theoretical Motivations • Online Contribu6on:

•  Users produce online content about key set of personally-‐interes6ng topics because it is fulfilling and seen as having beDer cost benefit

•  (Harper et al., 2007; Lakhani & von Hippel, 2003; Lerner & Tirole, 2000; Ling et al., 2006; Maslow, 1970)

Theoretical Motivations • Online Contribu6on:

•  Users produce online content about key set of personally-‐interes6ng topics because it is fulfilling and seen as having beDer cost benefit

•  (Harper et al., 2007; Lakhani & von Hippel, 2003; Lerner & Tirole, 2000; Ling et al., 2006; Maslow, 1970)

• Modeling Interests: •  Effec6ve to model these topic interests from lexical features of these text-‐based contribu6ons

•  (Chen et al., 2010; Cosley et al., 2007; Pennacchioq & Popescu, 2011)

Modeling a Knowledge Context

•  Knowledge base, K

•  K=(N,E)

•  2 node types: •  Categories •  Topics

c1c2

c4

t3t2

c3

d2d1 d3

t1

The Knowledge Graph

The Knowledge Graph

•  Category nodes: NCategory⊂N

The Knowledge Graph


The Knowledge Graph


The Knowledge Graph


The Knowledge Graph

•  Category nodes: NCategory⊂N •  Unique iden6fier

The Knowledge Graph

•  Category nodes: NCategory⊂N •  Unique iden6fier •  Seman6c rela6onships with other nodes

The Knowledge Graph


•  Topic nodes: NTopic⊂N

The Knowledge Graph



The Knowledge Graph



The Knowledge Graph


•  Topic nodes: NTopic⊂N •  Unique iden6fier

The Knowledge Graph



The Knowledge Graph



The Knowledge Graph


•  Topic nodes: NTopic⊂N •  Unique iden6fier •  Belongs to one or more categories

The Knowledge Graph



The Knowledge Graph



The Knowledge Graph



The Knowledge Graph


•  Topic nodes: NTopic⊂N •  Unique iden6fier •  Belongs to one or more categories •  Associated with text-‐based descrip6on

User Interest Model

User Interest Model •  Edi6ng a descrip6on signals interest in associated topic



User Interest Model •  Edi6ng a descrip6on signals interest in associated topic •  Topic nodes: all topics user edited descrip6on of




User Interest Model •  Edi6ng a descrip6on signals interest in associated topic •  Topic nodes: all topics user edited descrip6on of •  Category nodes: categories reachable in knowledge graph from those topics



User Interest Model •  Edi6ng a descrip6on signals interest in associated topic •  Topic nodes: all topics user edited descrip6on of •  Category nodes: categories reachable in knowledge graph from those topics •  Edge weight = inverse of shortest path length



! c1 c2 c3 c4

t1 !!! 1!

!!! 0!

t2 !!! 1!

!!! 1!

t3 0! 0! !!! 1!


! c1 c2 c3 c4

t1 !!! 1!

!!! 0!

t2 !!! 1!

!!! 1!

t3 0! 0! !!! 1!

•  Same representa6on for candidates

Instantiating the Model •  Wikipedia •  DBPedia •  Freebase

Instantiating the Model •  Wikipedia •  DBPedia •  Freebase

Instantiating on Wikipedia •  Ar6cles, categories effec6vely represent topics (Syed, 2008)

Instantiating on Wikipedia •  Ar6cles, categories effec6vely represent topics (Syed, 2008) •  Good coverage of even rare en6ty concepts (Zesch, 2007)

Instantiating on Wikipedia •  Ar6cles, categories effec6vely represent topics (Syed, 2008) •  Good coverage of even rare en6ty concepts (Zesch, 2007) •  Compa6ble with NER toolkits

•  DBPedia Spotlight, Wikipedia Miner

Instantiating on Wikipedia •  Ar6cles, categories effec6vely represent topics (Syed, 2008) •  Good coverage of even rare en6ty concepts (Zesch, 2007) •  Compa6ble with NER toolkits

•  DBPedia Spotlight, Wikipedia Miner

•  Ar6cle edi6ng behavior effec6ve for modeling interests (Cosley, 2007; Lieberman & Lin, 2009; WaDenberg et al., 2007)

Article editing signals topic interest

Editing Behavior Intuition Number of times user edits article

Repeatedly editing an article implies greater commitment and interest

Article’s overall edit activity and total number of editors

Generally popular and actively edited articles are less discriminative of individ-ual interest and personal relevance

Time period user edits article

Long-term interests are stronger than fleeting, short-term interests

Type of edit accord-ing to revision tag

Trivial edits such as vandalism reversion or typo correction less indicative of inter-est than thoughtful, effortful edits

Complexity, com-pleteness, informa-tiveness of edit ac-cording to metrics of Information Quality

Type, substantiveness, and overall quality of care user gives to an edit indicates con-cern and interest in topic

Edi6ng behaviors indica6ve of user interest:

Article editing signals topic interest

Editing Behavior Intuition Number of times user edits article

Repeatedly editing an article implies greater commitment and interest

Article’s overall edit activity and total number of editors

Generally popular and actively edited articles are less discriminative of individ-ual interest and personal relevance

Time period user edits article

Long-term interests are stronger than fleeting, short-term interests

Type of edit accord-ing to revision tag

Trivial edits such as vandalism reversion or typo correction less indicative of inter-est than thoughtful, effortful edits

Complexity, com-pleteness, informa-tiveness of edit ac-cording to metrics of Information Quality

Type, substantiveness, and overall quality of care user gives to an edit indicates con-cern and interest in topic

Edi6ng behaviors indica6ve of user interest:

Less Meaningful Edits

Ignore Irrelevant Edits Clean Article Text Articles with less than 100 non-stopwords

Stem, tokenize, lowercase; re-move stopwords, punctuation, non-printable characters.

Trivial edits, i.e., typo correc-tion, vandalism reversion.

Parse Wiki Markup to remove article maintenance information

List pages merely containing widely diverse sets of topics that are all not necessarily indicative of the piece person-ally relevant to the user

Implementation: The RESLVE System RESLVE (Resolving En6ty Sense by LeVeraging Edits) addresses NED by:

pre-processor

Wikipedia Miner

user utterances unstructured short texts

DBPedia Spotlight

top ranked personally-

relevant candidates

entity

mmm

entity

username

user contributed structured documents

user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING

CANDIDATES BY PERSONAL RELEVANCE

mmm

m mm m

mmm

entity

entity

detected entities & candidate meanings ("m")

Implementation: The RESLVE System RESLVE (Resolving En6ty Sense by LeVeraging Edits) addresses NED by: I.  Connec6ng social Web + Wikipedia editor iden6ty

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Implementation: The RESLVE System RESLVE (Resolving En6ty Sense by LeVeraging Edits) addresses NED by: I.  Connec6ng social Web + Wikipedia editor iden6ty II.  Modeling topics of interests using ar6cle edits

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Implementation: The RESLVE System RESLVE (Resolving En6ty Sense by LeVeraging Edits) addresses NED by: I.  Connec6ng social Web + Wikipedia editor iden6ty II.  Modeling topics of interests using ar6cle edits III.  Ranking en6ty candidates by personal relevance

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Implementation: The RESLVE System RESLVE (Resolving En6ty Sense by LeVeraging Edits) addresses NED by: I.  Connec6ng social Web + Wikipedia editor iden6ty II.  Modeling topics of interests using ar6cle edits III.  Ranking en6ty candidates by personal relevance

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Phase 1: Bridging Web Identities •  Connect iden6ty of social media user with Wikipedia editor

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Phase 1: Bridging Web Identities •  Connect iden6ty of social media user with Wikipedia editor

•  Simple string matching •  Iofciu, 2011; Perito, 2011

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Phase 2: Representing Users and Entities • Models user’s topics of interest using bridged Wiki account’s edi6ng-‐history •  Compares similarity of those topics to topic associated with candidate sense

• Models user’s topics of interest using bridged Wiki account’s edi6ng-‐history •  Compares similarity of those topics to topic associated with candidate sense •  Content-‐based & knowledge-‐graph based similarity

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Phase 2: Representing Users and Entities

• Models user’s topics of interest using bridged Wiki account’s edi6ng-‐history •  Compares similarity of those topics to topic associated with candidate sense •  Content-‐based & knowledge-‐graph based similarity • Weighted vectors used to represent user and candidate sense

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Phase 2: Representing Users and Entities

Content-‐based similarity •  Bag-‐Of-‐Words

•  Titles of ar6cles user edited •  Candidate’s ar6cle 6tle •  Words from those ar6cles’ pages & category 6tles

•  TF-‐IDF weighted

Content-‐based similarity •  Bag-‐Of-‐Words

•  Titles of ar6cles user edited •  Candidate’s ar6cle 6tle •  Words from those ar6cles’ pages & category 6tles

•  TF-‐IDF weighted

• User, u: Vcontent, u •  Candidate meaning, m: Vcontent, m

simcontent(u, m) = cossim(Vcontent, u , Vcontent, m)

Knowledge-‐context based similarity •  Vectors of ar6cles’ category IDs •  Weight is distance between the ar6cle (topic) and category in knowledge graph

•  E.g., “American Television Series” > “Broadcas6ng”

Knowledge-‐context based similarity •  Vectors of ar6cles’ category IDs •  Weight is distance between the ar6cle (topic) and category in knowledge graph

•  E.g., “American Television Series” > “Broadcas6ng”

• User, u : Vcategory, u •  Candidate meaning, m: Vcategory, m

simcategory(u, m) = cossim(Vcategory, u , Vcategory, m)

Phase 3: Ranking by Personal Relevance Output highest scoring candidate as intended meaning by measuring:

sim(u,m)=α*simcontent(u,m)+(1-‐α)*simcategory(u,m)

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity


Pre-‐processing & prepara6on modules

pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity



pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity



pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity



pre-processor

Wikipedia Miner


DBPedia Spotlight


relevant candidates

entity

mmm

entity

username


user interest model

BRIDGING USER

IDENTITY

MODELING USER

INTEREST

I II

IIIRANKING


mmm

m mm m

mmm

entity

entity





Agenda

Experiment Data Sample •  TwiDer: tweets •  YouTube: video 6tles, descrip6ons •  Flickr: photo tags, 6tles, descrip6ons

Experiment Data Sample •  TwiDer: tweets •  YouTube: video 6tles, descrip6ons •  Flickr: photo tags, 6tles, descrip6ons •  String-‐matched usernames of posters to Wikipedia accounts •  Mechanical Turk used to confirm accounts were same person

Experiment Data Sample •  TwiDer: tweets •  YouTube: video 6tles, descrip6ons •  Flickr: photo tags, 6tles, descrip6ons •  String-‐matched usernames of posters to Wikipedia accounts •  Mechanical Turk used to confirm accounts were same person

For confirmed matches: •  Collected 100 most recent uDerances •  ID, 6tle, page content, categories of edited ar6cles

Experiment Labeling correct en6ty meaning •  1545 valid ambiguous en66es •  Mechanical Turk Categoriza6on Masters •  Averaged observed agreement across all coders and items = 0.866 •  Average Fleiss Kappa = 0.803 •  918 unanimously labeled ambiguous en66es

Dataset Characteristics

Text Length Longest uDerances s6ll shorter than even shortest texts from NER task corpora like Reuters-‐21578, Brown-‐Corpus

0"

5"

10"

15"

20"

25"

30"

10"

40"

70"

100"

130"

160"

190"

300"

450"

600"

800"

1100"

1400"

2500"

4000"

5500"

7000"

8500"

10000"

11500"

13000"

14500"

Twi/er" YouTube" Flickr"Reuters" Brown"

High Ambiguity • NER services have low confidence

0"

0.1"

0.2"

0.3"

0.4"

0.5"

0.6"

0.7"

0.8"

0.9"

1"

Wikipedia"Miner" DBPedia"Spotlight"

High Ambiguity • NER services have low confidence

• Many poten6al candidates (2 to 163, avg. 5-‐6, median 4)

0"

0.1"

0.2"

0.3"

0.4"

0.5"

0.6"

0.7"

0.8"

0.9"

1"

Wikipedia"Miner" DBPedia"Spotlight"

High Ambiguity •  91% of uDerances contain at least 1 ambiguous en6ty •  2/3 of en66es detected are ambiguous •  Almost no en66es without at least 2 senses to disambiguate

Performance Metric •  Precision at rank 1 (P@1)

Performance Metric •  Precision at rank 1 (P@1)

Methods of comparison • Human annotated gold standard • RC: Randomly sorted candidates • PF: Prior frequency • RU: RESLVE given a random Wikipedia user's interest model • DS: DBPedia Spotlight • WM: Wikipedia Miner

Results

Flickr YouTube

RESLVE 0.63 0.76 0.84

RC 0.21 0.32 0.31

PF 0.74 0.69 0.66

RU 0.51 0.71 0.78

WM 0.78 0.58 0.80

DS 0.53 0.67 0.63

Twitter

Discussion •  Best performance on YouTube texts (longest) due to content-‐based sim


• Outperforms on more personal text (e.g., tweets) •  Random user model less effec6ve


• Outperforms on more personal text (e.g., tweets) •  Random user model less effec6ve

•  Less effec6ve on impersonal text (e.g., photo geo-‐tags) •  High prior frequency so standard methods suffice •  Personally-‐unfamiliar topics so not likely to make Wiki edits about them •  Stable interests assump6on breaks down here

Error Cases •  Automated messages

•  “I uploaded a video on @youtube” à 1945 European Films



•  En66es not in knowledge base •  “Peter on the dock”



•  En66es not in knowledge base •  “Peter on the dock”

•  Less prolific contributors

Future Work

Future Work •  Computability

•  Wikipedia has 5M ar6cles, 700K categories à Vector pruning

Future Work •  Computability

•  Wikipedia has 5M ar6cles, 700K categories à Vector pruning

• User iden6ty & modeling interests

Bridging User Accounts

# Usernames Exist on Wikipedia TwiDer 479 46.1%

YouTube 454 19.6%

Flickr 226 21.7%


# Usernames Exist on Wikipedia Matches are same person TwiDer 479 46.1% 47%

YouTube 454 19.6% 48%

Flickr 226 21.7% 71%


Bridging User Accounts a.  True nega6ve (no iden6ty in knowledge base)

Bridging User Accounts a.  True nega6ve (no iden6ty in knowledge base) b.  False nega6ve (same person, different usernames)

Bridging User Accounts a.  True nega6ve (no iden6ty in knowledge base) b.  False nega6ve (same person, different usernames) c.  False posi6ves (string match, but different people)


Collabora6ve filtering techniques to approximate user's own interests with contribu6ons of social connec6ons

ü 



ü 

Consider more profile aDributes than username ü 



ü 



• Use other knowledge base besides Wikipedia


ü 



• Use other knowledge base besides Wikipedia • Model user interest from addi6onal kinds of par6cipa6on (e.g., page visits, bookmarking favori6ng)


ü 



• Use other knowledge base besides Wikipedia • Model user interest from addi6onal kinds of par6cipa6on (e.g., page visits, bookmarking favori6ng)

•  Interest driy & 6me-‐frame of pos6ngs


ü 


Summary & Conclusion •  Social Web texts: short & highly personal

•  User posts about same topics across communi6es (but not always)

• Models user interest as personal context with respect to a knowledge base’s categorical organiza6on scheme

•  Ranking technique compares en6ty’s poten6al meanings to user’s interests to determine intended meaning •  Language and context independent

•  Promising performance gains

•  Going forward: such a strategy becomes increasingly necessary, feasible, and effec6ve

Thank You!

Acknowledgements •  Claire Cardie, Dan Cosley, Lillian Lee, Sean Allen, Wenceslaus Lee • Na6onal Science Founda6on Graduate Research Fellowship under Grant No. DGE 1144153

• Marie Curie Interna6onal Outgoing Fellowship within the 7th European Community Framework Programme (PIOF-‐GA-‐2009-‐252206).

• Ques6ons?

Elizabeth L. Murnane [email protected]

Bernhard Haslhofer bernhard.haslhofer@

univie.ac.at

Carl Lagoze [email protected]

RESLVE: Leveraging User Interest to Improve Entity Disambiguation on Short Text

Technology