Collaborative Personalized Twitter Search with Topic-Language Models

Collaborative Personalized

Twitter Search with Topic-Language Models

Jan Vosecky

Kenneth Wai-Ting Leung

Wilfred Ng

Supported by SIGIR Travel Grant

Microblogs

Tweet 1

Tweet 2

User-generated content

– Short length

– Informal language, free-form

– Diverse topics

Very high volume

Information overload

Searching on Twitter

“When you've got 5 minutes to fill,

Twitter is a great way to fill 35 minutes”

@mattcutts

Searching for “ipad” on Twitter

Around 50 tweets

mentioning “iPad”

posted within

1-minute

Personalizing

Twitter Search

Microblog data

• Compared with traditional domains

(e.g. web search, news search):

– Explicitly stated user interests

• tweets, conversations, re-tweets

– Social network structure

• following

• Individual user’s data

– Diverse

– Sparse

• User’s social connections

Personalization challenge

Putting all kinds of information into a single user model

inaccurate, noisy

– Diverse

– Sparse

– Diverse friends, topics

– Need to carefully organize friends’ informatio

Short messages

Few messages

Few social connections

Little search history

– Diverse

– Sparse

– Need to carefully organize friends’ information

for it to be useful

– Diverse

– Sparse

Topics

Contributions

Novel User Model

structure

Collaborative User

Language

modeling IR

Query likelihood model

– Given a query Q and a

document D,

Topic Models

A latent topic in LDA:

“Information Technology”

Google 0.00040

Android 0.00020

Microsoft 0.00010

App 0.00010

Security 0.00009

Email 0.00008

Login 0.00005

Virus 0.00004

Scope of our approach

• Input to our algorithm:

– Set of n documents returned by Twitter given

query Q

• Our task:

– Rank the documents according to:

• Query

• User model

Proposed Framework

At a Glance: Proposed User Model

Individual User Model

ITW = 2/5 = 40%

W = 2/5 = 40%

Manchester: 5

Play: 4

Win: 2

Android: 6

Coding: 2

Java: 2

ID Tweet Time Topic

1 Manchester playing tonight 1. 1. Sport

2 Doing some android coding 2. 1. IT

3 Great game, great win for manchester! 5. 1. Sport

4 Had a great apple cake with chocolate 6. 1. Food

5 My java code keeps throwing exceptions 10. 1. IT

W = 1/5 =

Cake: 6

Apple: 5

Oven: 2

Individual User Model (IM)

Is u interested in word w from topic k?

Is u interested in topic k?

Is word w related to topic k?

Prior prob. of topic k

Recent interest is more important:

From user From topic model

Personalization using IM

Is the Query relevant to topic k?

Is Q related to topic k in general?

Is the User interested in topic k?

Is Q related to the words in topic k that User is interested in?

Is the Document relevant to topic k?

Is D related to topic k in general?

Is the User interested in topic k?

Is D related to the words in topic k that User is interested in?

Prior Document probability

Q = australia

I’m interested in IT and travel

I’ve never tweeted about Australia

Travel

Politics

Business

Top 10 restaurants in Australia

iPhones, iPads, and Macs Hacked and Hijacked

for Ransom in Australia - Gotta Be Mobile

Tweet (D):

Q = australia

I’m interested in IT and travel

I have tweeted about IT in Australia

Travel

Politics

Business

Top 10 restaurants in Australia

iPhones, iPads, and Macs Hacked and Hijacked

for Ransom in Australia - Gotta Be Mobile

Tweet (D):

Collaborative User Model

Sport Food

Manchester: 5

Play: 4

Win: 2

Cake: 6

Apple: 5

Oven: 2

Friend 1

Manchester: 5

Play: 4

Win: 2

Friend 2

IT Music

Radiohead: 4

Listen: 2

Song: 5

Android: 6

Coding: 2

Java: 2

Friend 3

Manchester: 5

Play: 4

Win: 2

Android: 6

Coding: 2

Java: 2

Radiohead: 4

Listen: 2

Song: 5

Cake: 6

Apple: 5

Oven: 2

Collaborative Model

Collaborative User Model

• Weighted sum of IM’s of the top-n friends– based on the amount of interactions (re-tweets, mentions,

conversations)

• Weight of each friend f:

– wP(f): Popularity of f

– wA(u,f): Affinity of u and f

• Weight of each f’s topic k:

– wB(u,k): Topic bias

– wI(u,f,k): Topic-interaction between u and f

Personalization using IM and CM

From user From topic modelFrom friends

Dirichlet smoothing

Depends on the amount of user’s tweets

Search User Model (SM)

• Feedback sources: Queries + clicks

• What does a ‘click’ mean?

URL clickre-tweetfavorite

Search User Model (SM)

• Feedback sources: Queries + clicks

• Feedback from a ‘click’:

– Query-topic: preference for topic k when issuing Q

– Topic-word: preference for words in topic k

– Topic: user’s search bias towards topic k

Evaluation

Query log collection

• Evaluation interface

– Submit query, returns tweets from Twitter API

– Rate relevant tweets

Datasets

• Controlled user study (Log_CoS)

– 11 users

• In-the-wild user study (Log_IwS)

– 24 users

Log_CoS Log_IwS

Ranking Results

Baselines:

Query likelihood (J-M smoothing)

Topic model-based IR

Personalized search (User-specific language models)

Collaborative search (Cluster-specific language models)

Collaborative Personalized search

Ranking Results

Average per-user ranking performance

after processing i user’s queries

Comparison of models

(a) Log_CoS (b) Log_IwS

Query types

(a) Log_CoS (b) Log_IwS

Performance by query type

In summary

• Collaborative Personalized Twitter Search

– User’s tweets

– User’s friends’ tweets

– User’s search activity

– Organized around topics

• topic-specific language models

Future work

• Query-dependent personalization

strategies

• Selection of an optimal set of friends for

collaborative model

• Integrating spatial and temporal features

Thank You!

Jan Vosecky

Kenneth Wai-Ting Leung

Wilfred Ng

Supported by SIGIR Travel Grant

Collaborative Personalized Twitter Search with Topic-Language Models

Social Media

Center for Collaborative Education: Massachusetts...

Collaborative IQ with Denise Holt - INFOGRAPHIC Twitter...

Personalized Recommender System Using Entropy Based...

Personalized Collaborative Clustering - Facebook...

Personalized medicine, personalized intelligence.

SEMANTIC WEB TECHNOLOGIES FOR PERSONALIZED LEARNING AND...

Personalized recommender system on whom to follow in Twitter

1/18 Hypertwitter Collaborative Knowledge Engineering via...

Collaborative IQ with Denise Holt - INFOGRAPHIC Facebook...

A Collaborative Filtering Approach to Personalized ... · A...

Managing Documentation Projects in a Collaborative World...

World Class...Redesigning Education Artisan Teaching...

Analyzing User Modeling on Twitter for Personalized News...

PERSONALIZED RECOMMENDER SYSTEM ON WHOM TO FOLLOW IN...

Personalized Recommender System Using Entropy Based...

UMAP 2011: Analyzing User Modeling on Twitter for...