Top Banner
Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, Kentaro Inui Tohoku University 1 SENTIMENT 1: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using Tweets and Matrix Factorization
42

Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Mar 31, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Other Topics You May Also Agree or Disagree:

Modeling Inter-Topic Preferences using Tweets and Matrix Factorization

Akira Sasaki, Kazuaki Hanawa, Naoaki Okazaki, Kentaro InuiTohoku University

1SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization

Page 2: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Stance classification• Goal• Classify stances of texts in regard to a specific topic

• Applications• Public opinion survey from SNS data• Predicting voting actions

2SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization

Input Output

Text: I fully agree with TPP Topic: TPP Stance: Agree

Page 3: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Difficulty of stance classification

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 3

People often talk about topicswithout explicitly mentioning the topic.

How can we classify stance from such a text?

Input Output

Text: It is better to promote domestic consumption Topic: TPP Stance: Disagree

Input Output

Text: It is better to promote free trade Topic: TPP Stance: Agree

Page 4: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Input Output

Text: It is better to promote domestic consumption Topic: TPP Stance: Disagree

Input Output

Text: It is better to promote free trade Topic: TPP Stance: Agree

who agree with TPP

free trade

revision of copyright law domestic consumption

distribution of pharmaceuticals

also agree with disagree with

knowledge

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 4

Use of inter-topic preferences forstance classification

inter-topic preference

Page 5: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

A relatively simple example

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 5

Topic words and their surrounding wordsprovide strong clues.(Somasundaran&Wiebe,2010),(Mohammad+,2013)

Input Output

Text: I fully agree with TPP Topic: TPP Stance: Agree

※ Although datasets used in this work are in Japanese, we provide examples in English for readability.

Page 6: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Proposal: modeling inter-topic preferences via matrix factorization

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 6

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

1.0 0.3 -1.0 0.2

-1.0 0.2 0.7 -0.2

-0.4 -0.3 1.0 -1.0

0.2 0.5 0.1 -0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

Topic

1To

pic 2

Topic

3To

pic 4

User 1

User 2

User 3

User 4

! !"

≈ × =

!" !Users’ stances for

each topics (user-topic matrix)

Compute users’ densefeature vector and topics’ dense

feature vector via matrix factorization

Complete missing values by

feature vectors

The aim of matrix factorization:

1. capture inter-topic preferences by dense feature vectors2. reveal users’ hidden stances by completion

Page 7: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

The whole architecture

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 7

Corpus (tweets)

Tweets posted by userswho have used pro/con hashtags

A good news. [URL] #TPP反対

TPP ruins the future of our country

A is completely wrong

We should introduce A

to A

Pattern candidates in whichthe users describe topics

Linguistic pro/conpatterns

PatternExtraction

Sort candidatesand select

useful patterns

I support A / A is necessary /Welcome A / We should introduce A

I disagree A / A is completely wrong /A ruins the future of our country

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

1.0 0.3 -1.0 0.2

-1.0 0.2 0.7 -0.2

-0.4 -0.3 1.0 -1.0

0.2 0.5 0.1 -0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

Topic

1To

pic 2

Topic

3To

pic 4

User 1

User 2

User 3

User 4

! !"

≈ × =

!" !

Mine topicpreferences

① Mining Linguistic Patterns of Agreement and Disagreement

② Extracting Instances ofStances

③ Matrix Factorization

Page 8: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

The whole architecture

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 8

Corpus (tweets)

Tweets posted by userswho have used pro/con hashtags

A good news. [URL] #TPP反対

TPP ruins the future of our country

A is completely wrong

We should introduce A

to A

Pattern candidates in whichthe users describe topics

Linguistic pro/conpatterns

PatternExtraction

Sort candidatesand select

useful patterns

I support A / A is necessary /Welcome A / We should introduce A

I disagree A / A is completely wrong /A ruins the future of our country

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

1.0 0.3 -1.0 0.2

-1.0 0.2 0.7 -0.2

-0.4 -0.3 1.0 -1.0

0.2 0.5 0.1 -0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

Topic

1To

pic 2

Topic

3To

pic 4

User 1

User 2

User 3

User 4

! !"

≈ × =

!" !

Mine topicpreferences

① Mining Linguistic Patterns of Agreement and Disagreement

② Extracting Instances ofStances

③ Matrix Factorization

Page 9: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Mining linguistic patternsof agreement/disagreement• Focus on pro/con hashtags such as “#X賛成” or

“#X反対” used by users who have strong stances to topics

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 9

Then extractcon linguistic patterns

from other tweets by this user

#X反対 means“disagree with X”

Corpus (Tweet) Tweets posted by userswho have used pro/con hashtags

A good news. [URL] #TPP反対…

TPP is completely wrongA is completely wrong

We should introduce A

to A

Candidates of linguistic patterns

PatternExtraction

user X

user Y

Page 10: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

The whole architecture

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 10

Corpus (tweets)

Tweets posted by userswho have used pro/con hashtags

A good news. [URL] #TPP反対

TPP ruins the future of our country

A is completely wrong

We should introduce A

to A

Pattern candidates in whichthe users describe topics

Linguistic pro/conpatterns

PatternExtraction

Sort candidatesand select

useful patterns

I support A / A is necessary /Welcome A / We should introduce A

I disagree A / A is completely wrong /A ruins the future of our country

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

1.0 0.3 -1.0 0.2

-1.0 0.2 0.7 -0.2

-0.4 -0.3 1.0 -1.0

0.2 0.5 0.1 -0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

Topic

1To

pic 2

Topic

3To

pic 4

User 1

User 2

User 3

User 4

! !"

≈ × =

!" !

Mine topicpreferences

① Mining Linguistic Patterns of Agreement and Disagreement

② Extracting Instances ofStances

③ Matrix Factorization

Page 11: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

• Sort aforementioned pattern candidates by their frequency, and filter manually

Extracting instances ofstances

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 11

A is completely wrong

We should introduce A

to APattern candidates

Linguistic patterns

I support AA is necessaryWelcome A…

I disagree AA is completely wrongA is silly…

PRO

CON

Manual examination

Page 12: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Corpus (Tweet)

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

User 1

User 2

User 3

User 4

TPP…do

mestic

cons

umpti

on

…I support domestic consumption

TPP is sillyuser 1

• By using linguistic patterns, we create user-topic matrix

Extracting instances ofstances

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 12

!",$ =#((,),+1) − #((, ), −1)#((,),+1) + #((, ), −1)

Number of timesthe user u agree with the topic v

Number of timesthe user u disagree

with the topic v

Each element of the matrix is:

A is completely wrong

We should introduce A

to APattern candidates

Linguistic patterns

I support AA is necessaryWelcome A…

I disagree AA is completely wrongA is silly…

PRO

CON

Manual examination

Page 13: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

The whole architecture

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 13

Corpus (tweets)

Tweets posted by userswho have used pro/con hashtags

A good news. [URL] #TPP反対

TPP ruins the future of our country

A is completely wrong

We should introduce A

to A

Pattern candidates in whichthe users describe topics

Linguistic pro/conpatterns

PatternExtraction

Sort candidatesand select

useful patterns

I support A / A is necessary /Welcome A / We should introduce A

I disagree A / A is completely wrong /A ruins the future of our country

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

1.0 0.3 -1.0 0.2

-1.0 0.2 0.7 -0.2

-0.4 -0.3 1.0 -1.0

0.2 0.5 0.1 -0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

Topic

1To

pic 2

Topic

3To

pic 4

User 1

User 2

User 3

User 4

! !"

≈ × =

!" !

Mine topicpreferences

① Mining Linguistic Patterns of Agreement and Disagreement

② Extracting Instances ofStances

③ Matrix Factorization

Page 14: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Matrix factorization• By minimizing following objective function

• We can complete missing values as follows:

• Based on preliminary experiments, we set parameters as 𝑘 = 100,λ(= 0.1,λ*= 0.1 (refer to the paper for more info)

• We use libmf to solve the optimization problemhttps://github.com/cjlin1/libmf

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 14

r̂u,v ' pu|qv

minP,Q

X

(u,v)2R

(ru,v � pu|qv)

2 + �P ||pu||2 + �Q||qv||2

(u, v) 2 R:declared preference: u column vectors of P (user vector)

�P � 0,�Q � 0 : regularization coefficients: v column vectors of Q (topic vector)

pu 2 Rk

qv 2 Rk

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

1.0 0.3 -1.0 0.2

-1.0 0.2 0.7 -0.2

-0.4 -0.3 1.0 -1.0

0.2 0.5 0.1 -0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

Topic

1To

pic 2

Topic

3To

pic 4

User 1

User 2

User 3

User 4

! !"

≈ × =

!" !

Page 15: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Evaluation• Ex1: Determining the dimension parameter 𝒌→ RMSE decreased as the number of dimensions ( ) increased

• Ex2: Predicting missing stances→ 80-94% accuracy on predicting missing stances

• Ex3: Correlation between human judgements→ Moderate correlation

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 15

𝑘

Page 16: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Dataset• Tweet corpus

• about 35 Billion tweets crawled from Feb. 2013 to Sep. 2016• about 7 Million users• retweets are removed

• Collected data• 100 pro patterns and 100 con patterns (manually filtered)• about 25 Million tuples (agreement/disagreement declaration)

corresponding to about 3 Million users and about 5,000 topics

• User-topic matrix• removed users and topics that appeared less than five times• about 10 Million tuples corresponding to about 270,000 users and about

2,300 topics• sparsity = 98.43%

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 16

Page 17: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

• How accurately can user and topic vectors predict missing stances?

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 17

Ex2: Predicting missing stances

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

1.0 -1.0

-1.0

1.0 -1.0

User 1

User 2

User 3

User 4To

pic 1

Topic

2To

pic 3

Topic

4

hide 5% of elements ≈ ×

!" !

=matrix

factorization

1.0 0.3 -1.0 0.2

-1.0 0.2 0.9 -0.2

-0.2 -0.3 1.0 -1.0

0.2 0.8 0.1 -0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

calculate accuracyin regard to hidden elements

Page 18: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Ex2: Predicting missing stances• How accurately can user and topic vectors

predict missing stances?• majority baseline: predict missing values as

majority one of agree/disagree in regard to the topic

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 18

Matrix factorization

Majority baseline

Ourapproachpredictsmissingtopicpreferencesby80– 94%accuracy

Sincepreferencesofvocalusersdeviatedfromthoseoftheaverage

users,majoritybaselinedecreased

MatrixFactorization

MajorityBaseline

Page 19: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

• Are predicted agreements/disagreements by matrix factorization are reasonable?

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 19

Ex2: Predicting missing stances

Ourapproachreasonablypredictsmissingvalues

user sample A

agree with

disagree with

regime changecapital relocation

Abe CabinetOkinawa US military basenuclear weaponsTPP

vote of non-confidence to Cabinetsame-sex partnership ordinancenational people’s government

may also agree with

steamrollering war billworsening dispatch lawsendai nuclear power plantwar bill

predict(matrix factorization) may also disagree with

Page 20: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Conclusion• Modeled inter-topic preferences by matrix factorization

• Our approach accurately predicts missing stancesby 80-94% accuracy

• Future work• Use methods of targeted sentiment analysis

instead of using linguistic patterns

• Extend our approach to other domains• product, company, music, etc

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 20

Page 21: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 21

Page 22: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 22

Page 23: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Appendix

SENTIMENT 1: Other Topics You May Also Agree or Disagree:

Modeling Inter-Topic Preferences using Tweets and Matrix Factorization

23

Page 24: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Ex1: Determining the dimension parameter 𝒌• We observed that the reconstruction error decreased

as the iterative method of libmf progressed

• Based on this result,we concluded that

is sufficient forreconstructing theoriginal matrix

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 24

𝑘 = 100

𝑅

Page 25: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Ex2: Predicting missing stances

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 25

• majority baseline: predict missing values as majority one of agree/disagree in regard to the topic

1.0 -1.0

-1.0

1.0 -1.0

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

1.0

1.0

-1.0

agree

agreedisagr

ee

disagr

ee

Page 26: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Ex2: Predicting missing stances

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 26

• Since preferences of vocal users deviated from those of the average users, majority baseline decreased

Page 27: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Ex3: Correlation betweenhuman judgements• Created a dataset of pairwise inter-topic

preferences by using a crowdsourcing service

• Obtained 6-10 human judgements for everytopic pair, then computed the mean of the points

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 27

Q. People who agree with topic A also agree with topic B?

A1. those who agree/disagree with topic A may also agree/disagree with topic B

A2. those who agree/disagree with topic A may conversely disagree/agree with topic B

A3. otherwise(no associaction between topic A and topic B

+1

-1

±0

Page 28: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

topic A topic B

Spearman’s rank correlation coefficient = 0.2210

cosine similarity

1.0 0.6

topic C topic D0.8 0.7

topic Y topic Z-1.0 -0.3

human judgement

Ex3: Correlation betweenhuman judgements• Compared human judgements and

similarity between vectors of pairs

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 28

450 topic pairs

a moderate correlationeven though

inter-topic preferences are highly subjective

Page 29: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 29

Sub1: Example of predicted missing topic preference (qualitative)

Page 30: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Sub2: Similarity between topic vectors

• Do the topic vectors obtained by matrix factorization capture inter-topic preferences?

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization

Topic:LiberalDemocraticParty(LDP)Top 7 of similar topics cosisne

similarityAbs’s LDP 0.3937resuming nuclear power plant operations

0.3765

bus rapid transit (BRT) 0.3410hate speech countermeasure law 0.3373Henoko relocation 0.3353C-130 0.3338Abe administration 0.3248

Synonymous topics successfully have similar

vectors

Page 31: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

• Do the topic vectors obtained by matrix factorization capture inter-topic preferences?

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization

Topic:LiberalDemocraticParty(LDP)Top 7 of similar topics cosisne

similarityAbe’s LDP 0.3937resuming nuclear power plant operations

0.3765

bus rapid transit (BRT) 0.3410hate speech countermeasure law 0.3373Henoko relocation 0.3353C-130 0.3338Abe administration 0.3248

Topics promoted by LDP also have similar vectors

Sub2: Similarity between topic vectors

Page 32: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Sub2: Similarity between topic vectors

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 32

Page 33: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 33

Page 34: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 34

Page 35: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Unused slides

SENTIMENT 1: Other Topics You May Also Agree or Disagree:

Modeling Inter-Topic Preferences using Tweets and Matrix Factorization

35

Page 36: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 36

How can we use intrinsic knowledge in stance classification?

Input Output

Text: It is better to promote domestic consumption Topic: TPP Stance: Disagree

Input Output

Text: It is better to promote free trade Topic: TPP Stance: Agree

TPP

domestic consumption

free trade

revision of copyright law

distribution of pharmaceuticals

PROMOTE

PROMOTE SUPPRESS

SUPPRESS

Background knowledge

assume we know that“better to promote X”

means agreement to X

Page 37: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 37

Previously, we manually annotated these PROMOTE/SUPPRESS knowledge and utilized in stance

classification (Sasaki+, WI2016)

How can we use intrinsic knowledge in stance classification?

Page 38: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Challenge formodeling inter-topic preference• Intuitively, we can see a topic as a vector

consisting of users’ declared stances

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 38

Those who agree with topic A also agree

with topic B

Those who agree with topic A

disagreewith topic B

Topic

A

User 1

cosine similarity = 1

1 : the user agrees with the topic : the user disagrees with the topic-1

1-11-1

1-11-1

User 2

User 3

User 4

Topic

B

Topic

A

User 1

cosine similarity = -1

1-11-1

1-1

1-1

User 2

User 3

User 4

Topic

B

Page 39: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Challenge formodeling inter-topic preference• However, a lot of people declare

agreement/disagreement to only a few topics

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 39

Empty cell meansundeclared stance

1.0 -1.0

-1.0 0.7

-0.4 1.0 -1.0

0.5

User 1

User 2

User 3

User 4

Topic

1To

pic 2

Topic

3To

pic 4

Page 40: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

Other usage of inter-topic preference• Public opinion survey• analyze people’s political ideology at low cost

(cf. public opinion poll, census)• finer-grained than liberal/conservative

• Electoral campaigns• we can assume

“those who agree with topic A also vote for party B”

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 40

Page 41: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 41

Page 42: Other Topics You May Also Agree or Disagree: Modeling Inter-Topic Preferences using ... · 2018. 7. 27. · Stance classification •Goal •Classify stances of texts in regard to

SENTIMENT 1: Other Topics You May Also Agree or Disagree:Modeling Inter-Topic Preferences using Tweets and Matrix Factorization 42