Top Banner
Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 06/13/22 1
16

Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Jan 20, 2016

Download

Documents

Amber Hunter
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Question Identification on Twitter

Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang

04/21/23 1

Page 2: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Agenda

• Background• Two-phase Classification• Experiments• Conclusion

04/21/23 2

Page 3: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Background

04/21/23 3

Page 4: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

04/21/23 4

Page 5: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Two Challenges

• 140 characters

• Special features

04/21/23 5

Page 6: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Two-phase Classification

• Interrogative Tweet Detection– Tweets which contain question sentences

• Qweet Extraction– Interrogative tweets which require some information

or help and thus need to be answered

Interrogative Tweet

DetectionTweets Qweet

ExtractionQweetsInterrogative

Tweets

04/21/23 6

Page 7: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Interrogative Tweet Detection

• Rule-based Approach– Question marks– 5W1H words and Refined 5W1H words – Heuristic Rules (Efron and Winget, 2010)

• Learning-based Approach– Frequent question patterns mining (Pei et al.,

2001) + One-class SVM (Schölkopf et al., 2001)– Over 850,000 QA pairs in community question

answering (CQA) portals were used

04/21/23 7

Page 8: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Types of Interrogative Tweets

04/21/23 8

Page 9: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Types of Interrogative Tweets

04/21/23 9

Page 10: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Types of Interrogative Tweets

04/21/23 10

Page 11: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Qweet Extraction

• Feature Extraction

04/21/23 11

Page 12: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Experiments

• Data Set

04/21/23 12

Page 13: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Results: Interrogative Tweet Detection

• Heuristics– H1: Must appear at the beginning of one sentence– H2: Add auxiliary words to the original 5W1H words

• “what” -> “what is” and “what are”

04/21/23 13

Page 14: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Results: Qweet Extraction

• Context features are of great importance in distinguishing qweets from non-qweets

• Tweet-specific features also help in qweet identification

04/21/23 14

Page 15: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Conclusion

• First Attempt in discovering questions from tweets automatically

• Two-phase classification – Interrogative Tweet Detection– Qweet Extraction

• Limitations and future work– Tweets containing rhetorical questions and

complicated self-ask-self-answer sentences– Real-time clustering (Ahmed et al., 2011)– Question analysis and classification

04/21/23 15

Page 16: Question Identification on Twitter Baichuan Li, Xiance Si, Michael R. Lyu, Irwin King, and Edward Y. Chang 10/9/20151.

Thank You!

Q&A

04/21/23 16