Top Banner
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages Soudip Roy Chowdhury, Muhammad Imran , Muhammad Rizwan Asghar, Sihem Amer-Yahia, Carlos Castillo
15

Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Jun 28, 2015

Download

Education

Muhammad Imran

This work describes our work presented at the ISCRAM-2013 conference. We presented Tweet4act system, which is used to detect and classify crisis-related messages communicated over a microblogging platform. Our system relies on extracting content features from each message. These features and the use of an incident-specific dictionary allow us to determine the period type of an incident that each message belongs to.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Soudip Roy Chowdhury, Muhammad Imran, Muhammad Rizwan Asghar, Sihem Amer-Yahia, Carlos Castillo

Page 2: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Disaster & Social Media

Disaster Strikes, Social Media Responds

Page 3: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Virtual Collaboration, Information Sharing

• Valuable information

• Contribute to situational awareness

• Highly useful, if analyzed timely and effectively(Starbird et al., 2010; Latonero and Shklovski,

2010)

Page 4: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Social Media Response to Disaster Phases

Before

During

After

Page 5: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Disaster Management, Crisis Informatics

- Caution, warnings- Alerts etc.

- Damage- Causalities etc.

- Request for help- Donations etc.

• The main goals of our research:1. Identify messages related to an incident.2. Classify incident-messages with the corresponding

period (PRE, DURING, POST).

Page 6: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Datasets & Examples

1. Joplin Tornado on May 22, 2011

2. Nesat Typhoon in Philipines on Sep 27, 2011

3. Haiti Earthquake on Jan 12, 2010

• [PRE] New #tropical storm forms in the West #Pacific. #Nesat may hit the #Philippines & #China as a #typhoon next week

• [DURING] @Yahoo News: Powerful #typhoon with winds up to 106 mph makes landfall in #Philippines as 100,000 odered to fless homes

• [POST] News5 Action center is now accepting donations for the victims of Typhoon “pedring. Drop boxes are located @ TV5 Office :)

Page 7: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Tweet4Act System• Collection -> Filtering -> Period Classification

Page 8: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

1. Filtering Process

• Normalization: remove “RT @username” and “@username” prefixes and remove duplicate messages

• Apply the k-mediod method with the manhattan distance between medoids and messages in each cluster

• Discard all cluster having a negative number or zero as silhouette coefficient

• Select from each cluster the fraction m messages closer to the mediod

Page 9: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Filtering Process Validation

• Using CrowdFlower crowdsourcing platform

Page 10: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

2. Dictionary Based Period Classification• Most frequent words across datasets

• “warning” & “alert” typically found in the Pre

• “now”, “sweeps” etc. typically found in During

• “aftermath”, “donate” etc. typically found in Post

Page 11: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

3. NLP-Based Period Classification

• Tense of verbs can help identify period. (A. Iyengar

et al., 2011)

POS tagging

1. Dictionary based verbs get +1 (ignore below)

2. Aux verbs get +1(e.g., could-PRE, are-DURING, did-POST)

3. If a main verb in future/present/past tense, add +0.5 to pre/during/post period, respectively.

Ties: PRE > DURING > POST

Page 12: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Manual Period Classification

• CrowdFlower crowdsourcing period labeling

Page 13: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Performance of Tweet4Act

PRE

PRE

PRE

DURING

DURING

DURING

POST

POST

POST

AVG

AVG

AVG

Page 14: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

References

• A. Iyengar, T. Finin, and A. Joshi (2011) Content-based prediction of temporal boundaries for events in Twitter. In Proceedings of the Third IEEE International Conference on Social Computing.

• K. Starbird, L. Palen, A. Hughes, and S. Vieweg (2010) Chatter on the red: what hazards threat reveals about the social life of microblogged information. In Proceedings of the 2010 ACM conference on Computer supported cooperative work, pages 241–250. ACM.

• Latonero, Mark, and Irina Shklovski. "“Respectfully Yours in Safety and Service”: Emergency Management & Social Media Evangelism.” Proceedings of the 7th International ISCRAM Conference– Seattle. Vol. 1. 2010.

Page 15: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Thank you!Muhammad [email protected]