Top Banner
Avar Monitoring the blogosphere for emerging, health related events, so Health Officials don‘t have to Team Mentor: Avaré Stewa
9

Avar

Feb 22, 2016

Download

Documents

Avar. Health. Sentinel. Public. Dashboard. Health Alert Level. Team Mentor: Avaré Stewart. Monitoring the blogosphere for emerging, health related events, so Health Officials don‘t have to. Event-Based Biosurveillance. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Avar

Avar

Monitoring the blogosphere for emerging, health related events, so Health Officials don‘t have to

Team Mentor: Avaré Stewart

Page 2: Avar

Event-Based BiosurveillanceMonitor time series, textual

data to provide early

alerts to anomalies ...

stimulate investigation of potential outbreaks

Shmueli 2010

Page 3: Avar

What If ....?

Could we have detected theemergence of the2009 Swine FluPandemic from Blog Social Media ?

Page 4: Avar

• What approach can be used to create Simulated Outbreaks for blog text?

– What outbreak patterns/signatures exist?

– How can text be generated to simulate a given outbreak pattern?

• What tools would be useful in creating Simulated Textual and Numerical Outbreak Data?

Questions to Consider?

Page 5: Avar

Feature Selection and Counts for Outbreak Data

Event-Based (Text) Data

• Select (textual features)i.e.: Number documents

containing mentions „flu“• Get timestamps• Create counts from features

Time Series Data

• Select features: i.e.: hospital visits,

death rate• Get timestamps• Create counts from

features

Page 6: Avar

What is the Task?Using existing data, tools and references:

• Part 1: Design an approach to creating Simulated Data from example data• Part 2: Design an approach for adding Noise to the Simulated Data• Part 3: List and summarize any additional tools and approaches that

would be useful in creating Simulated Data

• Document your design:

• Discuss the motivation for your work/results• Outline algorithms with PseudoCode• Provide several example input and outputs • Hightlight strengths and shortcomings

Page 7: Avar

• Build Simulated Data from Event-Based Biosurveillance

• How to organize, design, implement, deliver small-scale project results

• That your contributions are valuable ....

What Will You Learn?

Page 8: Avar

Starting PointsPapers :

The Nature of Outbreaks and Their Determination (Shmueli, Section 3.2 )Characteristic shapes of outbreak news. (Collier, Figure 5)

Model-Specific Generation:

Data Simulation Using Rhttp://biostat.mc.vanderbilt.edu/wiki/pub/Main/AngelAn/myslides5.pdf

Random Generation:

“Generating Random Text with Bigrams“http://nltk.googlecode.com/svn/trunk/doc/book/ch02.html

“How to Generate Random Text in Word 2003” http://www.ehow.com/how_2183058_generate-random-text-word.html

Page 9: Avar

Background Reading

1. Statistical Challenges Facing Early Outbreak Detection in Biosurveillance, Galit Shmueli

2. What‘s Unusual in Online Disease Outbreat News, Nigel Collier

3. Detecting Influenza Outbreaks by Analyting Twitter Messages, Aron Culotta