Top Banner
People on Drugs : Credibility of User Statements in Health Forums Subhabrata Mukherjee 1 Gerhard Weikum 1 Cristian Danescu-Niculescu-Mizil 2 1 Max Planck Institute for Informatics 2 Max Planck Institute for Software Systems KDD 2014 August 25, 2014
42

People on Drugs: Credibility of User Statements in Health Forums

Jun 26, 2015

Download

Healthcare

People on Drugs: Credibility of User Statements in Health Communities. Subhabrata Mukherjee, Gerhard Weikum and Cristian Danescu-Niculescu-Mizil. Proc. of the 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). 2014
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: People on Drugs: Credibility of User Statements in Health Forums

People on Drugs : Credibility of UserStatements in Health Forums

Subhabrata Mukherjee 1 Gerhard Weikum 1

Cristian Danescu-Niculescu-Mizil 2

1Max Planck Institute for Informatics

2Max Planck Institute for Software Systems

KDD 2014

August 25, 2014

Page 2: People on Drugs: Credibility of User Statements in Health Forums

Motivation: Internet as a healthcare resource

59% of US population use internet for health information[Pew Research Center Report, 2013]

Half of US physicians rely on online resources[IMS Health Report, 2014]

This work:Credibility of user-generated online health information

Page 3: People on Drugs: Credibility of User Statements in Health Forums

Motivation: Internet as a healthcare resource

59% of US population use internet for health information[Pew Research Center Report, 2013]

Half of US physicians rely on online resources[IMS Health Report, 2014]

This work:Credibility of user-generated online health information

Page 4: People on Drugs: Credibility of User Statements in Health Forums

Posts from Healthboards.com

“My girlfriend always gets a bad dry skin, rash on her upper arm,cheeks, and shoulders when she is on [Depo]. . . . ”

“I have had no side effects from [Depo] (except ... ), but otherwise norashes. She should see her gyno. She may be allergic to something”

Page 5: People on Drugs: Credibility of User Statements in Health Forums

Posts from Healthboards.com

“My girlfriend always gets a bad dry skin, rash on her upper arm,cheeks, and shoulders when she is on [Depo]. . . . ”

“I have had no side effects from [Depo] (except ... ), but otherwise norashes. She should see her gyno. She may be allergic to something”

Page 6: People on Drugs: Credibility of User Statements in Health Forums

Our IntuitionUsers, language and credibility influence each other

I took a cocktail of meds. Xanax gave me hallucinations and a demonic feel.

Xanax and Prozacare known to cause drowsiness.

Xanax made medizzy and sleepless.

Statement Credibility

User Trustworthiness Language Objectivity

s1

s2 s3?

u1u2

p1p2

p3u3u3

s1

Trustworthy users write credible postsAgree with each other on credible statements

Page 7: People on Drugs: Credibility of User Statements in Health Forums

Our Intuition

I took a cocktail of meds. Xanax gave me hallucinations and a demonic feel.

Xanax and Prozacare known to cause drowsiness.

Xanax made medizzy and sleepless.

Statement Credibility

User Trustworthiness Language Objectivity

s1

s2 s3?

u1u2

p1p2

p3u3u3

s1

Page 8: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“I heard Xanax can have pretty bad side-effects. You may have peelingof skin, and apparently some friend of mine told me you can developulcers in the lips also. If you take this medicine for a long time then youwould probably develop a lot of other physical problems. Which ofthese did you experience ?”

Usage of modals, indefinite determiner, conditional, probabilisticadverb, question particle, etc.

Page 9: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“I heard Xanax can have pretty bad side-effects. You may have peelingof skin, and apparently some friend of mine told me you can developulcers in the lips also. If you take this medicine for a long time then youwould probably develop a lot of other physical problems. Which of thesedid you experience ?”

Usage of modals, indefinite determiner, conditional, probabilisticadverb, question particle, etc.

Page 10: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“I heard Xanax can have pretty bad side-effects. You may have peelingof skin, and apparently some friend of mine told me you can developulcers in the lips also. If you take this medicine for a long time then youwould probably develop a lot of other physical problems. Which of thesedid you experience ?”

Usage of modals, indefinite determiner, conditional, probabilisticadverb, question particle, etc.

Page 11: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“I heard Xanax can have pretty bad side-effects. You may have peelingof skin, and apparently some friend of mine told me you can developulcers in the lips also. If you take this medicine for a long time then youwould probably develop a lot of other physical problems. Which ofthese did you experience ?”

Usage of modals, indefinite determiner, conditional, probabilisticadverb, question particle, etc.

Page 12: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“I heard Xanax can have pretty bad side-effects. You may have peelingof skin, and apparently some friend of mine told me you can developulcers in the lips also. If you take this medicine for a long time then youwould probably develop a lot of other physical problems. Which ofthese did you experience ?”

Usage of modals, indefinite determiner, conditional, probabilisticadverb, question particle, etc.

Page 13: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“Depo is very dangerous as a birth control and has too many long termside-effects like reducing bone density. Hence, I will never recommendanyone using this as a birth control. Some women tolerate it well butthose are the minority. Most women have horrible long lastingside-effects from it.”

Uses inferential conjunction, modal, definite determiners, etc.

Page 14: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“Depo is very dangerous as a birth control and has too many long termside-effects like reducing bone density. Hence, I will never recommendanyone using this as a birth control. Some women tolerate it well butthose are the minority. Most women have horrible long lastingside-effects from it.”

Uses inferential conjunction, modal, definite determiners, etc.

Page 15: People on Drugs: Credibility of User Statements in Health Forums

Language: Stylistic Features

“Depo is very dangerous as a birth control and has too many long termside-effects like reducing bone density. Hence, I will never recommendanyone using this as a birth control. Some women tolerate it well butthose are the minority. Most women have horrible long lastingside-effects from it.”

Uses inferential conjunction, modal, definite determiners, etc.

Page 16: People on Drugs: Credibility of User Statements in Health Forums

Language: Objectivity

“I started Cymbalta, but now I’m having a panic attack or an allergicreaction. I have a hardcore burning sensation in my chest and warmsensations all over. It’s like my body can’t decide whether it wants to becold or hot. I feel if I close my eyes I’ll lose control, go crazy and passout.”

Page 17: People on Drugs: Credibility of User Statements in Health Forums

Our Intuition

I took a cocktail of meds. Xanax gave me hallucinations and a demonic feel.

Xanax and Prozacare known to cause drowsiness.

Xanax made medizzy and sleepless.

Statement Credibility

User Trustworthiness Language Objectivity

s1

s2 s3?

u1u2

p1p2

p3u3u3

s1

Page 18: People on Drugs: Credibility of User Statements in Health Forums

User Features

I User demographic features like age, gender, location

I Engagegement features like number of posts, questions, answers,thanks

I User post properties like avg. post length

Page 19: People on Drugs: Credibility of User Statements in Health Forums

Objective

I took a cocktail of meds. Xanax gave me hallucinations and a demonic feel.

Xanax and Prozacare known to cause drowsiness.

Xanax made medizzy and sleepless.

Statement Credibility

User Trustworthiness Language Objectivity

s1

s2 s3?

u1u2

p1p2

p3u3u3

s1This is what we want

Page 20: People on Drugs: Credibility of User Statements in Health Forums

Probabilistic Inference: CRF

I took a cocktail of meds. Xanax gave me hallucinations and a demonic feel.

Xanax and Prozacare known to cause drowsiness.

Xanax made medizzy and sleepless.

Statement Credibility

User Trustworthiness Language Objectivity

s1

s2 s3?

u1u2

p1p2

p3u3

s1

Observed Features Observed Features

CRF

Labels ?

Predict the most likely label assignment of statements

Page 21: People on Drugs: Credibility of User Statements in Health Forums

Semi Supervised LearningProtects against users conveying misinformation using confidentand objective language

I took a cocktail of meds. Xanax gave me hallucinations and a demonic feel.

Xanax and Prozacare known to cause drowsiness.

Xanax made medizzy and sleepless.

Statement Credibility

User Trustworthiness Language Objectivity

s1

s2 s3?

u1u2

p1p2

p3u3

s1

Observed Features Observed Features

CRF

Labels ?

Expert stated side-effects of drugs from MayoClinic portal

Page 22: People on Drugs: Credibility of User Statements in Health Forums

Semi-Supervised CRF (Sketch)

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2?

u2

p1

p2

p3

True

Unknown

u1

False

Page 23: People on Drugs: Credibility of User Statements in Health Forums

Semi-Supervised CRF (Sketch)

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2?

u2

p1

p2

p3

True

Unknown

u1

False

Page 24: People on Drugs: Credibility of User Statements in Health Forums

Semi-Supervised CRF (Sketch)

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2?

u2

p1

p2

p3

True

Unknown

u1

False

Page 25: People on Drugs: Credibility of User Statements in Health Forums

Semi-Supervised CRF (Sketch)

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2?

u2

p1

p2

p3

True

Unknown

u1

FalseDepo → dry skin

Page 26: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2?

u2

p1

p2

p3

True

Unknown

u1

False

1. Estimate user trustworthiness :

Page 27: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2?

u2

p1

p2

p3

True

Unknown

u1

False

1. Estimate user trustworthiness :

10.5

0

Page 28: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2?

u2

p1

p2

p3

True

Unknown

u1

False

2. E-Step : Estimate label of unknown statements by Gibbs' sampling :

Page 29: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2

u2

p1

p2

p3

True

Unknown

u1

False

2. E-Step : Estimate label of unknown statements by Gibbs' sampling :

Page 30: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2

u2

p1

p2

p3

True

Unknown

u1

False

3. M-Step : Maximize log-likelihood to estimate feature weights using Trust Region Newton :

Page 31: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2

u2

p1

p2

p3

True

Unknown

u1

False

4. Re-Estimate user trustworthiness :

Page 32: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2

u2

p1

p2

p3

True

Unknown

u1

False

10.5

1

4. Re-Estimate user trustworthiness :

Page 33: People on Drugs: Credibility of User Statements in Health Forums

Statement Credibility

User Trustworthiness Language Objectivity

s1

u3

s2

u2

p1

p2

p3

True

Unknown

u1

False

10.5

1

4. Re-Estimate user trustworthiness :

5. Apply E-Step and M-Step until convergence

Page 34: People on Drugs: Credibility of User Statements in Health Forums

Dataset

Healthboards.com community (www.healthboards.com) with850,000 registered users and 4.5 million messagesI We sampled 15,000 users with 2.8 million messages

Expert labels about drugs from MayoClinic portalI 2172 drugs categorized in 837 drug familiesI 6 widely used drugs used for experimentation

Page 35: People on Drugs: Credibility of User Statements in Health Forums

Dataset

Healthboards.com community (www.healthboards.com) with850,000 registered users and 4.5 million messagesI We sampled 15,000 users with 2.8 million messages

Expert labels about drugs from MayoClinic portalI 2172 drugs categorized in 837 drug familiesI 6 widely used drugs used for experimentation

Page 36: People on Drugs: Credibility of User Statements in Health Forums

Drug Statisticsa

aData available at : http://www.mpi-inf.mpg.de/impact/peopleondrugs/

Drugs Treatment For # Users # Postsalprazolam anxiety, depression, panic disorder 2.8K 21Kibuprofen pain, symptoms of arthritis 5.7K 15Komeprazole acidity in stomach and ulcers 1K 4Kmetformin high blood sugar, diabetes .8K 3.6Klevothyroxine hypothyroidism .4K 2.4Kmetronidazole bacterial infection .5K 1.6K

Page 37: People on Drugs: Credibility of User Statements in Health Forums

Baselines

I Frequency of statements

I SVM ClassificationI Feature vector for each statement using all our features

I SVM Classification with Distant SupervisionI Each user, post and statement instance constitutes a feature

vectorI Aggregate labels of all such instances for a statement by

majority voting

Page 38: People on Drugs: Credibility of User Statements in Health Forums

Accuracy Comparison

Page 39: People on Drugs: Credibility of User Statements in Health Forums

Use-Case: Following Trustworthy Users

What users should I follow to get information on drug X ?

Baseline: Rank users based on #thanks from community

Page 40: People on Drugs: Credibility of User Statements in Health Forums

Use-Case: Following Trustworthy Users

Compare with human annotations

Page 41: People on Drugs: Credibility of User Statements in Health Forums

Conclusions

Proposed a probabilistic graphical model to jointly learn usertrustworthiness, statement credibility and language use

I To extract side-effects of drugs from communities

I Identify expert users

Provides a framework to incorporate richer linguistic (e.g., bias,discourse) and user (e.g., perspective, expertise) features

Page 42: People on Drugs: Credibility of User Statements in Health Forums

Thank you