Top Banner
Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems Dr. Aaron C. Elkins The University of Arizona
48

Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Feb 23, 2016

Download

Documents

Hachi

Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems. Dr. Aaron C. Elkins The University of Arizona. Emotional Voice. Can computers perceive vocal emotion?. Yes…. but, The science of the emotional voice is young Communication is complex and dynamic - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocalic Markers of Deception and Cognitive Dissonance for Automated

Emotion Detection Systems

Dr. Aaron C. ElkinsThe University of Arizona

Page 2: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Emotional Voice

2

Page 3: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Can computers perceive vocal emotion?

Yes…. but,The science of the emotional voice is

youngCommunication is complex and

dynamicMoods and emotions contextually switch

Emotion is computationally ill-definedMeasuring emotion may inform theory

3

Page 4: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Emotional Dimensions

4

DISGUST?

Page 5: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Four Components of Speech

Voiced vs. Unvoiced sounds [v] vs. [f]

Airstream through mouth or nose [m] vs. [o]

5

Page 6: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Speech Sounds

(1) pitch, (2) loudness, and (3) qualitySound is small variations in air pressure that

occur rapidly in successionVocal folds superimpose outgoing air of

voiced soundsThe vocal folds vibrate to create a periodic

vibration (100 – 250 Hz)We measure these features digitally

6

Page 7: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Recording Father – Digital Audio

7

Waveform measures pulses of vocal foldsBased on air pressure disturbance (dB)

Voiced vs. Unvoiced (low pressure)

Each peak occurs every 100th of a second (100 Hz)

Page 8: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vowel Articulation

8

Source-Filter Theory (Müller, 1848)

Vocal Folds vibrate at same speed (pitch)

Resonance changes in vocal tract to filter frequencies (formants)

Page 9: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

9

Vocalics

Vocalic AnalysisExamines how it

was saidAmplitude Pitch (frequency)Response latencyTempo

LinguisticsExamines what

was said

Page 10: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Sound Production is Complex

When we tense our muscles, such during stress, our larynx tenses Higher Pitch

The process is complex Emotions affect the normal operation Deception takes away cognitive resources away and is

stressful More mistakes, lower quality, increased average and variation in

pitch Sympathetic Nervous system response

Increased auditory acuity Heightened arousal

10

Page 11: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Standard Vocal Measures Calculated with Praat and Custom Signal Processing Software

11

Page 12: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Nemesysco LVA 6.50Commercial Vocalic Software Evaluated

12

Page 13: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Five Vocalic Studies Summarized

Study One (Deception Experiment)Study Two (Cognitive Dissonance)Study Three (Embodied Conversational

Agent and Trust)Study Four (Embodied Conversational

Agent Security Screening - Bomber)Study Five (Embodied Conversational

Agent Security Screening - Imposter)

Page 14: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocal Deception (Study 1) – Experimental Design

N = 96 $10 reward for appearing credible to professional

interviewer Two Sequences:

First Sequence: DT DDTT TD TTDD TSecond Sequence: DT TTDD TD DDTT T

13 Short-Answer Questions Only 8 had variation both within and between subjects Two types of questions: Charged and Neutral

14

Page 15: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Results

Built-in classification performed at chance level Vocal measures independent of system discriminated

deception: FMain, AVJ, and SOS Possible Latent Variables measuring Conflicting Thoughts,

Cognitive Effort, and Emotional Fear Logistic regression performed best on charged questions

Higher pitch, cognitive effort, and hesitations are predictive of deception in more stressful interactions

The claim that the vocal analysis software measures stress, cognitive effort, or emotion cannot be completely dismissed

Deception and Stress can be predicted by Acoustic measures of Voice Quality and Pitch when controlling for speaker characteristics

15

Page 16: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocal Dissonance (Study 2) –Experimental Design

Modified Induced-Compliance ParadigmParticipants (N=52) made two vocal

counter-attitudinal arguments for cutting funding for service for the disabled

Choice is manipulated High vs. Low (IV) High N = 24, Low N = 28

Participants report attitude towards argument issue (DV)

Page 17: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Arousal (Vocal Pitch)

17

High choice had a 10Hz higher pitchF(1,50) = 4.43,

p = .04All participants

reduced their pitch over timeF(1,50) = 4.90,

p = .03

Page 18: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Cognitive Difficulty

High Choice had nearly 2x the response latency on argument twoF(1,50) = 4.53, p

= .04Arousal

moderation

18

Page 19: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Cognitive Difficulty

Participants spoke with 33% more nonfluencies on the second argumentF(1,50) = 4.03,

p = .05

19

Page 20: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

The Importance of Language (Imagery as Abstract Language)

20

Page 21: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocal Dissonance Model

χ²(1, N = 51), p = .49 SRMR = .02R² Attitude Change = .17, Imagery = .11

21

Page 22: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

From the lab to the AVATAR

22

Page 23: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

First Kiosk

23

Page 24: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Kiosk from Last Year

24

Page 25: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Third-Generation Kiosk

25

Page 26: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Gender and Demeanor

26

Page 27: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocal Trust (Study 3) – Experimental Design

• Participants completed pre-survey

• Packed bag before ECA screening interviewing

• Completed security screening

• All responses to ECA recorded for vocal analysis

Page 28: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

ECA Demeanor and Gender

28

Question Block 1

Question Block 2

Question Block 3

Question Block4

Repeated Measures Latin Square DesignAll participants interacted with all demeanor and gender ECA combinations4 Questions Per block, 16 Total Questions

N = 88 Participants (53 Males, 35 Females)

Page 29: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Trust and Time

Main effects Initial Trust = 4.09 Trust Rate of Change

.04 per second increase p < .01

Duration .05 decrease in trust for

every second spent answering the ECA overthe 7.6 second average

p < .001

29

Multilevel Growth Model Specified with Trust as the DV (N = 218) with Subject as random effect (N=60)

Page 30: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocal Pitch, Time, and Trust

Main Effect of Pitch For every 1Hz increase

in pitch over 156Hz trust drops by .01

p = .03 Interaction Pitch and

Time Pitch x Time b = 9.3e-

05, p = .03 Over time pitch predicts

trust less and less

30

Page 31: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Results

Human perceptions of trust transfer to ECA Time plays in important role in the interaction All participants trusted the ECA more over

time, particularly when it smiled 48 increase in trust when ECA smiles

Vocal measures of pitch predicted trust, but only early on For every 1Hz increase in pitch over 156Hz trust

drops by .01 Over time pitch predicts trust less and less

31

Page 32: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocalics of a Bomber (Study 4) Experimental Design

• 29 EU border guards were randomly assigned to build a bomb (N = 16) or Control (N = 13) then pack a bag

• Identical to Study 3,but no breaks in the interview

• Only male neutral demeanor ECA interviewed participants

• Bomb Makers were instructed to successfully smuggle the bomb past the ECA

Page 33: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocal Analysis

Recorded responses to question:“Has anyone given you a prohibited

substance to transport through this checkpoint?”Average Response 2.68 sec (SD = 1.66)

Responses such as “No” or “of course not”Vocal measures of Pitch and Pitch

Variation

33

Page 34: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Results of Vocal Pitch

Voice Quality, Gender, and Intensity included as covariates

No difference in mean vocal pitch F(1,22)=0.38, p = .54

Main Effect of pitch variation Bomb Makers had 25.34%

more variation F(1,22)=4.79, p=.04

34

Page 35: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Pitch Contours

35

Page 36: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

36

Eye Gaze: Guilty

Page 37: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

37

Eye Gaze: Innocent

Page 38: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocalics of an Imposter (Study 5) – Experimental Design

38 EU Border GuardsAll required to present visa and passport

through multiphase screening E-gate Manual Processing AVATAR Screening Interview

Four randomly assigned imposters carrying false documents with hostile intentions through screening

Page 39: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

AVATAR Interaction Example

Page 40: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

iPad Output for Screener

40

Page 41: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Voice Quality Change from Baseline Question (What is your full name?)

41

Page 42: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocalic Classification Model

42

Page 43: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocalic Resulting Classification

7 innocents falsely classified as terrorists 27 correctly classified as innocent All “guilty” referred to secondary Overall accuracy = 81%

TPR = 100% TNR = 79% FPR = 20% FNR = 0%

43

Page 44: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Eye Fixations on Visa

44

Page 45: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Date of Birth Results – Correct?

45

Page 46: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Final Decision Model

46

Page 47: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Vocalic Resulting Classification

3 innocents falsely classified as terrorists One of these three was actually lying

Actually a True Positive 31 correctly classified as innocent All “guilty” referred to secondary Overall accuracy = 94.47%

TPR = 100% TNR = 88.24% FPR = 5.8% Reduced by 3/4 FNR = 0%

47

Page 48: Vocalic Markers of Deception and Cognitive Dissonance for Automated Emotion Detection Systems

Questions?

Isn’t the voice amazing?