Top Banner
03/25/22 1 Predicting Phrasing and Accent Julia Hirschberg
25
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 1

Predicting Phrasing and Accent

Julia Hirschberg

Page 2: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

Demos

• Current schedule for individual demos– 8/25

• 2pm will• 2:30pm karl

– 8/28• 3:30 Ros

– Lambbook?• Everyone else: Please make sure your system can be

run by me. Have a non-team leader run in the team leader’s $HOMEDIR to make sure permissions are set correctly and paths are specified.

Page 3: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

Today

• Motivation and issues• Approaches: hand-built vs. corpus-based rules• Predicting phrasing• Predicting accent• Future research: emotion, personalization,

personality

04/18/23 3

Page 4: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 4

Synthesizing the News

A car bomb attack on a police station in the northern Iraqi city of Kirkuk early Monday killed four civilians and wounded 10 others U.S. military officials said. A leading Shiite member of Iraq's Governing Council on Sunday demanded no more "stalling" on arranging for elections to rule this country once the U.S.-led occupation ends June 30. Abdel Aziz al-Hakim a Shiite cleric and Governing Council member said the U.S.-run coalition should have begun planning for elections months ago.

-- Loquendo

Page 5: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

How do we assign intonation in TTS?

• Take a sentence like:

Today the world lost a major talent – Amy Winehouse.

• Or

You are clearly just totally insane, dude!• Or

Do you want to leave from Dallas or Denver?• Creating/finding appropriate prosody in TTS

Page 6: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 6

Why is Intonation Assignment Important?

• TTS and CTS– Naturalness– Intelligibility– Acceptability

Page 7: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 7

What are the indicators of phrasing in speech?

• Timing– Pause– Lengthening

• F0 changes• Vocal fry/glottalization

Page 8: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 8

Problems Assigning Phrasing

• Traditional hand-built rules– Use punctuation: 234-5682, New York, NY– Context/function word: no breaks after

function word: • He went to dinner. • He came to and went to dinner.

– Syntax: • She favors the nuts and bolts approach. • She went home and Dave stayed.

Page 9: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 9

What are the indicators of accent in speech?

• F0 excursion• Durational lengthening• Voice quality• Vowel quality• Loudness

Page 10: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 10

Problems Assigning Accent

• Traditional Hand-built rules– Function/content distinction

• He went out the back door• He threw out the trash

– Complex nominals: • Main Street/Park Avenue• city hall parking lot (stress shift)• German teachers

Page 11: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

What are the indicators of contour in speech?

• Sequence of pitch accents, phrase accents and boundary tones

• Predicing phrasing and accent – which comes first?

Page 12: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 12

Problems Assigning Contours

• Simple rules– ‘.’ = declarative contour– ‘?’ = yes-no-question contour unless wh-word

present at/near front of sentence• Well then, how did he do it? And what do you

know?

– What should we do with ‘!’– How can we assign other contours from text

input?– Why should we try?

Page 13: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 13

How can we improve intonation assignment from text analysis?

• Simplest rules– Accent content words, deaccent function words– Use punctuation to determine phrasing and

sentence-final prosody– Limitations

• Doesn’t work well, especially for longer sentences• Hard to add new rules – consider consequences for

entire system: Bell Labs ‘Frend’

• Current solution: Corpus-based approaches (Sproat et al ’92)– Train prosodic variation on large hand-labeled

corpora using machine learning techniques

Page 14: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 14

– Accent and phrasing decisions trained separately

• Binary prediction• Feat1, Feat2,…Accent• Feat1, Feat2,…Boundary

– Associate prosodic labels with simple features of transcripts that can be automatically computed

• distance from beginning or end of phrase• orthography: punctuation, paragraphing• part of speech, constituent information

– Apply automatically learned rules when processing text

Page 15: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 15

Statistical learning methods

• Classification and regression trees (CART)• Rule induction (Ripper), Support Vector

Machines, HMMs, Neural Nets• All take vector of independent variables and one

dependent (predicted) variable, e.g. ‘there’s a phrase boundary here’ or ‘there’s not’

Feat1 Feat2 …FeatN DepVar• Input from hand labeled dependent variable and

automatically extracted independent variables• Result can be integrated into TTS text processor

Page 16: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 17

What linguistic and contextual features are linked to phrasing?

• Syntactic information– Abney ’91 chunking major constituents– Steedman ’90, Oehrle ’91 CCGs …– Which ‘chunks’ tend to stick together?– Which ‘chunks’ tend to be separated intonationally?

• Largest constituent dominating w(i) but not w(j)NP[The man in the moon] |? VP[looks down on you]• Smallest constituent dominating w(i),w(j)NP[The man PP[in |? moon]]

– Part-of-speech of words around potential boundary site

The/DET man/NN |? in/Prep moon/NN• Sentence-level information

– Length of sentence

Page 17: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 18

This is a very |? very very long sentence ?| which thus might have a lot of phrase boundaries in?| it ?| don’t you think?

This |? isn’t.• Orthographic information

– They live in Butte, ?| Montana, ?| don’t they?• Word co-occurrence information

Vampire ?| bat …powerful ?| but benign… • Are words on each side accented or not?

The cat in |? the• Where is the most recent previous phrase boundary?

He asked for pills | but |?• What else?

Page 18: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 19

Integrating More Syntactic Information

• Incremental improvements continue:– Adding higher-accuracy parsing (Koehn et al

‘00)• Collins ‘99 parser• Different learning algorithms (Schapire & Singer

‘00)• Different syntactic representations: relational?

Tree-based?• Ranking vs. classification?

• Rules always impoverished• Where to next?

Page 19: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 21

What phenomena are associated with accent?

• Word class: content vs. function words• Information status:

– Given/new He likes dogs and dogs like him.– Topic/Focus Dogs he likes.– Contrast He likes dogs but not cats.

• Grammatical function– The dog ate his kibble.

• Surface position in sentence: Today George is hungry.

Page 20: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 22

• Association with focus:– John only introduced Mary to Sue.

• Semantic parallelism– John likes beer but Mary prefers wine.

• Other semantic phenomena?• How many of these are easy to compute

automatically?

Page 21: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 23

How can we approximate such information?

• POS window• Position of candidate word in sentence• Location of prior phrase boundary• Pseudo-given/new• Location of word in complex nominal and stress

prediction for that nominal

City hall, parking lot, city hall parking lot• Word co-occurrence

Blood vessel, blood orange

Page 22: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 24

How do we evaluate the result?

• How to define a Gold Standard?– Natural speech corpus– Multi-speaker/same text– Subjective judgments

• No simple mapping from text to prosody– Many variants can be acceptable

The car was driven to the border last spring while its owner an elderly man was taking an extended vacation in the south of France.

Page 23: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 25

Current Research

• Concept-to-Speech (CTS)– Systems should be able to specify “better”

prosody: the system knows what it wants to say and can specify how

• Information status– Given/new– Topic/focus– Preposition/particle distinction– PP attachment and phrasing

Page 24: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 26

Future Intonation Prediction: Beyond Phrasing and Accent

• Assigning contour – how?• Assigning affect (emotion) from text – how?• Personalizing TTS: modeling individual style in

intonation – how?• Conveying personality, charisma – how?

Page 25: 6/28/20151 Predicting Phrasing and Accent Julia Hirschberg.

04/18/23 27

Next Class

• Information status: focus and given/new information