Page 1
Stylization and Trajectory Modellingof Short and Long Term Speech Prosody Variations
Nicolas Obin 1,2
Anne Lacheret 2, Xavier Rodet 1
1 Analysis-Synthesis Team, IRCAM, Paris, France2 Modyco Lab., University of Paris Ouest - La Defense, Nanterre, France
[email protected] , [email protected] , [email protected]
AbstractIn this paper, a unified trajectory model based on the styl-ization and the modelling of f0 variations simultaneouslyover various temporal domains is proposed1. The syllable isused as the minimal temporal domain for the description ofspeech prosody, and short-term and long-term f0 variations arestylized and modelled simultaneously over various temporaldomains. During the training, a context-dependent model isestimated according to the joint stylized f0 contours over thesyllable and a set of long-term temporal domains. During thesynthesis, f0 variations are determined using the long-termvariations as trajectory constraints. In a subjective evaluationin speech synthesis, the stylization and trajectory modellingof short and long term speech prosody variations is shownto consistently model speech prosody and to outperform theconventional short-term modelling.
Index Terms: speech prosody, stylization, trajectory model,speech synthesis.
1. IntroductionIn parallel to the development of high-quality speech synthesissystems [1], the modelling of speech prosody has raised as amajor concern to improve the naturalness, the liveliness, andthe variety of the synthetic speech. Speech prosody is generallydescribed as the co-occurrence of acoustic gestures occurringsimultaneously over different temporal domains [2, 3] andassociated to different communicative functions (linguistic,expressive). A high-quality modelling of speech prosodyis desirable for natural and expressive speech synthesis andadequate modelling of speaking style, and a prerequisite in realmulti-media applications (e.g., avatars, story telling, dialoguesystems, numeric arts).
A variety of methods has been proposed to model speechprosody variations (f0 [4], temporal structure [5]), and localand global variations [6, 7]. However, conventional methodsusually models short-term variations of speech prosody(frame-based, or instantaneous variations), while long-termvariations of speech prosody are not explicitly considered.Recent studies have been proposed to integrate long-termvariations into HMM modelling, either for the modellingof f0 variations [8, 9], or with extension to state-duration
1This study was partially funded by “La Fondation Des Treilles”,and supported by ANR Rhapsodie 07 Corp-030-01; reference prosodycorpus of spoken French; French National Agency of research; 2008-2012.
modelling [10]. However, the proposed methods remain amixed model, i.e. the conventional model is used to model theinstantaneous variations of f0, while stylization of long-termvariations are used as trajectory constraints only. In particular,the instantaneous variations remain the minimal and targettemporal domain for the modelling of speech prosody.
In this paper, a unified trajectory model based on the stylizationand the joint modelling of f0 variations over various temporaldomains is proposed. In the proposed approach, the syllableis used as the minimal temporal domain for the description ofspeech prosody, and f0 variations are stylized and modelledsimultaneously over various temporal domains which covershort-term and long-term variations. During the training, acontext-dependent model is estimated according to the jointstylized f0 contours over the syllable and a set of long-termtemporal domains. During the synthesis, f0 variations aredetermined using the long-term variations as trajectory con-straints.
4.25
4.3
4.35
4.4
4.45
4.5
4.55
4.6
4.65
4.7
4.75
f 0 (log
)
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
74 CHAPTER 8. SPEAKER-DEPENDENT PROSODIC STRUCTURE MODEL
a ∼
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
conventional HMM
4.25
4.3
4.35
4.4
4.45
4.5
4.55
4.6
4.65
4.7
4.75
f 0 (log
)
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
74 CHAPTER 8. SPEAKER-DEPENDENT PROSODIC STRUCTURE MODEL
a ∼
73
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
Chapter 1
Introduction
Contents1.1 General Background . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Scope of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3 An Introduction to Speech Prosody . . . . . . . . . . . . . . . . 15
1.3.1 Prologue: La Voix & le Dialogue de l’Ombre Double . . . . . . . 15
1.3.2 Speech Communication . . . . . . . . . . . . . . . . . . . . . . . 16
1.3.3 Speech Domains . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.3.4 Speech Prosody: From Signal to Communicative Functions . . . 18
1.3.5 Making Sense of Variations . . . . . . . . . . . . . . . . . . . . . 20
1.3.6 Speaking Style: a matter of Identity, Genre & Time . . . . . . . 23
SPEECH
DATABASE
text
transcription
speech
signal
text
analysis
linguistic
labels
prosodic structure
parameters extraction
13
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic model
acoustic model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
speech
synthesizer
SYNTHESIZED SPEECH
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
14 CHAPTER 1. INTRODUCTION
prosodic acoustic
parameters extraction
prosodic
labels
prosodic
acoustic parameters
speech
segmentation
prosody
labeling
prosody
labeling
linguistic +
prosodic labels
training of symbolic HMM models
training of acoustic HMM models
HMM models
symbolic
model
acoustic
model
TEXT
inference of symbolic parameters
inference of acoustic parameters
prosodic parameters
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
1.1. GENERAL BACKGROUND 15
speech
synthesizer
TRAINING
SYNTHESIS
1.1 General Background
1.2 Scope of the Thesis
Figure 8.1: Overall architecture of a speech prosody synthesizer.
##lo ∼t
syllable-based HMM with stylization of f0 contours
Figure 1: Schematic comparison of frame-based and syllable-based modelling of f0 variations.
2. Stylization of Speech ProsodyThe Discrete Cosine Transform (DCT) is used to stylize the f0variations over various temporal domains [11] (figure 2). The