7/27/2019 MPEG Audio Compression-1
1/19
MPEG AUDIO COMPRESSION1
7/27/2019 MPEG Audio Compression-1
2/19
1. PSYCHOACOUSTICS
| Psychoacoustics is the research where you aim tounderstand how the ear and brain interact as varioussounds enter the ear.
|A perceptual audio codec is a codec that takesadvantage of this human characteristic.
| The range of human hearing is about 20 Hz to about 20
kHz
| The easiest frequency range to perceive by humans istypically from about 2 kHz to 4 kHz
| The dynamic range, the ratio of the maximum soundamplitude to the quietest sound that humans can hear,is on the order of about 120 dB
2
7/27/2019 MPEG Audio Compression-1
3/19
EQUAL-LOUDNESS RELATIONS
| Fletcher-Munson Curves
y Equal loudness curves that display the relationship
,
stimulus sound volume (Sound Pressure Level, alsoin dB), as a function of frequency
| Fig. 1 shows the ears perception of equal loudness:
y
stimulus is required to produce the perception of a 10
dB sound
loudness level gives the same loudness as for that
loudness level of a pure tone at 1 kHz
3
7/27/2019 MPEG Audio Compression-1
4/19
EQUAL-LOUDNESS RELATIONS
. - -and Dadson)
4
7/27/2019 MPEG Audio Compression-1
5/19
THRESHOLD OF HEARING
|A plot of the threshold of human hearing for a puretone
Fig. 2: Threshold of human hearing, for pure tones 5
7/27/2019 MPEG Audio Compression-1
6/19
THRESHOLD OF HEARING (CONTD)
| The threshold of hearing curve: if a sound is above
the dB level shown then the sound is audible
| urn ng up a one so a equa s or surpasses
the curve means that we can then distinguish the
sound
|An approximate formula exists for this curve:
20.8 0.6( /1000 3.3) 3 4Threshold 3.64 /1000 6.5 10 /1000fe = +
y The threshold units are dB; the frequency for the origin
Threshold(f) = 0 at f =2 kHz
6
7/27/2019 MPEG Audio Compression-1
7/19
FREQUENCYMASKING
| Experiments have shown that the human ear has
24 frequency bands (Basilar membrane).
harder to distinguish by the human ear.
7
7/27/2019 MPEG Audio Compression-1
8/19
FREQUENCYMASKING
| Suppose there is a dominant tonal component present in an
audio signal. The dominant noise will introduce a masking
threshold that will mask out frequencies in the same critical
band (see Figure below).
| This frequency-domain phenomenon is known as
simultaneous masking, which has been observed within
critical bands.
8
7/27/2019 MPEG Audio Compression-1
9/19
FREQUENCYMASKING CURVES
| Frequency masking is studied by playing a
particular pure tone, say 1 kHz again, at a loud
,
ability to hear tones nearby in frequency
y one would enerate a 1 kHz maskin tone at a fixed
sound level of 60 dB, and then raise the level of a
nearby tone, e.g., 1.1 kHz, until it is just audible
| The threshold in Fig. 3 plots the audible level for a
single masking tone (1 kHz)
| ig. 4 s ows ow t e p ot c anges i ot er mas ingtones are used
9
7/27/2019 MPEG Audio Compression-1
10/19
FREQUENCYMASKING CURVES
Fig. 3: Effect on threshold for 1 kHz masking tone10
7/27/2019 MPEG Audio Compression-1
11/19
FREQUENCYMASKING CURVES
Fig. 4: Effect of masking tone at three different frequencies
11
7/27/2019 MPEG Audio Compression-1
12/19
CRITICAL BANDS
| Critical bandwidth represents the ears
resolving power for simultaneous tones or partials
y e ow- requency en , a cr ca an s ess an
100 Hz wide, while for high frequencies the width canbe greater than 4 kHz
|
Experiments indicate that the critical bandwidth:
approximately constant in width ( about 100 Hz)
y for masking frequencies > 500 Hz: increases
approx ma e y near y w requency
12
7/27/2019 MPEG Audio Compression-1
13/19
. -
13
7/27/2019 MPEG Audio Compression-1
14/19
14
7/27/2019 MPEG Audio Compression-1
15/19
BARKUNIT
| Bark unit is defined as the width of one critical
band, for any masking frequency
| e ea o e ar un : every cr ca an w
is roughly equal in terms of Barks (refer to Fig. 5)
g. : ec o mas ng ones, expresse n ar un s 15
7/27/2019 MPEG Audio Compression-1
16/19
CONVERSION: FREQUENCY& CRITICAL BAND NUMBER
| Conversion expressed in the Bark unit:
2
/100, for 500 ,Critical band number (Bark)
9 4 lo /1000 for 500.%
f f
7/27/2019 MPEG Audio Compression-1
17/19
TEMPORAL MASKING
| Phenomenon: any loud tone will cause the hearing
receptors in the inner ear to become saturated and
| The following figures show the results of Maskingexperiments:
17
Fig. 6: The louder is the test tone, the shorter it takes for our
hearing to get over hearing the masking.
7/27/2019 MPEG Audio Compression-1
18/19
TEMPORAL MASKING
| Temporal masking occurs in the time-domain. A stronger
tonal component (masker) will mask a weaker one (maskee)
if they appear within a small interval of time.
| The masking threshold will mask weaker signals pre and
post to the masker.
will last from 50 to 300 ms, depending on the strength and
duration of the masker as shown in Figure below.
18
7/27/2019 MPEG Audio Compression-1
19/19
TEMPORAL MASKING
Fig. 14.8: For a masking tone that is played for a longer time, it
a es onger e ore a es one can e ear .Solid curve: masking tone played for 200 ms;
dashed curve: masking tone played for 100 ms.19