Top Banner
© 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 December 2, 2002 Page 1 1 © 2000-2002 BDTI Digital Audio Compression: Digital Audio Compression: Why, What, and How Why, What, and How An Absurdly Short Course An Absurdly Short Course Jeff Bier Jeff Bier Berkeley Design Technology, Inc. Berkeley Design Technology, Inc. ©2000 2000- 2002 BDTI 2002 BDTI 2 © 2000-2002 BDTI Outline Outline Why Compress? What is Audio Compression? How Does it Work? Conclusions
15

Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

Mar 11, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 1

1© 2000-2002 BDTI

Digital Audio Compression:Digital Audio Compression:Why, What, and HowWhy, What, and How

An Absurdly Short CourseAn Absurdly Short Course

Jeff BierJeff Bier

Berkeley Design Technology, Inc.Berkeley Design Technology, Inc.

©©20002000--2002 BDTI2002 BDTI

2© 2000-2002 BDTI

OutlineOutlineWhy Compress?What is Audio Compression?How Does it Work?Conclusions

Page 2: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 2

3© 2000-2002 BDTI

Why: Too Much Data!Why: Too Much Data!

For example, Compact DiscRate: 2 channels * 44.1 ksamples/sec * 16 bits = 1.4 Mbit/second audio dataAudio CD capacity: 6.3 Gbits audio data

6-channel: 6 channels * 48 ksamples / sec * 16 bits = 4.6 Mbit/second

Typical compressed audio today: a few hundred kbit/sec for even 6 channels

4© 2000-2002 BDTI

Crossing ThresholdsCrossing ThresholdsMemory Capacity - Mbytes/chip

0.0080.032

0.1250.5

28

32128

512

0.001

0.01

0.1

1

10

100

1000

1975 1980 1985 1990 1995 2000 2005 2010Year

Mby

tes

/ chi

p

Audio CD

DVD Sound Track MP3 Album

MP3 Song

Page 3: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 3

5© 2000-2002 BDTI

Commercial ApplicationsCommercial ApplicationsCinema (digital film sound) Consumer devices

Mini-disc (Sony)Handheld playersGamesDVD

DistributionInternet distributionAudio over cable (set-top box)Satellite/terrestrial audio broadcastDigital television

6© 2000-2002 BDTI

What: Compression Goals What: Compression Goals Reduced bandwidth and/or storageMake decoded signal sound as close as possible to original signalLowest implementation complexityReasonable data-type requirementsApplicable to as many signal types as possibleRobustScalableExtensible

Page 4: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 4

7© 2000-2002 BDTI

How: PsychoacousticsHow: PsychoacousticsWhat is it?

Relationship between what arrives at the ear and what we hear

Why is it important for compression?Don’t transmit what the ear can’t hearAllow added noise that can’t be heard

What will we cover today?Range of human hearingMasking

FrequencyTime

8© 2000-2002 BDTI

Range of Human HearingRange of Human Hearing

ZwickerZwicker/Fastl p. 17/Fastl p. 17

Page 5: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 5

9© 2000-2002 BDTI

Auditory MaskingAuditory Masking

One signal can make another inaudible

MaskerMasker

Signal not Signal not maskedmasked

FreqFreq..

Am

pA

mp

..

Masked Masked signalssignals

Shifted Shifted hearing hearing

thresholdthreshold

Hearing Hearing thresholdthreshold

10© 2000-2002 BDTI

Auditory Masking (cont’d)Auditory Masking (cont’d)Simultaneous masking, narrow-band noise masker

Zwicker/Fastl, p. 64

Page 6: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 6

11© 2000-2002 BDTI

Auditory Masking (cont’d)Auditory Masking (cont’d)Temporal Masking

After Zwicker/Fastl p. 78, After Zwicker/Fastl p. 78, Buser/Imbert p. 47Buser/Imbert p. 47

Pre-masking(“Back-ward”)

10-30 msec

Shifted Threshold

MaskerDuration

Time Time →→→→→→→→

So

un

d P

ress

ure

Lev

el

So

un

d P

ress

ure

Lev

el →→ →→→→ →→

Post-masking(“Forward”)

100-200 msec

12© 2000-2002 BDTI

Remove Inaudible

Components

Perceptual CompressionPerceptual Compression

Window (Snapshot)

Spectral Analysis

Coding Scheme

QuantizePack

Data into Frames

Calculate Thresholds

Page 7: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 7

13© 2000-2002 BDTI

How: Spectral AnalysisHow: Spectral AnalysisWhat is it?

Break a signal into spectrumRecover the signal from its spectrum

14© 2000-2002 BDTI

Spectral Analysis (cont’d)Spectral Analysis (cont’d)How is it done?

Filter bankDCT

......

++

......

Page 8: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 8

15© 2000-2002 BDTI

Spectral Analysis (cont’d)Spectral Analysis (cont’d)

Analysis/resynthesis can be an “identity system” with proper constraints:

Design of windowHow often take transformHow to overlap windows in analysis/resynthesis

16© 2000-2002 BDTI

How: Noise AllocationHow: Noise AllocationQuantize coefficients for audible spectral components to minimum # bits

Davidson et al., 1994Davidson et al., 1994

Adds noiseKeep below masking threshold

Page 9: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 9

17© 2000-2002 BDTI

How: More TricksHow: More Tricks

FilterBand-limit input signalLFE bandlimited to <120 Hz

Differential coding of spectral valuesCoupling

Across timeAcross channels ...

18© 2000-2002 BDTI

How: Data CompressionHow: Data Compression

Entropy (e.g. Huffman) coding Quantization table Run-lengthVector quantization

Page 10: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 10

19© 2000-2002 BDTI

Artifacts: “PreArtifacts: “Pre--echo”echo”

Quantization noise spreadNoise components ≥≥≥≥ 1-2 msec before impulsive signal not maskedFix: Shorter window; wavelets (ePAC)

Original

(5.3 msec)

Resynthesized

20© 2000-2002 BDTI

Comparing SpecificationsComparing Specifications

Bit rate ranges: < 8 kbps - 9.6 Mbps Bit widths: 16-24 bitsSample rates: 8-192 kHzNumber of channels: 1-many dozenSpectral bins: 128-1024Time resolution: 4-12 msec

Page 11: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 11

21© 2000-2002 BDTI

Which Algorithms?Which Algorithms?MPEG-1

1992MPEG-2

1994AAC 1997

MPEG-4 1999 ...

ATRAC (Sony) 1992

PAC (Lucent)

1992

AC-3 (Dolby)

1995

Coherent Acoustics

(DTS) 1996

Qdesign 1999

TwinVQ NTT

MLP (Meridian)

G2 (RealNetworks)

Windows Media Audio (Microsoft)

22© 2000-2002 BDTI

MPEG FamilyMPEG Family

Moving Pictures Experts GroupVideo and audio compression standardsMPEG-1, MPEG-2 (MP3), MPEG-4Ongoing standardization effort (MPEG-7)

Page 12: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 12

23© 2000-2002 BDTI

MPEGMPEG--1 Audio1 Audio

1992One or two channels

singletwo independent channelsstereostereo with joint coding

32 kHz, 44.1 kHz, 48 kHzSpecifies bit stream format, decoder structure, but not encoder

24© 2000-2002 BDTI

MPEGMPEG--1 Audio (cont’d)1 Audio (cont’d)

Layer 1: simplest; Philips DCCLayer 2: more efficient coding; DAB, CD-ILayer 3: higher frequency and time resolution; ISDN, InternetDecoder for one layer also decodes lower-numbered layersHigher-numbered layers have more complex decoder

Page 13: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 13

25© 2000-2002 BDTI

MPEGMPEG--2 Audio2 Audio

1994MPEG-2 video used for digital TV, DVDBackward compatible with MPEG-1Add lower sample rates

16, 22.05, 24 kHz

5.1 + up to 7 multilingual/commentary channels“MP3” = MPEG-1/2 Layer 3 (not “MPEG-3”)

26© 2000-2002 BDTI

ConclusionsConclusionsEntertainment is going digital

Audio is a key component

Many new market opportunities

Internet audio is hot; audio may be the Internet “killer app”

Audio compression is a key technologyMany algorithms, many applications

Better algorithms -> better quality, more compression

Computation requirements are going up

Page 14: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 14

27© 2000-2002 BDTI

In the Future…In the Future…

Photo credit: Dr. Richard O. Duda, SJSU

28© 2000-2002 BDTI

Want Copies of the Slides?Want Copies of the Slides?

www.BDTI.com/articles/info_articles.htm

Page 15: Digital Audio Compression: Why, What, and How · 2019-12-18 · © 2000-2002 Berkeley Design Technology, Inc. Digital Audio Compression: Why, What, and How UCB EE20 Page 1 December

© 2000-2002 Berkeley Design Technology, Inc.

Digital Audio Compression: Why, What, and How

UCB EE20 December 2, 2002Page 15

29© 2000-2002 BDTI

Want More Information?Want More Information?http://www.BDTI.com

http://www.eg3.com/dsp

comp.dsp

Microprocessor Report

DSP Processor Fundamentals,BDTI

Or, join BDTI... (see www.BDTI.com)

Collection of BDTI's papers on DSP processors, tools, and benchmarking

Links to other good DSP sites

Usenet group

For info on newer DSPs

Textbook on DSP processors