DCSP-13

DCSP-13

Jianfeng Feng

Department of Computer Science Warwick Univ., UK

[email protected]

http://www.dcs.warwick.ac.uk/~feng/dsp.html

mailto:[email protected]

Applications

• Power spectrum estimate

• Compression

clear allclose allsampling_rate=100; %Hzomega=20; %signal frequecy HzN=10000; %total number of samplesfor i=1:N x_sound(i)=cos(2*pi*omega*i/sampling_rate); %signal x(i)=cos(2*pi*omega*i/sampling_rate)+2*randn(1,1); %signal+noise axis(i)=sampling_rate*i/N; % for psd time(i)=i/sampling_rate; % for time traceend

subplot(1,2,1)plot(time,x); %signal + noise, time tracexlabel('time (sec)');ylabel('signal')subplot(1,2,2)plot(axis,abs(fft(x)).^2,'r'); % power of signalxlabel('Frequency') ylabel('Power')sound(x_sound, sampling_rate); %true signal sound

• Singnal processing demo: transformation

• A few words on Matlab

periodgram (fft)

pwelch (overlapped windows)

Power spectrum for white noiseNoise is a stochastic process x(t), for time t (discrete or continuous)

Most noisy noise should have no memory, which impliese that

E x(t)x(t+s) = 0 if s is not zero E x(t)x(t) = 1 or in another words E x(t)x(t+s) =d(s)

Therefore the psd of the white noise is flat: it has constant power for all frequencies, as confirmed in the previous matlabe example

Different from all meaningful signals we encount

Spectrogram

• A spectrogram is an image that shows how the power spectrum of a signal varies with time.

0 500 1000 1500 2000 2500-1

-0.8

-0.6

-0.4

-0.2

0

0.2

0.4

0.6

0.8

1

Time

Frequency

• t=0:0.001:20; % 2 secs @ 1kHz sample rate• y=chirp(t,100,1,200,'q'); % Start @ 100Hz, cross 200Hz at t=1sec• spectrogram(y,128,120,128,1E3); % Display the spectrogram• title('Quadratic Chirp: start at 100Hz and cross 200Hz at t=1sec');• sound(y)

Compression

Sampling and reconstruction

The question we consider here is under what conditions we can completely reconstruct the original signal

x(t)

from its discretely sampled signal

x(n).

The use in MP3 is designed to greatly reduce the amount of data required to represent the audio recording and still sound like a faithful reproduction of the original uncompressed audio for most listeners.

The use in MP3 is designed to greatly reduce the amount of data required to represent the audio recording and still sound like a faithful reproduction of the original uncompressed audio for most listeners.

An MP3 file could result in a file that is about 1/11th the size of the file created from the original audio source.

The compression works by reducing accuracy of certain parts of sound that are deemed beyond the auditory resolution ability of most people.


This method is commonly referred to as perceptual coding.


This method is commonly referred to as perceptual coding.

It internally provides a representation of sound within a short-term time/frequency analysis window, by using psychoacoustic models to discard or reduce precision of components less audible to human hearing, and recording the remaining information in an efficient manner.

This technique is often presented as relatively conceptually similar to the principles used by JPEG, an image compression format.

DCSP-13

Documents

signal noise

original signal xt

signal xi

representation of sound

signal noise axisi

audio recording

original uncompressed

signal frequecy hzn