Chapter 6 FFT Chip Design Simulation and Analysis · FFT Chip Design Simulation and Analysis In the previous chapter, we introduced our new architecture including, the radix-8 butterfly

Chapter 6 FFT Chip Design Simulation and Analysis

In the previous chapter, we introduced our new architecture including, the

radix-8 butterfly operation, variable length twiddle factor ROM table, and new

address pointer generation for any length FFT. We designed a variable length

(512/1024/2048/4096) real to complex FFT processor chip. In this chapter, we discuss

and analyze the Matlab simulation. After Matlab simulation, we write verilog code to

implement the FFT hardware design. Finally, the pad location, the floorplan and the

layout of our FFT chip are listed in section 6.3.

6.1 Matlab Simulation and Analysis

We simulate fixed radix-8 and mixed radix algorithm by using the Matlab

software package. We also determine the number of bits for the data bus and twiddle

factor by using Matlab simulation.

6.1.1 Simulation for Fixed and Mixed Radix FFT Algorithm

Figure 6.1 is the flow chart of fixed and mixed radix FFT algorithm. The

procedures are as follows:

(1) Select FFT length 512,1024,2048 or 4096 point.

(2) Input sine waveform

(3) If FFT length is power of 8, the fixed radix-8 algorithm is used. If the length

-94-

is not a power of 8, the mixed radix algorithm is used.

(4) The resulting FFT data can be transformed by the IFFT by using Equation

(6.1).

1

0: ( ) ( ( )) ( )

NnkN

nDFT X k DFT x n x n W

−

=

= =∑

1

0

* *

1: ( ) ( ).

1 [ { ( )}

Nnk

Nn

IDFT x n X n WN

j DFT jX nN

−−

=

=

= ×

∑

] (6.1)

(5) If the output waveform is equal to the input waveform, our algorithm has

been verified to be correct. We can see the simulation results for a 512-point

FFT given in Figure 6.2 (a) and (b).

-95-

input sine wave form

start

FFT

sel fft mode 512, 1024, 2048, 4096

IFFT

output signal

equal signal

system ok

Yes

No

system error

-96-

Figure 6.1 Fixed and mixed radix –8 FFT algorithm flowchart

-97-

(a) Input signal

(b) Output signal

Figure 6.2 Comparison of input and output data for 512 –point FFT simulation with

radix-8 algorithm

6.1.2 Simulation of Twiddle Factor and Databus Bit Length

Before the implementation of the FFT IC design, we need to know the data bus

size (in bits) and the length of the twiddle factor coefficient value. So, we simulate the

whole DMT modulation to determine the optimal data and coefficient wordlength.

The processing flow of this simulation, depicted in Figure 6.3 is as follows:

(1) We perform I and Q values from 256 QAM constellation.

(2) Input coding data is randomly generated.

(3) We set initial databus and coefficient lengths.

(4) The resulting input coding data are mapped to the complex input of the FFT.

(5) The IFFT of the mapped data values is computed and a Guard Interval (GI)

is added to the result.

(6) After removing the Guard Interval, we perform the FFT operation to recover

the complex data values.

(7) These data values are demapped into binary data values.

(8) Finally, we decode the data.

(9) We compare the coded data and the decoded data.

(10) If the databus and coefficient wordlengths are optimal, the DMT simulation

is OK.

(11) If we get the non-optimal values, we adjust their bit length and run this

processing flow again.

After simulation, we find that the optimal length of the input and output databus

is 16 bits (1 bit for the sign bit, 5 bits for the integer portion, and 10 bits for the

fractional portion). Again, the optimal length of the twiddle factor coefficient is 12

bits (1 bit for the sign bit, 1 bit for the integer portion, and 10 bits for the fractional

-98-

input QAM table

Start DMT system

input coding data

sel fft mode 512, 1024, 2048, 4096

QAM map

data record

Yes

No

radom generator coding data

IFFT

Add GI

channel

Remove GI

FFT

De QAM map

compare data

inital twiddle bit length initial databus bit length

input coding data

decoding data

finsh DMTsystem

finish

A

A

new twiddle bit length new databus bit length

Figure 6.3 Processing flow chart of simulation with twiddle factor and databus

wordlength

-99-

portion). Figure 6.4 gives SNR ratios of different size fractional portions for the data

and twiddle factor coefficients. From this simulation, 10 bits is sufficiently large for

input and out put wordlength and twiddle factor.

-100-

Figure 6.4 Simulation of the optimal wordlength for data bus and twiddle factor

6.2 Verilog Simulation and Analysis

The new fabrication process used is TSMC 0.25 µm CMOS technology. It is

synthesized with the Synopsys cell library. Figure 6.5 gives the design flow chart and

CAD tools used in our new FFT architecture. The simulation result for 512-point FFT

with read/write address generation is described in Figure 6.6. Figure 6.7 is the

simulation result of 2048-point with input/output data and butterfly operation.

RTL Design

Synthesis(Synopsys)

Verilog Simulation

FloorPlan

Physical Optimization(Timing Closure)

Gate Level Verilog Simulation

RC Extraction

DRC/LVS verification

Post layout simulation

Static Timing Analysis

Architecture Design

Place and Route

Fail

Fail

Fail

Fail

Synopsys model

Apollo library

Verilog library

GDSII

SDF

Gate Level Verilog Simulation

Gate Level Netlist

Gate Level Netlist

-101-

(1) FFT chip design flow and CAD tools used

Figure 6.5 Flow chart of FFT chip design

-102-

(2) Simulation results for FFT processor

Figure 6.6 Simulation waveform of 512-point with read/write address generation

Figure 6.7 Simulation of 2048-point with input/output data and butterfly operation

(3) Verification for FFT IC design

We must determine whether the FFT IC design is correct or not. Here we will

describe how we verify our FFT IC design in Figure 6.8. We perform the procedure of

the shaded region as shown in Figure 6.9. First, we dump the FFT output data from

the Matlab simulation to produce the input data to the FFT. Second, we implement the

FFT IC according to the flow chart of Figure 6.5. Third, we dump the output data of

the FFT IC operation. Fourth, we use the resulting IC data output to deQAM mapping

with Matlab simulation. Finally, we perform data decoding and comparison with input

coding data, described in Figure 6.10.

Verification for FFT IC system block

QAM Modulation

data input

FFTIFFTDe-QAM Modulation

data outputChannel

IC operation

Figure 6.8 Verification for FFT IC system block

-103-

input QAM table

Start DMT system

input coding data

sel fft mode 512, 1024, 2048, 4096

QAM map

radom generator coding data

IFFT

Add GI

channel

Remove GI

Down IFFT data

De QAM map

compare data

inital twiddle bit length initial databus bit length

input coding data

decoding data

system finish

decoding data

Input IC_FFT

IC_FFT Operation

Ouput IC_FFT data

IC operation

-104-

Verification for FFT IC flowchart

Figure 6.9 Verification for FFT IC flowchart

-105-

Figure 6.10 Compare coding and decoding data to verify FFT IC design

6.3 Pad Location and Floorplan

We sketch a floor plan for the FFT chip, depicted in Figure 6.11. The objective is

to determine the overall structure of the auto placement and routing. Figure 6.12

shows a schematic of QFP package for our FFT chip design. Its pin description is

given in Table 6.1. Table 6.2 is the features of our FFT chip design. The die size of

our FFT chip is 2600 2600m mµ µ× excluding memory. It synthesizes with 146576

gate counts and the critical path delay is which reported by Synopsys Design

Analyzer. We do the automatic placement and routing by using Apollo. Its layout view

of the 4096-point our memory based FFT chip is shown in Figure 6.13.

18 ns

3735 363230 3127 2928 33

vdd

26 34

100 99 98 97 96 95 94 93 92 91 90 89

1234

6

8910

1112

100 QFP

fft chip

151617181920

4947 484442 4339 4140 4538 46 50

88 87 86 85 84 83 82 81 80 79 78 77

5

1314

21222324

7

n_resetext_clk

fft_mode0fft_mode1

fft_mode2

fft_startfft_valid

vssin_fft_real0

in_fft_real1in_fft_real2in_fft_real3

in_fft_real4in_fft_real5in_fft_real6in_fft_real7

25

vddin_fft_real8in_fft_real9

in_fft_real10in_fft_real11in_fft_real12in_fft_real13

vss

vdd

in_f

ft_re

al14

in_f

ft_re

al15

in_f

ft_im

g0in

_fft_

img1

in_f

ft_im

g2in

_fft_

img3

in_f

ft_im

g4in

_fft_

img5

in_f

ft_im

g6in

_fft_

img7

vss

in_f

ft_im

g8in

_fft_

img9

in_f

ft_im

g10

in_f

ft_im

g11

in_f

ft_im

g12

in_f

ft_im

g13

in_f

ft_im

g14

in_f

ft_im

g15

vdd

vss

nc nc nc

75747372

70

686766

6564

616059585756

71

6362

55545352

69

51

76

vdd

ou_fft_real0

tes0tes1tes2vss

vddout_data_clkout_data_valid

vss

ou_fft_real1ou_fft_real2ou_fft_real3ou_fft_real4

ou_fft_real5

ou_fft_real6ou_fft_real7

vdd

vss

ou_fft_real8


ou_fft_real11


vdd

ou_f

ft_re

al14

ou_f

ft_re

al15

ou_f

ft_im

g0

ou_f

ft_im

g1ou

_fft_

img2

ou_f

ft_im

g3ou

_fft_

img4

ou_f

ft_im

g5ou

_fft_

img6

ou_f

ft_im

g7vs

sou

_fft_

img8

ou_f

ft_im

g9

ou_f

ft_im

g10

ou_f

ft_im

g11

ou_f

ft_im

g12

ou_f

ft_im

g13

ou_f

ft_im

g14

ou_f

ft_im

g15

vdd

vss

ncncnc

-106-

(1) Floorplan of our FFT chip

S R A M _ b a n k 7

S R A M _ b a n k 6

S R A M _ b a n k 5

S R A M _ b a n k 4

S R A M _ b a n k 3

S R A M _ b a n k 2

S R A M _ b a n k 1

S R A M _ b a n k 0

s e r i a l p a r a l l e l

f f t _ c t r l

r o m t a b l e

r a d i x 8 _ b u f f e r f l yS R A M _ m o d e l

Figure 6.11 A sketch of FFT chip floorplan

(2) Package definition of our FFT chip

Figure 6.12 FFT package definition

(3) Pin configuration of our FFT chip

Table 6.1 FFT pin configuration

Name Pin# I/O Description Vdd 1 power Power source (core) N_reset 2 Input For system reset Ext_clk 3 Input Clk input Fft_mode0 4 Input Sel fft mode Fft_mode1 5 Input Sel fft mode Fft_mode2 6 Input Sel fft mode Fft_start 7 Input Start fft system Fft_valid 8 Input Fft data valid Vss 9 ground Ground source (core) In_fft_real0 10 Input Fft real0 input In_fft_real1 11 Input Fft real1 input In_fft_real2 12 Input Fft real2 input In_fft_real3 13 Input Fft real3 input In_fft_real4 14 Input Fft real4 input In_fft_real5 15 Input Fft real5 input In_fft_real6 16 Input Fft real6 input In_fft_real7 17 Input Fft real7 input Vdd 18 power Power source (core) In_fft_real8 19 Input Fft real8 input In_fft_real9 20 Input Fft real9 input In_fft_real10 21 Input Fft real10 input In_fft_real11 22 Input Fft real11 input In_fft_real12 23 Input Fft real12 input In_fft_real13 24 Input Fft real13 input Vss 25 ground Ground source(core) Vdd 26 Power Power source(pad) In_fft_real14 27 Input Fft real14 input In_fft_real15 28 Input Fft real15 input In_fft_img0 29 Input Fft img0 input In_fft_img1 30 Input Fft img1 input In_fft_img2 31 Input Fft img2 input In_fft_img3 32 Input Fft img3 input

-107-

In_fft_img4 33 Input Fft img4 input In_fft_img5 34 Input Fft img5 input In_fft_img6 35 Input Fft img6 input In_fft_img7 36 Input Fft img7 input Vss 37 Input Ground source(pad) In_fft_img8 38 Input Fft img8 input In_fft_img9 39 Input Fft img9 input In_fft_img10 40 Input Fft img10 input In_fft_img11 41 Input Fft img11 input In_fft_img12 42 Input Fft img12 input In_fft_img13 43 Input Fft img13 input In_fft_img14 44 Input Fft img14 input In_fft_img15 45 Input Fft img15 input Vdd 46 Power Power source(pad) Vss 47 ground Ground source(pad) Nc 48 Nc 49 Nc 50 Name Pin# I/O Description Vdd 51 power Power source (pad) Test 0 52 Input Sel Test pattern Test 1 53 Input Sel Test pattern Test 2 54 Input Sel Test pattern Vss 55 ground Ground source(pad) Vdd 56 power Power source (pad) Out_data_clk 57 output Fft output data clk Out_data_valid 58 output Fft output data valid Vss 59 ground Ground source (pad) Ou_fft_real0 60 output Fft real0 output Ou_fft_real1 61 output Fft real1 output Ou_fft_real2 62 output Fft real2 output Ou_fft_real3 63 output Fft real3 output Ou_fft_real4 64 output Fft real4 output Ou_fft_real5 65 output Fft real5 output Ou_fft_real6 66 output Fft real6 output Ou_fft_real7 67 output Fft real7 out put

-108-

vdd 68 power Power source (pad) Ou_fft_real8 69 output Fft real8 output Ou_fft_real9 70 output Fft real9 output Ou_fft_real10 71 output Fft real10 output Ou_fft_real11 72 output Fft real11 output Ou_fft_real12 73 output Fft real12 output Ou_fft_real13 74 output Fft real13 output Vss 75 ground Ground source(pad) Vdd 76 Power Power source(pad) Out_fft_real14 77 output Fft real14 output Ou_fft_real15 78 output Fft real15 output Ou_fft_img0 79 output Fft img0 output Ou_fft_img1 80 output Fft img1 output Ou_fft_img2 81 output Fft img2 output Ou_fft_img3 82 output Fft img3 output Ou_fft_img4 83 output Fft img4 output Ou_fft_img5 84 output Fft img5 output Ou_fft_img6 85 output Fft img6 output Ou_fft_img7 86 output Fft img7 output Vss 87 ground Ground source(pad) Ou_fft_img8 88 output Fft img8 output Ou_fft_img9 89 output Fft img9 output Ou_fft_img10 90 output Fft img10 output Ou_fft_img11 91 output Fft img11 output Ou_fft_img12 92 output Fft img12 output Ou_fft_img13 93 output Fft img13 output Ou_fft_img14 94 output Fft img14 output Ou_fft_img15 95 output Fft img15 output Vdd 96 Power Power source(pad) Vss 97 ground Ground source(pad) Nc 98 Nc 99 Nc 100

-109-

-110-

(4) Features of our FFT chip design

Table 6.2 Features of our FFT chip design Process TSMC 0.25 1P4M

die size excluding memory 2600 2600m mµ µ×

Gate count 146576 FFT length 512,1024,2048,4096

Maximum clock speed 50 MHz Package 100 QFP

Power supply 2.5V Input/Output data bit 16 bits

(5) Layout view of our variable length FFT chip excluding memory

Figure 6.13 Layout view of our variable length FFT chip

Chapter 6 FFT Chip Design Simulation and Analysis · FFT Chip Design Simulation and Analysis In the previous chapter, we introduced our new architecture including, the radix-8 butterfly

Documents