Laboratorio de Tecnologías de Información Arquitectura de Computadoras Multicycle- 1 Designing a Multicycle Designing a Multicycle Processor Processor Arquitectura de Computadoras Arquitectura de Computadoras Arturo D Arturo D í í az P az P é é rez rez Centro de Investigaci Centro de Investigaci ó ó n y de Estudios Avanzados del IPN n y de Estudios Avanzados del IPN Laboratorio de Tecnolog Laboratorio de Tecnolog í í as de Informaci as de Informaci ó ó n n [email protected][email protected]
41
Embed
Arquitectura de Computadoras - CINVESTAVadiaz/ArqComp/10-Multicycle.pdf · Tecnologías de Información Arquitectura de Computadoras Multicycle- 1 Designing a Multicycle Processor
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 1
Designing a Multicycle Designing a Multicycle ProcessorProcessor
Arquitectura de ComputadorasArquitectura de ComputadorasArturo DArturo Dííaz Paz Péérezrez
Centro de InvestigaciCentro de Investigacióón y de Estudios Avanzados del IPNn y de Estudios Avanzados del IPNLaboratorio de TecnologLaboratorio de Tecnologíías de Informacias de Informacióónn
S <– A + SExt(Im16)M <– MEM[S]R[rd] <– M; PC <– PC + 4
Tim
e
Exe
c
Reg
. Fi
le
Mem
Acc
ess
Dat
aM
em
S
M
Reg
File
PC
Nex
t PC
IR
Inst
. Mem A
B
E
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 18
Step 4 : StoreStep 4 : Store
♦
Logical Register Transfer
♦
Physical Register Transfers
inst Logical Register Transfers
SW MEM[R[rs] + SExt(Im16)] <– R[rt];
PC <– PC + 4
inst Physical Register TransfersIR <–
MEM[pc]SW A<– R[rs]; B <– R[rt]
S <– A + SExt(Im16); MEM[S] <– B PC <– PC + 4
Tim
e
Exe
c Reg
. Fi
le
Mem
Acc
ess
Dat
aM
em
S
M
Reg
File
PC
Nex
t PC
IR
Inst
. Mem A
B
E
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 19
Step 4 : BranchStep 4 : Branch
♦
Logical Register Transfer
♦
Physical Register Transfers
Exe
c
Reg
. Fi
le
Mem
Acc
ess
Dat
aM
em
S
M
Reg
File
PC
Nex
t PC
IR
Inst
. Mem A
B
E
inst Logical Register Transfers
BEQ if R[rs] == R[rt]
then PC <= PC + 4+SExt(Im16) || 00
else PC <= PC + 4inst Physical Register Transfers
IR <–
MEM[pc]BEQ E<– (R[rs] = R[rt])
if E then PC <– PC + 4 else PC <–PC+4+SExt(Im16)||00
Tim
e
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 20
Alternative datapath (book): Alternative datapath (book): Multiple Cycle DatapathMultiple Cycle Datapath♦
Minimizes Hardware: 1 memory, 1 adder
IdealMemoryWrAdrDin
RAdr
32
32
32Dout
MemWr32
AL
U
3232
ALUOp
ALUControl
Instruction Reg
32
IRWr
32
Reg File
Ra
Rw
busW
Rb5
5
32busA
32busB
RegWr
Rs
Rt
Mux
0
1
Rt
Rd
PCWr
ALUSelA
Mux 01
RegDst
Mux
0
1
32
PC
MemtoReg
Extend
ExtOp
Mux
0
132
0
1
23
4
16Imm 32
<< 2
ALUSelB
Mux
1
0
Target32
Zero
ZeroPCWrCond PCSrc BrWr
32
IorD
AL
U O
ut
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 21
Our Control ModelOur Control Model
♦
State specifies control points for Register Transfer♦
Transfer occurs upon exiting state (same falling edge)
State X
Register TransferControl Points
Depends on Input
Control State
Next StateLogic
Output Logic
inputs (conditions)
outputs (control points)
Step 4 Step 4 ⇒⇒
Control Specification for Control Specification for multicyclemulticycle procproc
IR <= MEM[PC]
R-type
A <= R[rs]B <= R[rt]
S <= A fun B
R[rd] <= SPC <= PC + 4
S <= A or ZX
R[rt] <= SPC <= PC + 4
ORi
S <= A + SX
R[rt] <= MPC <= PC + 4
M <= MEM[S]
LW
S <= A + SX
MEM[S] <= BPC <= PC + 4
BEQ
PC <=Next(PC,Equal)
SW
“instruction fetch”
“decode / operand fetch”
Exe
cute
Mem
ory
Writ
e-ba
ck
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 23
Traditional FSM ControllerTraditional FSM Controller
State
6
4
11nextState
op
Equal
control points
state op condnextstate control points
Truth Table
datapath
State
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 24
Step 5 Step 5 ⇒⇒
((datapathdatapath + state + state diagramdiagram ⇒ ⇒
control)control)
♦
Translate RTs
into control points
♦
Assign states
♦
Then go build the controller
Mapping Mapping RTsRTs to Control Pointsto Control PointsIR <= MEM[PC]
R-type
A <= R[rs]B <= R[rt]
S <= A fun B
R[rd] <= SPC <= PC + 4
S <= A or ZX
R[rt] <= SPC <= PC + 4
ORi
S <= A + SX
R[rt] <= MPC <= PC + 4
M <= MEM[S]
LW
S <= A + SX
MEM[S] <= BPC <= PC + 4
BEQPC <=
Next(PC,Equal)
SW
“instruction fetch”
“decode”
imem_rd, IRen
ALUfun, Sen
RegDst, RegWr,PCen
Aen, Ben,Een
Exe
cute
Mem
ory
Writ
e-ba
ck
Assigning StatesAssigning StatesIR <= MEM[PC]
R-type
A <= R[rs]B <= R[rt]
S <= A fun B
R[rd] <= SPC <= PC + 4
S <= A or ZX
R[rt] <= SPC <= PC + 4
ORi
S <= A + SX
R[rt] <= MPC <= PC + 4
M <= MEM[S]
LW
S <= A + SX
MEM[S] <= BPC <= PC + 4
BEQPC <= Next(PC)
SW
“instruction fetch”
“decode”
0000
0001
0100
0101
0110
0111
1000
1001
1010
00111011
1100
Exe
cute
Mem
ory
Writ
e-ba
ck
Detailed Control SpecificationDetailed Control Specification
0000 ??????? 0001 10001 BEQ x 0011 1 1 0001 R-type x 0100 1 1 0001 orI x 0110 1 1 0001 LW x 1000 1 1 0001 SW x 1011 1 1
0011 xxxxxx 0 0000 1 00011 xxxxxx 1 0000 1 10100 xxxxxx x 0101 0 1 fun 10101 xxxxxx x 0000 1 0 0 1 10110 xxxxxx x 0111 0 0 or 10111 xxxxxx x 0000 1 0 0 1 01000 xxxxxx x 1001 1 0 add 11001 xxxxxx x 1010 1 0 01010 xxxxxx x 0000 1 0 1 1 01011 xxxxxx x 1100 1 0 add 11100 xxxxxx x 0000 1 0 0 1
State
Op field
Eq
Next IR
PC
Ops
Exec
Mem
Write-Backen sel
A B Ex Sr
ALU S R W M
M-R Wr
Dst
R:
ORi:
LW:
SW:
-all same in Moore machine
BEQ:
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 28
Performance EvaluationPerformance Evaluation
♦
What is the average CPI?■
state diagram gives CPI for each instruction type
■
workload gives frequency of each type
Type
CPIi
for type
Frequency
CPIi
x freqIiArith/Logic
4
40%
1.6
Load
5
30%
1.5
Store
4
10%
0.4
branch
3
20%
0.6
Average CPI:4.1
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 29
Controller DesignController Design
♦
The state diagrams that arise define the controller for an instruction set processor are highly structured
♦
Use this structure to construct a simple “microsequencer”♦
Control reduces to programming this very simple device■
⇒ microprogramming
sequencercontrol
datapath
control
micro-PCsequencer
microinstruction
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 30
Example: JumpExample: Jump--CounterCounter
op-codeMap ROM
Counterzeroincload
0000i
i+1
i
None of above: Do nothing(for wait states)
Using a Jump CounterUsing a Jump CounterIR <= MEM[PC]
R-type
A <= R[rs]B <= R[rt]
S <= A fun B
R[rd] <= SPC <= PC + 4
S <= A or ZX
R[rt] <= SPC <= PC + 4
ORi
S <= A + SX
R[rt] <= MPC <= PC + 4
M <= MEM[S]
LW
S <= A + SX
MEM[S] <= BPC <= PC + 4
BEQPC <= Next(PC)
SW
“instruction fetch”
“decode”
0000
0001
0100
0101
0110
0111
1000
1001
1010
00111011
1100
inc
load
zero zerozero
zero
zeroinc inc inc inc
inc
Exe
cute
Mem
ory
Writ
e-ba
ck
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 32
Our Our MicrosequencerMicrosequencer
Micro-PC
Map ROM
op-code
Z I L datapath
control
taken
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 33
Microprogram Control SpecificationMicroprogram Control Specification
0000 ? inc 10001 0 load 1 1
0011 0 zero 1 00011 1 zero 1 10100 x inc 0 1 fun 10101 x zero 1 0 0 1 10110 x inc 0 0 or 10111 x zero 1 0 0 1 01000 x inc 1 0 add 11001 x inc 1 0 01010 x zero 1 0 1 1 01011 x inc 1 0 add 11100 x zero 1 0 0 1
Control may be designed using one of several initial representations. The choice of sequence control, and how logic is represented, can then be determined independently; the control can then be implemented with one of several methods using a structured logic technique.
Initial Representation Finite State Diagram Microprogram
Sequencing Control
Explicit Next State Microprogram counter Function + Dispatch ROMs
Logic Representation
Logic Equations
Truth Tables
Implementation PLA
R
OM Technique
“hardwired control” “microprogrammed control”
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 40
SummarySummary
♦
Disadvantages of the Single Cycle Processor■
Long cycle time
■
Cycle time is too long for all instructions except the Load
♦
Multiple Cycle Processor:■
Divide the instructions into smaller steps
■
Execute each step (instead of the entire instruction) in one cycle
♦
Partition datapath
into equal size chunks to minimize cycle time■
~10 levels of logic between latches
♦
Follow same 5-step method for designing “real”
processor
Laboratorio deTecnologías de Información
Arquitectura de Computadoras Multicycle- 41
Summary (contSummary (cont’’d)d)
♦
Control is specified by finite state diagram♦
Specialize state-diagrams easily captured by microsequencer■