Power Grid Analysis in VLSI Designs A Thesis Submitted for the Degree of Master of Science (Engineering) In the Faculty of Engineering By Kalpesh Shah Super Computer Education and Research Centre Indian Institute of Science Bangalore – 560012 March 2007
Power has become an important design closure parameter in today’s ultra low submicron digital designs. The impact of the increase in power is multi-discipline to researchers ranging from power supply design, power converters or voltage regulators design, system, board and package thermal analysis, power grid design and signal integrity analysis to minimizing power itself. This work focuses on challenges arising due to increase in power to power grid design and analysis.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Power Grid Analysis in VLSI Designs
A Thesis
Submitted for the Degree of
Master of Science (Engineering) In the Faculty of Engineering
By Kalpesh Shah
Super Computer Education and Research Centre
Indian Institute of Science Bangalore – 560012
March 2007
2
3
Acknowledgements
My sincere gratitude to both my guides - Prof S K Nandy and Dr. Vish Visvanathan. Prof Nandy, thank you for your guidance right from the start of the MS curriculum till the end. I
would not have dreamt of the final chapters had it not been for your timely guidance. To Vish, thank you for bearing with me and guiding me from the beginning till end, in your
busy schedule at office. You are the one who encouraged me from enrolling for this program till end. Thank you for your valuable inputs and comments on the material. My
sincere thanks to IISc and specifically SERC staff who helped me through various administrative work.
To my colleagues and managers at Texas Instruments, thank you for your cooperation -you are a team I am proud of. Thanks for your support and the camaraderie. A special
thanks to Harinath for approving my MS Program and Venugopal Puvvada, my manager when most of this work happened. Discussions with him made this work relevant to Multi-
million gate designs and found real application.
Thanks to many of my friends with whom I discussed similar topics like my research
throughout this period – Ananth, Gokul, Mallik, Suravi, Saby, Bram, Ashish, Aishwarya and Sumedha. A special thanks to Anjana Ghose for all that you did for me while I was not in
Bangalore.
Thanks to my family for having stood behind me like a rock. To my parents, thanks for
your support and affection – your unrelenting persistence helped me to complete last step. To Pratiksha – thank you for being my invisible strength. Your constant reassuring
presence and confidence in me drove me to this point in journey. To Bhavesh and Deepti – thank you for being my savior at times of load at home. Without you folks, this thesis
would not have materialized. And finally, thanks to little Harsh who came to this world halfway through my MS and Darsh who saw my MS from the age of 1 year – you kept me
giving unasked needed breaks and made everything so live.
4
5
Table of Contents Acknowledgements..................................................................................................................3 Abstract ...................................................................................................................................11 1 Introduction ...................................................................................................................13 1.1 Motivation ........................................................................................................................................13 1.1.1 Power Estimation ................................................................................................................................... 16 1.1.2 Power Supply Noise ............................................................................................................................... 17 1.1.3 MTCMOS Analysis ................................................................................................................................. 22 1.2 Terms ..............................................................................................................................................24 1.3 Thesis outline and Contribution......................................................................................................25 2 Toggle Activity Estimation...........................................................................................27 2.1 Overview .........................................................................................................................................27 2.2 Toggle Activity Estimation ..............................................................................................................29 2.3 Multi-million gate solution ...............................................................................................................30 2.3.1 Deriving automatic toggle frequency values.............................................................................................. 31 2.3.2 Hierarchical Modeling ............................................................................................................................. 35 2.4 Validation and Results ....................................................................................................................37 2.5 Summary .........................................................................................................................................38 3 Power Estimation..........................................................................................................39 3.1 Overview .........................................................................................................................................39 3.2 Current approaches to Power Analysis..........................................................................................42 3.3 Power analysis Tools ......................................................................................................................45 3.3.1 Power Compiler: [67] .............................................................................................................................. 45 3.3.2 Power Mill (or Nano Sim) [4][68] .............................................................................................................. 46 3.3.3 Prime Power [66].................................................................................................................................... 47 3.3.4 Other Tools ............................................................................................................................................ 47 3.4 Validation Flow................................................................................................................................48 3.4.1 Netlist Setup:.......................................................................................................................................... 50 3.4.2 Vector Generation .................................................................................................................................. 50 3.4.3 Interconnect setup .................................................................................................................................. 51 3.5 Validation and Results ....................................................................................................................51 3.6 Power estimation applications ........................................................................................................60 3.6.1 Average power/ground bus currents ........................................................................................................ 60 3.6.2 Average power dissipation ...................................................................................................................... 61 3.6.3 Electro migration failures......................................................................................................................... 61 3.6.4 Power Routing........................................................................................................................................ 61 3.6.5 Gate Oxide Integrity Analysis .................................................................................................................. 62 3.7 Summary .........................................................................................................................................62 4 Power Supply Noise Analysis .....................................................................................63 4.1 Overview .........................................................................................................................................63 4.2 Cell Characterization.......................................................................................................................64 4.2.1 Current Characterization Methodology..................................................................................................... 65 4.2.2 Current Characterization Flow................................................................................................................. 71 4.3 Power Grid network modeling ........................................................................................................72 4.3.1 Power Grid Current Waveform Modeling .................................................................................................. 74 4.4 Complete Flow ................................................................................................................................78
6
4.4.1 Timing Information Generation ................................................................................................................ 80 4.4.2 Power Grid Generator............................................................................................................................. 80 4.4.3 SPICE Simulation................................................................................................................................... 82 4.5 Validation and Results ....................................................................................................................82 4.5.1 Peak Power Results ............................................................................................................................... 83 4.5.2 Peak Dynamic IR Drop Results ............................................................................................................... 84 4.6 Summary .........................................................................................................................................87 5 Power Up Analysis........................................................................................................89 5.1 Switched PG Networks ...................................................................................................................91 5.2 Switch Network Analysis.................................................................................................................94 5.2.1 Switch Characterization .......................................................................................................................... 95 5.2.2 Current or Switch Prediction.................................................................................................................... 96 5.3 Results and Analysis.......................................................................................................................99 5.4 Summary .......................................................................................................................................104 6 Conclusion...................................................................................................................105 6.1 Summary .......................................................................................................................................105 6.2 Scope of Future Work...................................................................................................................106 7 References...................................................................................................................109 Appendix A Sample SDC file...............................................................................................115 Appendix B Sample SPEF Format......................................................................................116 Appendix C Power Waveforms Analysis...........................................................................118 Appendix D Current Characterization – sample spice deck ...........................................119 Appendix E Waveform transformation example...............................................................120
7
Table of Figures Figure 1.1 Power Dissipation in CMOS designs ......................................................................................13 Figure 1.2 Power Density trend in CMOS designs...................................................................................14 Figure 1.3 Leakage and Dynamic Power Dissipation [2].........................................................................15 Figure 1.4 Schematic of Power Grid in CMOS designs...........................................................................18 Figure 1.5 Normalized delay and normalized delay to voltage ratio........................................................21 Figure 1.6 Total power break up into leakage and active........................................................................23 Figure 2.1 Schematic of logic circuit 1......................................................................................................31 Figure 2.2 Schematic of Logic Circuit 2....................................................................................................32 Figure 2.3 Gated clock example ...............................................................................................................34 Figure 2.4 Gate Level Netlist for 'simple' design......................................................................................36 Figure 2.5 Timing Arcs in extracted model of 'simple' design..................................................................37 Figure 3.1 Venn diagram of Power Components.....................................................................................40 Figure 3.2 Power Estimation in Design Stages........................................................................................45 Figure 3.3 Power Estimation Validation Flow...........................................................................................49 Figure 3.4 Legends for Validation Flow....................................................................................................49 Figure 4.1 Voltage over time representation at an internal design node ................................................63 Figure 4.2 Schematic circuit for instantaneous voltage drop analysis ....................................................64 Figure 4.3 Inverter waveforms measured at different nodes...................................................................66 Figure 4.4 transition time vs. peak power for Inverter..............................................................................68 Figure 4.5 Transition time vs. peak power for nand gate.........................................................................68 Figure 4.6 Load vs. peak power for AND gate.........................................................................................69 Figure 4.7 Load vs. Peak power for OR gate...........................................................................................69 Figure 4.8 State Dependency on cell switching .......................................................................................70 Figure 4.9 Cell Characterization Flow.......................................................................................................72 Figure 4.10 Power Grid Modeling .............................................................................................................73 Figure 4.11 Peak IR drop Computation Flow...........................................................................................79 Figure 4.12 Prime Time flow for arrival time computation .......................................................................80 Figure 4.13 Power Grid Generation Flow.................................................................................................81 Figure 4.14 PSN waveform of Proposed Method.....................................................................................86 Figure 4.15 PSN Reference Waveform....................................................................................................86 Figure 5.1 Gated Power Supply ([74]) ......................................................................................................89 Figure 5.2 Layout of 1M gate with switch network...................................................................................92 Figure 5.3 Current Glitch and Voltage Ramp at arbitrary switch output..................................................92 Figure 5.4 Typical PG network with Power Switches...............................................................................93 Figure 5.5 Schematic Switch network Analysis Flow...............................................................................95 Figure 5.6 Analysis model of Virtual Power Network...............................................................................96 Figure 5.7 Infinitesimal Time Division for Current Prediction...................................................................97 Figure 5.8 Reduced Switch Network for validation ................................................................................100 Figure 5.9 Voltage Ramp up over Time for various nodes ....................................................................103 Figure 5.10 Current comparison over time.............................................................................................103 Figure 1 1MHz, Peak: 838.9 uW.............................................................................................................118 Figure 2 100MHz, Peak: 840.7 uW.........................................................................................................118
List of Tables Table 1.1 Consolidation of ITRS2003 Predictions ...................................................................................14 Table 1.2 Generic Term Definitions ..........................................................................................................25 Table 2.1 Comparison of Static vs Dynamic approaches for Power Estimation.....................................28 Table 3.1 Power Modeling for CMOS gates.............................................................................................43 Table 3.2 ISCAS89 circuit description ......................................................................................................54 Table 3.3 Runtime comparison between vector less and SPICE............................................................55 Table 3.4 Clock Power vs. Total Power....................................................................................................57 Table 3.5 Power Estimation across various tools ....................................................................................60 Table 4.1 Comparison of Peak power Dissipation...................................................................................84 Table 4.2 Comparison of percentage peak instantaneous IR drop.........................................................85 Table 4.3 Comparison of percentage peak IR drop on ISCAS89 circuits...............................................85 Table 5.1 Switch Prediction by proposed algorithm...............................................................................102 Table 5.2 Voltage Prediction...................................................................................................................102 Table 5.3 Power Up analysis - Runtime Comparison ............................................................................103
10
11
Abstract
Power has become an important design closure parameter in today’s ultra low submicron
digital designs. The impact of the increase in power is multi-discipline to researchers ranging
from power supply design, power converters or voltage regulators design, system, board and
package thermal analysis, power grid design and signal integrity analysis to minimizing power
itself. This work focuses on challenges arising due to increase in power to power grid design
and analysis.
Challenges arising due to lower geometries and higher power are very well researched topics
and there is still lot of scope to continue work. Traditionally, designs go through average IR
drop analysis. Average IR drop analysis is highly dependent on current dissipation estimation.
This work proposes a vector less probabilistic toggle estimation which is extension of one of
the approaches proposed in literature. We have further used toggles computed using this
approach to estimate power of ISCAS89 benchmark circuits. This provides insight into quality
of toggles being generated. Power Estimation work is further extended to comprehend with
various state of the art methodologies available i.e. spice based power estimation, logic
simulation based power estimation, commercially available tool comparisons etc. We finally
arrived at optimum flow recommendation which can be used as per design need and schedule.
Today’s design complexity – high frequencies, high logic densities and multiple level clock and
power gating - has forced design community to look beyond average IR drop. High rate of
switching activities induce power supply fluctuations to cells in design which is known as
12
instantaneous IR drop. However, there is no good analysis methodology in place to analyze this
phenomenon. Ad hoc decoupling planning and on chip intrinsic decoupling capacitance helps
to contain this noise but there is no guarantee. This work also applies average toggle
computation approach to compute instantaneous IR drop analysis for designs. Instantaneous IR
drop is also known as dynamic IR drop or power supply noise. We are proposing cell
characterization methodology for standard cells. This data is used to build power grid model of
the design. Finally, the power network is solved to compute instantaneous IR drop.
Leakage Power Minimization has forced design teams to do complex power gating – multi
level MTCMOS usage in Power Grid. This puts additonal analysis challenge for Power Grid in
terms of ON/OFF sequencing and noise injection due to it. This work explains the state of art
here and highlights some of the issues and trade offs using MTCMOS logic. It further suggests
a simple approach to quickly access the impact of MTCMOS gates in Power Grid in terms of
peak currents and IR drop. Alternatively, the approach suggested also helps in MTCMOS gate
optimization. Early leakage optimization overhead can be computed using this approach.
13
1 Introduction
1.1 Motivation
VLSI industry is facing one of the biggest challenges in its evolution – Power Integrity closure
– the next after cross talk induced integrity issues in previous decade. Power Dissipation has
phenomenally increased across years as shown in Figure 1.1 giving rise to this challenge.
Figure 1.2 shows the increase in power density due to ultra low scaling and hence increasing
the components cramped in unit area.
5KW 18KW
1.5KW 500W
40048008
80808085
8086286
386486
Pentium® proc
0.1
1
10
100
1000
10000
100000
1971 1974 1978 1985 1992 2000 2004 2008Year
Pow
er (W
atts
)
5KW 18KW
1.5KW 500W
40048008
80808085
8086286
386486
Pentium® proc
0.1
1
10
100
1000
10000
100000
1971 1974 1978 1985 1992 2000 2004 2008Year
Pow
er (W
atts
)
Figure 1.1 Power Dissipation in CMOS designs
14
400480088080
8085
8086
286 386486
Pentium® procP6
1
10
100
1000
10000
1970 1980 1990 2000 2010Year
Pow
er D
ensi
ty (W
/cm
2)
Hot Plate
NuclearReactor
RocketNozzle
400480088080
8085
8086
286 386486
Pentium® procP6
1
10
100
1000
10000
1970 1980 1990 2000 2010Year
Pow
er D
ensi
ty (W
/cm
2)
Hot PlateHot Plate
NuclearReactorNuclearReactorNuclearReactor
RocketNozzleRocketNozzleRocketNozzle
Figure 1.2 Power Density trend in CMOS designs
Table 1.1 below shows consolidation of ITRS2003 [1] predictions on power as well as its
impact on design as well as operating voltages.
2003 2004
(90u) 2005 2006
2007
(65u) 2008 2009
2010
(45u) 2012
Vdd(High Perf) 1.2 1.2 1.1 1.1 1.1 1 1 1 0.9
Vdd(Low Power) 1 0.9 0.9 0.9 0.8 0.8 0.8 0.7 0.7
High Perf Power (W) 149 158 167 180 189 200 210 218 240
and lower level power minimization. In this case, power dissipation is described as below.
P = (A*C*V^2*f) + (τ*A*V*Ishort) + (V*Ileak)
Where
A = activity factor à this specifies the amount of switching at various internal
nodes of design. Note that ‘f’ is clock frequency which is readily available for
most designs. Activity factor specifies about how much a node toggles per ‘f’
transitions of clock. The activity factor can be derived from simulation patterns
of the logic.
C = capacitance à Interconnect load capacitance or wire capacitance
V = dynamic voltage à voltage at which the logic operates
f = frequency à clock frequency at which the logic operates
40
Ishort = short-circuit current during switching à During transition in CMOS
logic, both NMOS and PMOS are ON for a momentarily of time. This time
current finds a direct path from Power Supply to Ground. This is called short
circuit current. It is dependent on input transition duration of CMOS.
τ = duration of short-circuit current
Ileak = leakage current [72-80][32]
Figure 3.1 defines various components of power and their relation ship or contribution to total
power estimation.
Switching power (70-80%) power dissipated by the
charging and discharging of the load capacitance.
∑∀Cell
iTRiCloadVDD ))(*)((*)2^(
Static (leakage) power (5%): power dissipated by a gate
when it is not switching
∑∀ )(iCell
ge(i)PCellLeaka
Dynamic Power consists of Switching Power and Short Circuit Power
ASIC Flow characterizes libraries for average and leakage power.
Short Circuit powerpower dissipated by a momentary short circuit between the P and N transistors of a gate during switching
Cell Internal Switching Power –can vary based on macro Size
InternalInternalPowerPower
Switching power (70-80%) power dissipated by the
charging and discharging of the load capacitance.
∑∀Cell
iTRiCloadVDD ))(*)((*)2^(
Static (leakage) power (5%): power dissipated by a gate
when it is not switching
∑∀ )(iCell
ge(i)PCellLeaka
Dynamic Power consists of Switching Power and Short Circuit Power
ASIC Flow characterizes libraries for average and leakage power.
Short Circuit powerpower dissipated by a momentary short circuit between the P and N transistors of a gate during switching
Cell Internal Switching Power –can vary based on macro Size
InternalInternalPowerPower
Figure 3.1 Venn diagram of Power Components
41
In this work, above power components and their computation are extensively studied. To
address the problem in systematic manner, power estimation has been simplified the following
way. These assumptions are acceptable given the global analysis that we are considering.
Power supply and ground voltage levels throughout the chip are fixed so that it becomes
simpler to compute the power by estimating the current drawn by every sub-circuit assuming a
given fixed power supply voltage. Note that this does not mean that different blocks can not be
at different voltage level. This allows pre-characterizing library components for required
voltage points.
The circuit is built of logic gates and latches or reusable IPs, and has the popular and well-
structured design style of a synchronous sequential circuit. In other words, it consists of flops
driven by a common clock and combinational logic blocks whose inputs (outputs) are derived
from flop outputs (inputs). It is also assumed that the flops are edge-triggered and, with the use
of CMOS design technology, the circuit draws no steady-state supply current. This allows
breaking down average power dissipation of the circuit into 2 components
• The power consumed by the flops
• The power consumed by the combinational logic blocks.
This chapter is organized as below. In the next section, we have further explained cell based
power analysis. Next section briefly introduces tools used to compare power estimation as
performed by toggle computation described in previous chapter. Later validation and results are
described.
42
3.2 Current approaches to Power Analysis
Cell based power estimation consists of cell characterization and logic simulation or activity
estimation. The characterization phase entails a set of electrical simulations of each library cell
for all possible input transitions and for a wide range of fanin and fanout conditions. Timing
and power information obtained in this way is used to construct lookup tables for the basic
library elements [46][69].
Summing the leakage power of the design’s constituent library cells derives the total leakage
power of a circuit:
PleakageTotal = ∑∀ )(iCell
ge(i)PCellLeaka (3)
Where PcellLeakage(I) is the leakage power dissipation of each cell. Technology library developers
annotate the library cells with the approximate total leakage power dissipated by each cell.
There is usually a single static power number per library cell but sometimes leakage power can
depend on the logical condition of the cell. In this case, the library cell is annotated with a state
dependent static power.
A cell’s internal power is the sum of the internal power of all of the cell’s inputs and outputs as
modeled in the technology library:
∑∀
=)(
)(*)(*iPin
ifiAEiInternal
P (4)
Where Ei is the internal energy of each pin. In practice, the internal energy if a pin is
characterized in the technology library and can be accessed by simple table look-up. Depending
43
on the required accuracy, different look-up tables can be provided by the library designers as
explained in Table 3.1.
Lookup Table Pin
Direction Indices
One-
dimensional
Input/
Output
Input Transition OR Output load capacitance
Two-
dimensional
Output Input transition and output load capacitance
Three-
dimensional
Output Input transition and output load capacitance of the two outputs
that have equal or opposite logic values
Table 3.1 Power Modeling for CMOS gates
The switching power is calculated in the following way:
∑∀
=Cell
ifiAiCloadVDDPswitching ))(*)(*)((*)2^( (5)
Where Cload(i) is the capacitive load of net i. Without any physical information, the load
capacitance Cload(i) is calculated using the wire load model of the net and the fanout of the
driving pin. Usually, this approach achieves relative accuracy.
Apart from the approaches mentioned above, the following factors are also important for
accurate power estimation.
44
1. Temperature dependency of power. Power consumption in CMOS depends on mobility
factors, threshold voltage and doping concentrations. These factors are temperature
dependent. Hence power also varies according to variation in temperature.
2. Voltage dependency of power. Voltage dependency of power is well known.
(P=C*V*V*f). This is true for CMOS technology also. If we model, the CMOS
component as a capacitor, it is clear that power varies based on the variation on supply
voltage.
3. Power increases with increase in frequency of operation. In fact, many designs now a
day have different modes of operation. A high frequency mode when the device is
operational and a low frequency mode when the device is in standby mode. The impact
of frequency on power estimation is already being discussed in previous section.
4. Now a day, most of the designs have a significant chunk of flops or registers. According
to one statistics, around 40-50% logic of the design contains flops. If all the flops are
clocked throughout the operation, clock network consumes almost 50% of total power.
It is sometimes helpful to analyze power consumption on clock network. This work
analyzes clock power contribution to total power.
5. Process corner also impacts the currents and power consumption. This is especially true
for leakage power. A typical VLSI process has leakage power variation of order of 4-6
from worst process to best process.
45
Based on power sensitivity and tool study analysis in this section, we propose a power
estimation flow in typical design cycle as shown in Figure 3.2 below. Note that the power
analysis varies from RTL design to pre layout netlist to post layout netlist.
* SAIF - Switching Activity File based approach
Architecture
Recommended Least Preferred
Power Estimation(spreadsheet)
RTL
Placed Netlist
Detailed Route Over
RC SPICE Netlist
Unplaced Netlist
Placed Netlist
Detailed Route Over
Toggle Frequency Calculator
Power Estimation in Power
Compiler (wire load, global SPEF,
Detailed SPEF)
RC SPICE Netlist
NanoSim
PIF File Generation
PrimePower
Forward SAIF*Or Frequency Constraints
Logi
c Si
mul
atio
n
* SAIF - Switching Activity File based approach
Architecture
Recommended Least Preferred
Power Estimation(spreadsheet)
RTL
Placed Netlist
Detailed Route Over
RC SPICE Netlist
Unplaced Netlist
Placed Netlist
Detailed Route Over
Toggle Frequency Calculator
Power Estimation in Power
Compiler (wire load, global SPEF,
Detailed SPEF)
RC SPICE Netlist
NanoSim
PIF File Generation
PrimePower
Forward SAIF*Or Frequency Constraints
Logi
c Si
mul
atio
n
Figure 3.2 Power Estimation in Design Stages
3.3 Power analysis Tools
3.3.1 Power Compiler: [67]
Formerly known as Design Power, power compiler is currently most widely used Synopsys tool.
Power compiler, typically being used during synthesis, does power optimization as well as
power estimation. This tool has static algorithms for calculating switching activity at various
46
circuit nodes and propagates the same. It is known fact that power compiler cannot estimate
good switching activity for sequential cells. It should be also noted that most ASIC vendors
have cell power modeling based on Synopsys Liberty syntax so it is highly important to have
single cell power estimation close to Power Compiler number. Synopsys Reference Manual on
Power Compiler [18] gives basic power calculation theory and description of terms being used
in its tools.
We used power compiler in two modes.
One mode was to use power compiler as complete solution for power estimation. In this
approach, we generated input switching activity from our vectors and specified to
power compiler. Power compiler propagated the switching activity based on switching
probability. It then calculates power. In this method, it used some assignment method
for sequential cells and we went ahead with that because our aim was to verify default
switching activity propagation algorithm of Power Compiler.
Second mode was to use power compiler just as power calculation engine. In this
approach, we generated switching activity at all the nodes by using methodology
defined in Chapter 3 and used the power calculation engine. As mentioned earlier,
power calculation engine is quite accurate and so based on power estimation; our aim
was to evaluate switching activity determination accuracy of other methods.
3.3.2 Power Mill (or Nano Sim) [4][68]
Power Mill is Synopsys tool (currently known as Nano Sim) with fast SPICE engine at core. It
has been identified as nicely correlating for two of the single cell circuits and one small design
47
with SPICE. Power Mill is dynamic simulation based tool and hence it requires patterns for
simulation.
We used Power Mill to calculate average and peak power. The main reason was runtime
advantage of PowerMill compare to SPICE. It should be noted here that Power Mill is capable
of taking SPICE net list as input so any switching between from Power Mill and SPICE is
transparent, if needed.
3.3.3 Prime Power [66]
Prime Power is another offering in Synopsys power portfolio. This is dynamic vector based
solution. However the key difference with Power Mill is that Power Mill is SPICE based tool
whereas Prime Power is logic simulation based tool. In other words, Power Mill is more tuned
for accuracy and Analog kind of designs whereas Prime Power is tuned to digital and
specifically ASIC kind of designs with reasonably good accuracy. Prime Power has PLI
interface with leading industry simulators e.g. VCS, Modelsim, Verilog etc. While doing logic
verification with these simulators, if we instantiate one call/command, the PLI dumps binary
files. These binary files can be used in Prime Power to do power estimation. It should be noted
that Prime Power can do peak power analysis also.
We used Prime Power for both average and peak power analysis. The simulator interface being
used was VCS.
3.3.4 Other Tools
This project used VTRAN for converting vectors to SPICE stimulus. VTRAN is one of the
offerings as part of Synopsys and is generic translator of vectors from one format to another. It
48
is supporting all major industry formats as well as internal formats of many prominent
ASIC/EDA vendors.
VCS was used for logic simulation. There is no specific reason for using this simulator except
that it is Synopsys offering so will go with Prime Power without major hurdles.
There are few TI internal programs used to set up an automated flow. They are listed below.
1. genFuncTDL – An internal utility to generate random vectors with specified clock rate.
2. SimOut – A test constraint validation environment.
3. SDFAligner – for translating SDF from one simulator to other simulator compatible
format.
4. SigProbGen – For converting vectors to input switching activity and probability
calculator.
5. DREPGEN – for generating data compatible for TFC.
6. ASCII benchmark data to Verilog netlist and SPICE netlist translator.
3.4 Validation Flow
The validation flow diagram, data management and color convention is shown in Figure 3.3.
Some of the key steps are described below.
49
VERILOGNETLIST
POWER
RANDOMTDL
SIGPROBGEN
PIF
TESTBench
GENFUNCTDL
POWERESTIMATION
DREPGEN
VTRAN
SMOUT
VCS_PIF
PrimePower
PWLFILE
TFC
TRANSLATERSPICE
POWERMILL
USERFREQFILE
SWITCHINGACTIVITYFILE
DREPFILE+ DATA
SpiceNETLIST
VTRAN cmd
SDF
CFG
CMD
Full VCD
COMPARISON ANDREPORT
DC Scripts
ISCAS89Circuits
TRANSLATERVerilog
VERILOGNETLIST
POWER
RANDOMTDL
SIGPROBGEN
PIF
TESTBench
GENFUNCTDL
POWERESTIMATION
DREPGEN
VTRAN
SMOUT
VCS_PIF
PrimePower
PWLFILE
TFC
TRANSLATERSPICE
POWERMILL
USERFREQFILE
SWITCHINGACTIVITYFILE
DREPFILE+ DATA
SpiceNETLIST
VTRAN cmd
SDF
CFG
CMD
Full VCD
COMPARISON ANDREPORT
DC Scripts
ISCAS89Circuits
TRANSLATERVerilog
Figure 3.3 Power Estimation Validation Flow
n White : Third Party toolsn Green : Automatically generated data or written translatorn Grey : TI toolsn Default : standard inputs/outputsn Blue: Final Outputn Elipse : Data file(s)n Rhombus : Process Block(s)
Figure 3.4 Legends for Validation Flow
50
3.4.1 Netlist Setup:
Standard industry benchmark circuits – ISCAS89 are used for the validation. The circuits’
complexity ranges from 14 gates to 22000 gates. The detail statistics of the circuit is mentioned
in Table 2. [71]
To make the validation complete, two single cell circuits are added for ‘micro’ level validation.
ISCAS89 benchmark circuits were mapped to 130nm technology for analysis. Note that there is
no optimization or synthesis being used while mapping the circuits to 130nm technology
however predetermined set of cells was used. They are,
• 2,3,4 inputs AND/NAND gates
• 2,3,4 inputs OR and NOR gates
• Buffers and inverters
• 2,3 inputs ex-or and ex-nor gates
• Flops
3.4.2 Vector Generation
Random vectors were generated for all the ISCAS89 circuits. The numbers of vectors were
based on circuit complexity and number of gates. They vary from 4 vectors to 38000 vectors
approximately. The same set of vectors is used for logic simulation and SPICE simulation as
well as derivation of switching activity and static probabilities for Input Pins.
51
3.4.3 Interconnect setup
All the circuits can be estimated as synthesized Verilog netlist and hence the parasitic
information was not available. To make comparison more realistic, no load modes were used in
power compiler and in SPICE simulation. The logic simulation was based on SDF generated
from Synopsys.
3.5 Validation and Results
The complete data from different tools are shown in Table 3.5. Table 3.2 describes circuits used
for benchmarking. Table 3.3 compares run time between dynamic method and modified toggle
computation method for some of the big design blocks. Table 3.4 shows power estimation for
clock network vs. total power estimation. All the power data is dynamic power in uW.
• The power numbers mainly reflect the cell internal power and switching power only due
to gate input capacitances as no interconnects were assumed.
• All the experiments are done at nominal operating point i.e. normal process, 25 C
temperatures and 1.2 voltage (nominal voltage).
• Clock network power is 50% of total dynamic power but this is not true in all cases.
• Run time reduction from static approach is more than 1000 times.
• Prime Power reported power is optimistic in many cases to PowerMill. This is not in
our expectation and we are looking into it.
• TFC is within 30% of PowerMill reported power. However there are certain exceptions
where it reports 30% optimistic power or >50% pessimistic power.
• Power Compiler is >50% pessimistic in most of the cases.
52
Design Name
IN OUT Flops Boolean (gates+inv)
s111 8 1 0 8
s1196 14 14 18 388+141
s1238 14 14 18 428+80
s13207 31 121 669 2573+5378
s13207_1 62 152 638 2573+5378
s1423 17 5 74 490+167
s1488 8 19 6 550+103
s1494 8 19 6 558+89
s15850 14 87 597 3448+6324
s15850_1 77 150 534 3448+6324
s208_1 10 1 8 66+38
s27 4 1 3 8+2
s298 3 6 14 75+44
s344 9 11 15 101+59
s349 9 11 15 104+57
53
Design Name
IN OUT Flops Boolean (gates+inv)
s35932 35 320 1728 12204+3861
s382 3 6 21 99+59
s38417 28 106 1636 8709+13470
s38584 12 278 1452 11448+7805
s38584_1 38 304 1426 11448+7805
s386 7 7 6 118+41
s4 2 1 1 0
s400 3 6 21 106+58
s420_1 18 1 16 140+78
s444 3 6 21 119+62
s5 2 1 0 1+0
s510 19 7 6 179+32
s526 3 6 21 141+52
s526n 3 6 21 140+54
s5378 35 49 179 1004+1775
s641 35 24 19 107+272
54
Design Name
IN OUT Flops Boolean (gates+inv)
s713 35 23 19 139+254
s820 18 19 5 256+33
s832 18 19 5 262+25
s838_1 34 1 32 288+158
s9234 19 22 228 2027+3570
s9234_1 36 39 211 2027+3570
s953 16 23 29 311+84
Table 3.2 ISCAS89 circuit description
Design TFC + Power Compiler Runtimes (in mts) PowerMill runtime (CPU Hr)
S13207 3 23
S13207_1 3 24
S15850 3 25
S15850_1 3 26
S35932 6 250
55
Design TFC + Power Compiler Runtimes (in mts) PowerMill runtime (CPU Hr)
S38417 6 189
S38584 7 205
S38584_1 7 212
Table 3.3 Runtime comparison between vector less and SPICE