[Engineering Feature]
Stanch The Bleeding Of Leakage Power At 65 nm
Even as leakage overwhelms their power budgets, IC design teams are finding ways to plug the holes that are costing them dearly at sub-micron nodes.
David Maliniak
ED Online ID #17402
November 5, 2007
Copyright © 2006 Penton Media, Inc., All rights reserved. Printing of this document is for personal use only.
Reprints
As 90-nm process technologies began
entering the mainstream a few years ago,
it became clear that device delays were no
longer the chief culprit. Interconnect
delays had caught and passed them,
becoming the number-one contributor to
timing woes.
Now as the 65-nm node is hitting its
stride, a parallel trend has arisen for designers in the power
domain. No longer is dynamic power consumption the dominant
factor in total power budgets. Rather, leakage power
dominates those budgets. That's power down the drain (pun
intended) that can't be used for greater performance. But
why is leakage power such a huge concern?
"For cell phones, manufacturers want standby power consumption
to be no more than 5% of the full operational power
consumption," says Dian Yang, general manager and vice
president of product management at Apache Design Systems.
"So if full power consumption is, say, 100 mW, standby power
can't be more than 5 mW. But at deep-submicron nodes, over
50% of total power consumption is lost to leakage."
Leakage is a fact of life for CMOS transistors. And what's
already a bad situation at 65 nm can get worse - much
worse - at 45 nm. Yet design techniques are available to help
mitigate the leakage situation. Upcoming materials and
process tweaks also hold promise.
Root causes
There are two primary sources of leakage
in MOS transistors (Fig. 1). One is the subthreshold leakage,
which is leakage from drain to source (or power to
ground). Subthreshold leakage is rising with each process
node and shows no sign of abating. The mechanics of subthreshold
leakage are based on the fact that no transistor is a
perfect switch.
"In digital logic we all think of them as perfect switches,
but they never really turn off completely," says Jerry Frenkil,
CTO and vice president of research and development at
Sequence Design.
The issue can be seen in terms of the three main regions of
operation for a transistor. There's the cutoff region, where current
is effectively zero. In the saturation region, the transistor
is completely on and can pump a lot of current. In the linear
region, the device essentially functions as a linear amplifier.
"Between the linear and cutoff regions, there's a weak
inversion current flowing between source and drain. The
transistor begins to invert, but it's in a sensitive region where
a small change in gate voltage results in a large change in
current," says Frenkil.
"The degree of change in the current is directly related to
how low the threshold voltage is. The drain current on a
transistor is a function of, among other things, the voltage on
the source, drain, and gate. You can't make that term in the
equation go completely to zero, so there's always a little bit
of current flowing," he adds.
The other main component of the overall leakage issue is
gate-oxide leakage (Fig. 1, again). Gate leakage (as it's commonly
known) is an unhappy byproduct of progress. Transistor
gates are composed of polysilicon sitting on silicon dioxide,
which has the advantage of being very easy to fabricate.
But as semiconductor processes have scaled downward, gate
lengths are obviously shorter. The downward scaling affects all
dimensions, so that silicon dioxide gate layer has become thinner as well to increase gate capacitance and thereby drive current.
Consequently, gate leakage manifests itself as electron
tunneling through the gate oxide.
Differentiating between these two primary sources of leakage
power is critical. While gate leakage is an issue that can,
and in all likelihood will, be solved with process and materials
improvements, subthreshold leakage is entirely a designrelated
problem in terms of any possible fixes.
"In the long run, designers have to worry about subthreshold
leakage but not gate leakage," says Frenkil. "At 65 nm,
there's no convenient process solution for gate leakage. But
at the smaller nodes, there will be."
Attacking the problem
So the design community
and EDA vendors have turned their attention largely to the
control of subthreshold leakage. Several low-power design
methodologies are in use, and they all have their tradeoffs in
terms of relative benefits and impacts (see the table).
One of the most common techniques being applied of late
is multi-VT cell swapping. This technique involves use of
libraries with two or more voltage thresholds. The idea is to
provide synthesis options for simultaneous optimization of
timing, area, and power. A library with a lower VT will leak
more, but it will be faster than the high-VT library. Designers
can opt to use slower, but less power-hungry, cells on noncritical
paths.
"Multi-VT complicated things the least," says Sequence's
Frenkil. "Use of multi-VT may mean a modestly longer timing
closure period." However, Frenkil adds, the leakage gains
to be had from multi-VT are not huge. "It usually cuts leakage
in half," he says.
Reverse body biasing of transistors can also help with subthreshold
leakage by essentially turning the transistor "more
off." Gate leakage is directly proportional to the gate-to-substrate
voltage, VGS. Increasing VGS reduces leakage, but it
also lowers performance.
Opinions differ on the merits of reverse biasing. According
to Frenkil, reverse body biasing is losing favor at advanced
nodes. "It has less effect on leakage with scaling," Frenkil
notes. But Apache's Dian Yang believes
that back biasing can be combined with
variable-threshold CMOS (VTCMOS)
technology to dynamically alter VGS as
necessary for leakage control in critical
paths. For non-critical paths, a higher
VGS can come into play full time for
leakage reduction.
Apache's RedHawk-ALP tool for
physical power integrity supports a
number of techniques for leakage control,
including VTCMOS back biasing
and the insertion of power gating for
memory IP.
Speaking of power gating, it's a technique
that will come into play more at
65 and 45 nm. Power gating (or power
shutoff, as some term it) entails the insertion
of switches that shut off power to
inactive functional blocks. There's good
news and not-so-good news associated
with power gating, however.
The good news is that it can profoundly
reduce leakage power from one
to three orders of magnitude. "For
those seeking ultra-low leakage, they'll
need one flavor or another of power
gating," says Frenkil. The not-so-good
news is that power gating comes with a host of complications to the design flow. In addition to having
to figure out where to place power switches, you have to figure
out how large or small to make them.
"The sizing of the switches is critical," says Frenkil. The
larger the switches, the less they cost in terms of performance.
But larger switches consume more area and degrade leakage
reduction. Smaller switches save on area, performance suffers
more, but there's more leakage reduction.
Power shutoff switches also can wreak havoc for chip floorplanning,
says Frenkil. "If you are power-gating blocks on the
chip, their power rails have to be separated from non-powergated
domains. If it's more than just one or two, it's a real
headache for floorplanning," he says.
Power-shutoff switches also can cause issues with rush currents
and wakeup times. Upon closing of a power switch to a
block, the rush current can be large enough to be damaging if
not managed properly.
Finally, power-shutoff switches bring a number of issues
related to functional verification. Are the
control signals for all switches correct?
Have floating outputs been rectified?
Will there be issues with state retention
for blocks that are turned off?
Focus on flows
For all of the
above reasons, it behooves designers to
consider tool flows that account for
these and other leakage-related factors.
For example, Sequence's tools begin
with exploration of the effects of power
gating at RTL, enabling what-if analyses
of the effects of gating various
blocks. The flow moves on to automatic
sizing and insertion of switches and
then to a final voltage-drop analysis
stage, including analysis of the effects of
voltage drops on delays.
Sequence's flow, which includes Power
Theatre, CoolTime, and CoolPower,
takes a holistic approach to power from
RTL to GDSII (Fig. 2). It's prudent for
designers to consider the entire design
flow when considering leakage, including
the architectural level.
Of course, all of the EDA industry's
three large RTL-to-GDSII tool vendors
have some form of an integrated flow
that attempts to address low-power
design. Magma Design Systems throws
two tools in particular at the problem.
Talus Power operates on the optimization
aspect, spanning RTL to GDSII,
while Quartz Rail performs both power
analysis as well as static and dynamic
voltage-drop analysis. The latter analyzes
the impact of IR drop on delay
and also performs thermal analysis.
"These tools work hand in hand,"
explains Arvind Narayanan, a Magma
director specializing in low power. "If you do multi-VT
optimization, the optimization engine has visibility into
power, timing, and area." As with all integrated implementation
flows, the ability to perform concurrent optimization
has the best likelihood of delivering improved quality of
results without multiple iterations.
Architecture matters
It would seem counterintuitive
to think that much could be accomplished at the architectural
stage of the design cycle with regard to leakage management.
ChipVision Design Systems is one EDA vendor that
has targeted the architectural level for optimization. Earlier
this year, it announced electronic-system-level (ESL) technology
that lets RTL designers work interactively with systemlevel
descriptions to generate power-optimized RTL code.
ChipVision also is part of a European initiative to control
leakage power under the aegis of the OFFIS research and
development consortium. The initiative, called Controlling LEAkage power in NanoCMOS SoCs
(CLEAN), has a number of European
industrial houses and research institutes
among its members.
According to Wolfgang Nebel,
ChipVision's chief technology advisor
and CLEAN's scientific leader, the
effort's objective is to find ways to
reduce leakage for the current generation
of process technology as well as for
future generations.
"We've made substantial progress in
understanding the potential of the lower-
level techniques," says Nebel.
"We've made good progress in modeling
their impact at the higher levels.
There's still work to be done to really
apply all these techniques. The first of
them are already used by the industrial
partners in CLEAN."
The CLEAN development effort,
which counts Infineon and STMicroelectronics
among its industrial partners,
began its three-year term in 2006
and will conclude by the end of 2008.
Also, Nebel points to some pending
CMOS technology improvements that
should, in combination with high-k
dielectrics, go a long way toward solving
the gate-leakage problem (Fig. 3).
The next generation of CMOS technology,
variously termed thin- or
ultra-thin-body CMOS, will provide
much better control over gate leakage,
virtually eliminating it. In its initial
appearance, expected in the 2010
timeframe, subthreshold leakage will
also be substantially lower compared
to bulk CMOS.
The long-term solution, Nebel
believes, is dual-gate or FinFET technology,
which the ITRS expects to
appear with the 40-nm node in 2011.
"FinFETs are named for how they'll
look, standing vertically on the substrate
rather than lying horizontally
and looking something like a shark's
fin," says Nebel.
FinFETs will be completely surrounded
by a gate, providing much greater
control over the channel. "FinFETs will
take us back to the 'good old days'
when the dynamic power consumption
was the biggest contributor to the overall
power budget," says Nebel.
FinFET technology is on the
roadmap for at least one large systems
house. According to Kazu Yamada, vice
president and general manager of the Custom SoC business unit at NEC Electronics
America, NEC's research and
development laboratory is already
working with FinFET.
"But when will we move to that technology
is a moving target," says Yamada.
"Five years ago, we thought it
would be at 32 nm. Now, perhaps it'll
come into play at 28 nm or 24 nm. And
before then, there may be further
breakthroughs that will allow us to
hold off further."
Leveraging multicore
The
growing emphasis on multicore architectures
is an important trend to watch.
Anmol Mathur, chief technology officer
at Calypto Design, points out that leakage
is significantly reduced by moving
to multicore architectures.
"So far, most of the things done at
RTL and above are geared at reduction
of dynamic power," says Mathur. "Typically,
leakage is addressed more at
implementation using multi-VT cells
and other techniques."
But Mathur believes that the tide is
turning. "People are starting to think
about at RTL and at the architectural level.
They're taking it to a slightly higher
level of abstraction," he says.
The advantage of multicore architectures
with regard to leakage stems from
the breaking apart of a very power-hungry
function. "You have a fixed amount
of area on the chip," Mathur says.
"You can use a very fast single core,
with fast memories and caches, or you
can use that same real estate for, say,
four smaller cores, each at a lower frequency,
and get the same aggregate
throughput," he adds. Those four lower-
frequency cores enable scaling back
the power supply, reducing dynamic
power and leakage power.
Examination of power management
at the microarchitectural level is critical,
according to Mohit Bhatnagar of
Encounter product marketing at
Cadence Design Systems (Fig. 4). "At
65 nm, you have to answer questions,"
he says. "What is the range of techniques
I'm using? If I use power shutoff
switches, what is the range of voltage
domains I should use? Should I use my
foundry's generic process or their lowpower
process, compromising performance
but making power goals more
attainable?"
Equally important, says Bhatnagar, is
availing oneself of an automated flow for
these architectural explorations. "The
range of choices is very large. You don't
want to undertake this process manually,"
Bhatnagar says. Partitioning blocks
into various voltage domains might also
mean hierarchies within blocks with subblocks
at lower voltages.
What can you do to control sub-threshold
leakage when the chip is fully awake
and active? See "Control Leakage In
Active Mode" at www.electronicdesign.com, Drill Deeper 17400. Also, find out
how recent advances in materials can
help solve the gate-leakage problem in
"Materials Play A Key Role In Stopping
Leakage," Drill Deeper 17401.
|