Title: INTRODUCTION TO LOWPOWER DESIGN
1INTRODUCTION TOLOW-POWER DESIGN
2Why Low-Power Devices?
- Practical reasons
- (Reducing power requirements of high throughput
portable applications) - Financial reasons
- (Reducing packaging costs and achieving memory
savings) - Technological reasons
- (Excessive heat prevents the realization of high
density chips and limits their functionalities)
3Application Fields
- Portable Electronics (PC, PDA, Wireless)
- IC Cost (Packaging and Cooling)
- Reliability (Electromigration, Latch-up)
- Signal Integrity (Switching Noise, DC Voltage
Drop) - Thermal Design
- Ultra-low-power applications
- Space missions (miniaturized satellites)
4Different Constraints for Different Application
Fields
- Portable devices Battery life-time
- Telecom and military Reliability (reduced power
decreases electromigration, hence increases
reliability) - High volume products Unit cost
- (reduced power decreases packaging cost)
5Design Technology Trend
6Driving Forces for Low-Power Portable
Applications
- The market of portable applications is growing
rapidly.
Global Market for Cellular Phones
7Driving Forces for Low-Power Deep-Submicron
Technology
- ADVANTAGES
- Smaller geometries
- Higher clock frequencies
- DISADVANTAGES
- Higher power consumption
- Lower reliability
8Driving Forces for Low-PowerBattery Limitations
- Battery maximum power and capacity increases
10-15 per year - Increasing gap with respect to power demand
9What has worked up to now?
- Voltage and process scaling
- Design methodologies
- Power-aware design flows and tools, trade area
for lower power - Architecture Design
- Power down techniques
- Clock gating, dynamic power management
- Dynamic voltage scaling based on workload
- Power conscious RT/ logic synthesis
- Better cell library design and resizing methods
- Cap. reduction, threshold control, transistor
layout
10Opportunities for Power Savings
11Realistic Estimation Expectations
12Power Metrics
- Average power Related to battery lifetime.
- Peak power Related to reliability and thermal
failure - RMS power Related to cycle-by-cycle power
- Energypower ? time Related to power-delay
product.
13Why CMOS?
VDD
- Intrinsically low power consuming
- (when the input is static, there is no power
consumption) - Reference technology
- Ease of design.
PMOS
Vin
Vout
NMOS
Basic CMOS configuration
VSS
14An NMOS gate
Polysilicon or Metal
Oxide
Gate
Source
Drain
p
n
n
15A biased NMOS gate
VGSgt0
p
n
n
16A biased NMOS gate
VGSgt0
p
n
n
17A biased NMOS gate
VGSVTn
p
n
n
When VGS ? VTn , the n-channel is developed and
the device can start operation by applying a
positive VDS
18Standard (Non-Adiabatic) Capacitance Change
19Energy Stored in C
20Energy Supplied by the Battery
21Heat Energy Dissipated (Non-Adiabatic)
22Quasi-Adiabatic Charging of C
23Sources of Power Consumption in CMOS
24Power Dissipation in CMOS Circuits
- Ptotal Pswitching Pshort-circuit Pleakage
Due to charging and discharging capacitors
(dynamic power consumption)
Due to direct paths
Due to leaking diodes and transistors
75
5
20
25Inverter A Basic CMOS Gate
26Inverter Analysis
27Inverter Analysis
28Energy Consumed (related to Battery Power)
- Energy consumed due to a complete cycle 0?1?0.
29Dynamic Power Consumption(related to Battery
Power)
- Average power consumption by a node cycling at
each period T - (each period has a 0?1 or a 1 ?0 transition)
- Average power consumed by a node with partial
activity - (only a fraction ? of the periods has a
transition)
30Dynamic Power Consumption(related to Inverter)
- Average power consumption by a node cycling at
each period T - (each period has a 0?1 or a 1 ?0 transition)
- Average power consumed by a node with partial
activity - (only a fraction ? of the periods has a
transition)
31Dynamic Power Consumption
- Define effective capacitance Ceff
- To minimize switching power
- Reduce VDD
- Reduce Ceff
32Factors Influencing Ceff
- Circuit function
- Circuit technology
- Input probabilities
- Circuit topology
33Some Basic Definitions
- Signal probability of a signal g(t) is given by
- Signal activity of a logic signal g(t) is given by
where ng(t) is the number of transitions of g(t)
in the time interval between T/2 and T/2.
34Factors Influencing CeffCircuit Function
- Assume that there are M mutually independent
signals g1, g2,...gM each having a signal
probability Pi and a signal activity Ai, for i ?
n. - For static CMOS, the signal probability at the
output of a gate is determined according to the
probability of 1s (or 0s) in the logic
description of the gate
P1
P1
P1P2
1-(1-P1)(1- P2)
P1
1-P1
P2
P2
35Factors Influencing CeffCircuit Function
(Static CMOS)
- Transistors connected to the same input are
turning on and off simultaneously when the input
changes - CL of a static CMOS gate is charged to VDD any
time a 0?1 transition at the output node is
required. - CL of a static CMOS gate is discharged to ground
any time a 1? 0 transition at the output node is
required.
NOR Gate
36Factors Influencing CeffCircuit Function
(Static CMOS)
- Two-input NOR gate
- Assume only one input transition per cycle is
allowed - Assume inputs are equiprobable pApB1/2.
- The probability for the output to be 1 is
- pY(1-pA)(1-pB)1/4
- The probability for the output to be 0 is
- pY1-pY3/4
37Factors Influencing CeffCircuit Function
(Static CMOS)
- State transition diagram of the NOR gate
38Factors Influencing CeffCircuit Function
(Static CMOS)
- Two-input XOR gate
- Assume only one input transition per cycle is
allowed - Assume inputs are equiprobable pApB1/2.
- The probability for the output to be 1 is
- pY(1-pA)pB(1-pB)pA1/2
- The probability for the output to be 0 is
- pY1-pY1/2
39Factors Influencing CeffCircuit Function
(Static CMOS)
- State transition diagram of the NOR gate
40Factors Influencing CeffCircuit Function
(Dynamic CMOS)
- At each cycle, MD is precharged to VDD.
- CL is precharged to VDD at each clock cycle
- It is discharged to ground any time a 1? 0
transition at the output node is required.
MD
41Factors Influencing CeffCircuit Function
(Dynamic CMOS)
- Two-input NOR gate
- Assume only one input transition per cycle is
allowed - Assume inputs are equiprobable pApB1/2.
- The probability for the output to be discharged
is - pY3/4
- The probability of CL to be re-charged at the
next cycle is pY.
42Factors Influencing CeffCircuit Function
(Dynamic vs Static)
- ?dynamic CMOS ? ?static CMOS
- Ceff (dynamic CMOS) ? Ceff (static CMOS)
- Power due to glitching is much smaller in dynamic
CMOS than it is in static CMOS. - In static CMOS, the transition probability
depends on both input probabilities and previous
state. - In dynamic CMOS, the transition probability
depends on solely input probabilities. - In static CMOS, the gate output does not switch
if the inputs do not change between subsequent
cycles. - In dynamic CMOS, the gate output may switch even
if the inputs do not change between subsequent
cycles.
43Factors Influencing CeffInput Probabilities
(Static CMOS)
- Two-input NOR gate
- Assume only one input transition per cycle is
allowed - Assume inputs are not equiprobable pA , pB
- The probability for the output to be 1 is
- pY(1-pA)(1-pB)
- The probability for the output to be 0 is
- pY1-pY
44Factors Influencing CeffInput Probabilities
(Static CMOS)
- The probability for the output of a NOR gate to
have a 0?1 transition
45Factors Influencing CeffInput Probabilities
(Static CMOS)
- Signal probability calculation
- For each input signal and gate output in the
circuit, assign a unique variable - Starting from at the inputs and proceeding to the
outputs, write the expression for the output of
each gate as a function of its input expression - Suppress all exponents in a given expression to
obtain the correct probability for that signal
(Recall that an exponent of a binary number is
also a binary number) -
46Factors Influencing CeffInput Probabilities
(Static CMOS)
- Signal activity calculation Boolean Difference
-
- It signifies the condition under which output f
is sensitized to input xi - If the primary inputs to function f are not
spatially correlated, the signal activity at f is
47Factors Influencing Ceff Input Probabilities
(Static CMOS)
- Signal activity through basic gates
P1 , A1
P2 A1 P1 A2
P1 , A
A
P2 , A2
P1 , A1
(1-P2 ) A1 (1-P1) A2
P2 , A2
- Signal activity is used to determine dynamic
power due to glitches.
48Factors Influencing Ceff Circuit Topology
- Circuit topology may have high impact on Ceff
- Example Chain and Tree implementation of a four
input NAND gate - Assume static CMOS
- Assume all inputs are equiprobable.
49Factors Influencing Ceff Circuit Topology
- Globally chain implementation has a lower
switching activity in the static behavior of the
circuit. - Timing skew between signals may cause hazards
resulting in extra power dissipation. - Consider 1110?1011 in chain circuit with unit
delay of each gate.
50Factors Influencing Ceff Circuit Topology
- The chain circuit suffers from hazards, but the
tree circuit does not (due to its balanced paths) - Dynamic CMOS is glitch-free because the gate
output can make at most one power consuming
transition per clock cycle.
51Power Dissipation in CMOS Circuits
- Ptotal Pswitching Pshort-circuit Pleakage
Due to charging and discharging capacitors
(dynamic power consumption)
Due to direct paths
Due to leaking diodes and transistors
75
5
20
52Power Dissipation due to Short Circuit Currents
53Power Dissipation due to Short Circuit Currents
- Short circuit current flows when both devices are
on simultaneously
- Short circuit current, Isc, flows from VDD to
ground - The power dissipated by a CMOS gate due to short
circuit current is
54Power Dissipation due to Short Circuit Currents
- Isc is significant when the input rise-fall time
is much larger than the output rise-fall times. - When input and output rise-fall times are equal,
Isc tends to zero - Isc 0 when
- because both transistors will never be on
simultaneously
55Power Dissipation due to Short Circuit Currents
Fast rising input
56Power Dissipation due to Short Circuit Currents
- The energy dissipated by a CMOS gate due to short
circuit current is
- If the current isc(t) is a triangle, then
57Power Dissipation due to Short Circuit Currents
Weight of short circuit power in the total power
consumption
58Power Dissipation due to Short Circuit Currents
Weight of short circuit power of the total energy
59Power Dissipation due to Short Circuit Currents
- Short circuit component of the total power
consumption may be important and may increase
while scaling - Short circuit power increases with signal
transition times at inputs - Short circuit power decreases with signal
increasing load capacitance (because Imax
decreases)
60Power Dissipation in CMOS Circuits
- Ptotal Pswitching Pshort-circuit Pleakage
Due to charging and discharging capacitors
(dynamic power consumption)
Due to direct paths
Due to leaking diodes and transistors
75
5
20
61Power Dissipation due to Leakage Currents
- Leakage currents are important in the systems
with long periods of inactivity - Reverse bias diode current through the transistor
drain IL - Subthreshold current through the channel of an
off transistor Ids
62Power Reduction MethodsVoltage Supply Scaling
- Historically most adapted method is the reduction
of voltage supply, VDD
75
5
20
63Power Reduction MethodsVoltage Supply Scaling
- Gate delay, Td, increases as VDD decreases!
- ?The circuit cannot be switched very fast!
64Additional Power Reduction Methods
- Preserving circuit speed and computational
throughput mandatory. - Two solutions
- Threshold voltage scaling
- Architecture driven voltage scaling based on
- Pipelining
- Parallelization
65Power Reduction MethodsThreshold Voltage Scaling
- Reduce threshold voltage while reducing supply
voltage - Example
- Circuit A VDD1.5V, VTh1V
- Circuit B VDD0.9V, VTh0.5V
- Circuits A and B approximately have the same
performance
66Power Reduction MethodsThreshold Voltage Scaling
- Td increases as VDD approaches to VTh
67Power Reduction MethodsThreshold Voltage Scaling
- If threshold voltage scaling is required,
low-threshold MOS devices must be used for the
design. - The limit on the threshold voltage scaling is
imposed by the noise margin and the increase of
the subthreshold current (Ids) - Tradeoff between Pswitching (decreases as VTh
decreases) and Pleakage (increases as VTh
decreases)
68Power Reduction MethodsThreshold Voltage Scaling
69Power Reduction MethodsArchitecture Driven
Supply Voltage Scaling
- Strategy
- 1. Modify the architecture of the system so as to
make it faster. - 2. Reduce VDD so as to restore the original
speed. Power consumption has decreased. - The most common architectural changes rely on the
exploitation of parallelization and pipelining. - Drawback
- The additional circuitry required to compensate
the speed degradation may dominate, and the power
consumption may increase. - Consequence
- Parallelism and pipelining do not always
pay-off.
70Parallel and Pipelined Architectures
Example Reference Adder-Comparator
71Parallel and Pipelined Architectures
- Supply voltage VRef 5V .
- Assume the worst-case delay (through adder and
comparator) to be 25nsec. - Best clock period allowed TRef 25nsec.
- Total effective capacitance Cref
- Power consumption
- PRef 0.5 CRef V 2Ref (Tref)-1
- CRef has been obtained assuming equiprobable
inputs. - For maximum throughput, no voltage scaling is
allowed (no additional delay) ? no power
reduction can be obtained.
72Parallel Architectures
Ppar0.36Pref
73Parallel Architectures
- Best possible clock period 25nsec.
- However, the required throughput is guaranteed if
the clock period is doubled - TPar 50nsec.
- The speed of the adder and the comparator can
thus be halved. - Supply voltage VPar 2.9V O.58 Vref
- Total effective capacitance CPar 2.15 CRef
- (factor 2.15 instead of 2 is due to extra
routing). - Since TPar 2 TRef, power consumption is
- PPar O.36 PRef
74Parallel Architectures
- Parallelism does not pay off when VDD approaches
to VTh
75Pipelined Architectures
Ppar0.39Pref
76Pipelined Architectures
- Power consumption is reduced by a factor of 2.5,
approximately as in the case of parallel
realization. - Area penalty is much more Iimited than in the
parallel case. - The use of pipelining also reduces the sequential
depth of the circuit, thus reducing power
dissipation due to hazards and critical races.
77Parallel-Pipelined Architectures
Ppar0.2Pref
78Comparison of Architectures
79Comments on Parallel and Pipelined Architectures
- Total reduction of dynamic power that can be
applied through voltage scaling is always quite
remarkable - Yet, there are some cases where modifying the
clock frequency or the relative speed of some
components through architectural changes is
infeasible (Example Time-multiplexed
architectures like DSP and microprocessors.
80Different Supply Voltages for Different Units
- Partition the chip into multiple sub-units each
of which is designed to operate at a specific
supply voltage
3V
5V
5V
SLOW
3V
FAST
5V
SLOW
SLOW
3V
3V
SLOW
3V
81Other Methods in Power Reduction
- Supply voltage scaling is not the only possible
solution to reduce power consumption. - Considerable results can be obtained through
minimization of Ceff. - Ceff is proportional to switching Ceff ?CL
- Design and synthesis techniques have been
developed to reduce both the capacitive Ioad, CL
and the switching activity, ?, at all stages of
the design process.
82SoC Design Flow
83Power Analysis
- Fast and accurate analysis in the design process
- Power budgeting
- Knowledge-based architectural and implementation
decisions - Package selection
- Power hungry module identification
- Detailed and comprehesive analysis at the later
stages - Satisfaction of power budget and constraints
- Hot spots
84Average vs Detailed Power Analysis
- Average Power Analysis is for
- Heat dissipation
- Power budgeting
- Package selection
- Implementation trade-offs for power
- Detailed Power Analysis is for
- Determining power supply specifications
- Voltage drop
- Hot spots
- Peak power