0% found this document useful (0 votes)

3 views

LPV_06

The document discusses various techniques for low power design in architecture and systems, focusing on power and performance management, including methods for reducing active power and leakage. It highlights the importance of hardware/software trade-offs, power analysis using EDA tools, and design techniques such as parallelism, pipelining, and loop unrolling. Additionally, it covers specific power-saving modes in microprocessors and strategies for managing power consumption effectively.

Uploaded by

basavalingaswamy2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

LPV_06

Uploaded by

basavalingaswamy2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 52

Low Power Design

Low power Architecture & Systems: Power &

performance management, switching activity reduction,
parallel architecture with voltage reduction, flow graph
transformation, low power arithmetic components, low
power memory design.
Managing the Power Problem

• System Design
 Hardware/software trade-offs and optimization
• Technology
 Transistor scaling, Voltage scaling
• SoC Design
 Low power memories and macros
 Logic
• Dynamic, leakage power opt w/ EDA tools to meet power budgets
• Power Analysis
 Analyze Power using EDA tools to identify Power Problems at a
realistically earlier level.
Reducing active power
• Downsizing transistors (CL)
▫ Slows down logic
• Lowering the supply voltage (VDD)
▫ Slows down logic
▫ Reducing swing slows down
the succeeding stage Pdyn ~  CL Vswing VDD f
• Reducing frequency (f)
▫ Does not reduce energy E ~  CL Vswing VDD
• Reducing switching activity (a)
▫ Logic restructuring
• Reducing glitching
▫ Balancing logic
Reducing Active Power

• Downsizing, lowering the supply on the critical path will lower

the operating frequency
▫ Downsize non-critical paths
▫ Narrows down the path delay distribution
▫ Increases impact of variations

Target
Path count

delay
Original delay
distribution

Delay
Reducing Leakage
• Using higher thresholds
▫ Channel doping
▫ Body biasing
▫ Reduces drive current
• Using stack effect
▫ Stacked devices
▫ Sleep transistors
• Using longer transistors
▫ Limited benefit
▫ Increase in active current
Power-Performance Optimization
Energy/op
Unoptimized
design

Emax

Emin
Dmin Dmax Delay

Maximize throughput for given energy or

Minimize energy for given throughput
Power-Performance Optimization
There are many sets of parameters to adjust
Tuning variables topology A
Devices
Circuit

Energy/op
(sizing, supply, threshold)
Logic style
(std. cells, custom , …)
Block topology
(adder: CLA, CSA, …) topology B
Micro-architecture
(parallel, pipelined) Delay
Design for Low-Power Techniques

• Reduced supply voltage

▫ Charging power varies as VDD2
▫ Reduce transistor threshold voltages to maintain noise margins
▫ But reduced thresholds increase leakage currents exponentially
• Change your CMOS logic family – use a low-power one
• Transistor resizing to speed-up circuit and reduce power
• Use parallelism and pipelining in system architecture – use more,
but slower, hardware
• Standby modes – clock disabling and power-down of selected
logic blocks
• Adiabatic computing – avoid gain/loss of heat during computing
• Software redesign to lower power dissipation
Low Power Design Techniques
• Low power applications
▫Remote systems (e.g., satellite)
▫Portable systems (e.g., mobile phone)
• Methods of low power design
▫Reduced supply voltage
▫Adiabatic switching and charge recovery
▫Clock suppression
▫Logic design for reduced activity
▫Reduce Hazards & Glitches (40% in arithmetic logic)
▫Transistor sizing
▫Pass-transistor logic
▫Pseudo-NMOS logic
▫Multi Threshold gates
▫Software techniques
• Reference: Chandrakasan and Brodersen
Performance Maximization
Techniques
• Use parallelism and pipelining in system architecture –
use more, but slower, hardware
• High throughput
• High router utilization
Power & performance
management
• Microprocessor sleep modes
• Performance Management
• Adaptive filtering
Power & performance
management
• It refer to general class of Low power
techniques that carefully manage the
performance and throughput of a system.
• Do not waste power by designing hardware
that has more performance than necessary.
• Throughput : the amount of work that a
system can do in a given time period
Microprocessor sleep modes
• Deactivate some functional units when no
computation is required.
• At different level of the design
• Subsystems, modules, buses, functional
units, state machine etc.
• Motorola PowerPC603 processor
Motorola PowerPC603 processor
• The CPU has three primary power saver
modes called
• DOZE
• NAP
• SLEEP
• controlled by software.
• In DOZE mode, most functional unit of the
processor are stopped except the on chip cache
memory to maintain cache coherency.

• In NAP mode, cache is also turned off to conserve

power and the processor wakes up after a fixed
amount of time or upon external interrupt.

• In SLEEP mode, the entire processor clock may be

halted and an external reset or interrupt can
resume its operation.
MODE 66 MHz 80 MHz

No power 2.18 W 2.54W

management
Dynamic power 1.89W 2.20W
management

DOZE 307mW 366mW

NAP 113mW 135mW
SLEEP 89mW 105mW
SLEEP without 18mW 19mW
PLL
SLEEP without 2mW 2mW
system clock
Performance Management:

Slid
e 17
Adaptive filtering
• The basic principle is to adjust the filter’s order
length depending on the noise characteristics of
the input signal.

• The quality or order length requirement of a

digital filter depends on:
▫ The desired signal to noise ratio of the output.
▫ The noise energy level of the input signal.
Adaptive Filtering:

FIR
Filter

Slid
e 19
Switching activity reduction

• Some of the techniques available in the reducing

the switching activity are as follows:
• Guarded Evaluation.
• Bus Multiplexing
• Glitch reduction by pipelining
Guarded Evaluation:
• It is a technique to reduce switching activities by
adding latches or blocking gates at the inputs of a
combinational modules if the outputs are not
used.
Example: Guarded Evaluation
• For example, consider a multiplier whose outputs
are used only under certain conditions.
• In this case, the input to a multiplier can be
stopped from toggling whenever outputs are not
used. It will stop unnecessary switching from
entering into the multiplier.
Bus Multiplexing
• Highly congested designs tend to consume more
power due to longer wire lengths.
• Placement has to be porous and more spread out
to route the design, resulting in longer wire
lengths and more switches per wire.
• All of this contributes to bad timing results, as
well as increased power consumption.
Bus Multiplexing (cont..)
• Reducing such busses helps both timing and
power.
• The busses carrying correlated data should be
multiplexed together to further reduce switching
into the MUX/DEMUX logic.
Glitch Reduction by Pipelining
• Glitches are unwanted switching activities that
occur before a signal settles to its intended value.

• Pipelining is another technique that involves

introducing the registers in the middle of long
combinatorial paths.

• This adds latency but increases the speed and

reduces the levels of logic.

• The introduction of extra registers consumes

power but minimizes the glitches drastically.
Parallel architecture with voltage
reduction
• Used to improve the computation throughput of
high performance digital system.
• Uniprocessing system
• Parallel system
Uniprocessing system

Input Capacitance = C
Output
Processor Voltage = V
Frequency = f
Power = CV2f

• In a uniprocessing system, the power dissipation

will be given by:
• Puni = CV2f
Parallel Architecture

Capacitance =
Processor 2.2C
Voltage = 0.6V
Frequency = 0.5f
Output Power = 0.396CV2f
f/2
Input • The power dissipation of the
parallel system is:

Processor f • Ppar = 0.396CV2f

= 0.396Puni

• About 60% reduction in power

f/2
obtainable.

For N stage Parallelism :

2
V   f  Puni
Ppar ( nC )     2
 n  n n
Pipelined Architecture
• Capacitance = 1.2C

Register
Input • Voltage = 0.6V
Proc. Proc.
• Frequency = f
• Power = 0.432CV2f
f
• The power dissipation of
the pipelined system is:

For N stage Pipelining : • Ppip = 0.432CV2f

= 0.432Puni
2
V  Puni • About 60% reduction in
Ppip C   f  2 power obtainable.
 n n
Flow graph transformation
• This is a system level technique for the design of
special purpose DSP systems, which are
characterized by computation intensive data path
operations with simple control structures.
• This is also known as Control Data Flow Graph.
Control Data Flow Graph

• The graph consists of control nodes and data nodes

connected by edges.

• Control node change the flow of data that pass through it.
• Examples: multiplexers, condition selectors etc.

• Data nodes provide computation operators for the input

data streams such as addition, multiplication, shift etc.

• The graph edges represent the data streams of the

system.
Control Data Flow Graph (Cont..)

• A control data flow graph expresses the

conceptual algorithm of the system.

• The control data flow graph is often the starting

point to derive the actual hardware architecture
of a system by mapping the operators and edges
to actual hardware modules and busses
respectively.
Control Data Flow Graph
(Cont..)
• Draw a control flow graph of a system that
computes the equation
y n a n bn  3a n  1
y n a n bn  3a n  1
Hardware architecture
Operator Reduction
• Draw a control flow graph of a system that
computes the equation
• Y=AB+AC
Operator Reduction(Cont..)
• Draw a control flow graph of a system that
computes the equation
• Y=A(B+C)
Operator Reduction (Cont..)
Architecture and System

Slid
e 39
x 2  x1 sin   y1 cos 
( x1  y1 ) sin   y1 (cos  sin  )
y 2  x1 (1  sin  )  y1 sin 
 x1  ( x1  y1 ) sin 
Operator Reduction (Cont..)
Operator Reduction (Cont..)
Operator Reduction (Cont..)
Architecture and System

Slid
e 44
Loop Unrolling
• The technique of loop unrolling replicates the body of a loop some
number of times (unrolling factor u) and then iterates by step u
instead of step 1. This transformation reduces the loop overhead,
increases the instruction parallelism and improves register, data
cache or TLB locality.
for i = 2 to N - 2 step 2
for i = 2 to N - 1
A(i ) = A(i ) + A(i - 1) A(i + 1)
A(i ) = A(i ) + A(i - 1) A(i + 1)
A(i  1) = A(i  1) + A(i ) A(i + 2)

Loop overhead is cut in half because two iterations are performed in

each iteration.
If array elements are assigned to registers, register locality is improved
because A(i) and A(i +1) are used twice in the loop body.
Instruction parallelism is increased because the second assignment
can be performed while the results of the first are being stored and the
loop variables are being updated.
Loop Unrolling (IIR filter example)
loop unrolling : localize the data to reduce the activity of the inputs of the
functional units or two output samples are computed in parallel based on
two input samples.
Yn  1  X n  1  A Yn  2
Yn  X n  A Yn  1  X n  A ( X n  1  A Yn  2 )

Neither the capacitance switched nor the voltage is altered. However,

loop unrolling enables several other transformations (distributivity,
constant propagation, and pipelining). After distributivity and constant
propagation,
Yn  1  X n  1  A Yn  2
Yn  X n  A Yn  1  A2 Yn  2
I I R Filter
Loop Unrolling is a method to apply parallelism to the
computation.

Slid
e 47
Architecture and System

Slid
e 48
Loop Unrolling for Low Power
Loop Unrolling for Low Power
Loop Unrolling for Low Power
Effective Resource Utilization
7 S 7
S
+ + + +

D
D
5 1 2 6
+ + 1 2 6
5
+ D +
D Retiming
D
3 4 D

3 4
D

Before AFTER

CYCLE Multipliers
1 Adder Multipliers Adder
1 1, 3 - 2 8
2 2, 4 5 1 6
1 - 6, 8 3 7
1 - 7 4 5

Can reducd interconnect capacitance.

Jira Interview Questions and Answers
No ratings yet
Jira Interview Questions and Answers
75 pages
Power and Speed Trade-Offs in Data Path Structures Array Subsystems
100% (1)
Power and Speed Trade-Offs in Data Path Structures Array Subsystems
54 pages
2539
No ratings yet
2539
29 pages
Chapter 4 (1)
No ratings yet
Chapter 4 (1)
35 pages
Cmos Low Power
No ratings yet
Cmos Low Power
5 pages
Unit v -Sources of Power Dissipation
No ratings yet
Unit v -Sources of Power Dissipation
52 pages
Adopted From Low Power Design Essentials - Jan M. Rabaey
No ratings yet
Adopted From Low Power Design Essentials - Jan M. Rabaey
52 pages
Low Power Vlsi Design: Assignment-1 G Abhishek Kumar Reddy, M Manoj Varma
No ratings yet
Low Power Vlsi Design: Assignment-1 G Abhishek Kumar Reddy, M Manoj Varma
17 pages
eytu_lecture2-3
No ratings yet
eytu_lecture2-3
114 pages
Lecture13 03 PDF
No ratings yet
Lecture13 03 PDF
35 pages
Low Power Solutions
No ratings yet
Low Power Solutions
56 pages
3 Anandi
No ratings yet
3 Anandi
27 pages
Low-Power VLSI Design TOC
No ratings yet
Low-Power VLSI Design TOC
3 pages
30VLSI System Level
No ratings yet
30VLSI System Level
49 pages
Week 12 A
No ratings yet
Week 12 A
19 pages
Unit 5
No ratings yet
Unit 5
11 pages
Signal Processing (E.g. For Multimedia and Wireless Communications)
No ratings yet
Signal Processing (E.g. For Multimedia and Wireless Communications)
10 pages
Low Power Implem
No ratings yet
Low Power Implem
19 pages
Low Power VLSI Design
No ratings yet
Low Power VLSI Design
12 pages
Chapter Five
No ratings yet
Chapter Five
13 pages
Chapter 17: Low-Power Design: Keshab K. Parhi and Viktor Owall
No ratings yet
Chapter 17: Low-Power Design: Keshab K. Parhi and Viktor Owall
34 pages
LP Main
No ratings yet
LP Main
10 pages
Low Power Syntheis
100% (3)
Low Power Syntheis
18 pages
4.CMOS Power Consumption&Low Power Technique Final (1)
No ratings yet
4.CMOS Power Consumption&Low Power Technique Final (1)
46 pages
Lec 38
No ratings yet
Lec 38
31 pages
Designing For Low Power in Soc Projects
No ratings yet
Designing For Low Power in Soc Projects
14 pages
Low Power Design Methodologies and Flows
No ratings yet
Low Power Design Methodologies and Flows
52 pages
1 s2.0 0026269296000109 Main PDF
No ratings yet
1 s2.0 0026269296000109 Main PDF
14 pages
Chapter-4 Low Power Computing: Sources of Energy Consumptions
No ratings yet
Chapter-4 Low Power Computing: Sources of Energy Consumptions
3 pages
Low Power VLSI Design
No ratings yet
Low Power VLSI Design
6 pages
Zhou 2008
No ratings yet
Zhou 2008
7 pages
A Study of Low Power Design Techniques For Application Specific Processors
No ratings yet
A Study of Low Power Design Techniques For Application Specific Processors
2 pages
Kaxiras - Computer Architecture Techniques For Power Efficiency - 2008
No ratings yet
Kaxiras - Computer Architecture Techniques For Power Efficiency - 2008
219 pages
Dynamic Power Reduction WP
No ratings yet
Dynamic Power Reduction WP
6 pages
High-Level Power Analysis and Optimization
No ratings yet
High-Level Power Analysis and Optimization
185 pages
Lecture Notes: B.Tech
No ratings yet
Lecture Notes: B.Tech
68 pages
Rends and Challenges in Vlsi: BY: Bhanuteja Labishetty
No ratings yet
Rends and Challenges in Vlsi: BY: Bhanuteja Labishetty
35 pages
CHAPTER Five
No ratings yet
CHAPTER Five
26 pages
Low Power Vlsi Design1
No ratings yet
Low Power Vlsi Design1
23 pages
Unit 5 digi
No ratings yet
Unit 5 digi
73 pages
Low Power Design: Dr. Paul D. Franzon
No ratings yet
Low Power Design: Dr. Paul D. Franzon
16 pages
LPV_04
No ratings yet
LPV_04
110 pages
Q Electrical Dinamic Power
No ratings yet
Q Electrical Dinamic Power
8 pages
Lecture 24
No ratings yet
Lecture 24
21 pages
lpvd u3
No ratings yet
lpvd u3
10 pages
Power aware Architecture
No ratings yet
Power aware Architecture
46 pages
11 - Chepter 3 PDF
No ratings yet
11 - Chepter 3 PDF
17 pages
JNTUA Low Power VLSI Circuits & Systems Notes - R15
No ratings yet
JNTUA Low Power VLSI Circuits & Systems Notes - R15
68 pages
U I - Lecture 4 Basic Principle of Low Power Design
50% (2)
U I - Lecture 4 Basic Principle of Low Power Design
17 pages
Lecture1 ch1 Fundamentals of Quantitative Design and Analysis
No ratings yet
Lecture1 ch1 Fundamentals of Quantitative Design and Analysis
28 pages
File 1501
No ratings yet
File 1501
31 pages
Power Analysis Methodology and Objectives for TI wireless platform PDF
No ratings yet
Power Analysis Methodology and Objectives for TI wireless platform PDF
19 pages
Vlsi Interview Questions
No ratings yet
Vlsi Interview Questions
26 pages
Low Power Design of Digital Systems
No ratings yet
Low Power Design of Digital Systems
28 pages
Low Power Design Techniques and Implementation Strategies Adopted in VLSI Circuits
No ratings yet
Low Power Design Techniques and Implementation Strategies Adopted in VLSI Circuits
4 pages
LP VLSI Syllabus
No ratings yet
LP VLSI Syllabus
2 pages
Low Power Design Techniques and Power Dissipation Management
No ratings yet
Low Power Design Techniques and Power Dissipation Management
11 pages
LPVD U1,2
No ratings yet
LPVD U1,2
34 pages
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 2
From Everand
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 2
Kerwin Mathew
No ratings yet
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 1
From Everand
Reference Guide To Useful Electronic Circuits And Circuit Design Techniques - Part 1
Kerwin Mathew
2.5/5 (3)
8.Algorithm & architectural level methodologies
No ratings yet
8.Algorithm & architectural level methodologies
53 pages
7.Low Power Clock Distribution
No ratings yet
7.Low Power Clock Distribution
99 pages
LPV_07
No ratings yet
LPV_07
67 pages
LPV_05
No ratings yet
LPV_05
84 pages
AP-MTT Society Chapter Details-Bangalore Section
No ratings yet
AP-MTT Society Chapter Details-Bangalore Section
1 page
Untitled
100% (1)
Untitled
321 pages
Figma Notes
No ratings yet
Figma Notes
2 pages
Fronius Single Interface Devicnet R-J3iB and Higher
No ratings yet
Fronius Single Interface Devicnet R-J3iB and Higher
34 pages
Introduction To Computer Software
No ratings yet
Introduction To Computer Software
4 pages
2022 - Model-Free Repetitive Control Design and Implementation For Dynamical Galvanometer-Based Raster Scanning
No ratings yet
2022 - Model-Free Repetitive Control Design and Implementation For Dynamical Galvanometer-Based Raster Scanning
11 pages
BS-480 - Serial Port Card Installation Guidance - V1.0 - EN
No ratings yet
BS-480 - Serial Port Card Installation Guidance - V1.0 - EN
8 pages
UTU Summer Date Sheet For B.Tech
No ratings yet
UTU Summer Date Sheet For B.Tech
8 pages
Parallelism strategies in Machine Learning, get the Free cheat sheet __-2
No ratings yet
Parallelism strategies in Machine Learning, get the Free cheat sheet __-2
32 pages
Unit5 - Data Compression and Cryptography
No ratings yet
Unit5 - Data Compression and Cryptography
59 pages
E Tech Presentation
No ratings yet
E Tech Presentation
16 pages
Certif Eddy
No ratings yet
Certif Eddy
7 pages
Big Data Manual - Edited
No ratings yet
Big Data Manual - Edited
69 pages
iRAC5000 Seies Controller Self Diagnostics
No ratings yet
iRAC5000 Seies Controller Self Diagnostics
10 pages
Blackbook 2023
No ratings yet
Blackbook 2023
4 pages
Unit 5 Introduction To A Text Editor
0% (1)
Unit 5 Introduction To A Text Editor
7 pages
MCA Syallbus 2020-21
No ratings yet
MCA Syallbus 2020-21
58 pages
Solved Unit 1 Q-Bank
No ratings yet
Solved Unit 1 Q-Bank
19 pages
Exam Seat Allocation-Report
No ratings yet
Exam Seat Allocation-Report
35 pages
Microsoft-365-F5
No ratings yet
Microsoft-365-F5
1 page
Competency Based Learning Material
33% (3)
Competency Based Learning Material
52 pages
Register and Memory Package
No ratings yet
Register and Memory Package
33 pages
Microprocessor and Interfacing
No ratings yet
Microprocessor and Interfacing
4 pages
Python Numbers
No ratings yet
Python Numbers
1 page
Soundgrid Studio Manual
No ratings yet
Soundgrid Studio Manual
123 pages
DigitalForensics Book
No ratings yet
DigitalForensics Book
3 pages
Viper Steel DDR4 Performance Memory DRAM
No ratings yet
Viper Steel DDR4 Performance Memory DRAM
2 pages
TVL-ICT-CSS-11-Q3_ICCS-Week-5-6
No ratings yet
TVL-ICT-CSS-11-Q3_ICCS-Week-5-6
9 pages
11.Exception Handling in python.docx
No ratings yet
11.Exception Handling in python.docx
9 pages
Chapter 4 Advanced CSS
No ratings yet
Chapter 4 Advanced CSS
70 pages

LPV_06

Uploaded by

LPV_06

Uploaded by

Low Power Design

Low power Architecture & Systems: Power &

• Downsizing, lowering the supply on the critical path will lower

Maximize throughput for given energy or

• Reduced supply voltage

• In NAP mode, cache is also turned off to conserve

• In SLEEP mode, the entire processor clock may be

No power 2.18 W 2.54W

DOZE 307mW 366mW

• The quality or order length requirement of a

• Some of the techniques available in the reducing

• Pipelining is another technique that involves

• This adds latency but increases the speed and

• The introduction of extra registers consumes

• In a uniprocessing system, the power dissipation

Processor f • Ppar = 0.396CV2f

• About 60% reduction in power

For N stage Parallelism :

For N stage Pipelining : • Ppip = 0.432CV2f

• The graph consists of control nodes and data nodes

• Data nodes provide computation operators for the input

• The graph edges represent the data streams of the

• A control data flow graph expresses the

• The control data flow graph is often the starting

Loop overhead is cut in half because two iterations are performed in

Neither the capacitance switched nor the voltage is altered. However,

Can reducd interconnect capacitance.

You might also like