100% found this document useful (1 vote)

627 views25 pages

Clock Concurrent Optimization: Paul Cunningham, Marc Swinnen, Steev Wilcox Electronic Design Processes April 10, 2009

Uploaded by

manojkumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

627 views25 pages

Clock Concurrent Optimization: Paul Cunningham, Marc Swinnen, Steev Wilcox Electronic Design Processes April 10, 2009

Uploaded by

manojkumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Clock Concurrent Optimization

Paul Cunningham, Marc Swinnen, Steev Wilcox

Electronic Design Processes

April 10, 2009

© Azuro, Inc. 2009 1

The Clock Timing Gap

© Azuro, Inc. 2009 2

Traditional Design Flows
RTL

Synthesis

Initial Placement Chip speed measured using “ideal” clocks

Physical Optimization

TRADTIIONAL PURPOSE OF CTS:

CTS Make propagated clocks look like ideal clocks
by building “balanced” clock networks

Post-CTS Optimization

Routing Chip speed measured using “propagated” clocks

Post-Route Optimization

Final Layout

© Azuro, Inc. 2009 3

Reality Today
RTL (e.g. Verilog)

Synthesis

Initial Placement Ideal Timing

Clock Gating
Clock Muxing
Physical Optimization
Clock Generators
– Especially for hold
MANY
ITERATIONS
CTS BIG DIFFERENCE!! Complex Scan Chains
OCV derates and CPPR
Post-CTS Optimization Multi-corner
– Especially for hold
Multi-mode
Routing
Propagated Timing

Post-Route Optimization

Final Layout

© Azuro, Inc. 2009 4

Technology Trends
Opening the Clock Timing Gap

© Azuro, Inc. 2009 5

Trends Driving the Clock Timing Gap
clock (T) clock (T)
Clock Timing
Gap CPPRBC
“Skew” does not include
? OCV effects CPPRAB
2X to 5X
skew OCV ± 10%
clock period

OCV ± 10%

A B C
D ≈ clock period
D

Traditional Optimization OCV

D < T - skew OCV affects each pair of FFs differently (CPPR)

OCV effect can be very big - e.g. 10% of 3T

CTS cannot predict OCV impact

So, “skew=0” does not mean FFs are really balanced

© Azuro, Inc. 2009 6

Trends Driving the Clock Timing Gap
clock (T) clock (T)
Clock Timing
Gap

“Skew “does not include

? enable

2T … 5T
gate offsets CG
skew

CG
offset

Traditional Optimization Clock Gating

D < T - skew Clock gates are supposed to have a very big skew

Traditional optimization tries to prevent this by

‘cloning’ the gates and pushing them down the tree

Traditional approach cannot correctly optimize or time

CG enable paths

© Azuro, Inc. 2009 7

Trends Driving the Clock Timing Gap Clk-A

clock (T)

Clock Timing Clk-B

Gap 1,000 FFs

10,000 FFs
? “Skew “does not include
interclock skew
skew
Are all FFs
“balanced”?
Clk-C Clk-D
1,000 FFs
AOI2

D 2,000 FFs

Traditional Optimization Clock Complexity

500 FFs

D < T - skew Clock balancing becomes very difficult, or even

theoretically impossible

Requires extensive manual intervention

Final clock implementation is very different from

original, ideal assumptions

© Azuro, Inc. 2009 8

The Clock Timing Gap is Growing
Pre-CTS Timing Report CTS Post-CTS Timing Report

Propagated clocks timing and ideal clocks timing are diverging

Number of Paths The clock timing gap is

growing exponentially

180nm, σ = 7% of T

65nm, σ = 27% of T

45nm, σ = 50% of T

Difference in Pre- to Post-CTS Timing (% of period T)

© Azuro, Inc. 2009 9

CMPLX

Ideal vs. Propagated Clocks Timing Gap

Difference between ideal and propagated timing across 60 chips
– Top 10% worst violating paths
– Difference measured as a %age of clock period

60%

with ocv

inter-clock
50%
reg-to-cg

reg-to-reg

40%

30%

20%

10%

0%
180nm 130nm 65nm 45nm

© Azuro, Inc. 2009 10

Key Limitation of Traditional Flows
RTL

Synthesis

Initial Placement
“Ideal clocks”
Big decisions about chip
speed vs. area vs. power
Physical Optimization made here using ideal clocks

CTS Two worlds tearing apart

(more than 50% at 40nm!!)

Post-CTS Optimization

Downstream steps don’t have

the freedom to correct all “Propagated clocks”
Routing the mistakes made pre-CTS
in the flow

Post-Route Optimization

Final Layout

© Azuro, Inc. 2009 11

The Key Problems

Physical timing optimization today is all based on ideal clocks timing

– Timing opt is based on wrong information (like wire load models in the past)
– Cannot see the real timing situation

Clock balancing is not achievable, not necessary, and not helpful

– Even if CTS skew=0, Propagated timing ≠ Ideal timing
– Clock balancing imposes severe restrictions on timing optimization – for no benefit

© Azuro, Inc. 2009 12

Solution: Clock Concurrent Optimization
RTL

Synthesis
Pretend clocks
“Ideal clocks”
Initial Placement

Clock Concurrent Optimization Build clocks and optimize logic

at the same time

Real clocks
“Propagated clocks”
Routing

Post-Route Optimization

Final Layout

© Azuro, Inc. 2009 13

Clock Concurrent Optimization

© Azuro, Inc. 2009 14

Clock Concurrent Technology

Traditional Physical Optimization Clock Concurrent Optimization

clock T clock T

Extend physical
optimization into
the clocks
L C

skew

Gmax Gmax
More
degrees of
freedom
Gmax < T - skew L + Gmax < T + C

variable fixed fixed variable variable fixed variable

Time Borrowing in Clock Concurrent Opt.

clock

? ?

slack

Using CC-Opt, slack can flow across register boundaries

Logic Chains Limit Time Borrowing

clock

Looping Chain

IO Chain

Speed is Not Limited by the Critical Path

The “critical path” does NOT limit the chip speed

CC-Opt can easily move slack along a chain to where it is needed

critical path

slack

CC-Opt will optimize “non-critical” paths to create spare slack

Speed is Limited by the Critical Chain

The “CRITICAL CHAIN” is the focus of CC-Opt

– Critical chain is the chain with the longest delay/stage

Logic delay

11 13 8
Delay 11+9+19+8+13
= = 12
Stage 5
9 19

traditional critical path

15 11
Delay 15+16+11
= = 14
Stage 3
16

critical chain

CC-Opt Benefits

Summary of CC-Opt
RTL (e.g. Verilog)

Build clocks directly for timing not

Synthesis skew balancing
Ideal – Consider setup and hold timing
Timing Initial Placement
– Understand OCV timing
– Understand clock gate timing
– Understand clock mux timing
– Understand clock generator timing
– Understand multi-corner
Clock Concurrent Optimization – Understand multi-mode

Eliminate need to configure any

skew groups
– Skew groups are just a work-around for a
Routing broken flow!
Propagated
Timing
Post-Route Optimization

Final Layout

Key Benefits of CC-Opt.
Up to 20% increase in clock speed
– Fundamentally more degrees of freedom during optimization
All the benefits of useful skew and more!
– Directly targets propagated timing

Accelerated timing closure

– No requirement to configure any skew groups
– Automatically handles clock muxing, clock gating, clock generators,
OCV, multi-corner (setup & hold), and multi-mode

Reduced iterations to the frontend

– No need manually “retime” logic across register boundaries

Reduced IR-drop
– Clocks are not balanced!

Reduced power
– Clock buffers are only used where it is necessary for timing

Rubix™ - An Implementation

Rubix™ Flow and Key Features
RTL
Full industry standard STA
– SDC constraint format
– Multi-corner and multi-mode
Synthesis
– OCV derates and CPPR

Global routing
Placement – Ability to export “route guides”
Verilog Placed SDC
netlist DEF Physical Optimization
Phys. Opt. – Timing-driven incremental placement
– Timing-driven high-fanout net buffering
– Cell sizing and logic transformations
CTS RUBIX™ – Legalization

Clocks
Post-CTS Opt.
– Comprehensive skew group support
Verilog Placed
(can mix and overlap with timing windows
netlist DEF
based CTS)
Routing
Multi-voltage
– Clock buffering and net buffering across
voltage islands
PRO
Timing driven scan-chain
reordering
– Setup and hold aware
GDSII

Thanks!

For more information see CC-Opt White Paper at

www.azuro.com

5.ClockTreeSynthesis JD
No ratings yet
5.ClockTreeSynthesis JD
42 pages
What Is Global Skew and Local Skew?
No ratings yet
What Is Global Skew and Local Skew?
24 pages
Clock Tree Synthesis
No ratings yet
Clock Tree Synthesis
68 pages
Clock Tree Synthesis Overview and Techniques
No ratings yet
Clock Tree Synthesis Overview and Techniques
16 pages
Clock Tree Synthesis Guide
No ratings yet
Clock Tree Synthesis Guide
30 pages
Exclusive Clock Management in VLSI
No ratings yet
Exclusive Clock Management in VLSI
23 pages
Clock Gating
No ratings yet
Clock Gating
7 pages
Day8 Clock Tree Synthesis
No ratings yet
Day8 Clock Tree Synthesis
37 pages
CTS Basics
No ratings yet
CTS Basics
13 pages
CTS Stage in Physical Design
No ratings yet
CTS Stage in Physical Design
26 pages
CTS
No ratings yet
CTS
9 pages
CTS Algorithms
No ratings yet
CTS Algorithms
2 pages
Clock Tree Divergence Ti
No ratings yet
Clock Tree Divergence Ti
5 pages
PD - Training Topic: CTS Author: Nilesh Ingale & P. Ravikumar Date:08-11-2012
100% (2)
PD - Training Topic: CTS Author: Nilesh Ingale & P. Ravikumar Date:08-11-2012
90 pages
PnR-II-CTS Routing Chip Finishing
No ratings yet
PnR-II-CTS Routing Chip Finishing
88 pages
Clocktreesynthesis 230513171122 Fd363b4e
No ratings yet
Clocktreesynthesis 230513171122 Fd363b4e
44 pages
Multipoint CTS for Clock Distribution
No ratings yet
Multipoint CTS for Clock Distribution
59 pages
Clock Tree Optimization Techniques
100% (2)
Clock Tree Optimization Techniques
10 pages
PAGE9
No ratings yet
PAGE9
6 pages
Clock Tree Synthesis Overview
100% (4)
Clock Tree Synthesis Overview
69 pages
Cts
No ratings yet
Cts
79 pages
CTS Cheatsheet
No ratings yet
CTS Cheatsheet
3 pages
10 CTS
No ratings yet
10 CTS
21 pages
ClockGating Cts
No ratings yet
ClockGating Cts
8 pages
Cts
No ratings yet
Cts
6 pages
Clock Tree Synthesis
No ratings yet
Clock Tree Synthesis
33 pages
Efficient Clock Tree Synthesis for SoCs
No ratings yet
Efficient Clock Tree Synthesis for SoCs
2 pages
Edi11 Ccopt Slides
No ratings yet
Edi11 Ccopt Slides
28 pages
CCD Cadence
0% (1)
CCD Cadence
15 pages
Clock Tree Synthesis Overview Guide
100% (2)
Clock Tree Synthesis Overview Guide
50 pages
Clock Tree Synthesis
100% (1)
Clock Tree Synthesis
7 pages
Clock Routing For High-Performance Ics: CP 1 DL +
No ratings yet
Clock Routing For High-Performance Ics: CP 1 DL +
7 pages
????? ???? ?????????
No ratings yet
????? ???? ?????????
5 pages
Clock Tree Synthesis in VLSI Design
100% (3)
Clock Tree Synthesis in VLSI Design
21 pages
03b Clocks Slides
No ratings yet
03b Clocks Slides
47 pages
Industrial Clock Tree Synthesis Insights
No ratings yet
Industrial Clock Tree Synthesis Insights
43 pages
?????? ???????
No ratings yet
?????? ???????
5 pages
L24 CTS
No ratings yet
L24 CTS
7 pages
Clock Tree Synthesis (CTS) Interview Questions - Vlsi4freshers
100% (1)
Clock Tree Synthesis (CTS) Interview Questions - Vlsi4freshers
11 pages
Clock Tree Synthesis
No ratings yet
Clock Tree Synthesis
45 pages
VLSI Clock & Power Routing Guide
100% (1)
VLSI Clock & Power Routing Guide
30 pages
Clock Balance Ieee Seminar04
100% (1)
Clock Balance Ieee Seminar04
49 pages
Sanity Checks for VLSI CTS Process
No ratings yet
Sanity Checks for VLSI CTS Process
5 pages
Physical Design Essentials
No ratings yet
Physical Design Essentials
88 pages
Clock Tree Synthesis Guide
No ratings yet
Clock Tree Synthesis Guide
7 pages
Lecture24 Clockpower Routing
No ratings yet
Lecture24 Clockpower Routing
30 pages
Clock Tree Synthesis 2
No ratings yet
Clock Tree Synthesis 2
7 pages
Don't Buffer Net:: - If The Path Is A False Path, Then No Need of Balancing The Path. So Set Don't Buffer Net Attribute
No ratings yet
Don't Buffer Net:: - If The Path Is A False Path, Then No Need of Balancing The Path. So Set Don't Buffer Net Attribute
4 pages
Clock Tree Synthesis (CTS) Interview Questions - Vlsi4freshers
100% (3)
Clock Tree Synthesis (CTS) Interview Questions - Vlsi4freshers
5 pages
VLSI Clock & Power Routing Guide
No ratings yet
VLSI Clock & Power Routing Guide
30 pages
Azuro Clock
100% (1)
Azuro Clock
20 pages
Azuro Ccopt White Paper
No ratings yet
Azuro Ccopt White Paper
20 pages
VLSI Design: Placement and Routing Guide
100% (1)
VLSI Design: Placement and Routing Guide
7 pages
About DRC and Its Impact in Physical Design
No ratings yet
About DRC and Its Impact in Physical Design
5 pages
GigaOpt Optimization Report
No ratings yet
GigaOpt Optimization Report
29 pages
Interview Questions
No ratings yet
Interview Questions
6 pages
Synthesis - 07 - 23
No ratings yet
Synthesis - 07 - 23
102 pages
Timing Fixes
No ratings yet
Timing Fixes
18 pages
6 Prime Time
No ratings yet
6 Prime Time
42 pages
What Is Timing Analysis PDF
No ratings yet
What Is Timing Analysis PDF
62 pages
Icc2 Useful Commands
78% (9)
Icc2 Useful Commands
4 pages
Reporting Filler Cells in Design
100% (1)
Reporting Filler Cells in Design
2 pages
Innovus Single-Line dbGet Commands
No ratings yet
Innovus Single-Line dbGet Commands
7 pages
tc3 Tutorial GCA FINAL
No ratings yet
tc3 Tutorial GCA FINAL
42 pages
4.fixing Hold Time Violations by Inserting Delay at The Data Path Endpoint
No ratings yet
4.fixing Hold Time Violations by Inserting Delay at The Data Path Endpoint
3 pages
LP Sign Off
No ratings yet
LP Sign Off
11 pages
Verifyconnectivity or Check - Connectivity Reports Some Nets As Open and Some Nets As Special Open
No ratings yet
Verifyconnectivity or Check - Connectivity Reports Some Nets As Open and Some Nets As Special Open
1 page
Hold Fix M (1) .TCL Submitit
No ratings yet
Hold Fix M (1) .TCL Submitit
6 pages
4 Hybris Install
100% (1)
4 Hybris Install
6 pages
Silicon-freeze ECO Optimization
No ratings yet
Silicon-freeze ECO Optimization
20 pages
Multi Level Physical Hierarchy Floorplanning
No ratings yet
Multi Level Physical Hierarchy Floorplanning
8 pages
How Does Clock Reconvergence Pessimism Removal (CRPR) Handle Dynamically Switched Related Clocks
No ratings yet
How Does Clock Reconvergence Pessimism Removal (CRPR) Handle Dynamically Switched Related Clocks
4 pages
Creating Power-Switch Arrays and Rings
No ratings yet
Creating Power-Switch Arrays and Rings
7 pages
IR-Induced Clock Jitter Extraction and Improvement: Kenny Chen & James Su
No ratings yet
IR-Induced Clock Jitter Extraction and Improvement: Kenny Chen & James Su
6 pages
Calculating The Toggle Rate in The Averaged-Power Analysis Mode Using The PrimeTime PX and Power Compiler Tools
No ratings yet
Calculating The Toggle Rate in The Averaged-Power Analysis Mode Using The PrimeTime PX and Power Compiler Tools
3 pages
16nm Low Power Chip Implementation
No ratings yet
16nm Low Power Chip Implementation
32 pages
Crosstalk
No ratings yet
Crosstalk
85 pages
Advantages of Replacement To A Chip Varistor and Selection Points
No ratings yet
Advantages of Replacement To A Chip Varistor and Selection Points
16 pages
ClockPulseGen 09 09 08
No ratings yet
ClockPulseGen 09 09 08
9 pages
Innovus Foundation Flows Guide May2018
No ratings yet
Innovus Foundation Flows Guide May2018
7 pages
Understanding The UPF Power Domain and Domain Boundary - Mentor Graphics
No ratings yet
Understanding The UPF Power Domain and Domain Boundary - Mentor Graphics
15 pages
Ubc 2010 Fall Mueller Jeff
No ratings yet
Ubc 2010 Fall Mueller Jeff
168 pages
Duet Embedded Memories and Logic Libraries For TSMC 28HP: Highlights
No ratings yet
Duet Embedded Memories and Logic Libraries For TSMC 28HP: Highlights
5 pages
Title Description: (/S/) Cases (/S/Case-List) Stars (/S/Star-List) Articles (/S/Knowledge) Help (/S/Help-Info)
No ratings yet
Title Description: (/S/) Cases (/S/Case-List) Stars (/S/Star-List) Articles (/S/Knowledge) Help (/S/Help-Info)
2 pages
Manoj - Qualcomm Interview Questions
100% (2)
Manoj - Qualcomm Interview Questions
1 page
Appnote ESD Primary Check v1
No ratings yet
Appnote ESD Primary Check v1
15 pages
Adding Spare Cells
No ratings yet
Adding Spare Cells
4 pages
Automatically Adding Spare Cells: IC Compiler II Implementation User Guide, Version R-2020.09 ECO Flow
No ratings yet
Automatically Adding Spare Cells: IC Compiler II Implementation User Guide, Version R-2020.09 ECO Flow
2 pages
Tech File
No ratings yet
Tech File
4 pages
Presented By, Narendra Kuppili, Analog IC Layout Engineer
No ratings yet
Presented By, Narendra Kuppili, Analog IC Layout Engineer
27 pages
AMME3500 Assignment 1 Overview
No ratings yet
AMME3500 Assignment 1 Overview
5 pages
CD7300 Doc
No ratings yet
CD7300 Doc
33 pages
Wi-Fi Interview Questions and Answers
No ratings yet
Wi-Fi Interview Questions and Answers
4 pages
Irnss Sps Icd Version1.1-2017
No ratings yet
Irnss Sps Icd Version1.1-2017
72 pages
Successful Writing at Work Edition 10 Philip C. Kolin Available All Format
No ratings yet
Successful Writing at Work Edition 10 Philip C. Kolin Available All Format
106 pages
Lab1 PAR
No ratings yet
Lab1 PAR
40 pages
Ee 582 C
No ratings yet
Ee 582 C
5 pages
Factors Driving Direct Marketing Growth
No ratings yet
Factors Driving Direct Marketing Growth
3 pages
Circular Hons Adm 2017-18
No ratings yet
Circular Hons Adm 2017-18
2 pages
NIST LWC Side-Channel Analysis
No ratings yet
NIST LWC Side-Channel Analysis
24 pages
Cellphone Jammer
No ratings yet
Cellphone Jammer
28 pages
Wolfson Audio Card Schematic Diagram
No ratings yet
Wolfson Audio Card Schematic Diagram
7 pages
Mobile Technology Assignment
No ratings yet
Mobile Technology Assignment
3 pages
Hash-Based Cryptography Insights
No ratings yet
Hash-Based Cryptography Insights
11 pages
Action Plan For Batemans Bay Bridge Replacement Project
No ratings yet
Action Plan For Batemans Bay Bridge Replacement Project
7 pages
Specification - Plant Site Data Sheet & Project Data Sheet
No ratings yet
Specification - Plant Site Data Sheet & Project Data Sheet
6 pages
QX Brochure
No ratings yet
QX Brochure
27 pages
Fraction
No ratings yet
Fraction
3 pages
3.3-V Can Transceivers: Features Applications
100% (1)
3.3-V Can Transceivers: Features Applications
31 pages
Python Questions and Answers - Variable Names: Advertisement
No ratings yet
Python Questions and Answers - Variable Names: Advertisement
10 pages
Flip 3
No ratings yet
Flip 3
1 page
Tia Pro2
100% (2)
Tia Pro2
389 pages
Cost Estimation and Price Update Guide
No ratings yet
Cost Estimation and Price Update Guide
7 pages
Understanding The Challenges Nurses Encounter With Monitoring Technologies in A NICU
No ratings yet
Understanding The Challenges Nurses Encounter With Monitoring Technologies in A NICU
25 pages
Graph Theory PDF
100% (1)
Graph Theory PDF
20 pages
Fuctions f4 Exercise KSSM
No ratings yet
Fuctions f4 Exercise KSSM
4 pages
NFe Web Services for Brazilian States
No ratings yet
NFe Web Services for Brazilian States
4 pages
1ZBG000828 - en TXpert Hub Power Technical Brochure
No ratings yet
1ZBG000828 - en TXpert Hub Power Technical Brochure
14 pages
Bug 6150
No ratings yet
Bug 6150
4 pages
WM PPT For Markfed
No ratings yet
WM PPT For Markfed
20 pages