"实现面积和时序的双重优化——Logic Synthesis 中的技巧介绍"

PDF文件

下载需积分: 5 | 69KB | 更新于2024-01-11 | 31 浏览量 | 举报收藏

立即下载

本文介绍了在逻辑综合中实现面积优化的技巧，并讲述了面积优化和时序闭合是如何可以同时实现的。作者指出，面积优化可以对设计产生许多积极的影响，包括改善时序。尽管没有单一的方法可以通过一个"魔法开关"得到满足时序要求的最小设计，但许多因素的结合可以产生积极的效果。首先，作者介绍了在逻辑综合中常用的几个关键步骤，包括逻辑合成、技术映射和布局布线。逻辑合成是将高级的逻辑描述转换为简化的逻辑门级的过程，技术映射用于将逻辑门映射到实际的物理库中的门，布局布线则是确定门的位置和连接方式。作者指出，在面积优化中，关键的因素之一是对逻辑门的选取。不同的逻辑门有不同的面积和时序特性，因此选取适合的门可以实现更好的面积优化。作者建议在选择门时，应该综合考虑面积、时序、功耗等因素，并利用合适的工具和方法进行综合评估。另一个重要的优化技巧是对逻辑电路进行重编码。作者解释了重编码的原理和方法，并指出重编码可以减少逻辑门的数量，从而实现面积的优化。通过重编码，可以将逻辑门的输入和输出信号分配得更加紧凑和高效，从而减小电路的面积。此外，作者还提出了一种利用时序约束来实现面积优化的技巧。通过合理设置时序约束，可以对电路中的时序路径进行限制和调整，从而实现对面积的进一步优化。作者指出，时序路径的优化可以通过反复迭代和实验来实现，需要设计人员具备丰富的经验和深入的理解。最后，作者强调了面积优化和时序闭合之间的相互关系。他指出，虽然面积优化和时序闭合通常被认为是互相对立的目标，但实际上它们之间存在着一定的关联。面积优化可以通过减少逻辑门的数量和优化时序路径来改善时序性能。因此，在设计过程中，设计人员应该综合考虑面积和时序的要求，并寻找最佳的平衡点。总之，本文介绍了在逻辑综合中实现面积优化的技巧，并解释了面积优化和时序闭合之间的关系。通过综合考虑逻辑门的选取、重编码和时序约束，设计人员可以实现对电路面积的优化。同时，作者也强调了面积优化和时序闭合之间的相互作用，指出了在设计中寻找平衡的重要性。

SNUG San Jose 2001 Have Your Cake And Eat It Too:

How To Optimize For Area AND Timing

4.2. Do Not Over-Constrain

Another key methodology issue involves the practice of over-constraining a design. Over-

constraining consumes area by causing designs to be overbuilt, using higher than necessary drive

strengths on cells and forcing more area hungry implementations of designware components.

Realistic and successively refined constraints are much more area friendly. Default constraints,

which should be built into the scripts and methodology, should reflect reasonable input & output

delays, input drivers and output loading. For example, use the “Q” output pin of a 1X drive flop

as the default driving cell pin, and 4 times the “A” input pin of a 1X drive NAND gate as the

default output load for a block. For a default input delay, allow for the clock-to-Q of the driving

flop plus a small wire delay. For a default output delay, allow for setup time of the driving flop

plus a small wire delay. Use virtual clocks to constrain I/O’s of combinatorial blocks, or

preferably ungroup combinatorial blocks into their parent designs. Combinatorial and “snaking”

paths, and possibly some sequential paths, will be under-constrained at first. This would cause

great difficulty in a single pass methodology. By using a multi-pass methodology however, the

constraints will be refined in subsequent passes without over-constraining the entire design.

4.3. Margin With Design Rules

Traditionally, over-constraining was done to minimize the timing losses incurred by non-timing

savvy layout tools. With the present widespread use of timing driven layout tools, timing

accuracy (especially with respect to predicted clock insertion delay and skew) can be much more

important than timing margin. It is advisable, however, to margin the design with design rules

instead of timing constraints. One example is to tighten the max_transition value before layout,

and relax it to an acceptable value after layout. Another example is to use a max_fanout of 10 to

20 on compiled designs to reduce unexpected timing swings in layout. Another example is to set

a max_fanout on all the input ports of compiled blocks. The max_fanout constraints on inputs

and designs combine to prevent high fanout loading surprises further up the design hierarchy, and

limit the amount of buffering to be performed by the timing driven layout tools. I have found an

input max_fanout value of one to be quite effective, and that this constraint can sometimes be

lifted on the last pass in a multi-pass compile strategy. Any resulting unnecessary buffering will

be optimized away with the core level incremental optimization in synthesis. Once again, these

design rule based margining techniques should be built into the scripts and methodology.

4.4. Use Selective Ungrouping

Yet another key methodology issue involves the use of selective ungrouping and set_dont_touch

on compiled designs. Very small designs, especially purely combinational ones, should be

ungrouped into their parent blocks whenever possible. Compile generated hierarchy including

designware and MUX OP components should also be ungrouped. Ungrouping these components

removes boundary conditions allowing further area optimization. Ungrouping before the compile

can result in longer run times, since synthesis has to look at more of the design at once.

Compiling hierarchically can have a runtime advantage, but consumes more area and also

requires hierarchical saves, leading to multiple versions of the same leaf block throughout the

design. To obtain the best of both worlds, load the parent and leaf blocks and then compile

剩余17页未读，继续阅读

babydream520

粉丝: 65

"实现面积和时序的双重优化——Logic Synthesis 中的技巧介绍"

Advanced-Logic-Synthesis_logic_vhdl_usualqhx_

VHDL-for-Logic-Synthesis--3rd-Edition.rar_VHDL/FPGA/Verilog_VHDL_

FPGA综合技术Synthesis Strategies

【FPGA资源优化技巧】：学会如何在ISE项目中高效利用硬件资源

ISE 14.7资源优化技巧：减少FPGA消耗，提升性能

【FPGA设计优化技巧】：性能与资源利用的双重提升

【ALU设计实战】：32位算术逻辑单元构建与优化技巧

ISE 14.7深度优化：高级技巧助你提升性能

VCS功耗分析与优化：节省资源的7大技巧

FPGA资源优化秘籍：UG901高级应用技巧大公开

最新资源