Results 1 -
6 of
6
An approach to placement-coupled logic replication”, DAC
, 2004
"... We present a set of techniques for placement-coupled, timingdriven logic replication. Two components are at the core of the approach. First is an algorithm for optimal timingdriven fanin tree embedding; the algorithm is very general in that it can easily incorporate complex objective functions (e.g. ..."
Abstract
-
Cited by 18 (2 self)
- Add to MetaCart
We present a set of techniques for placement-coupled, timingdriven logic replication. Two components are at the core of the approach. First is an algorithm for optimal timingdriven fanin tree embedding; the algorithm is very general in that it can easily incorporate complex objective functions (e.g., placement costs) and can perform embedding on any graph-based target. Second we introduce the Replication Tree which allows us to induce large fanin trees from a given circuit which can then be optimized by the embedder. We have built an optimization engine around these two ideas and report promising results for the FPGA domain including clock period reductions of up to 36 % compared with a timing-driven placement from VPR [12] and almost double the average improvement of local replication from [1]. These results are achieved with modest area and runtime overhead.
Complexity Issues in Gate Duplication
- IN WORKSHOP NOTES, INTERNATIONAL WORKSHOP ON LOGIC SYNTHESIS
, 2000
"... In this paper we address the complexity issues associated with gate duplication for delay optimization. Gate duplication for general circuits has been proved NP-Complete [1]. In this paper we show that even the local delay optimization by gate duplication is NPComplete. Local fanout optimization (bu ..."
Abstract
-
Cited by 4 (4 self)
- Add to MetaCart
In this paper we address the complexity issues associated with gate duplication for delay optimization. Gate duplication for general circuits has been proved NP-Complete [1]. In this paper we show that even the local delay optimization by gate duplication is NPComplete. Local fanout optimization (buffer insertion) for fixed net topology can be solved in polynomial time [9]. Even the global fanout optimization problem has polynomial time complexityifall the pin to pin parameters of a gate are the same and the topology of all the nets is fixed [13]. Hence weshow that gate duplication is much harder than buffer insertion.
VLSI CAD Tool Protection by Birthmarking Design Solutions
- 15th IEEE /ACM Great Lakes Symposium on VLSI (GLSVLSI’05
, 2005
"... Many techniques have been proposed in the past for the protection of VLSI design IPs (intellectual property). CAD tools and algorithms are intensively used in all phases of modern VLSI designs; however, little has been done to protect them. Basically, given a problem P and a solution S, we want to b ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
Many techniques have been proposed in the past for the protection of VLSI design IPs (intellectual property). CAD tools and algorithms are intensively used in all phases of modern VLSI designs; however, little has been done to protect them. Basically, given a problem P and a solution S, we want to be able to determine whether S is obtained by a particular tool or algorithm. We propose two techniques that intentionally leave some trace or birthmark, which refers to certain easy detectable properties, in the design solutions to facilitate CAD tool tracing and protection. The pre-processing technique provides the ideal protection at the cost of losing control of solution’s quality. The post-processing technique balances the level of protection and design quality. We conduct a case study on how to protect a timing-driven gate duplication algorithm. Experimental results on a large set of MCNC benchmarks confirm that the pre-processing technique results in a significant reduction (about 48%) of the optimization power of the tool, while the post-processing technique has almost no penalty (less than 2%) on the tool’s performance.
Algorithms for Simultaneous Consideration of Multiple Physical Synthesis Transforms for Timing Closure ∗
"... We propose a post-placement physical synthesis algorithm that can apply multiple circuit synthesis and placement transforms on a placed circuit to improve the critical path delay under area constraints by simultaneously considering the benefits and costs of all transforms (as opposed to considering ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
We propose a post-placement physical synthesis algorithm that can apply multiple circuit synthesis and placement transforms on a placed circuit to improve the critical path delay under area constraints by simultaneously considering the benefits and costs of all transforms (as opposed to considering them sequentially after applying each transform). The circuit transforms we employ include, but are not limited to, incremental placement, two types of buffer insertion, cell resizing and cell replication. The problem is modeled as a min-cost network flow problem, in which nodes represent circuit transform options. By carefully determining the structure of the network graph and the cost of each arc, a set of near-optimal transform options can be obtained as those whose corresponding nodes in the network graph have the min-cost flow passing through them. We also tie the transform selection network graph to a detailed placement network graph with TD arc costs for cell movements. This enables our algorithms to incorporate considerations of detailed placement cost for each synthesis transform along with the basic cost of applying the transform in the circuit. We have tested our algorithms on three sets of benchmarks under 3-10 % area increase constraints, and obtained up to 48 % and an average of 27.8 % timing improvement. Our average improvement is relatively 40 % better (8.2 % better by an absolute measure) than applying the same set of transforms in a good sequential order that is used in many current techniques. Considering only synthesis transforms (no replacement), our technique is relatively 50 % better than the sequential approach. 1
Addressing the Effects of Reconvergence on Placement-Coupled Logic Replication
"... We present a general and robust approach to timing-driven, placement-coupled logic replication. The approach is similar in nature to [7] and it is designed to address issues that arise due to the reconvergence in the circuit specification. We build on the Replication Tree idea and modify the timing- ..."
Abstract
- Add to MetaCart
We present a general and robust approach to timing-driven, placement-coupled logic replication. The approach is similar in nature to [7] and it is designed to address issues that arise due to the reconvergence in the circuit specification. We build on the Replication Tree idea and modify the timing-driven fanin tree embedding algorithm to optimize sub-critical paths. We have built optimization engine for the FPGA domain and report promising preliminary results including clock period reduction of up to 38 % compared with a timing-driven placement from VPR [14] and up to 9 % compared to RT-Embedding replication algorithm from [7].
Broadening the Scope of Multi-Objective Optimizations in Physical Synthesis of Integrated Circuits
, 2010
"... would not have been possible without the immeasurable self-sacrifice of my perfect wife, Amy. She has worked day and night by my side for years to make our home and family prosperous. I love you very much. Our two beautiful sons George and Victor have brought me indescribable joy and gave me hope fo ..."
Abstract
- Add to MetaCart
would not have been possible without the immeasurable self-sacrifice of my perfect wife, Amy. She has worked day and night by my side for years to make our home and family prosperous. I love you very much. Our two beautiful sons George and Victor have brought me indescribable joy and gave me hope for the future when it seemed all was lost. I love you two in ways I never thought possible. I am eternally grateful to her for the faith she has placed in me. I will do everything I can to reward her investment. I am also deeply indebted to her parents Ren Fang Zhang and Yue Xia Gong who have come from their home in China to live with us and help raise our babies. Without them, I don’t know how it would be possible for me to balance graduate school, a full-time job, and a new family. I will be sorry when they return home. My advisor, Professor Igor Markov, has also poured an incredible amount of work into training me to be capable of writing this dissertation. He has defended me when it was not convenient, supported me when it seemed hopeless, and never gave up on me until the task was complete. I am grateful for all of his efforts as well as all of the opportunities and second chances he has given me. I truly hope it has been as worth it for him as it has been

