Timing-Constrained I/O Buffer Placement for Flip- Chip Designs

Size: px
Start display at page:

Download "Timing-Constrained I/O Buffer Placement for Flip- Chip Designs"

Transcription

1 Timing-Constrained I/O Buffer Placement for Flip- Chip Designs Zhi-Wei Chen 1 and Jin-Tai Yan 2 1 College of Engineering, 2 Department of Computer Science and Information Engineering Chung-Hua University, Hsinchu, Taiwan, R.O.C {m9423, yan}@chu.edu.tw Astract Due to inappropriate assignment of ump pads or improper placement of I/O uffers, the configured delays of I/O signals may not satisfy the timing requirement inside die core. In this paper, the prolem of timing-constrained I/O uffer placement in an area-io flip-chip design is firstly formulated. Furthermore, an efficient two-phase approach is proposed to place I/O uffers onto feasile uffer locations etween I/O pins and ump pads with the consideration of the timing constraints. Compared with Peng s SA-ased approach[7], with no timing constraint, our approach can reduce 71.82% of total wirelength and 55.74% of the maximum delay for 7 tested cases on the average. Under the given timing constraints, our result otains higher timing-constrained satisfaction ratio(tcsr) than the SAased approach[7]. I. INTRODUCTION With the increase of circuit complexity and the decrease of feature size, the dramatic demand for I/O counts ecomes a significant issue in VLSI designs. An advanced packaging technology, flip-chip package[1], is introduced to meet the higher integration density and the larger I/O counts in modern VLSI designs. In general, a flip-chip design as shown in Fig. 1 descries the methodology of electrically connecting the die core to the package carrier ased on the technology of the flipchip package. Because of the reduced signal inductance and package footprint, the flip-chip technology has een widely used in high-performance circuit designs. However, the placement of I/O uffers is not exactly mapped onto the locations of ump pads in an area-i/o flip-chip design. Hence, an extra metal layer, Re-Distriution Layer(RDL), is used to redistriute the connections from I/O uffers on the die to the ump pads on the RDL. Fig. 1. Flip Chip Die Package Build-up Layer Package Core I/O Buffer Package Build-up Layer PCB Die interconnect RDL Bump Pad Bump Ball Package Ball Connection of IO signal in an area-i/o flip-chip design In general, the performance in a chip design is evaluated y the timing effect inside the die core and most of works are concentrated on the timing delay inside the die core for the performance optimization. In an area-io flip-chip design, I/O uffers can e placed onto white space inside the die to ridge I/O pins in circuit locks to ump pads on RDL. After optimizing the performance inside the die core, the configured delay of I/O signals may not satisfy the timing requirement inside the die core due to inappropriate assignment of ump pads or improper placement of IO uffers. As a result, the final performance of the chip may e dominated y the timing effect at package stage. The I/O uffer placement prolem in conventional wireond designs[2, 3, 4, 5] has een extensively studied. However, these approaches are inapplicale in area-io flip-chip designs. Recently, many works[6, 7, 8, 9, 1] on I/O uffer placement have een pulished in area-io flip-chip designs. Hsieh et. al.[6] proposed a constructive approach with considering the total path delay and signal skew to place circuit locks and I/O uffers simultaneously. A SA-ased approach with the similar consideration was further introduced in [7]. However, these two approaches are concentrated on the skew minimization on the asis of wirelength. Xiong et al. [8] studied the I/O uffer placement with considering signal skew and other constraints. However, it is assumed that I/O uffers are placed at fixed locations for usage. Lai et. al.[9] considered the additional differential pairs into the IO uffer placement prolem. Wang et al. [1] proposed an I/O uffer placement approach to take the routaility issue for escape routing into consideration. However, all the works only considered how to place the circuit locks and I/O uffers at the same time and the timing constraint did not e considered for I/O uffer placement. For I/O uffer placement in an area-io flip-chip design, the prolem of timing-constrained I/O uffer placement in an area- IO flip-chip design is firstly formulated. Furthermore, an efficient two-phase approach is proposed to assign necessary I/O uffers for all the I/O signals such that the timing constraint of each I/O signal is satisfied. In the first phase, all I/O pins are assigned onto the feasile ump pads under the consideration of timing constraints. Furthermore, the I/O uffers are inserted to complete all the I/O signals from I/O pins to ump pads. Compared with Peng s SA-ased approach[7], with no timing constraint, our approach can reduce 71.82% of total wirelength and 55.74% of the maximum delay for 7 tested cases on the average. Under the given timing constraints, our result otains higher timing-constrained satisfaction ratio(tcsr) than the SA-ased approach[7] /DATE11/ 211 EDAA

2 II. PROBLEM FORMULATION According to the design specification in an area-io flipchip design, I/O uffers can e placed on white space in a floorplan plane and ump pads must e placed at pre-defined locations on RDL. It is assumed that a set of locks with some I/O pins is located in a floorplan plane and an array of ump pads is located on RDL. Generally speaking, any I/O pin must e assigned onto a unique ump pad and a necessary I/O uffer must e required to ridge the connection etween the I/O pin and its corresponding ump pad. To assign an I/O uffer onto an availale space in a floorplan plane to connect each I/O pin onto the corresponding ump pad, two essential constraints, timing constraint and non-overlapping constraint, must e satisfied on RDL. In an area-io flip-chip design, I/O signal is defined as an I/O connection to connect an I/O pin to a ump pad via an I/O uffer as shown in Fig. 2. In general, the delay of any I/O signal is constrained y a given timing delay of its I/O pin and divided into three parts: The first one is the delay etween the I/O pin and the I/O uffer, the second part is the intrinsic delay of the I/O uffer, and the final part is the delay etween the I/O uffer and the ump pad. Besides the intrinsic delay of the I/O uffer, the remaining delays are dependent on the lengths of the two connections. To satisfy the given timing delay, the timing constraint of each I/O connection must e considered in I/O uffer placement. On the other hand, the overlapping relation etween any I/O uffer and any lock is not allowed. Hence, I/O uffers are only placed on white space in a floorplan plane. In general, white spaces in a floorplan plane can e otained y performing the sweeping line algorithm[11]. Furthermore, all availale locations for I/O uffers can e achieved according to the size of an I/O uffer and the size of each white space in the plane. To satisfy the valid locations of I/O uffers, the nonoverlapping constraint must e must e considered in I/O uffer placement. Fig. 2. I/O pin I/O uffer Bump pad RDL Device layer I/O connection from an I/O pin to a ump pad via an I/O uffer Given a set of m locks, M = {B 1, B 2,...,B m }, and a set of s uffer locations, S = {s 1, s 2,...,s s }, in a floorplan plane, a set of n I/O pins, P = {,,...,p n }, on the locks, an array of r ump pads, B = {,,..., r }, on the given RDL and the timing constraint, t i, each I/O pin, p i, 1 i n, the timing-constrained I/O uffer placement is to map each I/O pin onto a unique ump pad to construct an I/O signal and assign an I/O uffer on a feasile location for each I/O signal in the given floorplan plane under the timing and non-overlapping constraints. As illustrated in Fig. 3(a), given a set of 7 locks, B 1, B 2,..., and B 7, and a set of 13 uffer locations, s 1, s 2,..., and s 13, in a floorplan plane, a set of 9 I/O pins,,,..., and p 9, on the locks, a set of 9 ump pads,,,, and, on the given RDL and the timing constraint of each I/O pin, the 9 resultant I/O signals, (, s 6, ), (, s 2, ), (, s 1, ), (, s 8, ), (, s 3, ), (, s 9, ), (, s 13, ), (, s 12, ) and (p 9, s 11, ), with the location assignment of 9 I/O uffers can e otained in timingconstrained I/O uffer placement and shown in Fig. 3(). s 1 s 2 B 2 Fig. 3. B 3 B 4 B s 7 1 s 1 8 B p s 6 s 7 s 8 s 9 s 12 B 5 B 1 s 13 (a) s 3 s 4 s 5 B 2 B 3 B 1 B 4 () B p B 5 B 6 Timing-constrained I/O uffer placement in an area-io flip-chip design III. TIMING-CONSTRAINED I/O BUFFER PLACEMENT For timing-constrained I/O uffer placement in an area-io flip-chip design, an efficient approach is proposed to assign I/O pins onto feasile ump pads to construct I/O signals and assign necessary I/O uffers onto feasile uffer locations for all I/O signals. Basically, the placement process can e divided into three main steps: Extraction of timing-constrained routing length, I/O signal assignment and Timing-constrained I/O uffer placement and the design flow for the proposed approach is illustrated in Fig. 4. (1) A given set of locks with some I/O pins and a given set of uffer locations in a floorplan plane (2) A given array of ump pads (3) The timing constraints of all the I/O pins Extraction of Timing -Constrained Routing Length I/O Signal Assignment Construction of Manhattan Circular Regions Graph Construction for I/O Signal Assignment Fig. 4. Matching-Based I/O Signal Assignment Timing-Constrained I/O Buffer placement Construction of Feasile Location Regions Location Construction for I/O Buffers Graph Construction for I/O Buffer Placement Matching-Based I/O Buffer Placement Timing-Constrained I/O Buffer Placement result Design flow of timing-constrained I/O uffer placement

3 A. Extraction of Timing-Constrained Routing Length For any I/O pin in a floorplan plane, its timing constraint determines all the feasile ump pads to construct an I/O signal. In other words, the location of any valid ump pad must connect from the I/O pin to the pad to satisfy the given timing constraint. To find all the valid ump pads under the timing constraint, the maximal routing length of the I/O pin must e extracted from the given timing constraint. Basically, an I/O connection can e modeled as a RC circuit and the delay of the RC circuit can e computed y using Elmore delay model. In Elmore delay model, the delay, t, of any I/O connection with length, l, can e otained as r c 2 t = l 2 ( Rc r C) l RC where R is the driver strength, C is the driving load, r is the unit resistance and c is the unit capacitance. It is assumed that t max is the maximum timing delay of a given I/O connection with length, l. It is known that the delay, t, of the I/O connection with length, l, must satisfy the timing constraint, t t. By using the delay expression in Elmore max delay model, the length, l, of the I/O connection must satisfy the following constraint. l c R r C 2r c t c R r C max, r c Hence, the maximal length, l max, of the I/O connection can e otained as, feasile to e assigned onto the pin, p k, ecause the two ump pads are located inside MCR k. Hence, the two pads, i,j and i,j1, are assigned as the pad candidates of the pin, p k. Fig. 5. l k i,j1 i,j MCR k p k (x k,y k ) i1,j1 i1,j Bump pad candiates for I/O pin Refer to the set of 9 I/O pins,,,..., and p 9, and the set of 9 ump pads,,,..., and, in Fig. 3(a), ased on the extraction of the maximum routing lengths of the I/O pins, all the MCRs can e achieved and illustrated in Fig. 6(a). According to the locations of all the ump pads and all the MCRs, the pad candidates of each I/O pin can e otained and all the possile I/O signals can e constructed as illustrated in Fig. 6(). Clearly, I/O pin,, has a set of pad candidates,, and, I/O pin,, has a set of pad candidates, and, I/O pin,, has a set of pad candidates, and, I/O pin,, has a set of pad candidates,,, and, I/O pin,, has a set of pad candidates, and, I/O pin,, has a set of pad candidates, and, I/O pin,, has a set of pad candidates, and, I/O pin,, has a set of pad candidates, and, and I/O pin, p 9, has a pad candidate,. c R r C 2r c t c R r C max max =, r c l p3 4 According to the length extraction of any I/O connection under a given timing constraint, t i, for I/O pin, p i, the maximum routing length, l i, of the I/O pin can e extracted. p2 p4 p5 p8 p9 4 p 9 B. I/O Signal Assignment According to the length extraction under a given timing constraint, the maximum routing lengths of all I/O pins in a floorplan plane can e extracted. In a Manhattan routing model, the collection of points within a fixed distance of a given point is called a Manhattan circular region(mcr) whose oundary is composed of two line segments with slope 1 and two line segments with slop -1. Furthermore, the radius of the MCR is the Manhattan distance etween the given point and its oundary. For any I/O pin, p i, with its timing constraint, t i, its feasile pad candidates must e inside the Manhattan circular region, MCR i, whose center is the coordinate, (x i, y i ), of the I/O pin, p i, and whose radius is the maximum routing length, l i, under the timing constraint, t i. As illustrated in Fig. 5, I/O pin, p k, constructs its MCR k ased on the coordinate, (x k, y k ), of the I/O pin as the MCR center and the maximum routing length, l k, as the MCR radius. It is assumed that there are four ump pads, i,j, i1,j, i,j1, and i1,j1, around the MCR k. Clearly, the ump pads, i1,j and i1,j1, are not feasile to e assigned onto the pin, p k, ecause the two ump pads are not located inside MCR k. In contrast, the two ump pads, i,j and i,j1, are Fig. 6. p1 p6 (a) p7 () Distriution of Manhattan circular regions for I/O pins and all the possile I/O signal for 9 I/O pins According to the possile I/O signals for all the I/O pins, a ipartite graph, G=(V p, V, E), can e constructed as follows: the vertex set in G can e divided into a vertex set, V p, for I/O pins and a vertex set, V, for pad candidates. For v i in V p and v j in V, there exists an edge, e i,j, in E if the connection etween the corresponding I/O pin in v i and the corresponding pad candidate in v i is a possile I/O signal under a given timing constraint. Clearly, an assignment solution can e otained if the numer of pad candidates is greater than or equal to the numer of I/O pins and each I/O pin has at least one pad candidate. To make each I/O pin to assign a feasile pad candidate, the I/O pins with only one pad candidate must have the highest priority to construct I/O signals. Therefore, a

4 matching-ased I/O signal assignment can e applied as follows: Firstly, any vertex in V p with degree 1 and its connecting vertex in V are selected to form an I/O signal and the original ipartite graph is modified y deleting the selected vertex pair in V. If there is no vertex in V p with degree 1, the vertex in V p with the smallest degree and its nearest connecting vertex in V are selected to form an I/O signal and the original ipartite graph is modified y deleting the selected vertex pair in V. Until the vertex set, V p, is empty or there is no edge in G, the assignment process will stop and the final I/O signal assignment result will e otained. Refer to all the possile I/O signals in Fig. 6(), a ipartite graph, G=(V p, V, E) can e uilt as illustrated in Fig. 7, where V p ={,,,,,,,, p 9 }, V ={,,,,,,,, } and E={e 1,2, e 1,3, e 1,6, e 2,1, e 2,2, e 3,1, e 3,4, e 4,2, e 4,3, e 4,5, e 4,6, e 5,4, e 5,7, e 6,5, e 6,6, e 7,8, e 7,9, e 8,7, e 8,8, e 9,8 }. By performing a matching-ased I/O signal assignment, the edge, e 9,8, is firstly selected to form an I/O signal and the edges, e 7,8 and e 8,8, are deleted ecause the pad candidate,, is assigned onto the pin, p 9. Furthermore, for the vertices in V p with degree 1, the edges, e 7,9, e 8,7, e 5,4, e 3,1 and e 2,2, are sequentially selected to form I/O signals. Finally, for the vertices in V p with the smallest degree, the edges, e 6,6, e 1,3 and e 4,5, are sequentially selected to form I/O signals and the assignment result of 9 I/O signals is illustrated in Fig. 8. Fig. 7. p 9 Construction of a ipartite graph for all the I/O signals of 9 I/O pins delay of an I/O signal from its I/O pin to its ump pad via an I/O uffer violates the given timing constraint, the placement of the I/O uffer for the I/O signal is not allowed. Hence, the location of the placed uffer for an I/O signal must e one availale uffer location under its timing constraint. Fig. 9. s 1 s 2 s 6 s 7 s 8 s 9 s 3 s 4 s 5 p 9 s 1 s 11 s 12 s 13 Availale locations for I/O uffer placement For any I/O signal from I/O pin, p i, to ump pad, j, ased on the construction of a Manhattan circular region, two Manhattan circular regions, MCR p i, and MCR j, must e uilt from I/O pin, p i, to the placed I/O uffer and from the placed I/O uffer to the corresponding ump pad, j, respectively. Furthermore, the feasile location region, FLR, of the placed I/O uffer for the corresponding I/O signal can e defined as the intersection region etween two MCRs, MCR p i, and MCR j, as illustrated in Fig. 1, where the center of the MCR p i is the coordinate, (x i, y i ), of the I/O pin, p i, and the radius, l p i, of the MCR p i is the maximum routing length under the given timing delay, t p i, and the center of the MCR j is the coordinate, (x j, y j ), of the ump pad, j, and the radius, l j, of the MCR j is the maximum routing length under the given timing delay, t j. l j FLR j (x j,y j ) IO l p i p i (x i,y i ) p 9 MCR j MCR p i Fig. 8. I/O signal assignment for 9 I/O pins C. Timing-Constrained I/O Buffer Placement After a set of I/O signals is constructed in I/O signal assignment, necessary I/O uffers must e further placed onto feasile uffer locations to complete I/O connections etween IO pins and ump pads. As illustrated in Fig. 9, according to the size of an I/O uffer, the whitespace in a floorplan plane can e assigned as 13 uffer locations. On the other hand, if the Fig. 1. Feasile location region for an I/O uffer To satisfy the given timing constraint of any I/O pin, p i, the FLR etween MCR p i, and MCR j, for the placed I/O uffer must e constrained into two necessary conditions. Firstly, the timing constraint of the I/O signal for the I/O pin, p i, must e satisfied as t p i t uf t j t max, where t p i is the timing delay from the I/O pin, p i, to the placed I/O uffer, t uf is the intrinsic delay of the placed I/O uffer, and t j is the timing delay from the placed I/O uffer to the ump pad, j. Based on the length, l p i, etween the I/O pin, p i, and the placed I/O uffer and the length, l j, etween the placed I/O uffer and the ump pad, j, t p i and t j can e otained as

5 p rc t i = li 2 p ( Rc r C ) li RC rc t j = l j j 2 ( R c r C) l R C and where R and C are the output resistance and input capacitance of an placed I/O uffer, respectively. On the other hand, to form the intersection region etween two MCRs, the total length of oth radiuses must e greater than or equal to the distance, d ij, etween the points, (x i, y i ) and (x j, y j ), that is, l p il j d ij. According to all the availale locations of I/O uffers for any I/O signal in a floorplan plane, a ipartite graph, G(V s, V, E), can e constructed as follows: the vertex set in G can e divided into a vertex set, V s, for I/O signals and a vertex set, V, for availale uffer locations. For v i in V s and v j in V, there exists an edge, e i,j, in E if the corresponding I/O signal in v i can use the corresponding uffer location in v i to place an I/O uffer under a given timing constraint. Clearly, a placement solution can e otained if the numer of uffer locations is greater than or equal to the numer of I/O signals and any I/O signal has at least one uffer location to place an I/O uffer. To make each I/O signal to place an I/O uffer, the I/O signals with only one uffer location must have the highest priority to place I/O uffers. Therefore, a matching-ased I/O uffer placement can e applied as follows: Firstly, any vertex in V s with degree 1 and its connecting vertex in V are selected to place an I/O uffer and the original ipartite graph is modified y deleting the selected vertex pair in V. If there is no vertex in V s with degree 1, the vertex in V s with the smallest degree and its nearest connecting vertex in V are selected to place an I/O uffer and the original ipartite graph is modified y deleting the selected vertex pair in V. Until the vertex set, V s, is empty or there is no edge in G, the placement process will stop and the final I/O uffer placement result will e otained. Refer to the set of 9 I/O signals, n 1, n 2,..., and n 9, and the set of 13 uffer locations, s 1, s 2,..., and s 13, in Fig. 9, a ipartite graph, G(V s, V, E) can e uilt as illustrated in Fig. 11, where V s ={n 1, n 2, n 3, n 4, n 5, n 6, n 7, n 8, n 9 }, V s ={s 1, s 2, s 3, s 4, s 5, s 6, s 7, s 8, s 9, s 1, s 11, s 12, s 13 } and E={e 1,3, e 1,4, e 2,2, e 3,1, e 3,2, e 4,5, e 4,6, e 5,7, e 5,8, e 6,5, e 6,6, e 7,13, e 8,1, e 8,11, e 9,11, e 9,12 }. By performing a matching-ased I/O uffer placement, the edges, e 2,2 and e 7,13, are selected to place two I/O uffers and the edges, e 3,2, is deleted ecause the location, s 2, is used for the I/O signal, n 2. Furthermore, for the vertices in V s with degree 1, the edge, e 3,1, is selected to place an I/O uffer. Finally, for the vertices in V p with the smallest degree, the edges, e 1,3, e 4,5, e 6,6, e 5,7, e 8,11, and e 9,12, are sequentially selected to place I/O uffers and the placement result of 9 I/O uffers is illustrated in Fig. 12. Fig. 11. n 1 n 2 n 3 n 4 n 5 n 6 n 7 n 8 n 9 s 1 s 2 s 3 s 4 s 5 s 6 s 7 s 8 s 9 s 1 s 11 s 12 s 13 Construction of a ipartite graph for 9 I/O signals and 13 availale uffer locations, Fig. 12. B 2 B 3 B 1 B 4 B 5 B 6 p B Timing-constrained I/O uffer placement for 9 I/O pins D. Analysis of Time Complexity Based on the timing constraints of all I/O pins, it is clear that the time complexity of extracting the timing-constrained routing lengths of all the pins in Elmore delay model is O(n), where n is the numer of I/O pins in a floorplan plane. Furthermore, the time complexity of constructing the Manhattan circular regions(mcrs) for all the pins under their timing constraints is O(n). Based on all the possile I/O signal, it is clear that the time complexity of constructing a ipartite graph is O(nr) and the time complexity in a matching-ased I/O signal assignment is O(n 2 ), where r is the numer of ump pads in the area-i/o flip-chip design. In the placement of I/O uffers, firstly, the time complexity of finding the possile uffer locations for all the I/O signals under their timing constraints is O(n 2 ). Furthermore, the time complexity of constructing a ipartite graph is O(ns). Finally, the time complexity in a matching I/O uffer placement is O(n 2 ). Hence, the time complexity of timing-constrained I/O uffer placement is O(n 2 nrns). As a result, the time complexity in the proposed approach is O(nrns) ecause the numer of ump pads is generally greater than the numer of I/O pins. IV. EXPERIMENTAL RESULTS For timing-constrained I/O uffer placement, our proposed approach has een implemented y standard C language and run on an Intel Core2 Quad 2.6GHz machine with 3.5G memory. We take seven industrial circuits in [7] as tested cases. Due to no timing constraint in these tested cases, we set the timing delay of the distance etween two adjacent ump pads as a unit delay and the timing constraint for all I/O pins as the window size of the delay of multiple units. The parameters in.18μm process from NTRS 97 are used to compute the delay of a RC circuit. In the experiments, the driver strength and the driving load are set as 18(Ω) and 23.4(pF), respectively, r is assigned as.76(ω/μm) and c is assigned as.118(pf/μm). Besides that, one kind of I/O uffers with input capacitance of 23.4(pF), output resistance of 18(Ω) and the intrinsic delay of 36.4(ps) is applied for the tested cases. Here, the timingconstrained satisfaction ratio(tcsr) is defined as the ratio etween the numer of I/O signals which satisfy the given timing constraints and the total numer of I/O signals in an area-i/o flip-chip design. Two experiments are set up to evaluate the TCSR and the results in our proposed approach are compared with those in

6 Peng s SA-ased approach[7]. In the first experiment, no timing constraint is considered in 7 tested cases and the experimental results including the total wirelength and the maximum delay are shown in Tale I. The experimental results in Tale I show that our approach can reduce 71.82% of the total wirelength and 55.74% of the maximum delay for 7 tested cases on the average. Under the various timing constraints from the window sizes(w.s) of the delay of 6, 12 and 18 units, the tested cases are used to evaluate the TCSR in our second experiment and the experimental results are shown in Tale II. Compared with Peng s SA-ased approach[7], the experimental results show that the SA-ased approach only has low TCSR in 7 tested cases to satisfy the given timing constraints. Under the first timing constraint, our proposed approach otains 86.5% TCSR in 7 tested cases in on the average. As the timing constraint is modified as the window size of the delay 18 units, all the tested cases are completely assigned y our proposed approach. V. CONCLUSIONS The prolem of timing-constrained I/O uffer placement in an area-io flip-chip design is firstly formulated. Given a set of locks with some I/O pins and the timing constraints of the I/O pins in a floorplan plane, our proposed approach is effectively to assign all the I/O signals and place the necessary I/O uffers for the assigned I/O signals with satisfying the given timing constraints. Experimental results present that our proposed approach reduces the total wirelength and the maximum delay with no timing constraint and achieves higher TCSR with the given timing constraints than the previous work. REFERENCES [1] P. Dehkordi and D. Bouldin, Design for packageaility: the impact of onding technology on the size and layout of VLSI dies, Multi-chip Module Conference, pp , [2] P. Buffet, J. Natonio, R. Proctor, Y. Sun and G. Yasar, Methodology for I/O cell placement and checking in ASIC designs using area-array power grid, IEEE Custom Integrated Circuits Conference, pp , 2. [3] A. E. Caldwell, A. B. Kahng and I. L. Markov, Can recursive isection alone produce routale placement?, ACM/IEEE Design Automation Conference, pp , 2. [4] A. R. Agnihotri, M. C. Yildiz, A. Khatkhate, A. Mathur, S. Ono and P. H. Madden, Fractical cut: improved recursive isection placement, IEEE/ACM International Conference on Computed-Aided Design, pp , 23. [5] S. N. Adya, I. L. Markov and P. G. Villarruia, On whitespace in mixed-size placement and physical synthesis, IEEE/ACM International Conference on Computed-Aided Design, pp , 23. [6] H. Y. Hsieh and T. C. Wang, Simple yet effective algorithms for lock and I/O uffer placement in flip-chip design, IEEE International Symposium on Circuits and Systems, pp , 25. [7] C. Y. Peng, W. C. Chao, Y. W. Chang and J. H. Wang, Simultaneous lock and I/O uffer floorplanning for flip-chip design, IEEE Asia and South Pacific Design Automation Conference, pp , 26. [8] J. Xiong, Y. C. Wong, E. Sarto and L. He, Constraint driven I/O planning and placement for chip-package co-design, IEEE/ACM Asia and South Pacific Design Automation Conference, pp , 27. [9] M. F. Lai and H. M. Chen, An implementation of performance-driven lock and I/O placement for chip-package codesign,, IEEE International Symposium on Quality Electronic Design, pp.64-67, 28. [1] C. Y. Wang and W. K. Mak, Signal skew aware floorplanning and umper signal assignment technique for flip-chip, IEEE/ACM Asia and South Pacific Design Automation Conference, pp , 29. [11] M. de Berg, O. Cheong, M. van Kreveld and M. Overmars, Computational Geometry: Algorithms and Applications, 3 rd edition, Springer-Verlag, 25. TABLE I. EXPERIMENTAL RESULTS FOR I/O BUFFER PLACEMENT WITH NO TIMING CONSTRAINT Ex #Pin #Block #Bump Chip area Peng s SA-ased Approach[7] #WL(μm) TCSR Max. delay(ps) #WL(μm) TCSR Max. delay(ps) Time(s) case x (1%) 1% (1%) 1213(75.2%) 1% 21.41(92.36%).16 case x (1%) 1% (1%) 16236(22.85%) 1% 62.92(51.35%).562 case x (1%) 1% 17.3(1%) 3394(19.4%) 1% 69.61(4.93%) 1.97 case x (1%) 1% (1%) 25226(13.23%) 1% 54.43(36.82%) 4.94 case x (1%) 1% (1%) 417(18.86%) 1% 73.46(42.24%) case x (1%) 1% (1%) 38534(16.47%) 1% 68.65(19.84%) case x (1%) 1% 33.95(1%) 69726(31.55%) 1% 87.12(26.32%) TABLE II. EXPERIMENTAL RESULTS FOR TIMING-CONSTRAINED I/O BUFFER PLACEMENT UNDER THREE WINDOW SIZES = 6, 12 AND 18 Ex Peng s SA-ased Approach[7] W.S = 6 W.S = 12 W.S = 18 Peng s SA-ased Approach[7] Peng s SA-ased Approach[7] TCSR #WL(μm) TCSR #WL(μm) TCSR #WL(μm) TCSR #WL(μm) TCSR #WL(μm) TCSR #WL(μm) case1 1% % % % % % 1213 case2 N/A N/A 89.2% N/A N/A 1% % 355 1% case3 N/A N/A 82.5% N/A N/A 1% N/A N/A 1% 3394 case4 N/A N/A 98.4% N/A N/A 1% N/A N/A 1% case5 N/A N/A 83.% 2537 N/A N/A 1% N/A N/A 1% 417 case6 N/A N/A 86.1% % % % 154 1% case7 1.4% % % % % % 69726

Routability-Driven Bump Assignment for Chip-Package Co-Design

Routability-Driven Bump Assignment for Chip-Package Co-Design 1 Routability-Driven Bump Assignment for Chip-Package Co-Design Presenter: Hung-Ming Chen Outline 2 Introduction Motivation Previous works Our contributions Preliminary Problem formulation Bump assignment

More information

Constraint Driven I/O Planning and Placement for Chip-package Co-design

Constraint Driven I/O Planning and Placement for Chip-package Co-design Constraint Driven I/O Planning and Placement for Chip-package Co-design Jinjun Xiong, Yiuchung Wong, Egino Sarto, Lei He University of California, Los Angeles Rio Design Automation, Inc. Agenda Motivation

More information

Physical Design Implementation for 3D IC Methodology and Tools. Dave Noice Vassilios Gerousis

Physical Design Implementation for 3D IC Methodology and Tools. Dave Noice Vassilios Gerousis I NVENTIVE Physical Design Implementation for 3D IC Methodology and Tools Dave Noice Vassilios Gerousis Outline 3D IC Physical components Modeling 3D IC Stack Configuration Physical Design With TSV Summary

More information

Floorplan Management: Incremental Placement for Gate Sizing and Buffer Insertion

Floorplan Management: Incremental Placement for Gate Sizing and Buffer Insertion Floorplan Management: Incremental Placement for Gate Sizing and Buffer Insertion Chen Li, Cheng-Kok Koh School of ECE, Purdue University West Lafayette, IN 47907, USA {li35, chengkok}@ecn.purdue.edu Patrick

More information

On Increasing Signal Integrity with Minimal Decap Insertion in Area-Array SoC Floorplan Design

On Increasing Signal Integrity with Minimal Decap Insertion in Area-Array SoC Floorplan Design On Increasing Signal Integrity with Minimal Decap Insertion in Area-Array SoC Floorplan Design Chao-Hung Lu Department of Electrical Engineering National Central University Taoyuan, Taiwan, R.O.C. Email:

More information

Symmetrical Buffered Clock-Tree Synthesis with Supply-Voltage Alignment

Symmetrical Buffered Clock-Tree Synthesis with Supply-Voltage Alignment Symmetrical Buffered Clock-Tree Synthesis with Supply-Voltage Alignment Xin-Wei Shih, Tzu-Hsuan Hsu, Hsu-Chieh Lee, Yao-Wen Chang, Kai-Yuan Chao 2013.01.24 1 Outline 2 Clock Network Synthesis Clock network

More information

On Improving Recursive Bipartitioning-Based Placement

On Improving Recursive Bipartitioning-Based Placement Purdue University Purdue e-pubs ECE Technical Reports Electrical and Computer Engineering 12-1-2003 On Improving Recursive Bipartitioning-Based Placement Chen Li Cheng-Kok Koh Follow this and additional

More information

Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations

Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations Renshen Wang Department of Computer Science and Engineering University of California, San Diego La Jolla,

More information

Physical Implementation

Physical Implementation CS250 VLSI Systems Design Fall 2009 John Wawrzynek, Krste Asanovic, with John Lazzaro Physical Implementation Outline Standard cell back-end place and route tools make layout mostly automatic. However,

More information

ECE 5745 Complex Digital ASIC Design Topic 13: Physical Design Automation Algorithms

ECE 5745 Complex Digital ASIC Design Topic 13: Physical Design Automation Algorithms ECE 7 Complex Digital ASIC Design Topic : Physical Design Automation Algorithms Christopher atten School of Electrical and Computer Engineering Cornell University http://www.csl.cornell.edu/courses/ece7

More information

Non-Rectangular Shaping and Sizing of Soft Modules for Floorplan Design Improvement

Non-Rectangular Shaping and Sizing of Soft Modules for Floorplan Design Improvement Non-Rectangular Shaping and Sizing of Soft Modules for Floorplan Design Improvement Chris C.N. Chu and Evangeline F.Y. Young Abstract Many previous works on floorplanning with non-rectangular modules [,,,,,,,,,,,

More information

Problem Formulation. Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets.

Problem Formulation. Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets. Clock Routing Problem Formulation Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets. Better to develop specialized routers for these nets.

More information

Power-Mode-Aware Buffer Synthesis for Low-Power Clock Skew Minimization

Power-Mode-Aware Buffer Synthesis for Low-Power Clock Skew Minimization This article has been accepted and published on J-STAGE in advance of copyediting. Content is final as presented. IEICE Electronics Express, Vol.* No.*,*-* Power-Mode-Aware Buffer Synthesis for Low-Power

More information

An Effective Decap Insertion Method Considering Power Supply Noise during Floorplanning *

An Effective Decap Insertion Method Considering Power Supply Noise during Floorplanning * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 115-127 (2008) Short Paper An Effective Decap Insertion Method Considering Power Supply Noise during Floorplanning * CHAO-HUNG LU, HUNG-MING CHEN ** AND

More information

An Efficient and Fast Algorithm for Routing

An Efficient and Fast Algorithm for Routing VLSI DESIGN 1996, Vol. 5, No. 1, pp. 1-10 Reprints availale directly from the pulisher Photocopying permitted y license only (C) 1996 OPA (Overseas Pulishers Association) Amsterdam B.V. Pulished in The

More information

On GPU Bus Power Reduction with 3D IC Technologies

On GPU Bus Power Reduction with 3D IC Technologies On GPU Bus Power Reduction with 3D Technologies Young-Joon Lee and Sung Kyu Lim School of ECE, Georgia Institute of Technology, Atlanta, Georgia, USA yjlee@gatech.edu, limsk@ece.gatech.edu Abstract The

More information

ICS 252 Introduction to Computer Design

ICS 252 Introduction to Computer Design ICS 252 Introduction to Computer Design Placement Fall 2007 Eli Bozorgzadeh Computer Science Department-UCI References and Copyright Textbooks referred (none required) [Mic94] G. De Micheli Synthesis and

More information

Floorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence

Floorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence Floorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence Chen-Wei Liu 12 and Yao-Wen Chang 2 1 Synopsys Taiwan Limited 2 Department of Electrical Engineering National Taiwan University,

More information

Very Large Scale Integration (VLSI)

Very Large Scale Integration (VLSI) Very Large Scale Integration (VLSI) Lecture 6 Dr. Ahmed H. Madian Ah_madian@hotmail.com Dr. Ahmed H. Madian-VLSI 1 Contents FPGA Technology Programmable logic Cell (PLC) Mux-based cells Look up table PLA

More information

Signal Integrity Comparisons Between Stratix II and Virtex-4 FPGAs

Signal Integrity Comparisons Between Stratix II and Virtex-4 FPGAs White Paper Introduction Signal Integrity Comparisons Between Stratix II and Virtex-4 FPGAs Signal integrity has become a critical issue in the design of high-speed systems. Poor signal integrity can mean

More information

Routing. Robust Channel Router. Figures taken from S. Gerez, Algorithms for VLSI Design Automation, Wiley, 1998

Routing. Robust Channel Router. Figures taken from S. Gerez, Algorithms for VLSI Design Automation, Wiley, 1998 Routing Robust Channel Router Figures taken from S. Gerez, Algorithms for VLSI Design Automation, Wiley, 1998 Channel Routing Algorithms Previous algorithms we considered only work when one of the types

More information

Binary Decision Tree Using Genetic Algorithm for Recognizing Defect Patterns of Cold Mill Strip

Binary Decision Tree Using Genetic Algorithm for Recognizing Defect Patterns of Cold Mill Strip Binary Decision Tree Using Genetic Algorithm for Recognizing Defect Patterns of Cold Mill Strip Kyoung Min Kim 1,4, Joong Jo Park, Myung Hyun Song 3, In Cheol Kim 1, and Ching Y. Suen 1 1 Centre for Pattern

More information

New Optimal Layer Assignment for Bus-Oriented Escape Routing

New Optimal Layer Assignment for Bus-Oriented Escape Routing New Optimal Layer ssignment for us-oriented scape Routing in-tai Yan epartment of omputer Science and nformation ngineering, hung-ua University, sinchu, Taiwan, R. O.. STRT n this paper, based on the optimal

More information

L14 - Placement and Routing

L14 - Placement and Routing L14 - Placement and Routing Ajay Joshi Massachusetts Institute of Technology RTL design flow HDL RTL Synthesis manual design Library/ module generators netlist Logic optimization a b 0 1 s d clk q netlist

More information

Topological Routing to Maximize Routability for Package Substrate

Topological Routing to Maximize Routability for Package Substrate Topological Routing to Maximize Routability for Package Substrate Speaker: Guoqiang Chen Authors: Shenghua Liu, Guoqiang Chen, Tom Tong Jing, Lei He, Tianpei Zhang, Robby Dutta, Xian-Long Hong Outline

More information

Iterative-Constructive Standard Cell Placer for High Speed and Low Power

Iterative-Constructive Standard Cell Placer for High Speed and Low Power Iterative-Constructive Standard Cell Placer for High Speed and Low Power Sungjae Kim and Eugene Shragowitz Department of Computer Science and Engineering University of Minnesota, Minneapolis, MN 55455

More information

ASIC Physical Design Top-Level Chip Layout

ASIC Physical Design Top-Level Chip Layout ASIC Physical Design Top-Level Chip Layout References: M. Smith, Application Specific Integrated Circuits, Chap. 16 Cadence Virtuoso User Manual Top-level IC design process Typically done before individual

More information

Parallel algorithms for fast air pollution assessment in three dimensions

Parallel algorithms for fast air pollution assessment in three dimensions HPC-UA 2014 (Ukraine, Kyiv, Octoer 14, 2014) Parallel algorithms for fast air pollution assessment in three dimensions Bohaienko V.O. 1 1 Glushkov Institute of Cyernetic of NAS of Ukraine, Kyiv, Ukraine

More information

Minimum Implant Area-Aware Placement and Threshold Voltage Refinement

Minimum Implant Area-Aware Placement and Threshold Voltage Refinement Minimum Implant Area-Aware Placement and Threshold Voltage Refinement Seong-I Lei Wai-Kei Ma Chris Chu Department of Computer Science Department of Computer Science Department of Electrical and Computer

More information

An Interconnect-Centric Design Flow for Nanometer. Technologies

An Interconnect-Centric Design Flow for Nanometer. Technologies An Interconnect-Centric Design Flow for Nanometer Technologies Jason Cong Department of Computer Science University of California, Los Angeles, CA 90095 Abstract As the integrated circuits (ICs) are scaled

More information

Crosslink Insertion for Variation-Driven Clock Network Construction

Crosslink Insertion for Variation-Driven Clock Network Construction Crosslink Insertion for Variation-Driven Clock Network Construction Fuqiang Qian, Haitong Tian, Evangeline Young Department of Computer Science and Engineering The Chinese University of Hong Kong {fqqian,

More information

Cell Density-driven Detailed Placement with Displacement Constraint

Cell Density-driven Detailed Placement with Displacement Constraint Cell Density-driven Detailed Placement with Displacement Constraint Wing-Kai Chow, Jian Kuang, Xu He, Wenzan Cai, Evangeline F. Y. Young Department of Computer Science and Engineering The Chinese University

More information

Diagonal Routing in High Performance Microprocessor Design

Diagonal Routing in High Performance Microprocessor Design Diagonal Routing in High Performance Microprocessor Design Noriyuki Ito, Hideaki Katagiri, Ryoichi Yamashita, Hiroshi Ikeda, Hiroyuki Sugiyama, Hiroaki Komatsu, Yoshiyasu Tanamura, Akihiko Yoshitake, Kazuhiro

More information

An Efficient Routing Tree Construction Algorithm with Buffer Insertion, Wire Sizing and Obstacle Considerations

An Efficient Routing Tree Construction Algorithm with Buffer Insertion, Wire Sizing and Obstacle Considerations An Efficient Routing Tree Construction Algorithm with uffer Insertion, Wire Sizing and Obstacle Considerations Sampath Dechu Zion Cien Shen Chris C N Chu Physical Design Automation Group Dept Of ECpE Dept

More information

Topics. ! PLAs.! Memories: ! Datapaths.! Floor Planning ! ROM;! SRAM;! DRAM. Modern VLSI Design 2e: Chapter 6. Copyright 1994, 1998 Prentice Hall

Topics. ! PLAs.! Memories: ! Datapaths.! Floor Planning ! ROM;! SRAM;! DRAM. Modern VLSI Design 2e: Chapter 6. Copyright 1994, 1998 Prentice Hall Topics! PLAs.! Memories:! ROM;! SRAM;! DRAM.! Datapaths.! Floor Planning Programmable logic array (PLA)! Used to implement specialized logic functions.! A PLA decodes only some addresses (input values);

More information

Physical Design of Digital Integrated Circuits (EN0291 S40) Sherief Reda Division of Engineering, Brown University Fall 2006

Physical Design of Digital Integrated Circuits (EN0291 S40) Sherief Reda Division of Engineering, Brown University Fall 2006 Physical Design of Digital Integrated Circuits (EN029 S40) Sherief Reda Division of Engineering, Brown University Fall 2006 Lecture 09: Routing Introduction to Routing Global Routing Detailed Routing 2

More information

Chap5 The Theory of the Simplex Method

Chap5 The Theory of the Simplex Method College of Management, NCTU Operation Research I Fall, Chap The Theory of the Simplex Method Terminology Constraint oundary equation For any constraint (functional and nonnegativity), replace its,, sign

More information

1 Model checking and equivalence checking

1 Model checking and equivalence checking 978--52-85972- - Practical Design Verification Model checking and equivalence checking Masahiro Fujita. Introduction Owing to the advances in semiconductor technology, a large and complex system that has

More information

Place and Route for FPGAs

Place and Route for FPGAs Place and Route for FPGAs 1 FPGA CAD Flow Circuit description (VHDL, schematic,...) Synthesize to logic blocks Place logic blocks in FPGA Physical design Route connections between logic blocks FPGA programming

More information

On Refining Standard Cell Placement for Self-aligned Double Patterning *

On Refining Standard Cell Placement for Self-aligned Double Patterning * On Refining Standard Cell Placement for Self-aligned Double Patterning * Ye-Hong Chen, Sheng-He Wang and Ting-Chi Wang Department of Computer Science, National Tsing Hua University, Hsinchu City 30013,

More information

A Hierarchical Bin-Based Legalizer for Standard-Cell Designs with Minimal Disturbance

A Hierarchical Bin-Based Legalizer for Standard-Cell Designs with Minimal Disturbance A Hierarchical Bin-Based Legalizer for Standard- Designs with Minimal Disturbance Yu-Min Lee, Tsung-You Wu, and Po-Yi Chiang Department of Electrical Engineering National Chiao Tung University ASPDAC,

More information

On the Rectangle Escape Problem

On the Rectangle Escape Problem On the Rectangle Escape Problem A. Ahmadinejad S. Assadi E. Emamjomeh-Zadeh S. Yazdanbod H. Zarrabi-Zadeh Abstract Motivated by the bus escape routing problem in printed circuit boards, we study the following

More information

Physical Design of Digital Integrated Circuits (EN0291 S40) Sherief Reda Division of Engineering, Brown University Fall 2006

Physical Design of Digital Integrated Circuits (EN0291 S40) Sherief Reda Division of Engineering, Brown University Fall 2006 Physical Design of Digital Integrated Circuits (EN029 S40) Sherief Reda Division of Engineering, Brown University Fall 2006 Lecture 08: Interconnect Trees Introduction to Graphs and Trees Minimum Spanning

More information

Satisfiability Modulo Theory based Methodology for Floorplanning in VLSI Circuits

Satisfiability Modulo Theory based Methodology for Floorplanning in VLSI Circuits Satisfiability Modulo Theory based Methodology for Floorplanning in VLSI Circuits Suchandra Banerjee Anand Ratna Suchismita Roy mailnmeetsuchandra@gmail.com pacific.anand17@hotmail.com suchismita27@yahoo.com

More information

Basic Idea. The routing problem is typically solved using a twostep

Basic Idea. The routing problem is typically solved using a twostep Global Routing Basic Idea The routing problem is typically solved using a twostep approach: Global Routing Define the routing regions. Generate a tentative route for each net. Each net is assigned to a

More information

Efficient Rectilinear Steiner Tree Construction with Rectangular Obstacles

Efficient Rectilinear Steiner Tree Construction with Rectangular Obstacles Proceedings of the th WSES Int. onf. on IRUITS, SYSTEMS, ELETRONIS, ONTROL & SIGNL PROESSING, allas, US, November 1-3, 2006 204 Efficient Rectilinear Steiner Tree onstruction with Rectangular Obstacles

More information

A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithmetic with Decomposed LUT

A Novel Approach of Area-Efficient FIR Filter Design Using Distributed Arithmetic with Decomposed LUT IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735. Volume 7, Issue 2 (Jul. - Aug. 2013), PP 13-18 A Novel Approach of Area-Efficient FIR Filter

More information

Obstacle-Aware Longest-Path Routing with Parallel MILP Solvers

Obstacle-Aware Longest-Path Routing with Parallel MILP Solvers , October 20-22, 2010, San Francisco, USA Obstacle-Aware Longest-Path Routing with Parallel MILP Solvers I-Lun Tseng, Member, IAENG, Huan-Wen Chen, and Che-I Lee Abstract Longest-path routing problems,

More information

Detailed Placement Algorithm for VLSI Design with Double-Row Height Standard Cells

Detailed Placement Algorithm for VLSI Design with Double-Row Height Standard Cells Detailed Placement Algorithm for VLSI Design with Double-Row Height Standard Cells Gang Wu and Chris Chu Department of Electrical and Computer Engineering, Iowa State University, IA Email: {gangwu, cnchu}@iastate.edu

More information

A Study of IR-drop Noise Issues in 3D ICs with Through-Silicon-Vias

A Study of IR-drop Noise Issues in 3D ICs with Through-Silicon-Vias A Study of IR-drop Noise Issues in 3D ICs with Through-Silicon-Vias Moongon Jung and Sung Kyu Lim School of Electrical and Computer Engineering Georgia Institute of Technology, Atlanta, Georgia, USA Email:

More information

Circuit Model for Interconnect Crosstalk Noise Estimation in High Speed Integrated Circuits

Circuit Model for Interconnect Crosstalk Noise Estimation in High Speed Integrated Circuits Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 8 (2013), pp. 907-912 Research India Publications http://www.ripublication.com/aeee.htm Circuit Model for Interconnect Crosstalk

More information

Chapter 5 Global Routing

Chapter 5 Global Routing Chapter 5 Global Routing 5. Introduction 5.2 Terminology and Definitions 5.3 Optimization Goals 5. Representations of Routing Regions 5.5 The Global Routing Flow 5.6 Single-Net Routing 5.6. Rectilinear

More information

VERY large scale integration (VLSI) design for power

VERY large scale integration (VLSI) design for power IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 7, NO. 1, MARCH 1999 25 Short Papers Segmented Bus Design for Low-Power Systems J. Y. Chen, W. B. Jone, Member, IEEE, J. S. Wang,

More information

Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations

Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations XXVII IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, OCTOBER 5, 2009 Symmetrical Buffer Placement in Clock Trees for Minimal Skew Immune to Global On-chip Variations Renshen Wang 1 Takumi Okamoto 2

More information

ECE260B CSE241A Winter Placement

ECE260B CSE241A Winter Placement ECE260B CSE241A Winter 2005 Placement Website: / courses/ ece260b- w05 ECE260B CSE241A Placement.1 Slides courtesy of Prof. Andrew B. Slides courtesy of Prof. Andrew B. Kahng VLSI Design Flow and Physical

More information

Floorplan considering interconnection between different clock domains

Floorplan considering interconnection between different clock domains Proceedings of the 11th WSEAS International Conference on CIRCUITS, Agios Nikolaos, Crete Island, Greece, July 23-25, 2007 115 Floorplan considering interconnection between different clock domains Linkai

More information

Clock Skew Optimization Considering Complicated Power Modes

Clock Skew Optimization Considering Complicated Power Modes Clock Skew Optimization Considering Complicated Power Modes Chiao-Ling Lung 1,2, Zi-Yi Zeng 1, Chung-Han Chou 1, Shih-Chieh Chang 1 National Tsing-Hua University, HsinChu, Taiwan 1 Industrial Technology

More information

Introduction. A very important step in physical design cycle. It is the process of arranging a set of modules on the layout surface.

Introduction. A very important step in physical design cycle. It is the process of arranging a set of modules on the layout surface. Placement Introduction A very important step in physical design cycle. A poor placement requires larger area. Also results in performance degradation. It is the process of arranging a set of modules on

More information

Can Recursive Bisection Alone Produce Routable Placements?

Can Recursive Bisection Alone Produce Routable Placements? Supported by Cadence Can Recursive Bisection Alone Produce Routable Placements? Andrew E. Caldwell Andrew B. Kahng Igor L. Markov http://vlsicad.cs.ucla.edu Outline l Routability and the placement context

More information

On the Rectangle Escape Problem

On the Rectangle Escape Problem CCCG 2013, Waterloo, Ontario, August 8 10, 2013 On the Rectangle Escape Problem Sepehr Assadi Ehsan Emamjomeh-Zadeh Sadra Yazdanbod Hamid Zarrabi-Zadeh Abstract Motivated by a PCB routing application,

More information

Visibility: Finding the Staircase Kernel in Orthogonal Polygons

Visibility: Finding the Staircase Kernel in Orthogonal Polygons Visibility: Finding the Staircase Kernel in Orthogonal Polygons 8 Visibility: Finding the Staircase Kernel in Orthogonal Polygons Tzvetalin S. Vassilev, Nipissing University, Canada Stefan Pape, Nipissing

More information

A Trade-off Oriented Placement Tool 1

A Trade-off Oriented Placement Tool 1 A Trade-off Oriented Placement Tool Huaiyu Xu Maogang Wang Bo-Kyung Choi Majid Sarrafzadeh Computer Science Department University of California, Los Angeles, CA, 90095 huaiyu@cs.ucla.edu, mgwang@cadence.com,

More information

Full Custom Layout Optimization Using Minimum distance rule, Jogs and Depletion sharing

Full Custom Layout Optimization Using Minimum distance rule, Jogs and Depletion sharing Full Custom Layout Optimization Using Minimum distance rule, Jogs and Depletion sharing Umadevi.S #1, Vigneswaran.T #2 # Assistant Professor [Sr], School of Electronics Engineering, VIT University, Vandalur-

More information

Monolithic 3D IC Design for Deep Neural Networks

Monolithic 3D IC Design for Deep Neural Networks Monolithic 3D IC Design for Deep Neural Networks 1 with Application on Low-power Speech Recognition Kyungwook Chang 1, Deepak Kadetotad 2, Yu (Kevin) Cao 2, Jae-sun Seo 2, and Sung Kyu Lim 1 1 School of

More information

Processing Rate Optimization by Sequential System Floorplanning

Processing Rate Optimization by Sequential System Floorplanning Processing Rate Optimization by Sequential System Floorplanning Jia Wang Ping-Chih Wu Hai Zhou EECS Department Northwestern University Evanston, IL 60208, U.S.A. {jwa112, haizhou}@ece.northwestern.edu

More information

Calibrating Achievable Design GSRC Annual Review June 9, 2002

Calibrating Achievable Design GSRC Annual Review June 9, 2002 Calibrating Achievable Design GSRC Annual Review June 9, 2002 Wayne Dai, Andrew Kahng, Tsu-Jae King, Wojciech Maly,, Igor Markov, Herman Schmit, Dennis Sylvester DUSD(Labs) Calibrating Achievable Design

More information

Analysis of different legalisation methods for unequal sized recursive Min-cut placement

Analysis of different legalisation methods for unequal sized recursive Min-cut placement IOSR Journal of VLSI and Signal Processing (IOSR-JVSP) Volume 4, Issue 4, Ver. III (Jul-Aug. 04), PP -60 e-issn: 3 400, p-issn No. : 3 4 Analysis of different legalisation methods for unequal sized recursive

More information

Configurable Rectilinear Steiner Tree Construction for SoC and Nano Technologies

Configurable Rectilinear Steiner Tree Construction for SoC and Nano Technologies Configurable Rectilinear Steiner Tree Construction for SoC and Nano Technologies Iris Hui-Ru Jiang and Yen-Ting Yu Department of Electronics Engineering & Institute of Electronics National Chiao Tung University,

More information

Recent Results from Analyzing the Performance of Heuristic Search

Recent Results from Analyzing the Performance of Heuristic Search Recent Results from Analyzing the Performance of Heuristic Search Teresa M. Breyer and Richard E. Korf Computer Science Department University of California, Los Angeles Los Angeles, CA 90095 {treyer,korf}@cs.ucla.edu

More information

ESE 570 Cadence Lab Assignment 2: Introduction to Spectre, Manual Layout Drawing and Post Layout Simulation (PLS)

ESE 570 Cadence Lab Assignment 2: Introduction to Spectre, Manual Layout Drawing and Post Layout Simulation (PLS) ESE 570 Cadence Lab Assignment 2: Introduction to Spectre, Manual Layout Drawing and Post Layout Simulation (PLS) Objective Part A: To become acquainted with Spectre (or HSpice) by simulating an inverter,

More information

A Routing Method based on Nearest Via Assignment for 2-Layer Ball Grid Array Packages

A Routing Method based on Nearest Via Assignment for 2-Layer Ball Grid Array Packages A Routing Method based on Nearest Via Assignment for 2-Layer Ball Grid Array Packages Yoshiaki KURATA Yoichi TOMIOKA Yukihide KOHIRA Atsushi TAKAHASHI Tokyo Institute of Technology Dept. of Communications

More information

Additional Slides for Lecture 17. EE 271 Lecture 17

Additional Slides for Lecture 17. EE 271 Lecture 17 Additional Slides for Lecture 17 Advantages/Disadvantages of Wire Bonding Pros Cost: cheapest packages use wire bonding Allows ready access to front side of die for probing Cons Relatively high inductance

More information

8D-3. Experiences of Low Power Design Implementation and Verification. Shi-Hao Chen. Jiing-Yuan Lin

8D-3. Experiences of Low Power Design Implementation and Verification. Shi-Hao Chen. Jiing-Yuan Lin Experiences of Low Power Design Implementation and Verification Shi-Hao Chen Global Unichip Corp. Hsin-Chu Science Park, Hsin-Chu, Taiwan 300 +886-3-564-6600 hockchen@globalunichip.com Jiing-Yuan Lin Global

More information

Double Patterning-Aware Detailed Routing with Mask Usage Balancing

Double Patterning-Aware Detailed Routing with Mask Usage Balancing Double Patterning-Aware Detailed Routing with Mask Usage Balancing Seong-I Lei Department of Computer Science National Tsing Hua University HsinChu, Taiwan Email: d9762804@oz.nthu.edu.tw Chris Chu Department

More information

Real-Time Median Filtering for Embedded Smart Cameras

Real-Time Median Filtering for Embedded Smart Cameras Real-Time Median Filtering for Emedded Smart Cameras Yong Zhao Brown University Yong Zhao@rown.edu Gariel Tauin Brown University tauin@rown.edu Figure 1: System architecture of our proof-of-concept Visual

More information

Metal-Density Driven Placement for CMP Variation and Routability

Metal-Density Driven Placement for CMP Variation and Routability Metal-Density Driven Placement for CMP Variation and Routability ISPD-2008 Tung-Chieh Chen 1, Minsik Cho 2, David Z. Pan 2, and Yao-Wen Chang 1 1 Dept. of EE, National Taiwan University 2 Dept. of ECE,

More information

a) wire i with width (Wi) b) lij C coupled lij wire j with width (Wj) (x,y) (u,v) (u,v) (x,y) upper wiring (u,v) (x,y) (u,v) (x,y) lower wiring dij

a) wire i with width (Wi) b) lij C coupled lij wire j with width (Wj) (x,y) (u,v) (u,v) (x,y) upper wiring (u,v) (x,y) (u,v) (x,y) lower wiring dij COUPLING AWARE ROUTING Ryan Kastner, Elaheh Bozorgzadeh and Majid Sarrafzadeh Department of Electrical and Computer Engineering Northwestern University kastner,elib,majid@ece.northwestern.edu ABSTRACT

More information

Further Studies on Placement Optimality

Further Studies on Placement Optimality 1 Further Studies on Placement Optimality Jason Cong, Michalis Romesis, Joseph R. Shinnerl, and Min Xie UCLA Computer Science Dept. Abstract Recent studies of the quality of results produced by leading

More information

Lecture 3: Art Gallery Problems and Polygon Triangulation

Lecture 3: Art Gallery Problems and Polygon Triangulation EECS 396/496: Computational Geometry Fall 2017 Lecture 3: Art Gallery Problems and Polygon Triangulation Lecturer: Huck Bennett In this lecture, we study the problem of guarding an art gallery (specified

More information

Non Uniform On Chip Power Delivery Network Synthesis Methodology

Non Uniform On Chip Power Delivery Network Synthesis Methodology Non Uniform On Chip Power Delivery Network Synthesis Methodology Patrick Benediktsson Institution of Telecomunications, University of Lisbon, Portugal Jon A. Flandrin Institution of Telecomunications,

More information

Graph Models for Global Routing: Grid Graph

Graph Models for Global Routing: Grid Graph Graph Models for Global Routing: Grid Graph Each cell is represented by a vertex. Two vertices are joined by an edge if the corresponding cells are adjacent to each other. The occupied cells are represented

More information

On the Rectangle Escape Problem

On the Rectangle Escape Problem CCCG 2013, Waterloo, Ontario, August 8 10, 2013 On the Rectangle Escape Problem Sepehr Assadi Ehsan Emamjomeh-Zadeh Sadra Yazdanbod Hamid Zarrabi-Zadeh Abstract Motivated by a bus routing application,

More information

S 1 S 2. C s1. C s2. S n. C sn. S 3 C s3. Input. l k S k C k. C 1 C 2 C k-1. R d

S 1 S 2. C s1. C s2. S n. C sn. S 3 C s3. Input. l k S k C k. C 1 C 2 C k-1. R d Interconnect Delay and Area Estimation for Multiple-Pin Nets Jason Cong and David Zhigang Pan Department of Computer Science University of California, Los Angeles, CA 90095 Email: fcong,pang@cs.ucla.edu

More information

Performance Analysis on the Target Detection of Linear Camera of CCD Vertical Target Coordinate Measurement System

Performance Analysis on the Target Detection of Linear Camera of CCD Vertical Target Coordinate Measurement System Sensors & Transducers, Vol. 17, Issue 7, July 21, pp. 285-291 Sensors & Transducers 21 y IFSA Pulishing, S. L. http://www.sensorsportal.com Performance Analysis on the Target Detection of Linear Camera

More information

2 A Plane Sweep Algorithm for Line Segment Intersection

2 A Plane Sweep Algorithm for Line Segment Intersection EECS 396/496: Computational Geometry Fall 2017 Lecturer: Huck Bennett Lecture 2: Line Segment Intersection In this lecture, we will study the problem of computing all intersections of a set of line segmentn

More information

Efficient Multilayer Routing Based on Obstacle-Avoiding Preferred Direction Steiner Tree

Efficient Multilayer Routing Based on Obstacle-Avoiding Preferred Direction Steiner Tree Efficient Multilayer Routing Based on Obstacle-Avoiding Preferred Direction Steiner Tree Ching-Hung Liu, Yao-Hsin Chou, Shin-Yi Yuan, and Sy-Yen Kuo National Taiwan University 1 Outline 2 Outline 3 The

More information

A Modified Playfair Cipher for a Large Block of Plaintext

A Modified Playfair Cipher for a Large Block of Plaintext International Journal of Computer Theory and Engineering, Vol 1, No 5, Decemer, 2009 A Modified layfair Cipher for a Large Block of laintext V Umakanta Sastry, N Ravi Shankar, and S Durga Bhavani Astract

More information

ECE260B CSE241A Winter Routing

ECE260B CSE241A Winter Routing ECE260B CSE241A Winter 2005 Routing Website: / courses/ ece260bw05 ECE 260B CSE 241A Routing 1 Slides courtesy of Prof. Andrew B. Kahng Physical Design Flow Input Floorplanning Read Netlist Floorplanning

More information

High-Density Integration of Functional Modules Using Monolithic 3D-IC Technology

High-Density Integration of Functional Modules Using Monolithic 3D-IC Technology High-Density Integration of Functional Modules Using Monolithic 3D-IC Technology Shreepad Panth 1, Kambiz Samadi 2, Yang Du 2, and Sung Kyu Lim 1 1 Dept. of Electrical and Computer Engineering, Georgia

More information

Simultaneous OPC- and CMP-Aware Routing Based on Accurate Closed-Form Modeling

Simultaneous OPC- and CMP-Aware Routing Based on Accurate Closed-Form Modeling Simultaneous OPC- and CMP-Aware Routing Based on Accurate Closed-Form Modeling Shao-Yun Fang, Chung-Wei Lin, Guang-Wan Liao, and Yao-Wen Chang March 26, 2013 Graduate Institute of Electronics Engineering

More information

An Enhanced Congestion-Driven Floorplanner

An Enhanced Congestion-Driven Floorplanner An Enhanced Congestion-Driven Floorplanner Yu-Cheng Lin 1 Shin-Jia Chen 1 1 Hsin-Hsiung Huang 2 1 Dept. of Information and Electronic Commerce, Kainan University, Taoyuan, Taiwan 2 Dept. of EE., Lunghwa

More information

CAD Algorithms. Placement and Floorplanning

CAD Algorithms. Placement and Floorplanning CAD Algorithms Placement Mohammad Tehranipoor ECE Department 4 November 2008 1 Placement and Floorplanning Layout maps the structural representation of circuit into a physical representation Physical representation:

More information

Cluster-based approach eases clock tree synthesis

Cluster-based approach eases clock tree synthesis Page 1 of 5 EE Times: Design News Cluster-based approach eases clock tree synthesis Udhaya Kumar (11/14/2005 9:00 AM EST) URL: http://www.eetimes.com/showarticle.jhtml?articleid=173601961 Clock network

More information

Eliminating Routing Congestion Issues with Logic Synthesis

Eliminating Routing Congestion Issues with Logic Synthesis Eliminating Routing Congestion Issues with Logic Synthesis By Mike Clarke, Diego Hammerschlag, Matt Rardon, and Ankush Sood Routing congestion, which results when too many routes need to go through an

More information

VLSI Physical Design: From Graph Partitioning to Timing Closure

VLSI Physical Design: From Graph Partitioning to Timing Closure Chapter 5 Global Routing Original uthors: ndrew. Kahng, Jens, Igor L. Markov, Jin Hu Chapter 5 Global Routing 5. Introduction 5.2 Terminology and Definitions 5.3 Optimization Goals 5. Representations of

More information

Agus Harjoko Lab. Elektronika dan Instrumentasi, FMIPA, Universitas Gadjah Mada, Yogyakarta, Indonesia

Agus Harjoko Lab. Elektronika dan Instrumentasi, FMIPA, Universitas Gadjah Mada, Yogyakarta, Indonesia A Comparison Study of the Performance of the Fourier Transform Based Algorithm and the Artificial Neural Network Based Algorithm in Detecting Faric Texture Defect Agus Harjoko La. Elektronika dan Instrumentasi,

More information

Linking Layout to Logic Synthesis: A Unification-Based Approach

Linking Layout to Logic Synthesis: A Unification-Based Approach Linking Layout to Logic Synthesis: A Unification-Based Approach Massoud Pedram Department of EE-Systems University of Southern California Los Angeles, CA February 1998 Outline Introduction Technology and

More information

Burn-in & Test Socket Workshop

Burn-in & Test Socket Workshop Burn-in & Test Socket Workshop IEEE March 4-7, 2001 Hilton Mesa Pavilion Hotel Mesa, Arizona IEEE COMPUTER SOCIETY Sponsored By The IEEE Computer Society Test Technology Technical Council COPYRIGHT NOTICE

More information

Multilayer Routing on Multichip Modules

Multilayer Routing on Multichip Modules Multilayer Routing on Multichip Modules ECE 1387F CAD for Digital Circuit Synthesis and Layout Professor Rose Friday, December 24, 1999. David Tam (2332 words, not counting title page and reference section)

More information

Finding Shortest Path on Land Surface

Finding Shortest Path on Land Surface Finding Shortest Path on Land Surface Lian Liu, Raymond Chi-Wing Wong Hong Kong University of Science and Technology June 14th, 211 Introduction Land Surface Land surfaces are modeled as terrains A terrain

More information