RALP:Reconvergence-Aware Layer Partitioning For 3D FPGAs*
|
|
- Dominick Bridges
- 5 years ago
- Views:
Transcription
1 RALP:Reconvergence-Aware Layer Partitioning For 3D s* Qingyu Liu 1, Yuchun Ma 1, Yu Wang 2, Wayne Luk 3, Jinian Bian 1 1 Department of Computer Science and Technology, Tsinghua University, Beijing, China 2 Department of Electronic and Engineer, Tsinghua University, Beijing, China 3 Imperial College, London, United Kingdom myc@mail.tsinghua.edu.cn Abstract In 3D designs, the circuit elements are distributed among multiple layers. Therefore, the partition strategies will influence the optimization of the entire design. Without the layout information, it is quite difficult to evaluate the effect of partitioning before placement. As a prior estimation model, reconvergence has shown its efficiency to estimate wire length before placement in 2D designs. However, when it comes to 3D, the traditional method is no longer applicable due to the change of routing architecture. In this paper, we propose a novel prior estimator called 3D-reconvergence to evaluate wire length of the netlists in 3D designs. A reconvergenceaware layer partition (RALP) algorithm for 3D design is proposed. Experimental results show that our partitioning approach could lead to better physical layout results. Compared with the traditional min-cut based partitioning approach, the design flow with RALP can obtain better routing results by reducing 7.06% wire length and 4.86% delay for 2-layer designs, 4.71% wire length and 4.73% delay for 3-layer designs. Keywords Reconvergence; Partition; 3D; I. INTRODUCTION Nowadays, three dimensional (3D) ICs have shown significant performance improvements compared with their 2D counterparts. For design area, both industrial and academic communities now are also focusing on the 3D designs. However, comparing with the traditional 2D designs, though 3D s have good quality in delay, wire length and so on [5][7][8][9], the specific routing architecture with multiple stacking layers also invokes the challenges for the layout optimization. As shown in Fig.1(a), the architecture of traditional is composed of two main parts: routing resource (SB: switching box; CB: connecting box) and logic resource (CLB: configure logic block). The CLBs are connected to the horizontal and vertical channels with a CB. Connections are switched at the intersection of horizontal and vertical channels by a SB [3]. For 3D which stacks multiple layers, as shown in Fig. 1(b), to switch connections from a layer to another, some SBs must be extended to 3D-SBs with through-silicon vias (TSVs) to realize the communications between layers [2]. Normally, there are several TSVs clustered in each 3D-SB. The architecture shown in [12] utilized 20 TSVs for a 3D-SB. Given the specific architecture of chips, the distribution of 3D-SBs is fixed. And normally only parts of the SBs on the chip are 3D-SBs due to the fabrication cost. Therefore, the physical This work is supportted by NSFC /13/$ IEEE layout optimization is seriously restricted by the distribution of 3D-SBs. Fig. 1. architecture Normally in 3D designs, there is a fundamental process before the placement called partitioning in which the blocks are assigned to the specific layers. With the constraints of the 3D- SBs, the partitions among layers influence the later design process of placement and routing (P&R) greatly. There are several researches focusing on the layout optimization with layer partitioning. TPR [5][6] and 3D MEANDER [7] are two major synthesis frameworks targeting 3D so far. Both of them contain the layer partition step based on min-cut approach, which tries to minimize the number of used TSVs. However, as shown in Fig. 1(b), both the number and distribution of TSVs clustered in 3D-SBs in 3D s are initially fixed. As shown in Fig.2, K. Siozios[8] quantifies the utilization of fabricated TSVs regarding the 20 largest MCNC benchmarks and it shows that there are just about 46% TSVs used, and 54% TSVs are still vacant. Therefore, instead of minimizing the number of the vertical connections, the partitioning approach should take advantage of existed TSVs to improve the designs. Though the layer partition algorithms proposed in [8] and [9] are not based on min-cut approach, their partitioning schemes are realized by either iterative heuristic optimization or simulated annealing, which may consume much time but cannot guarantee the quality of the results. What s more, most of the previous approaches are focusing on the number of the vertical connections. In fact, the partitions influence the placement and routing greatly. Even with the same number of vertical connections, different partitions may lead to different layout results. However, due to the lack of the layout information before placement, it is difficult to consider the layout optimization during the partitioning stage.
2 TSV constraints: The cut nets in layer partition step will utilize TSVs to realize communications between different layers in P&R. As fabricated TSVs are fixed in 3D, the number of utilized TSVs in a layer partition result should not exceed the total quantity of fabricated TSVs. Unlike the previous works whose objectives are to minimize the number of nets being cut, in this paper, we introduce an extra objective to improve the performance during the partitioning process. Fig. 2. Utilization of TSVs in 3D In this paper, we propose a novel prior wirelength estimator called 3D-reconvergence which could be used to evaluate different partitions before placement. Actually the term reconvergence has been used in 2D designs to estimate wire length before placement [10][11]. But with multiple layers stacked, the original definition of reconvergence is no longer suitable for 3D s. By representing the partitioned netlists considering the vertical connections, we propose a new estimator 3D-reconvergence accordingly. We also propose a reconvergence-aware layer partition algorithm which could provide better layer partition result for the later-on placement and routing. The contributions of this paper are summarized as follow: A new estimator 3D-reconvergence is proposed to reflect the relationship between reconvergence and wire length in 3D designs. A novel reconvergence-aware layer partition algorithm (RALP) based on 3D-reconvergence is proposed to provide better layer partition results for P&R in 3D designs. RALP could fit in 3D design flow smoothly and experimental results show that our approach obtain better result compared with traditional partitioning approach by considering the wirelength estimation at the early design stage. II. PROBLEM FORMULATION Layer partitioning for 3D is a physical design process in which a netlist of circuit blocks is mapped onto multiple chip layers. The input netlist is modeled as a hypergraph H(V, E). In the hypergraph each vertices corresponds to a logic block and each hyperedge corresponds to a net. Layer partitioning contains two steps: partition and assignment. The partition step first partitions the input netlist into multiple parts and the assignment then assigns each part to a layer in 3D. A good layer partition strategy should favor P&R under following constraints: Resource balance: As a general 3D basically stacks multiple identical 2D layers together, the resource used such as the number of utilized CLBs in different layers should be balanced to minimize the waste of the chip area resource. III. 3D-RECONVERGENCE To evaluate the partitioning results` performance without detailed layout information, a prior estimator is needed. Reconvergence [10] is used efficiently to estimate wire length before placement in 2D. However, the method is no longer applicable for 3D design. New techniques are needed to signify the relationship between reconvergence and wire length in 3D to guide the layer partitioning. A. Reconvergence for 2D Reconvergence occurs when multiple fan outs from a single node in the circuit branch back together at a later point. As shown in Fig.3, there is a reconvergence between node A and F. Based on this idea, [10] first put forwards the definition of reconvergence as follows: Definition 1. R : Reconvergence is defined to originate on a node x, if branches of multiple fan outs of x join later at a node y. The reconvergence will be referred by the name R xy. x is the origin and y is the destination of the R xy. For sequential circuits, the circuit is divided into different sequential levels at the flip-flop boundaries, and reconvergences of a sequential circuit are only within same sequential levels. In R, for each path p, the length of the path is noted as l (p). For R, the most important characteristic is its average path length. We could define it as the value of R, namely RV. RV l( p), p N R [10] uses R to generate test circuits. [11] uses R to estimate wire length before placement for 2D. In [11] the weight of every node is related to RV which it belongs to and a net`s length is linked to weights of its nodes. [11] has shown its efficiency. It achieves an average error around 10% w.r.t. semiperimeter wire length measured from the optimized layout using VPR. It suggests reconvergence dictates a kind of forces exerted by cells on each other. If there are a lot of reconvergences in a circuit, cells will compete with each other for sites in the layout so as to minimize the wire length in the placement. Therefore, a netlist with many complex reconvergences is hard to be placed. Since the larger reconvergence value indicates the possible larger wirelength of the layout, for 3D s, we expect our layer partition algorithm to partition a netlist into parts in a way which could reduce and simplify reconvergences in each part. However, during the partition process, the cut parts in the
3 netlist are connected by vertical connections and therefore the sub netlists on different layers are not completely separated with each other. Their placements depend on each other. Therefore, the traditional method on a planar space is no longer applicable due to that it is unable to reflect the vertical connection of these partitioned reconvergences. We need new models to estimate the wire length of the partitioned reconvergences so as to obtain a layer partition algorithm which favors P&R. Fig. 3. Different placement results of R AF B. 3D-reconvergence Since the vertical connections play an important role in partitions, we need to model it properly in our formulation. The physical location of the TSVs among layers constrains the layout of the blocks. The cut on the same reconvergence will lead to several TSVs which are required to be near to each other to favor the later-on placement and routing. Most placement algorithms for 3D place blocks layer by layer. As shown in Fig. 3, R AF is required to be placed on a 2-layer and we place layer 0 first. If TSVs are not close to each other as shown in Fig. 3(a), the optimal placement on layer 0 could lead to a bad result for layer 1 with total wire length of 8. Nevertheless, if we cluster TSVs in R AF, it could shorten wire length from 8 to 6 under the same placement strategy as shown in Fig. 3(b). Especially, there are indeed several TSVs clustered in each 3D- SB in the architecture. Furthermore, the proposed 3D `s architecture in [12] has 20 TSVs for a 3D-SB. Therefore, to represent the cut, we add a virtual node T on the netlist. As shown in Fig. 4, suppose R AF has been partitioned into two parts with AB and DE cut. A virtual node T is introduced connecting four nodes accordingly which is just representing the partition among layer. Fig. 4. 3D-Reconvergence Definition 1. Cutting node: If the R is partitioned into two parts, we add virtual nets and a virtual node T called cutting node in the netlist and the original R is turned into two small reconvergences R XT and R TY. In the real layout T stands for the TSVs in R resulted from nets cut during partition. Now with the cutting node, R AF is transformed into two small reconvergences R AT and R TF. We use R AT and R TF as a 3D-reconvergence: 3D-R AF which represents R AF after it is partitioned into sub-reconvergence. The definition of 3Dreconvergence is as follows: Definition 2. 3D-R: 3D-reconvergence is composed of several 2D reconvegences connected by cutting nodes. Suppose R is cut by cutting node T, 3D-R = {R XT, R TY}. For a reconvergence partitioned into N (N>2) parts, it is recursive defined. To evaluate the wirelength, the half parameter of the bounding box of the netlist is commonly used. In 3D designs, the wirelength is always evaluated by the bounding box of the projected elements on the planar space. With the cutting nodes representing the TSVs in the center of the netlists, we could roughly evaluate the wirelength by the largest bounding box among layers. Therefore, for each individual part on the same layer, we could estimate the wirelength according to its 2D reconveregence and the value of 3D-R (3D-RV ) should be evaluated by the maximal 2D reconvergence value among layers. Suppose 3D-R = {R XT1, R T1T2,, R TkY}, then 3D RV max( RV ), R 3D R ij Fig. 5. Different partition leads to different P&R results. In fact, different cut positions in R will form different 3D- R and also will lead to different wire length after P&R. Fig. 5 shows an example. R AF is same in Fig. 5(a) and Fig. 5(b). However, AB, AF and AD are cut in Fig. 5(a) while BC, AF and DE are cut in Fig. 5(b). This difference in layer partitioning forms different 3D-R and leads to different wire length after P&R. Fig. 5(a) tends to have a wire length of 6 while Fig. 5(b) tends to have a wire length of 5. Now we calculate 3D-RV in Fig. 5. 3D-RV for Fig. 5(a) = max(1, 7/3) = 7/3, while for Fig. 5(b) it is max(5/3, 5/3) = 5/3. This corresponds to the fact that Fig. 5(a) have larger wirelength and verifies the validity of 3D-RV. ij
4 IV. RALP FOR 3D A. Layer Partition in RALP As 3D-R with smaller value is likely to lead to a better P&R result shown in section above, the goal of RALP is to utilize limited TSV resources to cut reconvergences while at the same time to minimize the average 3D-RV of all 3Dreconvergences. As 3D-RV is determined by the maximal 2D-RV among layers, in a reconvergence, it is optimal to cut edge with the balanced sub netlists. When an edge in the reconvergence is cut, the corresponding path will be cut into two parts. Suppose edge E ij is on the path p of R. If E ij is cut during partition, according to equation (1) and (2), when calculating 3D-RV, the length of p is no longer l(p), but is max{l XT(p), l TY(p)}. In this case, by cutting E ij and partitioning p into two parts, we bring a benefit of min{l(p)-l XT(p), l(p)-l TY(p)} to path p. Therefore, from this view, we could define CB Rxy ij as the cut benefit of E ij in R. CB 0, Eij R R ij min{ l( p) lxt ( p), l( p) lty ( p )}, Eij R Based on (3), we can evaluate the effect of cutting each edge. Therefore, we can assign the netlist graph with the corresponding weight on each edge by its cutting benefit. Then to reduce the estimated wire length, we are aiming at finding the optimized cut on the graph with the maximal cutting benefits. Similar to TPR, we also use hmetis [13] for layer partition due to its quality and speed. Hmetis is a software package for partitioning large hypergraphs. The weights of vertices and hyperedges in the hypergraph are two main important parameters. TPR sets the same weight for all vertices due to that all clbs have the same area and also sets the same weight for all hyperedges when layer partition. The weights for vertices in RALP are same, too. However, the weights for hyperedges are different and based on cut benefits. Since Hmetis is based on min-cut, the goal of which is to minimize the total cut weight, namely nets with small weights tend to be cut, to maximize the cut benefits, we assign the reverse of the average value of cut benefits as the edge weights according to equation (4). With a limitation that total cuts are no more than TSVs` number, equation (4) could insure the goal of RALP discussed above. Specifically, weights are calculated as follows: 1, Ni 0 W N i i, Ni 0 R CB ij 0, Ei R Ni 1, Ei R B. Layer Assignment in RALP After the nodes are partitioned into different groups, it is still not settled which layer should be assigned to each group. Normally, the objective of this assignment is to minimize the utilization of TSVs. Based on the goal, RALP assigns layer to each group by greedily choosing the best solution among n! choices if the has n layers. Most placement algorithms for 3D place blocks layer by layer. Therefore after the assignment, we still need to determine the sequence of placement. Since the positions of blocks are greatly constrained by TSVs, it is reasonable to place the part with more blocks related to TSVs first. As shown in Fig. 5(a), we should place layer with block B, C, D, E, F first. Definition 4. 3D-Ri and 3D-Ri: For 3D-R, 3D-Ri represents the number of blocks in 3D-R in part i. 3D-Ri represents the total number of blocks of all 3D-reconvergences in part i. 3D-Ri just right reflects the number of blocks related to TSVs in reconvergences. Therefore RALP also uses 3Dreconvergence to determine the sequence of placement. It depends on the values of 3D-Ri. For a part with a larger 3D-Ri, it should be placed earlier. C. Total Flow of RALP Similar to hmetis, RALP partitions a netlist into an arbitrary number of parts using recursive bisection, namely a series of 2 partitions. Using three layers partition as an example, RALP includes two phases. Firstly, we perform an unbalanced 2 partition in which one part has 2/3 of the total number of vertices, and the other part has 1/3. Secondly, we bisects the part which has 2/3 of the total number of vertices into two equal-size parts. After these two phases, we could obtain three equal-size parts, each of which is 1/3 of the initial netlist. Fig. 6. RALP flow RALP could partition a netlist into n parts for a n-layer design and then determine the sequence of placement of each layer. The first step in RALP is also the most important step, which constructs a hypergraph according to input netlist and obtains the reconvergence information. Then according to equation (4), weights are calculated for hyperedges and hmetis is utilized to do the partition job. Layer assignment is then performed. Fig. 6 shows the total flow of RALP.
5 V. EXPERIMENTAL RESULTS This section evaluates the efficiency of RALP. RALP has been implemented with C++ and is tested on Red Hat Enterprise Linux 6 with 2.30GHz Intel Core i5-2410m CPU and 1 GB memory. In the test, a total of 9 circuits from the benchmark given by [6] are evaluated on the optimal 3D architecture of SP (20, 2)[12] (similar to the architecture shown in Fig. 1(b), the distance between two adjacent 3D-SBs in X- direction is 2 and there are 20 TSVs in every 3D-SB). The P&R algorithms used for RALP results are the same as TPR. A. The estimation effect of 3D-reconvergence In order to demonstrate 3D-RV could be used to evaluate the wire length trend in 3D design, in Table Ⅰ we show half perimeter (HP) obtained after placement together with average 3D-RV value obtained before placement for two different benchmarks: alu4 and des, for 2-layer and 3-layer designs. For every benchmark, we perform experiments for 10 times. The values of HP are sorted in ascending order. In Fig. 7, we show corresponding data in graphs for 2-layer cases. From the table and the figure we could see the 3D-RV value increase with the HP values accordingly which could verify the feasibility to use 3D-reconvergence as a prior wirelength estimator before placement. B. The Optimization of RALP As shown in Fig.6, though RALP is before placement and routing, the optimization effects of partitioning approach should be evaluated by the placement and routing results. To show the partitioning effects to the final layout results, we need to consider the wirelength (WL) and delay(delay) for both the placement level and the routing level. To show the advantage of our approach, we also compare our results with TPR[5,6] which utilizes the min-cut partitioning scheme. To make a fair comparison, the placement and routing tools from two approaches are using the same algorithms and settings. Table Ⅱ shows the results when 3D has 2 layers and Table Ⅲ shows the results when layer number is 3. From these tables we can see RALP leads to an average reduction of wire length and delay about 7.06% and 4.86% with 7.60% more TSVs utilized when layer number is 2 and an average reduction of wire length and delay about 4.71% and 4.73% with 6.74% more TSVs utilized when layer number is 3. These definitely show the efficiency of RALP. As what has been discussed that the ratio of used TSVs in 3D is rather small, the cost to use more TSVs could be ignored compared with the gain of the reduction of delay and wire length. Though RALP proposes an additional estimation for the partition step, it is still very efficient and the overhead is little compared with the run time of later placement and routing stages. Table IV shows the runtime of RALP, P&R for 2-layer s. From the results we can see that the runtime of RALP occupies less than 1% of the total run time of the physical design flow. VI. CONCLUSION In this paper, we first show the important role reconvergence could play in 3D design. In order to signify the relationship between reconvergence and wire length in 3D architecture, a new model called 3D-reconvergence is proposed. Based on 3D-reconvergence, we propose a reconvergenceaware layer partition algorithm for 3D. Experimental results show that compared to existing 3D partitioning approach, average wire length and delay could be reduced by 7.06% and 4.86% for 2 layers, 4.71% and 4.73% for 3 layers, respectively. VII. REFERENCES [1] R.S. Patti, Three-dimensional integrated circuits and the future of system-on-chip designs, Proc. IEEE, June 2006, vol. 94, no. 6, pp [2] Young-Su Kwon, Payam Lajevardi, Anantha P. Chandrakasan, Frank Honore, Donald E. Troxel, A 3-D wire resource prediction model validated using a 3-D placement and routing tool, SLIP, San Francisco, April 2005, pp [3] J. Rose, S. Brown, Flexibility of interconnection structures for fieldprogrammable gate arrays, IEEE Journal of Solid-State Circuits, 1991, vol. 26, pp [4] V. Betz and J. Rose, VPR: a new packing, placement and routing tool for research, Seventh International Workshop on Field- Programmable Logic and Applications, London, UK, 1997, pp [5] C. Ababei, H. Mogal, K. Bazargan, Three-dimensional place and route for s, IEEE Trans. Computer-Aided Design Integrated Circuits and Systems, June 2006, vol. 25, no. 6, pp [6] [7] K. Siozios, A. Bartzas, D. Soudirs, Architecture-level exploration of alternative interconnection schemes targeting 3D s: a softwaresupported methodology, Int`l Journal of Reconfigurable Computing, 2008, vol [8] K. Siozios, D. Soudris, A tabu-based partitioning and layer assignment algorithm for 3-D s, Embedded Systems Letters, IEEE, 2011, vol. 3, pp [9] Wentao Sui, Sheqin Dong, Jinian Bian, Wirelength-driven forcedirected 3D placement, Pro. Of the 20 th symposium on Great lakes symposium on VLSI, May [10] [M. Hutton, Characterization and parameterized generation of digital circuits, Ph.D. dissertation, Dept. Comput. Sci., Univ. Toronto, Toronto, ON, Canada, [11] S. Balachandran, D. Bhatia, A prior wirelength and interconnect estimation based on circuit characteristics, IEEE transactions on Computer-Aided Design of Integrated Circuit and Systems, 2005, vol. 24, pp [12] [Chia-I Chen, Bau-Cheng Lee, Juinn-Dar Huang, Architectural exploration of 3D s towards a better balance between area and delay, DATE, 2011, pp [13] G. Karypis, R. Aggarwal, V. Kumar, Shashi Shekhar, Multilevel hypergraph partitioning: applications in VLSI domain, IEEE Transactions on VLSI Systems, 1999, vol. 7, pp
6 bench mark ALU4 DES arch 2-layer 3-layer 2-layer 3-layer TABLE I. 3D-RV & HP FOR ALU4 & DES experiments D-RV HP D-RV HP D-RV HP D-RV HP DRV HP 3DRV HP (a) alue4 (b) des Fig. 7 3D-RV & HP for 2-layer deisgns of alu4 and des TABLE II. EXPERIMENTAL RESULTS FOR 2-LAYER (TSV) TPR RALP Placement-level result Routing-level result TSV Placement-level result Routing-level result TSV WL Delay WL Delay number WL Delay WL Delay number Alu E E E E Misex E E E E Ex5p E E E E Apex E E E E Apex E E E E Seq E E E E Des E E E E Ex E E E E Pdc E E E E Average E E E E Improve % -5.61% -7.06% -4.86% +7.60% TABLE III. EXPERIMENTAL RESULTS FOR 3-LAYER TPR RALP Placement-level result Routing-level result TSV Placement-level result Routing-level result TSV WL Delay WL Delay number WL Delay WL Delay number Alu E E E E Misex E E E E Ex5p E E E E Apex E E E E Apex E E E E Seq E E E E Des E E E E Ex E E E E Pdc E E E E Average E E E E Improve -8.90% -1.86% -4.71% -4.73% +6.74% TABLE IV. RUNTIME (SECONDS) OF RALP ALU4 MISEX3 EX5P APEX4 APEX2 SEQ DES EX1010 PDC RALP Place Route
How Much Logic Should Go in an FPGA Logic Block?
How Much Logic Should Go in an FPGA Logic Block? Vaughn Betz and Jonathan Rose Department of Electrical and Computer Engineering, University of Toronto Toronto, Ontario, Canada M5S 3G4 {vaughn, jayar}@eecgutorontoca
More informationModular Placement for Interposer based Multi-FPGA Systems
Modular Placement for Interposer based Multi-FPGA Systems Fubing Mao 1, Wei Zhang 2, Bo Feng 3, Bingsheng He 1, Yuchun Ma 3 1 School of Computer Engineering, Nanyang Technological University, Singapore
More informationFaster Placer for Island-style FPGAs
Faster Placer for Island-style FPGAs Pritha Banerjee and Susmita Sur-Kolay Advanced Computing and Microelectronics Unit Indian Statistical Institute 0 B. T. Road, Kolkata, India email:{pritha r, ssk}@isical.ac.in
More informationDetailed Router for 3D FPGA using Sequential and Simultaneous Approach
Detailed Router for 3D FPGA using Sequential and Simultaneous Approach Ashokkumar A, Dr. Niranjan N Chiplunkar, Vinay S Abstract The Auction Based methodology for routing of 3D FPGA (Field Programmable
More informationMODULAR PARTITIONING FOR INCREMENTAL COMPILATION
MODULAR PARTITIONING FOR INCREMENTAL COMPILATION Mehrdad Eslami Dehkordi, Stephen D. Brown Dept. of Electrical and Computer Engineering University of Toronto, Toronto, Canada email: {eslami,brown}@eecg.utoronto.ca
More information160 M. Nadjarbashi, S.M. Fakhraie and A. Kaviani Figure 2. LUTB structure. each block-level track can be arbitrarily connected to each of 16 4-LUT inp
Scientia Iranica, Vol. 11, No. 3, pp 159{164 c Sharif University of Technology, July 2004 On Routing Architecture for Hybrid FPGA M. Nadjarbashi, S.M. Fakhraie 1 and A. Kaviani 2 In this paper, the routing
More informationDesigning Heterogeneous FPGAs with Multiple SBs *
Designing Heterogeneous FPGAs with Multiple SBs * K. Siozios, S. Mamagkakis, D. Soudris, and A. Thanailakis VLSI Design and Testing Center, Department of Electrical and Computer Engineering, Democritus
More informationFast Timing-driven Partitioning-based Placement for Island Style FPGAs
.1 Fast Timing-driven Partitioning-based Placement for Island Style FPGAs Pongstorn Maidee Cristinel Ababei Kia Bazargan Electrical and Computer Engineering Department University of Minnesota, Minneapolis,
More informationICS 252 Introduction to Computer Design
ICS 252 Introduction to Computer Design Placement Fall 2007 Eli Bozorgzadeh Computer Science Department-UCI References and Copyright Textbooks referred (none required) [Mic94] G. De Micheli Synthesis and
More informationAN ACCELERATOR FOR FPGA PLACEMENT
AN ACCELERATOR FOR FPGA PLACEMENT Pritha Banerjee and Susmita Sur-Kolay * Abstract In this paper, we propose a constructive heuristic for initial placement of a given netlist of CLBs on a FPGA, in order
More informationMultilevel Algorithms for Multi-Constraint Hypergraph Partitioning
Multilevel Algorithms for Multi-Constraint Hypergraph Partitioning George Karypis University of Minnesota, Department of Computer Science / Army HPC Research Center Minneapolis, MN 55455 Technical Report
More informationOn Improving Recursive Bipartitioning-Based Placement
Purdue University Purdue e-pubs ECE Technical Reports Electrical and Computer Engineering 12-1-2003 On Improving Recursive Bipartitioning-Based Placement Chen Li Cheng-Kok Koh Follow this and additional
More informationPlacement Algorithm for FPGA Circuits
Placement Algorithm for FPGA Circuits ZOLTAN BARUCH, OCTAVIAN CREŢ, KALMAN PUSZTAI Computer Science Department, Technical University of Cluj-Napoca, 26, Bariţiu St., 3400 Cluj-Napoca, Romania {Zoltan.Baruch,
More informationSynthesizable FPGA Fabrics Targetable by the VTR CAD Tool
Synthesizable FPGA Fabrics Targetable by the VTR CAD Tool Jin Hee Kim and Jason Anderson FPL 2015 London, UK September 3, 2015 2 Motivation for Synthesizable FPGA Trend towards ASIC design flow Design
More informationDouble Patterning-Aware Detailed Routing with Mask Usage Balancing
Double Patterning-Aware Detailed Routing with Mask Usage Balancing Seong-I Lei Department of Computer Science National Tsing Hua University HsinChu, Taiwan Email: d9762804@oz.nthu.edu.tw Chris Chu Department
More informationCAD Algorithms. Circuit Partitioning
CAD Algorithms Partitioning Mohammad Tehranipoor ECE Department 13 October 2008 1 Circuit Partitioning Partitioning: The process of decomposing a circuit/system into smaller subcircuits/subsystems, which
More informationA Routing Approach to Reduce Glitches in Low Power FPGAs
A Routing Approach to Reduce Glitches in Low Power FPGAs Quang Dinh, Deming Chen, Martin Wong Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign This research
More informationFloorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence
Floorplan and Power/Ground Network Co-Synthesis for Fast Design Convergence Chen-Wei Liu 12 and Yao-Wen Chang 2 1 Synopsys Taiwan Limited 2 Department of Electrical Engineering National Taiwan University,
More informationPower-Mode-Aware Buffer Synthesis for Low-Power Clock Skew Minimization
This article has been accepted and published on J-STAGE in advance of copyediting. Content is final as presented. IEICE Electronics Express, Vol.* No.*,*-* Power-Mode-Aware Buffer Synthesis for Low-Power
More informationWirelength Estimation based on Rent Exponents of Partitioning and Placement Λ
Wirelength Estimation based on Rent Exponents of Partitioning and Placement Λ Xiaojian Yang Elaheh Bozorgzadeh Majid Sarrafzadeh Computer Science Department University of California at Los Angeles Los
More informationIterative-Constructive Standard Cell Placer for High Speed and Low Power
Iterative-Constructive Standard Cell Placer for High Speed and Low Power Sungjae Kim and Eugene Shragowitz Department of Computer Science and Engineering University of Minnesota, Minneapolis, MN 55455
More informationGeneral Models for Optimum Arbitrary-Dimension FPGA Switch Box Designs
General Models for Optimum Arbitrary-Dimension FPGA Switch Box Designs Hongbing Fan Dept. of omputer Science University of Victoria Victoria B anada V8W P6 Jiping Liu Dept. of Math. & omp. Sci. University
More informationSUBMITTED FOR PUBLICATION TO: IEEE TRANSACTIONS ON VLSI, DECEMBER 5, A Low-Power Field-Programmable Gate Array Routing Fabric.
SUBMITTED FOR PUBLICATION TO: IEEE TRANSACTIONS ON VLSI, DECEMBER 5, 2007 1 A Low-Power Field-Programmable Gate Array Routing Fabric Mingjie Lin Abbas El Gamal Abstract This paper describes a new FPGA
More informationPARALLEL PERFORMANCE DIRECTED TECHNOLOGY MAPPING FOR FPGA. Laurent Lemarchand. Informatique. ea 2215, D pt. ubo University{ bp 809
PARALLEL PERFORMANCE DIRECTED TECHNOLOGY MAPPING FOR FPGA Laurent Lemarchand Informatique ubo University{ bp 809 f-29285, Brest { France lemarch@univ-brest.fr ea 2215, D pt ABSTRACT An ecient distributed
More informationVariation Aware Routing for Three-Dimensional FPGAs
Variation Aware Routing for Three-Dimensional FPGAs Chen Dong, Scott Chilstedt, and Deming Chen Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign {cdong3, chilste1,
More informationIntroduction. A very important step in physical design cycle. It is the process of arranging a set of modules on the layout surface.
Placement Introduction A very important step in physical design cycle. A poor placement requires larger area. Also results in performance degradation. It is the process of arranging a set of modules on
More informationWirelength Estimation based on Rent Exponents of Partitioning and Placement 1
Wirelength Estimation based on Rent Exponents of Partitioning and Placement 1 Xiaojian Yang, Elaheh Bozorgzadeh, and Majid Sarrafzadeh Synplicity Inc. Sunnyvale, CA 94086 xjyang@synplicity.com Computer
More informationEN2911X: Reconfigurable Computing Lecture 13: Design Flow: Physical Synthesis (5)
EN2911X: Lecture 13: Design Flow: Physical Synthesis (5) Prof. Sherief Reda Division of Engineering, rown University http://scale.engin.brown.edu Fall 09 Summary of the last few lectures System Specification
More informationA Novel Net Weighting Algorithm for Timing-Driven Placement
A Novel Net Weighting Algorithm for Timing-Driven Placement Tim (Tianming) Kong Aplus Design Technologies, Inc. 10850 Wilshire Blvd., Suite #370 Los Angeles, CA 90024 Abstract Net weighting for timing-driven
More informationMulti-level Quadratic Placement for Standard Cell Designs
CS258f Project Report Kenton Sze Kevin Chen 06.10.02 Prof Cong Multi-level Quadratic Placement for Standard Cell Designs Project Description/Objectives: The goal of this project was to provide an algorithm
More informationSatisfiability Modulo Theory based Methodology for Floorplanning in VLSI Circuits
Satisfiability Modulo Theory based Methodology for Floorplanning in VLSI Circuits Suchandra Banerjee Anand Ratna Suchismita Roy mailnmeetsuchandra@gmail.com pacific.anand17@hotmail.com suchismita27@yahoo.com
More informationVdd Programmability to Reduce FPGA Interconnect Power
Vdd Programmability to Reduce FPGA Interconnect Power Fei Li, Yan Lin and Lei He Electrical Engineering Department University of California, Los Angeles, CA 90095 ABSTRACT Power is an increasingly important
More informationBasic Idea. The routing problem is typically solved using a twostep
Global Routing Basic Idea The routing problem is typically solved using a twostep approach: Global Routing Define the routing regions. Generate a tentative route for each net. Each net is assigned to a
More informationGeneric Integer Linear Programming Formulation for 3D IC Partitioning
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING (ID: 100500, REVISED VERSION) Generic Integer Linear Programming Formulation for 3D IC Partitioning Department of Electronics Engineering & Institute of Electronics
More informationOn GPU Bus Power Reduction with 3D IC Technologies
On GPU Bus Power Reduction with 3D Technologies Young-Joon Lee and Sung Kyu Lim School of ECE, Georgia Institute of Technology, Atlanta, Georgia, USA yjlee@gatech.edu, limsk@ece.gatech.edu Abstract The
More informationAutomated system partitioning based on hypergraphs for 3D stacked integrated circuits. FOSDEM 2018 Quentin Delhaye
Automated system partitioning based on hypergraphs for 3D stacked integrated circuits FOSDEM 2018 Quentin Delhaye Integrated circuits: Let s go 3D Building an Integrated Circuit (IC) Transistors to build
More informationCAD Flow for FPGAs Introduction
CAD Flow for FPGAs Introduction What is EDA? o EDA Electronic Design Automation or (CAD) o Methodologies, algorithms and tools, which assist and automatethe design, verification, and testing of electronic
More informationPushPull: Short Path Padding for Timing Error Resilient Circuits YU-MING YANG IRIS HUI-RU JIANG SUNG-TING HO. IRIS Lab National Chiao Tung University
PushPull: Short Path Padding for Timing Error Resilient Circuits YU-MING YANG IRIS HUI-RU JIANG SUNG-TING HO IRIS Lab National Chiao Tung University Outline Introduction Problem Formulation Algorithm -
More informationTECHNOLOGY MAPPING FOR THE ATMEL FPGA CIRCUITS
TECHNOLOGY MAPPING FOR THE ATMEL FPGA CIRCUITS Zoltan Baruch E-mail: Zoltan.Baruch@cs.utcluj.ro Octavian Creţ E-mail: Octavian.Cret@cs.utcluj.ro Kalman Pusztai E-mail: Kalman.Pusztai@cs.utcluj.ro Computer
More informationVERY large scale integration (VLSI) design for power
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 7, NO. 1, MARCH 1999 25 Short Papers Segmented Bus Design for Low-Power Systems J. Y. Chen, W. B. Jone, Member, IEEE, J. S. Wang,
More informationUnit 5A: Circuit Partitioning
Course contents: Unit 5A: Circuit Partitioning Kernighang-Lin partitioning heuristic Fiduccia-Mattheyses heuristic Simulated annealing based partitioning algorithm Readings Chapter 7.5 Unit 5A 1 Course
More informationA Methodology and Tool Framework for Supporting Rapid Exploration of Memory Hierarchies in FPGAs
A Methodology and Tool Framework for Supporting Rapid Exploration of Memory Hierarchies in FPGAs Harrys Sidiropoulos, Kostas Siozios and Dimitrios Soudris School of Electrical & Computer Engineering National
More informationEstimation of Wirelength
Placement The process of arranging the circuit components on a layout surface. Inputs: A set of fixed modules, a netlist. Goal: Find the best position for each module on the chip according to appropriate
More informationAn Enhanced Perturbing Algorithm for Floorplan Design Using the O-tree Representation*
An Enhanced Perturbing Algorithm for Floorplan Design Using the O-tree Representation* Yingxin Pang Dept.ofCSE Univ. of California, San Diego La Jolla, CA 92093 ypang@cs.ucsd.edu Chung-Kuan Cheng Dept.ofCSE
More informationThree-Dimensional Integrated Circuits: Performance, Design Methodology, and CAD Tools
Three-Dimensional Integrated Circuits: Performance, Design Methodology, and CAD Tools Shamik Das, Anantha Chandrakasan, and Rafael Reif Microsystems Technology Laboratories Massachusetts Institute of Technology
More informationBasic Block. Inputs. K input. N outputs. I inputs MUX. Clock. Input Multiplexors
RPack: Rability-Driven packing for cluster-based FPGAs E. Bozorgzadeh S. Ogrenci-Memik M. Sarrafzadeh Computer Science Department Department ofece Computer Science Department UCLA Northwestern University
More informationFast FPGA Routing Approach Using Stochestic Architecture
. Fast FPGA Routing Approach Using Stochestic Architecture MITESH GURJAR 1, NAYAN PATEL 2 1 M.E. Student, VLSI and Embedded System Design, GTU PG School, Ahmedabad, Gujarat, India. 2 Professor, Sabar Institute
More informationWhitespace-Aware TSV Arrangement in 3D Clock Tree Synthesis
2013 IEEE Computer Society Annual Symposium on VLSI Whitespace-Aware TSV Arrangement in 3D Clock Tree Synthesis Xin Li, Wulong Liu, Haixiao Du, Yu Wang, Yuchun Ma, Huazhong Yang Tsinghua National Laboratory
More information3-D INTEGRATED CIRCUITS (3-D ICs) are emerging
862 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 21, NO. 5, MAY 2013 Study of Through-Silicon-Via Impact on the 3-D Stacked IC Layout Dae Hyun Kim, Student Member, IEEE, Krit
More informationImplementation of Multi-Way Partitioning Algorithm
Implementation of Multi-Way Partitioning Algorithm Kulpreet S. Sikand, Sandeep S. Gill, R. Chandel, and A. Chandel Abstract This paper presents a discussion of methods to solve partitioning problems and
More informationFloorplan considering interconnection between different clock domains
Proceedings of the 11th WSEAS International Conference on CIRCUITS, Agios Nikolaos, Crete Island, Greece, July 23-25, 2007 115 Floorplan considering interconnection between different clock domains Linkai
More informationPlace and Route for FPGAs
Place and Route for FPGAs 1 FPGA CAD Flow Circuit description (VHDL, schematic,...) Synthesize to logic blocks Place logic blocks in FPGA Physical design Route connections between logic blocks FPGA programming
More informationPreclass Warmup. ESE535: Electronic Design Automation. Motivation (1) Today. Bisection Width. Motivation (2)
ESE535: Electronic Design Automation Preclass Warmup What cut size were you able to achieve? Day 4: January 28, 25 Partitioning (Intro, KLFM) 2 Partitioning why important Today Can be used as tool at many
More informationUNIVERSITY OF CALGARY. Force-Directed Partitioning Technique for 3D IC. Aysa Fakheri Tabrizi A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES
UNIVERSITY OF CALGARY Force-Directed Partitioning Technique for 3D IC by Aysa Fakheri Tabrizi A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE
More informationSYNTHETIC CIRCUIT GENERATION USING CLUSTERING AND ITERATION
SYNTHETIC CIRCUIT GENERATION USING CLUSTERING AND ITERATION Paul D. Kundarewich and Jonathan Rose Department of Electrical and Computer Engineering, University of Toronto Toronto, ON, M5S G4, Canada {kundarew,
More informationCAD Algorithms. Placement and Floorplanning
CAD Algorithms Placement Mohammad Tehranipoor ECE Department 4 November 2008 1 Placement and Floorplanning Layout maps the structural representation of circuit into a physical representation Physical representation:
More informationPartitioning. Course contents: Readings. Kernighang-Lin partitioning heuristic Fiduccia-Mattheyses heuristic. Chapter 7.5.
Course contents: Partitioning Kernighang-Lin partitioning heuristic Fiduccia-Mattheyses heuristic Readings Chapter 7.5 Partitioning 1 Basic Definitions Cell: a logic block used to build larger circuits.
More informationChapter 5 Global Routing
Chapter 5 Global Routing 5. Introduction 5.2 Terminology and Definitions 5.3 Optimization Goals 5. Representations of Routing Regions 5.5 The Global Routing Flow 5.6 Single-Net Routing 5.6. Rectilinear
More informationAn LP-based Methodology for Improved Timing-Driven Placement
An LP-based Methodology for Improved Timing-Driven Placement Qingzhou (Ben) Wang, John Lillis and Shubhankar Sanyal Department of Computer Science University of Illinois at Chicago Chicago, IL 60607 {qwang,
More informationTCG-Based Multi-Bend Bus Driven Floorplanning
TCG-Based Multi-Bend Bus Driven Floorplanning Tilen Ma Department of CSE The Chinese University of Hong Kong Shatin, N.T. Hong Kong Evangeline F.Y. Young Department of CSE The Chinese University of Hong
More informationVery Large Scale Integration (VLSI)
Very Large Scale Integration (VLSI) Lecture 6 Dr. Ahmed H. Madian Ah_madian@hotmail.com Dr. Ahmed H. Madian-VLSI 1 Contents FPGA Technology Programmable logic Cell (PLC) Mux-based cells Look up table PLA
More informationMinimizing Clock Domain Crossing in Network on Chip Interconnect
Minimizing Clock Domain Crossing in Network on Chip Interconnect Parag Kulkarni 1, Puneet Gupta 2, Rudy Beraha 3 1 Synopsys 2 UCLA 3 Qualcomm Corp. R&D Outline Motivation The Router Coloring Problem Approaches
More informationParallel Implementation of VLSI Gate Placement in CUDA
ME 759: Project Report Parallel Implementation of VLSI Gate Placement in CUDA Movers and Placers Kai Zhao Snehal Mhatre December 21, 2015 1 Table of Contents 1. Introduction...... 3 2. Problem Formulation...
More informationRoutability-Driven Bump Assignment for Chip-Package Co-Design
1 Routability-Driven Bump Assignment for Chip-Package Co-Design Presenter: Hung-Ming Chen Outline 2 Introduction Motivation Previous works Our contributions Preliminary Problem formulation Bump assignment
More informationCircuit Placement: 2000-Caldwell,Kahng,Markov; 2002-Kennings,Markov; 2006-Kennings,Vorwerk
Circuit Placement: 2000-Caldwell,Kahng,Markov; 2002-Kennings,Markov; 2006-Kennings,Vorwerk Andrew A. Kennings, Univ. of Waterloo, Canada, http://gibbon.uwaterloo.ca/ akenning/ Igor L. Markov, Univ. of
More informationToward More Efficient Annealing-Based Placement for Heterogeneous FPGAs. Yingxuan Liu
Toward More Efficient Annealing-Based Placement for Heterogeneous FPGAs by Yingxuan Liu A thesis submitted in conformity with the requirements for the degree of Master of Applied Science Graduate Department
More informationA Trade-off Oriented Placement Tool 1
A Trade-off Oriented Placement Tool Huaiyu Xu Maogang Wang Bo-Kyung Choi Majid Sarrafzadeh Computer Science Department University of California, Los Angeles, CA, 90095 huaiyu@cs.ucla.edu, mgwang@cadence.com,
More informationOptimality and Scalability Study of Existing Placement Algorithms
Optimality and Scalability Study of Existing Placement Algorithms Abstract - Placement is an important step in the overall IC design process in DSM technologies, as it defines the on-chip interconnects,
More informationFault Grading FPGA Interconnect Test Configurations
* Fault Grading FPGA Interconnect Test Configurations Mehdi Baradaran Tahoori Subhasish Mitra* Shahin Toutounchi Edward J. McCluskey Center for Reliable Computing Stanford University http://crc.stanford.edu
More informationGENETIC ALGORITHM BASED FPGA PLACEMENT ON GPU SUNDAR SRINIVASAN SENTHILKUMAR T. R.
GENETIC ALGORITHM BASED FPGA PLACEMENT ON GPU SUNDAR SRINIVASAN SENTHILKUMAR T R FPGA PLACEMENT PROBLEM Input A technology mapped netlist of Configurable Logic Blocks (CLB) realizing a given circuit Output
More informationConstraint-Driven Floorplanning based on Genetic Algorithm
Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 147 Constraint-Driven Floorplanning based on Genetic Algorithm
More informationEfficient Second-Order Iterative Methods for IR Drop Analysis in Power Grid
Efficient Second-Order Iterative Methods for IR Drop Analysis in Power Grid Yu Zhong Martin D. F. Wong Dept. of Electrical and Computer Engineering Dept. of Electrical and Computer Engineering Univ. of
More informationSafeChoice: A Novel Approach to Hypergraph Clustering for Wirelength-Driven Placement
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN, 2011 1 SafeChoice: A Novel Approach to Hypergraph Clustering for Wirelength-Driven Placement Jackey Z. Yan, Chris Chu and Wai-Kei Mak Abstract This paper presents
More informationPlacement Strategies for 2.5D FPGA Fabric Architectures
Placement Strategies for 2.5D FPGA Fabric Architectures Chirag Ravishankar 3100 Logic Dr. Longmont, Colorado Email: chiragr@xilinx.com Dinesh Gaitonde 2100 Logic Dr. San Jose, California Email: dineshg@xilinx.com
More informationTiming-driven Partitioning-based Placement for Island Style FPGAs
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Vol. 24, No. 3, Mar. 2005 1 Timing-driven Partitioning-based Placement for Island Style FPGAs Pongstorn Maidee, Cristinel
More informationAn Interconnect-Centric Design Flow for Nanometer Technologies
An Interconnect-Centric Design Flow for Nanometer Technologies Jason Cong UCLA Computer Science Department Email: cong@cs.ucla.edu Tel: 310-206-2775 URL: http://cadlab.cs.ucla.edu/~cong Exponential Device
More informationWojciech P. Maly Department of Electrical and Computer Engineering Carnegie Mellon University 5000 Forbes Ave. Pittsburgh, PA
Interconnect Characteristics of 2.5-D System Integration Scheme Yangdong Deng Department of Electrical and Computer Engineering Carnegie Mellon University 5000 Forbes Ave. Pittsburgh, PA 15213 412-268-5234
More informationLinking Layout to Logic Synthesis: A Unification-Based Approach
Linking Layout to Logic Synthesis: A Unification-Based Approach Massoud Pedram Department of EE-Systems University of Southern California Los Angeles, CA February 1998 Outline Introduction Technology and
More informationGenetic Placement: Genie Algorithm Way Sern Shong ECE556 Final Project Fall 2004
Genetic Placement: Genie Algorithm Way Sern Shong ECE556 Final Project Fall 2004 Introduction Overview One of the principle problems in VLSI chip design is the layout problem. The layout problem is complex
More informationPerformance-Preserved Analog Routing Methodology via Wire Load Reduction
Electronic Design Automation Laboratory (EDA LAB) Performance-Preserved Analog Routing Methodology via Wire Load Reduction Hao-Yu Chi, Hwa-Yi Tseng, Chien-Nan Jimmy Liu, Hung-Ming Chen 2 Dept. of Electrical
More informationCOE 561 Digital System Design & Synthesis Introduction
1 COE 561 Digital System Design & Synthesis Introduction Dr. Aiman H. El-Maleh Computer Engineering Department King Fahd University of Petroleum & Minerals Outline Course Topics Microelectronics Design
More informationSymmetrical Buffered Clock-Tree Synthesis with Supply-Voltage Alignment
Symmetrical Buffered Clock-Tree Synthesis with Supply-Voltage Alignment Xin-Wei Shih, Tzu-Hsuan Hsu, Hsu-Chieh Lee, Yao-Wen Chang, Kai-Yuan Chao 2013.01.24 1 Outline 2 Clock Network Synthesis Clock network
More informationNon-Rectangular Shaping and Sizing of Soft Modules for Floorplan Design Improvement
Non-Rectangular Shaping and Sizing of Soft Modules for Floorplan Design Improvement Chris C.N. Chu and Evangeline F.Y. Young Abstract Many previous works on floorplanning with non-rectangular modules [,,,,,,,,,,,
More informationHeterogeneous Technology Mapping for FPGAs with Dual-Port Embedded Memory Arrays
Heterogeneous Technology Mapping for FPGAs with Dual-Port Embedded Memory Arrays Steven J.E. Wilton Department of Electrical and Computer Engineering University of British Columbia Vancouver, BC, Canada,
More informationProblem Formulation. Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets.
Clock Routing Problem Formulation Specialized algorithms are required for clock (and power nets) due to strict specifications for routing such nets. Better to develop specialized routers for these nets.
More informationA Path Based Algorithm for Timing Driven. Logic Replication in FPGA
A Path Based Algorithm for Timing Driven Logic Replication in FPGA By Giancarlo Beraudo B.S., Politecnico di Torino, Torino, 2001 THESIS Submitted as partial fulfillment of the requirements for the degree
More informationA Multicast Routing Algorithm for 3D Network-on-Chip in Chip Multi-Processors
Proceedings of the World Congress on Engineering 2018 ol I A Routing Algorithm for 3 Network-on-Chip in Chip Multi-Processors Rui Ben, Fen Ge, intian Tong, Ning Wu, ing hang, and Fang hou Abstract communication
More informationGraph Models for Global Routing: Grid Graph
Graph Models for Global Routing: Grid Graph Each cell is represented by a vertex. Two vertices are joined by an edge if the corresponding cells are adjacent to each other. The occupied cells are represented
More informationHardware Modeling using Verilog Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
Hardware Modeling using Verilog Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture 01 Introduction Welcome to the course on Hardware
More informationFPGA Placement by Graph Isomorphism
FPGA Placement by Graph Isomorphism Hossein Omidian Savarbaghi Dept. of Computer Eng. Islamic Azad University Science and research branch Tehran, Iran h_omidian_sa@yahoo.com Kia Bazargan Dept. of Elec.
More informationKarthik Narayanan, Santosh Madiraju EEL Embedded Systems Seminar 1/41 1
Karthik Narayanan, Santosh Madiraju EEL6935 - Embedded Systems Seminar 1/41 1 Efficient Search Space Exploration for HW-SW Partitioning Hardware/Software Codesign and System Synthesis, 2004. CODES + ISSS
More informationA New Optimal State Assignment Technique for Partial Scan Designs
A New Optimal State Assignment Technique for Partial Scan Designs Sungju Park, Saeyang Yang and Sangwook Cho The state assignment for a finite state machine greatly affects the delay, area, and testabilities
More informationAn Introduction to FPGA Placement. Yonghong Xu Supervisor: Dr. Khalid
RESEARCH CENTRE FOR INTEGRATED MICROSYSTEMS UNIVERSITY OF WINDSOR An Introduction to FPGA Placement Yonghong Xu Supervisor: Dr. Khalid RESEARCH CENTRE FOR INTEGRATED MICROSYSTEMS UNIVERSITY OF WINDSOR
More informationHypergraph Partitioning With Fixed Vertices
Hypergraph Partitioning With Fixed Vertices Andrew E. Caldwell, Andrew B. Kahng and Igor L. Markov UCLA Computer Science Department, Los Angeles, CA 90095-596 Abstract We empirically assess the implications
More informationLeakage Efficient Chip-Level Dual-Vdd Assignment with Time Slack Allocation for FPGA Power Reduction
44.1 Leakage Efficient Chip-Level Dual-Vdd Assignment with Time Slack Allocation for FPGA Power Reduction Yan Lin and Lei He Electrical Engineering Department University of California, Los Angeles, CA
More informationAbstraction and Optimization of Consistent Floorplanning with Pillar Block Constraints
Abstraction and Optimization of Consistent Floorplanning with Pillar Block Constraints Ning FU, Shigetoshi NAKATAKE, Yasuhiro TAKASHIMA, Yoji KAJITANI School of Environmental Engineering, University of
More informationTiming Optimization of FPGA Placements by Logic Replication
13.1 Timing Optimization of FPGA Placements by Logic Replication Giancarlo Beraudo ECE Department, University of Illinois at Chicago 851 S. Morgan St., Chicago IL, 60607 gberaudo@ece.uic.edu John Lillis
More informationIntroduction Warp Processors Dynamic HW/SW Partitioning. Introduction Standard binary - Separating Function and Architecture
Roman Lysecky Department of Electrical and Computer Engineering University of Arizona Dynamic HW/SW Partitioning Initially execute application in software only 5 Partitioned application executes faster
More informationPartitioned Branch Condition Resolution Logic
1 Synopsys Inc. Synopsys Module Compiler Group 700 Middlefield Road, Mountain View CA 94043-4033 (650) 584-5689 (650) 584-1227 FAX aamirf@synopsys.com http://aamir.homepage.com Partitioned Branch Condition
More informationMincut Placement with FM Partitioning featuring Terminal Propagation. Brett Wilson Lowe Dantzler
Mincut Placement with FM Partitioning featuring Terminal Propagation Brett Wilson Lowe Dantzler Project Overview Perform Mincut Placement using the FM Algorithm to perform partitioning. Goals: Minimize
More information