35 th Design Automation Conference Copyright 1998 ACM

Size: px
Start display at page:

Download "35 th Design Automation Conference Copyright 1998 ACM"

Transcription

1 Using Reongurable Computing Tehniques to Aelerate Problems in the CAD Domain: A Case Study with Boolean Satisability Peixin Zhong, Pranav Ashar, Sharad Malik and Margaret Martonosi Prineton University and NEC CCRL pzhong, sharad, mrm@ee.prineton.edu, ashar@rl.nj.ne.om Abstrat The Boolean satisability problem lies at the ore of several CAD appliations, inluding automati test pattern generation and logi synthesis. This paper desribes and evaluates an approah for aelerating Boolean satisability using on- gurable hardware. Our approah harnesses the inreasing speed and apaity of eld-programmable gate arrays by tailoring the SAT-solver iruit to the partiular formula being solved. This input-spei tehnique gets high performane due both to (i) a diret mapping of Boolean operations to logi gates, and (ii) large amounts of ne-grain parallelism in the impliation proessing. Overall, these strategies yields impressive speedups (>200X in many ases) ompared to urrent software approahes, and they require only modest amounts of hardware. In a broader sense, this paper alerts the hardware design ommunity to the inreasing importane of input-spei designs, and douments their promise via a quantitative study of input-spei SAT solving. 1 Introdution While hardware aelerators for CAD are not new, the inreasing apaity and speed of eld-programmable gate arrays (FPGAs) oers exibility and omputing power not available before. The hardware an be reused for dierent appliations and an be easily updated to aommodate new developments in algorithms. This paper introdues an novel approah for aelerating the Boolean satisability (SAT) problem using ongurable omputing hardware. In our implementation of the SAT problem, we have harnessed this exibility by implementing the SAT solver iruit on an \input-spei" basis. That is, a iruit is generated espeially for eah SAT formula to be solved. Sine Boolean satisability approahes are logial-operation-intensive, the hardware approah an show large advantages by mapping the SAT expressions diretly to logi gates, and by harnessing large amounts of parallelism in the evaluation of the logi. While the bulk of the ongurable omputing appliations urrently implemented have been fairly simple dataentri systoli designs, our design shows the potential of ustom omputing on diult problems with omplex ontrol strutures. We have implemented a tree searh algorithm, as opposed to the generally-simpler signal proessing pipelines ommon to ongurable omputing thus far. In addition, wehave also implemented a hardware version of nonhronologial baktraking similar to that seen in software SAT solvers [11]. This requires omplex ontrol strutures to manage the baktraking steps. These implementations demonstrate ongurable omputing's potential for impat on a muh broader set of appliations, partiularly in CAD domain, than may initially have been onsidered. 2 The SAT Problem The Boolean satisability (SAT) problem is a well-known onstraint satisfation problem with many appliations in omputer-aided design of integrated iruits, suh as test generation, logi veriation and timing analysis. Given a Boolean formula, the goal is either to nd an assignment of 0-1 values to the variables so that the formula evaluates to 1, or to establish that no suh assignment exists. The Boolean formula is typially expressed in onjuntive normal form (CNF), also alled produt-of-sums form. Eah sum term (lause) in the CNF is a sum of single literals, where a literal is a variable or its negation. In order for the entire formula to evaluate to 1, eah lause must be satised, i.e., at least one of its literals should be 1. An assignment of 0-1 values to a subset of variables (alled a partial assignment) might satisfy some lauses and leave the others undetermined. If an undetermined lause has only one unassigned literal in it, that literal must evaluate to 1 in order to satisfy the lause. In suh a ase, the orresponding variable is said to be implied to that value. A variable is onsidered free if neither assigned nor implied. A onit or ontradition arises if the same variable is implied to be dierent values. This means that the orresponding partial assignment annot be a part of any valid solution. Most urrent SAT solvers are based on the Davis-Putnam algorithm [4]. This is a baktrak searh algorithm. The basi algorithm begins from an empty assignment. It proeeds by assigning a 0 or 1 value to one free variable at a time. After eah assignment, the algorithm determines the diret and transitive impliations of that assignment on other variables. If no ontradition is deteted after the impliation proedure, the algorithm piks the next free variable, and repeats the proedure (forward searh). Otherwise, the algorithm attempts a new partial assignment by omplementing the most-reently assigned variable (baktrak). If this also leads to ontradition, this variable is reset to the free value and the next most-reently assigned variable is omplemented. The algorithm terminates when: (i) no free variables are available and no ontraditions have been enountered (a solution has been found), or (ii) it wants to baktrak beyond the rst variable, whih means all possible assignments have been exhausted and there is no solution to the problem. Determining impliations is ruial to pruning the searh spae sine it allows the algorithm to skip entire regions of the searh spae orresponding to ontraditory partial assignments. Every implied variable orresponds to one less free variable to branh on. Unfortunately, deteting impli x-98/0006/$ th Design Automation Conferene Copyright 1998 ACM DAC98-06/98 San Franiso, CA USA

2 ations in software is very slow. Eah lause ontaining the newly assigned or implied variable is sanned and updated sequentially, with the proess repeated until no new impliations are deteted. Our intuition for hardware speedup potential in the SAT algorithm stems from reognizing that the impliation proedure entral to the algorithm is both highly parallelizable and easily mapped to basi logi gates. Our entire hardware arhiteture is designed to take advantage of this parallelism. Setion 4 details our mapping of the algorithm onto reon- gurable hardware in a formula-spei manner. Reent software implementations of the SAT algorithm have enhaned it in several ways while maintaining the same basi ow [8, 3, 12, 11]. The ontribution of the GRASP work [11] is notable sine it applies nonhronologial baktraking and dynami lause addition to prune the searh spae further. Very signiant improvements in run time are reported. Reognizing nonhronologial baktraking as an important feature, Setion 5 shows how it an be mapped to reongurable hardware with signiant improvements in run time. 3 The Promise of Congurable Computing Speial-purpose hardware aelerators for CAD problems are not new. This approah, however, usually suers from long development time and high ost. The growth of eld programmable logi devies allows appliation-spei hardware without the osts of fabriation. The SRAM-based programmable logi devies (suh as Xilinx FPGAs) allow the same hip to be reprogrammed to dierent iruits an unlimited number of times. One suessful appliation of suh devies is logi emulation. A logi design is mapped to an FPGA array and simulated before it is atually fabriated. An emulator is many orders of magnitude faster than simulation on a general purpose omputer. The development of FPGAs has also led to the advent of eld-programmable ustom omputing mahines. The abundant programmable logi omponents and routing resoures are used to form speial-purpose omputers exploiting ne-grain parallelism. Suh ustom omputing mahines have ahieved very high performane in signal proessing, geneti analysis, and ryptography appliations. Previous work mostly onentrated on data-entri problems with relatively simple logi ontrol. Inreasing density in programmable logi opens up exiting opportunities for hardware aeleration for omplex omputations. As feature sizes shrink even further, 1M-gate FPGAs will be feasible by roughly the year 2001 [10]. With programmability and high integration densities, we an reate a mahine speialized not only to the appliation but also to the input data assoiated with a spei problem. Our hoie of Boolean satisability (SAT) as a ase study here was guided by two main fators: It is a searh-intensive appliation, as opposed to dataintensive appliations with simple ontrol found in many previous studies of ongurable omputing. It is also a basi NP-omplete problem that is ore to many important problems. Gaining experiene with this problem will help us aelerate a wide range of appliations. It ontains a signiant amount of bit-level logi operations. This makes it very amenable for mapping to ongurable hardware. At this time, we know of three interesting proposals for solving SAT using reongurable hardware [13, 1, 9]. Suyama et al. [13] have proposed their own SAT algorithm - instead of the Davis-Putnam approah - for implementation in reongurable hardware. The searh uses full assignment to explore the spae. In their paper, they aknowledge that this is not as eient as a baktrak searh algorithm. The proposal by Abramovii and Saab [1] is also not an implementation of the Davis-Putnam algorithm. Their proposal basially amounts to an implementation of a PODEMbased [6] algorithm in reongurable hardware. Therefore it is diult to handle a generi CNF formula diretly. On the other hand, our approah is more general and an exploit heuristis not available to PODEM. Rashid et al. [9] implement small instanes of the PO- DEM algorithm on a Xilinx XC6216. While mapping and ompilation issues are presented in detail, there are urrently no performane results for their approah. 4 FPGA Mapping: Basi Baktrak Searh 4.1 Hardware Organization Our hardware implementation has two parts: (i) the impliation iruit (ii) a state mahine to manage the baktrak searh. Given a SAT formula, a VHDL desription of the hardware is generated automatially. The speedup over software arises from our implementation of the impliation iruit. It nds all diret impliations of newly-assigned or newly-implied variables in a single lok yle. Consequently, all transitive impliations of a new variable assignment an be determined in a few yles. Sine all lauses are examined in one yle, parallelism is quite onsiderable in pratie. For the aim yes1-1 example in the DIMACS benhmark suite, this approah yields an average of 41.9 eetive lause evaluations per yle. b out out out dout e out out a state a imp astate a imp D Q D Q GClear a out LContra a out LChange Figure 1: Ciruit for impliation and onit detetion. Figure 1 shows the details of the impliation iruit. The state of eah variable in the iruit is enoded into two bits, (v; v). A free literal is (0; 0). The 1 value of the variable is represented by (1;0) and the 0 value by (0;1). Impliations are easy to determine. For a lause with n literals, if n-1 literals are 0, the last literal is implied to 1. In our enoding, given the lauses ( + a)( d + a)( b + + a)(e +a)( +a), a is implied by + d + b and a is implied by e+, as shown in Figure 1. If both v and v are implied or assigned to be true, there is a ontradition. No solution exists with this partial assignment.

3 GContra GChange GClear E il E ir /10 init E il (v imp+v imp)/01 E il(v imp+v imp)/00 GContra/10 ative 1 ative 0 v i-1 v i v i+1 Clok GChange GContra GClear E il E ol E ir Clk LChange LContra LClear E or GChange LChange GContra LContra GClear LClear E il E or E ol E ir Clk GChange LChange GContra LContra GClear LClear E il E or E ol E ir Clk E ir/10 passive 1 passive 0 Figure 2: Global Ciruit Topology The transitive impliation may inlude yli loops. Sine the impliation is monotoni, hanging from 0 to 1 due to our 2-bit enoding, it will never ause osillation. A D ip- op is used with eah literal, so new impliations will propagate one level in one yle. The LChange signal heks new value hanges. Impliations nish when no new hanges are found. In order to avoid heavy ommuniation with a host omputer, searh ontrol is also implemented in hardware. A state mahine is implemented for eah variable and they are onneted aording to the order of searh, as in Figure 2. Eah box in the gure ontains the impliation iruit and the state mahine for eah variable. This distributed topology keeps global signals to a minimum and redues hardware osts. Atany instant, only one state mahine is in ontrol. One that state mahine has nished proessing, it asserts Eor to transfer ontrol to the state mahine on the right (if progressing forward in the omputation) or it asserts Eol to pass ontrol to the left (if baktraking). Eah state mahine is aware of whether its variable has been assigned, implied or is free. The state mahine for a single variable is shown in Figure 3. The ve states in the state mahine are enoded by three bits. Two bits orrespond to the values of the positive and negative literals of its variable. The third bit indiates whether this partiular state mahine is ative. The inputs to eah state mahine are the Enable signals from its left and right neighbors, and the global ontradition (GContra) and hange (GChange) signals. State mahine outputs are the enable signals, Eol and Eor that pass ontrol to the left or right. System operation is fairly simple. After initialization, all the state mahines are in the init state. When a state mahine reeives an enable signal, it hanges its states. If ontrol is transferred from the left, it an only be in the init state. If this variable's value is not implied, it asserts value 1(ative 1) and determines all the transitive impliations of that assignment. If a ontradition is deteted, it will try the value 0 instead (ative 0) and repeat the impliation step. If the impliation settles without a ontradition, it transfers ontrol to the state mahine on its right and it transitions to the passive 1 state. From ative 0, ifa ontradition is deteted, it will baktrak. Its state is reset to init and the ontrol is passed to the left. On the other hand, if impliations settle without a ontradition, then it moves from ative 0 to passive 0 and passes the ontrol to the right. If, as above, ontrol is transferred to a state mahine from the left, but its variable's value has already been implied by a previous assignment, then it merely passes ontrol to the state mahine on its right on the next lok yle and re- Figure 3: State Diagram for Baktraking Mahine mains in the init state. Its output value will be maintained by the impliation. If ontrol is transferred to a state mahine from the right, the system is baktraking. If the variable has been implied by some previous assignment, the state mahine is in the init state now. It simply passes ontrol to the variable on its left. If the variable urrently has the assigned value 1(passive 1), then it is hanged to 0 (ative 0). The impliations are omputed. If the variable already has the assigned value 0, we have already tried both possible values for this variable. In this ase, the state mahine resets to init and baktraks further by transferring ontrol to the left. Finding a Solution A solution has been found when the rightmost state mahine further attempts to pass ontrol to the right. No solution exists if the leftmost state mahine attempts to baktrak by further passing ontrol to the left. 4.2 Performane Results In this setion we ompare the performane of our hardware implementation of the basi Davis-Putnam SAT algorithm to its implementation in GRASP [11]. In order to estimate the hardware performane, we use simulation to ount the exat number of hardware lok yles needed to solve eah problem and divide by lok rate to ompute the run time. We have used a VHDL model to verify design orretness on moderate examples. However, sine VHDL simulation an be slow, we have generated a C-language hardware model that is signiantly faster then general VHDL simulation. We use this for performane estimation of longrunning problems. Like a VHDL simulation, the C-language simulation is also apable of ounting the number of hardware lok yles required exatly. The number of hardware lok yles is translated into run time based on a lok rate of 1.33 MHz. This is a lok rate ahieved on a large example on an IKOS FPGA board [7] as detailed in the setion on hardware implementation issues. The GRASP runs were timed on a Sun 5 workstation with 64 MB of RAM and a 110 MHz proessor. As disussed earlier, our hardware implementation urrently requires variables be ordered statially prior to the hardware implementation. Our ordering strategy plaes earlier the variables with more appearanes in the formula. We ompared the performane for all the examples in the DIMACS SAT benhmark suite [5]. There are 240 problems in the DIMACS suite. Some of them take avery long time to solve and neither GRASP nor our C-model simulation gets

4 a result in reasonable amount of time (several hours). There are 134 instanes in whih at least one program nished. Instanes v =1 12 v =1 11 v 12=0 v 11=0 v =0 10 v 1 =1 v 2 =0 v 3 =0 v =0 1 v =1 14 v 12 =1 v =1 13 v =1 10 v = Figure 5: Searh example for basi baktraking. Speed up ratio Figure 4: Speedup ratio histogram omparing our FPGA approah to a GRASP run, both with basi baktraking. In all examples for whih the implementation of the basi Davis-Putnam algorithm ompletes in GRASP, our hardware implementation also ompletes usually in muh shorter time. Figure 4 shows a histogram in whih eah bar orresponds to a speedup range. Speedup is dened as the ratio of the GRASP run time to the hardware run time. (That is, a 10X speedup means that our approah is ten times faster than GRASP.) The height of the bar orresponds to the number of DIMACS SAT examples for whih the orresponding speedup is obtained. The histogram indiates that more than 90% of the examples have speedup greater than 20X and more than 45% of the examples have speedup greater than 100X. The results learly demonstrate that utilizing parallelism by diret logi mapping in the SAT algorithm ahieves signiant speedup even when the hardware lok is muh slower than that of the general purpose omputer. 5 FPGA Mapping: Nonhronologial Baktrak 5.1 Algorithm When the Davis-Putnam algorithm baktraks to the most reently assigned variable, its baktraking is said to be hronologial. On the other hand, when an algorithm jumps over several previously-assigned variables to a variable more than one level above the urrent variable, the baktraking is said to be nonhronologial. In order to jump diretly to a previous level, the algorithm must rst determine that no ombination of values on the skipped variables will result in a satisfying assignment. GRASP is a reent SAT implementation with nonhronologial baktraking. The GRASP work demonstrated that nonhronologial baktraking an lead to signiant redutions in run time. We desribe an algorithm to ahieve a similar goal in hardware, but with less ontrol omplexity than GRASP. GRASP maintains a data struture alled the impliation graph, from whih one derives the transitive impliations leading to the ontradition. From the ontradition, it traes bak to a set of assignments ontributing to the ontradition. It jumps to the most reently assigned variable in the set. It is diult, however, to traverse the impliation iruit in hardware. As a result, we devised the following alternative. Rather than analyzing the impliations, we take advantage of the fat that determining impliations is very fast in our hardware. When a ontradition ours for both assignments of the variable, it should baktrak. If we reset this variable to be free, the observed ontradition will disappear, but may reur when the same variable is assigned a value in the future. We want tohange the value of the variable that really ontributes to the observed ontradition. If a ontradition was deteted for both assignments to vi, with i being its level, our proedure works its way bak up the levels, one at a time. At eah level j (< i), the proedure alls the impliation routine twie, one for eah vi value, while vj has a ipped value. The variables vk between i and j (j < k < i) are left unassigned when the impliation routine is alled, i.e. both literals vk and vk are set to 0. The algorithm must baktrak to vjif a noontradition ase was found for one of the two assignments. If both ause ontradition, the variable vj an be skipped, and the proedure repeats this step for variable vj,1. This proedure requires only 2n alls to the impliation routine if the algorithm reverts bak bynlevels. Sine the proedure does not all the impliation routine an exponential number of times, and sine the impliation hardware is very fast, this analysis for nonhronologial baktraking an be expeted to be very fast also. In omparison, GRASP has the overhead of maintaining the impliation graph data struture and analyzing it when a ontradition is found, but it does not do the 2n impliations. The following simple example shows how our approah works. Suppose we have the following formula: (v1 0 + v v 13)(v1 0 +v12 0 +v13)(v v 12 +v 14)(v1 0 +v 12 +v14) 0. Assume that no assignment for variables v 2, v 11 leads to a ontradition. The basi searh tree for this formula is shown in Figure 5. In the gure, eah solid arrow represents assigning a new value and determining its impliations. It begins by assigning v 1 = 1 and goes on until v 11 = 1. Finally, v 12 =1 is tried. This leads to a ontradition sine v 13 is required to be both 0 and 1 simultaneously. Similarly, v 12 = 0 also leads to a ontradition. The normal baktrak proedure would reset v 12 to unknown and it would baktrak tov 11. The onit disappears temporarily but will appear again whenever v 12 is set. Figure 6(a) shows our version of nonhronologial baktraking. Figure 6(b) shows how GRASP diretly baktraks to v 1. The dotted arrows indiate skipping and do

5 v 11=0 v 3 =0 v 10=0 v 1 =1 v 2 =0 v 1 =0 v 1 =0 v 1 =1 v 1 =0 Instanes v 12=1 n v 12 =1 v 13=1 v 12=1 v 12 =1 v 13=1 5 (a) v 14=1 Figure 6: Searh examples for nonhronologial baktraking. (a) Algorithm in our hardware algorithm. (b) Software algorithm used in GRASP. v 14=1 (b) Speed up ratio Figure 8: Speedup ratio histogram for nonhronologial baktraking leaf 0 leaf 1 GContra/10 ative 0 previously. Although the state mahine has beome bigger, the extra states require only one more enoding bit. Sine most of the iruit is the same and sine the state mahine is not instrumental in determining the lok yle time, the yle time from the basi algorithm remains valid here. E (v +v )/01 il imp imp E /10 ir E il E il(v imp+v imp )/00 init ative 1 E /00 ir passive 1 bk 0 a bk 0 b GContra/10 passive 0 E /10 ir 5.3 Performane Results We ompare our runtime with nonhronologial baktraking to a GRASP implementation with nonhronologial baktraking. Similar to Figure 4, the results are graphed in Figure 8. Due to the improved performane, there are 149 problems that nished with either GRASP of our approah. While GRASP's more ompliated nonhronologial baktraking allows it to garner a bigger improvement from this tehnique than our hardware does, we still obtain a median speedup over GRASP of 63.5X. From the full DIMACS suite, we oer 100X or greater speedups on 63 of the examples. Figure 7: State diagram for nonhronologial baktraking mahine not inur any omputation; they simply show what values are assumed before an impliation alulation. For both methods, the searh is the same before reahing v 12. Baktraking diers, however. When baktraking at eah level, our impliation proedure is alled twie. That is, we baktrak to v 1after eetively alling the impliation proedure only 22 times, rather than 4082 attempts with the basi baktraking approah. While this simple example is mainly pedagogi, dramati speedup potential also exists in real benhmarks. For example, in the aim yes1-1 benhmark from the SAT suite, hronologial baktraking visits 606,578 partial assignments; the nonhronologial baktraking algorithm only visits 1384 partial assignments. 5.2 Hardware Organization The only modiations required to implement our nonhronologial baktraking algorithm are to the state mahine. No new global signals are added and the impliation iruit is not modied. The state mahine to perform the nonhronologial baktraking is shown in Figure 7. It has more states and transitions to reet the omplex baktraking proedure outlined 6 Implementation Issues 6.1 Envisioned Usage Depending on the problem harateristis, the ongurable omputing implementation an often oer signiant speedups over a software SAT solver. In order for the aeleration to be useful, however, it must oer performane advantages even after hardware ompile time and onguration time are onsidered. For this reason, we envision that the on- gurable hardware tehniques will mainly be used on SAT problems with very long GRASP run-times (hours or days) or in ases where GRASP aborts. In suh ases, the hardware synthesis times required will be aeptable. Furthermore, urrent ommerial CAD tools for FPGAs are quite general and therefore quite slow in this usage; it is not unommon for synthesis and plae-and-route times to exeed one hour. Sine our designs are quite regular and easy-toroute, however, ompiler tools speialized for our template design ould be muh faster, and ould thus broaden the range of problems for whih hardware aeleration is useful. Overall, we envision a system where easy SAT problems are still solved in software. Very large problems, ones that often timeout today, will invoke the hardware ompiler and ongure an FPGA board to assist in solving them. The envisioned hardware platform is a high-performane workstation with an FPGA board attahed via an exter-

6 nal onnetion or the I/O bus. Easy problems are handled by software running on the workstation itself, while longerrunning problems are handled by the FPGA board. Requirements for the FPGA board itself are partly a funtion of the hardware size of the problems being solved; this is disussed in the following subsetion. 6.2 Hardware Resoures Hardware requirements for our approah are a funtion of the size and omplexity of the formula being solved. The hardware usage is estimated in terms of CLBs of Xilinx XC4000 series FPGAs.[14]. The SAT solver's CLB requirements varies widely with the formula to be solved. Aross the DIMACS benhmarks, the median CLB requirements are Overall, relatively few (less than 30%) of these problems will t within the about 2000 CLB limit of urrent FPGA hips. However, 202 of 240 problems require fewer than CLBs and should t on fewer than ten FPGAs. Partitioning the design aross FPGAs is straightforward beause of the regular topology used. In our experiments, we have used the partitioning and pin multiplexing tehniques in the Ikos SLI logi emulation system originally developed as part of the MIT Virtual Wires eort [2, 7]. This partitioning software an multiplex several inter-hip signals to use the same physial pins, thereby irumventing the pin limitations that an often limit the CLB utilization of FPGA designs. Furthermore, the software performs this multiplexing with minimal impat on hardware yle times. We have ompiled many of the DIMACS problems for the IKOS system. Their lok rate ranges from 700KHz to 2 MHz. We use a 1.33 MHz lok rate as a typial value for performane omparison. If SAT is implemented on a diretly-onneted array, the lok rate is expeted to be 10 to 20 MHz. 7 Future Work As evidened by the organization of our paper, we began our work by implementing the basi Davis-Putnam proedure in reongurable hardware. Reent software implementations of the Davis-Putnam proedure have grown to be very sophistiated however [11]. To beat them onsistently, our implementation should math that sophistiation. We zeroed in on nonhronologial baktraking as a key requirement and have been able to implement it with a small inrease in hardware. The GRASP implementation still ontains two additional features that our implementation does not: 1. The ability to add lauses on the y plays a signiant role in GRASP's ability to solve hard problems. Variable relationships derived in the searh help prune searh tree in the future. It is hard to hange the iruit and reroute the FPGA in the middle of the algorithm. We are working on low ost ways of implementing this feature based on multiple onguration ontexts. 2. The ability tohoose the next deision variable on the y also ontributes to the eieny of the software implementations. While this is very simple to do in software, it an be very hardware intensive in reon- gurable hardware, as evidened by the Abramovii and Saab proposal [1]. In pratie, we have found that a good stati ordering of variables works quite well. If dynami ordering an provide signiant improvement, we will implement itaswell. 8 Conlusions Overall, the ontributions of this paper are two-fold. First, we provide a system design for formula-spei Boolean satisability solutions based on a ongurable hardware implementation. Our design's hardware requirements are quite modest ompared to other reent proposals. Moreover, our hardware performane results indiate that the ongurable hardware approah oers dramati improvements over even the best urrent software-based tehniques. We have observed speedups of well over 200X ompared to software SAT solutions on several of the DIMACS SAT benhmarks. The seond ontribution of this paper is muh broader. We present our SAT-solver as a ase study of a lass of inputspei ongurable hardware appliations. These appliations aggressively harness the inreasing integration levels and reongurability of urrent FPGAs by performing template-driven hardware designs for eah problem/data set being solved. Their amenability to parameterized, automated design makes it easy and fast to ompile ongurations for them. Overall, this paper alerts the hardware design ommunity to an inreasingly-important appliation style, and douments its signiant promise via a detailed ase study on formula-spei SAT-solving. Referenes [1] M. Abramovii and D. Saab. Satisability on Reongurable Hardware. In Seventh Intl. Workshop on Field Programmable Logi and Appliations, Sept [2] J. Babb, R. Tessier, and A. Agarwal. Virtual Wires: Overoming pin limitations in FPGA-based logi emulators. In Pro. IEEE Workshop on FPGA-based Custom Computing Mahines, pages 142{151, Apr [3] S. Chakradhar, V. Agrawal, and S. Rothweiler. A transitive losure algorithm for test generation. IEEE Trans. on Computer-Aided Design of Integrated Ciruits and Systems, 12(7):1015{1028, July [4] M. Davis and H. Putnam. A Computing Proedure for Quantiation Theory. Journal of the ACM, 7:201{215, [5] DIMACS. DIMACS Challenge benhmarks and UCSC benhmarks. Available at ftp://dimas.rutgers.edu/pub/hallenge/sat/benhmarks/nf. [6] P. Goel. An Impliit Enumeration Algorithm to Generate Tests for Combinational Logi Ciruits. IEEE Tran. on Computers, C30(3):215{222, Marh [7] IKOS Systems. Virtual Logi SLI Doumentation. Version 1.6. [8] T. Larrabee. Test Pattern Generation Using Boolean Satis- ability. In IEEE Trans. on Computer-Aided Design, volume 11, pages 4{15, January [9] A. Rashid, J. Leonard, and W. H. Mangione-Smith. Dynami Ciruit Generation for Solving Spei Problem Instanes of Boolean Satisability. In Proeedings IEEE Workshop on FPGA-based Custom Computing Mahines, Apr [10] J. Rose and D. Hill. Arhitetural and Physial Design Challenges for One Million Gate FPGAs and Beyond. In Pro ACM/SIGDA Fifth Intl. Symp. on Field- Programmable Gate Arrays, Feb [11] J. Silva and K. Sakallah. GRASP-A New Searh Algorithm for Satisability. In IEEE ACM Intl. Conf. on CAD-96, pages 220{227, Nov [12] P. Stephan, R. Brayton, and A. Sangiovanni-Vinentelli. Combinational Test Generation Using Satisability. Department of Eletrial Engineering and Computer Sienes, University of California at Berkeley, UCB/ERL Memo M92/112. [13] T. Suyama, M. Yokoo, and H. Sawada. Solving Satisability Problems on FPGAs. In 6th Int'l Workshop on Field- Programmable Logi and Appliations, Sept [14] Xilinx Corp. The Programmable Logi Data Book. Xilinx Corp. San Jose, CA, 1996.

Pipelined Multipliers for Reconfigurable Hardware

Pipelined Multipliers for Reconfigurable Hardware Pipelined Multipliers for Reonfigurable Hardware Mithell J. Myjak and José G. Delgado-Frias Shool of Eletrial Engineering and Computer Siene, Washington State University Pullman, WA 99164-2752 USA {mmyjak,

More information

Partial Character Decoding for Improved Regular Expression Matching in FPGAs

Partial Character Decoding for Improved Regular Expression Matching in FPGAs Partial Charater Deoding for Improved Regular Expression Mathing in FPGAs Peter Sutton Shool of Information Tehnology and Eletrial Engineering The University of Queensland Brisbane, Queensland, 4072, Australia

More information

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2 On - Line Path Delay Fault Testing of Omega MINs M. Bellos, E. Kalligeros, D. Nikolos,2 & H. T. Vergos,2 Dept. of Computer Engineering and Informatis 2 Computer Tehnology Institute University of Patras,

More information

Improved Circuit-to-CNF Transformation for SAT-based ATPG

Improved Circuit-to-CNF Transformation for SAT-based ATPG Improved Ciruit-to-CNF Transformation for SAT-based ATPG Daniel Tille 1 René Krenz-Bååth 2 Juergen Shloeffel 2 Rolf Drehsler 1 1 Institute of Computer Siene, University of Bremen, 28359 Bremen, Germany

More information

Approximate logic synthesis for error tolerant applications

Approximate logic synthesis for error tolerant applications Approximate logi synthesis for error tolerant appliations Doohul Shin and Sandeep K. Gupta Eletrial Engineering Department, University of Southern California, Los Angeles, CA 989 {doohuls, sandeep}@us.edu

More information

Constructing Transaction Serialization Order for Incremental. Data Warehouse Refresh. Ming-Ling Lo and Hui-I Hsiao. IBM T. J. Watson Research Center

Constructing Transaction Serialization Order for Incremental. Data Warehouse Refresh. Ming-Ling Lo and Hui-I Hsiao. IBM T. J. Watson Research Center Construting Transation Serialization Order for Inremental Data Warehouse Refresh Ming-Ling Lo and Hui-I Hsiao IBM T. J. Watson Researh Center July 11, 1997 Abstrat In typial pratie of data warehouse, the

More information

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425)

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425) Automati Physial Design Tuning: Workload as a Sequene Sanjay Agrawal Mirosoft Researh One Mirosoft Way Redmond, WA, USA +1-(425) 75-357 sagrawal@mirosoft.om Eri Chu * Computer Sienes Department University

More information

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem Calulation of typial running time of a branh-and-bound algorithm for the vertex-over problem Joni Pajarinen, Joni.Pajarinen@iki.fi Otober 21, 2007 1 Introdution The vertex-over problem is one of a olletion

More information

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Malaysian Journal of Computer Siene, Vol 10 No 1, June 1997, pp 36-41 A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Md Rafiqul Islam, Harihodin Selamat and Mohd Noor Md Sap Faulty of Computer Siene and

More information

Outline: Software Design

Outline: Software Design Outline: Software Design. Goals History of software design ideas Design priniples Design methods Life belt or leg iron? (Budgen) Copyright Nany Leveson, Sept. 1999 A Little History... At first, struggling

More information

Extracting Partition Statistics from Semistructured Data

Extracting Partition Statistics from Semistructured Data Extrating Partition Statistis from Semistrutured Data John N. Wilson Rihard Gourlay Robert Japp Mathias Neumüller Department of Computer and Information Sienes University of Strathlyde, Glasgow, UK {jnw,rsg,rpj,mathias}@is.strath.a.uk

More information

CleanUp: Improving Quadrilateral Finite Element Meshes

CleanUp: Improving Quadrilateral Finite Element Meshes CleanUp: Improving Quadrilateral Finite Element Meshes Paul Kinney MD-10 ECC P.O. Box 203 Ford Motor Company Dearborn, MI. 8121 (313) 28-1228 pkinney@ford.om Abstrat: Unless an all quadrilateral (quad)

More information

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays nalysis of input and output onfigurations for use in four-valued D programmable logi arrays J.T. utler H.G. Kerkhoff ndexing terms: Logi, iruit theory and design, harge-oupled devies bstrat: s in binary,

More information

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering A Novel Bit Level Time Series Representation with Impliation of Similarity Searh and lustering hotirat Ratanamahatana, Eamonn Keogh, Anthony J. Bagnall 2, and Stefano Lonardi Dept. of omputer Siene & Engineering,

More information

Definitions Homework. Quine McCluskey Optimal solutions are possible for some large functions Espresso heuristic. Definitions Homework

Definitions Homework. Quine McCluskey Optimal solutions are possible for some large functions Espresso heuristic. Definitions Homework EECS 33 There be Dragons here http://ziyang.ees.northwestern.edu/ees33/ Teaher: Offie: Email: Phone: L477 Teh dikrp@northwestern.edu 847 467 2298 Today s material might at first appear diffiult Perhaps

More information

Exploring the Commonality in Feature Modeling Notations

Exploring the Commonality in Feature Modeling Notations Exploring the Commonality in Feature Modeling Notations Miloslav ŠÍPKA Slovak University of Tehnology Faulty of Informatis and Information Tehnologies Ilkovičova 3, 842 16 Bratislava, Slovakia miloslav.sipka@gmail.om

More information

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines The Minimum Redundany Maximum Relevane Approah to Building Sparse Support Vetor Mahines Xiaoxing Yang, Ke Tang, and Xin Yao, Nature Inspired Computation and Appliations Laboratory (NICAL), Shool of Computer

More information

This fact makes it difficult to evaluate the cost function to be minimized

This fact makes it difficult to evaluate the cost function to be minimized RSOURC LLOCTION N SSINMNT In the resoure alloation step the amount of resoures required to exeute the different types of proesses is determined. We will refer to the time interval during whih a proess

More information

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study What are Cyle-Stealing Systems Good For? A Detailed Performane Model Case Study Wayne Kelly and Jiro Sumitomo Queensland University of Tehnology, Australia {w.kelly, j.sumitomo}@qut.edu.au Abstrat The

More information

Space- and Time-Efficient BDD Construction via Working Set Control

Space- and Time-Efficient BDD Construction via Working Set Control Spae- and Time-Effiient BDD Constrution via Working Set Control Bwolen Yang Yirng-An Chen Randal E. Bryant David R. O Hallaron Computer Siene Department Carnegie Mellon University Pittsburgh, PA 15213.

More information

Performance Benchmarks for an Interactive Video-on-Demand System

Performance Benchmarks for an Interactive Video-on-Demand System Performane Benhmarks for an Interative Video-on-Demand System. Guo,P.G.Taylor,E.W.M.Wong,S.Chan,M.Zukerman andk.s.tang ARC Speial Researh Centre for Ultra-Broadband Information Networks (CUBIN) Department

More information

The AMDREL Project in Retrospective

The AMDREL Project in Retrospective The AMDREL Projet in Retrospetive K. Siozios 1, G. Koutroumpezis 1, K. Tatas 1, N. Vassiliadis 2, V. Kalenteridis 2, H. Pournara 2, I. Pappas 2, D. Soudris 1, S. Nikolaidis 2, S. Siskos 2, and A. Thanailakis

More information

Cross-layer Resource Allocation on Broadband Power Line Based on Novel QoS-priority Scheduling Function in MAC Layer

Cross-layer Resource Allocation on Broadband Power Line Based on Novel QoS-priority Scheduling Function in MAC Layer Communiations and Networ, 2013, 5, 69-73 http://dx.doi.org/10.4236/n.2013.53b2014 Published Online September 2013 (http://www.sirp.org/journal/n) Cross-layer Resoure Alloation on Broadband Power Line Based

More information

Parametric Abstract Domains for Shape Analysis

Parametric Abstract Domains for Shape Analysis Parametri Abstrat Domains for Shape Analysis Xavier RIVAL (INRIA & Éole Normale Supérieure) Joint work with Bor-Yuh Evan CHANG (University of Maryland U University of Colorado) and George NECULA (University

More information

HEXA: Compact Data Structures for Faster Packet Processing

HEXA: Compact Data Structures for Faster Packet Processing Washington University in St. Louis Washington University Open Sholarship All Computer Siene and Engineering Researh Computer Siene and Engineering Report Number: 27-26 27 HEXA: Compat Data Strutures for

More information

mahines. HBSP enhanes the appliability of the BSP model by inorporating parameters that reet the relative speeds of the heterogeneous omputing omponen

mahines. HBSP enhanes the appliability of the BSP model by inorporating parameters that reet the relative speeds of the heterogeneous omputing omponen The Heterogeneous Bulk Synhronous Parallel Model Tiani L. Williams and Rebea J. Parsons Shool of Computer Siene University of Central Florida Orlando, FL 32816-2362 fwilliams,rebeag@s.uf.edu Abstrat. Trends

More information

Direct-Mapped Caches

Direct-Mapped Caches A Case for Diret-Mapped Cahes Mark D. Hill University of Wisonsin ahe is a small, fast buffer in whih a system keeps those parts, of the ontents of a larger, slower memory that are likely to be used soon.

More information

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality INTERNATIONAL CONFERENCE ON MANUFACTURING AUTOMATION (ICMA200) Multi-Piee Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality Stephen Stoyan, Yong Chen* Epstein Department of

More information

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract CS 9 Projet Final Report: Learning Convention Propagation in BeerAdvoate Reviews from a etwork Perspetive Abstrat We look at the way onventions propagate between reviews on the BeerAdvoate dataset, and

More information

Gray Codes for Reflectable Languages

Gray Codes for Reflectable Languages Gray Codes for Refletable Languages Yue Li Joe Sawada Marh 8, 2008 Abstrat We lassify a type of language alled a refletable language. We then develop a generi algorithm that an be used to list all strings

More information

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks Abouberine Ould Cheikhna Department of Computer Siene University of Piardie Jules Verne 80039 Amiens Frane Ould.heikhna.abouberine @u-piardie.fr

More information

Evaluation of Benchmark Performance Estimation for Parallel. Fortran Programs on Massively Parallel SIMD and MIMD. Computers.

Evaluation of Benchmark Performance Estimation for Parallel. Fortran Programs on Massively Parallel SIMD and MIMD. Computers. Evaluation of Benhmark Performane Estimation for Parallel Fortran Programs on Massively Parallel SIMD and MIMD Computers Thomas Fahringer Dept of Software Tehnology and Parallel Systems University of Vienna

More information

Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems

Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems Arne Hamann, Razvan Rau, Rolf Ernst Institute of Computer and Communiation Network Engineering Tehnial University of Braunshweig,

More information

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1.

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1. Fuzzy Weighted Rank Ordered Mean (FWROM) Filters for Mixed Noise Suppression from Images S. Meher, G. Panda, B. Majhi 3, M.R. Meher 4,,4 Department of Eletronis and I.E., National Institute of Tehnology,

More information

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks International Journal of Advanes in Computer Networks and Its Seurity IJCNS A Load-Balaned Clustering Protool for Hierarhial Wireless Sensor Networks Mehdi Tarhani, Yousef S. Kavian, Saman Siavoshi, Ali

More information

On the Generation of Multiplexer Circuits for Pass Transistor Logic

On the Generation of Multiplexer Circuits for Pass Transistor Logic Preprint from Proeedings of DATE 2, Paris, rane, Marh 2 On the Generation of Multiplexer Ciruits for Pass Transistor Logi Christoph Sholl Bernd Beker Institute of Computer Siene Albert Ludwigs University

More information

Smooth Trajectory Planning Along Bezier Curve for Mobile Robots with Velocity Constraints

Smooth Trajectory Planning Along Bezier Curve for Mobile Robots with Velocity Constraints Smooth Trajetory Planning Along Bezier Curve for Mobile Robots with Veloity Constraints Gil Jin Yang and Byoung Wook Choi Department of Eletrial and Information Engineering Seoul National University of

More information

Accommodations of QoS DiffServ Over IP and MPLS Networks

Accommodations of QoS DiffServ Over IP and MPLS Networks Aommodations of QoS DiffServ Over IP and MPLS Networks Abdullah AlWehaibi, Anjali Agarwal, Mihael Kadoh and Ahmed ElHakeem Department of Eletrial and Computer Department de Genie Eletrique Engineering

More information

1 The Knuth-Morris-Pratt Algorithm

1 The Knuth-Morris-Pratt Algorithm 5-45/65: Design & Analysis of Algorithms September 26, 26 Leture #9: String Mathing last hanged: September 26, 27 There s an entire field dediated to solving problems on strings. The book Algorithms on

More information

Detection and Recognition of Non-Occluded Objects using Signature Map

Detection and Recognition of Non-Occluded Objects using Signature Map 6th WSEAS International Conferene on CIRCUITS, SYSTEMS, ELECTRONICS,CONTROL & SIGNAL PROCESSING, Cairo, Egypt, De 9-31, 007 65 Detetion and Reognition of Non-Oluded Objets using Signature Map Sangbum Park,

More information

A Novel Validity Index for Determination of the Optimal Number of Clusters

A Novel Validity Index for Determination of the Optimal Number of Clusters IEICE TRANS. INF. & SYST., VOL.E84 D, NO.2 FEBRUARY 2001 281 LETTER A Novel Validity Index for Determination of the Optimal Number of Clusters Do-Jong KIM, Yong-Woon PARK, and Dong-Jo PARK, Nonmembers

More information

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications System-Level Parallelism and hroughput Optimization in Designing Reonfigurable Computing Appliations Esam El-Araby 1, Mohamed aher 1, Kris Gaj 2, arek El-Ghazawi 1, David Caliga 3, and Nikitas Alexandridis

More information

Multi-Channel Wireless Networks: Capacity and Protocols

Multi-Channel Wireless Networks: Capacity and Protocols Multi-Channel Wireless Networks: Capaity and Protools Tehnial Report April 2005 Pradeep Kyasanur Dept. of Computer Siene, and Coordinated Siene Laboratory, University of Illinois at Urbana-Champaign Email:

More information

Department of Electrical and Computer Engineering University of Wisconsin Madison. Fall

Department of Electrical and Computer Engineering University of Wisconsin Madison. Fall Department of Eletrial and Computer Engineering University of Wisonsin Madison ECE 553: Testing and Testable Design of Digital Systems Fall 2014-2015 Assignment #2 Date Tuesday, September 25, 2014 Due

More information

Multiple Assignments

Multiple Assignments Two Outputs Conneted Together Multiple Assignments Two Outputs Conneted Together if (En1) Q

More information

Interconnection Styles

Interconnection Styles Interonnetion tyles oftware Design Following the Export (erver) tyle 2 M1 M4 M5 4 M3 M6 1 3 oftware Design Following the Export (Client) tyle e 2 e M1 M4 M5 4 M3 M6 1 e 3 oftware Design Following the Export

More information

SVC-DASH-M: Scalable Video Coding Dynamic Adaptive Streaming Over HTTP Using Multiple Connections

SVC-DASH-M: Scalable Video Coding Dynamic Adaptive Streaming Over HTTP Using Multiple Connections SVC-DASH-M: Salable Video Coding Dynami Adaptive Streaming Over HTTP Using Multiple Connetions Samar Ibrahim, Ahmed H. Zahran and Mahmoud H. Ismail Department of Eletronis and Eletrial Communiations, Faulty

More information

Zippy - A coarse-grained reconfigurable array with support for hardware virtualization

Zippy - A coarse-grained reconfigurable array with support for hardware virtualization Zippy - A oarse-grained reonfigurable array with support for hardware virtualization Christian Plessl Computer Engineering and Networks Lab ETH Zürih, Switzerland plessl@tik.ee.ethz.h Maro Platzner Department

More information

Gradient based progressive probabilistic Hough transform

Gradient based progressive probabilistic Hough transform Gradient based progressive probabilisti Hough transform C.Galambos, J.Kittler and J.Matas Abstrat: The authors look at the benefits of exploiting gradient information to enhane the progressive probabilisti

More information

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System Algorithms, Mehanisms and Proedures for the Computer-aided Projet Generation System Anton O. Butko 1*, Aleksandr P. Briukhovetskii 2, Dmitry E. Grigoriev 2# and Konstantin S. Kalashnikov 3 1 Department

More information

Query Optimization for Structured Documents Based on Knowledge. on the Document Type Denition. Institute of Inf. Systems. ETH Zentrum Dolivostr.

Query Optimization for Structured Documents Based on Knowledge. on the Document Type Denition. Institute of Inf. Systems. ETH Zentrum Dolivostr. Query Optimization for Strutured Douments Based on Knowledge on the Doument Type Denition Klemens Bohm Karl Aberer Institute of Inf. Systems GMD{IPSI ETH Zentrum Dolivostr. 15 8092 Zurih, Switzerland 64293

More information

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating Capturing Large Intra-lass Variations of Biometri Data by Template Co-updating Ajita Rattani University of Cagliari Piazza d'armi, Cagliari, Italy ajita.rattani@diee.unia.it Gian Lua Marialis University

More information

DECT Module Installation Manual

DECT Module Installation Manual DECT Module Installation Manual Rev. 2.0 This manual desribes the DECT module registration method to the HUB and fan airflow settings. In order for the HUB to ommuniate with a ompatible fan, the DECT module

More information

3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT?

3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT? 3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT? Bernd Girod, Peter Eisert, Marus Magnor, Ekehard Steinbah, Thomas Wiegand Te {girod eommuniations Laboratory, University of Erlangen-Nuremberg

More information

A {k, n}-secret Sharing Scheme for Color Images

A {k, n}-secret Sharing Scheme for Color Images A {k, n}-seret Sharing Sheme for Color Images Rastislav Luka, Konstantinos N. Plataniotis, and Anastasios N. Venetsanopoulos The Edward S. Rogers Sr. Dept. of Eletrial and Computer Engineering, University

More information

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 9, September 2013 ISSN: 2277 128X International Journal of Advaned Researh in Computer Siene and Software Engineering Researh Paper Available online at: www.ijarsse.om A New-Fangled Algorithm

More information

Trajectory Tracking Control for A Wheeled Mobile Robot Using Fuzzy Logic Controller

Trajectory Tracking Control for A Wheeled Mobile Robot Using Fuzzy Logic Controller Trajetory Traking Control for A Wheeled Mobile Robot Using Fuzzy Logi Controller K N FARESS 1 M T EL HAGRY 1 A A EL KOSY 2 1 Eletronis researh institute, Cairo, Egypt 2 Faulty of Engineering, Cairo University,

More information

Make your process world

Make your process world Automation platforms Modion Quantum Safety System Make your proess world a safer plae You are faing omplex hallenges... Safety is at the heart of your proess In order to maintain and inrease your ompetitiveness,

More information

M32: A Constructive Multilevel Logic Synthesis System*

M32: A Constructive Multilevel Logic Synthesis System* M32: A Construtive Multilevel Logi Synthesis System* Vitor N. Kravets Karem A. Sakallah Department of Eletrial Engineering and Computer Siene University of Mihigan, Ann Arbor, MI 48109 {vkravets, karem}@ees.umih.edu

More information

Performance Improvement of TCP on Wireless Cellular Networks by Adaptive FEC Combined with Explicit Loss Notification

Performance Improvement of TCP on Wireless Cellular Networks by Adaptive FEC Combined with Explicit Loss Notification erformane Improvement of TC on Wireless Cellular Networks by Adaptive Combined with Expliit Loss tifiation Masahiro Miyoshi, Masashi Sugano, Masayuki Murata Department of Infomatis and Mathematial Siene,

More information

RAC 2 E: Novel Rendezvous Protocol for Asynchronous Cognitive Radios in Cooperative Environments

RAC 2 E: Novel Rendezvous Protocol for Asynchronous Cognitive Radios in Cooperative Environments 21st Annual IEEE International Symposium on Personal, Indoor and Mobile Radio Communiations 1 RAC 2 E: Novel Rendezvous Protool for Asynhronous Cognitive Radios in Cooperative Environments Valentina Pavlovska,

More information

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks Unsupervised Stereosopi Video Objet Segmentation Based on Ative Contours and Retrainable Neural Networks KLIMIS NTALIANIS, ANASTASIOS DOULAMIS, and NIKOLAOS DOULAMIS National Tehnial University of Athens

More information

Alleviating DFT cost using testability driven HLS

Alleviating DFT cost using testability driven HLS Alleviating DFT ost using testability driven HLS M.L.Flottes, R.Pires, B.Rouzeyre Laboratoire d Informatique, de Robotique et de Miroéletronique de Montpellier, U.M. CNRS 5506 6 rue Ada, 34392 Montpellier

More information

Exploiting Enriched Contextual Information for Mobile App Classification

Exploiting Enriched Contextual Information for Mobile App Classification Exploiting Enrihed Contextual Information for Mobile App Classifiation Hengshu Zhu 1 Huanhuan Cao 2 Enhong Chen 1 Hui Xiong 3 Jilei Tian 2 1 University of Siene and Tehnology of China 2 Nokia Researh Center

More information

DECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Euncheol Kim, Gwan Choi, Mark Yeary *

DECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Euncheol Kim, Gwan Choi, Mark Yeary * DECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Eunheol Kim, Gwan Choi, Mark Yeary * Dept. of Eletrial Engineering, Texas A&M University, College Station, TX-77840

More information

References. December 1992, pp. 71 { 81. pp.457{467. Magazine, June for very large high throughput database systems,"

References. December 1992, pp. 71 { 81. pp.457{467. Magazine, June for very large high throughput database systems, the overall working time for other appliations. In ase, data ltering was the only appliation being run, then using distributed indexing, we an serve 00 times as many requests. 6 Conlusion We have explored

More information

Chapter 2: Introduction to Maple V

Chapter 2: Introduction to Maple V Chapter 2: Introdution to Maple V 2-1 Working with Maple Worksheets Try It! (p. 15) Start a Maple session with an empty worksheet. The name of the worksheet should be Untitled (1). Use one of the standard

More information

Parallelizing Frequent Web Access Pattern Mining with Partial Enumeration for High Speedup

Parallelizing Frequent Web Access Pattern Mining with Partial Enumeration for High Speedup Parallelizing Frequent Web Aess Pattern Mining with Partial Enumeration for High Peiyi Tang Markus P. Turkia Department of Computer Siene Department of Computer Siene University of Arkansas at Little Rok

More information

Boosted Random Forest

Boosted Random Forest Boosted Random Forest Yohei Mishina, Masamitsu suhiya and Hironobu Fujiyoshi Department of Computer Siene, Chubu University, 1200 Matsumoto-ho, Kasugai, Aihi, Japan {mishi, mtdoll}@vision.s.hubu.a.jp,

More information

splitting tehniques that partition live ranges have been proposed to solve both the spilling problem[5][8] and the assignment problem[8][9]. The parti

splitting tehniques that partition live ranges have been proposed to solve both the spilling problem[5][8] and the assignment problem[8][9]. The parti Load/Store Range Analysis for Global Register Alloation Priyadarshan Kolte and Mary Jean Harrold Department of Computer Siene Clemson University Abstrat Live range splitting tehniques divide the live ranges

More information

Dynamic Algorithms Multiple Choice Test

Dynamic Algorithms Multiple Choice Test 3226 Dynami Algorithms Multiple Choie Test Sample test: only 8 questions 32 minutes (Real test has 30 questions 120 minutes) Årskort Name Eah of the following 8 questions has 4 possible answers of whih

More information

特集 Road Border Recognition Using FIR Images and LIDAR Signal Processing

特集 Road Border Recognition Using FIR Images and LIDAR Signal Processing デンソーテクニカルレビュー Vol. 15 2010 特集 Road Border Reognition Using FIR Images and LIDAR Signal Proessing 高木聖和 バーゼル ファルディ Kiyokazu TAKAGI Basel Fardi ヘンドリック ヴァイゲル Hendrik Weigel ゲルド ヴァニーリック Gerd Wanielik This paper

More information

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks A Dual-Hamiltonian-Path-Based Multiasting Strategy for Wormhole-Routed Star Graph Interonnetion Networks Nen-Chung Wang Department of Information and Communiation Engineering Chaoyang University of Tehnology,

More information

Reduced-Complexity Column-Layered Decoding and. Implementation for LDPC Codes

Reduced-Complexity Column-Layered Decoding and. Implementation for LDPC Codes Redued-Complexity Column-Layered Deoding and Implementation for LDPC Codes Zhiqiang Cui 1, Zhongfeng Wang 2, Senior Member, IEEE, and Xinmiao Zhang 3 1 Qualomm In., San Diego, CA 92121, USA 2 Broadom Corp.,

More information

Semi-Supervised Affinity Propagation with Instance-Level Constraints

Semi-Supervised Affinity Propagation with Instance-Level Constraints Semi-Supervised Affinity Propagation with Instane-Level Constraints Inmar E. Givoni, Brendan J. Frey Probabilisti and Statistial Inferene Group University of Toronto 10 King s College Road, Toronto, Ontario,

More information

1. Introduction. 2. The Probable Stope Algorithm

1. Introduction. 2. The Probable Stope Algorithm 1. Introdution Optimization in underground mine design has reeived less attention than that in open pit mines. This is mostly due to the diversity o underground mining methods and omplexity o underground

More information

DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT

DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT 1 ZHANGGUO TANG, 2 HUANZHOU LI, 3 MINGQUAN ZHONG, 4 JIAN ZHANG 1 Institute of Computer Network and Communiation Tehnology,

More information

Series/1 GA File No i=:: IBM Series/ Battery Backup Unit Description :::5 ~ ~ >-- ffi B~88 ~0 (] II IIIIII

Series/1 GA File No i=:: IBM Series/ Battery Backup Unit Description :::5 ~ ~ >-- ffi B~88 ~0 (] II IIIIII Series/1 I. (.. GA34-0032-0 File No. 51-10 a i=:: 5 Q 1 IBM Series/1 4999 Battery Bakup Unit Desription B88 0 (] o. :::5 >-- ffi "- I II1111111111IIIIII1111111 ---- - - - - ----- --_.- Series/1 «h: ",

More information

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application World Aademy of Siene, Engineering and Tehnology 8 009 Performane of Histogram-Based Skin Colour Segmentation for Arms Detetion in Human Motion Analysis Appliation Rosalyn R. Porle, Ali Chekima, Farrah

More information

Concurrent Timing Optimization of. Hong-Yean Hsieh, Wentai Liu, Ralph K. Cavin. Electronics Research Laboratory. North Carolina State University

Concurrent Timing Optimization of. Hong-Yean Hsieh, Wentai Liu, Ralph K. Cavin. Electronics Research Laboratory. North Carolina State University Conurrent Timing Optimization of Lath-Based Digital Systems 1 Hong-Yean Hsieh, Wentai Liu, Ralph K. Cavin Eletronis Researh Laboratory Department of Eletrial and Computer Engineering North Carolina State

More information

Acoustic Links. Maximizing Channel Utilization for Underwater

Acoustic Links. Maximizing Channel Utilization for Underwater Maximizing Channel Utilization for Underwater Aousti Links Albert F Hairris III Davide G. B. Meneghetti Adihele Zorzi Department of Information Engineering University of Padova, Italy Email: {harris,davide.meneghetti,zorzi}@dei.unipd.it

More information

Dr.Hazeem Al-Khafaji Dept. of Computer Science, Thi-Qar University, College of Science, Iraq

Dr.Hazeem Al-Khafaji Dept. of Computer Science, Thi-Qar University, College of Science, Iraq Volume 4 Issue 6 June 014 ISSN: 77 18X International Journal of Advaned Researh in Computer Siene and Software Engineering Researh Paper Available online at: www.ijarsse.om Medial Image Compression using

More information

Z8530 Programming Guide

Z8530 Programming Guide Z8530 Programming Guide Alan Cox alan@redhat.om Z8530 Programming Guide by Alan Cox Copyright 2000 by Alan Cox This doumentation is free software; you an redistribute it and/or modify it under the terms

More information

Facility Location: Distributed Approximation

Facility Location: Distributed Approximation Faility Loation: Distributed Approximation Thomas Mosibroda Roger Wattenhofer Distributed Computing Group PODC 2005 Where to plae ahes in the Internet? A distributed appliation that has to dynamially plae

More information

COST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY

COST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY COST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY Dileep P, Bhondarkor Texas Instruments Inorporated Dallas, Texas ABSTRACT Charge oupled devies (CCD's) hove been mentioned as potential fast auxiliary

More information

Automated System for the Study of Environmental Loads Applied to Production Risers Dustin M. Brandt 1, Celso K. Morooka 2, Ivan R.

Automated System for the Study of Environmental Loads Applied to Production Risers Dustin M. Brandt 1, Celso K. Morooka 2, Ivan R. EngOpt 2008 - International Conferene on Engineering Optimization Rio de Janeiro, Brazil, 01-05 June 2008. Automated System for the Study of Environmental Loads Applied to Prodution Risers Dustin M. Brandt

More information

Bayesian Belief Networks for Data Mining. Harald Steck and Volker Tresp. Siemens AG, Corporate Technology. Information and Communications

Bayesian Belief Networks for Data Mining. Harald Steck and Volker Tresp. Siemens AG, Corporate Technology. Information and Communications Bayesian Belief Networks for Data Mining Harald Stek and Volker Tresp Siemens AG, Corporate Tehnology Information and Communiations 81730 Munih, Germany fharald.stek, Volker.Trespg@mhp.siemens.de Abstrat

More information

PROJECT PERIODIC REPORT

PROJECT PERIODIC REPORT FP7-ICT-2007-1 Contrat no.: 215040 www.ative-projet.eu PROJECT PERIODIC REPORT Publishable Summary Grant Agreement number: ICT-215040 Projet aronym: Projet title: Enabling the Knowledge Powered Enterprise

More information

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION Ken Sauer and Charles A. Bouman Department of Eletrial Engineering, University of Notre Dame Notre Dame, IN 46556, (219) 631-6999 Shool of

More information

High Speed Area Efficient VLSI Architecture for DCT using Proposed CORDIC Algorithm

High Speed Area Efficient VLSI Architecture for DCT using Proposed CORDIC Algorithm International Journal of Innovative Researh in Siene, Engineering and Tehnology Website: www.ijirset.om High Speed Area Effiient VLSI Arhiteture for DCT using Proposed CORDIC Algorithm Deepnarayan Sinha

More information

OvidSP Quick Reference Card

OvidSP Quick Reference Card OvidSP Quik Referene Card Searh in any of several dynami modes, ombine results, apply limits, use improved researh tools, develop strategies, save searhes, set automati alerts and RSS feeds, share results...

More information

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization Self-Adaptive Parent to Mean-Centri Reombination for Real-Parameter Optimization Kalyanmoy Deb and Himanshu Jain Department of Mehanial Engineering Indian Institute of Tehnology Kanpur Kanpur, PIN 86 {deb,hjain}@iitk.a.in

More information

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks Flow Demands Oriented Node Plaement in Multi-Hop Wireless Networks Zimu Yuan Institute of Computing Tehnology, CAS, China {zimu.yuan}@gmail.om arxiv:153.8396v1 [s.ni] 29 Mar 215 Abstrat In multi-hop wireless

More information

Implication Circuit Gclear FD LContra FD LChange x state x state GChange GContra E il E ol FSM E or E ir Figure 1: Zhong's Search Block for variable x

Implication Circuit Gclear FD LContra FD LChange x state x state GChange GContra E il E ol FSM E or E ir Figure 1: Zhong's Search Block for variable x Reducing Compilation Time of Zhong's FPGA-based SAT solver Pak K. Chan, M.J. Boyd, S. Goren, K. Klenk, V. Kodavati, R. Kundu, M. Margolese, J. Sun, K. Suzuki, E. Thorne, X. Wang, J. Xu, M. Zhu Department

More information

arxiv: v1 [cs.db] 13 Sep 2017

arxiv: v1 [cs.db] 13 Sep 2017 An effiient lustering algorithm from the measure of loal Gaussian distribution Yuan-Yen Tai (Dated: May 27, 2018) In this paper, I will introdue a fast and novel lustering algorithm based on Gaussian distribution

More information

INTERPOLATED AND WARPED 2-D DIGITAL WAVEGUIDE MESH ALGORITHMS

INTERPOLATED AND WARPED 2-D DIGITAL WAVEGUIDE MESH ALGORITHMS Proeedings of the COST G-6 Conferene on Digital Audio Effets (DAFX-), Verona, Italy, Deember 7-9, INTERPOLATED AND WARPED -D DIGITAL WAVEGUIDE MESH ALGORITHMS Vesa Välimäki Lab. of Aoustis and Audio Signal

More information

Design Implications for Enterprise Storage Systems via Multi-Dimensional Trace Analysis

Design Implications for Enterprise Storage Systems via Multi-Dimensional Trace Analysis Design Impliations for Enterprise Storage Systems via Multi-Dimensional Trae Analysis Yanpei Chen, Kiran Srinivasan, Garth Goodson, Randy Katz University of California, Berkeley, NetApp In. {yhen2, randy}@ees.berkeley.edu,

More information

Divide-and-conquer algorithms 1

Divide-and-conquer algorithms 1 * 1 Multipliation Divide-and-onquer algorithms 1 The mathematiian Gauss one notied that although the produt of two omplex numbers seems to! involve four real-number multipliations it an in fat be done

More information

Design of High Speed Mac Unit

Design of High Speed Mac Unit Design of High Speed Ma Unit 1 Harish Babu N, 2 Rajeev Pankaj N 1 PG Student, 2 Assistant professor Shools of Eletronis Engineering, VIT University, Vellore -632014, TamilNadu, India. 1 harishharsha72@gmail.om,

More information

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8 Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introdution... 1 1.1. Internet Information...2 1.2. Internet Information Retrieval...3 1.2.1. Doument Indexing...4 1.2.2. Doument Retrieval...4

More information