Binary Adder Architectures for Cell-Based VLSI and their Synthesis

Size: px
Start display at page:

Download "Binary Adder Architectures for Cell-Based VLSI and their Synthesis"

Transcription

1 Dss. ETH No. Bnary Adder Archtectures for Cell-Based VLSI and ther Synthess A dssertaton submtted to the SWISS FEDERAL INSTITUTE OF TECHNOLOGY ZURICH for the degree of Doctor of techncal scences presented by RETO ZIMMERMANN Dpl. Informatk-Ing. ETH born ctzen of Vechgen BE accepted on the recommendaton of Prof. Dr. W. Fchtner, examner Prof. Dr. L. Thele, co-examner Acknowledgments I would lke to thank my advsor, Prof. Wolfgang Fchtner, for hs overall support and for hs confdence n me and my work. I would also lke to thank Prof. Lothar Thele for readng and co-examnng the thess. I am greatly ndebted n Hubert Kaesln and Norbert Felber for ther encouragement and support durng the work as well as for proofreadng and commentng on my thess. I also want to express my grattude to all colleagues at the Integrated Systems Laboratory who contrbuted to the perfect workng envronment. In partcular, I want to thank the secretares for keepng the admnstraton, Hanspeter Mathys and Hansjörg Gsler the nstallatons, Chrstoph Wck and Adam Fegn the computers, and Andreas Weland the VLSI desgn tools runnng. I want to thank Hanspeter Kunz and Patrck Müller for the valuable contrbutons durng ther student projects. Also, I am grateful to Rajv Gupta, Duncan Fsher, and all other people who supported me durng my nternshp at Rockwell Semconductor Systems n Newport Beach, CA. I acknowledge the fnancal support of McroSwss, a Mcroelectroncs Program of the Swss Government. Fnally my specal thanks go to my parents for ther support durng my educaton and for ther understandng and tolerance durng the last couple of years. 997

2 Acknowledgments Contents Acknowledgments Abstract x Zusammenfassung x Introducton. : : : : : : : : : : : : : : : : : : : : : : : : : Motvaton. : : : : : : : : : : : : : : : : : : : : : : : RelatedWork. : : : : : : : : : : : : : : : : : : : : : GoalsofthsWork. : : : : : : : : : : : : : : : : : : StructureoftheThess Basc Condtons and Implcatons. : : : : : : : : : : : : ArthmetcOperatonsandUnts.. : : : : : : : : : : : : : : : : : : : : Applcatons 6.. : : : : : : : : : : : Bascarthmetcoperatons 6.. Number representaton : : : : : : : schemes.. Sequental and combnatonal : : : : crcuts.. Synchronous and self-tmed : : : : : : crcuts

3 v Contents Contents v..6 Carry-propagate and carry-save : : adders..7 : : : : : : : : : : : : : : : : : : : : Implcatons. Crcut and Layout Desgn : : : : : : : : Technques.. Layout-based desgn : : : : : : : technques.. : : : : : : : : : Cell-baseddesgntechnques.. : : : : : : : : : : : : : : : : : : : : Implcatons. : : : : : : : : : : : : : : : : : : SubmcronVLSIDesgn.. : : : : : : : : : : : : : Multlevelmetalroutng.. : : : : : : : : : : : : : : : : Interconnectdelay 6.. : : : : : : : : : : : : : : : : : : : : Implcatons 6. Automated Crcut Synthess and : : : Optmzaton 6.. : : : : : : : : : : : : : : : : Hgh-levelsynthess 6.. : : : : : : : : : : : : : : : : Low-levelsynthess 7.. : : : : : : : : : : : : : : : Data-pathsynthess 7.. Optmzaton of combnatonal : : : : crcuts 7.. Hardware descrpton : : : : : : : languages..6 : : : : : : : : : : : : : : : : : : : : Implcatons. Crcut Complexty and Performance : : Modelng.. : : : : : : : : : : : : : : : : : : Areamodelng 9.. : : : : : : : : : : : : : : : : : : Delaymodelng.. : : : : : : : : Powermeasuresandmodelng.. Combned crcut performance : : measures.. : : : : : : : : : : : : : : : : : : : : Implcatons.6 : : : : : : : : : : : : : : : : : : : : : : : : : : Summary Basc Addton Prncples and Structures 7. : : : : : : : : : : : : : : -BtAdders,(m,k)-Counters 7.. : : : : : : : : : : : : Half-Adder,(,)-Counter.. : : : : : : : : : : : : Full-Adder,(,)-Counter 9.. : : : : : : : : : : : : : : : : : : (m,k)-counters. : : : : : : : : : : : : Carry-PropagateAdders(CPA). : : : : : : : : : : : : : : : : Carry-SaveAdders(CSA). : : : : : : : : : : : : : : : : : : Mult-OperandAdders.. : : : : : : : : : : : : : : : : : : : ArrayAdders.. : : : : : : : : : : : : : : : : (m,)-compressors 6.. : : : : : : : : : : : : : : : : : : : : TreeAdders 9.. : : : : : : : : : : : : : : : : : : : : : : Remarks. : : : : : : : : : : : : : : : : : : : : : : PrefxAlgorthms.. : : : : : : : : : : : : : : : : : : Prefxproblems.. : : : : : : : : : : : : : : Seral-prefxalgorthm.. : : : : : : : : : : : : : : Tree-prefxalgorthms.. : : : : : : : : : : : : : Group-prefxalgorthms.. Bnary addton as a prefx : : : : : : problem.6 : : : : : : : : : BascAddtonSpeed-UpTechnques 6.6. Bt-Level or Drect CPA : : : : : : : : Schemes.6. Block-Level or Compound CPA : : Schemes 9.6. : : : : : : : : : : : : CompostonofSchemes 6 AdderArchtectures 67

4 v Contents Contents v. AnthologyofAdderArchtectures : : : : : : : : : : : 67.. Rpple-CarryAdder(RCA) : : : : : : : : : : : 67.. Carry-SkpAdder(CSKA) : : : : : : : : : : : : 6.. Carry-SelectAdder(CSLA) : : : : : : : : : : : 7.. Condtonal-SumAdder(COSA) : : : : : : : : 7.. Carry-IncrementAdder(CIA) : : : : : : : : : 7..6 Parallel-Prefx/ Carry-Lookahead Adders(PPA /CLA) : : : : : : : : : : : : : : : : : : : : : : : :..7 HybrdAdderArchtectures : : : : : : : : : : :. Complexty and Performance Comparsons : : : : : 9.. Adder Archtectures Compared : : : : : : : : 9.. Comparsons Based on Unt-Gate Area and DelayModels : : : : : : : : : : : : : : : : : : : 9.. Comparson Based on Standard-Cell Implementatons : : : : : : : : : : : : : : : : : : : : : 9.. ResultsandDscusson : : : : : : : : : : : : : : 97.. MoreGeneralObservatons : : : : : : : : : :..6 ComparsonDagrams : : : : : : : : : : : : : :. Summary: Optmal Adder Archtectures : : : : : : : SpecalAdders. AdderswthFlagGeneraton : : : : : : : : : : : : : :. AddersforLateInputCarry : : : : : : : : : : : : : : :. Adders wth Relaxed Tmng Constrants : : : : : : : 6. Adders wth Non-Equal Bt Arrval Tmes : : : : : : : : 6. : : : : : : : : : : : : : : : : : : : : : : ModuloAdders.. AddtonModulo (? ) : : : : : : : : : : : : n.. AddtonModulo ( + ) : : : : : : : : : : : : n.6 : : : : : : : : : : : : : : : : : : : : : Dual-SzeAdders 6.7 : : : : : : : : : : : : : RelatedArthmetcOperatons 9.7. : : : : : : : : : : scomplementsubtractors 9.7. : : : : : : : : : Incrementers/Decrementers.7. : : : : : : : : : : : : : : : : : : : Comparators 6 AdderSynthess 6. : : : : : : : : : : : : : : : : : : : : : : : : Introducton 6. : : : : : : : : : : : PrefxGraphsandAdderSynthess 6. Synthess of Fxed Parallel-Prefx : : : : : : Structures 6.. : : : : : : : : : : GeneralSynthessAlgorthm 6.. : : : : : : : : : : : : : : : : Seral-PrefxGraph : : : : : : : : : : SklanskyParallel-PrefxGraph 6.. Brent-Kung Parallel-Prefx : : : : : : : : Graph Level Carry-Increment Parallel-Prefx Graph Level Carry-Increment Parallel-Prefx Graph 6. Synthess of Flexble Parallel-Prefx : : : : : Structures 6.. : : : : : : : : : : : : : : : : : : : : Introducton 6.. Parallel-Prefx Adders : : : : : : : : : Revsted 6.. Optmzaton and Synthess of Prefx Structures 6.. Expermental Results and : : : : : : Dscusson

5 v Contents Contents x 6.. Parallel-Prefx Schedules wth Resource Constrants : : : : : : : : : : : : : : : : : : : : : : : 6. Valdty and Verfcaton of Prefx Graphs : : : : : : : Propertes of the Prefx Operator : : : : : : : : GeneralzedPrefxProblem : : : : : : : : : : : Transformatons of Prefx Graphs : : : : : : : : ValdtyofPrefxGraphs : : : : : : : : : : : : : IrredundancyofPrefxGraphs : : : : : : : : : VerfcatonofPrefxGraphs : : : : : : : : : : : Summary : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : ComplexCells 7. : : : : : : : : : : : : : : : : : : : PpelnngofAdders 7.6 : : : : : : : : : : : : : : : : : : : : : AddersonFPGAs : : : : : : : : : : : : : Coarse-GranedFPGAs 7.6. : : : : : : : : : : : : : : : Fne-GranedFPGAs Conclusons 9 Bblography 97 Currculum Vtae 7 VLSI Aspects of Adders 7 7. VerfcatonofParallel-PrefxAdders : : : : : : : : : : VerfcatonGoals : : : : : : : : : : : : : : : : : VerfcatonTestBench : : : : : : : : : : : : : : 7 7. Transstor-LevelDesgnofAdders : : : : : : : : : : : : Dfferences between Gate- and Transstor- LevelDesgn : : : : : : : : : : : : : : : : : : : : LogcStyles : : : : : : : : : : : : : : : : : : : : Transstor-Level Arthmetc Crcuts : : : : : : : ExstngCustomAdderCrcuts : : : : : : : : : Proposed Custom Adder Crcut : : : : : : : : LayoutofCustomAdders : : : : : : : : : : : : : : : : 7. LbraryCellsforCell-BasedAdders : : : : : : : : : : : 7.. SmpleCells : : : : : : : : : : : : : : : : : : : :

6 Abstract The addton of two bnary numbers s the fundamental and most often used arthmetc operaton on mcroprocessors, dgtal sgnal processors (DSP), and data-processng applcaton-specfc ntegrated crcuts (ASIC). Therefore, bnary adders are crucal buldng blocks n very large-scale ntegrated (VLSI) crcuts. Ther effcent mplementaton s not trval because a costly carrypropagaton operaton nvolvng all operand bts has to be performed. Many dfferent crcut archtectures for bnary addton have been proposed over the last decades, coverng a wde range of performance characterstcs. Also, ther realzaton at the transstor level for full-custom crcut mplementatons has been addressed ntensvely. However, the sutablty of adder archtectures for cell-based desgn and hardware synthess both prerequstes for the ever ncreasng productvty n ASIC desgn was hardly nvestgated. Based on the varous speed-up schemes for bnary addton, a comprehensve overvew and a qualtatve evaluaton of the dfferent exstng adder archtectures are gven n ths thess. In addton, a new multlevel carryncrement adder archtecture s proposed. It s found that the rpple-carry, the carry-lookahead, and the proposed carry-ncrement adders show the best overall performance characterstcs for cell-based desgn. These three adder archtectures, whch together cover the entre range of possble area vs. delay trade-offs, are comprsed n the more general prefx adder archtecture reported n the lterature. It s shown that ths unversal and flexble prefx adder structure also allows the realzaton of varous customzed adders and of adders fulfllng arbtrary tmng and area constrants. A non-heurstc algorthm for the synthess and optmzaton of prefx adders s proposed. It allows the runtme-effcent generaton of area-optmal adders for gven tmng constrants. x

7 Zusammenfassung De Addton zweer bnärer Zahlen st de grundlegende und am mesten verwendete arthmetsche Operaton n Mkroprozessoren, dgtalen Sgnalprozessoren (DSP) und datenverarbetenden anwendungsspezfschen ntegrerten Schaltungen (ASIC). Deshalb stellen bnäre Adderer krtsche Komponenten n hochntegrerten Schaltungen (VLSI) dar. Deren effzente Realserung st ncht trval, da ene teure carry-propagaton Operaton ausgeführt werden muss. Ene Velzahl verschedener Schaltungsarchtekturen für de bnäre Addton wurden n den letzten Jahrzehnten vorgeschlagen, welche sehr unterschedlche Egenschaften aufwesen. Zudem wurde deren Schaltungsrealserung auf Transstornveau berets engehend behandelt. Anderersets wurde de Egnung von Addererarchtekturen für zellbaserte Entwcklungstechnken und für de automatsche Schaltungssynthese bedes Grundvoraussetzungen für de hohe Produktvtätsstegerung n der ASIC Entwcklung bsher kaum untersucht. Baserend auf den manngfaltgen Beschleungungstechnken für de bnäre Addton wrd n deser Arbet ene umfassende Überscht und en qualtatver Verglech der verschedenen exsterenden Addererarchtekturen gegeben. Zudem wrd ene neue multlevel carry-ncrement Addererarchtektur vorgeschlagen. Es wrd gezegt, dass der rpple-carry, der carry-lookahead und der vorgeschlagene carry-ncrement Adderer de besten Egenschaften für de zellbaserte Schaltungsentwcklung aufwesen. Dese dre Addererarchtekturen, welche zusammen den gesamten Berech möglcher Kompromsse zwschen Schaltungsfläche und Verzögerungszet abdecken, snd n der allgemeneren Prefx-Addererarchtektur enthalten, de n der Lteratur beschreben st. Es wrd gezegt, dass dese unverselle und flexble Prefx-Addererstruktur de Realserung von verschedensten spezalx

8 xv Zusammenfassung serten Adderern mt belebgen Zet- und Flächenanforderungen ermöglcht. En ncht-heurstscher Algorthmus für de Synthese und de Zetoptmerung von Prefx-Adderern wrd vorgeschlagen. Deser erlaubt de recheneffzente Genererung flächenoptmaler Adderer unter gegebenen Anforderungen and de Verzögerungszet. Introducton. Motvaton The core of every mcroprocessor, dgtal sgnal processor (DSP), and dataprocessng applcaton-specfc ntegrated crcut (ASIC) s ts data path. It s often the crucal crcut component f de area, power dsspaton, and especally operaton speed are of concern. At the heart of data-path and addressng unts n turn are arthmetc unts, such as comparators, adders, and multplers. Fnally, the basc operaton found n most arthmetc components s the bnary addton. Besdes of the smple addton of two numbers, adders are also used n more complex operatons lke multplcaton and dvson. But also smpler operatons lke ncrementaton and magntude comparson base on bnary addton. Therefore, bnary addton s the most mportant arthmetc operaton. It s also a very crtcal one f mplemented n hardware because t nvolves an expensve carry-propagaton step, the evaluaton tme of whch s dependent on the operand word length. The effcent mplementaton of the addton operaton n an ntegrated crcut s a key problem n VLSI desgn. Productvty n ASIC desgn s constantly mproved by the use of cellbased desgn technques such as standard cells, gate arrays, and feldprogrammable gate arrays (FPGA) and by low- and hgh-level hardware synthess. Ths asks for adder archtectures whch result n effcent cell-based

9 Introducton. Structure of the Thess crcut realzatons whch can easly be syntheszed. Furthermore, they should provde enough flexblty n order to accommodate custom tmng and area constrants as well as to allow the mplementaton of customzed adders.. RelatedWork Much work has been done and many publcatons have been wrtten on crcut archtectures for bnary addton. Dfferent well-known adder archtectures are wdely used and can be found n any book on computer arthmetc [Kor9, Cav, Spa, Hwa79, Zm97]. Many adder crcut mplementatons at the transstor level are reported n the lterature whch use a varety of dfferent adder archtectures and combnatons thereof [D + 9, G + 9, M + 9, OV9, O + 9, M + 9]. On the other hand, a systematc overvew of the basc addton speedup technques wth ther underlyng concepts and relatonshps can hardly be found. Ths, however, s a prerequste for optmal adder mplementatons and versatle synthess algorthms. Furthermore, optmalty of adder archtectures for cell-based desgns was not nvestgated ntensvely and comprehensve performance comparsons were carred out only margnally [Tya9]. Most work so far has focused on the standard two-operand addton. The effcent realzaton of customzed adders such as adders wth flag generaton, non-unform sgnal arrval tmes [Okl9], fast carry-n processng, modulo [ENK9] and dual-sze adders were not consdered wdely. Fnally, the synthess of adder crcuts was addressed only margnally up to now. Ths s because the generaton of fxed adder archtectures s rather straghtforward and because no effcent synthess algorthms for flexble adder archtectures were known. Exceptons are some publcatons on the computaton of optmal block szes e.g. for carry-skp adders [Tur9] and on heurstc algorthms for the optmzaton of parallel-prefx adders [Fs9, GBB9].. Goals of ths Work As a consequence, the followng goals have been formulated for ths work: Establsh an overvew of the basc addton speed-up schemes, ther characterstcs, and ther relatonshps. Derve all possble adder archtectures from the above speed-up schemes and compare them qualtatvely and quanttatvely wth focus on cellbased crcut mplementaton, sutablty for synthess, and realzaton of customzed adders. Try to unfy the dfferent adder archtectures as much as possble n order to come up wth more generc adder structures. The deal soluton would be a flexble adder archtecture coverng the entre range of possble area-delay trade-offs wth mnor structural changes. Elaborate effcent and versatle synthess algorthms for the best performng adder archtectures found n the above comparsons. The deal soluton would consst of one unversal algorthm for a generc adder archtecture, whch takes automatcally nto account arbtrary tmng and area constrants. Incorporate the realzaton and generaton of customzed adders nto the above adder archtectures and synthess algorthms. Address other mportant VLSI aspects such as crcut verfcaton, layout topologes, and ppelnng for the chosen adder archtectures.. Structure of the Thess As a startng pont, the basc condtons and ther mplcatons are summarzed n Chapter. It s substantated why cell-based combnatonal carry-propagate adders and ther synthess are mportant n VLSI desgn and thus worthwhle to be covered by ths thess. Chapter ntroduces the basc addton prncples and structures. Ths ncludes -bt and mult-operand adders as well as the formulaton of carrypropagaton as a prefx problem and ts basc speed-up prncples. The dfferent exstng adder archtectures are descrbed n Chapter. In addton, a new carry-ncrement adder archtecture s ntroduced. Qualtatve and quanttatve comparsons are carred out and documented on the bass of a unt-gate model and of standard-cell mplementatons. It s shown that the best-performng adders are all prefx adders.

10 Introducton The mplementaton of specal adders usng the prefx adder archtecture s treated n Chapter. In Chapter 6, synthess algorthms are gven for the best-performng adder archtectures. Also, an effcent non-heurstc algorthm s presented for the synthess and optmzaton of arbtrary prefx graphs used n parallel-prefx adders. An algorthm for the verfcaton of prefx graphs s also elaborated. Varous mportant VLSI aspects relatng to the desgn of adders are summarzed n Chapter 7. These nclude verfcaton, transstor-level desgn, and layout of adder crcuts, lbrary aspects for cell-based adders, ppelnng of adders, and the realzaton of adder crcuts on FPGAs. Fnally, the man results of the thess are summarzed and conclusons are drawn n Chapter. Basc Condtons and Implcatons Ths chapter formulates the motvaton and goals as well as the basc condtons for the work presented n ths thess by answerng the followng questons: Why s the effcent mplementaton of combnatonal carry-propagate adders mportant? What wll be the key layout desgn technologes n the future, and why do cell-based desgn technques such as standard cells get more and more mportance? How does submcron VLSI challenge the desgn of effcent combnatonal cell-based crcuts? What s the current status of hghand low-level hardware synthess wth respect to arthmetc operatons and adders n partcular? Why s hardware synthess ncludng the synthess of effcent arthmetc unts becomng a key ssue n VLSI desgn? How can area, delay, and power measures of combnatonal crcuts be estmated early n the desgn cycle? How can the performance and complexty of adder crcuts be modeled by takng nto account archtectural, crcut, layout, and technology aspects? Although some of the followng aspects can be stated for VLSI desgn n general, the emphass wll be on the desgn of arthmetc crcuts.. Arthmetc Operatons and Unts The tasks of a VLSI chp whether as applcaton-specfc ntegrated crcut (ASIC) or as general-purpose mcroprocessor are the processng of data and

11 6 Basc Condtons and Implcatons. Arthmetc Operatons and Unts 7 the control of nternal or external system components. Ths s typcally done by algorthms whch base on logc and arthmetc operatons on data tems. based on operaton fxed-pont floatng-pont.. Applcatons related operaton <<, >> Applcatons of arthmetc operatons n ntegrated crcuts are manfold. Mcroprocessors and dgtal sgnal processors (DSPs) typcally contan adders and multplers n ther data path, formng dedcated nteger and/or floatng-pont unts and multply-accumulate (MAC) structures. Specal crcut unts for fast dvson and square-root operatons are sometmes ncluded as well. Adders, ncrementers/decrementers, and comparators are arthmetc unts often used for address calculaton and flag generaton purposes n controllers. Applcaton-specfc ICs use arthmetc unts for the same purposes. Dependng on ther applcaton, they even may requre dedcated crcut components for specal arthmetc operators, such as for fnte feld arthmetc used n cryptography, error correcton codng, and sgnal processng... Basc arthmetc operatons =, < +, +/ sqrt (x) exp (x) log (x) arthops.eps 7 mm trg (x) +, hyp (x) complexty +, (same as on the left for floatng-pont numbers) The arthmetc operatons that can be computed n electronc equpment are (ordered by ncreasng complexty, see Fg..) [Zm97]: Fgure.: Dependences of arthmetc operatons. shft / extenson operatons equalty and magntude comparson ncrementaton / decrementaton complementaton (negaton) addton / subtracton multplcaton dvson square root exponentaton logarthmc functons trgonometrc and nverse trgonometrc functons hyperbolc functons For trgonometrc and logarthmc functons as well as exponentaton, varous teratve algorthms exst whch make use of smpler arthmetc operatons. Multplcaton, dvson and square root extracton can be performed usng seral or parallel methods. In both methods, the computaton s reduced to a sequence of condtonal addtons/subtractons and shft operatons. Exstng speed-up technques try to reduce the number of requred addton/subtracton operatons and to mprove ther speed. Subtracton corresponds to the addton of a negated operand. The addton of two n-bt numbers tself can be regarded as an elementary operaton. In fact, decomposton nto a seres of ncrements and shfts s possble but of no relevance. The algorthm for complementaton (negaton)

12 Basc Condtons and Implcatons. Arthmetc Operatons and Unts 9 of a number depends on the chosen number representaton, but s usually accomplshed by bt nverson and ncrementaton. Incrementaton and decrementaton are smplfed addtons wth one nput operand beng constantly or -. Equalty and magntude comparson operatons can also be regarded as smplfed addtons, where only some the respectve addton flags, but no sum bts are used as outputs. Fnally, shft by a constant number of bts and extenson operatons, as used n some of the above more complex arthmetc functons, can be accomplshed by approprate wrng and thus requre no addtonal hardware. Ths short overvew shows that the addton s the key arthmetc operaton, whch most other operatons are based on. Its mplementaton n hardware s therefore crucal for the effcent realzaton of almost every arthmetc unt n VLSI. Ths s n terms of crcut sze, computaton delay, and power consumpton... Number representaton schemes The representaton of numbers and the hardware mplementaton of arthmetc unts are strongly dependent on each other. On one hand, each number representaton requres dedcated computaton algorthms. On the other hand, effcent crcut realzatons may ask for adequate number representatons. Only fxed-pont number representatons are consdered n ths thess. Ths s justfed snce arthmetc operatons on floatng-pont numbers are accomplshed by applyng varous fxed-pont operatons on mantssa and exponent. Moreover, fxed-pont numbers are reduced to ntegers heren, snce every nteger can be consdered as a fracton multpled by a constant factor. Bnary number systems The radx- or bnary number system s the most wdely used number representaton, whch s due to ts mplementaton effcency and smplcty n dgtal crcut desgn. An n-bt number s represented as A = (a n? ; a n? ; : : : ; a ; a ), where a f; g. The followng representatons for unsgned and sgned fxed-pont numbers are used: Unsgned numbers are used for the representaton of postve ntegers (.e., natural numbers). Value: A = P n? = a, Range: [; n? ]. Two s complement s the standard representaton of sgned numbers. Value: A =?a n? n? + P n? = a, Range: [? n? ; n?? ], Complement:?A = n? A = A +, where A = (a n? ; a n? ; : : : ; a ; a ), Sgn: a n?, Propertes: asymmetrc range (.e., n? negatve numbers, ( n?? ) postve numbers), compatble wth unsgned numbers n most arthmetc operatons. One s complement s a smlar representaton as the two s complement. Value: A =?a n? ( n? + ) + P n? = a, Range: [?( n?? ); n?? ], Complement:?A = n? A? = A, Sgn: a n?, Propertes: double representaton of zero, symmetrc range, modulo ( n? ) number system. Sgn magntude s an alternatve representaton of sgned numbers. Here, the bts a n? ; a n? ; : : : ; a are the true magntude. Value: A =?a n? P n? = a, Range: [?( n?? ); n?? ], Complement:?A = (a n? ; a n? ; : : : ; a ; a ), Sgn: a n?, Propertes: double representaton of zero, symmetrc range. Due to ther advantages and wde-spread use, the unsgned and two s complement sgned number representatons wll be consdered throughout the thess.

13 Basc Condtons and Implcatons. Arthmetc Operatons and Unts Redundant number systems Some redundant number systems exst, whch e.g. allow for speedng-up arthmetc operatons [Kor9]. Carry-save s the redundant representaton of the result when addng up three numbers wthout carry propagaton (.e., the ndvdual carry bts are saved for later carry propagaton). A carry-save number conssts of two numbers, one contanng all carry bts and the other all sum bts. Delayed-carry or half-adder form [LJ96] s the correspondng representaton when addng up only two numbers. Sgned-dgt s a redundant number system, whch makes use of the dgt set f?; ; g. The carry-save number representaton plays an mportant role n multoperand adders (see Sec..). Otherwse, redundant number systems are of no concern n carry-propagate adders, snce they are used precsely to avod carry propagaton... Sequental and combnatonal crcuts Many arthmetc operatons can be realzed as combnatonal or sequental crcuts. Bt-seral or ppelned adders are examples for sequental adder crcuts. However, snce adder archtectures deal wth speedng up carrypropagaton logc, only combnatonal adder mplementatons are covered n ths thess... Synchronous and self-tmed crcuts The realzaton of a synchronous crcut can be done n a synchronous or a self-tmed asynchronous fashon, whch also nfluences the mplementaton of the combnatonal crcuts. In partcular, self-tmed combnatonal crcuts have to provde completon sgnals, whch are not trval to generate. As a matter of fact, synchronous crcut technques are standard n the VLSI desgn communty. However, adders are very appealng for self-tmed realzaton snce they have a short average carry-propagaton length (.e., O(log n)) [GO96]. Because the smplest adder archtecture namely the rpple-carry adder takes most advantage of self-tmed mplementaton, a further study of adder archtectures for self-tmed crcut realzaton makes no sense. Resdue number systems Resdue number system (RNS) do not use a fxed radx for all dgts, but are constructed from a set of dfferent resdues, so that each dgt has a dfferent radx [Kor9]. Arthmetc operatons n RNS can be computed on each dgt ndependently and n parallel. The resultng speed-up s consderable, but converson from and to conventonal number systems s very expensve. The ndvdual operatons performed on each sngle dgt are done usng normal or modular nteger arthmetc, and agan manly addtons. The nvestgatons on effcent nteger addton n ths thess thus also become mportant for RNS systems. In redundant number systems, the number of representable dgts s larger than the radx, thus allowng for multple representatons of the same number...6 Carry-propagate and carry-save adders Addton s a prefx problem (see Sec..), whch means that each result bt s dependent on all nput bts of equal or lower magntude. Propagaton of a carry sgnal from each bt poston to all hgher bt postons s necessary. Carrypropagate adders perform ths operaton mmedately. The requred carry propagaton from the least to the most sgnfcant bt results n a consderable crcut delay, whch s a functon of the word length of the nput operands. The most effcent way to speed-up addton s to avod carry propagaton, thus savng the carres for later processng. Ths allows the addton of two or more numbers n a very short tme, but yelds results n a redundant (carry-save) number representaton. Carry-save adders as the most commonly used redundant arthmetc

14 Basc Condtons and Implcatons. Crcut and Layout Desgn Technques adders play an mportant role n the effcent mplementaton of multoperand addton crcuts. They are very fast due to the absence of any carry-propagaton paths, ther structure s very smple, but the potental for further optmzaton s mnmal. The same holds for sgned-dgt adders, whch use a slghtly dfferent redundant number representaton. The addton results, however, usually have to be converted nto an rredundant nteger representaton n order to be processed further. Ths operaton s done usng a carry-propagate adder...7 Implcatons As we have seen so far, the combnatonal, bnary carry-propagate adder s one of the most often used and most crucal buldng block n dgtal VLSI desgn. Varous well-known methods exst for speedng-up carry-propagaton n adders, offerng very dfferent performance characterstcs, advantages, and dsadvantages. Some lack of understandng of the basc concepts and relatonshps often lead to suboptmal adder mplementatons. One goal of ths thess s the systematc nvestgaton and performance comparson of all exstng adder archtectures as well as ther optmzaton wth respect to cellbased desgn technologes.. Crcut and Layout Desgn Technques Table.: IC classfcaton scheme based on fabrcaton depth and desgn level. Fabrcaton Programmng Sem-custom Full-custom depth only fabrcaton fabrcaton Desgn Cell-based, as obtaned from Hand level schematc entry and/or synthess layout Type of Programm- Gate-array or Standard cell IC Fullntegrated able IC sea-of-gates IC (possbly also custom IC crcut (PLD, FPGA, wth macrocells CPLD, etc.) and megacells) and layouts are often collected n lbrares together wth automatc generators. Mega-cells are full-custom cells for unversal functons whch need no parameterzaton, e.g., mcroprocessor cores and perpherals. Macro-cells are used for large crcut components wth regular structure and need for wordlength parameterzaton, e.g., multplers, ROMs, and RAMs. Data paths are usually realzed n a bt-slced layout style, whch allows parameterzaton of word length (frst dmenson) and concatenaton of arbtrary data-path elements (second dmenson) for logc, arthmetc, and storage functons. Snce adders are too small to be mplemented as macro-cells, they are usually realzed as data-path elements. IC fabrcaton technologes can be classfed nto full-custom, sem-custom, and programmable ICs, as summarzed n Table. (taken from [Kae97]). Further dstnctons are made wth respect to crcut desgn technques and layout desgn technques, whch are strongly related... Layout-based desgn technques In layout-based desgn technques, dedcated full-custom layout s drawn manually for crcuts desgned at the transstor-level. The ntal desgn effort s very hgh, but maxmum crcut performance and layout effcency s acheved. Full-custom cells are entrely desgned by hand for dedcated hgh-performance unts, e.g., arthmetc unts. The tled-layout technque can be used to smplfy, automate, and parameterze the layout task. For reuse purposes, the crcuts.. Cell-based desgn technques At a hgher level of abstracton, arbtrary crcuts can be composed from elementary logc gates and storage elements contaned n a lbrary of pre-desgned cells. The layout s automatcally composed from correspondng layout cells usng dedcated layout strateges, dependng on the used IC technology. Cellbased desgn technques are used n standard-cell, gate-array, sea-of-gates, and feld-programmable gate-array (FPGA) technologes. The desgn of logc crcuts does not dffer consderably among the dfferent cell-based IC technologes. Crcuts are obtaned from ether schematc entry, behavoral synthess, or crcut generators (.e., structural synthess). Due to the requred generc propertes of the cells, more conventonal logc styles have to be used for ther crcut mplementaton.

15 Basc Condtons and Implcatons. Submcron VLSI Desgn The advantages of cell-based desgn technques le n ther unversal usage, automated synthess and layout generaton for arbtrary crcuts, portablty between tools and lbrares, hgh desgn productvty, hgh relablty, and hgh flexblty n floorplannng. Ths comes at the prce of lower crcut performance wth respect to speed and area. Cell-based desgn technques are manly used for the mplementaton of random logc (e.g., controllers) and custom crcuts for whch no approprate lbrary components are avalable and custom mplementaton would be too costly. Cell-based desgn technques are wdely used n the ASIC desgn communty. Standard cells Standard cells represent the hghest performance cell-based technology. The layout of the cells s full-custom, whch mandates for full-custom fabrcaton of the wavers. Ths n turn enables the combnaton of standard cells wth customlayout components on the same de. For layout generaton, the standard cells are placed n rows and connected through ntermedate routng channels. Wth the ncreasng number of routng layers and over-the-cell routng capabltes n modern process technologes, the layout densty of standard cells gets close to the densty obtaned from full-custom layout. The remanng drawback s the restrcted use of hgh-performance (transstor-level) crcut technques. desgn of arbtrary crcuts. Turnaround tmes are very fast makng FPGAs the deal soluton for rapd prototypng. On the other hand, low crcut performance, lmted crcut complexty, and hgh de costs severely lmt ther area of applcaton... Implcatons In the feld of hgh-performance IC desgn, where layout-based and transstorlevel desgn technques are appled, much research effort has been nvested n the realzaton of effcent adder crcuts, and many dfferent mplementatons have been proposed. Effcent adder mplementatons for cell-based desgn, however, have hardly been addressed so far. Here, the ssues to be nvestgated are the technology mappng, cell lbrary propertes, routng, synthess, and portablty aspects. The wdespread use of cell-based desgn technques justfes a closer nspecton of the effcent crcut mplementaton of addton and related arthmetc operatons.. Submcron VLSI Desgn Gate-arrays and sea-of-gates On gate-arrays and sea-of-gates, preprocessed wafers wth unconnected crcut elements are used. Thus, only metalzaton used for the nterconnect s customzed, resultng n lower producton costs and faster turnaround tmes. Crcut performance and layout flexblty s lower than for standard cells, whch n partcular decreases mplementaton effcency of regular structures such as macro-cells. Wth evolvng process technologes, feature szes of.m,.m, and less become standard. These submcron technologes offer smaller and faster crcut structures at lower supply voltages, resultng n consderably faster and more complex ICs wth a lower power dsspaton per gate. Changng physcal characterstcs, however, strongly nfluence crcut desgn. Increasng gate denstes and clockng frequences lead to hgher power denstes, makng low power an mportant ssue n order to be able to dsspate the hgh energy of large chps. FPGAs Feld-programmable gate-arrays (FPGA) are electrcally programmable generc ICs. They are organzed as an array of logc blocks and routng channels, and the confguraton s stored n a statc memory or programmed e.g. usng antfuses. Agan, a lbrary of logc cells and macros allows flexble and effcent.. Multlevel metal routng As processes wth three and more metalzaton levels become avalable, routng denstes ncrease massvely. Over-the-cell routng elmnates the drawback of area-consumng routng channels n cell-based technologes, yeldng layout denstes comparable to custom-layout. Ths also results n a larger amount

16 6 Basc Condtons and Implcatons. Automated Crcut Synthess and Optmzaton 7 of local nterconnects (crcut localty), hgher layout flexblty, and more effcent automated routers. Especally standard-cell technologes beneft from these advantages, provdng both hgh desgn productvty as well as good crcut and layout performance... Interconnectdelay The delay of nterconnectons becomes domnant over swtchng delays n submcron VLSI. Ths s because RC delays ncrease (hgher wre resstances at roughly constant capactances) and wre lengths typcally scale wth chp sze but not wth feature sze. Therefore, crcut connectvty, localty, and fan-out are becomng mportant performance optmzaton crtera... Implcatons Cell-based desgn technques take advantage from emergng submcron VLSI technologes, partly approachng denstes and performance of full-custom technques. Interconnect aspects have to be accounted for, also wth respect to the optmalty of crcut archtectures.. Automated Crcut Synthess and Optmzaton Crcut synthess denotes the automated generaton of logc networks from behavoral descrptons at an arbtrary level. Synthess s becomng a key ssue n VLSI desgn for many reasons. Increasng crcut complextes, shorter development tmes, as well as effcent and flexble usage of cell and component lbrares can only be handled wth the ad of powerful desgn automaton tools. Arthmetc synthess addresses the effcent mappng of arthmetc functons onto exstng arthmetc components and logc gates... Hgh-levelsynthess Hgh-level synthess, or behavoral/archtectural synthess, allows the translaton of algorthmc or behavoral descrptons of hgh abstracton level (e.g., by way of data dependency graphs) down to RTL (regster-transfer level) representaton, whch can be processed further by low-level synthess tools. The nvolved archtectural synthess, ncludng resource allocaton, resource bndng, and schedulng tasks, s far from beng trval and s currently researched ntensvely. Hgh-level arthmetc synthess makes use of arthmetc transformatons n order to optmze hardware usage under gven performance crtera. Thereby, arthmetc lbrary components are regarded as the resources for mplementng the basc arthmetc operatons... Low-levelsynthess Low-level synthess, or logc synthess, translates an RTL specfcaton nto a generc logc network. For random logc, synthess s acheved by establshng the logc equatons for all outputs and mplementng them n a logc network... Data-pathsynthess Effcent arthmetc crcuts contan very specfc structures of large logc depth and hgh factorzaton degree. Ther drect synthess from logc equatons s not feasble. Therefore, parameterzed netlst generators usng dedcated algorthms are used nstead. Most synthess tools nclude generators for the basc arthmetc functons, such as comparators, ncrementers, adders, and multplers. For other mportant operatons (e.g., squarng, dvson) and specalzed functons (e.g., addton wth flag generaton, multplcaton wthout fnal addton) usually no generators are provded and thus synthess of effcent crcutry s not avalable. Also, the performance of the commonly used crcut archtectures vares consderably, whch often leads to suboptmal cell-based crcut mplementatons... Optmzaton of combnatonal crcuts The optmzaton of combnatonal crcuts connotes the automated mnmzaton of a logc netlst wth respect to area, delay, and power dsspaton measures of the resultng crcut, and the technology mappng (.e., mappng of the logc network onto the set of logc cells provded by the used technology/lbrary). The appled algorthms are very powerful for optmzaton of random logc by performng steps lke flattenng, logc mnmzaton, tmng-drven factorzaton, and technology mappng. However, the potental for optmzaton

17 Basc Condtons and Implcatons. Crcut Complexty and Performance Modelng 9 s rather lmted for networks wth large logc depth and hgh factorzaton degree, especally arthmetc crcuts. There, only local logc mnmzaton s possble, leavng the global crcut archtecture bascally unchanged. Thus, the realzaton of well-performng arthmetc crcuts reles more on effcent data-path synthess than on smple logc optmzaton... Hardware descrpton languages Hardware descrpton languages allow the specfcaton of hardware at dfferent levels of abstracton, servng as entry ponts to hardware synthess. VHDL, as one of the most wdely used and most powerful languages, enables the descrpton of crcuts at the behavoral and structural level. In partcular, parameterzed netlst generators can be wrtten n structural VHDL. Synthess of arthmetc unts s ntated by usng the standard arthmetc operator symbols n the VHDL code, for whch the correspondng bult-n netlst generators are called by the synthess tool. Bascally, the advantages of VHDL over schematc entry le n the possblty of behavoral hardware descrpton, the parameterzablty of crcuts, and portablty of code thanks to language standardzaton...6 Implcatons Due to ther manyfold occurrences and flexble usage, arthmetc unts form an ntegral part n automated hardware synthess for hgh-productvty VLSI desgn. The used crcut archtectures must be hghly flexble and easly parameterzable and must result n smple netlst generators and effcent crcut mplementatons. Thus, ths thess also focuses on algorthms for the synthess of adder crcuts and nvestgates the sutablty of varous adder archtectures wth respect to netlst synthess and optmzaton.. Crcut Complexty and Performance Modelng One mportant aspect n desgn automaton s the complexty and performance estmaton of a crcut early n the desgn cycle,.e., pror to the tme-consumng logc synthess and physcal layout phases. At a hgher desgn level, ths s acheved by usng characterzaton nformaton of the hgh-level components to be used and by complexty estmaton of the nterconnect. At gate level, however, estmaton s more dffcult and less accurate because crcut sze and performance strongly depend on the gate-level synthess results and on the physcal cell arrangement and routng. For a rough prelmnary characterzaton of adder archtectures, we are nterested n smple complexty and performance models for gate-level crcuts. Gven a crcut specfed by logc formulae or a generc netlst (.e., a netlst bult from basc logc gates), we need estmatons of the expected area, speed, and power dsspaton for a compled cell-based crcut as a functon of the operand word length... Areamodelng Slcon area on a VLSI chp s taken up by the actve crcut elements and ther nterconnectons. In cell-based desgn technques, the followng crtera for area modelng can be formulated: Total crcut complexty (GE total ) can be measured by the number of gate equvalents GE ( -nput NAND-gate MOSFETs). Crcut area (A crcut ) s occuped by logc cells and nter-cell wrng. In technologes wth three and more metal layers, over-the-cell routng capabltes allow the overlap of cell and wrng areas, as opposed to -metal technologes. Ths means that most of the cell area can also be used for wrng, resultng n very low routng area (A factors. = crcut cells + A wrng ) A Total cell area (A cells ) s roughly proportonal to the number of transstors or gate (GE equvalents total ) contaned n a crcut. Ths number s nfluenced by technology mappng, but not by physcal layout. Thus, cell area can be roughly estmated from a generc crcut descrpton (e.g. logc equatons or netlst wth smple gates) and can be precsely determned from a syntheszed (A netlst. / GE total cells ) Wrng area (A wrng ) s proportonal to the total wre length. The exact wre lengths, however, are not known pror to physcal layout. wrng / L total ) (A

18 Basc Condtons and Implcatons. Crcut Complexty and Performance Modelng Total wre length (L total ) can be estmated from the number of nodes and the average wre length of a node [Feu, KP9] or, more accurate, from the sum of cell fan-out and the average wre length of cell-tocell connectons (.e. accounts for the longer wre length of nodes wth hgher fan-out). The wre lengths also depend on crcut sze, crcut connectvty (.e., localty of connectons), and layout topology, whch are not known pror to crcut parttonng and physcal layout [RK9]. total / F O total ) (L Cell fan-out (F O) s the number of cell nputs a cell output s drvng. Fan-n s the number of nputs to a cell [WE9], whch for many combnatonal gates s proportonal to the sze of the cell. Snce the sum of cell (F O fan-out total ) of a crcut s equvalent to the sum of cell fan-n, t s also proportonal to crcut (F O sze. / GE total total ) Therefore, n a frst approxmaton, cell area as well as wrng area are proportonal to the number of gate equvalents. More accurate area estmatons before performng actual technology mappng and crcut parttonng are hardly possble. For crcut comparson purposes, the proportonalty factor s of no (A concern. / GE total / F O total crcut ) Our area estmaton model we are nterested n must be smple to compute whle beng as accurate as possble, and t should antcpate from logc equatons or generc netlsts (.e. netlsts composed of smple logc gates) alone. By consderng the above observatons, possble canddates are: Unt-gate area model Ths s the smplest and most abstract crcut area model, whch s often used n the lterature [Tya9]. A unt gate s a basc, monotonc -nput gate (or logc operaton, f logc equatons are concerned), such as AND, OR, NAND, and NOR. Basc, non-monotonc -nput gates lke XOR and XNOR are counted as two unt gates, reflectng ther hgher crcut complextes. Complex gates as well as mult-nput basc gates are bult from -nput basc gates and ther gate count equals the sum of gate counts of the composng cells. Fan-n area model In the fan-n model, the sze of - and mult-nput basc cells s measured by countng the number of nputs (.e., fan-n). Complex cells are agan composed of basc cells wth ther fan-n numbers summed up, whle the XOR/XNOR-gates are treated ndvdually. The obtaned numbers bascally dffer from the unt-gate numbers only by an offset of (e.g., the AND-gate counts as one unt gate but has a fan-n of two). Other area models The two prevous models do not account for transstorlevel optmzaton possbltes n complex gates, e.g., n multplexers and full-adders. More accurate area models need ndvdual gate count numbers for such complex gates. However, some degree of abstracton s sacrfced and applcaton on arbtrary logc equatons s not possble anymore. The same holds true for models whch take wrng aspects nto consderaton. One example of a more accurate area model s the gate-equvalents model (GE) mentoned above, whch bases on gate transstor counts and therefore s only applcable after synthess and technology mappng. Inverters and buffers are not accounted for n the above area models, whch makes sense for pre-synthess crcut descrptons. Note that the bggest dfferences n bufferng costs are found between low fan-out and hgh fan-out crcuts. Wth respect to area occupaton however, these effects are partly compensated because hgh fan-out crcuts need addtonal bufferng whle low fan-out crcuts usually have more wrng. Investgatons showed that the unt-gate model approach for the area estmaton of complex gates, such as multplexers and full-adders, does not ntroduce more naccuraces than e.g. the neglecton of crcut connectvty for wrng area estmaton. Wth the XOR/XNOR beng treated separately, the unt-gate model yelds acceptable accuracy at the gven abstracton level. Also, t perfectly reflects the structure of logc equatons by modelng the basc logc operators ndvdually and by regardng complex logc functons as composed from basc ones. Investgatons showed comparable performance for the fan-n and the unt-gate models due to ther smlarty. After all, the unt-gate model s very commonly used n the lterature. Therefore, t s used n ths work for area estmatons and comparsons from logc crcut specfcatons. Comparson results of placed and routed standard-cell solutons wll follow n Secton.... Delaymodelng Propagaton delay n a crcut s determned by the cell and nterconnecton delays on the crtcal path (.e. longest sgnal propagaton path n a combna-

19 Basc Condtons and Implcatons. Crcut Complexty and Performance Modelng tonal crcut). As opposed to area estmaton, not average and total numbers are of nterest, but ndvdual cell and node values are relevant for path delays. Crtcal path evaluaton s done by statc tmng analyss whch nvolves graph-based search algorthms. Of course, tmngs are also dependent on temperature, voltage, and process parameters whch, however, are not of concern for our comparson purposes. Maxmum delay (t crt path ) of a crcut s equal to the sum of cell nertal delays, cell output ramp delays, and wre delays on the crtcal (t path. path = P crt path ((t cell + t ramp ) + P crt path t crt wre) Cell delay (t cell ) depends on the transstor-level crcut mplementaton and the complexty of a cell. All smple gates have comparable delays. Complex gates usually contan tree-lke crcut and transstor arrangements, resultng n logarthmc delay-to-area dependences. cell / log (A cell )) (t Ramp delay (t ramp ) s the tme t takes for a cell output to drve the attached capactve load, whch s made up of nterconnect and cell nput loads. The ramp delay depends lnearly on the capactve load attached, whch n turn depends lnearly on the fan-out of the cell. ramp / F O cell ) (t Wre delay or nterconnecton delay (t wre ) s the RC-delay of a wre, whch depends on the wre length. RC-delays, however, are neglgble compared to cell and ramp delays for small crcuts such as the adders nvestgated n ths (t work. = wre ). Thus, a rough delay estmaton s possble by consderng szes and, wth a smaller weghtng factor, fan-out of the cells on the crtcal path. (log path / P crt path (A cell) + kf O cell crt )) (t Possble delay estmaton models are: Unt-gate delay model The unt-gate delay model s smlar to the unt-gate area model. Agan, the basc -nput gates (AND, OR, NAND, NOR) count as one gate delay wth the excepton of the XOR/XNOR-gates whch count as two gate delays [Tya9]. Complex cells are composed of basc cells usng the fastest possble arrangement (.e., tree structures wherever possble) wth the total gate delay determned accordngly. Fan-n delay model As for area modelng, fan-n numbers can be taken nstead of unt-gate numbers. Agan, no advantages over the unt-gate model are observed. Fan-out delay model The fan-out delay model bases on the unt-gate model but ncorporates fan-out numbers, thus accountng for gate fan-out numbers and nterconnecton delays [WT9]. Indvdual fan-out numbers can be obtaned from a generc crcut descrpton. A proportonalty factor has to be determned for approprate weghtng of fan-out wth respect to unt-gate delay numbers. Other delay models Varous delay models exst at other abstracton levels. At the transstor level, transstors can be modeled to contrbute one unt delay each (-model [CSTO9]). At a hgher level, complex gates lke full-adders and multplexers can agan be modeled separately for hgher accuracy [Kan9, CSTO9]. The mpact of large fan-out on crcut delay s hgher than on area requrements. Ths s because hgh fan-out nodes lead to long wres and hgh capactve loads and requre addtonal bufferng, resultng n larger delays. Therefore, the fan-out delay model s more accurate than the unt-gate model. However, due to the much smpler calculaton of the unt-gate delay model and ts wdespread use, as well as for compatblty reasons wth the chosen unt-gate area model, ths model wll be used for the crcut comparsons n ths work. As already mentoned, delay calculaton for a crcut requres statc tmng analyss, whch corresponds to the search for the longest path n a weghted drected acyclc graph. In our case, false path detecton [MB9] s not of mportance snce false paths do not occur n adder crcuts wth one excepton, whch wll be dscussed later... Power measures and modelng An ncreasngly mportant performance parameter for VLSI crcuts s power dsspaton. Peak power s a problem wth respect to crcut relablty (e.g. voltage drop on power buses, ground bounce) whch, however, can be dealt wth by careful desgn. On the other hand, average power dsspaton s A false path s a sgnal path n a combnatonal crcut whch cannot be senstzed.

Conditional Speculative Decimal Addition*

Conditional Speculative Decimal Addition* Condtonal Speculatve Decmal Addton Alvaro Vazquez and Elsardo Antelo Dep. of Electronc and Computer Engneerng Unv. of Santago de Compostela, Span Ths work was supported n part by Xunta de Galca under grant

More information

RADIX-10 PARALLEL DECIMAL MULTIPLIER

RADIX-10 PARALLEL DECIMAL MULTIPLIER RADIX-10 PARALLEL DECIMAL MULTIPLIER 1 MRUNALINI E. INGLE & 2 TEJASWINI PANSE 1&2 Electroncs Engneerng, Yeshwantrao Chavan College of Engneerng, Nagpur, Inda E-mal : mrunalngle@gmal.com, tejaswn.deshmukh@gmal.com

More information

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach Data Representaton n Dgtal Desgn, a Sngle Converson Equaton and a Formal Languages Approach Hassan Farhat Unversty of Nebraska at Omaha Abstract- In the study of data representaton n dgtal desgn and computer

More information

CHAPTER 4 PARALLEL PREFIX ADDER

CHAPTER 4 PARALLEL PREFIX ADDER 93 CHAPTER 4 PARALLEL PREFIX ADDER 4.1 INTRODUCTION VLSI Integer adders fnd applcatons n Arthmetc and Logc Unts (ALUs), mcroprocessors and memory addressng unts. Speed of the adder often decdes the mnmum

More information

Lecture 3: Computer Arithmetic: Multiplication and Division

Lecture 3: Computer Arithmetic: Multiplication and Division 8-447 Lecture 3: Computer Arthmetc: Multplcaton and Dvson James C. Hoe Dept of ECE, CMU January 26, 29 S 9 L3- Announcements: Handout survey due Lab partner?? Read P&H Ch 3 Read IEEE 754-985 Handouts:

More information

Outline. Digital Systems. C.2: Gates, Truth Tables and Logic Equations. Truth Tables. Logic Gates 9/8/2011

Outline. Digital Systems. C.2: Gates, Truth Tables and Logic Equations. Truth Tables. Logic Gates 9/8/2011 9/8/2 2 Outlne Appendx C: The Bascs of Logc Desgn TDT4255 Computer Desgn Case Study: TDT4255 Communcaton Module Lecture 2 Magnus Jahre 3 4 Dgtal Systems C.2: Gates, Truth Tables and Logc Equatons All sgnals

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Petri Net Based Software Dependability Engineering

Petri Net Based Software Dependability Engineering Proc. RELECTRONIC 95, Budapest, pp. 181-186; October 1995 Petr Net Based Software Dependablty Engneerng Monka Hener Brandenburg Unversty of Technology Cottbus Computer Scence Insttute Postbox 101344 D-03013

More information

Explicit Formulas and Efficient Algorithm for Moment Computation of Coupled RC Trees with Lumped and Distributed Elements

Explicit Formulas and Efficient Algorithm for Moment Computation of Coupled RC Trees with Lumped and Distributed Elements Explct Formulas and Effcent Algorthm for Moment Computaton of Coupled RC Trees wth Lumped and Dstrbuted Elements Qngan Yu and Ernest S.Kuh Electroncs Research Lab. Unv. of Calforna at Berkeley Berkeley

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

Area Efficient Self Timed Adders For Low Power Applications in VLSI

Area Efficient Self Timed Adders For Low Power Applications in VLSI ISSN(Onlne): 2319-8753 ISSN (Prnt) :2347-6710 Internatonal Journal of Innovatve Research n Scence, Engneerng and Technology (An ISO 3297: 2007 Certfed Organzaton) Area Effcent Self Tmed Adders For Low

More information

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier Some materal adapted from Mohamed Youns, UMBC CMSC 611 Spr 2003 course sldes Some materal adapted from Hennessy & Patterson / 2003 Elsever Scence Performance = 1 Executon tme Speedup = Performance (B)

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Newton-Raphson division module via truncated multipliers

Newton-Raphson division module via truncated multipliers Newton-Raphson dvson module va truncated multplers Alexandar Tzakov Department of Electrcal and Computer Engneerng Illnos Insttute of Technology Chcago,IL 60616, USA Abstract Reducton n area and power

More information

Mallathahally, Bangalore, India 1 2

Mallathahally, Bangalore, India 1 2 7 IMPLEMENTATION OF HIGH PERFORMANCE BINARY SQUARER PRADEEP M C, RAMESH S, Department of Electroncs and Communcaton Engneerng, Dr. Ambedkar Insttute of Technology, Mallathahally, Bangalore, Inda pradeepmc@gmal.com,

More information

CPE 628 Chapter 2 Design for Testability. Dr. Rhonda Kay Gaede UAH. UAH Chapter Introduction

CPE 628 Chapter 2 Design for Testability. Dr. Rhonda Kay Gaede UAH. UAH Chapter Introduction Chapter 2 Desgn for Testablty Dr Rhonda Kay Gaede UAH 2 Introducton Dffcultes n and the states of sequental crcuts led to provdng drect access for storage elements, whereby selected storage elements are

More information

FPGA IMPLEMENTATION OF RADIX-10 PARALLEL DECIMAL MULTIPLIER

FPGA IMPLEMENTATION OF RADIX-10 PARALLEL DECIMAL MULTIPLIER FPGA IMPLEMENTATION OF RADIX-10 PARALLEL DECIMAL MULTIPLIER A Dssertaton Submtted In Partal Fulflment of the Requred for the Degree of MASTER OF TECHNOLOGY In VLSI Desgn Submtted By GEETA Roll no. 601361009

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Verification by testing

Verification by testing Real-Tme Systems Specfcaton Implementaton System models Executon-tme analyss Verfcaton Verfcaton by testng Dad? How do they know how much weght a brdge can handle? They drve bgger and bgger trucks over

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Floating-Point Division Algorithms for an x86 Microprocessor with a Rectangular Multiplier

Floating-Point Division Algorithms for an x86 Microprocessor with a Rectangular Multiplier Floatng-Pont Dvson Algorthms for an x86 Mcroprocessor wth a Rectangular Multpler Mchael J. Schulte Dmtr Tan Carl E. Lemonds Unversty of Wsconsn Advanced Mcro Devces Advanced Mcro Devces Schulte@engr.wsc.edu

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

FPGA Based Fixed Width 4 4, 6 6, 8 8 and Bit Multipliers using Spartan-3AN

FPGA Based Fixed Width 4 4, 6 6, 8 8 and Bit Multipliers using Spartan-3AN IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.11 No.2, February 211 61 FPGA Based Fxed Wdth 4 4, 6 6, 8 8 and 12 12-Bt Multplers usng Spartan-3AN Muhammad H. Ras and Mohamed H.

More information

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)

For instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1) Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A

More information

Using Delayed Addition Techniques to Accelerate Integer and Floating-Point Calculations in Configurable Hardware

Using Delayed Addition Techniques to Accelerate Integer and Floating-Point Calculations in Configurable Hardware Draft submtted for publcaton. Please do not dstrbute Usng Delayed Addton echnques to Accelerate Integer and Floatng-Pont Calculatons n Confgurable Hardware Zhen Luo, Nonmember and Margaret Martonos, Member,

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Integrated interconnect circuit modeling for VLSI design

Integrated interconnect circuit modeling for VLSI design Integrated nterconnect crcut modelng for VLSI desgn Won-Young Jung, Ghun-Up Cha, Young-Bae Km, Jun-Ho Baek, Choon-Kyung Km LG Semcon Co.,Ltd. D.A. Center 16, Woomyeon-dong, Seocho-gu, Seoul, 137-140, Korea.

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

Real-Time Systems. Real-Time Systems. Verification by testing. Verification by testing

Real-Time Systems. Real-Time Systems. Verification by testing. Verification by testing EDA222/DIT161 Real-Tme Systems, Chalmers/GU, 2014/2015 Lecture #8 Real-Tme Systems Real-Tme Systems Lecture #8 Specfcaton Professor Jan Jonsson Implementaton System models Executon-tme analyss Department

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) , Fax: (370-5) ,

VRT012 User s guide V0.1. Address: Žirmūnų g. 27, Vilnius LT-09105, Phone: (370-5) , Fax: (370-5) , VRT012 User s gude V0.1 Thank you for purchasng our product. We hope ths user-frendly devce wll be helpful n realsng your deas and brngng comfort to your lfe. Please take few mnutes to read ths manual

More information

Harvard University CS 101 Fall 2005, Shimon Schocken. Assembler. Elements of Computing Systems 1 Assembler (Ch. 6)

Harvard University CS 101 Fall 2005, Shimon Schocken. Assembler. Elements of Computing Systems 1 Assembler (Ch. 6) Harvard Unversty CS 101 Fall 2005, Shmon Schocken Assembler Elements of Computng Systems 1 Assembler (Ch. 6) Why care about assemblers? Because Assemblers employ some nfty trcks Assemblers are the frst

More information

Review of approximation techniques

Review of approximation techniques CHAPTER 2 Revew of appromaton technques 2. Introducton Optmzaton problems n engneerng desgn are characterzed by the followng assocated features: the objectve functon and constrants are mplct functons evaluated

More information

Circuit Analysis I (ENGR 2405) Chapter 3 Method of Analysis Nodal(KCL) and Mesh(KVL)

Circuit Analysis I (ENGR 2405) Chapter 3 Method of Analysis Nodal(KCL) and Mesh(KVL) Crcut Analyss I (ENG 405) Chapter Method of Analyss Nodal(KCL) and Mesh(KVL) Nodal Analyss If nstead of focusng on the oltages of the crcut elements, one looks at the oltages at the nodes of the crcut,

More information

High level vs Low Level. What is a Computer Program? What does gcc do for you? Program = Instructions + Data. Basic Computer Organization

High level vs Low Level. What is a Computer Program? What does gcc do for you? Program = Instructions + Data. Basic Computer Organization What s a Computer Program? Descrpton of algorthms and data structures to acheve a specfc ojectve Could e done n any language, even a natural language lke Englsh Programmng language: A Standard notaton

More information

AADL : about scheduling analysis

AADL : about scheduling analysis AADL : about schedulng analyss Schedulng analyss, what s t? Embedded real-tme crtcal systems have temporal constrants to meet (e.g. deadlne). Many systems are bult wth operatng systems provdng multtaskng

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Repeater Insertion for Two-Terminal Nets in Three-Dimensional Integrated Circuits

Repeater Insertion for Two-Terminal Nets in Three-Dimensional Integrated Circuits Repeater Inserton for Two-Termnal Nets n Three-Dmensonal Integrated Crcuts Hu Xu, Vasls F. Pavlds, and Govann De Mchel LSI - EPFL, CH-5, Swtzerland, {hu.xu,vasleos.pavlds,govann.demchel}@epfl.ch Abstract.

More information

Assembler. Building a Modern Computer From First Principles.

Assembler. Building a Modern Computer From First Principles. Assembler Buldng a Modern Computer From Frst Prncples www.nand2tetrs.org Elements of Computng Systems, Nsan & Schocken, MIT Press, www.nand2tetrs.org, Chapter 6: Assembler slde Where we are at: Human Thought

More information

Rapid Development of High Performance Floating-Point Pipelines for Scientific Simulation 1

Rapid Development of High Performance Floating-Point Pipelines for Scientific Simulation 1 Rapd Development of Hgh Performance Floatng-Pont Ppelnes for Scentfc Smulaton 1 G. Lenhart, A. Kugel and R. Männer Dept. for Computer Scence V, Unversty of Mannhem, B6-26B, D-68131 Mannhem, Germany {lenhart,kugel,maenner}@t.un-mannhem.de

More information

A New Memory Reduced Radix-4 CORDIC Processor For FFT Operation

A New Memory Reduced Radix-4 CORDIC Processor For FFT Operation IOSR Journal of VLSI and Sgnal Processng (IOSR-JVSP) Volume, Issue 5 (May. Jun. 013), PP 09-16 e-issn: 319 400, p-issn No. : 319 4197 www.osrjournals.org A New Memory Reduced Radx-4 CORDIC Processor For

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007 Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

Storage Binding in RTL synthesis

Storage Binding in RTL synthesis Storage Bndng n RTL synthess Pe Zhang Danel D. Gajsk Techncal Report ICS-0-37 August 0th, 200 Center for Embedded Computer Systems Department of Informaton and Computer Scence Unersty of Calforna, Irne

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Brave New World Pseudocode Reference

Brave New World Pseudocode Reference Brave New World Pseudocode Reference Pseudocode s a way to descrbe how to accomplsh tasks usng basc steps lke those a computer mght perform. In ths week s lab, you'll see how a form of pseudocode can be

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Assembler. Shimon Schocken. Spring Elements of Computing Systems 1 Assembler (Ch. 6) Compiler. abstract interface.

Assembler. Shimon Schocken. Spring Elements of Computing Systems 1 Assembler (Ch. 6) Compiler. abstract interface. IDC Herzlya Shmon Schocken Assembler Shmon Schocken Sprng 2005 Elements of Computng Systems 1 Assembler (Ch. 6) Where we are at: Human Thought Abstract desgn Chapters 9, 12 abstract nterface H.L. Language

More information

AVO Modeling of Monochromatic Spherical Waves: Comparison to Band-Limited Waves

AVO Modeling of Monochromatic Spherical Waves: Comparison to Band-Limited Waves AVO Modelng of Monochromatc Sphercal Waves: Comparson to Band-Lmted Waves Charles Ursenbach* Unversty of Calgary, Calgary, AB, Canada ursenbach@crewes.org and Arnm Haase Unversty of Calgary, Calgary, AB,

More information

Chapter 6 Programmng the fnte element method Inow turn to the man subject of ths book: The mplementaton of the fnte element algorthm n computer programs. In order to make my dscusson as straghtforward

More information

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

Run-Time Energy Estimation in System-On-a-Chip Designs *

Run-Time Energy Estimation in System-On-a-Chip Designs * Run-Tme Energy Estmaton n System-On-a-Chp Desgns * J. Had, G. Kaefer, Ch. Steger, R. Wess Insttute for Techncal Informatcs Graz Unversty of Technology Graz, Austra Abstract - In ths paper, a co-processor

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Parallel Inverse Halftoning by Look-Up Table (LUT) Partitioning

Parallel Inverse Halftoning by Look-Up Table (LUT) Partitioning Parallel Inverse Halftonng by Look-Up Table (LUT) Parttonng Umar F. Sddq and Sadq M. Sat umar@ccse.kfupm.edu.sa, sadq@kfupm.edu.sa KFUPM Box: Department of Computer Engneerng, Kng Fahd Unversty of Petroleum

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Simple March Tests for PSF Detection in RAM

Simple March Tests for PSF Detection in RAM Smple March Tests for PSF Detecton n RAM Ireneusz Mroze Balysto Techncal Unversty Computer Scence Department Wejsa 45A, 5-35 Balysto POLAND mroze@.pb.balysto.pl Eugena Buslowsa Balysto Techncal Unversty

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 48 CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 3.1 INTRODUCTION The raw mcroarray data s bascally an mage wth dfferent colors ndcatng hybrdzaton (Xue

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Resource Efficient Design and Implementation of Standard and Truncated Multipliers using FPGAs

Resource Efficient Design and Implementation of Standard and Truncated Multipliers using FPGAs Proceedngs of the World Congress on Engneerng 2011 Vol II, July 6-8, 2011, London, U.K. Resource Effcent Desgn and Implementaton of Standard and Truncated Multplers usng FPGAs Muhammad H. Ras, Member,

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks

Decision Strategies for Rating Objects in Knowledge-Shared Research Networks Decson Strateges for Ratng Objects n Knowledge-Shared Research etwors ALEXADRA GRACHAROVA *, HAS-JOACHM ER **, HASSA OUR ELD ** OM SUUROE ***, HARR ARAKSE *** * nsttute of Control and System Research,

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Modeling Multiple Input Switching of CMOS Gates in DSM Technology Using HDMR

Modeling Multiple Input Switching of CMOS Gates in DSM Technology Using HDMR 1 Modelng Multple Input Swtchng of CMOS Gates n DSM Technology Usng HDMR Jayashree Srdharan and Tom Chen Dept. of Electrcal and Computer Engneerng Colorado State Unversty, Fort Collns, CO, 8523, USA (jaya@engr.colostate.edu,

More information

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.15 No.10, October 2015 1 Evaluaton of an Enhanced Scheme for Hgh-level Nested Network Moblty Mohammed Babker Al Mohammed, Asha Hassan.

More information

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Overvew 2 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Introducton Mult- Smulator MASIM Theoretcal Work and Smulaton Results Concluson Jay Wagenpfel, Adran Trachte Motvaton and Tasks Basc Setup

More information

Configuration Management in Multi-Context Reconfigurable Systems for Simultaneous Performance and Power Optimizations*

Configuration Management in Multi-Context Reconfigurable Systems for Simultaneous Performance and Power Optimizations* Confguraton Management n Mult-Context Reconfgurable Systems for Smultaneous Performance and Power Optmzatons* Rafael Maestre, Mlagros Fernandez Departamento de Arqutectura de Computadores y Automátca Unversdad

More information

Analysis of Min Sum Iterative Decoder using Buffer Insertion

Analysis of Min Sum Iterative Decoder using Buffer Insertion Analyss of Mn Sum Iteratve ecoder usng Buffer Inserton Saravanan Swapna M.E II year, ept of ECE SSN College of Engneerng M. Anbuselv Assstant Professor, ept of ECE SSN College of Engneerng S.Salvahanan

More information