ALU Augmentation for MPEG-4 Repetitive Padding

Size: px
Start display at page:

Download "ALU Augmentation for MPEG-4 Repetitive Padding"

Transcription

1 ALU Augmetatio for MPEG-4 Repetitive Paddig Georgi Kuzmaov Stamatis Vassiliadis Computer Egieerig Lab, Electrical Egieerig Departmet, Faculty of formatio Techology ad Systems, Delft Uiversity of Techology, P.O. Box 5031, 2600 GA Delft, The Netherlads fG.Kuzmaov, Abstract this paper we augmet a geeral purpose ALU with a extra fuctioality - a repetitive paddig operatio. The proposed solutio eables the processor to perform the time exhaustive MPEG-4 paddig algorithm i real time. At trivial hardware costs of a few hudred 2x2 AND-OR (or equivalet) logical gates, we achieve a order of magitude speed-up whe compared to software ruig o a geeral purpose processor. Our approach allows the MPEG-4 software paddig algorithm to ru i real time for its most demadig profiles. As a example, a 64-bit pipelied implemetatio is discussed i details. The approach is geeral ad fits ito differet architectural cocepts ad ALU operad widths. More specifically it is show that 344 extra gates are ecessary for a 64-bit implemetatio ad at these expeses a processig speed of more tha 7 millio macroblocks per secod (MB/s) ca be achieved. This is a order of magitude higher tha the requiremets of the most-demadig MPEG-4 profile levels. Speed ad hardware estimatios are also reported for 32 ad 128-bit ALUs. 1. troductio Assumig MPEG stadards, MPEG-4 [5, 6] is the first to deal with cotet-based codig of audio-visual scees. To allow the efficiet implemetatio of the stadard, MPEG-4 defies several profiles. These profiles group the large set of required tools, accordig to the targeted classes of applicatios. Withi each profile, a umber of levels costrai the computatioal complexity ad the required data badwidth of the applicatio. literature [8, 9], complexity aalysis results idicate that the computatioal requiremets of the highest profiles ad levels of MPEG-4 are measured i billios of RSC-like istructios per secod. These umbers will sigificatly exceed the capabilities of the geeral purpose processors, despite the ear future techology achievemets. The work preseted i this paper is a part of a research activity, i which we focus o speedig MPEG-4 applicatios up. Our basic approach is to augmet geeral-purpose Arithmetic- Logical-Uits (ALU) with applicatio specific fuctioalities. The specific fuctios are used to redefie the architecture ad are implemeted i hardware. Due to the differet requiremets of the MPEG-4 profiles ad the large umber of fuctioalities ivolved i it, the requiremets for costeffective implemetatios are essetial. Oe importat ew feature i MPEG-4 is the paddig techique, defied at all Levels i the Core ad Mai Profiles of the stadard. Software profilig results, reported i [2, 3, 8, 14, 15], idicate that paddig is a computatioally demadig ad time cosumig process, which restricts the real time operatio of the MPEG-4 codecs. this paper we propose a ALU augmetatio circuitry, which eables a geeral purpose processor to perform the paddig algorithm i real time. Thus we utilize the available resources of the ALU, ad by exploitig the sub-word parallelism ad dedicated istructios, we dramatically icrease the computatioal power of the architecture. The proposed solutio proved the followig advatages: ffl Real time processig for all MPEG-4 profiles ad levels, utilizig the paddig algorithm ca be achieved ffl Scalable implemetatio, tuable to differet operad sizes ffl Geeral implemetatio for differet architectural cocepts The implemetatio is estimated aalytically ad its ifluece o the performace ad the hardware cost of augmeted ALUs with differet operad widths is discussed. Our results idicate high processig speed, achieved at trivial implemetatio costs. The achieved worst-case beefits for a 64-bit ALU example are: ffl Approximately 30 times faster processig tha the highest MPEG-4 requiremets. 45 SBN:

2 ffl Over 1000 times less MPS (Millios of RSC-like structios per Secod) tha the software implemetatio o geeral purpose processors. ffl Oly 344 AND-OR gates extra hardware pealty. Our results strogly suggest that software ruig o a processor icorporatig our proposal will always meet the highest stadard requiremets. The remaider of the discussio i this paper is orgaized as follows. Sectio 2 gives some backgroud iformatio ad the motivatio for the preseted research, icludig a descriptio of the repetitive paddig algorithm. Sectio 3, we describe the proposed implemetatio i details. Sectio 4 gives a quatitative evaluatio of the desig. Fially, the coclusios are preseted i Sectio Backgroud ad Motivatio For cotet-based codig, MPEG-4 uses the cocept of a Video Object Plae (VOP). A VOP is a arbitrarily shaped regio of a frame, which usually correspods to a sematic object i the visual scee. A sequece of VOPs i the time domai is referred to as a Video Object (VO). Each VOP is described by its shape ad texture. Shape is maily represeted i biary format. This format represets the shape as a bitmap, referred to as biary alpha plae. Each pixel i this plae takes oe of two possible values, which idicate whether the pixel belogs to the object or ot. The biary alpha plae is divided ito 16x16-pixel blocks called Biary Alpha Blocks (B). The texture of a VOP represets its color by macroblocks. Each macroblock cosists of oe 16x16 array of lumiace (grayscale) pixels ad two 8x8 arrays of chromiace (color) pixels, which represet the full-color of the correspodig 16x16 area of a VOP. As its precedig visual data compressio stadards, MPEG-4 adopts motio compesatio techiques to exploit temporal redudacies i the ecoded video sequeces. MPEG-4, this process icludes a search algorithm for best matchig betwee the macroblock to be ecoded ad a area of previously ecoded frame. The Repetitive Paddig Algorithm. The purpose of paddig i MPEG-4 is to esure more accurate block matchig i motio compesatio algorithms for arbitrary shaped visual objects. The paddig process defies the fullcolor values (lumiace + chromiace) for pixels outside the shape of a VOP. paddig, two types of macroblocks are of iterest. Macroblocks, which lie o the boudary of the VOP are referred to as boudary blocks. They are processed by the so called repetitive paddig. Exterior macroblocks (completely outside the VOP) are padded usig the exteded paddig method. Sice repetitive paddig is the most demadig paddig algorithm, i this paper we will cosider the paddig of boudary macroblocks. The repetitive paddig algorithm is described i [5, 13], but i literature some modificatios ca be met. [4, 7, 11] ew algorithms or algorithm modificatios are proposed to redefie or eve substitute the origial repetitive paddig. All of them, however, suggest software improvemets of the codig efficiecy ad visual quality ad do ot focus o the real time performace. A hardware acceleratio of the paddig is discussed i [1]. the same paper, the paddig algorithm is modified to support specific istructio set extesios as the horizotal ad vertical paddig processes are divided ito two phases each. These two phases cosequetly sca the lies/colums ito two opposite directios ad perform the paddig operatios. The proposed hardware solutio there is a hardwired dedicated paddig uit. the preset paper we use the stadard repetitive paddig algorithm (differetiates from [4, 7, 11]) o a geeral ALU (differetiates from [1, 4, 7, 11]), where a boudary block is separately processed horizotally, per sca-lie basis ad vertically - per colums. We still sca each lie ad colum of a macroblock bidirectioally, but we do it i parallel (differetiates from [1]), thus savig a umber of processig cycles. The stadard repetitive paddig algorithm, as defied i [5], is equivalet to the followig steps: 1. Defie ay pixel outside the object boudary as a zero pixel. Make a duplicate biary alpha map. 2. Sca each horizotal lie of a block. Each sca lie is possibly composed of zero ad ozero lie segmets (accordig to the shape bits i the biary alpha map). (a) zero segmets, betwee a ed poit of the sca lie ad the ed poit of a ozero segmet, all zero pixels are replaced by the pixel value of the ed pixel of ozero segmet. (b) zero segmets, betwee the ed poits of two differet ozero segmets, all zero pixels take the average value of these two ed poits. Nozero segmets are ot processed. All shape bits, correspodig to padded pixels are set i the duplicate biary alpha map. 3. Sca each vertical lie of the block ad perform the idetical procedure as described for the horizotal lie. The updated shape iformatio from the duplicate biary alpha map is used. Ulike its predecessors, MPEG-4 is much more demadig i terms of computatioal complexity with eve more data itesive algorithms, which is illustrated i Table 1. While the computatioal complexity of the future geeral purpose processors may meet the demads of the lowest 46 SBN:

3 Simple Profile Levels, the challege is still to meet the requiremets of the most-demadig Core ad Mai Visual Profile of MPEG-4. Table 1. Visual Defiitios ad Processig Speed i MacroBlocks per secod [MB/s] Profile Le- Sessio # Max. Boudavel Size VO MB/s ry MB/s Mai L4 1920x L3 CCR L2 CF L1 N.A. N.A. N.A. N.A. Core L2 CF L1 QCF Simple L2 CF N.A. Scalable L1 CF N.A. Simple L3 CF N.A. L2 CF N.A. L1 QCF N.A. Motivatio. A summary of the computatioal complexity of the QCF, Core Profile Level 1 of MPEG-4 is reported i [8]. Sice this is the lowest profile level, utilizig the paddig algorithm, we shall cosider its real-time requiremets as the miimum for a hardware implemetatio. At this level, the computatioal power, reported for the software ecodig of a sigle object is i the order of 4500 Millio RSC-like structios Per Secod (MPS). Assumig a software performace optimizatio by a factor of up to 10, the total computatioal complexity is just withi the computatioal capabilities of the cotemporary geeral purpose processors( MPS). the case of 4 video objects (see Table 1), however, the real-time software feasibility becomes problematic. Therefore, the eed of a hardware acceleratio of MPEG-4 is evidet, eve at this relatively low profile level. Further aalysis of the requiremets for the software implemetatio idicates that the paddig algorithm occupies some 175 MPS for a sigle video object, or aroud 700 MPS for the maximum 4 video objects, stated at Level 1 of the Core profile (Table 1). A geeral purpose processor with such computatioal power is still very expesive to be dedicated just for the repetitive paddig calculatios. f we cosider Table 1, we ca estimate that the required speed of 5940 MB/s for the Core Profile Level 1 is approximately 82 times lower tha the speed requiremets of the highest - Mai@Level4 Profile ( MB/s). A simple arithmetic estimatio idicates that for the highest MPEG-4 profile level, the o-optimized software paddig would require approximately MPS ad whe extremely optimized (10 times speed-up) - i the order of 6000 MPS. Eve for the sigificatly less complex decoder part of MPEG-4, the paddig algorithm will require some MPS for o-optimized software implemetatio dow to 2500 MPS i dramatically optimized programmig. These umbers, huge eve for the expected techology levels of the ear future, motivated our research to focus o a cost-effective hardware solutio of the MPEG- 4 algorithms ad the repetitive paddig i particular. The large umber of alteratively used algorithms i MPEG-4, however, makes the implemetatio of dedicated hardware uits iefficiet, sice a lot of them may remai uutilized. A good example is the paddig algorithm, which is ot icluded i the Simple Profile Levels of MPEG-4. Thus, a multi-profile codec would ot utilize a hardwired paddig accelerator, whe ruig at ay of the Simple Profile levels. A promisig solutio of this problem is the recofigurable implemetatio of hardware accelerators [10, 16]. Aother possible approach to icrease the efficiecy of a geeral purpose architecture is adopted i this paper. We augmet the ALU with additioal circuitry that eriches its fuctioality with oe more operatio - the repetitive paddig. Thus we utilize the available resources of the ALU, ad by exploitig a sub-word parallelism ad dedicated istructios, we dramatically icrease the computatioal power of the architecture. 3. The Augmeted ALU Sice paddig is performed over horizotal ad vertical pixel lies i idetical maer ad the width of the pixel data is relatively small (8 or 12 bits i MPEG-4), a wider geeral purpose ALU ca ot compute it efficietly. To improve the computatioal efficiecy of such ALUs, we ca exploit a sub-word data parallelism. this sectio we augmet a geeral purpose ALU, to make it capable to perform a paddig operatio. Sice 8-bit iteger chromiace ad lumiace data represetatios are the most frequetly used, i this paper we assume the same data formats. Pixel Processig. A sigle byte processig structure, which is dedicated to process each pixel of a block, is depicted i Figure 1. The same structure is used both for lumiace ad chromiace blocks paddig. We pipelie the processig flow by dividig it ito two stages. The first stage cotais a Propagatio Node (PN) ad two multiplexers. The multiplexers are required to preserve the origial fuctioality of the ALU. A byte adder ad a output multiplexer build the secod pipelie stage. The byte adder is a part of the origial multi-byte ALU adder, with cotrollable carries betwee the bytes. The paddig output multiplexer ca be merged with the existig ALU output multiplexer ad that is depicted i Figure 1 by the dash-lied arrow, from the logical part of the ALU (LU). The fuctio of the PN is to propagate the appropriate 47 SBN:

4 L (OP1) S(OP2) R S ffl f the iput shape bit S is set (the pixel belogs to the object), the: Stage 1 Stage 2 Cotrol A R N C OUT L N Byte Adder O B S C =0 N From LU RO OP1 OP2 Cotrol L 3 - Register PN AB "PN" - Propagatio Node - Multiplexer RO R 1. The output O takes the value of the iput, i.e. the pixel keeps its color. 2. The value of the iput (pixel) is propagated to the left ad to the right (via outputs ad RO) for further processig. The shape iput bit S is propagated by the same multiplexers ad occupies the most-sigificat bits of ad RO. 3. The output bit S 0 is set, meaig the pixel has bee processed. Figure 1. A ALU augmetatio for a sigle pixel paddig values to its eighborig paddig PNs ad to supply data ad cotrol sigals to the byte adder ad the output multiplexer. Table 2 represets the cotrol sigals of the output multiplexer (SR(C OUT,ADD) meas Shift Right with 1-bit the ADDer output together with its carry). Sigal Cotrol determies whether the ALU will perform paddig, or its origial operatio(s). Table 2. Truth Table for the Cotrol Sigals of the Output Multiplexer Cotrol S R N R N O 0 ADD ADD ADD SR(C OUT,ADD) 1 1 The followig equatios describe the fuctioality of the PN: = S js; j _S R; RO = S js; j _S L; (1) S 0 = S _ L 8 _ R 8 ; (2) L; R are left ad right 9 bit iput vectors; ; RO are left ad right 9 bit output vectors; is the data iput 8 bit vector; S is the shape (iput) bit before processig; S 0 is a mask output bit after processig; js; j deotes the cocateatio of bit S ad vector ; ad L N represets the N th bit of vector L. The operatio of the PN is as follows. ffl f the iput shape bit S is zero (the pixel does ot belog to the object ad has to be padded), the: 1. The outputs A ad B propagate to the byte adder the least sigificat 8-bits of R ad L respectively. The propagated eighborig shape bit values (L 8, R 8 ) ad bit S, are issued as cotrol sigals to the output multiplexer. The multiplexer redirects to O either the output of the adder or its right shifted value (see Table 2) i.e. the pixel takes the padded value. 2. The L value is propagated via RO ad the R - via icludig color ad shape iformatio. 3. The output bit S 0 is set, meaig the pixel has bee processed. Lie / Colum Paddig. To process a lie or a colum from a block by a -byte ALU, we have to implemet a chai of PN (i.e. -pixel parallel paddig). A sectio of such processig circuitry for two eighborig pixels is depicted i Figure 2. This structure is scalable ad ca cotai a arbitrary umber of propagatio odes, depedig o the ALU width (i.e. elemets for a -byte ALU). t has the followig features: 1. Operads are bypassig the first pipelie stage for a covetioal ALU operatio, thus preservig the origial timig. 2. Operads are passed through the Propagatio Nodes oly whe a paddig operatio is performed 3. The critical path of the operads through the PNs for a -byte adder is equal to the delay of multiplexers. 4. The critical path pealty for a read ALU operad is oly oe 2-1 multiplexer. 5. The ALU critical path pealty is a sigle 2-way AND elemet. 48 SBN:

5 Op A Op B Op A Op B 64 Stage 1 L PN S R RO L PN S Stage 1 PN Chai 64 Stage 2 C N + + C OUT C N Shape A B + From Logical Uit Stage 2 64-bit Adder Shape Result (a) Geeral View of the Pipelie Stages Figure 2. A Sca Lie / Colum Paddig Augmetatio of a ALU PN Chai The last two pealties may be avoided if special desigs are employed (see [12]). Puttig Everythig Together. Sice a macroblock cosist of oe 16x16 lumiace ad two 8x8 chromiace blocks, it is efficiet to implemet structures processig 8 or 16 pixels simultaeously. This meas that with 64 or 128- bit ALUs we will be able to pad a etire lie or colum of chromiace ad/or lumiace blocks i pipelie maer for two ALU cycles. However, the proposed structure is capable to perform paddig eve whe it is implemeted o smaller ALUs. Here, we will explai i more details the paddig process flow, performed by a 64-bit ALU. the ext sectio we will give geeralized estimatios for -byte ALUs. First, for the proper circuit operatio, the left-most ad right-most iputs of the structure should be iitialized. Figure 3(a) depicts a geeral view of the 64-bit iitializatio for lumiace lie / colum paddig. The ad buffers are 8-bits wide ad cotai the iitializatio values, required by the uit to start the operatio. Figure 3(b), the cycle partitioig of the lumiace lie paddig is illustrated. Sice a lumiace lie (16x8-bit) ca ot be processed i oe pass by a 64 bit (8x8-bit) ALU, we assume that the left-most half of the lie is processed first. Depedig o the right-most shape bit of this first half of the lie, the full-lie paddig would require oe or two more cycles. f the right-most shape bit of the first half of the lumiace lie is 0, it meas that data from the other half of the lie is required to pad the first half. Thus the data, propagated to the right is stored ito register. the ext cycle this data is used to fully pad the secod half of the lie, ad the byte, which has to be propagated to the left is PN Chai PN Chai BN - Buffer N; -Do t care; 0-zero. (b) Data Bufferig by Cycles 0 Figure 3. Data itializatio ad Bufferig for Lumiace Lie / Colum Processig by a 64- bit ALU. stored i buffer. Now, a third cycle is executed o the first half of the lumiace lie, with the proper right-to-left propagatio value stored i the buffer. f the right-most shape bit of the first half of the lumiace lie is 1, the pixel value to be propagated right is stored ito buffer, ad the first half of the lumiace lie is completely padded. Just oe more cycle is required to pad the secod half of the lumiace lie, provided the right propagatio value is drive to the left iput of the circuit from. Sice the right-most shape bit of the first half of the lumiace lie is available before the ext operads (the other half of the padded lie) are issued, we have brach determiatio (a perfect brach predictio) for the pipelie. Therefore, we ca coclude that a complete lumiace lie paddig by a 64-bit paddig augmeted ad pipelied ALU would take o average 2.5 cycles to perform. The paddig 49 SBN:

6 of a chromiace lie by the same ALU would take approximately oe cycle for a log data sequece. The processig flow is similar for a 32-bit ALU, but the required average cycle umber is 5.5 (at least 4 at most 7 cycles), ad the aalysis is made o the right-most shape bits of each byte (4x8-bits) to be processed. Give the shape iformatio of the whole lie is available a priory we ca still make a perfect brach predictio, i.e. we ca predict the operads issue sequece perfectly. Note o Figure 3(b) that the left ad right-most iputs of the whole lumiace lie are iitialized with 0, icludig the propagatio of the shape bit. This is also valid for the chromiace lie processig ad for all up or dow scalig ALU implemetatios (say 32 or 128 bit ALUs). The ext sectio gives more accurate estimatio of the processig speed of several differet implemetatios. 4. Evaluatio whole macroblock will be padded for 32 F (N P 8 + N P 16 ) [s]. We ca formulate the Processig speed of the ALU as follows: F P rocessig speed ß 32 (N P 8 + N P 16 [MB=s] (3) ) Assumig a realistic value of F = 1GHz ad usig the data from Table 3 ito Equatio 3, we calculated the processig speed results, give i Table 4. Table 4. Processig Speed at F =1GHz ALU Speed [MB/s] bits Average Worst Case 32-bit bit bit Speed estimatio. We ca easily evaluate the processig speed of the structure, give its operatig speed 1. Let s assume a 8-bit paddig augmeted ALU like the oe, depicted i Figure 2, operatig at frequecy F [Hz]. The values of with practical sigificace are 4, 8, 16 (32, 64 or eve 128-bit ALU). We state two more parameters of the particular implemetatio: N P8 ad N P16 respectively deote the umbers of cycles, ecessary to process a 8-pixel (chromiace) ad a 16- pixel (lumiace) lie from a log data sequece. Some potetial values of these parameters are show i Table 3. Because of the data structures, imposed by MPEG-4, N P 8 [MacroBlocks / secod] Our scheme MPEG-4 Requiremets Table 3. Values of N P 8 P 16 ad N ALU Average Worst Case bits N P 8 N P 16 N P 8 N P bit bit bit is a variable for < 8, ad approximately costat for 8 (64-bit ALU). N P 16 is a variable for < 16, ad a costat for 16 (128-bit ALU). Thus the processig 16 NP of 16 pixels by ay 8 ALU will take [secods] F NP F ad for a 256-pixel lumiace block- [s]. detically, the processig of two 8x8-pixel chromiace blocks will take 16 NP 8 [s] i the same ALU cofiguratio. Sice F a macroblock cosists of 256 lumiace ad 128 (2 x 64) chromiace pixels, padded vertically ad horizotally, a 1 this paper we distiguish (data) processig speed, measured i [macroblocks/sec] (or [MB/s]) from the device operatig speed (frequecy), measured i [Hz] Number of ALU bits Figure 4. Processig Speed for Differet ALU Operad Sizes ad F =1GHz (ote the logarithmic scale) The most demadig profile level, level 4 of the Mai MPEG-4 profile, requires Boudary MB/s (maximum MB/s) for a high resolutio sessio type (1920 x 1088) ad 32 objects (Table 1). These rates are a order of magitude lower tha the reported speed results for the feasible paddig augmeted ALU implemetatios (see Figure 4). The potetials of the structure idicate capabilities to meet eve more-demadig future profiles of the visual data compressio stadards. Furthermore, a software implemetatio of the paddig algorithm o a paddig-augmeted ALU would cost a eglectable umber of MPS (Millios of RSC-like structios per Secod), compared to the MPS umbers, discussed i the Sectio 2. Table 5 cotais the estimated computatioal load of differet paddig augmeted ALU sizes for Level4 of the Mai MPEG-4 profile. 50 SBN:

7 Table 5. Computatioal load for Mai Profile - Level 4 at F =1GHz ALU [MPS] bits Average Worst Case 32-bit bit bit Hardware estimatios. We choose the 2x2 AND-OR logic block as a basis for the hardware estimatios. A oebit 2 to 1 multiplexor is a 2x2 AND-OR gate. The hardware pealty for a sigle byte paddig structure is: 2 x 9-bit multiplexers, 2 x 8-bit multiplexers ad 1 OR gate. That makes 2x9+2x8+1=35 2x2 AND-OR gates. A -byte implemetatio will cost 35 AND-OR gates plus additioal cost for the ALU multiplexer of 8 gates, i.e. 43 2x2 AND- OR gates. Table 6 cotais the exact values of the hardware pealties for differet ALU sizes. 5. Coclusios Table 6. Hardware estimatio ALU Number of bits extra gates 32-bit bit bit this paper we itroduced a scheme for geeral purpose ALU augmetatio, which accelerates the MPEG-4 paddig algorithm with orders of magitude. We proposed a pipelied implemetatio of the idea, thus preservig the origial fuctioality ad timig scheme of the target ALU. At a trivial hardware cost of oly a few hudred elemetary 2x2 AND-OR logical gates, we could easily achieve a real-time performace at the most-demadig profile levels of MPEG-4. We proved that the proposed desig is scalable by applyig it o ALUs with differet operad widths. The approach is geeral ad ca fit ito differet architectural cocepts. 6. Ackowledgemets This research is supported by PROGRESS, the embedded systems research program of the Dutch orgaizatio for Scietific Research NWO, the Dutch Miistry of Ecoomic Affairs, the Techology Foudatio STW (project AES.5021) ad PHLPS Research Laboratories, Eidhove, The Netherlads. Refereces [1] M. Berekovic, H.-J. Stolberg, M. B. Kulaczewski, P. Pirsh, H. Moler, H. Ruge, J. Keip, ad B. Staberack. structio set extesios for mpeg-4 video. Joural of VLS Sigal Processig, 23(1):27 49, October [2] H.-C. Chag, L.-G. Che, M.-Y. Hsu, ad Y.-C. Chag. Performace aalysis ad architecture evaluatio of MPEG-4 video codec system. EEE teratioal Symposium o Circuits ad Systems, volume, pages , Geeva, Switzerlad, May [3] H.-C. Chag, Y.-C. Wag, M.-Y. Hsu, ad L.-G. Che. Efficiet algorithms ad architectures for MPEG-4 object-based video codig. EEE Workshop o Sigal Processig Systems, pages 13 22, Oct [4] E. A. Edirisighe, J. Jiag, ad C. Grecos. Shape adaptive paddig for MPEG-4. EEE Trasactios o Cosumer Electroics, 46(3): , August [5] SO/EC JTC11/SC29/WG11, N3312. MPEG-4 video verificatio model versio [6] SO/EC JTC11/SC29/WG11 N4030. MPEG-4 overview, March [7] A. Kaup. Object-based texture codig of movig video i MPEG-4. EEE Trasactios o Circuits ad Systems for Video Techology, 9(1):5 15, February [8] J. Keip, S. Bauer, J. Vollmer, B. Schmale, P. Kuh, ad M. Reissma. The MPEG-4 video codig stadard - a VLS poit of view. EEE Workshop o Sigal Processig Systems,(SPS98), pages 43 52, 8-10 Oct [9] P. Kuh ad W. Stechele. Complexity aalysis of the emergig MPEG-4 stadard as a basis for VLS implemetatio. SPE Visual Comuicatios ad mage Processig (VCP), volume 3309, pages , Sa Jose, Ja [10] G. Kuzmaov, S. Vassiliadis, ad J. va Eijdhove. A Paddig Processor for MPEG-4. 12th Aual Workshop o Circuits, Systems, ad Sigal Processig (ProR- SC2001), Veldhove, The Netherlads, October [11] J.-H. Moo, J.-H. Kweo, ad H.-K. Kim. Boudary blockmergig (M) techique for efficiet texture codig of arbitrarily shaped object. EEE Trasactios o Circuits ad Systems for Video Techology, 9(1):35 43, February [12] M. Putrio, S. Vassiliadis, ad E. Schwarz. Parallel biary byte adder / subtracter. teratioal Joural of Electroics, 65(2): , February [13] Y. Q. Shi ad H. Su. mage ad Video Compressio for Multimedia Egieerig. Boca Rato CRC Press, [14] H.-J. Stolberg, M. Berekovic, P. Pirsch, H. Ruge, H. Moller, ad J. Keip. The M-PRE MPEG-4 codec DSP ad its macroblock egie. EEE teratioal Symposium o Circuits ad Systems, volume, pages , Geeva, Switzerlad, May [15] S. Vassiliadis, G. Kuzmaov, ad S. Wog. MPEG-4 ad the New Multimedia Architectural Challeges. 15th teratioal Coferece SAER 2001, St.Kostati, Bulgaria, Sept [16] S. Vassiliadis, S. Wog, ad S. Cotofaa. The MOLEN rm-coded processor. 11th teratioal Coferece o Field Programmable Logic ad Applicatios (FPL), Belfast, Norther relad, UK, August SBN:

Appendix D. Controller Implementation

Appendix D. Controller Implementation COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Appedix D Cotroller Implemetatio Cotroller Implemetatios Combiatioal logic (sigle-cycle); Fiite state machie (multi-cycle, pipelied);

More information

Elementary Educational Computer

Elementary Educational Computer Chapter 5 Elemetary Educatioal Computer. Geeral structure of the Elemetary Educatioal Computer (EEC) The EEC coforms to the 5 uits structure defied by vo Neuma's model (.) All uits are preseted i a simplified

More information

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design

CSC 220: Computer Organization Unit 11 Basic Computer Organization and Design College of Computer ad Iformatio Scieces Departmet of Computer Sciece CSC 220: Computer Orgaizatio Uit 11 Basic Computer Orgaizatio ad Desig 1 For the rest of the semester, we ll focus o computer architecture:

More information

EE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 )

EE260: Digital Design, Spring /16/18. n Example: m 0 (=x 1 x 2 ) is adjacent to m 1 (=x 1 x 2 ) and m 2 (=x 1 x 2 ) but NOT m 3 (=x 1 x 2 ) EE26: Digital Desig, Sprig 28 3/6/8 EE 26: Itroductio to Digital Desig Combiatioal Datapath Yao Zheg Departmet of Electrical Egieerig Uiversity of Hawaiʻi at Māoa Combiatioal Logic Blocks Multiplexer Ecoders/Decoders

More information

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits

Reversible Realization of Quaternary Decoder, Multiplexer, and Demultiplexer Circuits Egieerig Letters, :, EL Reversible Realizatio of Quaterary Decoder, Multiplexer, ad Demultiplexer Circuits Mozammel H.. Kha, Member, ENG bstract quaterary reversible circuit is more compact tha the correspodig

More information

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method

A New Morphological 3D Shape Decomposition: Grayscale Interframe Interpolation Method A ew Morphological 3D Shape Decompositio: Grayscale Iterframe Iterpolatio Method D.. Vizireau Politehica Uiversity Bucharest, Romaia ae@comm.pub.ro R. M. Udrea Politehica Uiversity Bucharest, Romaia mihea@comm.pub.ro

More information

End Semester Examination CSE, III Yr. (I Sem), 30002: Computer Organization

End Semester Examination CSE, III Yr. (I Sem), 30002: Computer Organization Ed Semester Examiatio 2013-14 CSE, III Yr. (I Sem), 30002: Computer Orgaizatio Istructios: GROUP -A 1. Write the questio paper group (A, B, C, D), o frot page top of aswer book, as per what is metioed

More information

Chapter 3 Classification of FFT Processor Algorithms

Chapter 3 Classification of FFT Processor Algorithms Chapter Classificatio of FFT Processor Algorithms The computatioal complexity of the Discrete Fourier trasform (DFT) is very high. It requires () 2 complex multiplicatios ad () complex additios [5]. As

More information

DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO

DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO DESIGN AND ANALYSIS OF LDPC DECODERS FOR SOFTWARE DEFINED RADIO Sagwo Seo, Trevor Mudge Advaced Computer Architecture Laboratory Uiversity of Michiga at A Arbor {swseo, tm}@umich.edu Yumig Zhu, Chaitali

More information

Low Complexity H.265/HEVC Coding Unit Size Decision for a Videoconferencing System

Low Complexity H.265/HEVC Coding Unit Size Decision for a Videoconferencing System BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 15, No 6 Special Issue o Logistics, Iformatics ad Service Sciece Sofia 2015 Prit ISSN: 1311-9702; Olie ISSN: 1314-4081 DOI:

More information

GPUMP: a Multiple-Precision Integer Library for GPUs

GPUMP: a Multiple-Precision Integer Library for GPUs GPUMP: a Multiple-Precisio Iteger Library for GPUs Kaiyog Zhao ad Xiaowe Chu Departmet of Computer Sciece, Hog Kog Baptist Uiversity Hog Kog, P. R. Chia Email: {kyzhao, chxw}@comp.hkbu.edu.hk Abstract

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 19 Query Optimizatio Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio Query optimizatio Coducted by a query optimizer i a DBMS Goal:

More information

Performance Plus Software Parameter Definitions

Performance Plus Software Parameter Definitions Performace Plus+ Software Parameter Defiitios/ Performace Plus Software Parameter Defiitios Chapma Techical Note-TG-5 paramete.doc ev-0-03 Performace Plus+ Software Parameter Defiitios/2 Backgroud ad Defiitios

More information

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming

Lecture Notes 6 Introduction to algorithm analysis CSS 501 Data Structures and Object-Oriented Programming Lecture Notes 6 Itroductio to algorithm aalysis CSS 501 Data Structures ad Object-Orieted Programmig Readig for this lecture: Carrao, Chapter 10 To be covered i this lecture: Itroductio to algorithm aalysis

More information

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON

A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON A SOFTWARE MODEL FOR THE MULTILAYER PERCEPTRON Roberto Lopez ad Eugeio Oñate Iteratioal Ceter for Numerical Methods i Egieerig (CIMNE) Edificio C1, Gra Capitá s/, 08034 Barceloa, Spai ABSTRACT I this work

More information

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Part A Datapath Design COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter The Processor Part A path Desig Itroductio CPU performace factors Istructio cout Determied by ISA ad compiler. CPI ad

More information

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation

Improvement of the Orthogonal Code Convolution Capabilities Using FPGA Implementation Improvemet of the Orthogoal Code Covolutio Capabilities Usig FPGA Implemetatio Naima Kaabouch, Member, IEEE, Apara Dhirde, Member, IEEE, Saleh Faruque, Member, IEEE Departmet of Electrical Egieerig, Uiversity

More information

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control

EE 459/500 HDL Based Digital Design with Programmable Logic. Lecture 13 Control and Sequencing: Hardwired and Microprogrammed Control EE 459/500 HDL Based Digital Desig with Programmable Logic Lecture 13 Cotrol ad Sequecig: Hardwired ad Microprogrammed Cotrol Refereces: Chapter s 4,5 from textbook Chapter 7 of M.M. Mao ad C.R. Kime,

More information

Multi-Threading. Hyper-, Multi-, and Simultaneous Thread Execution

Multi-Threading. Hyper-, Multi-, and Simultaneous Thread Execution Multi-Threadig Hyper-, Multi-, ad Simultaeous Thread Executio 1 Performace To Date Icreasig processor performace Pipeliig. Brach predictio. Super-scalar executio. Out-of-order executio. Caches. Hyper-Threadig

More information

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS

APPLICATION NOTE PACE1750AE BUILT-IN FUNCTIONS APPLICATION NOTE PACE175AE BUILT-IN UNCTIONS About This Note This applicatio brief is iteded to explai ad demostrate the use of the special fuctios that are built ito the PACE175AE processor. These powerful

More information

. Written in factored form it is easy to see that the roots are 2, 2, i,

. Written in factored form it is easy to see that the roots are 2, 2, i, CMPS A Itroductio to Programmig Programmig Assigmet 4 I this assigmet you will write a java program that determies the real roots of a polyomial that lie withi a specified rage. Recall that the roots (or

More information

The Magma Database file formats

The Magma Database file formats The Magma Database file formats Adrew Gaylard, Bret Pikey, ad Mart-Mari Breedt Johaesburg, South Africa 15th May 2006 1 Summary Magma is a ope-source object database created by Chris Muller, of Kasas City,

More information

UNIVERSITY OF MORATUWA

UNIVERSITY OF MORATUWA UNIVERSITY OF MORATUWA FACULTY OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING B.Sc. Egieerig 2014 Itake Semester 2 Examiatio CS2052 COMPUTER ARCHITECTURE Time allowed: 2 Hours Jauary 2016

More information

Fast Fourier Transform (FFT) Algorithms

Fast Fourier Transform (FFT) Algorithms Fast Fourier Trasform FFT Algorithms Relatio to the z-trasform elsewhere, ozero, z x z X x [ ] 2 ~ elsewhere,, ~ e j x X x x π j e z z X X π 2 ~ The DFS X represets evely spaced samples of the z- trasform

More information

Ones Assignment Method for Solving Traveling Salesman Problem

Ones Assignment Method for Solving Traveling Salesman Problem Joural of mathematics ad computer sciece 0 (0), 58-65 Oes Assigmet Method for Solvig Travelig Salesma Problem Hadi Basirzadeh Departmet of Mathematics, Shahid Chamra Uiversity, Ahvaz, Ira Article history:

More information

BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM

BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM MATEC Web of Cofereces 79, 01014 (016) DOI: 10.1051/ mateccof/0167901014 T 016 BOOLEAN DIFFERENTIATION EQUATIONS APPLICABLE IN RECONFIGURABLE COMPUTATIONAL MEDIUM Staislav Shidlovskiy 1, 1 Natioal Research

More information

Exact Minimum Lower Bound Algorithm for Traveling Salesman Problem

Exact Minimum Lower Bound Algorithm for Traveling Salesman Problem Exact Miimum Lower Boud Algorithm for Travelig Salesma Problem Mohamed Eleiche GeoTiba Systems mohamed.eleiche@gmail.com Abstract The miimum-travel-cost algorithm is a dyamic programmig algorithm to compute

More information

How do we evaluate algorithms?

How do we evaluate algorithms? F2 Readig referece: chapter 2 + slides Algorithm complexity Big O ad big Ω To calculate ruig time Aalysis of recursive Algorithms Next time: Litterature: slides mostly The first Algorithm desig methods:

More information

6.854J / J Advanced Algorithms Fall 2008

6.854J / J Advanced Algorithms Fall 2008 MIT OpeCourseWare http://ocw.mit.edu 6.854J / 18.415J Advaced Algorithms Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 18.415/6.854 Advaced Algorithms

More information

Computers and Scientific Thinking

Computers and Scientific Thinking Computers ad Scietific Thikig David Reed, Creighto Uiversity Chapter 15 JavaScript Strigs 1 Strigs as Objects so far, your iteractive Web pages have maipulated strigs i simple ways use text box to iput

More information

CMSC Computer Architecture Lecture 10: Caches. Prof. Yanjing Li University of Chicago

CMSC Computer Architecture Lecture 10: Caches. Prof. Yanjing Li University of Chicago CMSC 22200 Computer Architecture Lecture 10: Caches Prof. Yajig Li Uiversity of Chicago Midterm Recap Overview ad fudametal cocepts ISA Uarch Datapath, cotrol Sigle cycle, multi cycle Pipeliig Basic idea,

More information

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance

Pseudocode ( 1.1) Analysis of Algorithms. Primitive Operations. Pseudocode Details. Running Time ( 1.1) Estimating performance Aalysis of Algorithms Iput Algorithm Output A algorithm is a step-by-step procedure for solvig a problem i a fiite amout of time. Pseudocode ( 1.1) High-level descriptio of a algorithm More structured

More information

EE123 Digital Signal Processing

EE123 Digital Signal Processing Last Time EE Digital Sigal Processig Lecture 7 Block Covolutio, Overlap ad Add, FFT Discrete Fourier Trasform Properties of the Liear covolutio through circular Today Liear covolutio with Overlap ad add

More information

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis

Analysis Metrics. Intro to Algorithm Analysis. Slides. 12. Alg Analysis. 12. Alg Analysis Itro to Algorithm Aalysis Aalysis Metrics Slides. Table of Cotets. Aalysis Metrics 3. Exact Aalysis Rules 4. Simple Summatio 5. Summatio Formulas 6. Order of Magitude 7. Big-O otatio 8. Big-O Theorems

More information

Chapter 3. Floating Point Arithmetic

Chapter 3. Floating Point Arithmetic COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 3 Floatig Poit Arithmetic Review - Multiplicatio 0 1 1 0 = 6 multiplicad 32-bit ALU shift product right multiplier add

More information

Optimization for framework design of new product introduction management system Ma Ying, Wu Hongcui

Optimization for framework design of new product introduction management system Ma Ying, Wu Hongcui 2d Iteratioal Coferece o Electrical, Computer Egieerig ad Electroics (ICECEE 2015) Optimizatio for framework desig of ew product itroductio maagemet system Ma Yig, Wu Hogcui Tiaji Electroic Iformatio Vocatioal

More information

1. SWITCHING FUNDAMENTALS

1. SWITCHING FUNDAMENTALS . SWITCING FUNDMENTLS Switchig is the provisio of a o-demad coectio betwee two ed poits. Two distict switchig techiques are employed i commuicatio etwors-- circuit switchig ad pacet switchig. Circuit switchig

More information

BOOLEAN MATHEMATICS: GENERAL THEORY

BOOLEAN MATHEMATICS: GENERAL THEORY CHAPTER 3 BOOLEAN MATHEMATICS: GENERAL THEORY 3.1 ISOMORPHIC PROPERTIES The ame Boolea Arithmetic was chose because it was discovered that literal Boolea Algebra could have a isomorphic umerical aspect.

More information

Data diverse software fault tolerance techniques

Data diverse software fault tolerance techniques Data diverse software fault tolerace techiques Complemets desig diversity by compesatig for desig diversity s s limitatios Ivolves obtaiig a related set of poits i the program data space, executig the

More information

Enhancing Efficiency of Software Fault Tolerance Techniques in Satellite Motion System

Enhancing Efficiency of Software Fault Tolerance Techniques in Satellite Motion System Joural of Iformatio Systems ad Telecommuicatio, Vol. 2, No. 3, July-September 2014 173 Ehacig Efficiecy of Software Fault Tolerace Techiques i Satellite Motio System Hoda Baki Departmet of Electrical ad

More information

Bayesian approach to reliability modelling for a probability of failure on demand parameter

Bayesian approach to reliability modelling for a probability of failure on demand parameter Bayesia approach to reliability modellig for a probability of failure o demad parameter BÖRCSÖK J., SCHAEFER S. Departmet of Computer Architecture ad System Programmig Uiversity Kassel, Wilhelmshöher Allee

More information

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe

Copyright 2016 Ramez Elmasri and Shamkant B. Navathe Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe CHAPTER 18 Strategies for Query Processig Copyright 2016 Ramez Elmasri ad Shamkat B. Navathe Itroductio DBMS techiques to process a query Scaer idetifies

More information

Neural Networks A Model of Boolean Functions

Neural Networks A Model of Boolean Functions Neural Networks A Model of Boolea Fuctios Berd Steibach, Roma Kohut Freiberg Uiversity of Miig ad Techology Istitute of Computer Sciece D-09596 Freiberg, Germay e-mails: steib@iformatik.tu-freiberg.de

More information

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved.

Chapter 1. Introduction to Computers and C++ Programming. Copyright 2015 Pearson Education, Ltd.. All rights reserved. Chapter 1 Itroductio to Computers ad C++ Programmig Copyright 2015 Pearso Educatio, Ltd.. All rights reserved. Overview 1.1 Computer Systems 1.2 Programmig ad Problem Solvig 1.3 Itroductio to C++ 1.4 Testig

More information

Image Segmentation EEE 508

Image Segmentation EEE 508 Image Segmetatio Objective: to determie (etract) object boudaries. It is a process of partitioig a image ito distict regios by groupig together eighborig piels based o some predefied similarity criterio.

More information

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5

Morgan Kaufmann Publishers 26 February, COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 5 Morga Kaufma Publishers 26 February, 28 COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 5 Set-Associative Cache Architecture Performace Summary Whe CPU performace icreases:

More information

Parallel Polygon Approximation Algorithm Targeted at Reconfigurable Multi-Ring Hardware

Parallel Polygon Approximation Algorithm Targeted at Reconfigurable Multi-Ring Hardware Parallel Polygo Approximatio Algorithm Targeted at Recofigurable Multi-Rig Hardware M. Arif Wai* ad Hamid R. Arabia** *Califoria State Uiversity Bakersfield, Califoria, USA **Uiversity of Georgia, Georgia,

More information

CMSC Computer Architecture Lecture 12: Virtual Memory. Prof. Yanjing Li University of Chicago

CMSC Computer Architecture Lecture 12: Virtual Memory. Prof. Yanjing Li University of Chicago CMSC 22200 Computer Architecture Lecture 12: Virtual Memory Prof. Yajig Li Uiversity of Chicago A System with Physical Memory Oly Examples: most Cray machies early PCs Memory early all embedded systems

More information

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved.

Chapter 11. Friends, Overloaded Operators, and Arrays in Classes. Copyright 2014 Pearson Addison-Wesley. All rights reserved. Chapter 11 Frieds, Overloaded Operators, ad Arrays i Classes Copyright 2014 Pearso Addiso-Wesley. All rights reserved. Overview 11.1 Fried Fuctios 11.2 Overloadig Operators 11.3 Arrays ad Classes 11.4

More information

Computer Graphics Hardware An Overview

Computer Graphics Hardware An Overview Computer Graphics Hardware A Overview Graphics System Moitor Iput devices CPU/Memory GPU Raster Graphics System Raster: A array of picture elemets Based o raster-sca TV techology The scree (ad a picture)

More information

Vision & Perception. Simple model: simple reflectance/illumination model. image: x(n 1,n 2 )=i(n 1,n 2 )r(n 1,n 2 ) 0 < r(n 1,n 2 ) < 1

Vision & Perception. Simple model: simple reflectance/illumination model. image: x(n 1,n 2 )=i(n 1,n 2 )r(n 1,n 2 ) 0 < r(n 1,n 2 ) < 1 Visio & Perceptio Simple model: simple reflectace/illumiatio model Eye illumiatio source i( 1, 2 ) image: x( 1, 2 )=i( 1, 2 )r( 1, 2 ) reflectace term r( 1, 2 ) where 0 < i( 1, 2 ) < 0 < r( 1, 2 ) < 1

More information

Pattern Recognition Systems Lab 1 Least Mean Squares

Pattern Recognition Systems Lab 1 Least Mean Squares Patter Recogitio Systems Lab 1 Least Mea Squares 1. Objectives This laboratory work itroduces the OpeCV-based framework used throughout the course. I this assigmet a lie is fitted to a set of poits usig

More information

Design of Efficient Pipelined Radix-2 2 Single Path Delay Feedback FFT

Design of Efficient Pipelined Radix-2 2 Single Path Delay Feedback FFT IOSR Joural of VLSI ad Sigal Processig IOSR-JVSP Volume Issue Ver. I May-Ju. 0 PP 88-9 e-iss: 9 00 p-iss o. : 9 97 www.iosrjourals.org Desig of Efficiet Pipelied Radi- Sigle Path Delay Feedbac FFT isha

More information

Isn t It Time You Got Faster, Quicker?

Isn t It Time You Got Faster, Quicker? Is t It Time You Got Faster, Quicker? AltiVec Techology At-a-Glace OVERVIEW Motorola s advaced AltiVec techology is desiged to eable host processors compatible with the PowerPC istructio-set architecture

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045 Oe Brookigs Drive St. Louis, Missouri 63130-4899, USA jaegerg@cse.wustl.edu

More information

Dynamic Programming and Curve Fitting Based Road Boundary Detection

Dynamic Programming and Curve Fitting Based Road Boundary Detection Dyamic Programmig ad Curve Fittig Based Road Boudary Detectio SHYAM PRASAD ADHIKARI, HYONGSUK KIM, Divisio of Electroics ad Iformatio Egieerig Chobuk Natioal Uiversity 664-4 Ga Deokji-Dog Jeoju-City Jeobuk

More information

IMP: Superposer Integrated Morphometrics Package Superposition Tool

IMP: Superposer Integrated Morphometrics Package Superposition Tool IMP: Superposer Itegrated Morphometrics Package Superpositio Tool Programmig by: David Lieber ( 03) Caisius College 200 Mai St. Buffalo, NY 4208 Cocept by: H. David Sheets, Dept. of Physics, Caisius College

More information

GE FUNDAMENTALS OF COMPUTING AND PROGRAMMING UNIT III

GE FUNDAMENTALS OF COMPUTING AND PROGRAMMING UNIT III GE2112 - FUNDAMENTALS OF COMPUTING AND PROGRAMMING UNIT III PROBLEM SOLVING AND OFFICE APPLICATION SOFTWARE Plaig the Computer Program Purpose Algorithm Flow Charts Pseudocode -Applicatio Software Packages-

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19

CIS 121 Data Structures and Algorithms with Java Spring Stacks, Queues, and Heaps Monday, February 18 / Tuesday, February 19 CIS Data Structures ad Algorithms with Java Sprig 09 Stacks, Queues, ad Heaps Moday, February 8 / Tuesday, February 9 Stacks ad Queues Recall the stack ad queue ADTs (abstract data types from lecture.

More information

FEATURE BASED RECOGNITION OF TRAFFIC VIDEO STREAMS FOR ONLINE ROUTE TRACING

FEATURE BASED RECOGNITION OF TRAFFIC VIDEO STREAMS FOR ONLINE ROUTE TRACING FEATURE BASED RECOGNITION OF TRAFFIC VIDEO STREAMS FOR ONLINE ROUTE TRACING Christoph Busch, Ralf Dörer, Christia Freytag, Heike Ziegler Frauhofer Istitute for Computer Graphics, Computer Graphics Ceter

More information

Chapter 4 The Datapath

Chapter 4 The Datapath The Ageda Chapter 4 The Datapath Based o slides McGraw-Hill Additioal material 24/25/26 Lewis/Marti Additioal material 28 Roth Additioal material 2 Taylor Additioal material 2 Farmer Tae the elemets that

More information

Structuring Redundancy for Fault Tolerance. CSE 598D: Fault Tolerant Software

Structuring Redundancy for Fault Tolerance. CSE 598D: Fault Tolerant Software Structurig Redudacy for Fault Tolerace CSE 598D: Fault Tolerat Software What do we wat to achieve? Versios Damage Assessmet Versio 1 Error Detectio Iputs Versio 2 Voter Outputs State Restoratio Cotiued

More information

Alpha Individual Solutions MAΘ National Convention 2013

Alpha Individual Solutions MAΘ National Convention 2013 Alpha Idividual Solutios MAΘ Natioal Covetio 0 Aswers:. D. A. C 4. D 5. C 6. B 7. A 8. C 9. D 0. B. B. A. D 4. C 5. A 6. C 7. B 8. A 9. A 0. C. E. B. D 4. C 5. A 6. D 7. B 8. C 9. D 0. B TB. 570 TB. 5

More information

Computer Science Foundation Exam. August 12, Computer Science. Section 1A. No Calculators! KEY. Solutions and Grading Criteria.

Computer Science Foundation Exam. August 12, Computer Science. Section 1A. No Calculators! KEY. Solutions and Grading Criteria. Computer Sciece Foudatio Exam August, 005 Computer Sciece Sectio A No Calculators! Name: SSN: KEY Solutios ad Gradig Criteria Score: 50 I this sectio of the exam, there are four (4) problems. You must

More information

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting)

MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fitting) MATHEMATICAL METHODS OF ANALYSIS AND EXPERIMENTAL DATA PROCESSING (Or Methods of Curve Fittig) I this chapter, we will eamie some methods of aalysis ad data processig; data obtaied as a result of a give

More information

EE University of Minnesota. Midterm Exam #1. Prof. Matthew O'Keefe TA: Eric Seppanen. Department of Electrical and Computer Engineering

EE University of Minnesota. Midterm Exam #1. Prof. Matthew O'Keefe TA: Eric Seppanen. Department of Electrical and Computer Engineering EE 4363 1 Uiversity of Miesota Midterm Exam #1 Prof. Matthew O'Keefe TA: Eric Seppae Departmet of Electrical ad Computer Egieerig Uiversity of Miesota Twi Cities Campus EE 4363 Itroductio to Microprocessors

More information

Improving Template Based Spike Detection

Improving Template Based Spike Detection Improvig Template Based Spike Detectio Kirk Smith, Member - IEEE Portlad State Uiversity petra@ee.pdx.edu Abstract Template matchig algorithms like SSE, Covolutio ad Maximum Likelihood are well kow for

More information

RESEARCH ON AUTOMATIC INSPECTION TECHNIQUE OF REAL-TIME RADIOGRAPHY FOR TURBINE-BLADE

RESEARCH ON AUTOMATIC INSPECTION TECHNIQUE OF REAL-TIME RADIOGRAPHY FOR TURBINE-BLADE RESEARCH ON AUTOMATIC INSPECTION TECHNIQUE OF REAL-TIME RADIOGRAPHY FOR TURBINE-BLADE Z.G. Zhou, S. Zhao, ad Z.G. A School of Mechaical Egieerig ad Automatio, Beijig Uiversity of Aeroautics ad Astroautics,

More information

Big-O Analysis. Asymptotics

Big-O Analysis. Asymptotics Big-O Aalysis 1 Defiitio: Suppose that f() ad g() are oegative fuctios of. The we say that f() is O(g()) provided that there are costats C > 0 ad N > 0 such that for all > N, f() Cg(). Big-O expresses

More information

Lecture 5. Counting Sort / Radix Sort

Lecture 5. Counting Sort / Radix Sort Lecture 5. Coutig Sort / Radix Sort T. H. Corme, C. E. Leiserso ad R. L. Rivest Itroductio to Algorithms, 3rd Editio, MIT Press, 2009 Sugkyukwa Uiversity Hyuseug Choo choo@skku.edu Copyright 2000-2018

More information

3D Model Retrieval Method Based on Sample Prediction

3D Model Retrieval Method Based on Sample Prediction 20 Iteratioal Coferece o Computer Commuicatio ad Maagemet Proc.of CSIT vol.5 (20) (20) IACSIT Press, Sigapore 3D Model Retrieval Method Based o Sample Predictio Qigche Zhag, Ya Tag* School of Computer

More information

An Efficient Algorithm for Graph Bisection of Triangularizations

An Efficient Algorithm for Graph Bisection of Triangularizations Applied Mathematical Scieces, Vol. 1, 2007, o. 25, 1203-1215 A Efficiet Algorithm for Graph Bisectio of Triagularizatios Gerold Jäger Departmet of Computer Sciece Washigto Uiversity Campus Box 1045, Oe

More information

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Single-Cycle Disadvantages & Advantages

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. Chapter 4. The Processor. Single-Cycle Disadvantages & Advantages COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Iterface 5 th Editio Chapter 4 The Processor Pipeliig Sigle-Cycle Disadvatages & Advatages Clk Uses the clock cycle iefficietly the clock cycle must

More information

Algorithms for Disk Covering Problems with the Most Points

Algorithms for Disk Covering Problems with the Most Points Algorithms for Disk Coverig Problems with the Most Poits Bi Xiao Departmet of Computig Hog Kog Polytechic Uiversity Hug Hom, Kowloo, Hog Kog csbxiao@comp.polyu.edu.hk Qigfeg Zhuge, Yi He, Zili Shao, Edwi

More information

Evaluation scheme for Tracking in AMI

Evaluation scheme for Tracking in AMI A M I C o m m u i c a t i o A U G M E N T E D M U L T I - P A R T Y I N T E R A C T I O N http://www.amiproject.org/ Evaluatio scheme for Trackig i AMI S. Schreiber a D. Gatica-Perez b AMI WP4 Trackig:

More information

k (check node degree) and j (variable node degree)

k (check node degree) and j (variable node degree) A Parallel Turbo Decodig Message Passig Architecture for Array LDPC Codes Kira Guam, Pakaj Bhagawat, Weihuag Wag, Gwa Choi, Mark Yeary * Dept. of Electrical Egieerig, Texas A&M Uiversity, College Statio,

More information

Lower Bounds for Sorting

Lower Bounds for Sorting Liear Sortig Topics Covered: Lower Bouds for Sortig Coutig Sort Radix Sort Bucket Sort Lower Bouds for Sortig Compariso vs. o-compariso sortig Decisio tree model Worst case lower boud Compariso Sortig

More information

Introduction CHAPTER Computers

Introduction CHAPTER Computers Iside a Computer CHAPTER Itroductio. Computers A computer is a electroic device that accepts iput, stores data, processes data accordig to a set of istructios (called program), ad produces output i desired

More information

CMSC Computer Architecture Lecture 2: ISA. Prof. Yanjing Li Department of Computer Science University of Chicago

CMSC Computer Architecture Lecture 2: ISA. Prof. Yanjing Li Department of Computer Science University of Chicago CMSC 22200 Computer Architecture Lecture 2: ISA Prof. Yajig Li Departmet of Computer Sciece Uiversity of Chicago Admiistrative Stuff Lab1 out toight Due Thursday (10/18) Lab1 review sessio Tomorrow, 10/05,

More information

Outline. Applications of FFT in Communications. Fundamental FFT Algorithms. FFT Circuit Design Architectures. Conclusions

Outline. Applications of FFT in Communications. Fundamental FFT Algorithms. FFT Circuit Design Architectures. Conclusions FFT Circuit Desig Outlie Applicatios of FFT i Commuicatios Fudametal FFT Algorithms FFT Circuit Desig Architectures Coclusios DAB Receiver Tuer OFDM Demodulator Chael Decoder Mpeg Audio Decoder 56/5/ 4/48

More information

Computer Systems - HS

Computer Systems - HS What have we leared so far? Computer Systems High Level ENGG1203 2d Semester, 2017-18 Applicatios Sigals Systems & Cotrol Systems Computer & Embedded Systems Digital Logic Combiatioal Logic Sequetial Logic

More information

Efficient Hardware Design for Implementation of Matrix Multiplication by using PPI-SO

Efficient Hardware Design for Implementation of Matrix Multiplication by using PPI-SO Efficiet Hardware Desig for Implemetatio of Matrix Multiplicatio by usig PPI-SO Shivagi Tiwari, Niti Meea Dept. of EC, IES College of Techology, Bhopal, Idia Assistat Professor, Dept. of EC, IES College

More information

CMSC Computer Architecture Lecture 11: More Caches. Prof. Yanjing Li University of Chicago

CMSC Computer Architecture Lecture 11: More Caches. Prof. Yanjing Li University of Chicago CMSC 22200 Computer Architecture Lecture 11: More Caches Prof. Yajig Li Uiversity of Chicago Lecture Outlie Caches 2 Review Memory hierarchy Cache basics Locality priciples Spatial ad temporal How to access

More information

Lecture 1: Introduction and Strassen s Algorithm

Lecture 1: Introduction and Strassen s Algorithm 5-750: Graduate Algorithms Jauary 7, 08 Lecture : Itroductio ad Strasse s Algorithm Lecturer: Gary Miller Scribe: Robert Parker Itroductio Machie models I this class, we will primarily use the Radom Access

More information

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve

Analysis of Server Resource Consumption of Meteorological Satellite Application System Based on Contour Curve Advaces i Computer, Sigals ad Systems (2018) 2: 19-25 Clausius Scietific Press, Caada Aalysis of Server Resource Cosumptio of Meteorological Satellite Applicatio System Based o Cotour Curve Xiagag Zhao

More information

The isoperimetric problem on the hypercube

The isoperimetric problem on the hypercube The isoperimetric problem o the hypercube Prepared by: Steve Butler November 2, 2005 1 The isoperimetric problem We will cosider the -dimesioal hypercube Q Recall that the hypercube Q is a graph whose

More information

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13

CIS 121 Data Structures and Algorithms with Java Spring Stacks and Queues Monday, February 12 / Tuesday, February 13 CIS Data Structures ad Algorithms with Java Sprig 08 Stacks ad Queues Moday, February / Tuesday, February Learig Goals Durig this lab, you will: Review stacks ad queues. Lear amortized ruig time aalysis

More information

Stone Images Retrieval Based on Color Histogram

Stone Images Retrieval Based on Color Histogram Stoe Images Retrieval Based o Color Histogram Qiag Zhao, Jie Yag, Jigyi Yag, Hogxig Liu School of Iformatio Egieerig, Wuha Uiversity of Techology Wuha, Chia Abstract Stoe images color features are chose

More information

( n+1 2 ) , position=(7+1)/2 =4,(median is observation #4) Median=10lb

( n+1 2 ) , position=(7+1)/2 =4,(median is observation #4) Median=10lb Chapter 3 Descriptive Measures Measures of Ceter (Cetral Tedecy) These measures will tell us where is the ceter of our data or where most typical value of a data set lies Mode the value that occurs most

More information

CMSC Computer Architecture Lecture 5: Pipelining. Prof. Yanjing Li University of Chicago

CMSC Computer Architecture Lecture 5: Pipelining. Prof. Yanjing Li University of Chicago CMSC 22200 Computer Architecture Lecture 5: Pipeliig Prof. Yajig Li Uiversity of Chicago Admiistrative Stuff Lab1 Due toight Lab2: out later today; due 2 weeks from ow Review sessio this Friday Turig award

More information

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence

9.1. Sequences and Series. Sequences. What you should learn. Why you should learn it. Definition of Sequence _9.qxd // : AM Page Chapter 9 Sequeces, Series, ad Probability 9. Sequeces ad Series What you should lear Use sequece otatio to write the terms of sequeces. Use factorial otatio. Use summatio otatio to

More information

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP

Introduction. Nature-Inspired Computing. Terminology. Problem Types. Constraint Satisfaction Problems - CSP. Free Optimization Problem - FOP Nature-Ispired Computig Hadlig Costraits Dr. Şima Uyar September 2006 Itroductio may practical problems are costraied ot all combiatios of variable values represet valid solutios feasible solutios ifeasible

More information

Lecture 28: Data Link Layer

Lecture 28: Data Link Layer Automatic Repeat Request (ARQ) 2. Go ack N ARQ Although the Stop ad Wait ARQ is very simple, you ca easily show that it has very the low efficiecy. The low efficiecy comes from the fact that the trasmittig

More information

An Efficient Implementation Method of Fractal Image Compression on Dynamically Reconfigurable Architecture

An Efficient Implementation Method of Fractal Image Compression on Dynamically Reconfigurable Architecture A Efficiet Implemetatio Method of Fractal Image Compressio o Dyamically Recofigurable Architecture Hidehisa Nagao, Akihiro Matsuura, ad Akira Nagoya NTT Commuicatio Sciece Laboratories 2-4 Hikaridai, Seika-cho,

More information

ANN WHICH COVERS MLP AND RBF

ANN WHICH COVERS MLP AND RBF ANN WHICH COVERS MLP AND RBF Josef Boští, Jaromír Kual Faculty of Nuclear Scieces ad Physical Egieerig, CTU i Prague Departmet of Software Egieerig Abstract Two basic types of artificial eural etwors Multi

More information

An Estimation of Distribution Algorithm for solving the Knapsack problem

An Estimation of Distribution Algorithm for solving the Knapsack problem Vol.4,No.5, 214 Published olie: May 25, 214 DOI: 1.7321/jscse.v4.5.1 A Estimatio of Distributio Algorithm for solvig the Kapsack problem 1 Ricardo Pérez, 2 S. Jös, 3 Arturo Herádez, 4 Carlos A. Ochoa *1,

More information

Harris Corner Detection Algorithm at Sub-pixel Level and Its Application Yuanfeng Han a, Peijiang Chen b * and Tian Meng c

Harris Corner Detection Algorithm at Sub-pixel Level and Its Application Yuanfeng Han a, Peijiang Chen b * and Tian Meng c Iteratioal Coferece o Computatioal Sciece ad Egieerig (ICCSE 015) Harris Corer Detectio Algorithm at Sub-pixel Level ad Its Applicatio Yuafeg Ha a, Peijiag Che b * ad Tia Meg c School of Automobile, Liyi

More information

Neuro Fuzzy Model for Human Face Expression Recognition

Neuro Fuzzy Model for Human Face Expression Recognition IOSR Joural of Computer Egieerig (IOSRJCE) ISSN : 2278-0661 Volume 1, Issue 2 (May-Jue 2012), PP 01-06 Neuro Fuzzy Model for Huma Face Expressio Recogitio Mr. Mayur S. Burage 1, Prof. S. V. Dhopte 2 1

More information

Automatic Generation of Polynomial-Basis Multipliers in GF (2 n ) using Recursive VHDL

Automatic Generation of Polynomial-Basis Multipliers in GF (2 n ) using Recursive VHDL Automatic Geeratio of Polyomial-Basis Multipliers i GF (2 ) usig Recursive VHDL J. Nelso, G. Lai, A. Teca Abstract Multiplicatio i GF (2 ) is very commoly used i the fields of cryptography ad error correctig

More information

Xiaozhou (Steve) Li, Atri Rudra, Ram Swaminathan. HP Laboratories HPL Keyword(s): graph coloring; hardness of approximation

Xiaozhou (Steve) Li, Atri Rudra, Ram Swaminathan. HP Laboratories HPL Keyword(s): graph coloring; hardness of approximation Flexible Colorig Xiaozhou (Steve) Li, Atri Rudra, Ram Swamiatha HP Laboratories HPL-2010-177 Keyword(s): graph colorig; hardess of approximatio Abstract: Motivated b y reliability cosideratios i data deduplicatio

More information