High-level synthesis under I/O Timing and Memory constraints

Size: px
Start display at page:

Download "High-level synthesis under I/O Timing and Memory constraints"

Transcription

1 Highlevel synthesis under I/O Timing and Memory onstraints Philippe Coussy, Gwenolé Corre, Pierre Bomel, Eri Senn, Eri Martin To ite this version: Philippe Coussy, Gwenolé Corre, Pierre Bomel, Eri Senn, Eri Martin. Highlevel synthesis under I/O Timing and Memory onstraints. IEEE. 2005, IEEE, pp , <hal > HAL Id: hal Submitted on 30 May 2006 HAL is a multidisiplinary open aess arhive for the deposit and dissemination of sientifi researh douments, whether they are published or not. The douments may ome from teahing and researh institutions in Frane or abroad, or from publi or private researh enters. L arhive ouverte pluridisiplinaire HAL, est destinée au dépôt et à la diffusion de douments sientifiques de niveau reherhe, publiés ou non, émanant des établissements d enseignement et de reherhe français ou étrangers, des laboratoires publis ou privés.

2 Highlevel synthesis under I/O Timing and Memory onstraints Philippe Coussy, Gwenole Corre, Pierre Bomel, Eri Senn, Eri Martin LESTER LAB, UBS University, CNRS FRE 2734 Abstrat The design of omplex SystemsonChips implies to take into aount ommuniation and memory aess onstraints for the integration of dediated hardware aelerator. In this paper, we present a methodology and a tool that allow the HighLevel Synthesis of DSP algorithm, under both I/O timing and memory onstraints. Based on formal models and a generi arhiteture, this tool helps the designer to find a reasonable tradeoff between both the required I/O timing behavior and the internal memory aess parallelism of the iruit. The interest of our approah is demonstrated on the ase study of a FFT algorithm. I. INTRODUCTION Eletroni design omplexity has inreased hugely sine the birth of integrated iruits. System level tehnologies, over reent years, have moved from Appliation Speifi Integrated Ciruits (ASICs) and Appliation Speifi Signal Proessors (ASSPs) to omplete SystemOnChip (SoC) designs. This inrement in the hip omplexity requires an equivalent shift in the design methodology and a more diret path from the funtionality down to the silion. In [13], the authors propose system synthesis approahes where the algorithms of the funtional speifiation orrespond to predesigned omponents in a library. Maro generators produe the RTL arhiteture for hardware bloks by using the generi / generate VHDL mehanisms: the synthesis proess an hene be summarized as a blok instantiation. However, though suh omponents may be parameterizable, they rely on fixed arhitetural models with very restrited ustomization apabilities. This lak of flexibility in RTL bloks is espeially true for both the ommuniation unit, whih I/O sheduling and/or I/O timing requirements are defined, and the memory unit, whih data distribution is set. HighLevel Synthesis (HLS) an be used to redue this lak of flexibility. For example, SystemC Compiler [4] from Synopsys, and Monet from Mentor Graphis, propose a set of I/O sheduling modes (ylefixed, superstate, freefloating) that allow to target alternative arhitetural solutions. Communiation is speified using wait statements and is mixed with the omputation speifiation what limits the flexibility of the input behavioral desription. In these two tools, memory aesses are represented as multiyle operations in a Control and Data Flow Graph (CDFG). Memory verties are sheduled as operative verties by onsidering onflits among data aesses. In pratie, the number of nodes in their input speifiations must be limited to obtain a realisti and satisfying arhitetural solution. This limitation is mainly due to the omplexity of the algorithms that are used for the sheduling. Only a few works really shedule the memory aesses [5], [6]. They inlude preise temporal models of those aesses, and try to improve performanes without onsidering the possibility of simultaneous aesses that would ease the subsequent task of register and memory alloation. In the domain of realtime and dataintensive appliations, proessing resoures have to deal with ever growing data streams. The system/arhiteture design has therefore to fous on avoiding bottleneks in the buses and I/O buffers for datatransfer, while reduing the ost of data storage and satisfying strit timing onstraints and highdata rates. The methodology that an permit suh a design must rely on (1) onstraint modeling for both I/O timing and internal data memory, (2) onstraint analysis for feasibility heking and (3) highlevel synthesis. In [7] and [8], we proposed a methodology for SoC design that is based on the reusing of algorithmi desription. Our approah is based on highlevel synthesis tehniques under I/O timing onstraints and aims to optimally design the orresponding omponent by taking into aount the system integration onstraints: the data rate, the tehnology, and I/O timing properties. In [9], we have introdued a new approah to take into aount the memory arhiteture and the memory mapping in the behavioral synthesis of realtime VLSI iruits. A memorymapping file was used to inlude those memory onstraints in our HLS tool GAUT [10]. In this paper, we propose a design flow based on formal models that allow highlevel synthesis under both I/O timing and memory onstraints for digital signal proessing algorithms. DSP systems designers speify the I/O timing, the omputation lateny, the memory distribution and the appliation s data rate requirements that are the onstraints for the synthesis of the hardware omponents. This paper is organized as follows: In setion 2 we formulate the problem of synthesis under I/O timing and memory onstraints. Setion 3 presents the main steps of our approah, and its underlying formal models. In setion 4, we demonstrate the effiieny of our approah with the didati example of the Fast Fourier Transform (FFT). II. PROBLEM FORMULATION In this setion, we illustrate the interdependeny between the aess parallelism to memory and the timing performanes as well as the influene of these two parameters on the resulting omponent arhiteture. Let us onsider a hardware omponent based on a generi arhiteture omposed of two main funtional units: one memory unit MU and one proessing unit PU. Suppose the omputation proessed to be = (av1 v3) (bv2v4) where v1, v2, v3 and v4 are variables values stored in memory. Fig. 1(a) shows the Signal Flow Graph (SFG) of this algorithm. This omponent reeives input data a and b from the environment through an input port and sends its result on the output port. All the data used and produed by the proessing unit are respetively read and written in a fixed order

3 Reg1 a b a v1 b v2 Reg2 v1 v2 Reg3 x1 v3 x1 v4 x2 Reg4 x2 Reg5 v3 v4 Reg6 y1 x3 x4 Reg7 y2 Reg8 Lateny (a) (b) Fig. 1: (a) Signal Flow Graph SFG, (b) Timing behavior, sequene S =(a,b,): i.e. t a <t b < t. The read sequene of two variables v1 and v2 is ompletely deterministi i.e.: t v1 < t v2. with t v1 = t a and t v2 = t b. However, a sheduling hoie is needed to aess data v3 and v4 sine a single memory bank is available in the omponent. In our example, we hoose to aess v3 before v4. In this ontext, the minimum lateny is therefore equal to 5 yles (Fig. 1(b)). Fig. 2 presents a possible orresponding arhiteture of the proessing unit that inludes 1 multiplier, 1 adder, 1 substrator and 8 registers. A B Memory Unit reg2 reg1 v1, v2, v3, v4 reg3 Proessing unit reg6 reg7 Fig. 2: Sequential arhiteture reg8 Let us now onsider the following data transfer sequene S busses = (a b, ): i.e. t a =t b < t. If the lateny required to produe the result is long enough ( 5 yles) to allow a reordering (serialization) of input data a and b, then the previously designed arhiteture inluding one memory bank an be reused. However, this solution need to design an input wrapper omposed of 1 register, 1 multiplexer and 1 ontroller. If the required lateny is not long enough (i.e. = 3 yles), the designer must design a new omponent inluding 2 multipliers, 2 adders, 11 registers and 2 memory banks (see Fig. 3). In suh a ase, beause of their restrited ustomization apabilities, neither a predesigned omponent nor a maro generator would be flexible enough to respond to the new design onstraints. A B Memory Unit #1 v1 v3 reg2 reg1 reg12 reg11 reg3 v2 v4 Memory Unit #2 Proessing unit reg7 reg8 Fig. 3: Parallel arhiteture As stated before, a new design flow, based on synthesis under onstraints, is needed to get flexibility and to make the DSP C S t omponent design easier. This inludes (1) modeling styles to represent I/O timing and memory onstraints, (2) analysis steps to hek the feasibility of the onstraints (3) methods and tehniques for optimal synthesis. III. DESIGN APPROACH OVERVIEW The input of our HLS tool [10] is an algorithmi desription that speifies the funtionality disregarding implementation details. This initial desription is ompiled to obtain an intermediate representation: the Signal Flow Graph SFG (see Fig. 4). A. Timing Constraint Graph In a first step, we generate an Algorithmi Constraint Graph ACG from the operator latenies and the data dependenies expressed in the SFG. The latenies of the operators are assigned to operation verties of the ACG during the operator s seletion step in the behavioral synthesis flow. Starting from the system desription and its arhitetural model, the integrator, for eah bus or port that onnets the omponent to others in the SoC, speifies I/O rates, data sequene orders and transfer timing information. We defined a formal model named IOCG (IO Constraint Graph) that supports the expression of integration onstraints for eah bus (id. port) of the omponent. Finally we generate a Global Constraint Graph (GCG) by merging the ACG with the IOCG graph. Merging is done by mapping the verties and assoiated onstraints of IOCG onto the input and output verties set of ACG. A minimum timing onstraint on output verties (earliest date for data transfer) of the IOCG are transformed into the GCG in maximum timing onstraints (latest date for data omputation/prodution). After having desribed the behavior of the omponent and the design onstraints in a formal model, we analyze the feasibility between the appliation rate and the data dependenies of the algorithm, in funtion of the tehnologial onstraints. We analyze the I/O timing speifiations aording to the algorithmi ones: we hek if the required onstraints on output data are always verified with the behavior speified for input data. The entry point of the IP ore design task is the global onstraint graph GCG. B. Memory Constraint Graph As outlined in the previous subsetion, A Signal Flow Graph (SFG) is first generated from the algorithmi speifiation. A Memory Constraint Graph is a yli direted polar graph MCG(V',E',W') where V'={v'0,..., v'n} is the set of data verties plaed in memory. A memory Constraint Graph ontains V' =n1 verties whih represent the memory size, in term of memory elements. The set of edges E'=(v'i, v'j) represents possible onseutive memory aesses, and W' is a funtion that represents the aess delay between two data nodes. W' has only two possible values: Wseq (sequential) for an adjaent memory aess in memory, or Wrand (randomize) for a non adjaent memory aess. In our approah, this SFG is parsed and a memory table is reated. All data verties are extrated from the SFG to onstrut the memory table. The designer an hoose the data to be plaed in memory and defines a memory mapping. For every memory in the memory table, we onstrut a weighted Memory Constraint Graph (MCG). It represents onflits and sheduling possibilities between all nodes plaed in this memory. The MCG is onstruted from the SFG and the memory mapping file. It will be used during the sheduling proess.

4 operators Memory plaement & variable distribution MCG Algorithm Compilation SFG Seletion ACG Merging GCG Analysis IOCG Synthesis under I/O timing and Memory onstraints RTL Fig. 4: Proposed Synthesis Flow I/O refinement Fig. 6(b) shows a MCG for the presented example with one simple port memory bank. The variable data v1, v2, v3 and v4 are plaed onseutively in one bank. Dotted edges represent sequential aesses (two adjaent memory addresses) and plain edges represent random aesses (nonadjaent addresses). Further information about the formal models and the memory design an be found in [7], [8], [9]. C. Sheduling under I/O and Memory Constraints The lassial list sheduling algorithm relies on heuristis in whih ready operations (operations to be sheduled) are listed by priority order. In our tool, an early sheduling is performed on the GCG. In this sheduling, the priority funtion depends on the mobility riterion. For operations that have the same mobility, the priority is defined using the operation margin. Next, operations are sheduled and bind to operators (see Fig. 5). Sheduling_Funtion 1) Operation_Mobility_omputing(GCG) 2) For (time = 0; time < End; time = time t_yle) 3) List = Operation_Priority_listing(GCG) 4) Ready_Ops = Find_shedulable_operation(List, time) 5) Binding(Ready_Ops, operators_set, MCG, time) 6) End for Binding Funtion 1) While (Ready_Ops!= NULL) 2) Ops_low_mobility = Get_first(Ready_Ops) 3) if(op_low_mobility>margin > 0) 4) If(Find_mem_onfli(MCG, Ops_low_mobility) = FALSE) 5) If(operators_set!= NULL) 6) Ops_Binding(sh_list, operator) 7) else //no opr or mem onflit 8) Posponed(Ops_low_mobility) 9) else // margin = 0 10) If(Find_mem_onfli(MCG, Ops_low_mobility) = FALSE) 11) Operator_retation() 12) Ops_Binding(sh_list, operator) 13) else 14) Exit(yle, operator, operation, memory bank, ) 15) end if 16) End while nok Fig. 5: Pseudo ode of the sheduling algorithm An operation an be sheduled if the urrent yle is greater than the ASAP time. Whenever two ready operations need to aess the same resoure (this is a soalled resoure onflit), the operation with the lower mobility has the highest priority and is sheduled. The other is postponed. When the mobility is equal to zero, one new operator is alloated to this operation. To perform a sheduling under memory onstraint, we introdue memory aess operators and add an aessibility riterion based on the MCG. A memory has as muh aess operators as aess ports. The list of ready operations is still organised aording to the mobility riterion, but all the operations that do not math the aessibility ondition are removed from this list. Hene, when the mobility is equal to zero, the synthesis proess exits and the designer have to target an alternative solution for the omponent arhiteture by reviewing the memory mapping and/or modifying some ommuniation features. Our sheduling tehnique is illustrated in Fig. 6 using the previously presented example where the timing onstraints are now the following: S =(a b,) i.e. t a = t b < t. The memory table (Fig. 6(a)) is extrated from the SFG and is used by the designer to define the memory mapping. Internal data v1, v2, v3 and v4 are respetively plaed @2 in the bank0. Our tool onstruts one Memory Constraint Graph MCG (Fig. 6(b)). In addition to the mapping onstraint the designer also speifies two lateny Lat1=5 yles and Lat2=3yles. For lateny Lat1, the sequential aess sequene is v1 v2 v3 v4 : it inludes 3 dotted edges (with weight Wseq). To deal with the memory bank aess onflits, we define a table of aesses for eah port of a memory bank. In our example, the table has only one line for the single memory bank0. The table of memory aess has Data_rate / Sequential_aess_time elements. The value of eah element of the table indiates if a memory aess operator is idle or not at the urrent time (ontrol step _step). We use the MCG to produe a sheduling that permits to aess the memory in burst mode. If two operations have the same priority ( margin = Lat1T()T() = 1 yles) and request the same memory bank, the operation that is sheduled is the operation that involves an aess at an address that follows the preeding aess. For example, multipliation operation (av1) and (bv2) have the same mobility. At _step s_1, they are both exeutable and the both operands v1 and v2 are stored in bank0. MCG_1 indiates that the sequene v1 v2 is shorter than v2 v1. We then shedule (av1) at _step s_1 and (bv2) at _step s_2 to favour the sequential aess (see Fig. 6 ()). At _step s_3, addition (x1v3) and (x2v4) have the same mobility, the MCG indiates that sequene v2 v3 is shorter than v2 v4. Addition (x1v3) is sheduled at _step s_3 and (x2v4) at _step s_3. v1 0 0 v2 0 1 v3 0 2 v4 0 3 v2 v1 v3 v4 v1 a v3 x1 v2 b v4 (a)memory (b) MCG () Sheduling Table Fig. 6: Sheduling under I/O timing and lateny onstraint For lateny Lat2, multipliation operation (av1) and (bv2) have the same mobility that is null. Both operations must then be sheduled in _step s_1. Beause of the memory aess onflit, there is no solution to the sheduling problem: the designer has hene to review its design onstraints. He an target an alternative solution by adding one memory bank or by inreasing the omputing lateny. x2

5 IV. EXPERIMENTAL RESULTS We desribed in the two previous setions our synthesis design flow and the sheduling under I/O timing and memory onstraints. We present now the results of synthesis under onstraints obtained using the HLS tool GAUT [10]. The algorithm used for this experiene is a Fast Fourier Transform (FFT). This FFT reads 128 real input Xr(k) and produes the output Y(k) omposed of two parts: one real Yr(k) and one imaginary Yi(k). The SFG inludes edges and 8451 verties. Several syntheses have been realized using a 200MHz lok frequeny and a tehnologial library in whih the multiplier lateny is 2 yles and the lateny of the adder and the subtrator is 1 yle. A. Experiment 1: Synthesis under I/O timing onstraints In this first experiment we synthesized the FFT omponent under I/O timing onstraints and analyzed the requirements on memory banks. In order to generate a global onstraint graph GCG, minimum and maximum timing onstraints have been introdued between I/O verties of the ACG graph using the IOCG model. The FFT lateny is defined by a maximum timing onstraint between the first input and the first output verties. The speified lateny (that is the shortest one aording to the data dependenies and the operator latenies) orresponds to a 261 yles delay. The FFT omponent is onstrained to read one Xr sample and to produe one Y sample every yle. The resulting FFT omponent ontains 20 multipliers, 8 adders and 10 subtrators (see Exp#1 at Table 1). 8 memory banks are required for those I/O timing onstraints. However, the internal oeffiients are mapped in a nonlinear sheme in memory. A large amount of memory bank is needed to get enough parallel aesses to reah the speified lateny. Moreover, oeffiients an possibly be loated in multiple banks what requires the design of a omplex memory unit. B. Experiment 2: Synthesis under memory onstraints In this seond experiment we synthesized a FFT omponent only under memory onstraints. Only the maximal number of onurrent aess to the memory banks limits the minimal lateny. Thus, with a large amount of operators, a lateny equal to the ritial path delay of the SFG ould be obtained. For this reason, we synthesized the FFT with the same number of operators than in the first experiment. Then, we analyzed the requirement on I/O ports and omputation lateny. The memory onstraints are the following: 2 memory banks respeting a simple mapping onstraint: the 128 real oeffiient Wr in bank0 and the 128 imaginary oeffiient Wi in bank1. The shortest lateny imposed by the memory mapping and the number of operators orresponds to a 215 yles delay (Exp#2 at Table 1). This delay is shorter that the delay obtained in the previous experiment. This arhiteture requires 36 input busses and 14 outputs. However, a large amount of busses with nontrivial data ordering (nonlinear data index progression) is needed. If the environment imposes the exhange of data over a smaller number of I/O busses, a ommuniation unit should be designed. This unit would be able to add extra lateny to serialize data. C. Experiment 3: Synthesis under I/O timing and memory onstraints In this last experiment, we synthesized the FFT omponent under both I/O timing and memory onstraints. We kept the memory mapping used for the seond experiment and founded the shortest lateny that allows to respet the I/O rates defined in the first experiment. The resulting arhiteture ontains 17 multipliers, 8 adders and 10 subtrators (see Exp#3 at Table 1). It produes its first result after 343 yles. Memory bank. TABLE 1: SYNTHESIS RESULTS Input busses Output busses Sub. Add. Mult. Lateny (in yle) Exp# Exp# Exp# Beause of both the memory mapping and the I/O onstraints, the lateny is greater than in experiment 1 and 2. However, the arhiteture omplexity is equivalent to the previous ones in term of operators. Hene, it appears that synthesis under both I/O timing and memory onstraints allows to manage both the system s ommuniation and memory, while keeping a reasonable arhiteture omplexity. V. CONCLUSION In this paper, a design methodology for DSP omponent under I/O timing and memory onstraints is presented. This approah, that relies on onstraints modeling, onstraints analysis, and synthesis, helps the designer to effiiently implement omplex appliations. Experimental results in the DSP domain show the interest of the methodology and modeling, that allow tradeoffs between the lateny, I/O rate and memory mapping. We are urrently working on heuristi rules that ould help the designer in exploring more easily different arhitetural solutions, while onsidering memory mapping and I/O timing requirements. ACKNOWLEDGEMENTS These works have been realized within the Frenh RNRT Projet ALIPTA. REFERENCES [1] J. RuizAmaya, and Al., MATLAB/SIMULINKBased High Level Synthesis of DisreteTime and ContinuousTime Σ Modulators, In Pro. of DATE [2] L. Reyneri, F. Cuinotta, A. Serra, and L. Lavagno. A hardware/software odesign flow and IP library based on Simulink, In Pro. of DAC, [3] Codesimulink, [4] H. Ly, D. Knapp, R. Miller, and D. MMillen, Sheduling using behavioral templates, in Pro. Design Automation Conferene DAC'95, June 1995 [5] N. Passos, and al, Multidimensional interleaving for timeandmemory design optimization, in Pro. of ICCD, 1995 [6] A. Niolau and S. Novak, Trailblazing a hierarhial approah to perolation sheduling, in Pro. ICPP'93, 1993, [7] P. Coussy, A. Baganne, E. Martin, " Communiation and Timing Constraints Analysis for IP Design and Integration", In Pro. of IFIP WG 10.5 VLSISOC Conferene, [8] P. Coussy, D. Gnaedig, and al., A Methodology for IP Integration into DSP SoC: A Case Study of a MAP Algorithm for Turbo Deoder, In Pro. of ICASSP, 2004 [9] G. Corre, E. Senn, and al., Memory aesses Management During High Level Synthesis, In Pro. of CODESISSS, [10] GAUT HLS Tool for DSP,

This fact makes it difficult to evaluate the cost function to be minimized

This fact makes it difficult to evaluate the cost function to be minimized RSOURC LLOCTION N SSINMNT In the resoure alloation step the amount of resoures required to exeute the different types of proesses is determined. We will refer to the time interval during whih a proess

More information

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2 On - Line Path Delay Fault Testing of Omega MINs M. Bellos, E. Kalligeros, D. Nikolos,2 & H. T. Vergos,2 Dept. of Computer Engineering and Informatis 2 Computer Tehnology Institute University of Patras,

More information

Pipelined Multipliers for Reconfigurable Hardware

Pipelined Multipliers for Reconfigurable Hardware Pipelined Multipliers for Reonfigurable Hardware Mithell J. Myjak and José G. Delgado-Frias Shool of Eletrial Engineering and Computer Siene, Washington State University Pullman, WA 99164-2752 USA {mmyjak,

More information

Outline: Software Design

Outline: Software Design Outline: Software Design. Goals History of software design ideas Design priniples Design methods Life belt or leg iron? (Budgen) Copyright Nany Leveson, Sept. 1999 A Little History... At first, struggling

More information

Folding. Hardware Mapped vs. Time multiplexed. Folding by N (N=folding factor) Node A. Unfolding by J A 1 A J-1. Time multiplexed/microcoded

Folding. Hardware Mapped vs. Time multiplexed. Folding by N (N=folding factor) Node A. Unfolding by J A 1 A J-1. Time multiplexed/microcoded Folding is verse of Unfolding Node A A Folding by N (N=folding fator) Folding A Unfolding by J A A J- Hardware Mapped vs. Time multiplexed l Hardware Mapped vs. Time multiplexed/mirooded FI : y x(n) h

More information

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks A Dual-Hamiltonian-Path-Based Multiasting Strategy for Wormhole-Routed Star Graph Interonnetion Networks Nen-Chung Wang Department of Information and Communiation Engineering Chaoyang University of Tehnology,

More information

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System Algorithms, Mehanisms and Proedures for the Computer-aided Projet Generation System Anton O. Butko 1*, Aleksandr P. Briukhovetskii 2, Dmitry E. Grigoriev 2# and Konstantin S. Kalashnikov 3 1 Department

More information

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks Abouberine Ould Cheikhna Department of Computer Siene University of Piardie Jules Verne 80039 Amiens Frane Ould.heikhna.abouberine @u-piardie.fr

More information

Reduced-Complexity Column-Layered Decoding and. Implementation for LDPC Codes

Reduced-Complexity Column-Layered Decoding and. Implementation for LDPC Codes Redued-Complexity Column-Layered Deoding and Implementation for LDPC Codes Zhiqiang Cui 1, Zhongfeng Wang 2, Senior Member, IEEE, and Xinmiao Zhang 3 1 Qualomm In., San Diego, CA 92121, USA 2 Broadom Corp.,

More information

Partial Character Decoding for Improved Regular Expression Matching in FPGAs

Partial Character Decoding for Improved Regular Expression Matching in FPGAs Partial Charater Deoding for Improved Regular Expression Mathing in FPGAs Peter Sutton Shool of Information Tehnology and Eletrial Engineering The University of Queensland Brisbane, Queensland, 4072, Australia

More information

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications System-Level Parallelism and hroughput Optimization in Designing Reonfigurable Computing Appliations Esam El-Araby 1, Mohamed aher 1, Kris Gaj 2, arek El-Ghazawi 1, David Caliga 3, and Nikitas Alexandridis

More information

DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT

DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT 1 ZHANGGUO TANG, 2 HUANZHOU LI, 3 MINGQUAN ZHONG, 4 JIAN ZHANG 1 Institute of Computer Network and Communiation Tehnology,

More information

Alleviating DFT cost using testability driven HLS

Alleviating DFT cost using testability driven HLS Alleviating DFT ost using testability driven HLS M.L.Flottes, R.Pires, B.Rouzeyre Laboratoire d Informatique, de Robotique et de Miroéletronique de Montpellier, U.M. CNRS 5506 6 rue Ada, 34392 Montpellier

More information

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Malaysian Journal of Computer Siene, Vol 10 No 1, June 1997, pp 36-41 A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Md Rafiqul Islam, Harihodin Selamat and Mohd Noor Md Sap Faulty of Computer Siene and

More information

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks International Journal of Advanes in Computer Networks and Its Seurity IJCNS A Load-Balaned Clustering Protool for Hierarhial Wireless Sensor Networks Mehdi Tarhani, Yousef S. Kavian, Saman Siavoshi, Ali

More information

Extracting Partition Statistics from Semistructured Data

Extracting Partition Statistics from Semistructured Data Extrating Partition Statistis from Semistrutured Data John N. Wilson Rihard Gourlay Robert Japp Mathias Neumüller Department of Computer and Information Sienes University of Strathlyde, Glasgow, UK {jnw,rsg,rpj,mathias}@is.strath.a.uk

More information

Test Case Generation from UML State Machines

Test Case Generation from UML State Machines Test Case Generation from UML State Mahines Dirk Seifert To ite this version: Dirk Seifert. Test Case Generation from UML State Mahines. [Researh Report] 2008. HAL Id: inria-00268864

More information

SVC-DASH-M: Scalable Video Coding Dynamic Adaptive Streaming Over HTTP Using Multiple Connections

SVC-DASH-M: Scalable Video Coding Dynamic Adaptive Streaming Over HTTP Using Multiple Connections SVC-DASH-M: Salable Video Coding Dynami Adaptive Streaming Over HTTP Using Multiple Connetions Samar Ibrahim, Ahmed H. Zahran and Mahmoud H. Ismail Department of Eletronis and Eletrial Communiations, Faulty

More information

Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems

Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems Arne Hamann, Razvan Rau, Rolf Ernst Institute of Computer and Communiation Network Engineering Tehnial University of Braunshweig,

More information

PROJECT PERIODIC REPORT

PROJECT PERIODIC REPORT FP7-ICT-2007-1 Contrat no.: 215040 www.ative-projet.eu PROJECT PERIODIC REPORT Publishable Summary Grant Agreement number: ICT-215040 Projet aronym: Projet title: Enabling the Knowledge Powered Enterprise

More information

Parallel Block-Layered Nonbinary QC-LDPC Decoding on GPU

Parallel Block-Layered Nonbinary QC-LDPC Decoding on GPU Parallel Blok-Layered Nonbinary QC-LDPC Deoding on GPU Huyen Thi Pham, Sabooh Ajaz and Hanho Lee Department of Information and Communiation Engineering, Inha University, Inheon, 42-751, Korea Abstrat This

More information

Automatic Generation of Transaction-Level Models for Rapid Design Space Exploration

Automatic Generation of Transaction-Level Models for Rapid Design Space Exploration Automati Generation of Transation-Level Models for Rapid Design Spae Exploration Dongwan Shin, Andreas Gerstlauer, Junyu Peng, Rainer Dömer and Daniel D. Gajski Center for Embedded Computer Systems University

More information

The AMDREL Project in Retrospective

The AMDREL Project in Retrospective The AMDREL Projet in Retrospetive K. Siozios 1, G. Koutroumpezis 1, K. Tatas 1, N. Vassiliadis 2, V. Kalenteridis 2, H. Pournara 2, I. Pappas 2, D. Soudris 1, S. Nikolaidis 2, S. Siskos 2, and A. Thanailakis

More information

Cross-layer Resource Allocation on Broadband Power Line Based on Novel QoS-priority Scheduling Function in MAC Layer

Cross-layer Resource Allocation on Broadband Power Line Based on Novel QoS-priority Scheduling Function in MAC Layer Communiations and Networ, 2013, 5, 69-73 http://dx.doi.org/10.4236/n.2013.53b2014 Published Online September 2013 (http://www.sirp.org/journal/n) Cross-layer Resoure Alloation on Broadband Power Line Based

More information

Approximate logic synthesis for error tolerant applications

Approximate logic synthesis for error tolerant applications Approximate logi synthesis for error tolerant appliations Doohul Shin and Sandeep K. Gupta Eletrial Engineering Department, University of Southern California, Los Angeles, CA 989 {doohuls, sandeep}@us.edu

More information

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study What are Cyle-Stealing Systems Good For? A Detailed Performane Model Case Study Wayne Kelly and Jiro Sumitomo Queensland University of Tehnology, Australia {w.kelly, j.sumitomo}@qut.edu.au Abstrat The

More information

Large Exploration for HW/SW partitioning of Multirate and Aperiodic Real-Time Systems

Large Exploration for HW/SW partitioning of Multirate and Aperiodic Real-Time Systems Large Exploration for HW/SW partitioning of Multirate and Aperiodi Real-Time Systems Abdenour Azzedine, Jean-hilippe Diguet, Jean-Lu illippe Université de Bretagne SUD; LESTER lab.; Lorient, Frane jean-philippe.diguet@univ-ubs.fr

More information

COST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY

COST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY COST PERFORMANCE ASPECTS OF CCD FAST AUXILIARY MEMORY Dileep P, Bhondarkor Texas Instruments Inorporated Dallas, Texas ABSTRACT Charge oupled devies (CCD's) hove been mentioned as potential fast auxiliary

More information

Parallelizing Frequent Web Access Pattern Mining with Partial Enumeration for High Speedup

Parallelizing Frequent Web Access Pattern Mining with Partial Enumeration for High Speedup Parallelizing Frequent Web Aess Pattern Mining with Partial Enumeration for High Peiyi Tang Markus P. Turkia Department of Computer Siene Department of Computer Siene University of Arkansas at Little Rok

More information

XML Data Streams. XML Stream Processing. XML Stream Processing. Yanlei Diao. University of Massachusetts Amherst

XML Data Streams. XML Stream Processing. XML Stream Processing. Yanlei Diao. University of Massachusetts Amherst XML Stream Proessing Yanlei Diao University of Massahusetts Amherst XML Data Streams XML is the wire format for data exhanged online. Purhase orders http://www.oasis-open.org/ommittees/t_home.php?wg_abbrev=ubl

More information

DECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Euncheol Kim, Gwan Choi, Mark Yeary *

DECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Euncheol Kim, Gwan Choi, Mark Yeary * DECODING OF ARRAY LDPC CODES USING ON-THE FLY COMPUTATION Kiran Gunnam, Weihuang Wang, Eunheol Kim, Gwan Choi, Mark Yeary * Dept. of Eletrial Engineering, Texas A&M University, College Station, TX-77840

More information

Multi-Channel Wireless Networks: Capacity and Protocols

Multi-Channel Wireless Networks: Capacity and Protocols Multi-Channel Wireless Networks: Capaity and Protools Tehnial Report April 2005 Pradeep Kyasanur Dept. of Computer Siene, and Coordinated Siene Laboratory, University of Illinois at Urbana-Champaign Email:

More information

The Implementation of RRTs for a Remote-Controlled Mobile Robot

The Implementation of RRTs for a Remote-Controlled Mobile Robot ICCAS5 June -5, KINEX, Gyeonggi-Do, Korea he Implementation of RRs for a Remote-Controlled Mobile Robot Chi-Won Roh*, Woo-Sub Lee **, Sung-Chul Kang *** and Kwang-Won Lee **** * Intelligent Robotis Researh

More information

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking Algorithms for External Memory Leture 6 Graph Algorithms - Weighted List Ranking Leturer: Nodari Sithinava Sribe: Andi Hellmund, Simon Ohsenreither 1 Introdution & Motivation After talking about I/O-effiient

More information

Zippy - A coarse-grained reconfigurable array with support for hardware virtualization

Zippy - A coarse-grained reconfigurable array with support for hardware virtualization Zippy - A oarse-grained reonfigurable array with support for hardware virtualization Christian Plessl Computer Engineering and Networks Lab ETH Zürih, Switzerland plessl@tik.ee.ethz.h Maro Platzner Department

More information

Design of High Speed Mac Unit

Design of High Speed Mac Unit Design of High Speed Ma Unit 1 Harish Babu N, 2 Rajeev Pankaj N 1 PG Student, 2 Assistant professor Shools of Eletronis Engineering, VIT University, Vellore -632014, TamilNadu, India. 1 harishharsha72@gmail.om,

More information

HEXA: Compact Data Structures for Faster Packet Processing

HEXA: Compact Data Structures for Faster Packet Processing Washington University in St. Louis Washington University Open Sholarship All Computer Siene and Engineering Researh Computer Siene and Engineering Report Number: 27-26 27 HEXA: Compat Data Strutures for

More information

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425)

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425) Automati Physial Design Tuning: Workload as a Sequene Sanjay Agrawal Mirosoft Researh One Mirosoft Way Redmond, WA, USA +1-(425) 75-357 sagrawal@mirosoft.om Eri Chu * Computer Sienes Department University

More information

M32: A Constructive Multilevel Logic Synthesis System*

M32: A Constructive Multilevel Logic Synthesis System* M32: A Construtive Multilevel Logi Synthesis System* Vitor N. Kravets Karem A. Sakallah Department of Eletrial Engineering and Computer Siene University of Mihigan, Ann Arbor, MI 48109 {vkravets, karem}@ees.umih.edu

More information

We don t need no generation - a practical approach to sliding window RLNC

We don t need no generation - a practical approach to sliding window RLNC We don t need no generation - a pratial approah to sliding window RLNC Simon Wunderlih, Frank Gabriel, Sreekrishna Pandi, Frank H.P. Fitzek Deutshe Telekom Chair of Communiation Networks, TU Dresden, Dresden,

More information

Dynamic Algorithms Multiple Choice Test

Dynamic Algorithms Multiple Choice Test 3226 Dynami Algorithms Multiple Choie Test Sample test: only 8 questions 32 minutes (Real test has 30 questions 120 minutes) Årskort Name Eah of the following 8 questions has 4 possible answers of whih

More information

CA Release Automation 5.x Implementation Proven Professional Exam (CAT-600) Study Guide Version 1.1

CA Release Automation 5.x Implementation Proven Professional Exam (CAT-600) Study Guide Version 1.1 Exam (CAT-600) Study Guide Version 1.1 PROPRIETARY AND CONFIDENTIAL INFORMATION 2016 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized

More information

Acoustic Links. Maximizing Channel Utilization for Underwater

Acoustic Links. Maximizing Channel Utilization for Underwater Maximizing Channel Utilization for Underwater Aousti Links Albert F Hairris III Davide G. B. Meneghetti Adihele Zorzi Department of Information Engineering University of Padova, Italy Email: {harris,davide.meneghetti,zorzi}@dei.unipd.it

More information

Uplink Channel Allocation Scheme and QoS Management Mechanism for Cognitive Cellular- Femtocell Networks

Uplink Channel Allocation Scheme and QoS Management Mechanism for Cognitive Cellular- Femtocell Networks 62 Uplink Channel Alloation Sheme and QoS Management Mehanism for Cognitive Cellular- Femtoell Networks Kien Du Nguyen 1, Hoang Nam Nguyen 1, Hiroaki Morino 2 and Iwao Sasase 3 1 University of Engineering

More information

Graph-Based vs Depth-Based Data Representation for Multiview Images

Graph-Based vs Depth-Based Data Representation for Multiview Images Graph-Based vs Depth-Based Data Representation for Multiview Images Thomas Maugey, Antonio Ortega, Pasal Frossard Signal Proessing Laboratory (LTS), Eole Polytehnique Fédérale de Lausanne (EPFL) Email:

More information

Sparse Certificates for 2-Connectivity in Directed Graphs

Sparse Certificates for 2-Connectivity in Directed Graphs Sparse Certifiates for 2-Connetivity in Direted Graphs Loukas Georgiadis Giuseppe F. Italiano Aikaterini Karanasiou Charis Papadopoulos Nikos Parotsidis Abstrat Motivated by the emergene of large-sale

More information

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality INTERNATIONAL CONFERENCE ON MANUFACTURING AUTOMATION (ICMA200) Multi-Piee Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality Stephen Stoyan, Yong Chen* Epstein Department of

More information

CleanUp: Improving Quadrilateral Finite Element Meshes

CleanUp: Improving Quadrilateral Finite Element Meshes CleanUp: Improving Quadrilateral Finite Element Meshes Paul Kinney MD-10 ECC P.O. Box 203 Ford Motor Company Dearborn, MI. 8121 (313) 28-1228 pkinney@ford.om Abstrat: Unless an all quadrilateral (quad)

More information

SSD Based First Layer File System for the Next Generation Super-computer

SSD Based First Layer File System for the Next Generation Super-computer SSD Based First Layer File System for the Next Generation Super-omputer Shinji Sumimoto, Ph.D. Next Generation Tehnial Computing Unit FUJITSU LIMITED Sept. 24 th, 2018 0 Outline of This Talk A64FX: High

More information

Fuzzy Meta Node Fuzzy Metagraph and its Cluster Analysis

Fuzzy Meta Node Fuzzy Metagraph and its Cluster Analysis Journal of Computer Siene 4 (): 9-97, 008 ISSN 549-3636 008 Siene Publiations Fuzzy Meta Node Fuzzy Metagraph and its Cluster Analysis Deepti Gaur, Aditya Shastri and Ranjit Biswas Department of Computer

More information

Implementing Load-Balanced Switches With Fat-Tree Networks

Implementing Load-Balanced Switches With Fat-Tree Networks Implementing Load-Balaned Swithes With Fat-Tree Networks Hung-Shih Chueh, Ching-Min Lien, Cheng-Shang Chang, Jay Cheng, and Duan-Shin Lee Department of Eletrial Engineering & Institute of Communiations

More information

1. Introduction. 2. The Probable Stope Algorithm

1. Introduction. 2. The Probable Stope Algorithm 1. Introdution Optimization in underground mine design has reeived less attention than that in open pit mines. This is mostly due to the diversity o underground mining methods and omplexity o underground

More information

Interconnection Styles

Interconnection Styles Interonnetion tyles oftware Design Following the Export (erver) tyle 2 M1 M4 M5 4 M3 M6 1 3 oftware Design Following the Export (Client) tyle e 2 e M1 M4 M5 4 M3 M6 1 e 3 oftware Design Following the Export

More information

Compilation Lecture 11a. Register Allocation Noam Rinetzky. Text book: Modern compiler implementation in C Andrew A.

Compilation Lecture 11a. Register Allocation Noam Rinetzky. Text book: Modern compiler implementation in C Andrew A. Compilation 0368-3133 Leture 11a Text book: Modern ompiler implementation in C Andrew A. Appel Register Alloation Noam Rinetzky 1 Registers Dediated memory loations that an be aessed quikly, an have omputations

More information

Performance Benchmarks for an Interactive Video-on-Demand System

Performance Benchmarks for an Interactive Video-on-Demand System Performane Benhmarks for an Interative Video-on-Demand System. Guo,P.G.Taylor,E.W.M.Wong,S.Chan,M.Zukerman andk.s.tang ARC Speial Researh Centre for Ultra-Broadband Information Networks (CUBIN) Department

More information

Constructing Transaction Serialization Order for Incremental. Data Warehouse Refresh. Ming-Ling Lo and Hui-I Hsiao. IBM T. J. Watson Research Center

Constructing Transaction Serialization Order for Incremental. Data Warehouse Refresh. Ming-Ling Lo and Hui-I Hsiao. IBM T. J. Watson Research Center Construting Transation Serialization Order for Inremental Data Warehouse Refresh Ming-Ling Lo and Hui-I Hsiao IBM T. J. Watson Researh Center July 11, 1997 Abstrat In typial pratie of data warehouse, the

More information

Announcements. Lecture Caching Issues for Multi-core Processors. Shared Vs. Private Caches for Small-scale Multi-core

Announcements. Lecture Caching Issues for Multi-core Processors. Shared Vs. Private Caches for Small-scale Multi-core Announements Your fous should be on the lass projet now Leture 17: Cahing Issues for Multi-ore Proessors This week: status update and meeting A short presentation on: projet desription (problem, importane,

More information

Exploring the Commonality in Feature Modeling Notations

Exploring the Commonality in Feature Modeling Notations Exploring the Commonality in Feature Modeling Notations Miloslav ŠÍPKA Slovak University of Tehnology Faulty of Informatis and Information Tehnologies Ilkovičova 3, 842 16 Bratislava, Slovakia miloslav.sipka@gmail.om

More information

The HuMAnS toolbox, a homogenous framework for motion capture, analysis and simulation

The HuMAnS toolbox, a homogenous framework for motion capture, analysis and simulation The HuMAnS toolbox, a homogenous framework for motion apture, analysis and simulation Pierre-Brie Wieber, Florene Billet, Laurene Boissieux, Roger Pissard-Gibollet To ite this version: Pierre-Brie Wieber,

More information

Multiple Assignments

Multiple Assignments Two Outputs Conneted Together Multiple Assignments Two Outputs Conneted Together if (En1) Q

More information

A Novel Validity Index for Determination of the Optimal Number of Clusters

A Novel Validity Index for Determination of the Optimal Number of Clusters IEICE TRANS. INF. & SYST., VOL.E84 D, NO.2 FEBRUARY 2001 281 LETTER A Novel Validity Index for Determination of the Optimal Number of Clusters Do-Jong KIM, Yong-Woon PARK, and Dong-Jo PARK, Nonmembers

More information

Establishing Secure Ethernet LANs Using Intelligent Switching Hubs in Internet Environments

Establishing Secure Ethernet LANs Using Intelligent Switching Hubs in Internet Environments Establishing Seure Ethernet LANs Using Intelligent Swithing Hubs in Internet Environments WOEIJIUNN TSAUR AND SHIJINN HORNG Department of Eletrial Engineering, National Taiwan University of Siene and Tehnology,

More information

Definitions Homework. Quine McCluskey Optimal solutions are possible for some large functions Espresso heuristic. Definitions Homework

Definitions Homework. Quine McCluskey Optimal solutions are possible for some large functions Espresso heuristic. Definitions Homework EECS 33 There be Dragons here http://ziyang.ees.northwestern.edu/ees33/ Teaher: Offie: Email: Phone: L477 Teh dikrp@northwestern.edu 847 467 2298 Today s material might at first appear diffiult Perhaps

More information

Batch Auditing for Multiclient Data in Multicloud Storage

Batch Auditing for Multiclient Data in Multicloud Storage Advaned Siene and Tehnology Letters, pp.67-73 http://dx.doi.org/0.4257/astl.204.50. Bath Auditing for Multilient Data in Multiloud Storage Zhihua Xia, Xinhui Wang, Xingming Sun, Yafeng Zhu, Peng Ji and

More information

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 9, September 2013 ISSN: 2277 128X International Journal of Advaned Researh in Computer Siene and Software Engineering Researh Paper Available online at: www.ijarsse.om A New-Fangled Algorithm

More information

A Formal Hybrid Analysis Technique for Composite Web Services Verification

A Formal Hybrid Analysis Technique for Composite Web Services Verification A Formal Hybrid Analysis Tehnique for Composite Web Servies Verifiation MAY HAIDAR 1,2, HICHAM H. HALLAL 1 1 Computer Siene Department / Department of Eletrial Engineering Fahad Bin Sultan University P.O

More information

Multiple-Criteria Decision Analysis: A Novel Rank Aggregation Method

Multiple-Criteria Decision Analysis: A Novel Rank Aggregation Method 3537 Multiple-Criteria Deision Analysis: A Novel Rank Aggregation Method Derya Yiltas-Kaplan Department of Computer Engineering, Istanbul University, 34320, Avilar, Istanbul, Turkey Email: dyiltas@ istanbul.edu.tr

More information

Manipulation of Graphs, Algebras and Pictures. Essays Dedicated to Hans-Jörg Kreowski on the Occasion of His 60th Birthday

Manipulation of Graphs, Algebras and Pictures. Essays Dedicated to Hans-Jörg Kreowski on the Occasion of His 60th Birthday Eletroni Communiations of the EASST Volume 26 (2010) Manipulation of Graphs, Algebras and Pitures Essays Dediated to Hans-Jörg Kreowski on the Oasion of His 60th Birthday Autonomous Units for Solving the

More information

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract CS 9 Projet Final Report: Learning Convention Propagation in BeerAdvoate Reviews from a etwork Perspetive Abstrat We look at the way onventions propagate between reviews on the BeerAdvoate dataset, and

More information

C- BASED RAPID PROTOTYPING FOR DIGITAL SIGNAL PROCESSING

C- BASED RAPID PROTOTYPING FOR DIGITAL SIGNAL PROCESSING C- BASED RAPID PROTOTYPING FOR DIGITAL SIGNAL PROCESSING Emmanuel Casseau, Bertrand Le Gal, Pierre Bomel, Christophe Jego, Sylvain Huet, Eric Martin To cite this version: Emmanuel Casseau, Bertrand Le

More information

Allocating Rotating Registers by Scheduling

Allocating Rotating Registers by Scheduling Alloating Rotating Registers by Sheduling Hongbo Rong Hyunhul Park Cheng Wang Youfeng Wu Programming Systems Lab Intel Labs {hongbo.rong,hyunhul.park,heng..wang,youfeng.wu}@intel.om ABSTRACT A rotating

More information

Direct-Mapped Caches

Direct-Mapped Caches A Case for Diret-Mapped Cahes Mark D. Hill University of Wisonsin ahe is a small, fast buffer in whih a system keeps those parts, of the ontents of a larger, slower memory that are likely to be used soon.

More information

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays nalysis of input and output onfigurations for use in four-valued D programmable logi arrays J.T. utler H.G. Kerkhoff ndexing terms: Logi, iruit theory and design, harge-oupled devies bstrat: s in binary,

More information

Chapter 2: Introduction to Maple V

Chapter 2: Introduction to Maple V Chapter 2: Introdution to Maple V 2-1 Working with Maple Worksheets Try It! (p. 15) Start a Maple session with an empty worksheet. The name of the worksheet should be Untitled (1). Use one of the standard

More information

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks Flow Demands Oriented Node Plaement in Multi-Hop Wireless Networks Zimu Yuan Institute of Computing Tehnology, CAS, China {zimu.yuan}@gmail.om arxiv:153.8396v1 [s.ni] 29 Mar 215 Abstrat In multi-hop wireless

More information

RAC 2 E: Novel Rendezvous Protocol for Asynchronous Cognitive Radios in Cooperative Environments

RAC 2 E: Novel Rendezvous Protocol for Asynchronous Cognitive Radios in Cooperative Environments 21st Annual IEEE International Symposium on Personal, Indoor and Mobile Radio Communiations 1 RAC 2 E: Novel Rendezvous Protool for Asynhronous Cognitive Radios in Cooperative Environments Valentina Pavlovska,

More information

High Speed Area Efficient VLSI Architecture for DCT using Proposed CORDIC Algorithm

High Speed Area Efficient VLSI Architecture for DCT using Proposed CORDIC Algorithm International Journal of Innovative Researh in Siene, Engineering and Tehnology Website: www.ijirset.om High Speed Area Effiient VLSI Arhiteture for DCT using Proposed CORDIC Algorithm Deepnarayan Sinha

More information

Scheduling Multiple Independent Hard-Real-Time Jobs on a Heterogeneous Multiprocessor

Scheduling Multiple Independent Hard-Real-Time Jobs on a Heterogeneous Multiprocessor Sheduling Multiple Independent Hard-Real-Time Jobs on a Heterogeneous Multiproessor Orlando Moreira NXP Semiondutors Researh Eindhoven, Netherlands orlando.moreira@nxp.om Frederio Valente Universidade

More information

Improved flooding of broadcast messages using extended multipoint relaying

Improved flooding of broadcast messages using extended multipoint relaying Improved flooding of broadast messages using extended multipoint relaying Pere Montolio Aranda a, Joaquin Garia-Alfaro a,b, David Megías a a Universitat Oberta de Catalunya, Estudis d Informàtia, Mulimèdia

More information

splitting tehniques that partition live ranges have been proposed to solve both the spilling problem[5][8] and the assignment problem[8][9]. The parti

splitting tehniques that partition live ranges have been proposed to solve both the spilling problem[5][8] and the assignment problem[8][9]. The parti Load/Store Range Analysis for Global Register Alloation Priyadarshan Kolte and Mary Jean Harrold Department of Computer Siene Clemson University Abstrat Live range splitting tehniques divide the live ranges

More information

The recursive decoupling method for solving tridiagonal linear systems

The recursive decoupling method for solving tridiagonal linear systems Loughborough University Institutional Repository The reursive deoupling method for solving tridiagonal linear systems This item was submitted to Loughborough University's Institutional Repository by the/an

More information

13.1 Numerical Evaluation of Integrals Over One Dimension

13.1 Numerical Evaluation of Integrals Over One Dimension 13.1 Numerial Evaluation of Integrals Over One Dimension A. Purpose This olletion of subprograms estimates the value of the integral b a f(x) dx where the integrand f(x) and the limits a and b are supplied

More information

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application World Aademy of Siene, Engineering and Tehnology 8 009 Performane of Histogram-Based Skin Colour Segmentation for Arms Detetion in Human Motion Analysis Appliation Rosalyn R. Porle, Ali Chekima, Farrah

More information

CA Test Data Manager 4.x Implementation Proven Professional Exam (CAT-681) Study Guide Version 1.0

CA Test Data Manager 4.x Implementation Proven Professional Exam (CAT-681) Study Guide Version 1.0 Implementation Proven Professional Study Guide Version 1.0 PROPRIETARY AND CONFIDENTIAL INFORMATION 2017 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer

More information

Fast Distribution of Replicated Content to Multi- Homed Clients Mohammad Malli Arab Open University, Beirut, Lebanon

Fast Distribution of Replicated Content to Multi- Homed Clients Mohammad Malli Arab Open University, Beirut, Lebanon ACEEE Int. J. on Information Tehnology, Vol. 3, No. 2, June 2013 Fast Distribution of Repliated Content to Multi- Homed Clients Mohammad Malli Arab Open University, Beirut, Lebanon Email: mmalli@aou.edu.lb

More information

CONTROL SYSTEMS ANALYSIS & DESIGN SERVER. F. Morilla*, A. Fernández +, S. Dormido Canto*

CONTROL SYSTEMS ANALYSIS & DESIGN SERVER. F. Morilla*, A. Fernández +, S. Dormido Canto* CONTROL SYSTEMS ANALYSS & ESGN SERVER F. Morilla*, A. Fernández +, S. ormido Canto* * pto de nformátia y Automátia, UNE, Avda. Senda del Rey 9, 28040 Madrid, Spain. Phone:34-91-3987156, Fax:34-91-3986697,

More information

A PROTOTYPE OF INTELLIGENT VIDEO SURVEILLANCE CAMERAS

A PROTOTYPE OF INTELLIGENT VIDEO SURVEILLANCE CAMERAS INTERNATIONAL JOURNAL OF INFORMATION AND SYSTEMS SCIENCES Volume 1, 3, Number 1, 3, Pages 1-22 365-382 2007 Institute for Sientifi Computing and Information A PROTOTYPE OF INTELLIGENT VIDEO SURVEILLANCE

More information

Mining effective design solutions based on a model-driven approach

Mining effective design solutions based on a model-driven approach ata Mining VI 463 Mining effetive design solutions based on a model-driven approah T. Katsimpa 2, S. Sirmakessis 1,. Tsakalidis 1,2 & G. Tzimas 1,2 1 Researh ademi omputer Tehnology Institute, Hellas 2

More information

Outline. CS38 Introduction to Algorithms. Administrative Stuff. Administrative Stuff. Motivation/Overview. Administrative Stuff

Outline. CS38 Introduction to Algorithms. Administrative Stuff. Administrative Stuff. Motivation/Overview. Administrative Stuff Outline CS38 Introdution to Algorithms Leture 1 April 1, 2014 administrative stuff motivation and overview of the ourse stale mathings example graphs, representing graphs graph traversals (BFS, DFS) onnetivity,

More information

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks Unsupervised Stereosopi Video Objet Segmentation Based on Ative Contours and Retrainable Neural Networks KLIMIS NTALIANIS, ANASTASIOS DOULAMIS, and NIKOLAOS DOULAMIS National Tehnial University of Athens

More information

Gray Codes for Reflectable Languages

Gray Codes for Reflectable Languages Gray Codes for Refletable Languages Yue Li Joe Sawada Marh 8, 2008 Abstrat We lassify a type of language alled a refletable language. We then develop a generi algorithm that an be used to list all strings

More information

Accommodations of QoS DiffServ Over IP and MPLS Networks

Accommodations of QoS DiffServ Over IP and MPLS Networks Aommodations of QoS DiffServ Over IP and MPLS Networks Abdullah AlWehaibi, Anjali Agarwal, Mihael Kadoh and Ahmed ElHakeem Department of Eletrial and Computer Department de Genie Eletrique Engineering

More information

Cluster-based Cooperative Communication with Network Coding in Wireless Networks

Cluster-based Cooperative Communication with Network Coding in Wireless Networks Cluster-based Cooperative Communiation with Network Coding in Wireless Networks Zygmunt J. Haas Shool of Eletrial and Computer Engineering Cornell University Ithaa, NY 4850, U.S.A. Email: haas@ee.ornell.edu

More information

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8 Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introdution... 1 1.1. Internet Information...2 1.2. Internet Information Retrieval...3 1.2.1. Doument Indexing...4 1.2.2. Doument Retrieval...4

More information

CA Agile Requirements Designer 2.x Implementation Proven Professional Exam (CAT-720) Study Guide Version 1.0

CA Agile Requirements Designer 2.x Implementation Proven Professional Exam (CAT-720) Study Guide Version 1.0 Exam (CAT-720) Study Guide Version 1.0 PROPRIETARY AND CONFIDENTIAL INFORMATION 2017 CA. All rights reserved. CA onfidential & proprietary information. For CA, CA Partner and CA Customer use only. No unauthorized

More information

Computing Pool: a Simplified and Practical Computational Grid Model

Computing Pool: a Simplified and Practical Computational Grid Model Computing Pool: a Simplified and Pratial Computational Grid Model Peng Liu, Yao Shi, San-li Li Institute of High Performane Computing, Department of Computer Siene and Tehnology, Tsinghua University, Beijing,

More information

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem Calulation of typial running time of a branh-and-bound algorithm for the vertex-over problem Joni Pajarinen, Joni.Pajarinen@iki.fi Otober 21, 2007 1 Introdution The vertex-over problem is one of a olletion

More information

Make your process world

Make your process world Automation platforms Modion Quantum Safety System Make your proess world a safer plae You are faing omplex hallenges... Safety is at the heart of your proess In order to maintain and inrease your ompetitiveness,

More information

Distributed Resource Allocation Strategies for Achieving Quality of Service in Server Clusters

Distributed Resource Allocation Strategies for Achieving Quality of Service in Server Clusters Proeedings of the 45th IEEE Conferene on Deision & Control Manhester Grand Hyatt Hotel an Diego, CA, UA, Deember 13-15, 2006 Distributed Resoure Alloation trategies for Ahieving Quality of ervie in erver

More information

1 The Knuth-Morris-Pratt Algorithm

1 The Knuth-Morris-Pratt Algorithm 5-45/65: Design & Analysis of Algorithms September 26, 26 Leture #9: String Mathing last hanged: September 26, 27 There s an entire field dediated to solving problems on strings. The book Algorithms on

More information