Chapter 1. Introduction

Size: px
Start display at page:

Download "Chapter 1. Introduction"

Transcription

1 Chapter 1 Introducton 1.1 Parallel Processng There s a contnual demand for greater computatonal speed from a computer system than s currently possble (.e. sequental systems). Areas need great computatonal speed nclude numercal modelng and smulaton of scentfc and engneerng problems. For example; weather forecastng, predctng the moton of the astronomcal bodes n the space, vrtual realty, etc. Such problems are known as grand challenge problems. On the other hand, the grand challenge problem s the problem that cannot be solved n a reasonable amount of tme [1]. One way of ncreasng the computatonal speed s by usng multple processors n sngle case box or network of computers lke cluster operate together on a sngle problem. Therefore, the overall problem s needed to splt nto parttons, wth each partton s performed by a separate processor n parallel. Wrtng programs for ths form of computaton s known as parallel programmng [1]. How to execute the programs of applcatons n very fast way and on a concurrent manner? Ths s known as parallel processng. In the parallel processng, we must have underlne parallel archtectures, as well as, parallel programmng languages and algorthms. 1.2 Parallel Archtectures The man feature of a parallel archtecture s that there s more than one processor. These processors may communcate and cooperate wth one another to execute the program nstructons. There are dverse classfcatons for the parallel archtectures and the most popular one s the Flynn taxonomy (see Fgure 1.1) [2]. 1

2 1.2.1 The Flynn Taxonomy Mchael Flynn [2] has ntroduced taxonomy for varous computer archtectures based on notons of Instructon Streams (IS) and Data Streams (DS). Accordng to ths taxonomy, the parallel archtectures could be classfed nto four categores; Sngle Instructon Sngle Data (SISD), Multple Instructon Sngle Data (MISD), Sngle Instructon Multple Data (SIMD), and Multple Instructon Multple Data (MIMD). SISD Von Neumann Model Parallel computers SIMD MISD MIMD Array Processors Ppelned Vector Processors Ppelned Vector Processors Systolc Arrays Multprocessors Multcomputers Data Flow Machnes Network Cluster Grd Computng Fgure 1.1: Taxonomy of Parallel Processng Archtectures Sngle Instructon Sngle Data (SISD) Accordng to ths category, only one nstructon stream s executed on one data stream each tme. Conventonal sequental machnes are consdered SISD archtecture. The prototype of the sequental machne s shown n Fgure

3 I/O IS IS DS CU PE MU CU: s the Control Unt PE: Processor Element IS: Instructon Stream DS: Data Stream Fgure 1.2: SISD Unprocessor Archtecture. Sngle Instructon Multple Data Stream (SIMD) The SIMD archtectures consst of N processors (see Fgure 1.3). The processors operate synchronously, where at each cycle all processors execute the same nstructon, each on a dfferent datum. Some popular commercal SIMD archtectures are ILLIAC IV, DAP, and connecton machne CM-1 and CM-2. The SIMD archtectures can also support vector processng, whch can be accomplshed by assgnng vector of elements to ndvdual processors for concurrent computaton [3]. SIMD machne s a synchronzed machne, where all of the processors n the machne execute only one nstructon at a tme and the fastng processor has to wat the slowest one before t s startng executng next nstructon. Ths type of synchronzaton s called Lockstep [3]. These archtectures are used mostly for problems havng hgh degrees of data parallelsm. Interconnecton Network DS 1 DS 2 DS N PE 1 PE 2 PE N IS CU Fgure 1.3: SIMD Archtecture 3

4 Multple Instructons Sngle Data (MISD) The MISD archtectures consst of N processors each wth ts own control unt sharng a common memory where data resde (see Fgure 1.4) [3]. The archtecture of the MISD computers fall nto two dfferent categores: 1. A class of machnes that would requre dstnct processors that would receve dstnct nstructons to be performed on the same data. 2. A class of machnes such that data flows through a seres of processors (.e. ppelned archtectures). On type of ths machne s a systolc array. A systolc array s formed wth a network of functonal unts whch are locally connected. Ths array operates synchronously wth multdmensonal ppelnng. Therefore, ths class of multdmensonal ppelned array archtectures s desgned for mplementng fxed algorthms and t offers good performance for specal applcatons, lke sgnal and mage processng, wth smple, regular and modular layouts. PE 1 IS 1 CU 1 Memory DS PE 2 IS 2 CU 2 PE N IS N CU N Fgure 1.4: MISD Archtecture Multple Instructons Multple Data (MIMD) Ths class of archtectures s the most general and powerful one, where most of practcal applcatons need MIMD machnes [3]. The MIMD archtectures consst of N processors. Each processor operates under the control of an nstructon stream ssued by ts control unt (see Fgure 1.5). All processors are potentally executng dfferent subproblems on dfferent data whle solvng a sngle problem. Ths means that the processors typcally operate asynchronously. The communcaton between processors s preformed through a 4

5 shared memory or an nterconnecton network. MIMD computers wth a shared common memory are often referred to as multprocessors (or tghtly coupled machnes), whle those wth an nterconnecton network connectng processors are known as multcomputers (or loosely coupled machnes) [4]. Gordon Bell [5] has provded taxonomy of MIMD machnes (see Fgure 1.6). He consders shared-memory multprocessors as havng a sngle address space (.e. global address space) and multcomputers use dstrbuted memores as havng multple address space (.e. local address space). Shared Memory Or Interconnecton network DS 1 DS 2 DS N PE 1 IS 1 IS 2 PE 2 IS N PE N CU 1 CU 2 CU N Fgure 1.5: MIMD Archtecture Multcomputer Archtectures. Accordng to Bell taxonomy of MIMD archtectures, the multcomputer archtecture s classfed nto dstrbuted multcomputer archtectures and central multcomputer. Dstrbuted multcomputer archtecture (see Fgure 1.7) conssts of multple computers nterconnected by a message-passng network. Each computer conssts of a processor, local memory, and attached I/O perpherals. All local memores are prvate and not accessble by other processors. The communcaton between processors s acheved through a message-passng nterface protocol, va a general nterconnecton network. The dsadvantage of these archtectures s that a programmng model mposes a sgnfcant burden on the programmer, whch nduces consderable software overhead. On the other hand, these archtectures are clamed to have better scalablty and cost-effectveness [6]. 5

6 Dynamc bndng of addresses to processors KSR MIMD Multprocessors Dstrbuted memory multprocessors Central memory multprocessors Dstrbuted multcomputers Statc bndng, rng mult IEEE SCI standard proposal Statc bndng, cacheng Allant, DASH Statc program bndng BBN, Cedar, CM* Cross-pont or mult-stage Cray, Futsu, Htach, IBM, NEC, Tera Smple, rng mult,bus mult replacement Bus mults DEC, Encore,NCR, Sequent, SGI, Sun Mesh connected Intel Butterfly/Fat Tree, CM5 Hypercubes NCUBE Multcomputers Fast LANs for hgh avalablty and hgh capacty clusters. DEC, Tandem LANs for dstrbutd processng Workstatons, PCs Clusters PCs Clusters of Workstatons (CoWs) Central multcomputers Fgure 1.6: Bell s Taxonomy of MIMD Computers Interconnecton Network PE 1 PE 2 PE N M 1 M 2 M N Fgure 1.7: Generc Model of Message-Passng Multcomputer 6

7 Accordng to central multcomputers (see Fgure 1.8), there s one computer s consdered as a master one where all of the database as well as the applcaton programs are resdent ther. Before executng a program the master computer has to dstrbute the program parttons nto the other processors (slaves). After fnshng executng the program, the fnal results from the slaves are receved and organzed n the master. Ths type of multcomputers s referred as farm model. Master Slaves PE 1 PE LM 1 PE 2 LM 2 LM PE N LM N Fgure 1.8: Central Multcomputer Multprocessor Archtecture Multprocessor archtecture called shared memory archtecture, where a sngle global memory s shared among all processors (.e. global space memory model). Consequently, the multprocessor s classfed nto two category; shared memory multprocessor and dstrbuted shared memory (DSM) [6]. Accordng to shared memory multprocessor, only one memory s accessble by all processors equally (see Fgure 1.9). The man advantage of these archtectures s the smplcty n desgnng algorthms for them and transparent data access to the user. However, t suffers from ncreasng contenton n accessng the shared memory, whch lmts the scalablty [7]. A relatvely new concept - (DSM) - tres to combne the advantages of the multcomputer and multprocessors archtectures [8]. 7

8 The DSM archtecture can be generally vewed as a set of nodes or clusters, connected by an nterconnecton network (see Fgure 1.10). Each clusters organzed around a shared bus contans a physcally local memory module, whch s partally or entrely mapped to the DSM global address space. Prvate caches attached to the processors are nevtable for reducng memory latency. Regardless of the network topology a specfc nterconnecton controller wthn each cluster s needed to connect t nto the system. Therefore, the DSM archtecture logcally mplements the shared memory model n physcally dstrbuted memory archtecture [8]. PE 1 PE 2 Shared Memory PE N Fgure 1.9: Generc Model of Shared Memory Archtecture. Interconnecton network Cluster 1 Cluster 2 Cluster N Interconnecton controller Interconnecton controller Interconnecton controller Drectory DSM Porton Processors Caches Drectory DSM Porton Processors Caches Drectory DSM Porton Processors Caches DSM Shared Address Space Fgure 1.10: Structure and Organzaton of a DSM System. 8

9 1.3 The Parallel Algorthms Generally, a parallel algorthm can be defned as a set of nstructons that may be executed n parallel concurrently and may, connect, wth each other n order to solve a gven problem [3]. The term task or process may be defned as a part of a program that can be executed on a processor. The man phases of a parallel algorthm nclude four man phases; parttonng, communcaton, agglomeraton, and mappng phase (see Fgure 1.11). [3]. Problem Partton Communcaton Agglomeraton Mappng and schedulng Processor1 Processor2 Processor3 Fgure 1.11: Dfferent Phases of a Parallel Algorthm 9

10 1.3.1 Parttonng Phase The parttonng phase s ntended to expose opportuntes for parallel executon by decomposng computatons of assgned problem nto small tasks. A good parttonng method dvdes both the computaton and the data assocated wth a problem nto lttle nstructons [3] Communcaton Phase The communcaton phase s enables an approprate communcaton structure and algorthms to coordnate task executon. To allow computaton to proceed, the data must be transferred between tasks, whch are the outcome of the communcaton phase of a desgn. There are two communcaton structures: channel structure and message-passng structure [3]. In the channel structure, there s a lnk (drectly or ndrectly) between tasks that requre data wth other tasks that possess those data. In the message passng structure, the message s sent and receved on these channels Agglomeraton phase The frst two phases of the desgn process concern about dvdng the computatons nto a set of tasks and communcaton to provde data requred by these tasks. The agglomeraton phase s ntended to evaluate the tasks and the requred communcaton wth respect to the performance and mplementaton costs. For example, one crtcal ssue nfluencng the parallel performance s the communcaton cost. The performance mght be mproved by reducng the amount of tme spent communcatng (.e. sendng less data) [3] Mappng and Schedulng Phase The mappng and schedulng phase ams to specfy where each task has to be executed n whch processor and determne a sequence for ther executon such that the processor utlzaton s maxmzed and the communcaton costs, as well as, executon tme are mnmzed [3]. If these four phases (problems) are not properly handled, parallelzaton of an applcaton may not be benefcal. A large research efforts addressng theses problems has been 10

11 reported n the lterature [9,10]. Our work s concerned manly to study and mplement the mappng and schedule phase. Through the thess, we refer to ths phase as task schedulng problem. Two mportant ssues are nvolved n the task schedulng problem: enhancng concurrency and ncreasng localty. Enhancng concurrency deals wth placng tasks that can be executed smultaneously on dfferent processors, whle ncreasng localty refers to placng tasks that lkely communcate frequently on the same processor [3]. 1.4 The Task Schedulng Problem The task schedulng problem concerns wth determne whch computatonal tasks wll be executed on whch processor and at what tme. On the other hand, the schedulng and allocaton of multple nteractng tasks of a sngle parallel program s a hghly mportant ssue snce an napproprate schedulng of tasks can fal to explot the true performance of the system and can offset the gran from parallelzaton [10]. Therefore, ths problem has receved consderable attenton n recent years [11, 12]. The man obectve of the task schedulng s to assgn tasks to avalable processors such that precedence requrements between tasks are satsfed and, n the same tme, the overall completon tme whch known as schedule length (or make span) s mnmzed [13]. The task schedulng technques are manly classfed as statc and dynamc. In statc schedulng, the characterstc of a parallel program, ncludng task processng tmes, data dependences and synchronzaton, are known before program executon [10]. In dynamc schedulng algorthm, few assumptons about the parallel program can be made before executon, and thus, schedulng decsons have to be made on-the-fly [13]. On the other hand, the schedulng problem could be classfed nto preemptve and nonpreemptve schedulng [13]. The preemptve schedulng permts a task to be nterrupted and removed from the processor under the assumpton that t wll eventually receve the executon tme t requres. Ths nterrupton of tasks contrbutes to system overhead. Wth nonpreemptve schedulng, a task cannot be nterrupted once t has begun executon n a specfc processor. If there are no precedence relatons among the tasks formng a 11

12 program, the problem s known as task allocaton problem [13]. Our research work s focused on statc and nonpreemptve schedulng The Problem Model The model of the underlne parallel system to be consdered n ths research work could be descrbed as follows [14]: The archtecture s a network of arbtrary number of homogeneous processors. Assume P = {P 1, P 2, P 3 P m } denotes the set of m processors. Let a task graph G be a Drected, Acyclc Graph (DAG) composed of N nodes n, n2,, n, each node terms a task of the 1 N graph whch n turn s a set of nstructon that must be executed sequentally wthout preempton n the same processor. A node 1 has one or more nputs. When all nputs are avalable, the node s trggered to execute. A node wth no predecessor s called an entry node and a node wth no successor s called an ext node. The weght of node n s called the computaton cost of a node n and s denoted by weght n ). The graph also has E ( drected edges, where each edge e( n, n ) E representng a partal order among the tasks. The partal order ntroduced a precedence-constraned DAG and mples that f n n, then n s a successor ofn, whch cannot be started untl ts predecessor fnshes. The weght on an edge s called communcaton cost of the edge and s denoted by c n, n ) (. Ths cost s ncurred f n and s consdered to be zero f n and n are scheduled on dfferent processors and n are scheduled on the same processor. Let p n ) and Sc( n ) be the set of mmedate predecessors and the set of successors of the node n respectvely. Where, p( n ) { n : e( n, n ) E} and Sc( n ) { n : e( n, n ) E}. If a node n s scheduled to processor P, the start tme and fnsh tme of the node s denoted by ST n ) and FT n ) respectvely. After all nodes have been scheduled, the schedule ( ( length s defned as max { FT ( n )} across all processors. The obectve s to fnd an assgnment and the start tmes of the tasks to processors such that the schedule length s mnmzed and, n the same tme, the precedence constrans are preserved. A Crtcal Path ( n 1 task and node are exchangeable through the thess 12

13 (CP) of a task graph s defned as the path wth the maxmum sum of node and edge weghts from an entry node to an ext node. The nodes les on CP are denoted as CPNs, and the communcaton-to-computaton-rato (CCR) of a parallel program s defned as ts average edge weght dvded by ts average task weght. The task schedulng n general s known to be NP-complete problem except for some specal cases. The suggested taxonomy of the statc task schedulng algorthms s represented n Fgure Accordng to ths taxonomy, schedulng methods are dvded nto optmal soluton and non optmal soluton. Accordng to the non optmal solutons, many heurstcs have been suggested to tackle the problem under more pragmatc stuatons. These heurstcs algorthms could be dvded n two categores: greedy and non greedy (teratve). The greedy algorthms are ntalzed by a partal soluton and search to extend ths soluton untl a complete task schedulng s acheved [15]. At each step, one task assgnment s done and t cannot be changed n the remanng steps. The teratve algorthms are ntalzed by a complete schedulng and search to mprove t by movng a task from one processor to another or by exchangng the mappng of two tasks [15] Optmal Task Schedulng Algorthms The optmal soluton s known n few restrcted cases, such as polynomal tme algorthms whch descrbed as Hu algorthm [13], Coffman and Graham algorthm [16], and Papadmtrou and Yannakaks algorthm [13]. Statc schedulng Optmal Not Optmal Heurstcs Approxmate Iteratve Greedy Fgure 1.12: A Partal Taxonomy of Statc Schedulng Methods 13

14 1. HU Algorthm Hu [13] devsed a lnear-tme algorthm to schedulng problem called level algorthm. The algorthms s used to solve the schedule length n lnear tme when the task graph s ether an n-forest,.e., each task at most one mmedate successor, or an out-forset,.e., each task has at most one mmedate predecessor. The communcaton between tasks s gnored and all tasks have unt computatons. The algorthm begn wth calculate the level of each node n whch s defned as the maxmum number of nodes (ncludng n ) on any path from n to the ext node. The pseudo code of the algorthm s as follow: HU Algorthm Step1: The level of each node n the DAG s calculated and used as each nodes prorty. Step2: When the processor become avalable. Assgn t the ready node wth the hghest prorty. IF the number of nodes n a level s greater than the number of processors n the system, Then schedule the tasks usng round robn fashon. The complexty of the algorthm s (v) where v s the number of tasks n DAG[13]. Example By applyng HU algorthm to the DAG shown n Fgure 1.13a usng three processors, the lst of read tasks accordng to ts level s { 1,2,3,4,5,6,7,8,9}. The generated schedule length s shown n Fgure 1.13b. (a) n 1 n 2 n 3 n 4 n 5 n 6 n 7 n P 0 P 1 n 1 n 4 n 7 n 2 n 5 n 8 P 2 n 3 n 6 n 9 (b) n 9 Fgure1.13: (a) A Smple In-Forest Task Graph; (b) The Optmal Schedule of the Task Graph usng Three Processors System 14

15 2. Coffman and Graham Algorthm Coffman and Graham [16] devsed a quadratc-tme algorthm for schedulng an arbtrary structured DAG wth unt-weghted tasks and zero-weghted edges to a two-processors system. The algorthm starts by assgnng Labels to DAG s tasks startng wth the ext task. After that, a lst of tasks s constructed by sortng them n descendng order. Schedule each task to one of the two processors that allows earlest start tme. The pseudo code of the algorthm s as follow: Coffman and Graham Algorthm Step1: Assgn the number 1 to one of the ext tasks Step2: Let labels 1,2,, 1 have been assgned. Let S be the set of unassgned tasks wth no unlabeled successor. Select an element of S to be assgned label. For each task t n S, Defne l(t) as follow: Let n 1, n 2,,n k be the mmedate successors of t. Then l(t) s the decreasng sequence of ntegers formed by orderng the set { L( n1 ), L( n2 ),, L( nk )}. Let t be an element of S such that for all t n S, l( t) l( t ) (lexcographcally). Defne L (t) to be. End For Step3: When all tasks have been labeled, use the lst ( Tn, Tn 1,, T1 ) where for all, 1 n, L( T ) to schedule the tasks. The complexty of the algorthm s (v 2 ) where v s the number of tasks n DAG [16]. Example: Consder the task graph shown n Fgure 1.14(a). The two termnal task are assgned the label 1,2 respectvely. The set S of unassgned tasks wth no unlabeled successors becomes { 4,5}. Also t can be notced that l ( 4) {6} and l ( 5) {7,6}. Snce {6} < {7, 6} (lexcographcally), assgn label 3, 4 as shown n Fgure 1.14a. Begn the schedule by assgnng the tasks wth no predecessors after that assgn the task n decreasng order of ts label. Fgure 1.14b shows the output schedule on two processor. 15

16 5 n 1 6 n 2 7 n 3 3 n 4 4 n 5 n 6 n (a) P 0 P 1 n 3 n 2 n 1 n 5 n 4 n 7 n 6 (b) Fgure1.14: (a) A Smple Task Graph wth Unt-Tasks and No-Cost Communcaton Edges; (b) The Optmal Schedule of the Task Graph n a Two-Processors System 3. Papadmtrou and Yannakaks Algorthm Optmal schedulng algorthm has also been addressed by Papadmtrou and Yannakaks [13]. They desgned a lnear-tme algorthm to tackle the schedulng problem of an nterval-ordered DAG wth unt-weght nodes to an arbtrary number of processors. In an nterval-ordered DAG, two nodes are precedence-related f and only f the nodes can be mapped to non-overlappng ntervals on the real lne. The pseudo code of the algorthm s as follow: Papadmtrou and Yannakaks Algorthm 1. The number of successors of each node s used as ts prorty. 2. Whenever a processor becomes avalable, assgn t the ready task wth hgh prorty. Ths algorthm solves the schedulng problem for nterval order (V, A) n O( A v ). Example: Consder the problem of schedulng the nterval order gven n Fgure 1.15a on two dentcal processors. The result accordng to Papadmtrou and Yannakaks Algorthm s shown n Fgure 1.15b. 16

17 n 1 n 2 n 3 0 P 0 P 1 1 n 3 n 2 n 4 n 5 2 n 1 n 5 3 n 4 n 7 n 6 n 7 4 n 6 (a) (b) Fgure 1.15: (a) A Unt-Computatonal Interval Ordered DAG; (b) An Optmal Schedule of The DAG. 1.5 Performance Crtera The performance of task schedulng algorthms of parallel systems s generally evaluated and measured by means of some crtera (speedup, effcency, normalzed schedule length)[6, 17, 18] Speedup Speedup s a good measure for the executon of an applcaton program on a parallel system. The speedup relates to the tme for executng the program on a sngle processor to the tme for executng the same program on a parallel system. Lnear speedup means that the value of speedup ncreases as the number of processors n the parallel systems ncreases [6]. It s known that lnear speedup does not occur n dstrbuted memory archtectures because of the communcaton overhead. Assume T(1) s the tme requred for executng a program on a unprocessor computer and T(P) s the tme requred for executng the same program on a parallel computer contanng P processors. Thus the speedup can be estmated as: T(1) S( P) T( P) 17

18 Speedup Folklore Theorem [17]: For a gven computatonal problem, the speedup provded by a parallel program usng P processors, over the fastest possble sequental program for the problem, s at most equal to P. 1 S( P) P Effcency Effcency s an ndcaton to what percentage of a processor s tme s beng spent n useful computaton. The effcency of a parallel computer contanng P processors can be defned as S( P) E( P) P Snce 1 S( P) P, then 1/ P E( P) 1. The maxmum effcency s acheved when all P processors are fully utlzed throughout the executon [6] Normalzed Schedule Length The man performance measure of task schedulng algorthms s the schedule length. The Normalzed Schedule Length (NSL) of an algorthm s defned as: NSL S _ Length, weght ( n ) n CP Where S_Length s the maxmum fnsh tme of each task and weght (n ) s the executon tme of the node n. The sum of computaton costs on the CP represents a lower bound on the schedule length [18]. Such lower bound may not always be possble acheve, and the optmal schedule length may be larger than ths bound. 1.6 Obectve of the Thess The man obectve of the thess s developed and mplemented task schedulng algorthms usng genetc approach to mprove the performance of genetc algorthms, as well as, the heurstc one. So two genetc algorthms have been developed and 18

19 mplemented: Crtcal Path Genetc Algorthms (CPGA) and Task Duplcaton Genetc Algorthm (TDGA). 1.7 Organzaton of the Thess The organzaton of the thess s as follow: In chapter two, some of the early and recent task schedulng algorthms that were developed n the lterature are shortly revewed. Chapter three nclude the frst proposed algorthm called Crtcal Path Genetc Algorthm (CPGA). In ths chapter, we present brefly the basc dea of the algorthm wth some defntons, followed by the detals of the algorthm. Fnally, our CPGA compared wth Modfed Crtcal Path (MCP) algorthm. In chapter four, we present the second proposed algorthm Task Duplcaton Genetc Algorthm (TDGA). The TDGA s based on task duplcaton technque n an attempt to reduce the communcaton delays and then mnmze the overall executon tme. Therefore, the performance of the genetc algorthm s ncreased. The performance of our TDGA s compared wth a common heurstc task schedulng technque based on task duplcaton (DSH). 1.8 Concluson A survey of man types of the parallel archtectures, as well as, the man conceptes of the task schedulng has been dscussed. In the next chapter, a complete survey about man task schedulng algorthms wll be dscussed. One of these approaches s usng Genetc Algorthms (GAs) whch s consdered n ths research work. Also the prncple of GAs wll be ncluded n the next chapter. 19

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory Background EECS. Operatng System Fundamentals No. Vrtual Memory Prof. Hu Jang Department of Electrcal Engneerng and Computer Scence, York Unversty Memory-management methods normally requres the entre process

More information

AADL : about scheduling analysis

AADL : about scheduling analysis AADL : about schedulng analyss Schedulng analyss, what s t? Embedded real-tme crtcal systems have temporal constrants to meet (e.g. deadlne). Many systems are bult wth operatng systems provdng multtaskng

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

Distributed Resource Scheduling in Grid Computing Using Fuzzy Approach

Distributed Resource Scheduling in Grid Computing Using Fuzzy Approach Dstrbuted Resource Schedulng n Grd Computng Usng Fuzzy Approach Shahram Amn, Mohammad Ahmad Computer Engneerng Department Islamc Azad Unversty branch Mahallat, Iran Islamc Azad Unversty branch khomen,

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments

Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments Comparson of Heurstcs for Schedulng Independent Tasks on Heterogeneous Dstrbuted Envronments Hesam Izakan¹, Ath Abraham², Senor Member, IEEE, Václav Snášel³ ¹ Islamc Azad Unversty, Ramsar Branch, Ramsar,

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

A SYSTOLIC APPROACH TO LOOP PARTITIONING AND MAPPING INTO FIXED SIZE DISTRIBUTED MEMORY ARCHITECTURES

A SYSTOLIC APPROACH TO LOOP PARTITIONING AND MAPPING INTO FIXED SIZE DISTRIBUTED MEMORY ARCHITECTURES A SYSOLIC APPROACH O LOOP PARIIONING AND MAPPING INO FIXED SIZE DISRIBUED MEMORY ARCHIECURES Ioanns Drosts, Nektaros Kozrs, George Papakonstantnou and Panayots sanakas Natonal echncal Unversty of Athens

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Real-time Fault-tolerant Scheduling Algorithm for Distributed Computing Systems

Real-time Fault-tolerant Scheduling Algorithm for Distributed Computing Systems Real-tme Fault-tolerant Schedulng Algorthm for Dstrbuted Computng Systems Yun Lng, Y Ouyang College of Computer Scence and Informaton Engneerng Zheang Gongshang Unversty Postal code: 310018 P.R.CHINA {ylng,

More information

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Overvew 2 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Introducton Mult- Smulator MASIM Theoretcal Work and Smulaton Results Concluson Jay Wagenpfel, Adran Trachte Motvaton and Tasks Basc Setup

More information

Cache Performance 3/28/17. Agenda. Cache Abstraction and Metrics. Direct-Mapped Cache: Placement and Access

Cache Performance 3/28/17. Agenda. Cache Abstraction and Metrics. Direct-Mapped Cache: Placement and Access Agenda Cache Performance Samra Khan March 28, 217 Revew from last lecture Cache access Assocatvty Replacement Cache Performance Cache Abstracton and Metrcs Address Tag Store (s the address n the cache?

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

CHARUTAR VIDYA MANDAL S SEMCOM Vallabh Vidyanagar

CHARUTAR VIDYA MANDAL S SEMCOM Vallabh Vidyanagar CHARUTAR VIDYA MANDAL S SEMCOM Vallabh Vdyanagar Faculty Name: Am D. Trved Class: SYBCA Subject: US03CBCA03 (Advanced Data & Fle Structure) *UNIT 1 (ARRAYS AND TREES) **INTRODUCTION TO ARRAYS If we want

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION

CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION 24 CHAPTER 2 PROPOSED IMPROVED PARTICLE SWARM OPTIMIZATION The present chapter proposes an IPSO approach for multprocessor task schedulng problem wth two classfcatons, namely, statc ndependent tasks and

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Real-time Scheduling

Real-time Scheduling Real-tme Schedulng COE718: Embedded System Desgn http://www.ee.ryerson.ca/~courses/coe718/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrcal and Computer Engneerng Ryerson Unversty Overvew RTX

More information

Efficient Distributed File System (EDFS)

Efficient Distributed File System (EDFS) Effcent Dstrbuted Fle System (EDFS) (Sem-Centralzed) Debessay(Debsh) Fesehaye, Rahul Malk & Klara Naherstedt Unversty of Illnos-Urbana Champagn Contents Problem Statement, Related Work, EDFS Desgn Rate

More information

Assembler. Building a Modern Computer From First Principles.

Assembler. Building a Modern Computer From First Principles. Assembler Buldng a Modern Computer From Frst Prncples www.nand2tetrs.org Elements of Computng Systems, Nsan & Schocken, MIT Press, www.nand2tetrs.org, Chapter 6: Assembler slde Where we are at: Human Thought

More information

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Motivation. EE 457 Unit 4. Throughput vs. Latency. Performance Depends on View Point?! Computer System Performance. An individual user wants to:

Motivation. EE 457 Unit 4. Throughput vs. Latency. Performance Depends on View Point?! Computer System Performance. An individual user wants to: 4.1 4.2 Motvaton EE 457 Unt 4 Computer System Performance An ndvdual user wants to: Mnmze sngle program executon tme A datacenter owner wants to: Maxmze number of Mnmze ( ) http://e-tellgentnternetmarketng.com/webste/frustrated-computer-user-2/

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

A Genetic Algorithm Based Dynamic Load Balancing Scheme for Heterogeneous Distributed Systems

A Genetic Algorithm Based Dynamic Load Balancing Scheme for Heterogeneous Distributed Systems Proceedngs of the Internatonal Conference on Parallel and Dstrbuted Processng Technques and Applcatons, PDPTA 2008, Las Vegas, Nevada, USA, July 14-17, 2008, 2 Volumes. CSREA Press 2008, ISBN 1-60132-084-1

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Greedy Technique - Definition

Greedy Technique - Definition Greedy Technque Greedy Technque - Defnton The greedy method s a general algorthm desgn paradgm, bult on the follong elements: confguratons: dfferent choces, collectons, or values to fnd objectve functon:

More information

ELEC 377 Operating Systems. Week 6 Class 3

ELEC 377 Operating Systems. Week 6 Class 3 ELEC 377 Operatng Systems Week 6 Class 3 Last Class Memory Management Memory Pagng Pagng Structure ELEC 377 Operatng Systems Today Pagng Szes Vrtual Memory Concept Demand Pagng ELEC 377 Operatng Systems

More information

Task Scheduling for Directed Cyclic Graph. Using Matching Technique

Task Scheduling for Directed Cyclic Graph. Using Matching Technique Contemporary Engneerng Scences, Vol. 8, 2015, no. 17, 773-788 HIKARI Ltd, www.m-hkar.com http://dx.do.org/10.12988/ces.2015.56193 Task Schedulng for Drected Cyclc Graph Usng Matchng Technque W.N.M. Arffn

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

Tolerating Transient Faults in Statically Scheduled Safety-Critical Embedded Systems

Tolerating Transient Faults in Statically Scheduled Safety-Critical Embedded Systems Toleratng Transent Faults n Statcally Scheduled Safety-Crtcal Embedded Systems Nagaraan Kandasamy *, John P. Hayes *, and Bran T. Murray ** * Department of Electrcal Engneerng ** Delph Automotve Systems

More information

Scheduling Independent Tasks in Heterogeneous Environments under Communication Constraints

Scheduling Independent Tasks in Heterogeneous Environments under Communication Constraints Schedulng Independent Tasks n Heterogeneous Envronments under Communcaton Constrants Petros Lampsas 1 Thanass Loukopoulos 2 Fedon Dmopoulos 1 Marouso Athanasou 1 1 Department of Informatcs and Computer

More information

ARTICLE IN PRESS. Signal Processing: Image Communication

ARTICLE IN PRESS. Signal Processing: Image Communication Sgnal Processng: Image Communcaton 23 (2008) 754 768 Contents lsts avalable at ScenceDrect Sgnal Processng: Image Communcaton journal homepage: www.elsever.com/locate/mage Dstrbuted meda rate allocaton

More information

Storage Binding in RTL synthesis

Storage Binding in RTL synthesis Storage Bndng n RTL synthess Pe Zhang Danel D. Gajsk Techncal Report ICS-0-37 August 0th, 200 Center for Embedded Computer Systems Department of Informaton and Computer Scence Unersty of Calforna, Irne

More information

Chapter 6 Programmng the fnte element method Inow turn to the man subject of ths book: The mplementaton of the fnte element algorthm n computer programs. In order to make my dscusson as straghtforward

More information

A Hybrid Genetic Algorithm for Routing Optimization in IP Networks Utilizing Bandwidth and Delay Metrics

A Hybrid Genetic Algorithm for Routing Optimization in IP Networks Utilizing Bandwidth and Delay Metrics A Hybrd Genetc Algorthm for Routng Optmzaton n IP Networks Utlzng Bandwdth and Delay Metrcs Anton Redl Insttute of Communcaton Networks, Munch Unversty of Technology, Arcsstr. 21, 80290 Munch, Germany

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Conditional Speculative Decimal Addition*

Conditional Speculative Decimal Addition* Condtonal Speculatve Decmal Addton Alvaro Vazquez and Elsardo Antelo Dep. of Electronc and Computer Engneerng Unv. of Santago de Compostela, Span Ths work was supported n part by Xunta de Galca under grant

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Wavefront Reconstructor

Wavefront Reconstructor A Dstrbuted Smplex B-Splne Based Wavefront Reconstructor Coen de Vsser and Mchel Verhaegen 14-12-201212 2012 Delft Unversty of Technology Contents Introducton Wavefront reconstructon usng Smplex B-Splnes

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

4/11/17. Agenda. Princeton University Computer Science 217: Introduction to Programming Systems. Goals of this Lecture. Storage Management.

4/11/17. Agenda. Princeton University Computer Science 217: Introduction to Programming Systems. Goals of this Lecture. Storage Management. //7 Prnceton Unversty Computer Scence 7: Introducton to Programmng Systems Goals of ths Lecture Storage Management Help you learn about: Localty and cachng Typcal storage herarchy Vrtual memory How the

More information

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introducton to Bonformatcs Sequence Algnment Luke Huan Electrcal Engneerng and Computer Scence http://people.eecs.ku.edu/~huan/ HMM Π s a set of states Transton Probabltes a kl Pr( l 1 k Probablty

More information

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier Some materal adapted from Mohamed Youns, UMBC CMSC 611 Spr 2003 course sldes Some materal adapted from Hennessy & Patterson / 2003 Elsever Scence Performance = 1 Executon tme Speedup = Performance (B)

More information

Load-Balanced Anycast Routing

Load-Balanced Anycast Routing Load-Balanced Anycast Routng Chng-Yu Ln, Jung-Hua Lo, and Sy-Yen Kuo Department of Electrcal Engneerng atonal Tawan Unversty, Tape, Tawan sykuo@cc.ee.ntu.edu.tw Abstract For fault-tolerance and load-balance

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

A GENETIC ALGORITHM FOR PROCESS SCHEDULING IN DISTRIBUTED OPERATING SYSTEMS CONSIDERING LOAD BALANCING

A GENETIC ALGORITHM FOR PROCESS SCHEDULING IN DISTRIBUTED OPERATING SYSTEMS CONSIDERING LOAD BALANCING A GENETIC ALGORITHM FOR PROCESS SCHEDULING IN DISTRIBUTED OPERATING SYSTEMS CONSIDERING LOAD BALANCING M. Nkravan and M. H. Kashan Department of Electrcal Computer Islamc Azad Unversty, Shahrar Shahreqods

More information

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions

Sorting Review. Sorting. Comparison Sorting. CSE 680 Prof. Roger Crawfis. Assumptions Sortng Revew Introducton to Algorthms Qucksort CSE 680 Prof. Roger Crawfs Inserton Sort T(n) = Θ(n 2 ) In-place Merge Sort T(n) = Θ(n lg(n)) Not n-place Selecton Sort (from homework) T(n) = Θ(n 2 ) In-place

More information

Evaluation of Parallel Processing Systems through Queuing Model

Evaluation of Parallel Processing Systems through Queuing Model ISSN 2278-309 Vkas Shnde, Internatonal Journal of Advanced Volume Trends 4, n Computer No.2, March Scence - and Aprl Engneerng, 205 4(2), March - Aprl 205, 36-43 Internatonal Journal of Advanced Trends

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Optimized Resource Scheduling Using Classification and Regression Tree and Modified Bacterial Foraging Optimization Algorithm

Optimized Resource Scheduling Using Classification and Regression Tree and Modified Bacterial Foraging Optimization Algorithm World Engneerng & Appled Scences Journal 7 (1): 10-17, 2016 ISSN 2079-2204 IDOSI Publcatons, 2016 DOI: 10.5829/dos.weasj.2016.7.1.22540 Optmzed Resource Schedulng Usng Classfcaton and Regresson Tree and

More information

Lecture 7 Real Time Task Scheduling. Forrest Brewer

Lecture 7 Real Time Task Scheduling. Forrest Brewer Lecture 7 Real Tme Task Schedulng Forrest Brewer Real Tme ANSI defnes real tme as A Real tme process s a process whch delvers the results of processng n a gven tme span A data may requre processng at a

More information

Needed Information to do Allocation

Needed Information to do Allocation Complexty n the Database Allocaton Desgn Must tae relatonshp between fragments nto account Cost of ntegrty enforcements Constrants on response-tme, storage, and processng capablty Needed Informaton to

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Repeater Insertion for Two-Terminal Nets in Three-Dimensional Integrated Circuits

Repeater Insertion for Two-Terminal Nets in Three-Dimensional Integrated Circuits Repeater Inserton for Two-Termnal Nets n Three-Dmensonal Integrated Crcuts Hu Xu, Vasls F. Pavlds, and Govann De Mchel LSI - EPFL, CH-5, Swtzerland, {hu.xu,vasleos.pavlds,govann.demchel}@epfl.ch Abstract.

More information

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio,

Parallel and Distributed Association Rule Mining - Dr. Giuseppe Di Fatta. San Vigilio, Parallel and Dstrbuted Assocaton Rule Mnng - Dr. Guseppe D Fatta fatta@nf.un-konstanz.de San Vglo, 18-09-2004 1 Overvew Assocaton Rule Mnng (ARM) Apror algorthm Hgh Performance Parallel and Dstrbuted Computng

More information

Run-Time Operator State Spilling for Memory Intensive Long-Running Queries

Run-Time Operator State Spilling for Memory Intensive Long-Running Queries Run-Tme Operator State Spllng for Memory Intensve Long-Runnng Queres Bn Lu, Yal Zhu, and lke A. Rundenstener epartment of Computer Scence, Worcester Polytechnc Insttute Worcester, Massachusetts, USA {bnlu,

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

Fibre-Optic AWG-based Real-Time Networks

Fibre-Optic AWG-based Real-Time Networks Fbre-Optc AWG-based Real-Tme Networks Krstna Kunert, Annette Böhm, Magnus Jonsson, School of Informaton Scence, Computer and Electrcal Engneerng, Halmstad Unversty {Magnus.Jonsson, Krstna.Kunert}@de.hh.se

More information

EFFICIENT SYNCHRONOUS PARALLEL DISCRETE EVENT SIMULATION

EFFICIENT SYNCHRONOUS PARALLEL DISCRETE EVENT SIMULATION EFFICIENT SYNCHRONOUS PARALLEL DISCRETE EVENT SIMULATION WITH THE ARMEN ARCHITECTURE C. Beaumont, B. Potter J.M. Flloque LIBr I.U.T. de Brest and LIBr Unversté de Bretagne Occdentale Télécom Bretagne BP

More information

Space-Optimal, Wait-Free Real-Time Synchronization

Space-Optimal, Wait-Free Real-Time Synchronization 1 Space-Optmal, Wat-Free Real-Tme Synchronzaton Hyeonjoong Cho, Bnoy Ravndran ECE Dept., Vrgna Tech Blacksburg, VA 24061, USA {hjcho,bnoy}@vt.edu E. Douglas Jensen The MITRE Corporaton Bedford, MA 01730,

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Performance Study of Parallel Programming on Cloud Computing Environments Using MapReduce

Performance Study of Parallel Programming on Cloud Computing Environments Using MapReduce Performance Study of Parallel Programmng on Cloud Computng Envronments Usng MapReduce Wen-Chung Shh, Shan-Shyong Tseng Department of Informaton Scence and Applcatons Asa Unversty Tachung, 41354, Tawan

More information

Number of processing elements (PEs). Computing power of each element. Amount of physical memory used. Data access, Communication and Synchronization

Number of processing elements (PEs). Computing power of each element. Amount of physical memory used. Data access, Communication and Synchronization Parallel Computer Archtecture A parallel computer s a collecton of processng elements that cooperate to solve large problems. Broad ssues nvolved: Resource Allocaton: Number of processng elements (PEs).

More information

Concurrent models of computation for embedded software

Concurrent models of computation for embedded software Concurrent models of computaton for embedded software and hardware! Researcher overvew what t looks lke semantcs what t means and how t relates desgnng an actor language actor propertes and how to represent

More information

Multitasking and Real-time Scheduling

Multitasking and Real-time Scheduling Multtaskng and Real-tme Schedulng EE8205: Embedded Computer Systems http://www.ee.ryerson.ca/~courses/ee8205/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrcal and Computer Engneerng Ryerson Unversty

More information

Insertion Sort. Divide and Conquer Sorting. Divide and Conquer. Mergesort. Mergesort Example. Auxiliary Array

Insertion Sort. Divide and Conquer Sorting. Divide and Conquer. Mergesort. Mergesort Example. Auxiliary Array Inserton Sort Dvde and Conquer Sortng CSE 6 Data Structures Lecture 18 What f frst k elements of array are already sorted? 4, 7, 1, 5, 1, 16 We can shft the tal of the sorted elements lst down and then

More information

Exercises (Part 4) Introduction to R UCLA/CCPR. John Fox, February 2005

Exercises (Part 4) Introduction to R UCLA/CCPR. John Fox, February 2005 Exercses (Part 4) Introducton to R UCLA/CCPR John Fox, February 2005 1. A challengng problem: Iterated weghted least squares (IWLS) s a standard method of fttng generalzed lnear models to data. As descrbed

More information

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe CSCI 104 Sortng Algorthms Mark Redekopp Davd Kempe Algorthm Effcency SORTING 2 Sortng If we have an unordered lst, sequental search becomes our only choce If we wll perform a lot of searches t may be benefcal

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Heterogeneous Parallel Computing: from Clusters of Workstations to Hierarchical Hybrid Platforms

Heterogeneous Parallel Computing: from Clusters of Workstations to Hierarchical Hybrid Platforms Heterogeneous Parallel Computng: from Clusters of Workstatons to Herarchcal Hybrd Platforms A.L. Lastovetsky 1 DOI: 10.14529/jsf140304 c The Author 2014. Ths paper s publshed wth open access at SuperFr.org

More information

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING

BIN XIA et al: AN IMPROVED K-MEANS ALGORITHM BASED ON CLOUD PLATFORM FOR DATA MINING An Improved K-means Algorthm based on Cloud Platform for Data Mnng Bn Xa *, Yan Lu 2. School of nformaton and management scence, Henan Agrcultural Unversty, Zhengzhou, Henan 450002, P.R. Chna 2. College

More information

Notes on Organizing Java Code: Packages, Visibility, and Scope

Notes on Organizing Java Code: Packages, Visibility, and Scope Notes on Organzng Java Code: Packages, Vsblty, and Scope CS 112 Wayne Snyder Java programmng n large measure s a process of defnng enttes (.e., packages, classes, methods, or felds) by name and then usng

More information

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT

DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT DESIGNING TRANSMISSION SCHEDULES FOR WIRELESS AD HOC NETWORKS TO MAXIMIZE NETWORK THROUGHPUT Bran J. Wolf, Joseph L. Hammond, and Harlan B. Russell Dept. of Electrcal and Computer Engneerng, Clemson Unversty,

More information

Maintaining temporal validity of real-time data on non-continuously executing resources

Maintaining temporal validity of real-time data on non-continuously executing resources Mantanng temporal valdty of real-tme data on non-contnuously executng resources Tan Ba, Hong Lu and Juan Yang Hunan Insttute of Scence and Technology, College of Computer Scence, 44, Yueyang, Chna Wuhan

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Mixed-Criticality Scheduling on Multiprocessors using Task Grouping

Mixed-Criticality Scheduling on Multiprocessors using Task Grouping Mxed-Crtcalty Schedulng on Multprocessors usng Task Groupng Jankang Ren Lnh Th Xuan Phan School of Software Technology, Dalan Unversty of Technology, Chna Computer and Informaton Scence Department, Unversty

More information

Multiblock method for database generation in finite element programs

Multiblock method for database generation in finite element programs Proc. of the 9th WSEAS Int. Conf. on Mathematcal Methods and Computatonal Technques n Electrcal Engneerng, Arcachon, October 13-15, 2007 53 Multblock method for database generaton n fnte element programs

More information

Verification by testing

Verification by testing Real-Tme Systems Specfcaton Implementaton System models Executon-tme analyss Verfcaton Verfcaton by testng Dad? How do they know how much weght a brdge can handle? They drve bgger and bgger trucks over

More information

3. CR parameters and Multi-Objective Fitness Function

3. CR parameters and Multi-Objective Fitness Function 3 CR parameters and Mult-objectve Ftness Functon 41 3. CR parameters and Mult-Objectve Ftness Functon 3.1. Introducton Cogntve rados dynamcally confgure the wreless communcaton system, whch takes beneft

More information

On Fault-Tolerant Embedding of Meshes and Tori in a Flexible Hypercube with Unbounded Expansion

On Fault-Tolerant Embedding of Meshes and Tori in a Flexible Hypercube with Unbounded Expansion On Fault-Tolerant Embeddng of Meshes and Tor n a Flexble Hypercube wth Unbounded Expanson *JEN-CHIH LIN Department of Dgtal Technology Desgn, Natonal Tape Unversty of Educaton, No.34, Sec. 2, Hepng E.

More information

FORESIGHTED JOINT RESOURCE RECIPROCATION AND SCHEDULING STRATEGIES FOR REAL-TIME VIDEO STREAMING OVER PEER-TO-PEER NETWORKS

FORESIGHTED JOINT RESOURCE RECIPROCATION AND SCHEDULING STRATEGIES FOR REAL-TIME VIDEO STREAMING OVER PEER-TO-PEER NETWORKS FORESIGHTED JOINT RESOURCE RECIPROCATION AND SCHEDULING STRATEGIES FOR REAL-TIME VIDEO STREAMING OVER PEER-TO-PEER NETWORKS Sunghoon Ivan Lee, Hyunggon Park, and Mhaela van der Schaar Electrcal Engneerng

More information

and NSF Engineering Research Center Abstract Generalized speedup is dened as parallel speed over sequential speed. In this paper

and NSF Engineering Research Center Abstract Generalized speedup is dened as parallel speed over sequential speed. In this paper Shared Vrtual Memory and Generalzed Speedup Xan-He Sun Janpng Zhu ICASE NSF Engneerng Research Center Mal Stop 132C Dept. of Math. and Stat. NASA Langley Research Center Msssspp State Unversty Hampton,

More information

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach Data Representaton n Dgtal Desgn, a Sngle Converson Equaton and a Formal Languages Approach Hassan Farhat Unversty of Nebraska at Omaha Abstract- In the study of data representaton n dgtal desgn and computer

More information