GPU Accelerated Blood Flow Computation using the Lattice Boltzmann Method

Size: px
Start display at page:

Download "GPU Accelerated Blood Flow Computation using the Lattice Boltzmann Method"

Transcription

1 GPU Accelerated Blood Flow Computaton usng the Lattce Boltmann Method Cosmn Nţă, Lucan Mha Itu, Constantn Sucu Department of Automaton Translvana Unversty of Braşov Braşov, Romana Constantn Sucu Corporate Technology Semens Braşov, Romana Abstract We propose a numercal mplementaton based on a Graphcs Processng Unt (GPU) for the acceleraton of the executon tme of the Lattce Boltmann Method (LBM). The study focuses on the applcaton of the LBM for patent-specfc blood flow computatons, and hence, to obtan hgher accuracy, double precson computatons are employed. The LBM specfc operatons are grouped nto two kernels, whereas only one of them uses nformaton from neghborng nodes. Snce for blood flow computatons regularly only 1/5 or less of the nodes represent flud nodes, an ndrect addressng scheme s used to reduce the memory requrements. Three GPU cards are evaluated wth dfferent 3D benchmark applcatons (Posseulle flow, ld-drven cavty flow and flow n an elbow shaped doman) and the best performng card s used to compute blood flow n a patent-specfc aorta geometry wth coarctaton. The speed-up over a mult-threaded CPU code s of 19.42x. The comparson wth a basc GPU based LBM mplementaton demonstrates the mportance of the optmaton actvtes. Keywords Lattce Boltmann Method, parallel computng, GPU, CUDA, coarctaton of the aorta I. INTRODUCTION In recent years, there has been consderable focus on computatonal approaches for modelng the flow of blood n the human cardovascular system. When used n conjuncton wth patent-specfc anatomcal models extracted from medcal mages, such technques provde mportant nsghts nto the structure and functon of the cardovascular system [1]. The Lattce Boltmann Method (LBM) has been ntroduced n the 80 s, and has developed nto an alternatve powerful numercal solver for the Naver-Stokes (NS) equatons for modelng flud flow. Specfcally, LBM has been used consstently n the last years n several blood flow applcatons (e.g. coronares [2], aneurysms [3], abdomnal aorta [4]). The LBM s a mesoscopc partcle based method, whch has ts orgn n the Lattce Gas Automata. It uses a smplfed knetc model of the essental physcs of mcroscopc processes, such that the macroscopc propertes of the system are governed by a certan set of equatons. The equaton of LBM s hyperbolc, and can be solved explctly and effcently on parallel computers [5]. Wth the ncreasng computatonal power of Graphcs Processng Unts (GPU), parallel computng has become avalable at a relatvely small cost. Wth the advent of CUDA (Compute Unfed Devce Archtecture), several researchers have dentfed the potental of GPUs to accelerate Computatonal Flud Dynamcs (CFD) applcatons to unprecedented levels [6]. Due to the hgh computatonal requrements, there has been a lot of nterest n explorng hgh performance computng technques for speedng up the LBM algorthms. Effcent CUDA based mplementatons of the 3D LBM have been proposed prevously n the lterature [7-10], whch were optmed for specfc applcatons. Tölke et al. [10] obtaned a speed-up of around 100x over a sequental mplementaton on the Intel Xeon CPU for the flow around a movng sphere. Obrecht et al. [9] studed the flow n an urban envronment and obtaned for a mult-gpu mplementaton a speed-up of 28x compared to a mult-threaded CPU based mplementaton. All these researches focused on sngle precson computatons. Wth the ntroducton of the Ferm and the Kepler archtecture, the performance of double precson computatons on NVIDIA GPU cards has ncreased substantally. In ths paper we ntroduce a parallel mplementaton of the LBM desgned for blood flow computatons. To meet the hgh accuracy requrements of blood flow applcatons, computatons are performed wth double precson. Three recently released GPUs have been consdered and, to correctly evaluate the speed-up potental, results are compared aganst both sngle-core and mult-core CPU-based mplementatons. The best performng GPU card s frst determned usng three popular benchmarkng applcatons, and then t s used for computng blood flow n a patent-specfc aorta geometry wth coarctaton (CoA), contanng the descendng aorta and the supra-aortc branches. CoA s a congental cardac defect usually consstng of a dscrete shelf-lke narrowng of the aortc meda nto the lumen of the aorta, occurrng n 5 to 8% of all patents wth congental heart dsease [11]. The narrowng can lead to a sgnfcant pressure drop, whch affects the health of the patent. Both the mportance and the potental of CFD based approaches for non-nvasve dagnoss of CoA patents have been recently emphased n a challenge [12], where the LBM produced good results. The paper s organed as follows. In secton two we frst brefly ntroduce the LBM used heren. Then we ntroduce the numercal mplementaton, focusng on ts optmed parallelaton on a GPU. Secton three frst presents detaled results for the speed-up obtaned wth dfferent GPUs for the benchmarkng applcatons, and then t dsplays the results obtaned wth the best performng GPU card for the patent

2 specfc CoA geometry. Fnally, n secton four, we draw the conclusons. II. METHODS A. The Lattce Boltmann Method For studyng the parallel mplementaton of the LBM, we consdered the sngle relaxaton tme verson of the equaton, based on the Bhatnagar-Gross-Krook (BGK) approxmaton, whch assumes that the macroscopc quanttes of the flud are not nfluenced by most of the molecular collsons: f t + c f 1 τ ( ) eq = f f, (1) where f represents the probablty dstrbuton functon along an axs c, τ s a relaxaton factor related to the flud vscosty, x represents the poston and t s the tme. The dscretaton n space and tme s performed wth fnte dfference formulas. Ths s usually done n two steps: Δt eq f t + Δ = f + ( f f ), (2a) τ and f ( x + c Δt, t + Δ = f t Δ. (2b) + The frst equaton s known as the collson step, whle the second one represents the streamng step. f eq s called the equlbrum dstrbuton and s gven by the followng formula: 2 2 eq ck u 1 ck u 1 u f = ω ρ(, ) 1+ + x t, (3) 2 2 cs 2 c 2 s cs where ω s a weghtng scalar, c s s the lattce speed of sound, c k s the drecton vector, and u s the flud velocty. ρ(x, s a scalar feld, commonly called densty, whch s related to the macroscopc flud pressure as follows: ρ p( x, =. (4) 3 Once all f have been computed, the macroscopc quanttes (velocty and densty) can be determned: 1 ( x = n u, c f, (5) ρ n = 0 = 0 ρ( x, = f. (6) The computatonal doman s smlar to a regular grd used for fnte dfference algorthms. For a more detaled descrpton of the Boltmann equaton and the collson operator we refer the reader to [5]. The current study focuses on 3D flow domans: we used the D3Q15 lattce structure, dsplayed n fg. 1 for a sngle grd node. The weghtng factors are: ω = 16/72 for = 0, ω = 8/72 for = 1 6, and ω = 1/72 for = The boundary condtons (nlet, outlet and wall) are crucal for any flud flow computaton. For the LBM, the macroscopc quanttes (flow rate/pressure) can not be drectly mposed at nlet and outlet. Instead, the known values of the macroscopc quanttes are used for computng the unknown dstrbuton functons near the boundary. For the nlet and outlet of the doman we used Zou-He [13] boundary condtons wth known velocty. For the outlet we used homogeneous Neumann boundary condton. The arteral geometry has complex boundares n patent-specfc blood flow computatons, and hence, for mprovng the accuracy of the results, we used advanced bounce-back boundary condtons based on nterpolatons [14]. The sold walls are defned as an sosurface of a scalar feld, commonly known as the level-set functon. B. GPU based parallel mplementaton of the Lattce Boltmann Method In the followng we focus on the GPU based parallelaton of the above descrbed LBM. The GPU s vewed as a compute devce whch s able to run a very hgh number of threads n parallel nsde a kernel (a functon, wrtten n C language, whch s executed on the GPU and launched by the CPU). The GPU contans several streamng multprocessors, each of them contanng several cores. The GPU contans a certan amount of global memory to/from whch the CPU thread can wrte/read, and whch s accessble by all multprocessors. Furthermore, each multprocessor also contans shared memory and regsters whch are splt between the thread blocks and the threads, whch run on the multprocessor, respectvely. The LBM s both computatonally expensve and memory demandng [15], but ts explct nature and the data localty (the computatons for a sngle grd node requre only the values of the neghborng nodes) make t deal for parallel mplementatons. Each node can be computed at each tme step ndependently from other nodes. A frst mportant dfference between the CPU and the GPU mplementaton of the LBM s the memory arrangement. Regularly, on the CPU, a data structure contanng all the requred floatng-pont values for a grd node s defned, and then an array of ths data structure s created (the Array Of Structures approach AOS). Ths approach s not a vable soluton on the GPU because the global memory accesses would not be coalesced and would drastcally decrease the performance [16]. Instead of AOS, the Structure Of Arrays (SOA) approach has been consdered [15]: a dfferent array s allocated for each varable of a node, leadng to a total of 35 arrays, 15 for the densty functons, another 15 for swappng the new densty functons wth the old ones after the streamng step, three for the velocty, one for the Fg. 1. The D3Q15 lattce structure, frst number n the notaton s the space dmenson, whle the second one s the lattce lnks number.

3 densty and one for the level-set functon. The memory access patterns for the AOS and SOA approaches are dsplayed n fg. 2 for the three velocty components. The workflow of the GPU-based LBM mplementaton s dsplayed n fg. 3. All computatons are performed on the GPU. Therefore, hostdevce memory copy operatons are only requred when storng ntermedate (transent or unsteady flows) or fnal results (steady flows). Two dfferent kernels have been defned and are called at each teraton. The operatons n (2) (6) have been assocated to the two kernels based on the necessty of accessng nformaton from the neghborng nodes. Kernel 1 frst computes the macroscopc quanttes (velocty and densty), based on (5) and (6), by teratng through the 15 probablty dstrbuton functons. Then t apples the Zou-He boundary condtons at the nlet of the doman and t performs the collson step: frst the equlbrum dstrbuton functon s computed usng (3) and then the new probablty dstrbuton functons are determned based on (2). The second kernel focuses on the streamng step, the nterpolated bounce-back boundary condton and the outlet boundary condton. All these operatons requre nformaton from the neghborng nodes. The operatons of the second kernel are more complex snce the grd nodes located at the boundary requre a dfferent treatment than the other nodes. Ths leads to dfferent code executon paths and therefore to reduced parallelsm. However, snce relatvely few grd nodes resde next to the boundary, ths aspect s not crucal for the overall performance. The workflow of the streamng step s dsplayed n fg. 4 (for smplcty, the treatment of the nodes of the outlet boundary s not dsplayed). One can see that, f a node s surrounded n opposte drectons by sold nodes, the smple bounce back rule s appled nstead of the nterpolated bounce back rule, whch would lead to numercal dvergence. Ths case s encountered relatvely often n geometres wth complex boundares, especally around sharp edges. For both kernels, one CUDA thread s mapped to one node and snce all arrays are one-dmensonal, also the executon confguraton of the kernels s one-dmensonal, both at block and at grd level. Due to the hgh accuracy requrements of blood flow computatons, and unlke prevous researches, all computatons were performed wth double precson. Because the arrays and the executon confguraton are one-dmensonal, t s necessary Fg. 2. Memory access patterns: Array of Structures (top), Structure of Arrays (bottom). Fg. 3. LBM workflow. to map the three-dmensonal coordnates nsde the grd to a global ndex used to access the data from the arrays: = N N + j N k. (7) g y + g = N N j = k = g g y, N N N y y N N, j N. where, j and k are the node coordnates n the 3D LBM grd. Note that these values are approxmated wth the floor functon, N x, N y and N are the grd ses n each drecton and g s the global ndex of the node n the one-dmensonal array. Equatons (7) and (8) are used nsde the second kernel for fndng the global ndex of the neghbourng nodes. The LBM s usually appled for a rectangular grd. For blood flow computatons, the rectangular grd s chosen so as to nclude the arteral geometry of nterest. In ths case though, the flud nodes represent only 1/5 or less of the total number of nodes. Hence, f the nature of the nodes (flud/sold) s not taken nto account, around 80% of the allocated memory s not used and around 80% of the threads do not perform any computatons. To avod ths problem, we used an ndrect addressng scheme, dsplayed n fg. 5. Memory s only (8)

4 Fg. 4. The workflow of the second kernel n fg. 3. Fg. 5. Indrect addressng. allocated for the flud nodes and an addtonal array (called flud ndex array) s ntroduced for mappng the global ndex determned wth (7) to the flud nodes arrays (negatve values n the flud ndex array correspond to sold nodes). The content of the flud ndex array s determned n the preprocessng stage on the CPU and s requred only durng the streamng step. Snce for the operatons performed nsde the frst kernel n fg. 3 no nformaton from the neghborng nodes s requred, the executon confguraton of the frst kernel s set up so as to generate a number of threads equal to the number of flud nodes. For the second kernel on the other sde, the number of threads n the executon confguraton s set equal to the total number of nodes, to avod the necessty of a search operaton n the flud ndex array. III. RESULTS To compare the performance of the CPU based mplementaton of the LBM wth the GPU based mplementaton for double precson computatons, we consdered three dfferent NVIDIA GPU cards: GeForce GTX 460, GeForce GTX 650 and GeForce GTX 680 (the frst one s based on the Ferm archtecture, whle the other two are based on the Kepler archtecture). The CPU based mplementaton was run on an eght-core 7 processor usng both sngle and mult-threaded code. Parallelaton of the CPU code was performed usng OpenMP. Three dfferent 3D benchmark applcatons were frst consdered for determnng the best performng GPU card: Posseulle flow, ld-drven cavty flow and flow n an elbow shaped doman. Dfferent grd resolutons were consdered and table I dsplays the executon tmes for all test cases, correspondng to one computaton step. The performance mprovements are sgnfcant and demonstrate that a GPU based mplementaton of the LBM s superor to a mult-core CPU based mplementaton. The best performance s obtaned for the GTX 680 (see table I). The speed-up s computed based on the mult-threaded CPU code. The speed-up compared to the sngle-threaded CPU code vares between 150x and 290x. Note that the performance of the GTX 650 card s on average around 2x lower than of the GTX 460. Ths confrms the concerns rased for the frst GPUs of the Kepler archtecture, the performance of whch are n fact lower than for the prevously released cards of the 400 and 500 GeForce seres (wth the advantage of lower power consumpton). Once the GTX680 was determned as best performng GPU card for double-precson 3D computatons, we used t to compute blood flow n a patent-specfc aorta model wth coarctaton, whch was recently used n a CFD challenge [12]. To obtan the correspondence between the lattce unts and the physcal unts, we used the method descrbed n [17]. The computatons were ntaled wth the equlbrum dstrbuton functon, and for the current research actvty we focused on steady-state computatons,.e. we mposed the average value of the flow rate profle specfed n the challenge. The grd se was set to 92x156x428 ( nodes), of whch only represented flud nodes (less than 10%). The total number of computaton steps to obtan convergence strongly depends on the grd resoluton,.e. the tme needed by the pressure wave to propagate from one end to the other, an aspect whch s gven by the lattce speed of sound. Fg. 6 dsplays the computaton results obtaned after tme steps (the converged soluton). Followng the dea n [18], namely that lower occupancy leads to better performance, we tested dfferent executon confguratons. The executon tmes obtaned for dfferent thread block confguratons, for the entre computaton, are dsplayed n table II alongsde the executon tme for the mult-threaded CPU code. As has been reported prevously [15], executon confguratons wth fewer threads per block lead to better performance. The best performng executon confguraton s wth 128 threads per block and the speed-up compared to the executon tme of the mult-threaded CPU mplementaton s of 19.42x.

5 TABLE I. EXECUTION TIMES OF BENCHMARKING APPLICATIONS FOR ONE COMPUTATION STEP FOR DIFFERENT GRID CONFIGURATIONS. Benchmark case Posseulle flow Ld-drven cavty flow Elbow Grd resoluton Snglethreaded CPU code [ms] Multthreaded CPU code [ms] GeForce GTX 680 GeForce GTX 650 GeForce GTX 460 Tme [ms] Speed-Up Tme [ms] Speed-Up Tme [ms] Speed-Up 100x100x x50x x25x x100x x50x x25x x200x x100x Fg. 7. Comparson of basc vs optmed LBM GPU mplementaton. was allocated for all nodes, ncludng the sold nodes), used four kernels for the operatons of the LBM at each teraton, and executed all kernels wth a total number of threads equal to the total number of nodes. The results are dsplayed n fg. 7 for dfferent thread block confguratons and show that the optmaton actvtes are crucal for the speed-up (wth the basc LBM GPU verson, the speed-up s of only 4.41x compared to the mult-threaded CPU code). The speed-up of the optmed LBM GPU verson compared to the basc LBM GPU verson s of 4.40x. Fg. 6. Computaton result (streamlnes) for the patent-specfc coarctaton geometry. TABLE II. COMPARISON OF EXECUTION TIMES FOR DIFFERENT EXECUTION CONFIGURATIONS Confguraton Executon tme [s] GPU - 64 threads/block GPU threads/block GPU threads/block GPU threads/block GPU threads/block CPU - multthreaded The mplementaton and optmaton aspects descrbed n the prevous secton were desgned specfcally for blood flow computatons. To evaluate the mpact of these actvtes we also performed the flow computatons n the same model wth a basc verson of the LBM GPU mplementaton. The basc LBM GPU verson dd not use ndrect addressng (memory IV. DISCUSSION AND CONCLUSIONS In ths paper, we ntroduced a GPU-based parallel mplementaton of the Lattce Boltmann Method, optmed for patent-specfc blood flow computatons. Double precson computatons were employed for hgher accuracy and three dfferent NVIDIA GPU cards were consdered. Based on three 3D benchmarkng applcatons, the GTX680 card was determned as best performng GPU and was subsequently used to compute blood flow n a aorta geometry wth coarctaton. To our knowledge, ths s the frst work to evaluate the potental of Kepler archtecture GPU cards for acceleratng the executon of the LBM. Moreover, t s the frst paper to consder double precson computatons for hgher accuracy. A detaled comparson wth prevous mplementatons [7-10] s dffcult to perform snce generally the mplementatons are optmed for specfc actvtes and dfferent GPUs have been used n dfferent studes. However, the overall results obtaned heren are remarkable: the speed-up over a sngle-threaded CPU mplementaton vares between 150x and 290x, whereas prevously a speed-up of 100x was reported [10]. The speed-up of the CoA geometry blood flow computaton was of 19.42x

6 compared to a mult-threaded CPU mplementaton, whereas prevously a speed-up of 28x was reported, but for a mult- GPU and not a sngle GPU mplementaton [9]. The optmaton actvtes were desgned for patentspecfc blood flow computatons n general (not n partcular for the coarctaton geometry), where the rato of flud nodes to total number of nodes s usually around 1/5 or less. Hence we used an ndrect adressng scheme and allocated memory only for the flud nodes. Furthermore, the operatons were grouped nto two kernels: the frst one performs operatons for whch nformaton from neghborng nodes s not requred, whle the second one uses nformaton from neghborng nodes. Ths way the number of kernels s reduced, and t was possble to use an executon confguraton wth reduced number of threads for the operatons for whch nformaton from the neghborng nodes s not requred. As proposed n the CFD challenge [12], we only consdered rgd wall computatons. If elastc arteral walls are consdered, then the flud ndex array n fg. 5 has to be recomputed at each tme step snce the classfcaton of nodes nto flud and sold nodes changes over tme. All LBM based results reported for [12] were obtaned for CPU based mplementatons. Although the LBM s faster than the classc CFD approach, based on the Naver-Stokes equatons, the acceleraton of the executon tme remans a crucal task for several reasons. Frst of all, when blood flow s modelled n patent-specfc geometres n a clncal settng, results are requred n a tmely manner not only to potentally treat the patent faster, but also to perform computatons for more patents n a certan amount of tme. Furthermore, when performng patent-specfc computatons, t s necessary to match certan patent-specfc characterstcs, lke pressure or flow rates. Hence, the parameters of the model need be tuned, and the computaton needs to be run repeatedly for the same geometry, thus ncreasng the total executon tme for a sngle patent [19]. Several future work actvtes have been dentfed. From a computatonal pont of vew, the global memory accesses of the second kernel can be further optmed, and a mult-gpu based mplementaton wll be consdered for further decreasng the executon tme. From a modelng pont of vew, for more severe coarctatons than the one dsplayed n fg. 6, the Reynolds number ncreases consderably and a Smagornsky sub-grd model needs to be employed [9]. [3] J. Bernsdorf, and D. Wang, Non-Newtonan blood flow smulaton n cerebral aneurysms, Computers & Mathematcs wth Applcatons, vol. 58 pp , [4] A.M. Artol, A.G. Hoekstra, and P.M.A. Sloot, Mesoscopc smulatons of systolc flow n the human abdomnal aorta, Journal of Bomechancs, vol. 39, pp , [5] S. Succ, The Lattce Boltmann Equaton - For Flud Dynamcs and Beyond. New York: Oxford Unversty Press, [6] D. Krk, and W.M. Hwu, Programmng Massvely Parallel Processors: A Hands-on Approach. London: Elsever, [7] P. Baley, J. Myre, S.D.C. Walsh, D.J. Llja, and M.O. Saar, Acceleratng lattce Boltmann flud flow smulatons usng graphcs processors, IEEE Internatonal Conference on Parallel Processng, Venna, Austra, pp , Sept [8] M. Bernasch, M. Fatca, S. Melchonna, S. Succ, and E. Kaxras, A flexble hgh-performance lattce Boltmann GPU code for the smulatons of flud flows n complex geometres, Concurrency Computaton: Practce & Experence, vol. 22, pp. 1-14, [9] C. Obrecht, F. Kunk, B. Tourancheau, and J.-J. Roux, Towards urban-scale flow smulatons usng the Lattce Boltmann Method, Buldng Smulaton Conference, Sydney, Australa, pp , Nov [10] J. Tölke, and M. Krafcyk, TeraFLOP computng on a desktop PC wth GPUs for 3D CFD, Internatonal Journal of Computatonal Flud Dynamcs, vol. 22, pp , [11] R.E. Rngel, and K. Jenkns, Coarctaton of the aorta stent tral (coas, 2007, [12] ***, CFD Challenge: Smulaton of Hemodynamcs n a Patent-Specfc Aortc Coarctaton Model, [13] Q. Zou, and X. He, On pressure and velocty boundary condtons for the Lattce Boltmann BGK model, Physcs of Fluds, vol. 9, pp , [14] M. Boud, M. Frdaouss, and P. Lallemand, Momentum transfer of a Boltmann-Lattce flud wth boundares, Physcs of Fluds, vol. 13, pp , [15] M. Astorno, J. Becerra Sagredo, and A. Quarteron, A modular lattce Boltmann solver for GPU computng processors, SeMA journal, vol. 59, pp , [16] NVIDIA Corporaton: CUDA, Compute Unfed Devce Archtecture Best Practces Gude v5.0 (2013). [17] J. Latt, Hydrodynamc lmt of lattce Boltmann equatons, PhD Thess, Unverste de Geneve, Geneve, Swterland, [18] V. Volkov, Better performance at lower occupancy, GPU Technology Conference, San Jose, USA, [19] D.R.Golbert, P.J. Blanco, A. Clausse, and R.A. Fejóo, Tunng a Lattce-Boltmann model for applcatons n computatonal hemodynamcs, Medcal Engneerng & Physcs, vol. 34, pp , ACKNOWLEDGMENT Ths work s supported by the program Partnershps n Prorty Domans (PN II), fnanced by ANCS, CNDI - UEFISCDI, under the project nr. 130/2012. REFERENCES [1] C.A. Taylor, and D.A. Stenman, Image-based modelng of blood flow and vessel wall dynamcs: applcatons, methods and future drectons, Annals of Bomedcal Engneerng, vol. 38, pp , [2] S. Melchonna, M. Bernasch, S. Succ, E. Kaxras, F.J. Rybck, Mtsouras D, et al., Hydroknetc approach to large-scale cardovascular blood flow, Computer Physcs Communcatons, vol. 181, pp , 2010.

S.P.H. : A SOLUTION TO AVOID USING EROSION CRITERION?

S.P.H. : A SOLUTION TO AVOID USING EROSION CRITERION? S.P.H. : A SOLUTION TO AVOID USING EROSION CRITERION? Célne GALLET ENSICA 1 place Emle Bloun 31056 TOULOUSE CEDEX e-mal :cgallet@ensca.fr Jean Luc LACOME DYNALIS Immeuble AEROPOLE - Bat 1 5, Avenue Albert

More information

Efficient parallel implementation of the lattice Boltzmann method on large clusters of graphic processing units

Efficient parallel implementation of the lattice Boltzmann method on large clusters of graphic processing units Invted Artcle Computer Scence & Technology March 2012 Vol.57 No.7: 707 715 do: 10.1007/s11434-011-4908-y SPECIAL TOPICS: Effcent parallel mplementaton of the lattce Boltzmann method on large clusters of

More information

Research Article Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU

Research Article Performance Optimization of 3D Lattice Boltzmann Flow Solver on a GPU Hndaw Scentfc Programmng Volume 2017, Artcle ID 1205892, 16 pages https://do.org/10.1155/2017/1205892 Research Artcle Performance Optmzaton of 3D Lattce Boltzmann Flow Solver on a GPU Nhat-Phuong Tran,

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier Some materal adapted from Mohamed Youns, UMBC CMSC 611 Spr 2003 course sldes Some materal adapted from Hennessy & Patterson / 2003 Elsever Scence Performance = 1 Executon tme Speedup = Performance (B)

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Wavefront Reconstructor

Wavefront Reconstructor A Dstrbuted Smplex B-Splne Based Wavefront Reconstructor Coen de Vsser and Mchel Verhaegen 14-12-201212 2012 Delft Unversty of Technology Contents Introducton Wavefront reconstructon usng Smplex B-Splnes

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Dynamic wetting property investigation of AFM tips in micro/nanoscale

Dynamic wetting property investigation of AFM tips in micro/nanoscale Dynamc wettng property nvestgaton of AFM tps n mcro/nanoscale The wettng propertes of AFM probe tps are of concern n AFM tp related force measurement, fabrcaton, and manpulaton technques, such as dp-pen

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

Accounting for the Use of Different Length Scale Factors in x, y and z Directions 1 Accountng for the Use of Dfferent Length Scale Factors n x, y and z Drectons Taha Soch (taha.soch@kcl.ac.uk) Imagng Scences & Bomedcal Engneerng, Kng s College London, The Rayne Insttute, St Thomas Hosptal,

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Chapter 6 Programmng the fnte element method Inow turn to the man subject of ths book: The mplementaton of the fnte element algorthm n computer programs. In order to make my dscusson as straghtforward

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

Array transposition in CUDA shared memory

Array transposition in CUDA shared memory Array transposton n CUDA shared memory Mke Gles February 19, 2014 Abstract Ths short note s nspred by some code wrtten by Jeremy Appleyard for the transposton of data through shared memory. I had some

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Modeling of Airfoil Trailing Edge Flap with Immersed Boundary Method

Modeling of Airfoil Trailing Edge Flap with Immersed Boundary Method Downloaded from orbt.dtu.dk on: Sep 27, 2018 Modelng of Arfol Tralng Edge Flap wth Immersed Boundary Method Zhu, We Jun; Shen, Wen Zhong; Sørensen, Jens Nørkær Publshed n: ICOWEOE-2011 Publcaton date:

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES

More information

3D vector computer graphics

3D vector computer graphics 3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres

More information

A Product Model based Approach to Interactive CAE Design Optimization

A Product Model based Approach to Interactive CAE Design Optimization A Product Model based Approach to Interactve CAE Desgn Optmzaton T. Fahrg, B. Nachtwey, S. Geller, J. Tölke, M. Krafczyk Insttut für Computeranwendungen m Baungeneurwesen (CAB) Technsche Unverstät Braunschweg,

More information

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis Internatonal Mathematcal Forum, Vol. 6,, no. 7, 8 Soltary and Travelng Wave Solutons to a Model of Long Range ffuson Involvng Flux wth Stablty Analyss Manar A. Al-Qudah Math epartment, Rabgh Faculty of

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Modeling, Manipulating, and Visualizing Continuous Volumetric Data: A Novel Spline-based Approach

Modeling, Manipulating, and Visualizing Continuous Volumetric Data: A Novel Spline-based Approach Modelng, Manpulatng, and Vsualzng Contnuous Volumetrc Data: A Novel Splne-based Approach Jng Hua Center for Vsual Computng, Department of Computer Scence SUNY at Stony Brook Talk Outlne Introducton and

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory

Virtual Memory. Background. No. 10. Virtual Memory: concept. Logical Memory Space (review) Demand Paging(1) Virtual Memory Background EECS. Operatng System Fundamentals No. Vrtual Memory Prof. Hu Jang Department of Electrcal Engneerng and Computer Scence, York Unversty Memory-management methods normally requres the entre process

More information

Module 6: FEM for Plates and Shells Lecture 6: Finite Element Analysis of Shell

Module 6: FEM for Plates and Shells Lecture 6: Finite Element Analysis of Shell Module 6: FEM for Plates and Shells Lecture 6: Fnte Element Analyss of Shell 3 6.6. Introducton A shell s a curved surface, whch by vrtue of ther shape can wthstand both membrane and bendng forces. A shell

More information

An Influence of the Noise on the Imaging Algorithm in the Electrical Impedance Tomography *

An Influence of the Noise on the Imaging Algorithm in the Electrical Impedance Tomography * Open Journal of Bophyscs, 3, 3, 7- http://dx.do.org/.436/ojbphy.3.347 Publshed Onlne October 3 (http://www.scrp.org/journal/ojbphy) An Influence of the Nose on the Imagng Algorthm n the Electrcal Impedance

More information

Multiblock method for database generation in finite element programs

Multiblock method for database generation in finite element programs Proc. of the 9th WSEAS Int. Conf. on Mathematcal Methods and Computatonal Technques n Electrcal Engneerng, Arcachon, October 13-15, 2007 53 Multblock method for database generaton n fnte element programs

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

UrbaWind, a Computational Fluid Dynamics tool to predict wind resource in urban area

UrbaWind, a Computational Fluid Dynamics tool to predict wind resource in urban area UrbaWnd, a Computatonal Flud Dynamcs tool to predct wnd resource n urban area Karm FAHSSIS a, Gullaume DUPONT a, Perre LEYRONNAS a a Meteodyn, Nantes, France Presentng Author: Karm.fahsss@meteodyn.com,

More information

COMPARISON OF TWO MODELS FOR HUMAN EVACUATING SIMULATION IN LARGE BUILDING SPACES. University, Beijing , China

COMPARISON OF TWO MODELS FOR HUMAN EVACUATING SIMULATION IN LARGE BUILDING SPACES. University, Beijing , China COMPARISON OF TWO MODELS FOR HUMAN EVACUATING SIMULATION IN LARGE BUILDING SPACES Bn Zhao 1, 2, He Xao 1, Yue Wang 1, Yuebao Wang 1 1 Department of Buldng Scence and Technology, Tsnghua Unversty, Bejng

More information

AADL : about scheduling analysis

AADL : about scheduling analysis AADL : about schedulng analyss Schedulng analyss, what s t? Embedded real-tme crtcal systems have temporal constrants to meet (e.g. deadlne). Many systems are bult wth operatng systems provdng multtaskng

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.15 No.10, October 2015 1 Evaluaton of an Enhanced Scheme for Hgh-level Nested Network Moblty Mohammed Babker Al Mohammed, Asha Hassan.

More information

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm Resolvng Ambguty n Depth Extracton for Moton Capture usng Genetc Algorthm Yn Yee Wa, Ch Kn Chow, Tong Lee Computer Vson and Image Processng Laboratory Dept. of Electronc Engneerng The Chnese Unversty of

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method Internatonal Journal of Computatonal and Appled Mathematcs. ISSN 89-4966 Volume, Number (07), pp. 33-4 Research Inda Publcatons http://www.rpublcaton.com An Accurate Evaluaton of Integrals n Convex and

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

COMBINED VERTEX-BASED CELL-CENTRED FINITE VOLUME METHOD FOR FLOWS IN COMPLEX GEOMETRIES

COMBINED VERTEX-BASED CELL-CENTRED FINITE VOLUME METHOD FOR FLOWS IN COMPLEX GEOMETRIES Thrd Internatonal Conference on CFD n the Mnerals and Process Industres CSIRO, Melbourne, Australa 1-1 December 3 COMBINED VERTEX-BASED CELL-CENTRED FINITE VOLUME METHOD FOR FLOWS IN COMPLEX GEOMETRIES

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

An inverse problem solution for post-processing of PIV data

An inverse problem solution for post-processing of PIV data An nverse problem soluton for post-processng of PIV data Wt Strycznewcz 1,* 1 Appled Aerodynamcs Laboratory, Insttute of Avaton, Warsaw, Poland *correspondng author: wt.strycznewcz@lot.edu.pl Abstract

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Flow over Broad Crested Weirs: Comparison of 2D and 3D Models

Flow over Broad Crested Weirs: Comparison of 2D and 3D Models Journal of Cvl Engneerng and Archtecture 11 (2017) 769-779 do: 10.17265/1934-7359/2017.08.005 D DAVID PUBLISHING Flow over Broad Crested Wers: Comparson of 2D and 3D Models Shaymaa A. M. Al-Hashm 1, Huda

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

Electrical analysis of light-weight, triangular weave reflector antennas

Electrical analysis of light-weight, triangular weave reflector antennas Electrcal analyss of lght-weght, trangular weave reflector antennas Knud Pontoppdan TICRA Laederstraede 34 DK-121 Copenhagen K Denmark Emal: kp@tcra.com INTRODUCTION The new lght-weght reflector antenna

More information

Simulation of a Ship with Partially Filled Tanks Rolling in Waves by Applying Moving Particle Semi-Implicit Method

Simulation of a Ship with Partially Filled Tanks Rolling in Waves by Applying Moving Particle Semi-Implicit Method Smulaton of a Shp wth Partally Flled Tanks Rollng n Waves by Applyng Movng Partcle Sem-Implct Method Jen-Shang Kouh Department of Engneerng Scence and Ocean Engneerng, Natonal Tawan Unversty, Tape, Tawan,

More information

An Efficient Garbage Collection for Flash Memory-Based Virtual Memory Systems

An Efficient Garbage Collection for Flash Memory-Based Virtual Memory Systems S. J and D. Shn: An Effcent Garbage Collecton for Flash Memory-Based Vrtual Memory Systems 2355 An Effcent Garbage Collecton for Flash Memory-Based Vrtual Memory Systems Seunggu J and Dongkun Shn, Member,

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Configuration Management in Multi-Context Reconfigurable Systems for Simultaneous Performance and Power Optimizations*

Configuration Management in Multi-Context Reconfigurable Systems for Simultaneous Performance and Power Optimizations* Confguraton Management n Mult-Context Reconfgurable Systems for Smultaneous Performance and Power Optmzatons* Rafael Maestre, Mlagros Fernandez Departamento de Arqutectura de Computadores y Automátca Unversdad

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

AMath 483/583 Lecture 21 May 13, Notes: Notes: Jacobi iteration. Notes: Jacobi with OpenMP coarse grain

AMath 483/583 Lecture 21 May 13, Notes: Notes: Jacobi iteration. Notes: Jacobi with OpenMP coarse grain AMath 483/583 Lecture 21 May 13, 2011 Today: OpenMP and MPI versons of Jacob teraton Gauss-Sedel and SOR teratve methods Next week: More MPI Debuggng and totalvew GPU computng Read: Class notes and references

More information

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach

Data Representation in Digital Design, a Single Conversion Equation and a Formal Languages Approach Data Representaton n Dgtal Desgn, a Sngle Converson Equaton and a Formal Languages Approach Hassan Farhat Unversty of Nebraska at Omaha Abstract- In the study of data representaton n dgtal desgn and computer

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Preconditioning Parallel Sparse Iterative Solvers for Circuit Simulation

Preconditioning Parallel Sparse Iterative Solvers for Circuit Simulation Precondtonng Parallel Sparse Iteratve Solvers for Crcut Smulaton A. Basermann, U. Jaekel, and K. Hachya 1 Introducton One mportant mathematcal problem n smulaton of large electrcal crcuts s the soluton

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

VISCOELASTIC SIMULATION OF BI-LAYER COEXTRUSION IN A SQUARE DIE: AN ANALYSIS OF VISCOUS ENCAPSULATION

VISCOELASTIC SIMULATION OF BI-LAYER COEXTRUSION IN A SQUARE DIE: AN ANALYSIS OF VISCOUS ENCAPSULATION VISCOELASTIC SIMULATION OF BI-LAYER COEXTRUSION IN A SQUARE DIE: AN ANALYSIS OF VISCOUS ENCAPSULATION Mahesh Gupta Mchgan Technologcal Unversty Plastc Flow, LLC Houghton, MI 49931 Hancock, MI 49930 Abstract

More information

DESIGN OF A HAPTIC DEVICE FOR EXCAVATOR EQUIPPED WITH CRUSHER

DESIGN OF A HAPTIC DEVICE FOR EXCAVATOR EQUIPPED WITH CRUSHER DESIGN OF A HAPTIC DEVICE FOR EXCAVATOR EQUIPPED WITH CRUSHER Kyeong Won Oh, Dongnam Km Korea Unversty, Graduate School 5Ga-1, Anam-Dong, Sungbuk-Gu, Seoul, Korea {locosk, smleast}@korea.ac.kr Jong-Hyup

More information

CMPS 10 Introduction to Computer Science Lecture Notes

CMPS 10 Introduction to Computer Science Lecture Notes CPS 0 Introducton to Computer Scence Lecture Notes Chapter : Algorthm Desgn How should we present algorthms? Natural languages lke Englsh, Spansh, or French whch are rch n nterpretaton and meanng are not

More information

Inverse kinematic Modeling of 3RRR Parallel Robot

Inverse kinematic Modeling of 3RRR Parallel Robot ème Congrès Franças de Mécanque Lyon, 4 au 8 Août 5 Inverse knematc Modelng of RRR Parallel Robot Ouafae HAMDOUN, Fatma Zahra BAGHLI, Larb EL BAKKALI Modelng and Smulaton of Mechancal Systems Laboratory,

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Towards sibilant /s/ modelling: preliminary computational results

Towards sibilant /s/ modelling: preliminary computational results Acoustcs 8 Pars Towards sblant /s/ modellng: prelmnary computatonal results X. Grandchamp a, A. Van rtum a, X. Pelorson a, K. Nozak b and S. Shmoo b a Département Parole & Cognton, GIPSA-lab, 46, avenue

More information

Efficient Distributed File System (EDFS)

Efficient Distributed File System (EDFS) Effcent Dstrbuted Fle System (EDFS) (Sem-Centralzed) Debessay(Debsh) Fesehaye, Rahul Malk & Klara Naherstedt Unversty of Illnos-Urbana Champagn Contents Problem Statement, Related Work, EDFS Desgn Rate

More information

IP Camera Configuration Software Instruction Manual

IP Camera Configuration Software Instruction Manual IP Camera 9483 - Confguraton Software Instructon Manual VBD 612-4 (10.14) Dear Customer, Wth your purchase of ths IP Camera, you have chosen a qualty product manufactured by RADEMACHER. Thank you for the

More information

ELEC 377 Operating Systems. Week 6 Class 3

ELEC 377 Operating Systems. Week 6 Class 3 ELEC 377 Operatng Systems Week 6 Class 3 Last Class Memory Management Memory Pagng Pagng Structure ELEC 377 Operatng Systems Today Pagng Szes Vrtual Memory Concept Demand Pagng ELEC 377 Operatng Systems

More information

Kinematics of pantograph masts

Kinematics of pantograph masts Abstract Spacecraft Mechansms Group, ISRO Satellte Centre, Arport Road, Bangalore 560 07, Emal:bpn@sac.ernet.n Flght Dynamcs Dvson, ISRO Satellte Centre, Arport Road, Bangalore 560 07 Emal:pandyan@sac.ernet.n

More information

AP PHYSICS B 2008 SCORING GUIDELINES

AP PHYSICS B 2008 SCORING GUIDELINES AP PHYSICS B 2008 SCORING GUIDELINES General Notes About 2008 AP Physcs Scorng Gudelnes 1. The solutons contan the most common method of solvng the free-response questons and the allocaton of ponts for

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

Distance Calculation from Single Optical Image

Distance Calculation from Single Optical Image 17 Internatonal Conference on Mathematcs, Modellng and Smulaton Technologes and Applcatons (MMSTA 17) ISBN: 978-1-6595-53-8 Dstance Calculaton from Sngle Optcal Image Xao-yng DUAN 1,, Yang-je WEI 1,,*

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Constructing Minimum Connected Dominating Set: Algorithmic approach

Constructing Minimum Connected Dominating Set: Algorithmic approach Constructng Mnmum Connected Domnatng Set: Algorthmc approach G.N. Puroht and Usha Sharma Centre for Mathematcal Scences, Banasthal Unversty, Rajasthan 304022 usha.sharma94@yahoo.com Abstract: Connected

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

PHYSICS-ENHANCED L-SYSTEMS

PHYSICS-ENHANCED L-SYSTEMS PHYSICS-ENHANCED L-SYSTEMS Hansrud Noser 1, Stephan Rudolph 2, Peter Stuck 1 1 Department of Informatcs Unversty of Zurch, Wnterthurerstr. 190 CH-8057 Zurch Swtzerland noser(stuck)@f.unzh.ch, http://www.f.unzh.ch/~noser(~stuck)

More information