Iterative methods for use with the Fast Multipole Method
|
|
- Georgina Margaret Quinn
- 5 years ago
- Views:
Transcription
1 Iterative methods for use with the Fast Multipole Method Ramani Duraiswami Perceptual Interfaces and Reality Lab. Computer Science & UMIACS University of Maryland, College Park, MD Joint work with Nail A. Gumerov
2 Fast Multipole Methods Follows from seminal work of Rokhlin and Greengard (1987) General method for accelerating large classes of dense matrix vector products Apply to matrices whose entries are associated with computations of pairs of points Reduce computational/memory complexity from O(N 2 ) to O(N log N) Potentially reduce many O(N 2 )/O(N 3 ) calculations to O(N log N) Nail Gumerov and I are applying it to many areas Acoustics (JASA 2002,2005), Room impulse responses (IEEE Trans. SAP, 2006), book (Elsevier, 2005) Maxwell s equations in 3D and scattering (submitted, 2006) Fast statistics (NIPS 2004), similarity measures (CVPR 2005), image processing, segmentation (ICCV 2003), tracking, learning (Data Mining 2006) Non uniform fast Fourier transforms and reconstruction Biharmonic equation (JCP 2006), fitting thin-plate splines
3 Solving Linear Systems Typically the FMM is used with an iterative method some preliminary work aims at directly constructing the inverse approximately (Rokhlin, Martinsson, Gimbutas, Cheng) FMM reduces the cost of the matrix-vector product step in the iterative solution of linear systems, and for an iteration requiring N iter steps, the complexity is O(N iter NlogN). To bound N iter appropriate pre-conditioning strategies must be used with the dense matrix. However, many conventional pre-conditioning strategies rely on sparsity in the matrix, or are ineffective
4 Preconditioner cost To be effective with the FMM the preconditioner cost should be O(N log N) or less Otherwise we negate the advantage of the FMM We present results for two preconditioners we have used First, for highly oscillatory kernel --- multiple scattering with the Helmholtz equation Second, we accelerate a highly successful preconditioner developed by Faul, Powell and Goodsell (2005), but which has a complexity of O(N 2 ) to O(N log N).
5 Radial basis function (RBF) interpolation Construct interpolating function for scattered data, with the interpolating function expressed as a sum of RBFs centered at the data points. Much work by Powell, Beatson and coworkers Culmination of this work: an iterative Krylov subspace algorithm Faul et al. (2005) A. C. Faul, G. Goodsell, M. J. D. Powell, "A Krylov subspace algorithm for multiquadric interpolation in many dimensions," IMA Journal of Numerical Analysis (2005) 25, 1--24
6 RBF Interpolation x are points in R d f are values of the function to be interpolated s is the interpolating function λ are the interpolation coefficients Interpolating RBFs are multiquadric or biharmonic
7 Linear system \Phi is the RBF
8 Preconditioner via cardinal functions Beatson and Powell (1992) created preconditioners via approximate cardinal functions Preconditioner must be close to the inverse Cardinal function has value one at the point and zeros at all other points Thus a cardinal function c satisfies Ac = [ ] t If we stack cardinal functions for each row, we obtain A -1 f BP92, proposed to use the smoothness of the interpolant and required the cardinality properties at each point and several other points in the domain (not the whole set) Selection of which points to include and which points to exclude, and their influence on preconditioner performance is the subject of several subsequent papers by the groups of Powell and Beatson.
9 Empirical comparison of iterative methods We compared several strategies proposed by the authors, and found that the proposal of Faul et al. 2005, works best For some of the proposed strategies, for very large (10 6 or more points), some of the proposed preconditioners do not converge within 1000 iterations. However Faul et al. s preconditioned iterative algorithm converges robustly within about 50 iterations.
10 Accelerating the Faul et al. algorithm Matrix vector product can be accelerated via the FMM (e.g., Gumerov and Duraiswami, 2006, JCP) Choice of points for the approximate cardinal functions is very particular in Faul et al. Chooses the point that is a member of the closest pair of points, and its q closest neighbors and builds an interpolant The first point is eliminated from future consideration Algorithm is recursively applied and a fine-to-coarse structure of the sets of approximate cardinal functions are obtained Preconditioned Conjugate Direction algorithm is applied along the directions of these functions Search for q closest neighbors with removal of points from the set is O(N 2 ).
11 Converting it to O(N log N) Use FMM data structure to build lists of closest neighbors Use heaps to develop a dynamic data structure that allows deletions Employed a lemma that bounds the number of closest neighbors a point can have is a fixed number that is a function of the dimension Achieve a point selection that is O(N log N). Details in a preprint being prepared. For a 110,000 point problem, time of the preset set went down from seconds to 115 seconds in 3D biharmonic interpolation
12 Original unstructured data points RBF/FMM interpolation to regular spatial grid FMM algorithm cost per iteration:9.6 seconds on a Pentium IV 3.2 GHz desktop
13 Error Iteration number
14 Original Data: surface mesh with vertices and faces RBF interpolation to the spatial grid + isosurface (size of the grid is shown near each figure) 5x5x5 11x11x11 26x26x26 51x51x51 101x101x x201x201
15 Original Image percent samples drawn from image FMM algorithm cost per iteration:6.2 seconds on a Pentium IV 3.2 GHz desktop
16 Multiquadric Interpolation of a 2D image 10 4 Error Iteration number
17 Interpolated Result Difference Image 14 seconds to interpolate whole image from the fit data
18 Equations and Boundary Conditions Formulation Wave Equation Fourier Transform Helmholtz Equation Impedance Boundary Conditions 3 Field Decomposition Sommerfield Radiation Condition 2 Incident Wave 6
19 Scattered Field Decomposition T-Matrix Method Scattered field expressed in terms of singular (multipole) wave functions that satisfy the Sommerfeld condition Singular Basis Functions Hankel Functions Expansion Coefficients Spherical Harmonics Vector Form: dot product
20 T-Matrix Method Solution of Multiple Scattering Problem Write a system of equations relating expansion coefficients for each scatterer Effective Incident Field Coupled System of Equations: Scattered Wave (S R)-Translation Matrix Incident Wave 6 Linear system of size p 2 N p 2 N
21 T-matrix Method Three Spheres Comparisons of BEM & MultisphereHelmholtz HRTF (db) o 30 o θ 1 = 0 o 90 o o BEM MultisphereHelmholtz 30 o o 90 o 120 o o 150 o o Three Spheres, ka 1 = Angle φ 1 (deg) BEM: 5184 triangular elements MH: N trunc = 9 (100 coefficients for each sphere)
22 Iterative Methods Krylov Subspace Method (GMRES) For larger systems use an iterative method Diagonal Matrix The product of this matrix by An arbitrary input vector can be done fast with the FMM
23 Iterative Methods FGMRES Unpreconditioned LA = E To converge requires N iter multiplications LG, where G is an input vector Cost: C N iter Right Preconditioner LM -1 (MA)=E 1). Internal Loop: Solve M -1 F=E Requires N (1) iter multiplications MG 2). External Loop: Requires N (2) iter multiplications LG Cost: C (1) N (1) iter+ C N (2) iter Substantial speed up if M L and C (1) «C
24 MLFMM FMM splits influence of scatterers in to far and near Fields Neighborhood (Near Field) Far Field
25 MLFMM Computation of the Far Field (1) 1). Set Data Structure (hierarchically subdivide space with an oct-tree) 2). multipole to multipole translate S- expansions for all scatterers in a box at the finest level to the center of the box and sum up (determine contribution to Far Field for each box at the finest level). 3). Recursively multipole to multipole translate S-expansions to center of the parent box and sum up (determine contribution to Far Field for each box at all coarser levels). y x i x c (n,l) Upward Pass (From the finest to the coarsest level)
26 MLFMM Computation of the Far Field (2) Steps 4 and 5 performed one after the other recursively 4). Multipole-to-local translate S- expansions for boxes which are inside the parent neighborhood but outside the box neighborhood to the center of the box (convert S-expansion to R- expansion). Downward Pass (From the coarsest to the finest level) 5). Local-to-local translate R-expansions from the center of the box to the center of its children boxes (determine Far Field for each box at all levels). 6). local translation of R-expansions from the center of the boxes at the finest level to the centers of the spheres.
27 Preconditioned FGMRES FGMRES Choice of preconditioning operator: near (sparse) or far (dense)? Experimentally we found that the far (dense) operator works well in all tests
28 Range of Parameters Results Number of Spheres: ; ka: ; Random and regularly spaced grids of spheres; Polydispersity: (ratio to the mean radius); Volume fractions:
29 Results 100 random spheres (MLFMM) Plane Wave Plane Wave ka = 1.6 ka = 2.8
30 1000 random spheres Results ka=1
31 10000 random spheres Results ka = 0.75 kd 0 = 90
32 A posteriori Error Check Results
33 Results GMRES vs FGMRES Volume Fraction = 0.2 ka = 0.5, D 0 /a = 60, p 2 = GHz, Xeon, 3.5 MB RAM
34 Performance Test Results Volume fraction = 0.2, ka = 0.5, p 2 = FMM+FGMRES y=cx 1.25 CPU Time (s) y = cx 1.25 Total y = bx External Loop y = ax 3.2GHz, Xeon, 3.5 MB RAM Matrix-Vector Multiplication 1 Internal Loop Number of Scatterers
35 Conclusions Need for preconditioners for dense matrices for FMM based algorithms For a highly oscillatory kernel, preconditioner based on the FMM algorithm itself gives good results Preconditioner theoretical performance remains to be proved Approach that interpolates the cardinal function approximately on carefully chosen data appears very promising for smoother kernels Geometrical computations involved in this method reduced to O(N log N) Extension to other kernels of this approach is the subject of
Efficient O(N log N) algorithms for scattered data interpolation
Efficient O(N log N) algorithms for scattered data interpolation Nail Gumerov University of Maryland Institute for Advanced Computer Studies Joint work with Ramani Duraiswami February Fourier Talks 2007
More informationFMM accelerated BEM for 3D Helmholtz equation
FMM accelerated BEM for 3D Helmholtz equation Nail A. Gumerov and Ramani Duraiswami Institute for Advanced Computer Studies University of Maryland, U.S.A. also @ Fantalgo, LLC, U.S.A. www.umiacs.umd.edu/~gumerov
More informationFast Multipole Accelerated Indirect Boundary Elements for the Helmholtz Equation
Fast Multipole Accelerated Indirect Boundary Elements for the Helmholtz Equation Nail A. Gumerov Ross Adelman Ramani Duraiswami University of Maryland Institute for Advanced Computer Studies and Fantalgo,
More informationc 2007 Society for Industrial and Applied Mathematics
SIAM J. SCI. COMPUT. Vol. 29, No. 5, pp. 1876 1899 c 2007 Society for Industrial and Applied Mathematics FAST RADIAL BASIS FUNCTION INTERPOLATION VIA PRECONDITIONED KRYLOV ITERATION NAIL A. GUMEROV AND
More informationGPU accelerated heterogeneous computing for Particle/FMM Approaches and for Acoustic Imaging
GPU accelerated heterogeneous computing for Particle/FMM Approaches and for Acoustic Imaging Ramani Duraiswami University of Maryland, College Park http://www.umiacs.umd.edu/~ramani With Nail A. Gumerov,
More informationFast Multipole and Related Algorithms
Fast Multipole and Related Algorithms Ramani Duraiswami University of Maryland, College Park http://www.umiacs.umd.edu/~ramani Joint work with Nail A. Gumerov Efficiency by exploiting symmetry and A general
More informationAPPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION.
APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION Martin Pollow Institute of Technical Acoustics RWTH Aachen University Neustraße
More informationLecture 7: Introduction to HFSS-IE
Lecture 7: Introduction to HFSS-IE 2015.0 Release ANSYS HFSS for Antenna Design 1 2015 ANSYS, Inc. HFSS-IE: Integral Equation Solver Introduction HFSS-IE: Technology An Integral Equation solver technology
More informationThe Fast Multipole Method (FMM)
The Fast Multipole Method (FMM) Motivation for FMM Computational Physics Problems involving mutual interactions of N particles Gravitational or Electrostatic forces Collective (but weak) long-range forces
More informationA Broadband Fast Multipole Accelerated Boundary Element Method for the 3D Helmholtz Equation. Abstract
A Broadband Fast Multipole Accelerated Boundary Element Method for the 3D Helmholtz Equation Nail A. Gumerov and Ramani Duraiswami (Dated: 31 December 2007, Revised 22 July 2008, Revised 03 October 2008.)
More informationFast Spherical Filtering in the Broadband FMBEM using a nonequally
Fast Spherical Filtering in the Broadband FMBEM using a nonequally spaced FFT Daniel R. Wilkes (1) and Alec. J. Duncan (1) (1) Centre for Marine Science and Technology, Department of Imaging and Applied
More informationA Multi-scale Approach to 3D Scattered Data Interpolation with Compactly Supported Basis Functions
Shape Modeling International 2003 Seoul, Korea A Multi-scale Approach to 3D Scattered Data Interpolation with Compactly Supported Basis Functions Yutaa Ohtae Alexander Belyaev Hans-Peter Seidel Objective
More informationFast Multipole Methods. Linear Systems. Matrix vector product. An Introduction to Fast Multipole Methods.
An Introduction to Fast Multipole Methods Ramani Duraiswami Institute for Advanced Computer Studies University of Maryland, College Park http://www.umiacs.umd.edu/~ramani Joint work with Nail A. Gumerov
More informationKernel Independent FMM
Kernel Independent FMM FMM Issues FMM requires analytical work to generate S expansions, R expansions, S S (M2M) translations S R (M2L) translations R R (L2L) translations Such analytical work leads to
More informationFMM Data Structures. Content. Introduction Hierarchical Space Subdivision with 2 d -Trees Hierarchical Indexing System Parent & Children Finding
FMM Data Structures Nail Gumerov & Ramani Duraiswami UMIACS [gumerov][ramani]@umiacs.umd.edu CSCAMM FAM4: 4/9/4 Duraiswami & Gumerov, -4 Content Introduction Hierarchical Space Subdivision with d -Trees
More informationTransactions on Modelling and Simulation vol 20, 1998 WIT Press, ISSN X
Parallel indirect multipole BEM analysis of Stokes flow in a multiply connected domain M.S. Ingber*, A.A. Mammoli* & J.S. Warsa* "Department of Mechanical Engineering, University of New Mexico, Albuquerque,
More informationA Kernel-independent Adaptive Fast Multipole Method
A Kernel-independent Adaptive Fast Multipole Method Lexing Ying Caltech Joint work with George Biros and Denis Zorin Problem Statement Given G an elliptic PDE kernel, e.g. {x i } points in {φ i } charges
More informationTerascale on the desktop: Fast Multipole Methods on Graphical Processors
Terascale on the desktop: Fast Multipole Methods on Graphical Processors Nail A. Gumerov Fantalgo, LLC Institute for Advanced Computer Studies University of Maryland (joint work with Ramani Duraiswami)
More informationParallel Interpolation in FSI Problems Using Radial Basis Functions and Problem Size Reduction
Parallel Interpolation in FSI Problems Using Radial Basis Functions and Problem Size Reduction Sergey Kopysov, Igor Kuzmin, Alexander Novikov, Nikita Nedozhogin, and Leonid Tonkov Institute of Mechanics,
More informationinter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE
Copyright SFA - InterNoise 2000 1 inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering 27-30 August 2000, Nice, FRANCE I-INCE Classification: 7.6 EFFICIENT ACOUSTIC
More informationComputation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation
Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation Nail A. Gumerov, Adam E. O Donovan, Ramani Duraiswami,
More informationCapturing, Computing, Visualizing and Recreating Spatial Sound
Capturing, Computing, Visualizing and Recreating Spatial Sound Ramani Duraiswami University of Maryland, College Park Joint work with Dmitry Zotkin, Zhiyun Li, Elena Grassi, Adam O Donovan, Nail Gumerov,
More informationFAST MATRIX-VECTOR PRODUCT BASED FGMRES FOR KERNEL MACHINES
FAST MATRIX-VECTOR PRODUCT BASED FGMRES FOR KERNEL MACHINES BALAJI VASAN SRINIVASAN, M.S. SCHOLARLY PAPER DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF MARYLAND, COLLEGE PARK Abstract. Kernel based approaches
More informationFast multipole methods for axisymmetric geometries
Fast multipole methods for axisymmetric geometries Victor Churchill New York University May 13, 2016 Abstract Fast multipole methods (FMMs) are one of the main numerical methods used in the solution of
More informationA fast direct solver for high frequency scattering from a cavity in two dimensions
1/31 A fast direct solver for high frequency scattering from a cavity in two dimensions Jun Lai 1 Joint work with: Leslie Greengard (CIMS) Sivaram Ambikasaran (CIMS) Workshop on fast direct solver, Dartmouth
More informationThe Fast Multipole Method on NVIDIA GPUs and Multicore Processors
The Fast Multipole Method on NVIDIA GPUs and Multicore Processors Toru Takahashi, a Cris Cecka, b Eric Darve c a b c Department of Mechanical Science and Engineering, Nagoya University Institute for Applied
More informationImproved Multilevel Fast Multipole Method for Higher-Order Discretizations
Improved Multilevel Fast Multipole Method for Higher-Order Discretizations Oscar Borries 1,, Peter Meincke, Erik Jørgensen, Stig Busk Sørensen, Per Christian Hansen 1 1 Technical University of Denmark,
More informationFMM implementation on CPU and GPU. Nail A. Gumerov (Lecture for CMSC 828E)
FMM implementation on CPU and GPU Nail A. Gumerov (Lecture for CMSC 828E) Outline Two parts of the FMM Data Structure Flow Chart of the Run Algorithm FMM Cost/Optimization on CPU Programming on GPU Fast
More informationAdaptive boundary element methods in industrial applications
Adaptive boundary element methods in industrial applications Günther Of, Olaf Steinbach, Wolfgang Wendland Adaptive Fast Boundary Element Methods in Industrial Applications, Hirschegg, September 30th,
More informationWhat is Multigrid? They have been extended to solve a wide variety of other problems, linear and nonlinear.
AMSC 600/CMSC 760 Fall 2007 Solution of Sparse Linear Systems Multigrid, Part 1 Dianne P. O Leary c 2006, 2007 What is Multigrid? Originally, multigrid algorithms were proposed as an iterative method to
More informationSpherical Microphone Arrays
Spherical Microphone Arrays Acoustic Wave Equation Helmholtz Equation Assuming the solutions of wave equation are time harmonic waves of frequency ω satisfies the homogeneous Helmholtz equation: Boundary
More informationUsing GPUs to compute the multilevel summation of electrostatic forces
Using GPUs to compute the multilevel summation of electrostatic forces David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of
More informationMethod of Moments enhancement technique for the analysis of Sierpinski pre-fractal antennas. Total number of pages: 10
FRACTALCOMS Exploring the limits of Fractal Electrodynamics for the future telecommunication technologies IST-2001-33055 Method of Moments enhancement technique for the analysis of Sierpinski pre-fractal
More informationCan Xia 1, 2,WanqingYou 2, and Yufa Sun 2, *
Progress In Electromagnetics Research Letters, Vol. 81, 133 139, 2019 Fast Calculation of Monostatic Radar Cross Section of Conducting Targets Using Hierarchical Characteristic Basis Function Method and
More informationLow-rank Properties, Tree Structure, and Recursive Algorithms with Applications. Jingfang Huang Department of Mathematics UNC at Chapel Hill
Low-rank Properties, Tree Structure, and Recursive Algorithms with Applications Jingfang Huang Department of Mathematics UNC at Chapel Hill Fundamentals of Fast Multipole (type) Method Fundamentals: Low
More informationLong time integrations of a convective PDE on the sphere by RBF collocation
Long time integrations of a convective PDE on the sphere by RBF collocation Bengt Fornberg and Natasha Flyer University of Colorado NCAR Department of Applied Mathematics Institute for Mathematics Applied
More informationIntroduction to Multigrid and its Parallelization
Introduction to Multigrid and its Parallelization! Thomas D. Economon Lecture 14a May 28, 2014 Announcements 2 HW 1 & 2 have been returned. Any questions? Final projects are due June 11, 5 pm. If you are
More informationFMM CMSC 878R/AMSC 698R. Lecture 13
FMM CMSC 878R/AMSC 698R Lecture 13 Outline Results of the MLFMM tests Itemized Asymptotic Complexity of the MLFMM; Optimization of the Grouping (Clustering) Parameter; Regular mesh; Random distributions.
More informationThe meshfree computation of stationary electric current densities in complex shaped conductors using 3D boundary element methods
Boundary Elements and Other Mesh Reduction Methods XXXVII 121 The meshfree computation of stationary electric current densities in complex shaped conductors using 3D boundary element methods A. Buchau
More informationUsing a Fast Multipole Method to Accelerate Spline Evaluations
Using a Fast Multipole Method to Accelerate Spline Evaluations FANG CHEN AND DAVID SUTER Monush University, Australia 4 e In considering the problem of interpolating scattered data using spline methods,
More informationEnhanced Characteristic Basis Function Method for Solving the Monostatic Radar Cross Section of Conducting Targets
Progress In Electromagnetics Research M, Vol. 68, 173 180, 2018 Enhanced Characteristic Basis Function Method for Solving the Monostatic Radar Cross Section of Conducting Targets Jinyu Zhu, Yufa Sun *,
More informationRecent Developments in Model-based Derivative-free Optimization
Recent Developments in Model-based Derivative-free Optimization Seppo Pulkkinen April 23, 2010 Introduction Problem definition The problem we are considering is a nonlinear optimization problem with constraints:
More informationTime domain construction of acoustic scattering by elastic targets through finite element analysis
Time domain construction of acoustic scattering by elastic targets through finite element analysis Aaron Gunderson*, Blake Simon, Anthony Bonomo, Marcia Isakson Applied Research Laboratories University
More informationComparison of iteration convergences of SIE and VSIE for solving electromagnetic scattering problems for coated objects
RADIO SCIENCE, VOL. 38, NO. 2, 1028, doi:10.1029/2002rs002610, 2003 Comparison of iteration convergences of SIE and VSIE for solving electromagnetic scattering problems for coated objects Cai-Cheng Lu
More informationS0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS
S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS John R Appleyard Jeremy D Appleyard Polyhedron Software with acknowledgements to Mark A Wakefield Garf Bowen Schlumberger Outline of Talk Reservoir
More informationCMSC 858M/AMSC 698R. Fast Multipole Methods. Nail A. Gumerov & Ramani Duraiswami. Lecture 20. Outline
CMSC 858M/AMSC 698R Fast Multipole Methods Nail A. Gumerov & Ramani Duraiswami Lecture 20 Outline Two parts of the FMM Data Structures FMM Cost/Optimization on CPU Fine Grain Parallelization for Multicore
More information3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs
3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs H. Knibbe, C. W. Oosterlee, C. Vuik Abstract We are focusing on an iterative solver for the three-dimensional
More informationElectromagnetic Field Transformations for Measurements and Simulations
Progress In Electromagnetics Research, Vol. 151, 127 150, 2015 Electromagnetic Field Transformations for Measurements and Simulations Thomas F. Eibert *, Emre Kılıç, Carlos Lopez, Raimund A. M. Mauermayer,
More informationNumerical Quadrature over the Surface of a Sphere
Numerical Quadrature over the Surface of a Sphere Bengt Fornberg University of Colorado at Boulder Department of Applied Mathematics in collaboration with: Jonah Reeger Air Force Institute of Technology,
More informationXinyu Dou Acoustics Technology Center, Motorola, Inc., Schaumburg, Illinois 60196
A unified boundary element method for the analysis of sound and shell-like structure interactions. II. Efficient solution techniques Shaohai Chen and Yijun Liu a) Department of Mechanical Engineering,
More informationTechnische Universiteit Delft. Acceleration of the Multi Level Fast Multipole Algorithm on a Graphics Processing Unit C. Wagenaar
Technische Universiteit Delft Acceleration of the Multi Level Fast Multipole Algorithm on a Graphics Processing Unit C. Wagenaar Acceleration of the Multi Level Fast Multipole Algorithm on a Graphics
More informationSELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND
Student Submission for the 5 th OpenFOAM User Conference 2017, Wiesbaden - Germany: SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND TESSA UROIĆ Faculty of Mechanical Engineering and Naval Architecture, Ivana
More informationContents. I The Basic Framework for Stationary Problems 1
page v Preface xiii I The Basic Framework for Stationary Problems 1 1 Some model PDEs 3 1.1 Laplace s equation; elliptic BVPs... 3 1.1.1 Physical experiments modeled by Laplace s equation... 5 1.2 Other
More informationA broadband fast multipole accelerated boundary element method for the three dimensional Helmholtz equation
A broadband fast multipole accelerated boundary element method for the three dimensional Helmholtz equation Nail A. Gumerov a,b and amani Duraiswami b,c Perceptual Interfaces and eality Laboratory, Institute
More informationCUDA Experiences: Over-Optimization and Future HPC
CUDA Experiences: Over-Optimization and Future HPC Carl Pearson 1, Simon Garcia De Gonzalo 2 Ph.D. candidates, Electrical and Computer Engineering 1 / Computer Science 2, University of Illinois Urbana-Champaign
More informationSurfaces, meshes, and topology
Surfaces from Point Samples Surfaces, meshes, and topology A surface is a 2-manifold embedded in 3- dimensional Euclidean space Such surfaces are often approximated by triangle meshes 2 1 Triangle mesh
More informationSmoothers. < interactive example > Partial Differential Equations Numerical Methods for PDEs Sparse Linear Systems
Smoothers Partial Differential Equations Disappointing convergence rates observed for stationary iterative methods are asymptotic Much better progress may be made initially before eventually settling into
More informationMeshless physical simulation of semiconductor devices using a wavelet-based nodes generator
Meshless physical simulation of semiconductor devices using a wavelet-based nodes generator Rashid Mirzavand 1, Abdolali Abdipour 1a), Gholamreza Moradi 1, and Masoud Movahhedi 2 1 Microwave/mm-wave &
More informationConstruction and application of hierarchical matrix preconditioners
University of Iowa Iowa Research Online Theses and Dissertations 2008 Construction and application of hierarchical matrix preconditioners Fang Yang University of Iowa Copyright 2008 Fang Yang This dissertation
More informationMultigrid Pattern. I. Problem. II. Driving Forces. III. Solution
Multigrid Pattern I. Problem Problem domain is decomposed into a set of geometric grids, where each element participates in a local computation followed by data exchanges with adjacent neighbors. The grids
More informationESPRESO ExaScale PaRallel FETI Solver. Hybrid FETI Solver Report
ESPRESO ExaScale PaRallel FETI Solver Hybrid FETI Solver Report Lubomir Riha, Tomas Brzobohaty IT4Innovations Outline HFETI theory from FETI to HFETI communication hiding and avoiding techniques our new
More informationAcceleration of the Multi Level Fast Multipole Method on a GPU
Thesis National Aerospace Laboratory NLR Acceleration of the Multi Level Fast Multipole Method on a GPU Author: Claudia Wagenaar(4090772) Supervisors: H. van der Ven D. van der Heul C. Vuik February 1,
More informationA Random Variable Shape Parameter Strategy for Radial Basis Function Approximation Methods
A Random Variable Shape Parameter Strategy for Radial Basis Function Approximation Methods Scott A. Sarra, Derek Sturgill Marshall University, Department of Mathematics, One John Marshall Drive, Huntington
More informationSubdivision Curves and Surfaces
Subdivision Surfaces or How to Generate a Smooth Mesh?? Subdivision Curves and Surfaces Subdivision given polyline(2d)/mesh(3d) recursively modify & add vertices to achieve smooth curve/surface Each iteration
More informationAccelerated flow acoustic boundary element solver and the noise generation of fish
Accelerated flow acoustic boundary element solver and the noise generation of fish JUSTIN W. JAWORSKI, NATHAN WAGENHOFFER, KEITH W. MOORED LEHIGH UNIVERSITY, BETHLEHEM, USA FLINOVIA PENN STATE 27 APRIL
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra)
AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 20: Sparse Linear Systems; Direct Methods vs. Iterative Methods Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 26
More informationMultigrid solvers M. M. Sussman sussmanm@math.pitt.edu Office Hours: 11:10AM-12:10PM, Thack 622 May 12 June 19, 2014 1 / 43 Multigrid Geometrical multigrid Introduction Details of GMG Summary Algebraic
More informationIntroduction to Computer Graphics. Modeling (3) April 27, 2017 Kenshi Takayama
Introduction to Computer Graphics Modeling (3) April 27, 2017 Kenshi Takayama Solid modeling 2 Solid models Thin shapes represented by single polygons Unorientable Clear definition of inside & outside
More informationA comparison of parallel rank-structured solvers
A comparison of parallel rank-structured solvers François-Henry Rouet Livermore Software Technology Corporation, Lawrence Berkeley National Laboratory Joint work with: - LSTC: J. Anton, C. Ashcraft, C.
More informationsmooth coefficients H. Köstler, U. Rüde
A robust multigrid solver for the optical flow problem with non- smooth coefficients H. Köstler, U. Rüde Overview Optical Flow Problem Data term and various regularizers A Robust Multigrid Solver Galerkin
More informationHFSS Hybrid Finite Element and Integral Equation Solver for Large Scale Electromagnetic Design and Simulation
HFSS Hybrid Finite Element and Integral Equation Solver for Large Scale Electromagnetic Design and Simulation Laila Salman, PhD Technical Services Specialist laila.salman@ansys.com 1 Agenda Overview of
More informationTree-based methods on GPUs
Tree-based methods on GPUs Felipe Cruz 1 and Matthew Knepley 2,3 1 Department of Mathematics University of Bristol 2 Computation Institute University of Chicago 3 Department of Molecular Biology and Physiology
More informationThe Spherical Harmonics Discrete Ordinate Method for Atmospheric Radiative Transfer
The Spherical Harmonics Discrete Ordinate Method for Atmospheric Radiative Transfer K. Franklin Evans Program in Atmospheric and Oceanic Sciences University of Colorado, Boulder Computational Methods in
More informationAdvanced modelling of gratings in VirtualLab software. Site Zhang, development engineer Lignt Trans
Advanced modelling of gratings in VirtualLab software Site Zhang, development engineer Lignt Trans 1 2 3 4 Content Grating Order Analyzer Rigorous Simulation of Holographic Generated Volume Grating Coupled
More informationLocal Multilevel Fast Multipole Algorithm for 3D Electromagnetic Scattering
Progress In Electromagnetics Research Symposium 2005, Hangzhou, China, August 22-26 745 Local Multilevel Fast Multipole Algorithm for 3D Electromagnetic Scattering Jun Hu, Zaiping Nie, Lin Lei, and Jun
More information1.2 Numerical Solutions of Flow Problems
1.2 Numerical Solutions of Flow Problems DIFFERENTIAL EQUATIONS OF MOTION FOR A SIMPLIFIED FLOW PROBLEM Continuity equation for incompressible flow: 0 Momentum (Navier-Stokes) equations for a Newtonian
More informationPROGRAMMING OF MULTIGRID METHODS
PROGRAMMING OF MULTIGRID METHODS LONG CHEN In this note, we explain the implementation detail of multigrid methods. We will use the approach by space decomposition and subspace correction method; see Chapter:
More informationEfficient computation of source magnetic scalar potential
Adv. Radio Sci., 4, 59 63, 2006 Author(s) 2006. This work is licensed under a Creative Commons License. Advances in Radio Science Efficient computation of source magnetic scalar potential W. Hafla, A.
More informationA Parallel Implementation of A Fast Multipole Based 3-D Capacitance Extraction Program on Distributed Memory Multicomputers
A Parallel Implementation of A Fast Multipole Based 3-D Capacitance Extraction Program on Distributed Memory Multicomputers Yanhong Yuan and Prithviraj Banerjee Department of Electrical and Computer Engineering
More informationAlgorithms, System and Data Centre Optimisation for Energy Efficient HPC
2015-09-14 Algorithms, System and Data Centre Optimisation for Energy Efficient HPC Vincent Heuveline URZ Computing Centre of Heidelberg University EMCL Engineering Mathematics and Computing Lab 1 Energy
More informationHFSS Ansys ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary
HFSS 12.0 Ansys 2009 ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary Comparison of HFSS 11 and HFSS 12 for JSF Antenna Model UHF blade antenna on Joint Strike Fighter Inherent improvements in
More informationParallel High-Order Geometric Multigrid Methods on Adaptive Meshes for Highly Heterogeneous Nonlinear Stokes Flow Simulations of Earth s Mantle
ICES Student Forum The University of Texas at Austin, USA November 4, 204 Parallel High-Order Geometric Multigrid Methods on Adaptive Meshes for Highly Heterogeneous Nonlinear Stokes Flow Simulations of
More informationFast Radial Basis Functions for Engineering Applications. Prof. Marco Evangelos Biancolini University of Rome Tor Vergata
Fast Radial Basis Functions for Engineering Applications Prof. Marco Evangelos Biancolini University of Rome Tor Vergata Outline 2 RBF background Fast RBF on HPC Engineering Applications Mesh morphing
More informationParallel and Distributed Systems Lab.
Parallel and Distributed Systems Lab. Department of Computer Sciences Purdue University. Jie Chi, Ronaldo Ferreira, Ananth Grama, Tzvetan Horozov, Ioannis Ioannidis, Mehmet Koyuturk, Shan Lei, Robert Light,
More informationGeometric Modeling in Graphics
Geometric Modeling in Graphics Part 10: Surface reconstruction Martin Samuelčík www.sccg.sk/~samuelcik samuelcik@sccg.sk Curve, surface reconstruction Finding compact connected orientable 2-manifold surface
More informationLecture 15: More Iterative Ideas
Lecture 15: More Iterative Ideas David Bindel 15 Mar 2010 Logistics HW 2 due! Some notes on HW 2. Where we are / where we re going More iterative ideas. Intro to HW 3. More HW 2 notes See solution code!
More informationPerformance Evaluation of a New Parallel Preconditioner
Performance Evaluation of a New Parallel Preconditioner Keith D. Gremban Gary L. Miller October 994 CMU-CS-94-25 Marco Zagha School of Computer Science Carnegie Mellon University Pittsburgh, PA 523 This
More informationFast Multipole Method on GPU: Tackling 3-D Capacitance Extraction on Massively Parallel SIMD Platforms
Fast Multipole Method on GPU: Tackling 3-D Capacitance Extraction on Massively Parallel SIMD Platforms Xueqian Zhao Department of ECE Michigan Technological University Houghton, MI, 4993 xueqianz@mtu.edu
More informationHighly Parallel Multigrid Solvers for Multicore and Manycore Processors
Highly Parallel Multigrid Solvers for Multicore and Manycore Processors Oleg Bessonov (B) Institute for Problems in Mechanics of the Russian Academy of Sciences, 101, Vernadsky Avenue, 119526 Moscow, Russia
More informationUNIVERSITY OF CALGARY. Subdivision Surfaces. Advanced Geometric Modeling Faramarz Samavati
Subdivision Surfaces Surfaces Having arbitrary Topologies Tensor Product Surfaces Non Tensor Surfaces We can t find u-curves and v-curves in general surfaces General Subdivision Coarse mesh Subdivision
More informationScientific Computing on Graphical Processors: FMM, Flagon, Signal Processing, Plasma and Astrophysics
Scientific Computing on Graphical Processors: FMM, Flagon, Signal Processing, Plasma and Astrophysics Ramani Duraiswami Computer Science & UMIACS University of Maryland, College Park Joint work with Nail
More informationLecture 2: Introduction
Lecture 2: Introduction v2015.0 Release ANSYS HFSS for Antenna Design 1 2015 ANSYS, Inc. Multiple Advanced Techniques Allow HFSS to Excel at a Wide Variety of Applications Platform Integration and RCS
More informationRipplet: a New Transform for Feature Extraction and Image Representation
Ripplet: a New Transform for Feature Extraction and Image Representation Dr. Dapeng Oliver Wu Joint work with Jun Xu Department of Electrical and Computer Engineering University of Florida Outline Motivation
More informationGPU-Accelerated Algebraic Multigrid for Commercial Applications. Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA
GPU-Accelerated Algebraic Multigrid for Commercial Applications Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA ANSYS Fluent 2 Fluent control flow Accelerate this first Non-linear iterations Assemble
More informationTowards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers
Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers Markus Geveler, Dirk Ribbrock, Dominik Göddeke, Peter Zajac, Stefan Turek Institut für Angewandte Mathematik TU Dortmund,
More informationFeasibility of a meshfree post-processing for boundary element methods
Boundary Elements and Other Mesh Reduction Methods XXXVIII 327 Feasibility of a meshfree post-processing for boundary element methods A. Buchau & W. M. Rucker Institut für Theorie der Elektrotechnik, Universität
More informationFast Multipole BEM Modeling of Head Related Transfer Functions of a Dummy Head and Torso
Fast Multipole BEM Modeling of Head Related Transfer Functions of a Dummy Head and Torso P. Fiala 1,2, J. Huijssen 2, B. Pluymers 2, R. Hallez 3, W. Desmet 2 1 Budapest University of Technology and Economics,
More informationOptimum Array Processing
Optimum Array Processing Part IV of Detection, Estimation, and Modulation Theory Harry L. Van Trees WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Preface xix 1 Introduction 1 1.1 Array Processing
More informationFast Methods with Sieve
Fast Methods with Sieve Matthew G Knepley Mathematics and Computer Science Division Argonne National Laboratory August 12, 2008 Workshop on Scientific Computing Simula Research, Oslo, Norway M. Knepley
More informationEmpirical Analysis of Space Filling Curves for Scientific Computing Applications
Empirical Analysis of Space Filling Curves for Scientific Computing Applications Daryl DeFord 1 Ananth Kalyanaraman 2 1 Department of Mathematics 2 School of Electrical Engineering and Computer Science
More information