Iterative methods for use with the Fast Multipole Method

Size: px
Start display at page:

Download "Iterative methods for use with the Fast Multipole Method"

Transcription

1 Iterative methods for use with the Fast Multipole Method Ramani Duraiswami Perceptual Interfaces and Reality Lab. Computer Science & UMIACS University of Maryland, College Park, MD Joint work with Nail A. Gumerov

2 Fast Multipole Methods Follows from seminal work of Rokhlin and Greengard (1987) General method for accelerating large classes of dense matrix vector products Apply to matrices whose entries are associated with computations of pairs of points Reduce computational/memory complexity from O(N 2 ) to O(N log N) Potentially reduce many O(N 2 )/O(N 3 ) calculations to O(N log N) Nail Gumerov and I are applying it to many areas Acoustics (JASA 2002,2005), Room impulse responses (IEEE Trans. SAP, 2006), book (Elsevier, 2005) Maxwell s equations in 3D and scattering (submitted, 2006) Fast statistics (NIPS 2004), similarity measures (CVPR 2005), image processing, segmentation (ICCV 2003), tracking, learning (Data Mining 2006) Non uniform fast Fourier transforms and reconstruction Biharmonic equation (JCP 2006), fitting thin-plate splines

3 Solving Linear Systems Typically the FMM is used with an iterative method some preliminary work aims at directly constructing the inverse approximately (Rokhlin, Martinsson, Gimbutas, Cheng) FMM reduces the cost of the matrix-vector product step in the iterative solution of linear systems, and for an iteration requiring N iter steps, the complexity is O(N iter NlogN). To bound N iter appropriate pre-conditioning strategies must be used with the dense matrix. However, many conventional pre-conditioning strategies rely on sparsity in the matrix, or are ineffective

4 Preconditioner cost To be effective with the FMM the preconditioner cost should be O(N log N) or less Otherwise we negate the advantage of the FMM We present results for two preconditioners we have used First, for highly oscillatory kernel --- multiple scattering with the Helmholtz equation Second, we accelerate a highly successful preconditioner developed by Faul, Powell and Goodsell (2005), but which has a complexity of O(N 2 ) to O(N log N).

5 Radial basis function (RBF) interpolation Construct interpolating function for scattered data, with the interpolating function expressed as a sum of RBFs centered at the data points. Much work by Powell, Beatson and coworkers Culmination of this work: an iterative Krylov subspace algorithm Faul et al. (2005) A. C. Faul, G. Goodsell, M. J. D. Powell, "A Krylov subspace algorithm for multiquadric interpolation in many dimensions," IMA Journal of Numerical Analysis (2005) 25, 1--24

6 RBF Interpolation x are points in R d f are values of the function to be interpolated s is the interpolating function λ are the interpolation coefficients Interpolating RBFs are multiquadric or biharmonic

7 Linear system \Phi is the RBF

8 Preconditioner via cardinal functions Beatson and Powell (1992) created preconditioners via approximate cardinal functions Preconditioner must be close to the inverse Cardinal function has value one at the point and zeros at all other points Thus a cardinal function c satisfies Ac = [ ] t If we stack cardinal functions for each row, we obtain A -1 f BP92, proposed to use the smoothness of the interpolant and required the cardinality properties at each point and several other points in the domain (not the whole set) Selection of which points to include and which points to exclude, and their influence on preconditioner performance is the subject of several subsequent papers by the groups of Powell and Beatson.

9 Empirical comparison of iterative methods We compared several strategies proposed by the authors, and found that the proposal of Faul et al. 2005, works best For some of the proposed strategies, for very large (10 6 or more points), some of the proposed preconditioners do not converge within 1000 iterations. However Faul et al. s preconditioned iterative algorithm converges robustly within about 50 iterations.

10 Accelerating the Faul et al. algorithm Matrix vector product can be accelerated via the FMM (e.g., Gumerov and Duraiswami, 2006, JCP) Choice of points for the approximate cardinal functions is very particular in Faul et al. Chooses the point that is a member of the closest pair of points, and its q closest neighbors and builds an interpolant The first point is eliminated from future consideration Algorithm is recursively applied and a fine-to-coarse structure of the sets of approximate cardinal functions are obtained Preconditioned Conjugate Direction algorithm is applied along the directions of these functions Search for q closest neighbors with removal of points from the set is O(N 2 ).

11 Converting it to O(N log N) Use FMM data structure to build lists of closest neighbors Use heaps to develop a dynamic data structure that allows deletions Employed a lemma that bounds the number of closest neighbors a point can have is a fixed number that is a function of the dimension Achieve a point selection that is O(N log N). Details in a preprint being prepared. For a 110,000 point problem, time of the preset set went down from seconds to 115 seconds in 3D biharmonic interpolation

12 Original unstructured data points RBF/FMM interpolation to regular spatial grid FMM algorithm cost per iteration:9.6 seconds on a Pentium IV 3.2 GHz desktop

13 Error Iteration number

14 Original Data: surface mesh with vertices and faces RBF interpolation to the spatial grid + isosurface (size of the grid is shown near each figure) 5x5x5 11x11x11 26x26x26 51x51x51 101x101x x201x201

15 Original Image percent samples drawn from image FMM algorithm cost per iteration:6.2 seconds on a Pentium IV 3.2 GHz desktop

16 Multiquadric Interpolation of a 2D image 10 4 Error Iteration number

17 Interpolated Result Difference Image 14 seconds to interpolate whole image from the fit data

18 Equations and Boundary Conditions Formulation Wave Equation Fourier Transform Helmholtz Equation Impedance Boundary Conditions 3 Field Decomposition Sommerfield Radiation Condition 2 Incident Wave 6

19 Scattered Field Decomposition T-Matrix Method Scattered field expressed in terms of singular (multipole) wave functions that satisfy the Sommerfeld condition Singular Basis Functions Hankel Functions Expansion Coefficients Spherical Harmonics Vector Form: dot product

20 T-Matrix Method Solution of Multiple Scattering Problem Write a system of equations relating expansion coefficients for each scatterer Effective Incident Field Coupled System of Equations: Scattered Wave (S R)-Translation Matrix Incident Wave 6 Linear system of size p 2 N p 2 N

21 T-matrix Method Three Spheres Comparisons of BEM & MultisphereHelmholtz HRTF (db) o 30 o θ 1 = 0 o 90 o o BEM MultisphereHelmholtz 30 o o 90 o 120 o o 150 o o Three Spheres, ka 1 = Angle φ 1 (deg) BEM: 5184 triangular elements MH: N trunc = 9 (100 coefficients for each sphere)

22 Iterative Methods Krylov Subspace Method (GMRES) For larger systems use an iterative method Diagonal Matrix The product of this matrix by An arbitrary input vector can be done fast with the FMM

23 Iterative Methods FGMRES Unpreconditioned LA = E To converge requires N iter multiplications LG, where G is an input vector Cost: C N iter Right Preconditioner LM -1 (MA)=E 1). Internal Loop: Solve M -1 F=E Requires N (1) iter multiplications MG 2). External Loop: Requires N (2) iter multiplications LG Cost: C (1) N (1) iter+ C N (2) iter Substantial speed up if M L and C (1) «C

24 MLFMM FMM splits influence of scatterers in to far and near Fields Neighborhood (Near Field) Far Field

25 MLFMM Computation of the Far Field (1) 1). Set Data Structure (hierarchically subdivide space with an oct-tree) 2). multipole to multipole translate S- expansions for all scatterers in a box at the finest level to the center of the box and sum up (determine contribution to Far Field for each box at the finest level). 3). Recursively multipole to multipole translate S-expansions to center of the parent box and sum up (determine contribution to Far Field for each box at all coarser levels). y x i x c (n,l) Upward Pass (From the finest to the coarsest level)

26 MLFMM Computation of the Far Field (2) Steps 4 and 5 performed one after the other recursively 4). Multipole-to-local translate S- expansions for boxes which are inside the parent neighborhood but outside the box neighborhood to the center of the box (convert S-expansion to R- expansion). Downward Pass (From the coarsest to the finest level) 5). Local-to-local translate R-expansions from the center of the box to the center of its children boxes (determine Far Field for each box at all levels). 6). local translation of R-expansions from the center of the boxes at the finest level to the centers of the spheres.

27 Preconditioned FGMRES FGMRES Choice of preconditioning operator: near (sparse) or far (dense)? Experimentally we found that the far (dense) operator works well in all tests

28 Range of Parameters Results Number of Spheres: ; ka: ; Random and regularly spaced grids of spheres; Polydispersity: (ratio to the mean radius); Volume fractions:

29 Results 100 random spheres (MLFMM) Plane Wave Plane Wave ka = 1.6 ka = 2.8

30 1000 random spheres Results ka=1

31 10000 random spheres Results ka = 0.75 kd 0 = 90

32 A posteriori Error Check Results

33 Results GMRES vs FGMRES Volume Fraction = 0.2 ka = 0.5, D 0 /a = 60, p 2 = GHz, Xeon, 3.5 MB RAM

34 Performance Test Results Volume fraction = 0.2, ka = 0.5, p 2 = FMM+FGMRES y=cx 1.25 CPU Time (s) y = cx 1.25 Total y = bx External Loop y = ax 3.2GHz, Xeon, 3.5 MB RAM Matrix-Vector Multiplication 1 Internal Loop Number of Scatterers

35 Conclusions Need for preconditioners for dense matrices for FMM based algorithms For a highly oscillatory kernel, preconditioner based on the FMM algorithm itself gives good results Preconditioner theoretical performance remains to be proved Approach that interpolates the cardinal function approximately on carefully chosen data appears very promising for smoother kernels Geometrical computations involved in this method reduced to O(N log N) Extension to other kernels of this approach is the subject of

Efficient O(N log N) algorithms for scattered data interpolation

Efficient O(N log N) algorithms for scattered data interpolation Efficient O(N log N) algorithms for scattered data interpolation Nail Gumerov University of Maryland Institute for Advanced Computer Studies Joint work with Ramani Duraiswami February Fourier Talks 2007

More information

FMM accelerated BEM for 3D Helmholtz equation

FMM accelerated BEM for 3D Helmholtz equation FMM accelerated BEM for 3D Helmholtz equation Nail A. Gumerov and Ramani Duraiswami Institute for Advanced Computer Studies University of Maryland, U.S.A. also @ Fantalgo, LLC, U.S.A. www.umiacs.umd.edu/~gumerov

More information

Fast Multipole Accelerated Indirect Boundary Elements for the Helmholtz Equation

Fast Multipole Accelerated Indirect Boundary Elements for the Helmholtz Equation Fast Multipole Accelerated Indirect Boundary Elements for the Helmholtz Equation Nail A. Gumerov Ross Adelman Ramani Duraiswami University of Maryland Institute for Advanced Computer Studies and Fantalgo,

More information

c 2007 Society for Industrial and Applied Mathematics

c 2007 Society for Industrial and Applied Mathematics SIAM J. SCI. COMPUT. Vol. 29, No. 5, pp. 1876 1899 c 2007 Society for Industrial and Applied Mathematics FAST RADIAL BASIS FUNCTION INTERPOLATION VIA PRECONDITIONED KRYLOV ITERATION NAIL A. GUMEROV AND

More information

GPU accelerated heterogeneous computing for Particle/FMM Approaches and for Acoustic Imaging

GPU accelerated heterogeneous computing for Particle/FMM Approaches and for Acoustic Imaging GPU accelerated heterogeneous computing for Particle/FMM Approaches and for Acoustic Imaging Ramani Duraiswami University of Maryland, College Park http://www.umiacs.umd.edu/~ramani With Nail A. Gumerov,

More information

Fast Multipole and Related Algorithms

Fast Multipole and Related Algorithms Fast Multipole and Related Algorithms Ramani Duraiswami University of Maryland, College Park http://www.umiacs.umd.edu/~ramani Joint work with Nail A. Gumerov Efficiency by exploiting symmetry and A general

More information

APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION.

APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION. APPLYING EXTRAPOLATION AND INTERPOLATION METHODS TO MEASURED AND SIMULATED HRTF DATA USING SPHERICAL HARMONIC DECOMPOSITION Martin Pollow Institute of Technical Acoustics RWTH Aachen University Neustraße

More information

Lecture 7: Introduction to HFSS-IE

Lecture 7: Introduction to HFSS-IE Lecture 7: Introduction to HFSS-IE 2015.0 Release ANSYS HFSS for Antenna Design 1 2015 ANSYS, Inc. HFSS-IE: Integral Equation Solver Introduction HFSS-IE: Technology An Integral Equation solver technology

More information

The Fast Multipole Method (FMM)

The Fast Multipole Method (FMM) The Fast Multipole Method (FMM) Motivation for FMM Computational Physics Problems involving mutual interactions of N particles Gravitational or Electrostatic forces Collective (but weak) long-range forces

More information

A Broadband Fast Multipole Accelerated Boundary Element Method for the 3D Helmholtz Equation. Abstract

A Broadband Fast Multipole Accelerated Boundary Element Method for the 3D Helmholtz Equation. Abstract A Broadband Fast Multipole Accelerated Boundary Element Method for the 3D Helmholtz Equation Nail A. Gumerov and Ramani Duraiswami (Dated: 31 December 2007, Revised 22 July 2008, Revised 03 October 2008.)

More information

Fast Spherical Filtering in the Broadband FMBEM using a nonequally

Fast Spherical Filtering in the Broadband FMBEM using a nonequally Fast Spherical Filtering in the Broadband FMBEM using a nonequally spaced FFT Daniel R. Wilkes (1) and Alec. J. Duncan (1) (1) Centre for Marine Science and Technology, Department of Imaging and Applied

More information

A Multi-scale Approach to 3D Scattered Data Interpolation with Compactly Supported Basis Functions

A Multi-scale Approach to 3D Scattered Data Interpolation with Compactly Supported Basis Functions Shape Modeling International 2003 Seoul, Korea A Multi-scale Approach to 3D Scattered Data Interpolation with Compactly Supported Basis Functions Yutaa Ohtae Alexander Belyaev Hans-Peter Seidel Objective

More information

Fast Multipole Methods. Linear Systems. Matrix vector product. An Introduction to Fast Multipole Methods.

Fast Multipole Methods. Linear Systems. Matrix vector product. An Introduction to Fast Multipole Methods. An Introduction to Fast Multipole Methods Ramani Duraiswami Institute for Advanced Computer Studies University of Maryland, College Park http://www.umiacs.umd.edu/~ramani Joint work with Nail A. Gumerov

More information

Kernel Independent FMM

Kernel Independent FMM Kernel Independent FMM FMM Issues FMM requires analytical work to generate S expansions, R expansions, S S (M2M) translations S R (M2L) translations R R (L2L) translations Such analytical work leads to

More information

FMM Data Structures. Content. Introduction Hierarchical Space Subdivision with 2 d -Trees Hierarchical Indexing System Parent & Children Finding

FMM Data Structures. Content. Introduction Hierarchical Space Subdivision with 2 d -Trees Hierarchical Indexing System Parent & Children Finding FMM Data Structures Nail Gumerov & Ramani Duraiswami UMIACS [gumerov][ramani]@umiacs.umd.edu CSCAMM FAM4: 4/9/4 Duraiswami & Gumerov, -4 Content Introduction Hierarchical Space Subdivision with d -Trees

More information

Transactions on Modelling and Simulation vol 20, 1998 WIT Press, ISSN X

Transactions on Modelling and Simulation vol 20, 1998 WIT Press,   ISSN X Parallel indirect multipole BEM analysis of Stokes flow in a multiply connected domain M.S. Ingber*, A.A. Mammoli* & J.S. Warsa* "Department of Mechanical Engineering, University of New Mexico, Albuquerque,

More information

A Kernel-independent Adaptive Fast Multipole Method

A Kernel-independent Adaptive Fast Multipole Method A Kernel-independent Adaptive Fast Multipole Method Lexing Ying Caltech Joint work with George Biros and Denis Zorin Problem Statement Given G an elliptic PDE kernel, e.g. {x i } points in {φ i } charges

More information

Terascale on the desktop: Fast Multipole Methods on Graphical Processors

Terascale on the desktop: Fast Multipole Methods on Graphical Processors Terascale on the desktop: Fast Multipole Methods on Graphical Processors Nail A. Gumerov Fantalgo, LLC Institute for Advanced Computer Studies University of Maryland (joint work with Ramani Duraiswami)

More information

Parallel Interpolation in FSI Problems Using Radial Basis Functions and Problem Size Reduction

Parallel Interpolation in FSI Problems Using Radial Basis Functions and Problem Size Reduction Parallel Interpolation in FSI Problems Using Radial Basis Functions and Problem Size Reduction Sergey Kopysov, Igor Kuzmin, Alexander Novikov, Nikita Nedozhogin, and Leonid Tonkov Institute of Mechanics,

More information

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE Copyright SFA - InterNoise 2000 1 inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering 27-30 August 2000, Nice, FRANCE I-INCE Classification: 7.6 EFFICIENT ACOUSTIC

More information

Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation

Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation Nail A. Gumerov, Adam E. O Donovan, Ramani Duraiswami,

More information

Capturing, Computing, Visualizing and Recreating Spatial Sound

Capturing, Computing, Visualizing and Recreating Spatial Sound Capturing, Computing, Visualizing and Recreating Spatial Sound Ramani Duraiswami University of Maryland, College Park Joint work with Dmitry Zotkin, Zhiyun Li, Elena Grassi, Adam O Donovan, Nail Gumerov,

More information

FAST MATRIX-VECTOR PRODUCT BASED FGMRES FOR KERNEL MACHINES

FAST MATRIX-VECTOR PRODUCT BASED FGMRES FOR KERNEL MACHINES FAST MATRIX-VECTOR PRODUCT BASED FGMRES FOR KERNEL MACHINES BALAJI VASAN SRINIVASAN, M.S. SCHOLARLY PAPER DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF MARYLAND, COLLEGE PARK Abstract. Kernel based approaches

More information

Fast multipole methods for axisymmetric geometries

Fast multipole methods for axisymmetric geometries Fast multipole methods for axisymmetric geometries Victor Churchill New York University May 13, 2016 Abstract Fast multipole methods (FMMs) are one of the main numerical methods used in the solution of

More information

A fast direct solver for high frequency scattering from a cavity in two dimensions

A fast direct solver for high frequency scattering from a cavity in two dimensions 1/31 A fast direct solver for high frequency scattering from a cavity in two dimensions Jun Lai 1 Joint work with: Leslie Greengard (CIMS) Sivaram Ambikasaran (CIMS) Workshop on fast direct solver, Dartmouth

More information

The Fast Multipole Method on NVIDIA GPUs and Multicore Processors

The Fast Multipole Method on NVIDIA GPUs and Multicore Processors The Fast Multipole Method on NVIDIA GPUs and Multicore Processors Toru Takahashi, a Cris Cecka, b Eric Darve c a b c Department of Mechanical Science and Engineering, Nagoya University Institute for Applied

More information

Improved Multilevel Fast Multipole Method for Higher-Order Discretizations

Improved Multilevel Fast Multipole Method for Higher-Order Discretizations Improved Multilevel Fast Multipole Method for Higher-Order Discretizations Oscar Borries 1,, Peter Meincke, Erik Jørgensen, Stig Busk Sørensen, Per Christian Hansen 1 1 Technical University of Denmark,

More information

FMM implementation on CPU and GPU. Nail A. Gumerov (Lecture for CMSC 828E)

FMM implementation on CPU and GPU. Nail A. Gumerov (Lecture for CMSC 828E) FMM implementation on CPU and GPU Nail A. Gumerov (Lecture for CMSC 828E) Outline Two parts of the FMM Data Structure Flow Chart of the Run Algorithm FMM Cost/Optimization on CPU Programming on GPU Fast

More information

Adaptive boundary element methods in industrial applications

Adaptive boundary element methods in industrial applications Adaptive boundary element methods in industrial applications Günther Of, Olaf Steinbach, Wolfgang Wendland Adaptive Fast Boundary Element Methods in Industrial Applications, Hirschegg, September 30th,

More information

What is Multigrid? They have been extended to solve a wide variety of other problems, linear and nonlinear.

What is Multigrid? They have been extended to solve a wide variety of other problems, linear and nonlinear. AMSC 600/CMSC 760 Fall 2007 Solution of Sparse Linear Systems Multigrid, Part 1 Dianne P. O Leary c 2006, 2007 What is Multigrid? Originally, multigrid algorithms were proposed as an iterative method to

More information

Spherical Microphone Arrays

Spherical Microphone Arrays Spherical Microphone Arrays Acoustic Wave Equation Helmholtz Equation Assuming the solutions of wave equation are time harmonic waves of frequency ω satisfies the homogeneous Helmholtz equation: Boundary

More information

Using GPUs to compute the multilevel summation of electrostatic forces

Using GPUs to compute the multilevel summation of electrostatic forces Using GPUs to compute the multilevel summation of electrostatic forces David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of

More information

Method of Moments enhancement technique for the analysis of Sierpinski pre-fractal antennas. Total number of pages: 10

Method of Moments enhancement technique for the analysis of Sierpinski pre-fractal antennas. Total number of pages: 10 FRACTALCOMS Exploring the limits of Fractal Electrodynamics for the future telecommunication technologies IST-2001-33055 Method of Moments enhancement technique for the analysis of Sierpinski pre-fractal

More information

Can Xia 1, 2,WanqingYou 2, and Yufa Sun 2, *

Can Xia 1, 2,WanqingYou 2, and Yufa Sun 2, * Progress In Electromagnetics Research Letters, Vol. 81, 133 139, 2019 Fast Calculation of Monostatic Radar Cross Section of Conducting Targets Using Hierarchical Characteristic Basis Function Method and

More information

Low-rank Properties, Tree Structure, and Recursive Algorithms with Applications. Jingfang Huang Department of Mathematics UNC at Chapel Hill

Low-rank Properties, Tree Structure, and Recursive Algorithms with Applications. Jingfang Huang Department of Mathematics UNC at Chapel Hill Low-rank Properties, Tree Structure, and Recursive Algorithms with Applications Jingfang Huang Department of Mathematics UNC at Chapel Hill Fundamentals of Fast Multipole (type) Method Fundamentals: Low

More information

Long time integrations of a convective PDE on the sphere by RBF collocation

Long time integrations of a convective PDE on the sphere by RBF collocation Long time integrations of a convective PDE on the sphere by RBF collocation Bengt Fornberg and Natasha Flyer University of Colorado NCAR Department of Applied Mathematics Institute for Mathematics Applied

More information

Introduction to Multigrid and its Parallelization

Introduction to Multigrid and its Parallelization Introduction to Multigrid and its Parallelization! Thomas D. Economon Lecture 14a May 28, 2014 Announcements 2 HW 1 & 2 have been returned. Any questions? Final projects are due June 11, 5 pm. If you are

More information

FMM CMSC 878R/AMSC 698R. Lecture 13

FMM CMSC 878R/AMSC 698R. Lecture 13 FMM CMSC 878R/AMSC 698R Lecture 13 Outline Results of the MLFMM tests Itemized Asymptotic Complexity of the MLFMM; Optimization of the Grouping (Clustering) Parameter; Regular mesh; Random distributions.

More information

The meshfree computation of stationary electric current densities in complex shaped conductors using 3D boundary element methods

The meshfree computation of stationary electric current densities in complex shaped conductors using 3D boundary element methods Boundary Elements and Other Mesh Reduction Methods XXXVII 121 The meshfree computation of stationary electric current densities in complex shaped conductors using 3D boundary element methods A. Buchau

More information

Using a Fast Multipole Method to Accelerate Spline Evaluations

Using a Fast Multipole Method to Accelerate Spline Evaluations Using a Fast Multipole Method to Accelerate Spline Evaluations FANG CHEN AND DAVID SUTER Monush University, Australia 4 e In considering the problem of interpolating scattered data using spline methods,

More information

Enhanced Characteristic Basis Function Method for Solving the Monostatic Radar Cross Section of Conducting Targets

Enhanced Characteristic Basis Function Method for Solving the Monostatic Radar Cross Section of Conducting Targets Progress In Electromagnetics Research M, Vol. 68, 173 180, 2018 Enhanced Characteristic Basis Function Method for Solving the Monostatic Radar Cross Section of Conducting Targets Jinyu Zhu, Yufa Sun *,

More information

Recent Developments in Model-based Derivative-free Optimization

Recent Developments in Model-based Derivative-free Optimization Recent Developments in Model-based Derivative-free Optimization Seppo Pulkkinen April 23, 2010 Introduction Problem definition The problem we are considering is a nonlinear optimization problem with constraints:

More information

Time domain construction of acoustic scattering by elastic targets through finite element analysis

Time domain construction of acoustic scattering by elastic targets through finite element analysis Time domain construction of acoustic scattering by elastic targets through finite element analysis Aaron Gunderson*, Blake Simon, Anthony Bonomo, Marcia Isakson Applied Research Laboratories University

More information

Comparison of iteration convergences of SIE and VSIE for solving electromagnetic scattering problems for coated objects

Comparison of iteration convergences of SIE and VSIE for solving electromagnetic scattering problems for coated objects RADIO SCIENCE, VOL. 38, NO. 2, 1028, doi:10.1029/2002rs002610, 2003 Comparison of iteration convergences of SIE and VSIE for solving electromagnetic scattering problems for coated objects Cai-Cheng Lu

More information

S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS

S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS John R Appleyard Jeremy D Appleyard Polyhedron Software with acknowledgements to Mark A Wakefield Garf Bowen Schlumberger Outline of Talk Reservoir

More information

CMSC 858M/AMSC 698R. Fast Multipole Methods. Nail A. Gumerov & Ramani Duraiswami. Lecture 20. Outline

CMSC 858M/AMSC 698R. Fast Multipole Methods. Nail A. Gumerov & Ramani Duraiswami. Lecture 20. Outline CMSC 858M/AMSC 698R Fast Multipole Methods Nail A. Gumerov & Ramani Duraiswami Lecture 20 Outline Two parts of the FMM Data Structures FMM Cost/Optimization on CPU Fine Grain Parallelization for Multicore

More information

3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs

3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs 3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs H. Knibbe, C. W. Oosterlee, C. Vuik Abstract We are focusing on an iterative solver for the three-dimensional

More information

Electromagnetic Field Transformations for Measurements and Simulations

Electromagnetic Field Transformations for Measurements and Simulations Progress In Electromagnetics Research, Vol. 151, 127 150, 2015 Electromagnetic Field Transformations for Measurements and Simulations Thomas F. Eibert *, Emre Kılıç, Carlos Lopez, Raimund A. M. Mauermayer,

More information

Numerical Quadrature over the Surface of a Sphere

Numerical Quadrature over the Surface of a Sphere Numerical Quadrature over the Surface of a Sphere Bengt Fornberg University of Colorado at Boulder Department of Applied Mathematics in collaboration with: Jonah Reeger Air Force Institute of Technology,

More information

Xinyu Dou Acoustics Technology Center, Motorola, Inc., Schaumburg, Illinois 60196

Xinyu Dou Acoustics Technology Center, Motorola, Inc., Schaumburg, Illinois 60196 A unified boundary element method for the analysis of sound and shell-like structure interactions. II. Efficient solution techniques Shaohai Chen and Yijun Liu a) Department of Mechanical Engineering,

More information

Technische Universiteit Delft. Acceleration of the Multi Level Fast Multipole Algorithm on a Graphics Processing Unit C. Wagenaar

Technische Universiteit Delft. Acceleration of the Multi Level Fast Multipole Algorithm on a Graphics Processing Unit C. Wagenaar Technische Universiteit Delft Acceleration of the Multi Level Fast Multipole Algorithm on a Graphics Processing Unit C. Wagenaar Acceleration of the Multi Level Fast Multipole Algorithm on a Graphics

More information

SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND

SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND Student Submission for the 5 th OpenFOAM User Conference 2017, Wiesbaden - Germany: SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND TESSA UROIĆ Faculty of Mechanical Engineering and Naval Architecture, Ivana

More information

Contents. I The Basic Framework for Stationary Problems 1

Contents. I The Basic Framework for Stationary Problems 1 page v Preface xiii I The Basic Framework for Stationary Problems 1 1 Some model PDEs 3 1.1 Laplace s equation; elliptic BVPs... 3 1.1.1 Physical experiments modeled by Laplace s equation... 5 1.2 Other

More information

A broadband fast multipole accelerated boundary element method for the three dimensional Helmholtz equation

A broadband fast multipole accelerated boundary element method for the three dimensional Helmholtz equation A broadband fast multipole accelerated boundary element method for the three dimensional Helmholtz equation Nail A. Gumerov a,b and amani Duraiswami b,c Perceptual Interfaces and eality Laboratory, Institute

More information

CUDA Experiences: Over-Optimization and Future HPC

CUDA Experiences: Over-Optimization and Future HPC CUDA Experiences: Over-Optimization and Future HPC Carl Pearson 1, Simon Garcia De Gonzalo 2 Ph.D. candidates, Electrical and Computer Engineering 1 / Computer Science 2, University of Illinois Urbana-Champaign

More information

Surfaces, meshes, and topology

Surfaces, meshes, and topology Surfaces from Point Samples Surfaces, meshes, and topology A surface is a 2-manifold embedded in 3- dimensional Euclidean space Such surfaces are often approximated by triangle meshes 2 1 Triangle mesh

More information

Smoothers. < interactive example > Partial Differential Equations Numerical Methods for PDEs Sparse Linear Systems

Smoothers. < interactive example > Partial Differential Equations Numerical Methods for PDEs Sparse Linear Systems Smoothers Partial Differential Equations Disappointing convergence rates observed for stationary iterative methods are asymptotic Much better progress may be made initially before eventually settling into

More information

Meshless physical simulation of semiconductor devices using a wavelet-based nodes generator

Meshless physical simulation of semiconductor devices using a wavelet-based nodes generator Meshless physical simulation of semiconductor devices using a wavelet-based nodes generator Rashid Mirzavand 1, Abdolali Abdipour 1a), Gholamreza Moradi 1, and Masoud Movahhedi 2 1 Microwave/mm-wave &

More information

Construction and application of hierarchical matrix preconditioners

Construction and application of hierarchical matrix preconditioners University of Iowa Iowa Research Online Theses and Dissertations 2008 Construction and application of hierarchical matrix preconditioners Fang Yang University of Iowa Copyright 2008 Fang Yang This dissertation

More information

Multigrid Pattern. I. Problem. II. Driving Forces. III. Solution

Multigrid Pattern. I. Problem. II. Driving Forces. III. Solution Multigrid Pattern I. Problem Problem domain is decomposed into a set of geometric grids, where each element participates in a local computation followed by data exchanges with adjacent neighbors. The grids

More information

ESPRESO ExaScale PaRallel FETI Solver. Hybrid FETI Solver Report

ESPRESO ExaScale PaRallel FETI Solver. Hybrid FETI Solver Report ESPRESO ExaScale PaRallel FETI Solver Hybrid FETI Solver Report Lubomir Riha, Tomas Brzobohaty IT4Innovations Outline HFETI theory from FETI to HFETI communication hiding and avoiding techniques our new

More information

Acceleration of the Multi Level Fast Multipole Method on a GPU

Acceleration of the Multi Level Fast Multipole Method on a GPU Thesis National Aerospace Laboratory NLR Acceleration of the Multi Level Fast Multipole Method on a GPU Author: Claudia Wagenaar(4090772) Supervisors: H. van der Ven D. van der Heul C. Vuik February 1,

More information

A Random Variable Shape Parameter Strategy for Radial Basis Function Approximation Methods

A Random Variable Shape Parameter Strategy for Radial Basis Function Approximation Methods A Random Variable Shape Parameter Strategy for Radial Basis Function Approximation Methods Scott A. Sarra, Derek Sturgill Marshall University, Department of Mathematics, One John Marshall Drive, Huntington

More information

Subdivision Curves and Surfaces

Subdivision Curves and Surfaces Subdivision Surfaces or How to Generate a Smooth Mesh?? Subdivision Curves and Surfaces Subdivision given polyline(2d)/mesh(3d) recursively modify & add vertices to achieve smooth curve/surface Each iteration

More information

Accelerated flow acoustic boundary element solver and the noise generation of fish

Accelerated flow acoustic boundary element solver and the noise generation of fish Accelerated flow acoustic boundary element solver and the noise generation of fish JUSTIN W. JAWORSKI, NATHAN WAGENHOFFER, KEITH W. MOORED LEHIGH UNIVERSITY, BETHLEHEM, USA FLINOVIA PENN STATE 27 APRIL

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra)

AMS526: Numerical Analysis I (Numerical Linear Algebra) AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 20: Sparse Linear Systems; Direct Methods vs. Iterative Methods Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 26

More information

Multigrid solvers M. M. Sussman sussmanm@math.pitt.edu Office Hours: 11:10AM-12:10PM, Thack 622 May 12 June 19, 2014 1 / 43 Multigrid Geometrical multigrid Introduction Details of GMG Summary Algebraic

More information

Introduction to Computer Graphics. Modeling (3) April 27, 2017 Kenshi Takayama

Introduction to Computer Graphics. Modeling (3) April 27, 2017 Kenshi Takayama Introduction to Computer Graphics Modeling (3) April 27, 2017 Kenshi Takayama Solid modeling 2 Solid models Thin shapes represented by single polygons Unorientable Clear definition of inside & outside

More information

A comparison of parallel rank-structured solvers

A comparison of parallel rank-structured solvers A comparison of parallel rank-structured solvers François-Henry Rouet Livermore Software Technology Corporation, Lawrence Berkeley National Laboratory Joint work with: - LSTC: J. Anton, C. Ashcraft, C.

More information

smooth coefficients H. Köstler, U. Rüde

smooth coefficients H. Köstler, U. Rüde A robust multigrid solver for the optical flow problem with non- smooth coefficients H. Köstler, U. Rüde Overview Optical Flow Problem Data term and various regularizers A Robust Multigrid Solver Galerkin

More information

HFSS Hybrid Finite Element and Integral Equation Solver for Large Scale Electromagnetic Design and Simulation

HFSS Hybrid Finite Element and Integral Equation Solver for Large Scale Electromagnetic Design and Simulation HFSS Hybrid Finite Element and Integral Equation Solver for Large Scale Electromagnetic Design and Simulation Laila Salman, PhD Technical Services Specialist laila.salman@ansys.com 1 Agenda Overview of

More information

Tree-based methods on GPUs

Tree-based methods on GPUs Tree-based methods on GPUs Felipe Cruz 1 and Matthew Knepley 2,3 1 Department of Mathematics University of Bristol 2 Computation Institute University of Chicago 3 Department of Molecular Biology and Physiology

More information

The Spherical Harmonics Discrete Ordinate Method for Atmospheric Radiative Transfer

The Spherical Harmonics Discrete Ordinate Method for Atmospheric Radiative Transfer The Spherical Harmonics Discrete Ordinate Method for Atmospheric Radiative Transfer K. Franklin Evans Program in Atmospheric and Oceanic Sciences University of Colorado, Boulder Computational Methods in

More information

Advanced modelling of gratings in VirtualLab software. Site Zhang, development engineer Lignt Trans

Advanced modelling of gratings in VirtualLab software. Site Zhang, development engineer Lignt Trans Advanced modelling of gratings in VirtualLab software Site Zhang, development engineer Lignt Trans 1 2 3 4 Content Grating Order Analyzer Rigorous Simulation of Holographic Generated Volume Grating Coupled

More information

Local Multilevel Fast Multipole Algorithm for 3D Electromagnetic Scattering

Local Multilevel Fast Multipole Algorithm for 3D Electromagnetic Scattering Progress In Electromagnetics Research Symposium 2005, Hangzhou, China, August 22-26 745 Local Multilevel Fast Multipole Algorithm for 3D Electromagnetic Scattering Jun Hu, Zaiping Nie, Lin Lei, and Jun

More information

1.2 Numerical Solutions of Flow Problems

1.2 Numerical Solutions of Flow Problems 1.2 Numerical Solutions of Flow Problems DIFFERENTIAL EQUATIONS OF MOTION FOR A SIMPLIFIED FLOW PROBLEM Continuity equation for incompressible flow: 0 Momentum (Navier-Stokes) equations for a Newtonian

More information

PROGRAMMING OF MULTIGRID METHODS

PROGRAMMING OF MULTIGRID METHODS PROGRAMMING OF MULTIGRID METHODS LONG CHEN In this note, we explain the implementation detail of multigrid methods. We will use the approach by space decomposition and subspace correction method; see Chapter:

More information

Efficient computation of source magnetic scalar potential

Efficient computation of source magnetic scalar potential Adv. Radio Sci., 4, 59 63, 2006 Author(s) 2006. This work is licensed under a Creative Commons License. Advances in Radio Science Efficient computation of source magnetic scalar potential W. Hafla, A.

More information

A Parallel Implementation of A Fast Multipole Based 3-D Capacitance Extraction Program on Distributed Memory Multicomputers

A Parallel Implementation of A Fast Multipole Based 3-D Capacitance Extraction Program on Distributed Memory Multicomputers A Parallel Implementation of A Fast Multipole Based 3-D Capacitance Extraction Program on Distributed Memory Multicomputers Yanhong Yuan and Prithviraj Banerjee Department of Electrical and Computer Engineering

More information

Algorithms, System and Data Centre Optimisation for Energy Efficient HPC

Algorithms, System and Data Centre Optimisation for Energy Efficient HPC 2015-09-14 Algorithms, System and Data Centre Optimisation for Energy Efficient HPC Vincent Heuveline URZ Computing Centre of Heidelberg University EMCL Engineering Mathematics and Computing Lab 1 Energy

More information

HFSS Ansys ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary

HFSS Ansys ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary HFSS 12.0 Ansys 2009 ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary Comparison of HFSS 11 and HFSS 12 for JSF Antenna Model UHF blade antenna on Joint Strike Fighter Inherent improvements in

More information

Parallel High-Order Geometric Multigrid Methods on Adaptive Meshes for Highly Heterogeneous Nonlinear Stokes Flow Simulations of Earth s Mantle

Parallel High-Order Geometric Multigrid Methods on Adaptive Meshes for Highly Heterogeneous Nonlinear Stokes Flow Simulations of Earth s Mantle ICES Student Forum The University of Texas at Austin, USA November 4, 204 Parallel High-Order Geometric Multigrid Methods on Adaptive Meshes for Highly Heterogeneous Nonlinear Stokes Flow Simulations of

More information

Fast Radial Basis Functions for Engineering Applications. Prof. Marco Evangelos Biancolini University of Rome Tor Vergata

Fast Radial Basis Functions for Engineering Applications. Prof. Marco Evangelos Biancolini University of Rome Tor Vergata Fast Radial Basis Functions for Engineering Applications Prof. Marco Evangelos Biancolini University of Rome Tor Vergata Outline 2 RBF background Fast RBF on HPC Engineering Applications Mesh morphing

More information

Parallel and Distributed Systems Lab.

Parallel and Distributed Systems Lab. Parallel and Distributed Systems Lab. Department of Computer Sciences Purdue University. Jie Chi, Ronaldo Ferreira, Ananth Grama, Tzvetan Horozov, Ioannis Ioannidis, Mehmet Koyuturk, Shan Lei, Robert Light,

More information

Geometric Modeling in Graphics

Geometric Modeling in Graphics Geometric Modeling in Graphics Part 10: Surface reconstruction Martin Samuelčík www.sccg.sk/~samuelcik samuelcik@sccg.sk Curve, surface reconstruction Finding compact connected orientable 2-manifold surface

More information

Lecture 15: More Iterative Ideas

Lecture 15: More Iterative Ideas Lecture 15: More Iterative Ideas David Bindel 15 Mar 2010 Logistics HW 2 due! Some notes on HW 2. Where we are / where we re going More iterative ideas. Intro to HW 3. More HW 2 notes See solution code!

More information

Performance Evaluation of a New Parallel Preconditioner

Performance Evaluation of a New Parallel Preconditioner Performance Evaluation of a New Parallel Preconditioner Keith D. Gremban Gary L. Miller October 994 CMU-CS-94-25 Marco Zagha School of Computer Science Carnegie Mellon University Pittsburgh, PA 523 This

More information

Fast Multipole Method on GPU: Tackling 3-D Capacitance Extraction on Massively Parallel SIMD Platforms

Fast Multipole Method on GPU: Tackling 3-D Capacitance Extraction on Massively Parallel SIMD Platforms Fast Multipole Method on GPU: Tackling 3-D Capacitance Extraction on Massively Parallel SIMD Platforms Xueqian Zhao Department of ECE Michigan Technological University Houghton, MI, 4993 xueqianz@mtu.edu

More information

Highly Parallel Multigrid Solvers for Multicore and Manycore Processors

Highly Parallel Multigrid Solvers for Multicore and Manycore Processors Highly Parallel Multigrid Solvers for Multicore and Manycore Processors Oleg Bessonov (B) Institute for Problems in Mechanics of the Russian Academy of Sciences, 101, Vernadsky Avenue, 119526 Moscow, Russia

More information

UNIVERSITY OF CALGARY. Subdivision Surfaces. Advanced Geometric Modeling Faramarz Samavati

UNIVERSITY OF CALGARY. Subdivision Surfaces. Advanced Geometric Modeling Faramarz Samavati Subdivision Surfaces Surfaces Having arbitrary Topologies Tensor Product Surfaces Non Tensor Surfaces We can t find u-curves and v-curves in general surfaces General Subdivision Coarse mesh Subdivision

More information

Scientific Computing on Graphical Processors: FMM, Flagon, Signal Processing, Plasma and Astrophysics

Scientific Computing on Graphical Processors: FMM, Flagon, Signal Processing, Plasma and Astrophysics Scientific Computing on Graphical Processors: FMM, Flagon, Signal Processing, Plasma and Astrophysics Ramani Duraiswami Computer Science & UMIACS University of Maryland, College Park Joint work with Nail

More information

Lecture 2: Introduction

Lecture 2: Introduction Lecture 2: Introduction v2015.0 Release ANSYS HFSS for Antenna Design 1 2015 ANSYS, Inc. Multiple Advanced Techniques Allow HFSS to Excel at a Wide Variety of Applications Platform Integration and RCS

More information

Ripplet: a New Transform for Feature Extraction and Image Representation

Ripplet: a New Transform for Feature Extraction and Image Representation Ripplet: a New Transform for Feature Extraction and Image Representation Dr. Dapeng Oliver Wu Joint work with Jun Xu Department of Electrical and Computer Engineering University of Florida Outline Motivation

More information

GPU-Accelerated Algebraic Multigrid for Commercial Applications. Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA

GPU-Accelerated Algebraic Multigrid for Commercial Applications. Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA GPU-Accelerated Algebraic Multigrid for Commercial Applications Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA ANSYS Fluent 2 Fluent control flow Accelerate this first Non-linear iterations Assemble

More information

Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers

Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers Markus Geveler, Dirk Ribbrock, Dominik Göddeke, Peter Zajac, Stefan Turek Institut für Angewandte Mathematik TU Dortmund,

More information

Feasibility of a meshfree post-processing for boundary element methods

Feasibility of a meshfree post-processing for boundary element methods Boundary Elements and Other Mesh Reduction Methods XXXVIII 327 Feasibility of a meshfree post-processing for boundary element methods A. Buchau & W. M. Rucker Institut für Theorie der Elektrotechnik, Universität

More information

Fast Multipole BEM Modeling of Head Related Transfer Functions of a Dummy Head and Torso

Fast Multipole BEM Modeling of Head Related Transfer Functions of a Dummy Head and Torso Fast Multipole BEM Modeling of Head Related Transfer Functions of a Dummy Head and Torso P. Fiala 1,2, J. Huijssen 2, B. Pluymers 2, R. Hallez 3, W. Desmet 2 1 Budapest University of Technology and Economics,

More information

Optimum Array Processing

Optimum Array Processing Optimum Array Processing Part IV of Detection, Estimation, and Modulation Theory Harry L. Van Trees WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Preface xix 1 Introduction 1 1.1 Array Processing

More information

Fast Methods with Sieve

Fast Methods with Sieve Fast Methods with Sieve Matthew G Knepley Mathematics and Computer Science Division Argonne National Laboratory August 12, 2008 Workshop on Scientific Computing Simula Research, Oslo, Norway M. Knepley

More information

Empirical Analysis of Space Filling Curves for Scientific Computing Applications

Empirical Analysis of Space Filling Curves for Scientific Computing Applications Empirical Analysis of Space Filling Curves for Scientific Computing Applications Daryl DeFord 1 Ananth Kalyanaraman 2 1 Department of Mathematics 2 School of Electrical Engineering and Computer Science

More information