Structure-preserving Smoothing for Seismic Amplitude Data by Anisotropic Diffusion using GPGPU

Size: px
Start display at page:

Download "Structure-preserving Smoothing for Seismic Amplitude Data by Anisotropic Diffusion using GPGPU"

Transcription

1 GPU Technology Conference 2016 April, 4-7 San Jose, CA, USA Structure-preserving Smoothing for Seismic Amplitude Data by Anisotropic Diffusion using GPGPU Joner Duarte

2 Outline Introduction Why is noise attenuation important? Objective Related Works Structure-preserving Smoothing Method Parallel Approach Results Conclusion 2

3 Introduction Why is noise attenuation important? Original seismic data crossline 905 of F3 block 1 Zoom area from the input data and the corresponding filtered image. 3 1 Volume of the Netherlands offshore F3 block downloaded from the Opendtect website

4 Introduction Why is noise attenuation important? Preserve relevant structural features is fundamental 4 Original image Gaussian filter with radius 1.0 Proposed filter

5 Introduction Why is noise attenuation important? Seismic attributes have been used to accelerate interpretations, and their results are directly related to the quality of the seismic data. 5 Input seismic data inline 640 of the Kerry data 1 1 Kerry data from New Zealand provided for use by New Zealand Petroleum and Minerals Zoom area of the input data and the corresponding semblance attribute Filtered data and the corresponding semblance attribute

6 Introduction Why is noise attenuation important? Curvature attributes are widely used to improve seismic interpretation process. The results of curvature attribute can be directly improved by our iterative noise attenuation process. Original Filtered 3x 6 Zoomed areas of time slice 396ms of F3 Block data Original Filtered 3x

7 Introduction Objective Remove noise preserving relevant details such as structural and stratigraphic discontinuities in interactive time. 7

8 Related Works [1990] Perona and Malik, Scale-Space and Edge Detection Using Anisotropic Diffusion [2002] Hocker and Fehmers, Fast structural interpretation with structure-oriented filtering [2009] Dave Hale, Structure-oriented smoothing and semblance [2011] Baddari et al., Seismic Noise Attenuation by Means of an Anisotropic Non-linear Diffusion Filter 8

9 Structure-preserving Smoothing Method Input Volume Do you wish iterate again? STEP 1: Computing Seismic Attributes STEP 2: Applying The Anisotropic Diffusion Filter Filtered Volume Overview Pampanelli P., Seismic amplitude smoothing by anisotropic diffusion preserving structural features, 2015 Use the instantaneous phase as horizon indicator attribute Two steps Compute the constraints Assemble and solve the system of equations Iterable 9

10 Structure-preserving Smoothing Method Input Volume STEP 1: Computing Seismic Attributes STEP 2: Applying The Anisotropic Diffusion Filter Compute Hilbert Transform Y t Compute Seismic Complex Trace Z t = X t + iy t Do you wish iterate again? Filtered Volume 10

11 Structure-preserving Smoothing Method Input Volume STEP 1: Computing Seismic Attributes STEP 2: Applying The Anisotropic Diffusion Filter Compute Horizon Indicator Attribute¹ Φ = δφ δx = 1 X 2 + Y 2 δφ δy = 1 X 2 + Y 2 δφ δx, δφ δy δy X δx δy X δy Y δx δx Y δx δy 11 Do you wish iterate again? Filtered Volume where Φ is the instantaneous phase attribute 1 Silva, P.M. et al., Horizon indicator attributes and applications. SEG Technical Program Expanded Abstracts 2012.

12 Structure-preserving Smoothing Method Input Volume STEP 1: Computing Seismic Attributes Compute Horizon Indicator Attribute¹ Φ = δφ δx, δφ δy STEP 2: Applying The Anisotropic Diffusion Filter instantaneous phase gradient Φ Φ Do you wish iterate again? Filtered Volume horizon 12

13 Structure-preserving Smoothing Method Input Volume STEP 1: Computing Seismic Attributes Compute Fault Attribute¹ F max = MAX X. Φ, Y. Φ Do you wish iterate again? STEP 2: Applying The Anisotropic Diffusion Filter Filtered Volume where X represents the normalized vector of the amplitude gradient and F max 1, 1 1 Pampaneli et al., A new fault-enhancement attribute based on first order directional derivatives of complex trace, EAGE,

14 Structure-preserving Smoothing Method Input Volume STEP 1: Computing Seismic Attributes STEP 2: Applying The Anisotropic Diffusion Filter Diffusion Tensor D = Φ Φ T Fault Preserving Factor ε = F max 2 Do you wish iterate again? Filtered Volume 14

15 Structure-preserving Smoothing Method Input Volume STEP 1: Computing Seismic Attributes The Anisotropic Diffusion Filter Solving the linear system Ax = b of size m x m, where m = w x h Stencil 3x3 STEP 2: Applying The Anisotropic Diffusion Filter δu δt = εd u Do you wish iterate again? Filtered Volume u n+1 x,y = u n x,y + t ε x,y n D n x,y n+1 u x,y 15

16 Parallel Approach Compute Seismic Attributes (STEP 1) CPU FFTW3 OpenMP 1 loop for each trace (Hilbert Transform) 1 loop with 1 inner loop Memory used in this step Input seismic section Hilbert Transform Y(t) Imaginary part of complex trace Horizon Atribute Instantaneous phase gradient GPU CUFFT 2 kernels (Hilbert Transform) 1 kernel to compute attributes 1 thread per sample Fault Attribute (ε) dxx dxy dyy 16 Diffusion tensor

17 Parallel Approach Assemble and Solve the System of Equations (STEP 2) System solver 50~70% of total time of one iteration Libraries for solving sparse linear systems Sparse matrix format: CSR (Compressed Sparse Row) Memory used in this step 31x input image size A x b A x = b 19x Input section size Input Input Input + 1x + 1x + 10x = section size section size section size 31x Input section size CSR Filtered image Input image NUMERICAL METHOD 17

18 Parallel Approach Numerical Methods Conjugate gradient (CG) Symmetric linear systems Symmetric and positive-definite matrix Ax = b => A T Ax = A T b Stop criteria Error tolerance: 10-4 Limit of 100 iterations No preconditioner used Biconjugate gradient stabilized (BiCGStab) Generalized minimal residual (GMRES) 18

19 Parallel Approach Libraries Intel MKL 11.3 CUSP ViennaCL There are other libraries that we may try in the future (MUMPS, cusolver, PARALUTION, etc) 19

20 Results HP Z820 Workstation 64 GB RAM Intel(R) Xeon(R) E cores (12 threads) Inline 240 of the volume of the Netherlands offshore F3 block¹ 951 x 462 Tesla k40 12 GB 2880 processors Linear system size: x Volume of the Netherlands offshore F3 block downloaded from the Opendtect website 20

21 Results Inline 240 of the volume of the Netherlands offshore F3 block Original slice Filtered Slice with 3 Iterations 21

22 Results Inline 240 of the volume of the Netherlands offshore F3 block Original slice 1 iteration 3 iterations 5 iterations 10 iterations 22

23 Results Number of iterations MKL (ms) CG CUSP (ms) ViennaCL (ms) x 16x 14x 12x 10x 8x 6x 4x 2x GPU Speedup using CG Iterations MKL CUSP ViennaCL 23

24 Results Number of iterations MKL (ms) BiCGSTAB CUSP (ms) ViennaCL (ms) x 16x 14x 12x 10x 8x 6x 4x 2x GPU Speedup using BiCGStab Iterations MKL CUSP ViennaCL 24

25 Results Number of iterations MKL (ms) GMRES CUSP (ms) ViennaCL (ms) x 16x 14x 12x 10x 8x 6x 4x 2x GPU Speedup using GMRES Iterations MKL CUSP ViennaCL 25

26 Time (ms) Time (ms) Results Filtering time with 1 iteration Filtering time with 10 iterations CG BiCGStab GMRES CG BiCGStab GMRES MKL ViennaCL CUSP MKL ViennaCL CUSP 26

27 Conclusions - Method that attenuates noise preserving structural features - Improves interpretation eficiency and also can enhance the results of other seismic attributes - Fast enough to be interactive 27

28 Conclusions - It can be used on demand during the interpretation process and also be integrated with tools like Cintiq tablet - CUSP library proved to be very fast for our problem with a high level interface that makes it very simple to use 28

29 Acknowledgements Questions??? 29

HYPERDRIVE IMPLEMENTATION AND ANALYSIS OF A PARALLEL, CONJUGATE GRADIENT LINEAR SOLVER PROF. BRYANT PROF. KAYVON 15618: PARALLEL COMPUTER ARCHITECTURE

HYPERDRIVE IMPLEMENTATION AND ANALYSIS OF A PARALLEL, CONJUGATE GRADIENT LINEAR SOLVER PROF. BRYANT PROF. KAYVON 15618: PARALLEL COMPUTER ARCHITECTURE HYPERDRIVE IMPLEMENTATION AND ANALYSIS OF A PARALLEL, CONJUGATE GRADIENT LINEAR SOLVER AVISHA DHISLE PRERIT RODNEY ADHISLE PRODNEY 15618: PARALLEL COMPUTER ARCHITECTURE PROF. BRYANT PROF. KAYVON LET S

More information

OpenFOAM + GPGPU. İbrahim Özküçük

OpenFOAM + GPGPU. İbrahim Özküçük OpenFOAM + GPGPU İbrahim Özküçük Outline GPGPU vs CPU GPGPU plugins for OpenFOAM Overview of Discretization CUDA for FOAM Link (cufflink) Cusp & Thrust Libraries How Cufflink Works Performance data of

More information

S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS

S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS John R Appleyard Jeremy D Appleyard Polyhedron Software with acknowledgements to Mark A Wakefield Garf Bowen Schlumberger Outline of Talk Reservoir

More information

On the Parallel Solution of Sparse Triangular Linear Systems. M. Naumov* San Jose, CA May 16, 2012 *NVIDIA

On the Parallel Solution of Sparse Triangular Linear Systems. M. Naumov* San Jose, CA May 16, 2012 *NVIDIA On the Parallel Solution of Sparse Triangular Linear Systems M. Naumov* San Jose, CA May 16, 2012 *NVIDIA Why Is This Interesting? There exist different classes of parallel problems Embarrassingly parallel

More information

GTC 2013: DEVELOPMENTS IN GPU-ACCELERATED SPARSE LINEAR ALGEBRA ALGORITHMS. Kyle Spagnoli. Research EM Photonics 3/20/2013

GTC 2013: DEVELOPMENTS IN GPU-ACCELERATED SPARSE LINEAR ALGEBRA ALGORITHMS. Kyle Spagnoli. Research EM Photonics 3/20/2013 GTC 2013: DEVELOPMENTS IN GPU-ACCELERATED SPARSE LINEAR ALGEBRA ALGORITHMS Kyle Spagnoli Research Engineer @ EM Photonics 3/20/2013 INTRODUCTION» Sparse systems» Iterative solvers» High level benchmarks»

More information

Large scale Imaging on Current Many- Core Platforms

Large scale Imaging on Current Many- Core Platforms Large scale Imaging on Current Many- Core Platforms SIAM Conf. on Imaging Science 2012 May 20, 2012 Dr. Harald Köstler Chair for System Simulation Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen,

More information

A MAP Algorithm for AVO Seismic Inversion Based on the Mixed (L 2, non-l 2 ) Norms to Separate Primary and Multiple Signals in Slowness Space

A MAP Algorithm for AVO Seismic Inversion Based on the Mixed (L 2, non-l 2 ) Norms to Separate Primary and Multiple Signals in Slowness Space A MAP Algorithm for AVO Seismic Inversion Based on the Mixed (L 2, non-l 2 ) Norms to Separate Primary and Multiple Signals in Slowness Space Harold Ivan Angulo Bustos Rio Grande do Norte State University

More information

Exploiting GPU Caches in Sparse Matrix Vector Multiplication. Yusuke Nagasaka Tokyo Institute of Technology

Exploiting GPU Caches in Sparse Matrix Vector Multiplication. Yusuke Nagasaka Tokyo Institute of Technology Exploiting GPU Caches in Sparse Matrix Vector Multiplication Yusuke Nagasaka Tokyo Institute of Technology Sparse Matrix Generated by FEM, being as the graph data Often require solving sparse linear equation

More information

PARALUTION - a Library for Iterative Sparse Methods on CPU and GPU

PARALUTION - a Library for Iterative Sparse Methods on CPU and GPU - a Library for Iterative Sparse Methods on CPU and GPU Dimitar Lukarski Division of Scientific Computing Department of Information Technology Uppsala Programming for Multicore Architectures Research Center

More information

On Level Scheduling for Incomplete LU Factorization Preconditioners on Accelerators

On Level Scheduling for Incomplete LU Factorization Preconditioners on Accelerators On Level Scheduling for Incomplete LU Factorization Preconditioners on Accelerators Karl Rupp, Barry Smith rupp@mcs.anl.gov Mathematics and Computer Science Division Argonne National Laboratory FEMTEC

More information

Porting the NAS-NPB Conjugate Gradient Benchmark to CUDA. NVIDIA Corporation

Porting the NAS-NPB Conjugate Gradient Benchmark to CUDA. NVIDIA Corporation Porting the NAS-NPB Conjugate Gradient Benchmark to CUDA NVIDIA Corporation Outline! Overview of CG benchmark! Overview of CUDA Libraries! CUSPARSE! CUBLAS! Porting Sequence! Algorithm Analysis! Data/Code

More information

Accelerating a Simulation of Type I X ray Bursts from Accreting Neutron Stars Mark Mackey Professor Alexander Heger

Accelerating a Simulation of Type I X ray Bursts from Accreting Neutron Stars Mark Mackey Professor Alexander Heger Accelerating a Simulation of Type I X ray Bursts from Accreting Neutron Stars Mark Mackey Professor Alexander Heger The goal of my project was to develop an optimized linear system solver to shorten the

More information

GPU Acceleration for Seismic Interpretation Algorithms

GPU Acceleration for Seismic Interpretation Algorithms GPU Acceleration for Seismic Interpretation Algorithms Jon Marbach, Ph.D. Pate Motter TerraSpark Geosciences 2012 TerraSpark Geosciences, LLC. All Rights Reserved. Talk overview Who am I? (~1min) Who is

More information

ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016

ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016 ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016 Challenges What is Algebraic Multi-Grid (AMG)? AGENDA Why use AMG? When to use AMG? NVIDIA AmgX Results 2

More information

Application of GPU technology to OpenFOAM simulations

Application of GPU technology to OpenFOAM simulations Application of GPU technology to OpenFOAM simulations Jakub Poła, Andrzej Kosior, Łukasz Miroslaw jakub.pola@vratis.com, www.vratis.com Wroclaw, Poland Agenda Motivation Partial acceleration SpeedIT OpenFOAM

More information

Iterative Fault Enhancement Workflow

Iterative Fault Enhancement Workflow Iterative Fault Enhancement Workflow. Introduction A greater degree of filtering can be obtained using three methods: (1) applying more aggressive filter parameters using the original window, (2) applying

More information

Two-Phase flows on massively parallel multi-gpu clusters

Two-Phase flows on massively parallel multi-gpu clusters Two-Phase flows on massively parallel multi-gpu clusters Peter Zaspel Michael Griebel Institute for Numerical Simulation Rheinische Friedrich-Wilhelms-Universität Bonn Workshop Programming of Heterogeneous

More information

Paralution & ViennaCL

Paralution & ViennaCL Paralution & ViennaCL Clemens Schiffer June 12, 2014 Clemens Schiffer (Uni Graz) Paralution & ViennaCL June 12, 2014 1 / 32 Introduction Clemens Schiffer (Uni Graz) Paralution & ViennaCL June 12, 2014

More information

THE DEVELOPMENT OF THE POTENTIAL AND ACADMIC PROGRAMMES OF WROCLAW UNIVERISTY OF TECH- NOLOGY ITERATIVE LINEAR SOLVERS

THE DEVELOPMENT OF THE POTENTIAL AND ACADMIC PROGRAMMES OF WROCLAW UNIVERISTY OF TECH- NOLOGY ITERATIVE LINEAR SOLVERS ITERATIVE LIEAR SOLVERS. Objectives The goals of the laboratory workshop are as follows: to learn basic properties of iterative methods for solving linear least squares problems, to study the properties

More information

PhD Student. Associate Professor, Co-Director, Center for Computational Earth and Environmental Science. Abdulrahman Manea.

PhD Student. Associate Professor, Co-Director, Center for Computational Earth and Environmental Science. Abdulrahman Manea. Abdulrahman Manea PhD Student Hamdi Tchelepi Associate Professor, Co-Director, Center for Computational Earth and Environmental Science Energy Resources Engineering Department School of Earth Sciences

More information

Sparse Matrices. This means that for increasing problem size the matrices become sparse and sparser. O. Rheinbach, TU Bergakademie Freiberg

Sparse Matrices. This means that for increasing problem size the matrices become sparse and sparser. O. Rheinbach, TU Bergakademie Freiberg Sparse Matrices Many matrices in computing only contain a very small percentage of nonzeros. Such matrices are called sparse ( dünn besetzt ). Often, an upper bound on the number of nonzeros in a row can

More information

Figure 6.1: Truss topology optimization diagram.

Figure 6.1: Truss topology optimization diagram. 6 Implementation 6.1 Outline This chapter shows the implementation details to optimize the truss, obtained in the ground structure approach, according to the formulation presented in previous chapters.

More information

Contents. I The Basic Framework for Stationary Problems 1

Contents. I The Basic Framework for Stationary Problems 1 page v Preface xiii I The Basic Framework for Stationary Problems 1 1 Some model PDEs 3 1.1 Laplace s equation; elliptic BVPs... 3 1.1.1 Physical experiments modeled by Laplace s equation... 5 1.2 Other

More information

Speedup Altair RADIOSS Solvers Using NVIDIA GPU

Speedup Altair RADIOSS Solvers Using NVIDIA GPU Innovation Intelligence Speedup Altair RADIOSS Solvers Using NVIDIA GPU Eric LEQUINIOU, HPC Director Hongwei Zhou, Senior Software Developer May 16, 2012 Innovation Intelligence ALTAIR OVERVIEW Altair

More information

PARDISO - PARallel DIrect SOlver to solve SLAE on shared memory architectures

PARDISO - PARallel DIrect SOlver to solve SLAE on shared memory architectures PARDISO - PARallel DIrect SOlver to solve SLAE on shared memory architectures Solovev S. A, Pudov S.G sergey.a.solovev@intel.com, sergey.g.pudov@intel.com Intel Xeon, Intel Core 2 Duo are trademarks of

More information

Efficient Finite Element Geometric Multigrid Solvers for Unstructured Grids on GPUs

Efficient Finite Element Geometric Multigrid Solvers for Unstructured Grids on GPUs Efficient Finite Element Geometric Multigrid Solvers for Unstructured Grids on GPUs Markus Geveler, Dirk Ribbrock, Dominik Göddeke, Peter Zajac, Stefan Turek Institut für Angewandte Mathematik TU Dortmund,

More information

Large Displacement Optical Flow & Applications

Large Displacement Optical Flow & Applications Large Displacement Optical Flow & Applications Narayanan Sundaram, Kurt Keutzer (Parlab) In collaboration with Thomas Brox (University of Freiburg) Michael Tao (University of California Berkeley) Parlab

More information

Multi-GPU Scaling of Direct Sparse Linear System Solver for Finite-Difference Frequency-Domain Photonic Simulation

Multi-GPU Scaling of Direct Sparse Linear System Solver for Finite-Difference Frequency-Domain Photonic Simulation Multi-GPU Scaling of Direct Sparse Linear System Solver for Finite-Difference Frequency-Domain Photonic Simulation 1 Cheng-Han Du* I-Hsin Chung** Weichung Wang* * I n s t i t u t e o f A p p l i e d M

More information

Enhancing faults and axial planes Program fault_enhancement

Enhancing faults and axial planes Program fault_enhancement Enhancing faults and axial planes Program fault_enhancement Overview The fault enhancement attribute is a post-stack attribute which enhances locally planar features within a seismic attribute volume.

More information

Robust Simulation of Sparsely Sampled Thin Features in SPH-Based Free Surface Flows

Robust Simulation of Sparsely Sampled Thin Features in SPH-Based Free Surface Flows Copyright of figures and other materials in the paper belong to original authors. Robust Simulation of Sparsely Sampled Thin Features in SPH-Based Free Surface Flows Xiaowei He et al. ACM SIGGRAPH 2015

More information

Accelerating GPU computation through mixed-precision methods. Michael Clark Harvard-Smithsonian Center for Astrophysics Harvard University

Accelerating GPU computation through mixed-precision methods. Michael Clark Harvard-Smithsonian Center for Astrophysics Harvard University Accelerating GPU computation through mixed-precision methods Michael Clark Harvard-Smithsonian Center for Astrophysics Harvard University Outline Motivation Truncated Precision using CUDA Solving Linear

More information

ESPRESO ExaScale PaRallel FETI Solver. Hybrid FETI Solver Report

ESPRESO ExaScale PaRallel FETI Solver. Hybrid FETI Solver Report ESPRESO ExaScale PaRallel FETI Solver Hybrid FETI Solver Report Lubomir Riha, Tomas Brzobohaty IT4Innovations Outline HFETI theory from FETI to HFETI communication hiding and avoiding techniques our new

More information

FAULT DETECTION IN SEISMIC DATASETS USING HOUGH TRANSFORM. Zhen Wang and Ghassan AlRegib

FAULT DETECTION IN SEISMIC DATASETS USING HOUGH TRANSFORM. Zhen Wang and Ghassan AlRegib 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) FAULT DETECTION IN SEISMIC DATASETS USING HOUGH TRANSFORM Zhen Wang and Ghassan AlRegib Center for Energy and Geo Processing

More information

Anisotropy-preserving 5D interpolation by hybrid Fourier transform

Anisotropy-preserving 5D interpolation by hybrid Fourier transform Anisotropy-preserving 5D interpolation by hybrid Fourier transform Juefu Wang and Shaowu Wang, CGG Summary We present an anisotropy-preserving interpolation method based on a hybrid 5D Fourier transform,

More information

Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA

Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA NVIDIA and HPC Evolution of GPUs Public, based in Santa Clara, CA ~$4B revenue ~5,500 employees Founded in 1999 with primary business in

More information

Intel MKL Sparse Solvers. Software Solutions Group - Developer Products Division

Intel MKL Sparse Solvers. Software Solutions Group - Developer Products Division Intel MKL Sparse Solvers - Agenda Overview Direct Solvers Introduction PARDISO: main features PARDISO: advanced functionality DSS Performance data Iterative Solvers Performance Data Reference Copyright

More information

Image-guided 3D interpolation of borehole data Dave Hale, Center for Wave Phenomena, Colorado School of Mines

Image-guided 3D interpolation of borehole data Dave Hale, Center for Wave Phenomena, Colorado School of Mines Image-guided 3D interpolation of borehole data Dave Hale, Center for Wave Phenomena, Colorado School of Mines SUMMARY A blended neighbor method for image-guided interpolation enables resampling of borehole

More information

Matrix-free IPM with GPU acceleration

Matrix-free IPM with GPU acceleration Matrix-free IPM with GPU acceleration Julian Hall, Edmund Smith and Jacek Gondzio School of Mathematics University of Edinburgh jajhall@ed.ac.uk 29th June 2011 Linear programming theory Primal-dual pair

More information

Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers

Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers Markus Geveler, Dirk Ribbrock, Dominik Göddeke, Peter Zajac, Stefan Turek Institut für Angewandte Mathematik TU Dortmund,

More information

Automated Finite Element Computations in the FEniCS Framework using GPUs

Automated Finite Element Computations in the FEniCS Framework using GPUs Automated Finite Element Computations in the FEniCS Framework using GPUs Florian Rathgeber (f.rathgeber10@imperial.ac.uk) Advanced Modelling and Computation Group (AMCG) Department of Earth Science & Engineering

More information

Multi-GPU simulations in OpenFOAM with SpeedIT technology.

Multi-GPU simulations in OpenFOAM with SpeedIT technology. Multi-GPU simulations in OpenFOAM with SpeedIT technology. Attempt I: SpeedIT GPU-based library of iterative solvers for Sparse Linear Algebra and CFD. Current version: 2.2. Version 1.0 in 2008. CMRS format

More information

GPU-based Parallel Reservoir Simulators

GPU-based Parallel Reservoir Simulators GPU-based Parallel Reservoir Simulators Zhangxin Chen 1, Hui Liu 1, Song Yu 1, Ben Hsieh 1 and Lei Shao 1 Key words: GPU computing, reservoir simulation, linear solver, parallel 1 Introduction Nowadays

More information

Report of Linear Solver Implementation on GPU

Report of Linear Solver Implementation on GPU Report of Linear Solver Implementation on GPU XIANG LI Abstract As the development of technology and the linear equation solver is used in many aspects such as smart grid, aviation and chemical engineering,

More information

NEW ADVANCES IN GPU LINEAR ALGEBRA

NEW ADVANCES IN GPU LINEAR ALGEBRA GTC 2012: NEW ADVANCES IN GPU LINEAR ALGEBRA Kyle Spagnoli EM Photonics 5/16/2012 QUICK ABOUT US» HPC/GPU Consulting Firm» Specializations in:» Electromagnetics» Image Processing» Fluid Dynamics» Linear

More information

Mixed Precision Methods

Mixed Precision Methods Mixed Precision Methods Mixed precision, use the lowest precision required to achieve a given accuracy outcome " Improves runtime, reduce power consumption, lower data movement " Reformulate to find correction

More information

Matrix-free multi-gpu Implementation of Elliptic Solvers for strongly anisotropic PDEs

Matrix-free multi-gpu Implementation of Elliptic Solvers for strongly anisotropic PDEs Iterative Solvers Numerical Results Conclusion and outlook 1/18 Matrix-free multi-gpu Implementation of Elliptic Solvers for strongly anisotropic PDEs Eike Hermann Müller, Robert Scheichl, Eero Vainikko

More information

Challenges Simulating Real Fuel Combustion Kinetics: The Role of GPUs

Challenges Simulating Real Fuel Combustion Kinetics: The Role of GPUs Challenges Simulating Real Fuel Combustion Kinetics: The Role of GPUs M. J. McNenly and R. A. Whitesides GPU Technology Conference March 27, 2014 San Jose, CA LLNL-PRES-652254! This work performed under

More information

ME964 High Performance Computing for Engineering Applications

ME964 High Performance Computing for Engineering Applications ME964 High Performance Computing for Engineering Applications Outlining Midterm Projects Topic 3: GPU-based FEA Topic 4: GPU Direct Solver for Sparse Linear Algebra March 01, 2011 Dan Negrut, 2011 ME964

More information

High Performance Computing: Tools and Applications

High Performance Computing: Tools and Applications High Performance Computing: Tools and Applications Edmond Chow School of Computational Science and Engineering Georgia Institute of Technology Lecture 15 Numerically solve a 2D boundary value problem Example:

More information

Volumetric Attributes: Constrained least-squares spectral analysis Program spec_clssa

Volumetric Attributes: Constrained least-squares spectral analysis Program spec_clssa CONSTRAINED LEAST-SQUARES SPECTRAL ANALYSIS PROGRAM spec_clssa Alternative Spectral Decomposition Algorithms Spectral decomposition methods can be divided into three classes: those that use quadratic forms,

More information

Performance and accuracy of hardware-oriented native-, solvers in FEM simulations

Performance and accuracy of hardware-oriented native-, solvers in FEM simulations Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations Dominik Göddeke Angewandte Mathematik und Numerik, Universität Dortmund Acknowledgments Joint

More information

Fault Detection Using Color Blending and Color Transformations

Fault Detection Using Color Blending and Color Transformations Fault Detection Using Color Blending and Color Transformations Zhen Wang, Dogancan Temel, and Ghassan AlRegib Center for Energy and Geo Processing - CeGP School of Electrical and Computer Engineering Georgia

More information

3-D vertical cable processing using EOM

3-D vertical cable processing using EOM Carlos Rodriguez-Suarez, John C. Bancroft, Yong Xu and Robert R. Stewart ABSTRACT Three-dimensional seismic data using vertical cables was modeled and processed using equivalent offset migration (EOM),

More information

Modern GPUs (Graphics Processing Units)

Modern GPUs (Graphics Processing Units) Modern GPUs (Graphics Processing Units) Powerful data parallel computation platform. High computation density, high memory bandwidth. Relatively low cost. NVIDIA GTX 580 512 cores 1.6 Tera FLOPs 1.5 GB

More information

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation Ray Browell nvidia Technology Theater SC12 1 2012 ANSYS, Inc. nvidia Technology Theater SC12 HPC Revolution Recent

More information

AmgX 2.0: Scaling toward CORAL Joe Eaton, November 19, 2015

AmgX 2.0: Scaling toward CORAL Joe Eaton, November 19, 2015 AmgX 2.0: Scaling toward CORAL Joe Eaton, November 19, 2015 Agenda Introduction to AmgX Current Capabilities Scaling V2.0 Roadmap for the future 2 AmgX Fast, scalable linear solvers, emphasis on iterative

More information

EFFICIENT SOLVER FOR LINEAR ALGEBRAIC EQUATIONS ON PARALLEL ARCHITECTURE USING MPI

EFFICIENT SOLVER FOR LINEAR ALGEBRAIC EQUATIONS ON PARALLEL ARCHITECTURE USING MPI EFFICIENT SOLVER FOR LINEAR ALGEBRAIC EQUATIONS ON PARALLEL ARCHITECTURE USING MPI 1 Akshay N. Panajwar, 2 Prof.M.A.Shah Department of Computer Science and Engineering, Walchand College of Engineering,

More information

Volumetric curvature attributes adding value to 3D seismic data interpretation

Volumetric curvature attributes adding value to 3D seismic data interpretation Satinder Chopra* and Kurt J. Marfurt + *rcis Corporation, Calgary; + University of Houston, Houston Summary During the last few years, curvature attributes have been found to be useful for delineating

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra)

AMS526: Numerical Analysis I (Numerical Linear Algebra) AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 20: Sparse Linear Systems; Direct Methods vs. Iterative Methods Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 26

More information

MAGMA a New Generation of Linear Algebra Libraries for GPU and Multicore Architectures

MAGMA a New Generation of Linear Algebra Libraries for GPU and Multicore Architectures MAGMA a New Generation of Linear Algebra Libraries for GPU and Multicore Architectures Stan Tomov Innovative Computing Laboratory University of Tennessee, Knoxville OLCF Seminar Series, ORNL June 16, 2010

More information

Accelerating the Iterative Linear Solver for Reservoir Simulation

Accelerating the Iterative Linear Solver for Reservoir Simulation Accelerating the Iterative Linear Solver for Reservoir Simulation Wei Wu 1, Xiang Li 2, Lei He 1, Dongxiao Zhang 2 1 Electrical Engineering Department, UCLA 2 Department of Energy and Resources Engineering,

More information

CUDA Accelerated Compute Libraries. M. Naumov

CUDA Accelerated Compute Libraries. M. Naumov CUDA Accelerated Compute Libraries M. Naumov Outline Motivation Why should you use libraries? CUDA Toolkit Libraries Overview of performance CUDA Proprietary Libraries Address specific markets Third Party

More information

A Comparison of Algebraic Multigrid Preconditioners using Graphics Processing Units and Multi-Core Central Processing Units

A Comparison of Algebraic Multigrid Preconditioners using Graphics Processing Units and Multi-Core Central Processing Units A Comparison of Algebraic Multigrid Preconditioners using Graphics Processing Units and Multi-Core Central Processing Units Markus Wagner, Karl Rupp,2, Josef Weinbub Institute for Microelectronics, TU

More information

Accelerated ANSYS Fluent: Algebraic Multigrid on a GPU. Robert Strzodka NVAMG Project Lead

Accelerated ANSYS Fluent: Algebraic Multigrid on a GPU. Robert Strzodka NVAMG Project Lead Accelerated ANSYS Fluent: Algebraic Multigrid on a GPU Robert Strzodka NVAMG Project Lead A Parallel Success Story in Five Steps 2 Step 1: Understand Application ANSYS Fluent Computational Fluid Dynamics

More information

Efficient multigrid solvers for strongly anisotropic PDEs in atmospheric modelling

Efficient multigrid solvers for strongly anisotropic PDEs in atmospheric modelling Iterative Solvers Numerical Results Conclusion and outlook 1/22 Efficient multigrid solvers for strongly anisotropic PDEs in atmospheric modelling Part II: GPU Implementation and Scaling on Titan Eike

More information

Benchmarking of the linear solver PASTIX for integration in LESCAPE. Lucas Lestandi

Benchmarking of the linear solver PASTIX for integration in LESCAPE. Lucas Lestandi Benchmarking of the linear solver PASTIX for integration in LESCAPE Lucas Lestandi Supervisor: Héloïse Beaugendre March 9, 2015 Contents 1 The test case 2 2 Fast Fourier Transform using le Intel MKL library

More information

COMPUTING APPARENT DIP AND AMPLITUDE GRADIENTS PROGRAM apparent_cmpt. Apparent curvature computation flow charts

COMPUTING APPARENT DIP AND AMPLITUDE GRADIENTS PROGRAM apparent_cmpt. Apparent curvature computation flow charts COMPUTING APPARENT DIP AND AMPLITUDE GRADIENTS PROGRAM apparent_cmpt Apparent curvature computation flow charts inline dip crossline dip apparent_cmpt p(φ=-90 0 ) p(φ=0 0 ) p(φ=+30 0 ) p(φ=-60 0 ) p(φ=+60

More information

Parallelization of Shortest Path Graph Kernels on Multi-Core CPUs and GPU

Parallelization of Shortest Path Graph Kernels on Multi-Core CPUs and GPU Parallelization of Shortest Path Graph Kernels on Multi-Core CPUs and GPU Lifan Xu Wei Wang Marco A. Alvarez John Cavazos Dongping Zhang Department of Computer and Information Science University of Delaware

More information

Seismic Attributes on Frequency-enhanced Seismic Data

Seismic Attributes on Frequency-enhanced Seismic Data Seismic Attributes on Frequency-enhanced Seismic Data Satinder Chopra* Arcis Corporation, Calgary, Canada schopra@arcis.com Kurt J. Marfurt The University of Oklahoma, Norman, US and Somanath Misra Arcis

More information

FLATTENING AND GENERATING STRATAL SLICES PROGRAMS flatten, vector_flatten, stratal_slice, and vector_stratal_slice

FLATTENING AND GENERATING STRATAL SLICES PROGRAMS flatten, vector_flatten, stratal_slice, and vector_stratal_slice FLATTENING AND GENERATING STRATAL SLICES PROGRAMS flatten, vector_flatten, stratal_slice, and vector_stratal_slice Extracting phantom horizon and stratal slices is one of the more common interpretation

More information

DIFFERENTIAL. Tomáš Oberhuber, Atsushi Suzuki, Jan Vacata, Vítězslav Žabka

DIFFERENTIAL. Tomáš Oberhuber, Atsushi Suzuki, Jan Vacata, Vítězslav Žabka USE OF FOR Tomáš Oberhuber, Atsushi Suzuki, Jan Vacata, Vítězslav Žabka Faculty of Nuclear Sciences and Physical Engineering Czech Technical University in Prague Mini workshop on advanced numerical methods

More information

GPU DEVELOPMENT & FUTURE PLAN OF MIDAS NFX

GPU DEVELOPMENT & FUTURE PLAN OF MIDAS NFX GPU DEVELOPMENT & FUTURE PLAN OF MIDAS NFX September 22 2015 Noh-hoon Lee lnh0702@midasit.com SOFTWARE ENGINEER / CFD DEVELOPMENT TEAM MIDASIT CONTENTS 1. Introduction to MIDASIT 2. Computing Procedure

More information

Analysis and Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Multi-core and Many-core Platforms

Analysis and Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Multi-core and Many-core Platforms Analysis and Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Multi-core and Many-core Platforms H. Anzt, V. Heuveline Karlsruhe Institute of Technology, Germany

More information

Performance and accuracy of hardware-oriented. native-, solvers in FEM simulations

Performance and accuracy of hardware-oriented. native-, solvers in FEM simulations Robert Strzodka, Stanford University Dominik Göddeke, Universität Dortmund Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations Number of slices

More information

Block Distributed Schur Complement Preconditioners for CFD Computations on Many-Core Systems

Block Distributed Schur Complement Preconditioners for CFD Computations on Many-Core Systems Block Distributed Schur Complement Preconditioners for CFD Computations on Many-Core Systems Dr.-Ing. Achim Basermann, Melven Zöllner** German Aerospace Center (DLR) Simulation- and Software Technology

More information

Efficient 3D Gravity and Magnetic Modeling

Efficient 3D Gravity and Magnetic Modeling Efficient 3D Gravity and Magnetic Modeling X. Li Fugro Gravity & Magnetic Services Inc., Houston, Texas, USA Summary There are many different spatial-domain algorithms for 3D gravity and magnetic forward

More information

CUDA Toolkit 5.0 Performance Report. January 2013

CUDA Toolkit 5.0 Performance Report. January 2013 CUDA Toolkit 5.0 Performance Report January 2013 CUDA Math Libraries High performance math routines for your applications: cufft Fast Fourier Transforms Library cublas Complete BLAS Library cusparse Sparse

More information

Structure-oriented smoothing and semblance

Structure-oriented smoothing and semblance CWP-635 Structure-oriented smoothing and semblance Dave Hale Center for Wave Phenomena, Colorado School of Mines, Golden CO 80401, USA (c) Figure 1. A seismic image after applying structure-oriented smoothing

More information

PARDISO Version Reference Sheet Fortran

PARDISO Version Reference Sheet Fortran PARDISO Version 5.0.0 1 Reference Sheet Fortran CALL PARDISO(PT, MAXFCT, MNUM, MTYPE, PHASE, N, A, IA, JA, 1 PERM, NRHS, IPARM, MSGLVL, B, X, ERROR, DPARM) 1 Please note that this version differs significantly

More information

Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations

Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations Hartwig Anzt 1, Marc Baboulin 2, Jack Dongarra 1, Yvan Fournier 3, Frank Hulsemann 3, Amal Khabou 2, and Yushan Wang 2 1 University

More information

PRESTACK STRUCTURE-ORIENTED FILTERING PROGRAM sof_prestack

PRESTACK STRUCTURE-ORIENTED FILTERING PROGRAM sof_prestack PRESTAK STRUTURE-ORIENTED FILTERING PROGRAM sof_prestack omputation Flow hart Program sof_prestack is a generalization of program sof3d For this reason, the input parameters and workflow of the two algorithms

More information

Iterative Methods for Solving Linear Problems

Iterative Methods for Solving Linear Problems Iterative Methods for Solving Linear Problems When problems become too large (too many data points, too many model parameters), SVD and related approaches become impractical. Iterative Methods for Solving

More information

Accelerating the Hessian-free Gauss-Newton Full-waveform Inversion via Preconditioned Conjugate Gradient Method

Accelerating the Hessian-free Gauss-Newton Full-waveform Inversion via Preconditioned Conjugate Gradient Method Accelerating the Hessian-free Gauss-Newton Full-waveform Inversion via Preconditioned Conjugate Gradient Method Wenyong Pan 1, Kris Innanen 1 and Wenyuan Liao 2 1. CREWES Project, Department of Geoscience,

More information

Lessons Learned in Developing the Linear Algebra Library ViennaCL

Lessons Learned in Developing the Linear Algebra Library ViennaCL Lessons Learned in Developing the Linear Algebra Library ViennaCL Florian Rudolf 1, Karl Rupp 1,2, Josef Weinbub 1 http://karlrupp.net/ 1 Institute for Microelectronics 2 Institute for Analysis and Scientific

More information

Iterative Sparse Triangular Solves for Preconditioning

Iterative Sparse Triangular Solves for Preconditioning Euro-Par 2015, Vienna Aug 24-28, 2015 Iterative Sparse Triangular Solves for Preconditioning Hartwig Anzt, Edmond Chow and Jack Dongarra Incomplete Factorization Preconditioning Incomplete LU factorizations

More information

STRUCTURE-ORIENTED FILTERING OF POSTSTACK DATA Program sof3d

STRUCTURE-ORIENTED FILTERING OF POSTSTACK DATA Program sof3d STRUCTURE-ORIENTED FILTERING OF POSTSTACK DATA Program sof3d Computation flow chart Program sof3d provides the application and blending of anisotropic diffusion filtering described by Fehmers and Höecker

More information

P. Bilsby (WesternGeco), D.F. Halliday* (Schlumberger Cambridge Research) & L.R. West (WesternGeco)

P. Bilsby (WesternGeco), D.F. Halliday* (Schlumberger Cambridge Research) & L.R. West (WesternGeco) I040 Case Study - Residual Scattered Noise Attenuation for 3D Land Seismic Data P. Bilsby (WesternGeco), D.F. Halliday* (Schlumberger Cambridge Research) & L.R. West (WesternGeco) SUMMARY We show that

More information

Available online at ScienceDirect. Parallel Computational Fluid Dynamics Conference (ParCFD2013)

Available online at  ScienceDirect. Parallel Computational Fluid Dynamics Conference (ParCFD2013) Available online at www.sciencedirect.com ScienceDirect Procedia Engineering 61 ( 2013 ) 81 86 Parallel Computational Fluid Dynamics Conference (ParCFD2013) An OpenCL-based parallel CFD code for simulations

More information

Multi-GPU Acceleration of Algebraic Multigrid Preconditioners

Multi-GPU Acceleration of Algebraic Multigrid Preconditioners Multi-GPU Acceleration of Algebraic Multigrid Preconditioners Christian Richter 1, Sebastian Schöps 2, and Markus Clemens 1 Abstract A multi-gpu implementation of Krylov subspace methods with an algebraic

More information

Applications of Berkeley s Dwarfs on Nvidia GPUs

Applications of Berkeley s Dwarfs on Nvidia GPUs Applications of Berkeley s Dwarfs on Nvidia GPUs Seminar: Topics in High-Performance and Scientific Computing Team N2: Yang Zhang, Haiqing Wang 05.02.2015 Overview CUDA The Dwarfs Dynamic Programming Sparse

More information

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances) HPC and IT Issues Session Agenda Deployment of Simulation (Trends and Issues Impacting IT) Discussion Mapping HPC to Performance (Scaling, Technology Advances) Discussion Optimizing IT for Remote Access

More information

GPU Implementation of Elliptic Solvers in NWP. Numerical Weather- and Climate- Prediction

GPU Implementation of Elliptic Solvers in NWP. Numerical Weather- and Climate- Prediction 1/8 GPU Implementation of Elliptic Solvers in Numerical Weather- and Climate- Prediction Eike Hermann Müller, Robert Scheichl Department of Mathematical Sciences EHM, Xu Guo, Sinan Shi and RS: http://arxiv.org/abs/1302.7193

More information

newfasant Periodical Structures User Guide

newfasant Periodical Structures User Guide newfasant Periodical Structures User Guide Software Version: 6.2.10 Date: February 23, 2018 Index 1. FILE MENU 2. EDIT MENU 3. VIEW MENU 4. GEOMETRY MENU 5. MATERIALS MENU 6. CELL MENU 6.1. DEFINE CELL

More information

SeisSpace Software. SeisSpace enables the processor to be able focus on the science instead of being a glorified data manager.

SeisSpace Software. SeisSpace enables the processor to be able focus on the science instead of being a glorified data manager. SeisSpace Software OVERVIEW Delivering large volumes of data quickly and accurately remains the hallmark of any seismic processing system. With intuitive analysis tools, state-of-the-art geophysical algorithms

More information

3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs

3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs 3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs H. Knibbe, C. W. Oosterlee, C. Vuik Abstract We are focusing on an iterative solver for the three-dimensional

More information

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011 ANSYS HPC Technology Leadership Barbara Hutchings barbara.hutchings@ansys.com 1 ANSYS, Inc. September 20, Why ANSYS Users Need HPC Insight you can t get any other way HPC enables high-fidelity Include

More information

Highly Parallel Multigrid Solvers for Multicore and Manycore Processors

Highly Parallel Multigrid Solvers for Multicore and Manycore Processors Highly Parallel Multigrid Solvers for Multicore and Manycore Processors Oleg Bessonov (B) Institute for Problems in Mechanics of the Russian Academy of Sciences, 101, Vernadsky Avenue, 119526 Moscow, Russia

More information

Parallel solution for finite element linear systems of. equations on workstation cluster *

Parallel solution for finite element linear systems of. equations on workstation cluster * Aug. 2009, Volume 6, No.8 (Serial No.57) Journal of Communication and Computer, ISSN 1548-7709, USA Parallel solution for finite element linear systems of equations on workstation cluster * FU Chao-jiang

More information

Efficient Imaging Algorithms on Many-Core Platforms

Efficient Imaging Algorithms on Many-Core Platforms Efficient Imaging Algorithms on Many-Core Platforms H. Köstler Dagstuhl, 22.11.2011 Contents Imaging Applications HDR Compression performance of PDE-based models Image Denoising performance of patch-based

More information

We G High-resolution Tomography Using Offsetdependent Picking and Inversion Conditioned by Image-guided Interpolation

We G High-resolution Tomography Using Offsetdependent Picking and Inversion Conditioned by Image-guided Interpolation We G103 05 High-resolution Tomography Using Offsetdependent Picking and Inversion Conditioned by Image-guided Interpolation G. Hilburn* (TGS), Y. He (TGS), Z. Yan (TGS) & F. Sherrill (TGS) SUMMARY An approach

More information