GPU ACCELERATED TOTAL FOCUSING METHOD IN CIVA
|
|
- Pierce Merritt
- 5 years ago
- Views:
Transcription
1 OPARUS GPU ACCELERATED TOTAL FOCUSING METHOD IN CIVA Authors: Gilles ROUGERON, Jason LAMBERT, Ekaterina IAKOVLEVA, L. LACASSAGNE Presenter: Nicolas DOMINGUEZ QNDE 2013 Baltimore, Md, USA, 24/07/2013 CEA 10 AVRIL 2012 PAGE 1
2 APPLICATION CONTEXT Phased Array Inspection in Industry: Examples AIRCRAFT INDUSTRY PIPE INDUSTRY NUCLEAR INDUSTRY Faster acquisition / Finer resolution / Limited Access Compensation QNDE July 2013 PAGE 2
3 APPLICATION CONTEXT Trade-off elements for practical implementation of phased array inspections NDT PERFORMANCE Good detection Law false alarms Sizing capability (resolution) INDUSTRIAL THROUGHPUT Amount (and type) of acquired data Scanning speed Processing speed Analysis speed (processing + diagnosis) Versatility (ability to cover areas) THE CHOICE OF A PHASED ARRAY SOLUTION IS A COMBINATION OF Device Acquisition settings Processing TFM QNDE July 2013 PAGE 3
4 APPLICATION CONTEXT Phased Array UT & Reconstructions Application of reconstruction algorithms can improve Contrast (Signal to Noise Ratio) Spatial resolution Limited Access Compensation But penalizes inspection speeds Acquisition is usually heavier more shots, more data to upload (ex. FMC) Processing needs to be applied. Work on processing acceleration QNDE July 2013 PAGE 4
5 APPLICATION CONTEXT Challenges with the Total Focusing Method performances TFM is preferentially processed from FMC acquisitions or a subset (SMC) Data acquisition Uploading data flow Data management / Memories Hardware/ Firmware Computation cost: Ultra-fast analysis (post-processing) Real-time reconstruction (with data flow) Firmware/ Software QNDE July 2013 PAGE 5
6 GPU ACCELERATED TFM IN CIVA Synthetic Focusing & Total Focusing Method QNDE July 2013 PAGE 6
7 COMPUTATION TASKS A 2 STEPS PROCESS 1. Time of flight computation 2. Amplitudes summation QNDE July 2013 PAGE 7
8 GPU ACCELERATED TFM IN CIVA Optimization approach Algorithmic optimizations Time of flights in emission and reception are equal => computed only once Time of flights are computed on a coarser grid of the reconstruction area and then interpolated on the refined grid. Time of flights computation Coarser grid = tiles on GPU Pixels Implementation optimization Parallelization Use of GPU devices Comparison GPU (CUDA / OpenCL) & CPU-multi-cores (OpenMP) QNDE July 2013 PAGE 8
9 TIME OF FLIGHT COMPUTATION (x i x i ) 2 +z 2 i Minimizing time of flight T Eip= + (x i x) 2 +z 2 c 1 Requires to solve x i x i c1 x i x i 2 +z i 2 Solved as a 4 th degree polynomial (flat surface). = c 2 x i x c 2 x i x 2 +z 2 (Snell-Law). Curved surfaces implies solving higher degree polynomials depending on the surface (6 th, 10 th or 16 th in the most general case). Laguerre s Method is used: iterative method, finds a real root or two complex conjugate roots at a time, requires deflation. QNDE July 2013 PAGE 9
10 PARALLEL STRATEGY AND MEMORY ACCESS GPU (Cuda/OpenCL) Blocks of threads correspond to tiles of pixels. Step 1 Time of flight at each 4 tile corners are computed by multiple threads. Results are stored in shared memory. Step 2 Simultaneous ToF interpolation + amplitude summation for same couple of E i R j elements for multiple pixels (one thread per pixel). Due to pixel locality and slowly spatially varying time of flights, memory access are enhanced (use of cache). Parallel CPU (OpenMP) Step 1 Parallel computation of time of flights for coarser grid (one CPU thread per grid point). Step 2 Parallel ToF interpolation + amplitude summation (one CPU thread per pixel) QNDE July 2013 PAGE 10
11 PERFORMANCE BENCHMARK Planar specimen Cylindrical specimen Phased array elements: 128 Central frequency: 2 MHz Number of samples per signal: 1031 FMC data : 130 MB Reconstruction zone (mm) : Image definition (pixels) : Phased array elements: 128 Central frequency: 2 MHz Number of samples per signal: 2007 FMC data : 250 MB Reconstruction zone (mm) : Image definition (pixels) : Hardware CPU 2x Xeon X5690 (6 GPU Nvidia: GTX580 (1.5 GB), Tesla C2070 (6 GB), AMD: HD6970 (2 GB) Software CPU OpenMP / Visual C OpenCL / Intel SDK 1.5 & AMD APP SDK 2.7 GPU Cuda 4.2 / NVCC on NVIDIA GTX580, Tesla C2070 OpenCL / AMD APP SDK 2.7 & NVIDIA OpenCL QNDE July 2013 PAGE 11
12 PERFORMANCE BENCHMARK Time (s) CPU GPU OpenMP OpenCL CUDA OpenCL Mono Multi AMD Intel GTX580 C2070 GTX580 C2070 HD6970 Planar Cylindrical Best acceleration performances on GPU with Cuda Good acceleration performances on GPU with OpenCL Good acceleration with OpenMP on CPU OpenCL on CPU performs less well (algorithm optimised for GPU memory access) QNDE July 2013 PAGE 12
13 A CIVA PERSPECTIVE CIVA implementation strategy OpenCL: emerging standard for massively parallel architecture processors programming (CPU/MIC/GPU ). Portability: Same source code executable on different hardwares. Good/acceptable performances compared with native platforms languages/library : i.e. Cuda on Nvidia, OpenMP on CPU. OpenCL is improving on CPU (see latest Intel SDK 2013 with automatic parallelisation and code vectorization) TFM accelerated code in CIVA developed with OpenCL QNDE July 2013 PAGE 13
14 A CIVA PERSPECTIVE Features for TFM on GPU in CIVA Specimen: Flat Cylindrical CAD 2.5D with planar extrusion Isotropic materials Multiple probe positions 2D and 3D ROI Direct Modes with conversion QNDE July 2013 PAGE 14
15 A CIVA PERSPECTIVE Examples of TFM on GPU in CIVA Complex surface specimen 64 elements ROI (60x40mm / 300x200 pixels) Computation time : 0,54s on Nvidia Tesla C2070 Scanning 10 positions Computation time : 4.2s on Nvidia Tesla C2070 QNDE July 2013 PAGE 15
16 A CIVA PERSPECTIVE Examples of TFM on GPU in CIVA Flat surface specimen 128 elements ROI (30x30mm / 400x400 pixels) LL LT = TL TT Complex surface specimen 128 elements 3D ROI (30x30x30mm3 / 80x80x80 voxels) Computation time : 1.8 s on Nvidia Tesla C2070 QNDE July 2013 PAGE 16
17 A CIVA PERSPECTIVE TFM on GPU will be available in a CIVA 11.X Enlarge range of applications Specimen geometries: conical, torical surfaces (polynomial degree up to 16) TFM with skips (corner and indirect echoes) GPU memory limits => «Out of core» A solution when FMC data exceeds GPU memory Accelerate Specialization for optimization: use Newton method for simple surfaces (ex. planar) less generic but more efficient. See also: Friday Session 43 A fast ultrasonic simulation based on GPU QNDE July 2013 PAGE 17
18 Thank you for your attention. Questions? CIVA 11 the new MAJOR version of CIVA is released! Ask for your DVD CEA 10 AVRIL 2012 PAGE 18
Implementation of a GPU accelerated total focusing reconstruction method within CIVA software
Implementation of a GPU accelerated total focusing reconstruction method within CIVA software Gilles Rougeron, Jason Lambert, Ekaterina Iakovleva, Lionel Lacassagne, Nicolas Dominguez To cite this version:
More informationMeasurement of Residual Thickness in Case of Corrosion Close to the Welds with an Adaptive Total Focusing Method
19 th World Conference on Non-Destructive Testing 2016 Measurement of Residual Thickness in Case of Corrosion Close to the Welds with an Adaptive Total Focusing Method Olivier ROY 1, hilippe BENOIST 1,
More informationREAL-TIME ADAPTIVE IMAGING FOR ULTRASONIC NONDESTRUCTIVE TESTING OF STRUCTURES WITH IRREGULAR SHAPES
REAL-TIME ADATIVE IMAGING FOR ULTRASONIC NONDESTRUCTIVE TESTING OF STRUCTURES WITH IRREGULAR SHAES Sébastien Robert, Léonard Le Jeune, Vincent Saint-Martin CEA-LIST, 91191 Gif-sur-Yvette Cedex, France
More informationPORTABLE PHASED-ARRAY ULTRASOUND FULL-FEATURED SYSTEM
PORTABLE PHASED-ARRAY ULTRASOUND FULL-FEATURED SYSTEM GEKKO GEKKO not only offers the features of standard phased-array portable systems (angular scanning, electronic scanning, TOFD, etc.), new advanced
More informationAdvanced ultrasonic 2D Phased-array probes
Advanced ultrasonic 2D Phased-array probes Frédéric REVERDY 1, G. ITHURRALDE 2, Nicolas DOMINGUEZ 1,2 1 CEA, LIST, F-91191, Gif-sur-Yvette cedex, France frederic.reverdy@cea.fr, nicolas.dominguez@cea.fr
More informationUT phased array inspection of turbines
UT phased array inspection of turbines components : experiments and simulation Bruno ISSENMANN Gilles ROUGERON Steve MAHAUT (CEA/LIST) Stéphane LABORDE (ALSTOM Power Service) 1 Industrial context (1) In
More informationULTRASONIC NONDESTRUCTIVE TESTING OF COMPLEX COMPONENTS WITH FLEXIBLE PHASED-ARRAY TRANSDUCERS
ULTRASONIC NONDESTRUCTIVE TESTING OF COMPLEX COMPONENTS WITH FLEXIBLE PHASED-ARRAY TRANSDUCERS 1 CEA-LIST, Gif-sur-Yvette, France 2 M2M, Les Ulis, France 3 IMASONIC, Voray-sur-l Ognon, France O. Casula
More informationApplications of Phased Array Techniques to NDT of Industrial Structures
The 2 nd International Conference on Technical Inspection and NDT (TINDT2008)- October 2008 - Tehran, Iran Applications of Phased Array Techniques to NDT of Industrial Structures Laurent Le ber 1, Olivier
More informationULTRAVISION 3.7R21. Product Bulletin. UltraVision, a complete UT and Phased Array inspection package!
ULTRAVISION 3.7R21 Product Bulletin UltraVision, a complete UT and Phased Array inspection package! www.zetec.com Table of Content Table of Content... 2 Purpose of UltraVision 3.7R21... 3 Corrosion...
More informationReal-time full matrix capture with auto-focussing of known geometry through dual layered media
Real-time full matrix capture with auto-focussing of known geometry through dual layered media Mark Sutcliffe, Miles Weston, Ben Dutton and Ian Cooper TWI NDT Validation Centre Heol Cefn Gwrgan, Margam,
More informationCorrosion detection and measurement improvement using advanced ultrasonic tools
19 th World Conference on Non-Destructive Testing 2016 Corrosion detection and measurement improvement using advanced ultrasonic tools Laurent LE BER 1, Grégoire BENOIST 1, Pascal DAINELLI 2 1 M2M-NDT,
More informationARIA Software. Total Focusing Method. - Real-Time TFM Imaging - Acquire all FMC data - FMC/TFM Wizard - TFM Viewer - Analysis Mode
ARIA Software Total Focusing Method - Real-Time TFM Imaging - Acquire all FMC data - FMC/TFM Wizard - TFM Viewer - Analysis Mode Several Implementations Standard TFM Migration TFM Advanced TFM TFMp Adaptive
More informationMULTI-MODE TFM IMAGING WITH ARTEFACT FILTERING USING CIVA UT FORWARD MODELS
MUI-MODE FM IMAGING WIH AREFAC FIERING USING CIVA U FORWARD MODES Ekaterina IAKOVEVA Sylvain CHAION hilippe BREDIF Steve MAHAU CEA-IS, Saclay OUINE 1. Imaging algorithm otal Focusing Method (FM) 1.1. Full
More informationSimulation of ultrasonic guided wave inspection in CIVA software platform
Simulation of ultrasonic guided wave inspection in CIVA software platform B. CHAPUIS, K. JEZZINE, V. BARONIAN, D. SEGUR and A. LHEMERY 18 April 2012 CIVA: Software for NDT Generalities WHY USING SIMULATION
More informationFast total focusing method for ultrasonic imaging
Fast total focusing method for ultrasonic imaging Ewen Carcreff,a), Gavin Dao 2 and Dominique Braconnier The Phased Array Company, 9365 Allen road, West Chester, Ohio, USA 2 Advanced OEM Solutions, 844
More informationOn Level Scheduling for Incomplete LU Factorization Preconditioners on Accelerators
On Level Scheduling for Incomplete LU Factorization Preconditioners on Accelerators Karl Rupp, Barry Smith rupp@mcs.anl.gov Mathematics and Computer Science Division Argonne National Laboratory FEMTEC
More informationHigh Resolution Phased Array Imaging using the Total Focusing Method
High Resolution Phased Array Imaging using the Total Focusing Method S. Kierspel, Wolfram A. Karl Deutsch, Helge Rast, Philippe Benoist 1, Venkat A 2 KARL DEUTSCH Pruef- und Messgeraetebau GmbH + Co KG
More informationHigh Resolution Phased Array Imaging using the Total Focusing Method
19 th World Conference on Non-Destructive Testing 2016 High Resolution Phased Array Imaging using the Total Focusing Method Wolfram A. Karl DEUTSCH 1, Werner ROYE 1, Helge RAST 1, Philippe BENOIST 2 1
More informationM2M GEKKO. State-of-the-art phased-array flaw detector with TFM
M2M GEKKO State-of-the-art phased-array flaw detector with TFM USER FRIENDLY FLAW DETECTOR Gekko is the only compact phased-array ultrasound testing (PAUT) instrumentation offering intuitive PA features
More informationHARNESSING IRREGULAR PARALLELISM: A CASE STUDY ON UNSTRUCTURED MESHES. Cliff Woolley, NVIDIA
HARNESSING IRREGULAR PARALLELISM: A CASE STUDY ON UNSTRUCTURED MESHES Cliff Woolley, NVIDIA PREFACE This talk presents a case study of extracting parallelism in the UMT2013 benchmark for 3D unstructured-mesh
More informationInfluence of Stress Corrosion Crack Morphology on Ultrasonic Examination Performances
Influence of Stress Corrosion Crack Morphology on Ultrasonic Examination Performances - 7th ICNDE in Relation to Structural Integrity for Nuclear and Pressurized Components - O. Dupond EDF R&D/Materials
More informationFull-Matrix Capture with a Customizable Phased Array Instrument
Full-Matrix Capture with a Customizable Phased Array Instrument Gavin Dao 1, a), Dominique Braconnier 2, b) 2, c), and Matt Gruber 1 Advanced OEM Solutions 8044 Montgomery Road #700 Cincinnati OH, 45236,
More informationApplication of the Total Focusing Method for Improved Defect Characterization in the Production of Steel Tubes, Pipes and Plates
19 th World Conference on Non-Destructive Testing 2016 Application of the Total Focusing Method for Improved Defect Characterization in the Production of Steel Tubes, Pipes and Plates Till SCHMITTE 1,
More informationNEW FEATURES FOR PHASED ARRAY TECHNIQUES INSPECTIONS : SIMULATION AND EXPERIMENTS
NEW FEATURES FOR PHASED ARRAY TECHNIQUES INSPECTIONS : SIMULATION AND EXPERIMENTS S. Mahaut 1, S. Chatillon 1, E. Kerbrat 1, J. Porre 1, P. Calmon 1 and O. Roy 2 1 CEA/LIST, Saclay, France; 2 M2M, Saint-Rémy,
More informationMANTIS Compact Phased Array Ultrasonic (PAUT) Flaw Detector featuring TFM, TOFD and Conventional UT
MANTIS Compact Phased Array Ultrasonic (PAUT) Flaw Detector featuring TFM, TOFD and Conventional UT MANTIS - Compact Phased Array Ultrasonic (PAUT) Flaw Detector The MANTIS is a cost-efficient, compact
More informationA FLEXIBLE PHASED ARRAY TRANSDUCER FOR CONTACT EXAMINATION OF COMPONENTS WITH COMPLEX GEOMETRY
A FLEXIBLE PHASED ARRAY TRANSDUCER FOR CONTACT EXAMINATION OF COMPONENTS WITH COMPLEX GEOMETRY O. Casula 1, C. Poidevin 1, G. Cattiaux 2 and G. Fleury 3 1 CEA/LIST, Saclay, France; 2 IRSN/DES, Fontenay-aux-Roses,
More informationCSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University
CSE 591/392: GPU Programming Introduction Klaus Mueller Computer Science Department Stony Brook University First: A Big Word of Thanks! to the millions of computer game enthusiasts worldwide Who demand
More informationCSE 591: GPU Programming. Introduction. Entertainment Graphics: Virtual Realism for the Masses. Computer games need to have: Klaus Mueller
Entertainment Graphics: Virtual Realism for the Masses CSE 591: GPU Programming Introduction Computer games need to have: realistic appearance of characters and objects believable and creative shading,
More informationCUDA and OpenCL Implementations of 3D CT Reconstruction for Biomedical Imaging
CUDA and OpenCL Implementations of 3D CT Reconstruction for Biomedical Imaging Saoni Mukherjee, Nicholas Moore, James Brock and Miriam Leeser September 12, 2012 1 Outline Introduction to CT Scan, 3D reconstruction
More informationSIMULATION OF THE UT INSPECTION OF PLANAR DEFECTS USING A GENERIC GTD-KIRCHHOFF APPROACH
ound M athematics L td. SIMULATION OF THE UT INSPECTION OF PLANAR DEFECTS USING A GENERIC GTD-KIRCHHOFF APPROACH V. DORVAL, M. DARMON, S. CHATILLON, L. FRADKIN presented by A. LHEMERY CEA, LIST, France
More informationFinite Element Integration and Assembly on Modern Multi and Many-core Processors
Finite Element Integration and Assembly on Modern Multi and Many-core Processors Krzysztof Banaś, Jan Bielański, Kazimierz Chłoń AGH University of Science and Technology, Mickiewicza 30, 30-059 Kraków,
More informationRECENT MODELLING ADVANCES FOR ULTRASONIC TOFD INSPECTIONS
RECENT MODELLING ADVANCES FOR ULTRASONIC TOFD INSPECTIONS Michel DARMON 1, Adrien FERRAND 1, Vincent DORVAL 1, Sylvain CHATILLON 1 CEA LIST, Gif-sur-Yvette, France michel.darmon@cea.fr QNDE July 2014 OUTLINE
More informationDevelopments in Ultrasonic Phased Array Inspection III
Developments in Ultrasonic Inspection III Ultrasonic Inspection of Welded Pipes Using Wave Mode-Converted at the Inner Surface of the Pipe R. Long, P. Cawley, Imperial College, UK; J. Russell, Rolls-Royce,
More informationAdvanced Ultrasonic 2D Phased-Array Probes
18th World Conference on Nondestructive Testing, 16-20 April 2012, Durban, South Africa Advanced Ultrasonic 2D Phased-Array Probes Frédéric REVERDY 1, G. ITHURRALDE 2, Nicolas DOMINGUEZ 1,2 1 CEA, LIST,
More informationAutomatic analysis and weld indications classification on TFM acquisitions
Automatic analysis and weld indications classification on TFM acquisitions More info about this article: http://www.ndt.net/?id=22984 R. FERNANDEZ 1, S. LONNE 1 Y. RADOVCIC 2, S. LE BERRE 3, F. GRASSIN
More informationNDT OF SPECIMEN OF COMPLEX GEOMETRY USING ULTRASONIC ADAPTIVE
NDT OF SPECIMEN OF COMPLEX GEOMETRY USING ULTRASONIC ADAPTIVE TECHNIQUES - THE F.A.U.S.T. SYSTEM INTRODUCTION O. Roy, S. Mahaut, M. Serre Commissariat a I'Energie Atomique CEAlCEREM, CE Saclay France Phased
More informationDevelopment of a robotic nozzle inspection with a flexible transducer array
19 th World Conference on Non-Destructive Testing 2016 Development of a robotic nozzle inspection with a flexible transducer array Blandine DOBIGNY 1, Olivier WATTIAU 1, Sébastien BEY 2, Arnaud VANHOYE
More informationA Fast GPU-Based Approach to Branchless Distance-Driven Projection and Back-Projection in Cone Beam CT
A Fast GPU-Based Approach to Branchless Distance-Driven Projection and Back-Projection in Cone Beam CT Daniel Schlifske ab and Henry Medeiros a a Marquette University, 1250 W Wisconsin Ave, Milwaukee,
More informationDEVELOPMENT AND VALIDATION OF A FULL MATRIX CAPTURE SOLUTION. Patrick Tremblay, Daniel Richard ZETEC, Canada
DEVELOPMENT AND VALIDATION OF A FULL MATRIX CAPTURE SOLUTION Patrick Tremblay, Daniel Richard ZETEC, Canada ABSTRACT For the last 15 years, phased array has completely changed the face of ultrasonic non-destructive
More informationInternational Supercomputing Conference 2009
International Supercomputing Conference 2009 Implementation of a Lattice-Boltzmann-Method for Numerical Fluid Mechanics Using the nvidia CUDA Technology E. Riegel, T. Indinger, N.A. Adams Technische Universität
More informationHigh performance Computing and O&G Challenges
High performance Computing and O&G Challenges 2 Seismic exploration challenges High Performance Computing and O&G challenges Worldwide Context Seismic,sub-surface imaging Computing Power needs Accelerating
More informationInteractive Ultrasonic Field Simulation For Non Destructive Testing
Interactive Ultrasonic Field Simulation For Non Destructive Testing Jason Lambertab, Gilles Rougerona, Sylvain Chatillona and Lionel Lacassagneb a CEA LIST, CEA Saclay Digiteo Labs, PC120, 91191 Gif-sur-Yvette
More informationNext Generation Phased Array UT Total Focusing Method (TFM)
Next Generation Phased Array UT Total Focusing Method (TFM) Gavin Dao and Alan Caulder AOS, Cincinnati, OH Dominique Braconnier and John Lorenz TPAC, West Chester, OH Introduction Preview of Advanced Imaging
More informationUT inspection of welding joins: issues and contribution of CIVA simulation tools
UT inspection of welding joins: issues and contribution of CIVA simulation tools GENERAL CONTEXT: ISSUES AND PROBLEMS OF WELD INSPECTION BY ULTRASONIC METHODS 67 th IIW Annual Assembly and International
More informationSimulation in NDT. Online Workshop in in September Software Tools for the Design of Phased Array UT Inspection Techniques
Simulation in NDT Online Workshop in www.ndt.net in September 2010 Software Tools for the Design of Phased Array UT Inspection Techniques Daniel RICHARD, David REILLY, Johan BERLANGER and Guy MAES Zetec,
More informationCME 213 S PRING Eric Darve
CME 213 S PRING 2017 Eric Darve Summary of previous lectures Pthreads: low-level multi-threaded programming OpenMP: simplified interface based on #pragma, adapted to scientific computing OpenMP for and
More informationHigh Performance Computing with Accelerators
High Performance Computing with Accelerators Volodymyr Kindratenko Innovative Systems Laboratory @ NCSA Institute for Advanced Computing Applications and Technologies (IACAT) National Center for Supercomputing
More informationUsing GPUs to compute the multilevel summation of electrostatic forces
Using GPUs to compute the multilevel summation of electrostatic forces David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of
More informationOverview of Project's Achievements
PalDMC Parallelised Data Mining Components Final Presentation ESRIN, 12/01/2012 Overview of Project's Achievements page 1 Project Outline Project's objectives design and implement performance optimised,
More informationGeneral Purpose GPU Computing in Partial Wave Analysis
JLAB at 12 GeV - INT General Purpose GPU Computing in Partial Wave Analysis Hrayr Matevosyan - NTC, Indiana University November 18/2009 COmputationAL Challenges IN PWA Rapid Increase in Available Data
More informationVALIDATION OF THE SIMULATION SOFTWARE CIVA UT IN SEPARATED TRANSMIT/RECEIVE CONFIGURATIONS
VALIDATION OF THE SIMULATION SOFTWARE CIVA UT IN SEPARATED TRANSMIT/RECEIVE CONFIGURATIONS Fabrice FOUCHER 1, Sébastien LONNE 1, Gwénaël TOULLELAN 2, Steve MAHAUT 2, Sylvain CHATILLON 2, Erica SCHUMACHER
More informationINTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES. Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp.
INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp. Computer Vision in Mobile Tegra K1 It s time! AGENDA Use cases categories
More informationNumerical Algorithms on Multi-GPU Architectures
Numerical Algorithms on Multi-GPU Architectures Dr.-Ing. Harald Köstler 2 nd International Workshops on Advances in Computational Mechanics Yokohama, Japan 30.3.2010 2 3 Contents Motivation: Applications
More informationLarge scale Imaging on Current Many- Core Platforms
Large scale Imaging on Current Many- Core Platforms SIAM Conf. on Imaging Science 2012 May 20, 2012 Dr. Harald Köstler Chair for System Simulation Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen,
More informationInstruction Set Architecture ( ISA ) 1 / 28
Instruction Set Architecture ( ISA ) 1 / 28 instructions 2 / 28 Instruction Set Architecture Also called (computer) architecture Implementation --> actual realisation of ISA ISA can have multiple implementations
More informationValidation of aspects of BeamTool
Vol.19 No.05 (May 2014) - The e-journal of Nondestructive Testing - ISSN 1435-4934 www.ndt.net/?id=15673 Validation of aspects of BeamTool E. GINZEL 1, M. MATHESON 2, P. CYR 2, B. BROWN 2 1 Materials Research
More informationHigh Performance Video Artifact Detection Enhanced with CUDA. Atul Ravindran Digimetrics
High Performance Video Artifact Detection Enhanced with CUDA Atul Ravindran Digimetrics Goals & Challenges Provide automated QC for digital video files with accuracy and minimum false positives Provide
More informationACCELERATING THE PRODUCTION OF SYNTHETIC SEISMOGRAMS BY A MULTICORE PROCESSOR CLUSTER WITH MULTIPLE GPUS
ACCELERATING THE PRODUCTION OF SYNTHETIC SEISMOGRAMS BY A MULTICORE PROCESSOR CLUSTER WITH MULTIPLE GPUS Ferdinando Alessi Annalisa Massini Roberto Basili INGV Introduction The simulation of wave propagation
More informationRECONSTRUCTION OF PHASED ARRAY TECHNIQUES FROM THE FULL MATRIX CAPTURE DATA SET
More Info at Open Access Database www.ndt.net/?id=18486 RECONSTRUCTION OF PHASED ARRAY TECHNIQUES FROM THE FULL MATRIX CAPTURE DATA SET R.L. Ten Grotenhuis, J.X. Zhang, A Sakuta, A. Hong Ontario Power
More informationJ. Blair Perot. Ali Khajeh-Saeed. Software Engineer CD-adapco. Mechanical Engineering UMASS, Amherst
Ali Khajeh-Saeed Software Engineer CD-adapco J. Blair Perot Mechanical Engineering UMASS, Amherst Supercomputers Optimization Stream Benchmark Stag++ (3D Incompressible Flow Code) Matrix Multiply Function
More informationUsing CUDA to Accelerate Radar Image Processing
Using CUDA to Accelerate Radar Image Processing Aaron Rogan Richard Carande 9/23/2010 Approved for Public Release by the Air Force on 14 Sep 2010, Document Number 88 ABW-10-5006 Company Overview Neva Ridge
More informationOP2 FOR MANY-CORE ARCHITECTURES
OP2 FOR MANY-CORE ARCHITECTURES G.R. Mudalige, M.B. Giles, Oxford e-research Centre, University of Oxford gihan.mudalige@oerc.ox.ac.uk 27 th Jan 2012 1 AGENDA OP2 Current Progress Future work for OP2 EPSRC
More informationEXPERIMENTAL VALIDATION OF AN 8 ELEMENT EMAT PHASED ARRAY PROBE FOR LONGITUDINAL WAVE GENERATION
EXPERIMENTAL VALIDATION OF AN 8 ELEMENT EMAT PHASED ARRAY PROBE FOR LONGITUDINAL WAVE GENERATION QNDE 2014 Florian Le Bourdais and Benoît Marchand CEA LIST, Centre de Saclay F-91191 Gif-sur-Yvette, France
More informationNVIDIA GTX200: TeraFLOPS Visual Computing. August 26, 2008 John Tynefield
NVIDIA GTX200: TeraFLOPS Visual Computing August 26, 2008 John Tynefield 2 Outline Execution Model Architecture Demo 3 Execution Model 4 Software Architecture Applications DX10 OpenGL OpenCL CUDA C Host
More informationADVANCED PHASED ARRAY TECHNOLOGIES
3CNEND- 3ª Conferência Nacional em Ensaios Não Destrutivos ADVANCED PHASED ARRAY TECHNOLOGIES Dr.-Ing. Werner Roye Karl Deutsch Pruef- und Messgeraetebau GmbH + Co KG, Wuppertal, Germany Email: roye@karldeutsch.de
More informationEvaluation and Exploration of Next Generation Systems for Applicability and Performance Volodymyr Kindratenko Guochun Shi
Evaluation and Exploration of Next Generation Systems for Applicability and Performance Volodymyr Kindratenko Guochun Shi National Center for Supercomputing Applications University of Illinois at Urbana-Champaign
More informationUsing GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation
Using GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation GPU Technology Conference 2012 May 15, 2012 Thomas M. Benson, Daniel P. Campbell, Daniel A. Cook thomas.benson@gtri.gatech.edu
More informationAccelerating Financial Applications on the GPU
Accelerating Financial Applications on the GPU Scott Grauer-Gray Robert Searles William Killian John Cavazos Department of Computer and Information Science University of Delaware Sixth Workshop on General
More informationUltraVision TOUCH 3.8R11
UltraVision TOUCH 3.8R11 (PC and TOPAZ versions) Product Bulletin www.zetec.com Table of Content Table of Content... 2 UltraVision Touch 3.8R11... 3 Purpose of UltraVision Touch 3.8R11... 3 Important information
More informationSimulation platform for NDE
Description Simulation platform for NDE The CIVA software is an expertise platform dedicated to non destructive testing. It is composed of simulation, imaging and analysis modules, enabling the user to
More informationTrends in HPC (hardware complexity and software challenges)
Trends in HPC (hardware complexity and software challenges) Mike Giles Oxford e-research Centre Mathematical Institute MIT seminar March 13th, 2013 Mike Giles (Oxford) HPC Trends March 13th, 2013 1 / 18
More informationAPPLICATION OF ULTRASONIC BEAM MODELING TO PHASED ARRAY
APPLICATION OF ULTRASONIC BEAM MODELING TO PHASED ARRAY TESTING OF COMPLEX GEOMETRY COMPONENTS INTRODUCflON O. Roy, S. Mahaut, M. Serre, Commissariat 11 l'energie Atomique, CEAlCEREM, CE Saclay France
More informationATS-GPU Real Time Signal Processing Software
Transfer A/D data to at high speed Up to 4 GB/s transfer rate for PCIe Gen 3 digitizer boards Supports CUDA compute capability 2.0+ Designed to work with AlazarTech PCI Express waveform digitizers Optional
More informationGPUs and Emerging Architectures
GPUs and Emerging Architectures Mike Giles mike.giles@maths.ox.ac.uk Mathematical Institute, Oxford University e-infrastructure South Consortium Oxford e-research Centre Emerging Architectures p. 1 CPUs
More informationMPEXS benchmark results
MPEXS benchmark results - phase space data - Akinori Kimura 14 February 2017 Aim To validate results of MPEXS with phase space data by comparing with Geant4 results Depth dose and lateral dose distributions
More informationTMTO[dot]ORG: Hardware Comparison 8 x GTX580 vs. 4 x HD6990. Author: Jason R. Davis Site: TMTO[dot]ORG. Table of Contents
TMTO[dot]ORG: Hardware Comparison 8 x GTX580 vs. 4 x HD6990 Author: Jason R. Davis Site: TMTO[dot]ORG Table of Contents Foreword Page 2 Chapter 1 Building: The Machine Page 3 Chapter 2 8 x GTX580 Page
More informationOptimization solutions for the segmented sum algorithmic function
Optimization solutions for the segmented sum algorithmic function ALEXANDRU PÎRJAN Department of Informatics, Statistics and Mathematics Romanian-American University 1B, Expozitiei Blvd., district 1, code
More informationPhased Array inspection system applied to complex geometry Carbon Fibre Reinforced Polymer parts
Phased Array inspection system applied to complex geometry Carbon Fibre Reinforced Polymer parts André Cereja andre.cereja@tecnico.ulisboa.pt Instituto Superior Técnico, Lisboa, Portugal May 2015 Abstract
More informationREDUCING BEAMFORMING CALCULATION TIME WITH GPU ACCELERATED ALGORITHMS
BeBeC-2014-08 REDUCING BEAMFORMING CALCULATION TIME WITH GPU ACCELERATED ALGORITHMS Steffen Schmidt GFaI ev Volmerstraße 3, 12489, Berlin, Germany ABSTRACT Beamforming algorithms make high demands on the
More informationIntroduction to GPGPU and GPU-architectures
Introduction to GPGPU and GPU-architectures Henk Corporaal Gert-Jan van den Braak http://www.es.ele.tue.nl/ Contents 1. What is a GPU 2. Programming a GPU 3. GPU thread scheduling 4. GPU performance bottlenecks
More informationAdaptive-Mesh-Refinement Hydrodynamic GPU Computation in Astrophysics
Adaptive-Mesh-Refinement Hydrodynamic GPU Computation in Astrophysics H. Y. Schive ( 薛熙于 ) Graduate Institute of Physics, National Taiwan University Leung Center for Cosmology and Particle Astrophysics
More informationHow GPUs can find your next hit: Accelerating virtual screening with OpenCL. Simon Krige
How GPUs can find your next hit: Accelerating virtual screening with OpenCL Simon Krige ACS 2013 Agenda > Background > About blazev10 > What is a GPU? > Heterogeneous computing > OpenCL: a framework for
More informationSummary of CERN Workshop on Future Challenges in Tracking and Trigger Concepts. Abdeslem DJAOUI PPD Rutherford Appleton Laboratory
Summary of CERN Workshop on Future Challenges in Tracking and Trigger Concepts Abdeslem DJAOUI PPD Rutherford Appleton Laboratory Background to Workshop Organised by CERN OpenLab and held in IT Auditorium,
More informationCS8803SC Software and Hardware Cooperative Computing GPGPU. Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology
CS8803SC Software and Hardware Cooperative Computing GPGPU Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology Why GPU? A quiet revolution and potential build-up Calculation: 367
More informationSAFT-Reconstruction in ultrasonic immersion technique using phased array transducers
SAFT-Reconstruction in ultrasonic immersion technique using phased array transducers J. Kitze, J. Prager, R. Boehm, U. Völz, H.-J. Montag Federal Institute for Materials Research and Testing Berlin, Division
More informationX-TRACT: software for simulation and reconstruction of X-ray phase-contrast CT
X-TRACT: software for simulation and reconstruction of X-ray phase-contrast CT T.E.Gureyev, Ya.I.Nesterets, S.C.Mayo, A.W.Stevenson, D.M.Paganin, G.R.Myers and S.W.Wilkins CSIRO Materials Science and Engineering
More informationGPGPU Applications. for Hydrological and Atmospheric Simulations. and Visualizations on the Web. Ibrahim Demir
GPGPU Applications for Hydrological and Atmospheric Simulations and Visualizations on the Web Ibrahim Demir Big Data We are collecting and generating data on a petabyte scale (1Pb = 1,000 Tb = 1M Gb) Data
More informationphysics on screen Supported platforms and minimum system requirements
physics on screen Supported platforms and minimum system requirements Minimum system requirements for BETA_LM_TOOLS v6.4 or later Linux 32bit 64bit glibc 2.3 glibc 2.3.3 MacOS Mac OS X v10.4 Mac OS X v10.6
More informationPresenting: Comparing the Power and Performance of Intel's SCC to State-of-the-Art CPUs and GPUs
Presenting: Comparing the Power and Performance of Intel's SCC to State-of-the-Art CPUs and GPUs A paper comparing modern architectures Joakim Skarding Christian Chavez Motivation Continue scaling of performance
More informationEfficient AMG on Hybrid GPU Clusters. ScicomP Jiri Kraus, Malte Förster, Thomas Brandes, Thomas Soddemann. Fraunhofer SCAI
Efficient AMG on Hybrid GPU Clusters ScicomP 2012 Jiri Kraus, Malte Förster, Thomas Brandes, Thomas Soddemann Fraunhofer SCAI Illustration: Darin McInnis Motivation Sparse iterative solvers benefit from
More informationAccelerating image registration on GPUs
Accelerating image registration on GPUs Harald Köstler, Sunil Ramgopal Tatavarty SIAM Conference on Imaging Science (IS10) 13.4.2010 Contents Motivation: Image registration with FAIR GPU Programming Combining
More informationGPUfs: Integrating a file system with GPUs
GPUfs: Integrating a file system with GPUs Mark Silberstein (UT Austin/Technion) Bryan Ford (Yale), Idit Keidar (Technion) Emmett Witchel (UT Austin) 1 Building systems with GPUs is hard. Why? 2 Goal of
More informationHigh-Order Finite-Element Earthquake Modeling on very Large Clusters of CPUs or GPUs
High-Order Finite-Element Earthquake Modeling on very Large Clusters of CPUs or GPUs Gordon Erlebacher Department of Scientific Computing Sept. 28, 2012 with Dimitri Komatitsch (Pau,France) David Michea
More informationHashcash Parallelization on GPGPU using OpenCL
Hashcash Parallelization on GPGPU using OpenCL Etienne Servais MINES ParisTech, Centre de Recherche en Informatique April 19, 2011 Table of Contents 1 Hashcash overview The hashcash protocol Stamp format
More informationGeorgia Institute of Technology, August 17, Justin W. L. Wan. Canada Research Chair in Scientific Computing
Real-Time Rigid id 2D-3D Medical Image Registration ti Using RapidMind Multi-Core Platform Georgia Tech/AFRL Workshop on Computational Science Challenge Using Emerging & Massively Parallel Computer Architectures
More informationNVIDIA s Compute Unified Device Architecture (CUDA)
NVIDIA s Compute Unified Device Architecture (CUDA) Mike Bailey mjb@cs.oregonstate.edu Reaching the Promised Land NVIDIA GPUs CUDA Knights Corner Speed Intel CPUs General Programmability 1 History of GPU
More informationNVIDIA s Compute Unified Device Architecture (CUDA)
NVIDIA s Compute Unified Device Architecture (CUDA) Mike Bailey mjb@cs.oregonstate.edu Reaching the Promised Land NVIDIA GPUs CUDA Knights Corner Speed Intel CPUs General Programmability History of GPU
More informationResearch on Correction and Optimization of Post-processing Imaging of Structure with Non-planar Interface Using Full Matrix Data of Ultrasonic Array
19 th World Conference on Non-Destructive Testing 2016 Research on Correction and Optimiation of Post-processing Imaging of Structure with Non-planar Interface Using Full Matrix Data of Ultrasonic Array
More informationad-heap: an Efficient Heap Data Structure for Asymmetric Multicore Processors
ad-heap: an Efficient Heap Data Structure for Asymmetric Multicore Processors Weifeng Liu and Brian Vinter Niels Bohr Institute University of Copenhagen Denmark {weifeng, vinter}@nbi.dk March 1, 2014 Weifeng
More informationPLB-HeC: A Profile-based Load-Balancing Algorithm for Heterogeneous CPU-GPU Clusters
PLB-HeC: A Profile-based Load-Balancing Algorithm for Heterogeneous CPU-GPU Clusters IEEE CLUSTER 2015 Chicago, IL, USA Luis Sant Ana 1, Daniel Cordeiro 2, Raphael Camargo 1 1 Federal University of ABC,
More information