Accelerating GATE simulations
|
|
- Kristina Lucas
- 6 years ago
- Views:
Transcription
1 GATE Simulations of Preclinical andclinical Scans in Emission Tomography, Transmission Tomography and Radiation Therapy Accelerating GATE simulations Parallel computing and GPU GATE Training, INSTN-Saclay, October 2015 Albertine Dubois IMNC CNRS Orsay, France Simon Stute SHFJ CEA Orsay, France Sebastien Jan SHFJ CEA Orsay, France
2 GATE Simulations of Preclinical andclinical Scans in Emission Tomography, Transmission Tomography and Radiation Therapy Running GATE using distributed architectures GATE Training 2014, Saclay, France 2
3 Introduction Parallel computing Simultaneous use of multiple computing ressources to achieve one computational problem Different usual approaches Multi-thread = shared memory (e.g. OpenMP, pthread) Multi-CPU = split memory (e.g. MPI) Intrinsic parallelism = cut the job into smaller jobs Monte Carlo method is most of the time intrinsically parallel 3
4 Monte Carlo parallelism Event splitting Assume events are independent to one another Works for radiotherapy applications Just split the job into pieces according to the number of events to be simulated and generate scripts for the batch queuing system Time splitting When events can interact between one another Works for imaging applications (pile-up, dead times, etc) Just split the job into pieces according to the total length of the simulation and generate scripts for the batch queuing system Finally merge the results from all pieces 4
5 GATE cluster tools GATE provides tools for convenient use with clusters Jobsplitter: to split the main simulation into pieces Filemerger: to merge the root outputs from all pieces Job splitter Batch queuing system Condor, OpenMosix, OpenPBS, Xgrid Job merger 5
6 Example using condor platform Easy to use tools with simple command lines Split the job (pieces are located in a.gate directory) gjs -numberofsplits 4 -clusterplatform condor condorscript condor.scpt macro.mac Submit the job condor_submit macro.submit Merge the results (works only for root files) gjm.gate/macro/macro.split Works well on multi-threaded desktop computers too 6
7 GATE Simulations of Preclinical andclinical Scans in Emission Tomography, Transmission Tomography and Radiation Therapy Running GATE using NVIDIA GPU GATE Training 2014, Saclay, France 7
8 Simulation strategy Using GPU needs specific language transcription Long, tedious, and needs thorough validation Move only high-cost parts of the simulation on GPU Hybrid simulation: GPU + CPU Current GATE strategy Phantom side on GPU One particule per thread CUDA kernels for specific physics and Monte Carlo navigation Detector side on CPU Phase space between phantom and detector Go back to GATE navigation and hits processing 8
9 GATE GPU modules Current GATE GPU modules Only for voxelized phantom (very time consuming) Only for PET and CT applications (validated in Bert et al 2013) GATE GATE-GPU GATE GATE-GPU GATE-GPU X 60 GATE GATE-GPU X 70 GATE 9
10 The PET module Only few commands related to the source change Example: /gate/source/addsource my_source GPUEmisTomo /gate/source/my_source/attachphantomto my_voxelized_phantom /gate/source/my_source/setgpubuffersize /gate/source/my_source/setgpudeviceid 1... Just declare your source as GPUEmisTomo Then attach the source to your voxelized phantom Set the size of phase space buffer (sets when CPU relieves GPU) Choose your GPU device (in case you have several) The rest is unchanged 10
11 The CT module Only few commands to add: Example: /gate/actor/addactor GPUTransTomoActor my_gpuactor /gate/actor/my_gpuactor/attachto my_voxelized_phantom /gate/actor/my_gpuactor/setgpubuffersize /gate/actor/my_gpuactor/setgpudeviceid 1... Just add a dedicated actor that do the link between the CPU and GPU Then attach the actor to your voxelized phantom Set the size of phase space buffer (sets when CPU relieves GPU) Choose your GPU device (in case you have several) The rest is unchanged 11
Customizable and Advanced Software for Tomographic Reconstruction
Customizable and Advanced Software for Tomographic Reconstruction 1 What is CASToR? Open source toolkit for 4D emission (PET/SPECT) and transmission (CT) tomographic reconstruction Focus on generic, modular
More informationThe OpenGATE Collaboration
The OpenGATE Collaboration GATE developments and future releases GATE Users meeting, IEEE MIC 2015, San Diego Sébastien JAN CEA - IMIV sebastien.jan@cea.fr Outline Theranostics modeling Radiobiology Optics
More informationGATE users meeting. Introduction. IEEE MIC 2015, San Diego
GATE users meeting Introduction IEEE MIC 2015, San Diego Irène Buvat IMIV, Inserm CEA CNRS Université Paris Sud Université Paris Saclay, Orsay, France irene.buvat@u-psud.fr Program 12:40 12:48 13:00 Quick
More informationThe new version of the GATE simulation platform
The new version of the GATE simulation platform Thibault Frisson Université de Lyon Léon Bérard anti-cancer center, CREATIS, CNRS On behalf of the OpenGATE Collaboration Outline GATE overview Main features
More informationGATE-RT Applications in Radiation Therapy
GATE-RT Applications in Radiation Therapy GATE Training, Orsay, April 2011 David SARRUT Creatis - CNRS Lyon - France GATE-RT: Radiation Therapy applications Related to dose distribution Radiotherapy: therapy
More informationGPU Debugging Made Easy. David Lecomber CTO, Allinea Software
GPU Debugging Made Easy David Lecomber CTO, Allinea Software david@allinea.com Allinea Software HPC development tools company Leading in HPC software tools market Wide customer base Blue-chip engineering,
More informationSimulations in emission tomography using GATE
Simulations in emission tomography using GATE Irène Buvat buvat@imed.jussieu.fr Laboratory of Functional Imaging, U678 INSERM, Paris, France Outline Emission tomography and need for simulations GATE short
More informationGPU-based high-performance computing for radiotherapy applications
GPU-based high-performance computing for radiotherapy applications Julien Bert, PhD CHRU de Brest LaTIM INSERM UMR1101 Radiotherapy Irradiation of the tumor: Maximum dose to the tumor Healthy surrounding
More informationParallel Programming Languages 1 - OpenMP
some slides are from High-Performance Parallel Scientific Computing, 2008, Purdue University & CSCI-UA.0480-003: Parallel Computing, Spring 2015, New York University Parallel Programming Languages 1 -
More informationSTARTING THE DDT DEBUGGER ON MIO, AUN, & MC2. (Mouse over to the left to see thumbnails of all of the slides)
STARTING THE DDT DEBUGGER ON MIO, AUN, & MC2 (Mouse over to the left to see thumbnails of all of the slides) ALLINEA DDT Allinea DDT is a powerful, easy-to-use graphical debugger capable of debugging a
More informationHPC Middle East. KFUPM HPC Workshop April Mohamed Mekias HPC Solutions Consultant. Agenda
KFUPM HPC Workshop April 29-30 2015 Mohamed Mekias HPC Solutions Consultant Agenda 1 Agenda-Day 1 HPC Overview What is a cluster? Shared v.s. Distributed Parallel v.s. Massively Parallel Interconnects
More informationCentre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules. Singularity overview. Vanessa HAMAR
Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules Singularity overview Vanessa HAMAR Disclaimer } The information in this presentation was compiled from different
More informationMPEXS benchmark results
MPEXS benchmark results - phase space data - Akinori Kimura 14 February 2017 Aim To validate results of MPEXS with phase space data by comparing with Geant4 results Depth dose and lateral dose distributions
More informationgpmc: GPU-Based Monte Carlo Dose Calculation for Proton Radiotherapy Xun Jia 8/7/2013
gpmc: GPU-Based Monte Carlo Dose Calculation for Proton Radiotherapy Xun Jia xunjia@ucsd.edu 8/7/2013 gpmc project Proton therapy dose calculation Pencil beam method Monte Carlo method gpmc project Started
More informationCOMP528: Multi-core and Multi-Processor Computing
COMP528: Multi-core and Multi-Processor Computing Dr Michael K Bane, G14, Computer Science, University of Liverpool m.k.bane@liverpool.ac.uk https://cgi.csc.liv.ac.uk/~mkbane/comp528 2X So far Why and
More informationImage-based Monte Carlo calculations for dosimetry
Image-based Monte Carlo calculations for dosimetry Irène Buvat Imagerie et Modélisation en Neurobiologie et Cancérologie UMR 8165 CNRS Universités Paris 7 et Paris 11 Orsay, France buvat@imnc.in2p3.fr
More informationXRAY Grid TO BE OR NOT TO BE?
XRAY Grid TO BE OR NOT TO BE? 1 I was not always a Grid sceptic! I started off as a grid enthusiast e.g. by insisting that Grid be part of the ESRF Upgrade Program outlined in the Purple Book : In this
More informationREAL-TIME ADAPTIVITY IN HEAD-AND-NECK AND LUNG CANCER RADIOTHERAPY IN A GPU ENVIRONMENT
REAL-TIME ADAPTIVITY IN HEAD-AND-NECK AND LUNG CANCER RADIOTHERAPY IN A GPU ENVIRONMENT Anand P Santhanam Assistant Professor, Department of Radiation Oncology OUTLINE Adaptive radiotherapy for head and
More informationA fast and accurate GPU-based proton transport Monte Carlo simulation for validating proton therapy treatment plans
A fast and accurate GPU-based proton transport Monte Carlo simulation for validating proton therapy treatment plans H. Wan Chan Tseung 1 J. Ma C. Beltran PTCOG 2014 13 June, Shanghai 1 wanchantseung.hok@mayo.edu
More informationParallelism paradigms
Parallelism paradigms Intro part of course in Parallel Image Analysis Elias Rudberg elias.rudberg@it.uu.se March 23, 2011 Outline 1 Parallelization strategies 2 Shared memory 3 Distributed memory 4 Parallelization
More informationX10 specific Optimization of CPU GPU Data transfer with Pinned Memory Management
X10 specific Optimization of CPU GPU Data transfer with Pinned Memory Management Hideyuki Shamoto, Tatsuhiro Chiba, Mikio Takeuchi Tokyo Institute of Technology IBM Research Tokyo Programming for large
More informationBright Cluster Manager Advanced HPC cluster management made easy. Martijn de Vries CTO Bright Computing
Bright Cluster Manager Advanced HPC cluster management made easy Martijn de Vries CTO Bright Computing About Bright Computing Bright Computing 1. Develops and supports Bright Cluster Manager for HPC systems
More informationCUDA and OpenCL Implementations of 3D CT Reconstruction for Biomedical Imaging
CUDA and OpenCL Implementations of 3D CT Reconstruction for Biomedical Imaging Saoni Mukherjee, Nicholas Moore, James Brock and Miriam Leeser September 12, 2012 1 Outline Introduction to CT Scan, 3D reconstruction
More informationHybrid Model Parallel Programs
Hybrid Model Parallel Programs Charlie Peck Intermediate Parallel Programming and Cluster Computing Workshop University of Oklahoma/OSCER, August, 2010 1 Well, How Did We Get Here? Almost all of the clusters
More informationImproving the Productivity of Scalable Application Development with TotalView May 18th, 2010
Improving the Productivity of Scalable Application Development with TotalView May 18th, 2010 Chris Gottbrath Principal Product Manager Rogue Wave Major Product Offerings 2 TotalView Technologies Family
More informationChapter 3 Parallel Software
Chapter 3 Parallel Software Part I. Preliminaries Chapter 1. What Is Parallel Computing? Chapter 2. Parallel Hardware Chapter 3. Parallel Software Chapter 4. Parallel Applications Chapter 5. Supercomputers
More informationOpenACC Course. Office Hour #2 Q&A
OpenACC Course Office Hour #2 Q&A Q1: How many threads does each GPU core have? A: GPU cores execute arithmetic instructions. Each core can execute one single precision floating point instruction per cycle
More informationA dedicated tool for PET scanner simulations using FLUKA
A dedicated tool for PET scanner simulations using FLUKA P. G. Ortega FLUKA meeting June 2013 1 Need for in-vivo treatment monitoring Particles: The good thing is that they stop... Tumour Normal tissue/organ
More informationOptical Modeling of Scintillation Detectors Using GATE
GATE Simulations of Preclinical and Clinical Scans in Emission Tomography, Transmission Tomography and Radiation Therapy Optical Modeling of Scintillation Detectors Using GATE Emilie Roncali Department
More informationS COMPUTING E M C A T G LINUX N GATE. Andrew Robinson
S COMPUTING E M C A T G LINUX N GATE 1. Linux (in a shell) 2. Our Cluster 3. Gateway to GATE 4. The roots of ROOT 5. Pick your tools 6. Resources LINUX Linux (in a shell) A(nother) Computer A Computer
More informationPerformance Evaluation of radionuclide imaging systems
Performance Evaluation of radionuclide imaging systems Nicolas A. Karakatsanis STIR Users meeting IEEE Nuclear Science Symposium and Medical Imaging Conference 2009 Orlando, FL, USA Geant4 Application
More informationParallel computation performances of Serpent and Serpent 2 on KTH Parallel Dator Centrum
KTH ROYAL INSTITUTE OF TECHNOLOGY, SH2704, 9 MAY 2018 1 Parallel computation performances of Serpent and Serpent 2 on KTH Parallel Dator Centrum Belle Andrea, Pourcelot Gregoire Abstract The aim of this
More informationIntroduction to Parallel and Distributed Computing. Linh B. Ngo CPSC 3620
Introduction to Parallel and Distributed Computing Linh B. Ngo CPSC 3620 Overview: What is Parallel Computing To be run using multiple processors A problem is broken into discrete parts that can be solved
More informationHPC with GPU and its applications from Inspur. Haibo Xie, Ph.D
HPC with GPU and its applications from Inspur Haibo Xie, Ph.D xiehb@inspur.com 2 Agenda I. HPC with GPU II. YITIAN solution and application 3 New Moore s Law 4 HPC? HPC stands for High Heterogeneous Performance
More informationANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation
ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation Ray Browell nvidia Technology Theater SC12 1 2012 ANSYS, Inc. nvidia Technology Theater SC12 HPC Revolution Recent
More informationHigh-Performance and Parallel Computing
9 High-Performance and Parallel Computing 9.1 Code optimization To use resources efficiently, the time saved through optimizing code has to be weighed against the human resources required to implement
More informationParallel Programming Models. Parallel Programming Models. Threads Model. Implementations 3/24/2014. Shared Memory Model (without threads)
Parallel Programming Models Parallel Programming Models Shared Memory (without threads) Threads Distributed Memory / Message Passing Data Parallel Hybrid Single Program Multiple Data (SPMD) Multiple Program
More informationHybrid Implementation of 3D Kirchhoff Migration
Hybrid Implementation of 3D Kirchhoff Migration Max Grossman, Mauricio Araya-Polo, Gladys Gonzalez GTC, San Jose March 19, 2013 Agenda 1. Motivation 2. The Problem at Hand 3. Solution Strategy 4. GPU Implementation
More informationAddressing Heterogeneity in Manycore Applications
Addressing Heterogeneity in Manycore Applications RTM Simulation Use Case stephane.bihan@caps-entreprise.com Oil&Gas HPC Workshop Rice University, Houston, March 2008 www.caps-entreprise.com Introduction
More informationGeant4 v9.5. Kernel III. Makoto Asai (SLAC) Geant4 Tutorial Course
Geant4 v9.5 Kernel III Makoto Asai (SLAC) Geant4 Tutorial Course Contents Fast simulation (Shower parameterization) Multi-threading Computing performance Kernel III - M.Asai (SLAC) 2 Fast simulation (shower
More informationECMWF Workshop on High Performance Computing in Meteorology. 3 rd November Dean Stewart
ECMWF Workshop on High Performance Computing in Meteorology 3 rd November 2010 Dean Stewart Agenda Company Overview Rogue Wave Product Overview IMSL Fortran TotalView Debugger Acumem ThreadSpotter 1 Copyright
More informationParallel Programming (1)
Parallel Programming (1) p. 1/?? Parallel Programming (1) Introduction and Scripting Nick Maclaren nmm1@cam.ac.uk February 2014 Parallel Programming (1) p. 2/?? Introduction (1) This is a single three--session
More informationIntroduction to Parallel Programming
Introduction to Parallel Programming January 14, 2015 www.cac.cornell.edu What is Parallel Programming? Theoretically a very simple concept Use more than one processor to complete a task Operationally
More informationDeutscher Wetterdienst
Accelerating Work at DWD Ulrich Schättler Deutscher Wetterdienst Roadmap Porting operational models: revisited Preparations for enabling practical work at DWD My first steps with the COSMO on a GPU First
More informationCUDA. Matthew Joyner, Jeremy Williams
CUDA Matthew Joyner, Jeremy Williams Agenda What is CUDA? CUDA GPU Architecture CPU/GPU Communication Coding in CUDA Use cases of CUDA Comparison to OpenCL What is CUDA? What is CUDA? CUDA is a parallel
More informationGraham vs legacy systems
New User Seminar Graham vs legacy systems This webinar only covers topics pertaining to graham. For the introduction to our legacy systems (Orca etc.), please check the following recorded webinar: SHARCNet
More informationWMS overview and Proposal for Job Status
WMS overview and Proposal for Job Status Author: V.Garonne, I.Stokes-Rees, A. Tsaregorodtsev. Centre de physiques des Particules de Marseille Date: 15/12/2003 Abstract In this paper, we describe briefly
More informationA Fast GPU-Based Approach to Branchless Distance-Driven Projection and Back-Projection in Cone Beam CT
A Fast GPU-Based Approach to Branchless Distance-Driven Projection and Back-Projection in Cone Beam CT Daniel Schlifske ab and Henry Medeiros a a Marquette University, 1250 W Wisconsin Ave, Milwaukee,
More informationOptimization of Cone Beam CT Reconstruction Algorithm Based on CUDA
Sensors & Transducers 2013 by IFSA http://www.sensorsportal.com Optimization of Cone Beam CT Reconstruction Algorithm Based on CUDA 1 Wang LI-Fang, 2 Zhang Shu-Hai 1 School of Electronics and Computer
More informationSampling Using GPU Accelerated Sparse Hierarchical Models
Sampling Using GPU Accelerated Sparse Hierarchical Models Miroslav Stoyanov Oak Ridge National Laboratory supported by Exascale Computing Project (ECP) exascaleproject.org April 9, 28 Miroslav Stoyanov
More informationModern Processor Architectures. L25: Modern Compiler Design
Modern Processor Architectures L25: Modern Compiler Design The 1960s - 1970s Instructions took multiple cycles Only one instruction in flight at once Optimisation meant minimising the number of instructions
More informationarxiv: v1 [hep-lat] 12 Nov 2013
Lattice Simulations using OpenACC compilers arxiv:13112719v1 [hep-lat] 12 Nov 2013 Indian Association for the Cultivation of Science, Kolkata E-mail: tppm@iacsresin OpenACC compilers allow one to use Graphics
More informationPROGRAMOVÁNÍ V C++ CVIČENÍ. Michal Brabec
PROGRAMOVÁNÍ V C++ CVIČENÍ Michal Brabec PARALLELISM CATEGORIES CPU? SSE Multiprocessor SIMT - GPU 2 / 17 PARALLELISM V C++ Weak support in the language itself, powerful libraries Many different parallelization
More informationUsing GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation
Using GPUs to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation GPU Technology Conference 2012 May 15, 2012 Thomas M. Benson, Daniel P. Campbell, Daniel A. Cook thomas.benson@gtri.gatech.edu
More informationhigh performance medical reconstruction using stream programming paradigms
high performance medical reconstruction using stream programming paradigms This Paper describes the implementation and results of CT reconstruction using Filtered Back Projection on various stream programming
More informationPedraforca: a First ARM + GPU Cluster for HPC
www.bsc.es Pedraforca: a First ARM + GPU Cluster for HPC Nikola Puzovic, Alex Ramirez We ve hit the power wall ALL computers are limited by power consumption Energy-efficient approaches Multi-core Fujitsu
More informationNVJPEG. DA _v0.2.0 October nvjpeg Libary Guide
NVJPEG DA-06762-001_v0.2.0 October 2018 Libary Guide TABLE OF CONTENTS Chapter 1. Introduction...1 Chapter 2. Using the Library... 3 2.1. Single Image Decoding... 3 2.3. Batched Image Decoding... 6 2.4.
More informationMaking Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010
Making Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010 Windows HPC Server 2008 R2 Windows HPC Server 2008 R2 makes supercomputing
More informationTOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT
TOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT Eric Kelmelis 28 March 2018 OVERVIEW BACKGROUND Evolution of processing hardware CROSS-PLATFORM KERNEL DEVELOPMENT Write once, target multiple hardware
More informationCOMP 605: Introduction to Parallel Computing Quiz 4: Module 4 Quiz: Comparing CUDA and MPI Matrix-Matrix Multiplication
COMP 605: Introduction to Parallel Computing Quiz 4: Module 4 Quiz: Comparing CUDA and MPI Matrix-Matrix Multiplication Mary Thomas Department of Computer Science Computational Science Research Center
More informationDesign and performance characteristics of a Cone Beam CT system for Leksell Gamma Knife Icon
Design and performance characteristics of a Cone Beam CT system for Leksell Gamma Knife Icon WHITE PAPER Introduction Introducing an image guidance system based on Cone Beam CT (CBCT) and a mask immobilization
More informationDistributing Computation to Large GPU Clusters
Distributing Computation to Large GPU Clusters What is this about? DiCE: Software library for writing applications scaling to many GPUs and CPUs in a cluster What is this about? DiCE: Software library
More informationThe Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System
The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System Alan Humphrey, Qingyu Meng, Martin Berzins Scientific Computing and Imaging Institute & University of Utah I. Uintah Overview
More informationMulticore Computer, GPU 및 Cluster 환경에서의 MATLAB Parallel Computing 기능
Multicore Computer, GPU 및 Cluster 환경에서의 MATLAB Parallel Computing 기능 성호현 MathWorks Korea 2012 The MathWorks, Inc. 1 A Question to Consider Do you want to speed up your algorithms? If so Do you have a multi-core
More informationParallel Programming Libraries and implementations
Parallel Programming Libraries and implementations Partners Funding Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License.
More informationParallel Computing with MATLAB
Parallel Computing with MATLAB CSCI 4850/5850 High-Performance Computing Spring 2018 Tae-Hyuk (Ted) Ahn Department of Computer Science Program of Bioinformatics and Computational Biology Saint Louis University
More informationTable of Contents. Table of Contents Job Manager for local execution of ATK scripts Serial execution Threading MPI parallelization Machine Manager
Table of Contents Table of Contents Job Manager for local execution of ATK scripts Serial execution Threading MPI parallelization Machine Manager 1 2 2 8 10 12 QuantumWise TRY IT! COMPANY CONTACT Docs»
More informationACCELERATING THE PRODUCTION OF SYNTHETIC SEISMOGRAMS BY A MULTICORE PROCESSOR CLUSTER WITH MULTIPLE GPUS
ACCELERATING THE PRODUCTION OF SYNTHETIC SEISMOGRAMS BY A MULTICORE PROCESSOR CLUSTER WITH MULTIPLE GPUS Ferdinando Alessi Annalisa Massini Roberto Basili INGV Introduction The simulation of wave propagation
More informationMicrosoft Windows HPC Server 2008 R2 for the Cluster Developer
50291B - Version: 1 02 May 2018 Microsoft Windows HPC Server 2008 R2 for the Cluster Developer Microsoft Windows HPC Server 2008 R2 for the Cluster Developer 50291B - Version: 1 5 days Course Description:
More informationThis is an author-deposited version published in : Eprints ID : 15075
Open Archive TOULOUSE Archive Ouverte (OATAO) OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible. This is an author-deposited
More informationRHRK-Seminar. High Performance Computing with the Cluster Elwetritsch - II. Course instructor : Dr. Josef Schüle, RHRK
RHRK-Seminar High Performance Computing with the Cluster Elwetritsch - II Course instructor : Dr. Josef Schüle, RHRK Overview Course I Login to cluster SSH RDP / NX Desktop Environments GNOME (default)
More informationDVCS software and analysis tutorial
DVCS software and analysis tutorial Carlos Muñoz Camacho Institut de Physique Nucléaire, Orsay, IN2P3/CNRS DVCS Collaboration Meeting January 16 17, 2017 Carlos Muñoz Camacho (IPNO) DVCS Software Jan 16,
More informationPerformance and Accuracy of Lattice-Boltzmann Kernels on Multi- and Manycore Architectures
Performance and Accuracy of Lattice-Boltzmann Kernels on Multi- and Manycore Architectures Dirk Ribbrock, Markus Geveler, Dominik Göddeke, Stefan Turek Angewandte Mathematik, Technische Universität Dortmund
More informationNVJPEG. DA _v0.1.4 August nvjpeg Libary Guide
NVJPEG DA-06762-001_v0.1.4 August 2018 Libary Guide TABLE OF CONTENTS Chapter 1. Introduction...1 Chapter 2. Using the Library... 3 2.1. Single Image Decoding... 3 2.3. Batched Image Decoding... 6 2.4.
More informationPROOF-Condor integration for ATLAS
PROOF-Condor integration for ATLAS G. Ganis,, J. Iwaszkiewicz, F. Rademakers CERN / PH-SFT M. Livny, B. Mellado, Neng Xu,, Sau Lan Wu University Of Wisconsin Condor Week, Madison, 29 Apr 2 May 2008 Outline
More informationAddressing the Increasing Challenges of Debugging on Accelerated HPC Systems. Ed Hinkel Senior Sales Engineer
Addressing the Increasing Challenges of Debugging on Accelerated HPC Systems Ed Hinkel Senior Sales Engineer Agenda Overview - Rogue Wave & TotalView GPU Debugging with TotalView Nvdia CUDA Intel Phi 2
More informationFaster Simulations of the National Airspace System
Faster Simulations of the National Airspace System PK Menon Monish Tandale Sandy Wiraatmadja Optimal Synthesis Inc. Joseph Rios NASA Ames Research Center NVIDIA GPU Technology Conference 2010, San Jose,
More informationHigh Performance Ocean Modeling using CUDA
using CUDA Chris Lupo Computer Science Cal Poly Slide 1 Acknowledgements Dr. Paul Choboter Jason Mak Ian Panzer Spencer Lines Sagiv Sheelo Jake Gardner Slide 2 Background Joint research with Dr. Paul Choboter
More informationA GPU-based Approximate SVD Algorithm Blake Foster, Sridhar Mahadevan, Rui Wang
A GPU-based Approximate SVD Algorithm Blake Foster, Sridhar Mahadevan, Rui Wang University of Massachusetts Amherst Introduction Singular Value Decomposition (SVD) A: m n matrix (m n) U, V: orthogonal
More informationSuperMike-II Launch Workshop. System Overview and Allocations
: System Overview and Allocations Dr Jim Lupo CCT Computational Enablement jalupo@cct.lsu.edu SuperMike-II: Serious Heterogeneous Computing Power System Hardware SuperMike provides 442 nodes, 221TB of
More informationSoftware and Performance Engineering for numerical codes on GPU clusters
Software and Performance Engineering for numerical codes on GPU clusters H. Köstler International Workshop of GPU Solutions to Multiscale Problems in Science and Engineering Harbin, China 28.7.2010 2 3
More informationCIVA Computed Tomography Modeling
CIVA Computed Tomography Modeling R. FERNANDEZ, EXTENDE, France S. LEGOUPIL, M. COSTIN, D. TISSEUR, A. LEVEQUE, CEA-LIST, France page 1 Summary Context From CIVA RT to CIVA CT Reconstruction Methods Applications
More informationIntroducing Computer-Assisted Surgery into combined PET/CT image based Biopsy
Introducing Computer-Assisted Surgery into combined PET/CT image based Biopsy Santos TO(1), Weitzel T(2), Klaeser B(2), Reyes M(1), Weber S(1) 1 - Artorg Center, University of Bern, Bern, Switzerland 2
More informationarxiv: v1 [physics.ins-det] 11 Jul 2015
GPGPU for track finding in High Energy Physics arxiv:7.374v [physics.ins-det] Jul 5 L Rinaldi, M Belgiovine, R Di Sipio, A Gabrielli, M Negrini, F Semeria, A Sidoti, S A Tupputi 3, M Villa Bologna University
More informationGPU-based Fast Cone Beam CT Reconstruction from Undersampled and Noisy Projection Data via Total Variation
GPU-based Fast Cone Beam CT Reconstruction from Undersampled and Noisy Projection Data via Total Variation 5 10 15 20 25 30 35 Xun Jia Department of Radiation Oncology, University of California San Diego,
More informationParallelization of Tau-Leap Coarse-Grained Monte Carlo Simulations on GPUs
Parallelization of Tau-Leap Coarse-Grained Monte Carlo Simulations on GPUs Lifan Xu, Michela Taufer, Stuart Collins, Dionisios G. Vlachos Global Computing Lab University of Delaware Multiscale Modeling:
More informationBatch Systems & Parallel Application Launchers Running your jobs on an HPC machine
Batch Systems & Parallel Application Launchers Running your jobs on an HPC machine Partners Funding Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike
More informationCPU-GPU Heterogeneous Computing
CPU-GPU Heterogeneous Computing Advanced Seminar "Computer Engineering Winter-Term 2015/16 Steffen Lammel 1 Content Introduction Motivation Characteristics of CPUs and GPUs Heterogeneous Computing Systems
More informationParallel and Distributed Computing with MATLAB The MathWorks, Inc. 1
Parallel and Distributed Computing with MATLAB 2018 The MathWorks, Inc. 1 Practical Application of Parallel Computing Why parallel computing? Need faster insight on more complex problems with larger datasets
More informationGPU ACCELERATED TOTAL FOCUSING METHOD IN CIVA
OPARUS GPU ACCELERATED TOTAL FOCUSING METHOD IN CIVA Authors: Gilles ROUGERON, Jason LAMBERT, Ekaterina IAKOVLEVA, L. LACASSAGNE Presenter: Nicolas DOMINGUEZ QNDE 2013 Baltimore, Md, USA, 24/07/2013 CEA
More informationCS 470 Spring Other Architectures. Mike Lam, Professor. (with an aside on linear algebra)
CS 470 Spring 2016 Mike Lam, Professor Other Architectures (with an aside on linear algebra) Parallel Systems Shared memory (uniform global address space) Primary story: make faster computers Programming
More informationSupercomputing resources at the IAC
Supercomputing resources at the IAC Ángel de Vicente angelv@iac.es SIE de Investigación y Enseñanza http://www.iac.es/sieinvens/sinfin/ Burros (Workstations with plenty of RAM) esel User room, 4GB, 420GB,
More informationClustering. Research and Teaching Unit
Clustering Research and Teaching Unit Disclaimer...though it cannot hope to be useful or informative on all matters, it does at least make the reassuring claim, that where it is inaccurate it is at least
More informationHeadline in Arial Bold 30pt. Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008
Headline in Arial Bold 30pt Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008 Agenda Visualisation Today User Trends Technology Trends Grid Viz Nodes Software Ecosystem
More informationOur Workshop Environment
Our Workshop Environment John Urbanic Parallel Computing Scientist Pittsburgh Supercomputing Center Copyright 2015 Our Environment Today Your laptops or workstations: only used for portal access Blue Waters
More informationMERCED CLUSTER BASICS Multi-Environment Research Computer for Exploration and Discovery A Centerpiece for Computational Science at UC Merced
MERCED CLUSTER BASICS Multi-Environment Research Computer for Exploration and Discovery A Centerpiece for Computational Science at UC Merced Sarvani Chadalapaka HPC Administrator University of California
More informationKohinoor queuing document
List of SGE Commands: qsub : Submit a job to SGE Kohinoor queuing document qstat : Determine the status of a job qdel : Delete a job qhost : Display Node information Some useful commands $qstat f -- Specifies
More informationModern Processor Architectures (A compiler writer s perspective) L25: Modern Compiler Design
Modern Processor Architectures (A compiler writer s perspective) L25: Modern Compiler Design The 1960s - 1970s Instructions took multiple cycles Only one instruction in flight at once Optimisation meant
More informationradiotherapy Andrew Godley, Ergun Ahunbay, Cheng Peng, and X. Allen Li NCAAPM Spring Meeting 2010 Madison, WI
GPU-Accelerated autosegmentation for adaptive radiotherapy Andrew Godley, Ergun Ahunbay, Cheng Peng, and X. Allen Li agodley@mcw.edu NCAAPM Spring Meeting 2010 Madison, WI Overview Motivation Adaptive
More informationOp#miza#on of CUDA- based Monte Carlo Simula#on for Radia#on Therapy. GTC 2014 N. Henderson & K. Murakami
Op#miza#on of CUDA- based Monte Carlo Simula#on for Radia#on Therapy GTC 2014 N. Henderson & K. Murakami The collabora#on Geant4 @ Special thanks to the CUDA Center of Excellence Program Makoto Asai, SLAC
More information