Early Experiences Porting the NAMD and VMD Molecular Simulation and Analysis Software to GPU-Accelerated OpenPOWER Platforms

Size: px
Start display at page:

Download "Early Experiences Porting the NAMD and VMD Molecular Simulation and Analysis Software to GPU-Accelerated OpenPOWER Platforms"

Transcription

1 Early Experiences Porting the NAMD and VMD Molecular Simulation and Analysis Software to GPU-Accelerated OpenPOWER Platforms John E. Stone, Antti-Pekka Hynninen, James C. Phillips, Klaus Schulten Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign Center for Accelerated Application Readiness, Oak Ridge National Laboratory International Workshop on OpenPOWER for HPC Frankfurt, Germany, June 23, 2016

2 Goal: A Computational Microscope Study the molecular machines in living cells Ribosome: target for antibiotics Poliovirus

3 Number of atoms Computational Biology s Insatiable Demand for Processing Power HIV capsid Ribosome 10 6 ATP Synthase STMV 10 5 Lysozyme ApoA

4 Our Interest in GPU-Accelerated OpenPOWER Platforms OpenPOWER roadmap includes: NVLink interconnect to GPUs, 5x-12x PCIe 3.0 perf Support for a variety of advanced DRAM and persistent memory systems Performance of NAMD and VMD on current petascale systems such as Blue Waters and Titan benefits greatly from GPU accelerators DOE Summit and Sierra systems based on Power9 + Volta in late 2017

5 Porting and Testing NAMD and VMD on Crest Power8 systems at ORNL CAAR Crest test systems at the ORNL Center for Accelerated Application Readiness IBM S822L nodes: Dual 10-core IBM POWER8 CPUs, support up to 8-way SMT per physical CPU core 256GB RAM NUMA arch. w/ 4 NUMA-nodes, 384GB/sec peak mem. bandwidth 4x NVIDIA Tesla K40m GPUs, w/ PCIe 3.0 interconnect Testbed to prepare for Summit ORNL Summit

6 CDIMMs P8 CPU IBM S822L CDIMMs SMP A Bus SMP X Bus SMP A Bus P8 CPU PHB PHB PHB PCIe 3.0 x16 PCIe 3.0 x16 PCIe 3.0 x8 GPU GPU IB NIC CDIMMs P8 CPU SMP A Bus SMP X Bus SMP A Bus PHB PHB PHB PCIe 3.0 x8 PCIe 3.0 x16 PCIe 3.0 x16 IB NIC GPU GPU CDIMMs P8 CPU

7

8 NAMD Performance Improvements and OpenPOWER Evaluation For Blue Waters and Titan, CUDA kernels for nonbonded force calculation provide the greatest fraction of GPU speedup New kernels developed for explicit solvent and implicit solvent non-bonded force calculation Performance of new kernels evaluated on POWER8+K40m (Crest) vs. Cray XK7 (BW, Titan) NAMD performance is contingent on many factors, so much work still remains to be done

9 NAMD Non-bonded Force Optimizations for the GPU Use warp-synchronous programming to exchange data between GPU threads using registers rather than shared memory Improve average work efficiency with improved cutoff test tile organization and by exploiting Newton s third law

10 Explicit Solvent Speedup

11 Implicit Solvent (GBIS) Speedup

12 Comparison of Crest vs. roughly comparable Titan nodes Evaluated performance of new NAMD kernels and overall simulation runtime Single node of Crest vs. four nodes of Titan: Crest node: dual POWER8 DCMs and four Tesla K40m GPUs Titan node: single AMD Opteron 6276 and one Tesla K20X Relative performance shown for Crest normalized by Titan Tesla K40m should run ~7% faster than Tesla K20X

13 POWER8+K40m vs. Cray XK7 (1-node Crest vs. 4-nodes Titan)

14 VMD OpenPOWER Performance Evaluation VMD performance characteristics are not dominated by any one kernel or algorithm VMD makes extensive use of GPUs, but there are still many routines that remain CPU-only code paths at present We evaluated performance of several visualization and analysis algorithms with differing characteristics to obtain a broad view of OpenPOWER s performance traits

15 Compiler Evaluation Many compiler choices will be available for OpenPOWER platforms We did some VMD microbenchmark comparisons between GCC and XLC to evaluate their ability to vectorize simple loops etc. IBM XLC was generally more performant, ranging from 7% to 50% faster than GCC One spectacularly bad case for GCC generated VSX vector code that ran 12x slower than XLC, and 11x slower than GCC generating scalar code All subsequent tests were performed with XLC

16 Tachyon Ray Tracing Performance VMD supports high performance ray tracing with built-in RT engines Tachyon: multi-platform hardware-neutral OptiX: NVIDIA GPUs, not available on POWER yet Tachyon characteristics: No hand-coded or other explicit vectorization Construction and traversal of linked lists, spatial acceleration data structures, lots of scalar floating point arithmetic Potential memory latency hiding benefit from SMT

17

18

19 Detailed Tachyon Double-Precision Performance w/ Varying SMT

20 GPU-Accelerated MDFF Cross Correlation GPUs enable laptops and desktop workstations to handle tasks that would have previously required a cluster, or a very long wait GPU-accelerated petascale supercomputers enable analyses were previously impractical, allowing detailed study of very large structures such as viruses GPU-accelerated MDFF Cross Correlation Timeline Regions with poor fit Regions with good fit

21

22

23 MDFF Cross Correlation Hand-coded POWER8 VSX yielded 20% performance gain vs. XLC vectorization Hand-coded x86 SSE yielded only 12% performance gain vs. ICC vectorization POWER8 gained nothing from SMT w/ hand-coded VSX

24 Hand-Coded VSX Instructions. vector float xdelta = *(( vector float *) &sxdelta[0]); // aligned load vector float xdist = vec_splats(sxdist); xdist = vec_add(xdist, xdelta); vector float ydist = vec_splats(sydist); vector float zdist = vec_splats(szdist); vector float xdist2 = vec_mul(xdist, xdist); vector float ydist2 = vec_mul(ydist, ydist); vector float zdist2 = vec_mul(zdist, zdist); vector float dist2 = vec_splats(yzdist2); dist2 = vec_add(dist2, xdist2);.

25 Molecular Orbital Performance Hand-coded POWER8 VSX yielded a speedup of ~2x for MO kernel We saw that SMT benefits scalar C++ by up to 1.85x, but not the hand-coded VSX versions of the same loops

26 Limitations/Issues Encountered No support for GPU peer-to-peer DMAs over PCIe (yet?) POWER CPU core/thread indexing schemes are the transpose of what is done on x86 platforms NUMA thread mapping on POWER8 is complex due to variable SMT modes, dynamic CPU states, etc. No API for querying SMT modes! Code to query SMT state is a few hundred lines, reading /proc /sys etc. as implemented in IBM processor management utilities HPC users need clean APIs that make NUMA pinning and thread indexing much simpler

27 Future Work Continued tuning of NAMD kernels, identification of OpenPOWER-specific overheads for GPU-accelerated kernels VMD molecular dynamics trajectory clustering analysis Preliminary result: POWER8 is ~2.5x faster than dual Xeon 2687W-v3 due to much greater peak memory bandwidth Evaluate OpenACC on POWER Support for GPU hardware OpenGL acceleration, EGL, OptiX GPU ray tracing, and NVENC GPU hardware video encoding

28 Acknowledgements Theoretical and Computational Biophysics Group, University of Illinois at Urbana-Champaign Center for Accelerated Application Readiness, Oak Ridge National Laboratory Funding: DOE INCITE, ORNL Titan: DE-AC05-00OR22725 NSF Blue Waters: NSF OCI , PRAC The Computational Microscope, ACI , ACI NIH support: 9P41GM104601, 5R01GM

29

30 Related Publications Research/gpu/ Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering. John E. Stone, William R. Sherman, and Klaus Schulten.High Performance Data Analysis and Visualization Workshop, IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW), (In-press) High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL. John E. Stone, Peter Messmer, Robert Sisneros, and Klaus Schulten.High Performance Data Analysis and Visualization Workshop, IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW), (In-press) Evaluation of Emerging Energy-Efficient Heterogeneous Computing Platforms for Biomolecular and Cellular Simulation Workloads. John E. Stone, Michael J. Hallock, James C. Phillips, Joseph R. Peterson, Zaida Luthey-Schulten, and Klaus Schulten.25th International Heterogeneity in Computing Workshop, IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW), (In-press) Atomic Detail Visualization of Photosynthetic Membranes with GPU-Accelerated Ray Tracing. J. E. Stone, M. Sener, K. L. Vandivort, A. Barragan, A. Singharoy, I. Teo, J. V. Ribeiro, B. Isralewitz, B. Liu, B.-C. Goh, J. C. Phillips, C. MacGregor-Chatwin, M. P. Johnson, L. F. Kourkoutis, C. Neil Hunter, and K. Schulten. J. Parallel Computing, (In-press) Chemical Visualization of Human Pathogens: the Retroviral Capsids. Juan R. Perilla, Boon Chong Goh, John E. Stone, and Klaus SchultenSC'15 Visualization and Data Analytics Showcase, 2015.

31 Related Publications Research/gpu/ Visualization of Energy Conversion Processes in a Light Harvesting Organelle at Atomic Detail. M. Sener, J. E. Stone, A. Barragan, A. Singharoy, I. Teo, K. L. Vandivort, B. Isralewitz, B. Liu, B. Goh, J. C. Phillips, L. F. Kourkoutis, C. N. Hunter, and K. Schulten. SC'14 Visualization and Data Analytics Showcase, ***Winner of the SC'14 Visualization and Data Analytics Showcase Runtime and Architecture Support for Efficient Data Exchange in Multi-Accelerator Applications. J. Cabezas, I. Gelado, J. E. Stone, N. Navarro, D. B. Kirk, and W. Hwu. IEEE Transactions on Parallel and Distributed Systems, (In press) Unlocking the Full Potential of the Cray XK7 Accelerator. M. D. Klein and J. E. Stone. Cray Users Group, Lugano Switzerland, May GPU-Accelerated Analysis and Visualization of Large Structures Solved by Molecular Dynamics Flexible Fitting. J. E. Stone, R. McGreevy, B. Isralewitz, and K. Schulten. Faraday Discussions, 169: , Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-gpu workstations. M. J. Hallock, J. E. Stone, E. Roberts, C. Fry, and Z. Luthey-Schulten. Journal of Parallel Computing, 40:86-99, 2014.

32 Related Publications Research/gpu/ GPU-Accelerated Molecular Visualization on Petascale Supercomputing Platforms. J. Stone, K. L. Vandivort, and K. Schulten. UltraVis'13: Proceedings of the 8th International Workshop on Ultrascale Visualization, pp. 6:1-6:8, Early Experiences Scaling VMD Molecular Visualization and Analysis Jobs on Blue Waters. J. Stone, B. Isralewitz, and K. Schulten. In proceedings, Extreme Scaling Workshop, Lattice Microbes: High performance stochastic simulation method for the reaction diffusion master equation. E. Roberts, J. Stone, and Z. Luthey Schulten. J. Computational Chemistry 34 (3), , Fast Visualization of Gaussian Density Surfaces for Molecular Dynamics and Particle System Trajectories. M. Krone, J. Stone, T. Ertl, and K. Schulten. EuroVis Short Papers, pp , Immersive Out-of-Core Visualization of Large-Size and Long-Timescale Molecular Dynamics Trajectories. J. Stone, K. L. Vandivort, and K. Schulten. G. Bebis et al. (Eds.): 7th International Symposium on Visual Computing (ISVC 2011), LNCS 6939, pp. 1-12, Fast Analysis of Molecular Dynamics Trajectories with Graphics Processing Units Radial Distribution Functions. B. Levine, J. Stone, and A. Kohlmeyer. J. Comp. Physics, 230(9): , 2011.

33 Related Publications Research/gpu/ Quantifying the Impact of GPUs on Performance and Energy Efficiency in HPC Clusters. J. Enos, C. Steffen, J. Fullop, M. Showerman, G. Shi, K. Esler, V. Kindratenko, J. Stone, J Phillips. International Conference on Green Computing, pp , GPU-accelerated molecular modeling coming of age. J. Stone, D. Hardy, I. Ufimtsev, K. Schulten. J. Molecular Graphics and Modeling, 29: , OpenCL: A Parallel Programming Standard for Heterogeneous Computing. J. Stone, D. Gohara, G. Shi. Computing in Science and Engineering, 12(3):66-73, An Asymmetric Distributed Shared Memory Model for Heterogeneous Computing Systems. I. Gelado, J. Stone, J. Cabezas, S. Patel, N. Navarro, W. Hwu. ASPLOS 10: Proceedings of the 15 th International Conference on Architectural Support for Programming Languages and Operating Systems, pp , 2010.

34 Related Publications Research/gpu/ GPU Clusters for High Performance Computing. V. Kindratenko, J. Enos, G. Shi, M. Showerman, G. Arnold, J. Stone, J. Phillips, W. Hwu. Workshop on Parallel Programming on Accelerator Clusters (PPAC), In Proceedings IEEE Cluster 2009, pp. 1-8, Aug Long time-scale simulations of in vivo diffusion using GPU hardware. E. Roberts, J. Stone, L. Sepulveda, W. Hwu, Z. Luthey-Schulten. In IPDPS 09: Proceedings of the 2009 IEEE International Symposium on Parallel & Distributed Computing, pp. 1-8, High Performance Computation and Interactive Display of Molecular Orbitals on GPUs and Multi-core CPUs. J. Stone, J. Saam, D. Hardy, K. Vandivort, W. Hwu, K. Schulten, 2nd Workshop on General-Purpose Computation on Graphics Pricessing Units (GPGPU-2), ACM International Conference Proceeding Series, volume 383, pp. 9-18, Probing Biomolecular Machines with Graphics Processors. J. Phillips, J. Stone. Communications of the ACM, 52(10):34-41, Multilevel summation of electrostatic potentials using graphics processing units. D. Hardy, J. Stone, K. Schulten. J. Parallel Computing, 35: , 2009.

35 Related Publications Research/gpu/ Adapting a message-driven parallel application to GPU-accelerated clusters. J. Phillips, J. Stone, K. Schulten. Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, IEEE Press, GPU acceleration of cutoff pair potentials for molecular modeling applications. C. Rodrigues, D. Hardy, J. Stone, K. Schulten, and W. Hwu. Proceedings of the 2008 Conference On Computing Frontiers, pp , GPU computing. J. Owens, M. Houston, D. Luebke, S. Green, J. Stone, J. Phillips. Proceedings of the IEEE, 96: , Accelerating molecular modeling applications with graphics processors. J. Stone, J. Phillips, P. Freddolino, D. Hardy, L. Trabuco, K. Schulten. J. Comp. Chem., 28: , Continuous fluorescence microphotolysis and correlation spectroscopy. A. Arkhipov, J. Hüve, M. Kahms, R. Peters, K. Schulten. Biophysical Journal, 93: , 2007.

S5386 Publication-Quality Ray Tracing of Molecular Graphics with OptiX

S5386 Publication-Quality Ray Tracing of Molecular Graphics with OptiX S5386 Publication-Quality Ray Tracing of Molecular Graphics with OptiX John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois

More information

In-Situ Visualization and Analysis of Petascale Molecular Dynamics Simulations with VMD

In-Situ Visualization and Analysis of Petascale Molecular Dynamics Simulations with VMD In-Situ Visualization and Analysis of Petascale Molecular Dynamics Simulations with VMD John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University

More information

GPU-Accelerated Visualization and Analysis of Petascale Molecular Dynamics Simulations

GPU-Accelerated Visualization and Analysis of Petascale Molecular Dynamics Simulations GPU-Accelerated Visualization and Analysis of Petascale Molecular Dynamics Simulations John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

S6261 VMD+OptiX: Streaming Interactive Ray Tracing from Remote GPU Clusters to Your VR Headset

S6261 VMD+OptiX: Streaming Interactive Ray Tracing from Remote GPU Clusters to Your VR Headset NIH BTRC for Macromolecular Modeling and Bioinformatics S6261 VMD+OptiX: Streaming Interactive Ray Tracing from Remote GPU Clusters to Your VR Headset John E. Stone Theoretical and Computational Biophysics

More information

Harnessing GPUs to Probe Biomolecular Machines at Atomic Detail

Harnessing GPUs to Probe Biomolecular Machines at Atomic Detail Harnessing GPUs to Probe Biomolecular Machines at Atomic Detail John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois

More information

GPU-Accelerated Analysis of Large Biomolecular Complexes

GPU-Accelerated Analysis of Large Biomolecular Complexes GPU-Accelerated Analysis of Large Biomolecular Complexes John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

More information

High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL

High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL John E. Stone, Peter Messmer, Robert Sisneros, Klaus Schulten Theoretical and Computational Biophysics Group Beckman Institute

More information

VMD: Immersive Molecular Visualization and Interactive Ray Tracing for Domes, Panoramic Theaters, and Head Mounted Displays

VMD: Immersive Molecular Visualization and Interactive Ray Tracing for Domes, Panoramic Theaters, and Head Mounted Displays VMD: Immersive Molecular Visualization and Interactive Ray Tracing for Domes, Panoramic Theaters, and Head Mounted Displays John E. Stone Theoretical and Computational Biophysics Group Beckman Institute

More information

Interactive Supercomputing for State-of-the-art Biomolecular Simulation

Interactive Supercomputing for State-of-the-art Biomolecular Simulation Interactive Supercomputing for State-of-the-art Biomolecular Simulation John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of

More information

Faster, Cheaper, Better: Biomolecular Simulation with NAMD, VMD, and CUDA

Faster, Cheaper, Better: Biomolecular Simulation with NAMD, VMD, and CUDA Faster, Cheaper, Better: Biomolecular Simulation with NAMD, VMD, and CUDA John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois

More information

Visualization and Analysis of Petascale Molecular Dynamics Simulations

Visualization and Analysis of Petascale Molecular Dynamics Simulations Visualization and Analysis of Petascale Molecular Dynamics Simulations John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois

More information

Using Accelerator Directives to Adapt Science Applications for State-of-the-Art HPC Architectures

Using Accelerator Directives to Adapt Science Applications for State-of-the-Art HPC Architectures Using Accelerator Directives to Adapt Science Applications for State-of-the-Art HPC Architectures John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and

More information

S4410: Visualization and Analysis of Petascale Molecular Simulations with VMD

S4410: Visualization and Analysis of Petascale Molecular Simulations with VMD S4410: Visualization and Analysis of Petascale Molecular Simulations with VMD John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University

More information

Proteins and Mesoscale Data: Visualization of Molecular Dynamics

Proteins and Mesoscale Data: Visualization of Molecular Dynamics Proteins and Mesoscale Data: Visualization of Molecular Dynamics John E. Stone Theoretical and Computational Biophysics Group Beckman Institute, University of Illinois at Urbana-Champaign http://www.ks.uiuc.edu/research/vmd/

More information

S6253 VMD: Petascale Molecular Visualization and Analysis with Remote Video Streaming

S6253 VMD: Petascale Molecular Visualization and Analysis with Remote Video Streaming S6253 VMD: Petascale Molecular Visualization and Analysis with Remote Video Streaming John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

GPU-Accelerated Analysis of Petascale Molecular Dynamics Simulations

GPU-Accelerated Analysis of Petascale Molecular Dynamics Simulations GPU-Accelerated Analysis of Petascale Molecular Dynamics Simulations John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois

More information

Using AWS EC2 GPU Instances for Computational Microscopy and Biomolecular Simulation

Using AWS EC2 GPU Instances for Computational Microscopy and Biomolecular Simulation Using AWS EC2 GPU Instances for Computational Microscopy and Biomolecular Simulation John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University

More information

S8665 VMD: Biomolecular Visualization from Atoms to Cells Using Ray Tracing, Rasterization, and VR

S8665 VMD: Biomolecular Visualization from Atoms to Cells Using Ray Tracing, Rasterization, and VR S8665 VMD: Biomolecular Visualization from Atoms to Cells Using Ray Tracing, Rasterization, and VR John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and

More information

John E. Stone. S03: High Performance Computing with CUDA. Heterogeneous GPU Computing for Molecular Modeling

John E. Stone. S03: High Performance Computing with CUDA. Heterogeneous GPU Computing for Molecular Modeling S03: High Performance Computing with CUDA Heterogeneous GPU Computing for Molecular Modeling John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

S8665 VMD: Biomolecular Visualization from Atoms to Cells Using Ray Tracing, Rasterization, and VR

S8665 VMD: Biomolecular Visualization from Atoms to Cells Using Ray Tracing, Rasterization, and VR S8665 VMD: Biomolecular Visualization from Atoms to Cells Using Ray Tracing, Rasterization, and VR John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and

More information

Early Experiences Scaling VMD Molecular Visualization and Analysis Jobs on Blue Waters

Early Experiences Scaling VMD Molecular Visualization and Analysis Jobs on Blue Waters Early Experiences Scaling VMD Molecular Visualization and Analysis Jobs on Blue Waters John E. Stone, Barry Isralewitz, Klaus Schulten Theoretical and Computational Biophysics Group Beckman Institute for

More information

Frontiers of Molecular Visualization: Interactive Ray Tracing, Panoramic Displays, VR HMDs, and Remote Visualization. John E.

Frontiers of Molecular Visualization: Interactive Ray Tracing, Panoramic Displays, VR HMDs, and Remote Visualization. John E. Frontiers of Molecular Visualization: Interactive Ray Tracing, Panoramic Displays, VR HMDs, and Remote Visualization John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced

More information

VMD: Interactive Publication-Quality Molecular Ray Tracing with OptiX

VMD: Interactive Publication-Quality Molecular Ray Tracing with OptiX VMD: Interactive Publication-Quality Molecular Ray Tracing with OptiX John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois

More information

Accelerating Molecular Modeling Applications with Graphics Processors

Accelerating Molecular Modeling Applications with Graphics Processors Accelerating Molecular Modeling Applications with Graphics Processors John Stone Theoretical and Computational Biophysics Group University of Illinois at Urbana-Champaign Research/gpu/ SIAM Conference

More information

Molecular Visualization and Simulation in VR

Molecular Visualization and Simulation in VR Molecular Visualization and Simulation in VR John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

More information

S5371 VMD: Visualization and Analysis of Biomolecular Complexes with GPU Computing

S5371 VMD: Visualization and Analysis of Biomolecular Complexes with GPU Computing S5371 VMD: Visualization and Analysis of Biomolecular Complexes with GPU Computing John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University

More information

High Performance Molecular Visualization and Analysis on GPUs

High Performance Molecular Visualization and Analysis on GPUs High Performance Molecular Visualization and Analysis on GPUs John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at

More information

Using GPUs to Supercharge Visualization and Analysis of Molecular Dynamics Simulations with VMD

Using GPUs to Supercharge Visualization and Analysis of Molecular Dynamics Simulations with VMD Using GPUs to Supercharge Visualization and Analysis of Molecular Dynamics Simulations with VMD John E. Stone http://www.ks.uiuc.edu/research/vmd/ VMD Visual Molecular Dynamics Visualization and analysis

More information

S8709 Accelerating Molecular Modeling on Desktop and Pre-Exascale Supercomputers

S8709 Accelerating Molecular Modeling on Desktop and Pre-Exascale Supercomputers S8709 Accelerating Molecular Modeling on Desktop and Pre-Exascale Supercomputers John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University

More information

High Performance Molecular Simulation, Visualization, and Analysis on GPUs

High Performance Molecular Simulation, Visualization, and Analysis on GPUs High Performance Molecular Simulation, Visualization, and Analysis on GPUs John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of

More information

Visualization Challenges and Opportunities Posed by Petascale Molecular Dynamics Simulations

Visualization Challenges and Opportunities Posed by Petascale Molecular Dynamics Simulations Visualization Challenges and Opportunities Posed by Petascale Molecular Dynamics Simulations John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

Accelerating Molecular Modeling Applications with GPU Computing

Accelerating Molecular Modeling Applications with GPU Computing Accelerating Molecular Modeling Applications with GPU Computing John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at

More information

GPU-Accelerated Molecular Visualization and Analysis with VMD

GPU-Accelerated Molecular Visualization and Analysis with VMD GPU-Accelerated Molecular Visualization and Analysis with VMD John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at

More information

Analysis and Visualization Algorithms in VMD

Analysis and Visualization Algorithms in VMD 1 Analysis and Visualization Algorithms in VMD David Hardy Research/~dhardy/ NAIS: State-of-the-Art Algorithms for Molecular Dynamics (Presenting the work of John Stone.) VMD Visual Molecular Dynamics

More information

Programming for Hybrid Architectures Today and in the Future

Programming for Hybrid Architectures Today and in the Future Programming for Hybrid Architectures Today and in the Future John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at

More information

Visualization of Energy Conversion Processes in a Light Harvesting Organelle at Atomic Detail

Visualization of Energy Conversion Processes in a Light Harvesting Organelle at Atomic Detail Visualization of Energy Conversion Processes in a Light Harvesting Organelle at Atomic Detail Theoretical and Computational Biophysics Group Center for the Physics of Living Cells Beckman Institute for

More information

Improving NAMD Performance on Volta GPUs

Improving NAMD Performance on Volta GPUs Improving NAMD Performance on Volta GPUs David Hardy - Research Programmer, University of Illinois at Urbana-Champaign Ke Li - HPC Developer Technology Engineer, NVIDIA John Stone - Senior Research Programmer,

More information

Experiences Developing and Maintaining Scientific Applications on GPU-Accelerated Platforms

Experiences Developing and Maintaining Scientific Applications on GPU-Accelerated Platforms Experiences Developing and Maintaining Scientific Applications on GPU-Accelerated Platforms John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

Advances In Biomolecular Simulation with NAMD and VMD

Advances In Biomolecular Simulation with NAMD and VMD Advances In Biomolecular Simulation with NAMD and VMD John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

More information

Accelerating Molecular Modeling Applications with GPU Computing

Accelerating Molecular Modeling Applications with GPU Computing Accelerating Molecular Modeling Applications with GPU Computing John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at

More information

Broadening the Use of Scalable Kernels in NAMD/VMD

Broadening the Use of Scalable Kernels in NAMD/VMD Broadening the Use of Scalable Kernels in NAMD/VMD John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

More information

Visualizing Biomolecular Complexes on x86 and KNL Platforms: Integrating VMD and OSPRay

Visualizing Biomolecular Complexes on x86 and KNL Platforms: Integrating VMD and OSPRay Visualizing Biomolecular Complexes on x86 and KNL Platforms: Integrating VMD and OSPRay John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

Accelerating Scientific Applications with GPUs

Accelerating Scientific Applications with GPUs Accelerating Scientific Applications with GPUs John Stone Theoretical and Computational Biophysics Group University of Illinois at Urbana-Champaign Research/gpu/ Workshop on Programming Massively Parallel

More information

High Performance Molecular Visualization and Analysis with GPU Computing

High Performance Molecular Visualization and Analysis with GPU Computing High Performance Molecular Visualization and Analysis with GPU Computing John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois

More information

Case Study: Molecular Modeling Applications John E. Stone

Case Study: Molecular Modeling Applications John E. Stone S04: High Performance Computing with CUDA Case Study: Molecular Modeling Applications John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

Scaling in a Heterogeneous Environment with GPUs: GPU Architecture, Concepts, and Strategies

Scaling in a Heterogeneous Environment with GPUs: GPU Architecture, Concepts, and Strategies Scaling in a Heterogeneous Environment with GPUs: GPU Architecture, Concepts, and Strategies John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology

More information

Immersive Out-of-Core Visualization of Large-Size and Long-Timescale Molecular Dynamics Trajectories

Immersive Out-of-Core Visualization of Large-Size and Long-Timescale Molecular Dynamics Trajectories Immersive Out-of-Core Visualization of Large-Size and Long-Timescale Molecular Dynamics Trajectories J. Stone, K. Vandivort, K. Schulten Theoretical and Computational Biophysics Group Beckman Institute

More information

Accelerating Computational Biology by 100x Using CUDA. John Stone Theoretical and Computational Biophysics Group, University of Illinois

Accelerating Computational Biology by 100x Using CUDA. John Stone Theoretical and Computational Biophysics Group, University of Illinois Accelerating Computational Biology by 100x Using CUDA John Stone Theoretical and Computational Biophysics Group, University of Illinois GPU Computing Commodity devices, omnipresent in modern computers

More information

GPU Acceleration of Molecular Modeling Applications

GPU Acceleration of Molecular Modeling Applications GPU Acceleration of Molecular Modeling Applications James Phillips John Stone Research/gpu/ NAMD: Practical Supercomputing 25,000 users can t all be computer experts. 18% are NIH-funded; many in other

More information

Multilevel Summation of Electrostatic Potentials Using GPUs

Multilevel Summation of Electrostatic Potentials Using GPUs Multilevel Summation of Electrostatic Potentials Using GPUs David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at

More information

Simulating Life at the Atomic Scale

Simulating Life at the Atomic Scale Simulating Life at the Atomic Scale James Phillips Beckman Institute, University of Illinois Research/namd/ Beckman Institute University of Illinois at Urbana-Champaign Theoretical and Computational Biophysics

More information

Using GPUs to compute the multilevel summation of electrostatic forces

Using GPUs to compute the multilevel summation of electrostatic forces Using GPUs to compute the multilevel summation of electrostatic forces David J. Hardy Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of

More information

High Performance Computing with Accelerators

High Performance Computing with Accelerators High Performance Computing with Accelerators Volodymyr Kindratenko Innovative Systems Laboratory @ NCSA Institute for Advanced Computing Applications and Technologies (IACAT) National Center for Supercomputing

More information

ECE 498AL. Lecture 21-22: Application Performance Case Studies: Molecular Visualization and Analysis

ECE 498AL. Lecture 21-22: Application Performance Case Studies: Molecular Visualization and Analysis ECE 498AL Lecture 21-22: Application Performance Case Studies: Molecular Visualization and Analysis Guest Lecture by John Stone Theoretical and Computational Biophysics Group NIH Resource for Macromolecular

More information

CUDA Experiences: Over-Optimization and Future HPC

CUDA Experiences: Over-Optimization and Future HPC CUDA Experiences: Over-Optimization and Future HPC Carl Pearson 1, Simon Garcia De Gonzalo 2 Ph.D. candidates, Electrical and Computer Engineering 1 / Computer Science 2, University of Illinois Urbana-Champaign

More information

OPTIMIZING HPC SIMULATION AND VISUALIZATION CODE USING NVIDIA NSIGHT SYSTEMS

OPTIMIZING HPC SIMULATION AND VISUALIZATION CODE USING NVIDIA NSIGHT SYSTEMS OPTIMIZING HPC SIMULATION AND VISUALIZATION CODE USING NVIDIA NSIGHT SYSTEMS Daniel Horowitz Director of Platform Developer Tools, NVIDIA, Robert (Bob) Knight Principal System Software Engineer, NVIDIA

More information

NAMD, CUDA, and Clusters: Taking GPU Molecular Dynamics Beyond the Deskop

NAMD, CUDA, and Clusters: Taking GPU Molecular Dynamics Beyond the Deskop NAMD, CUDA, and Clusters: Taking GPU Molecular Dynamics Beyond the Deskop James Phillips Research/gpu/ Beckman Institute University of Illinois at Urbana-Champaign Theoretical and Computational Biophysics

More information

Graphics Processor Acceleration and YOU

Graphics Processor Acceleration and YOU Graphics Processor Acceleration and YOU James Phillips Research/gpu/ Goals of Lecture After this talk the audience will: Understand how GPUs differ from CPUs Understand the limits of GPU acceleration Have

More information

GPU Accelerated Visualization and Analysis in VMD

GPU Accelerated Visualization and Analysis in VMD GPU Accelerated Visualization and Analysis in VMD John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

More information

IBM J. Res. & Dev.Acun et al.: Scalable Molecular Dynamics with NAMD on the Summit System Page 1

IBM J. Res. & Dev.Acun et al.: Scalable Molecular Dynamics with NAMD on the Summit System Page 1 IBM J. Res. & Dev.Acun et al.: Scalable Molecular Dynamics with NAMD on the Summit System Page 1 Scalable Molecular Dynamics with NAMD on the Summit System B. Acun, D. J. Hardy, L. V. Kale, K. Li, J. C.

More information

IBM CORAL HPC System Solution

IBM CORAL HPC System Solution IBM CORAL HPC System Solution HPC and HPDA towards Cognitive, AI and Deep Learning Deep Learning AI / Deep Learning Strategy for Power Power AI Platform High Performance Data Analytics Big Data Strategy

More information

CUDA Applications I. John E. Stone

CUDA Applications I. John E. Stone CUDA Applications I John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign Research/gpu/ Cape Town

More information

S05: High Performance Computing with CUDA. Case Study: Molecular Visualization and Analysis

S05: High Performance Computing with CUDA. Case Study: Molecular Visualization and Analysis Case Study: Molecular Visualization and Analysis John Stone NIH Resource for Macromolecular Modeling and Bioinformatics http://www.ks.uiuc.edu/research/gpu/ Outline What speedups can be expected Fluorescence

More information

Lecture 1: Introduction and Computational Thinking

Lecture 1: Introduction and Computational Thinking PASI Summer School Advanced Algorithmic Techniques for GPUs Lecture 1: Introduction and Computational Thinking 1 Course Objective To master the most commonly used algorithm techniques and computational

More information

Quantifying the Impact of GPUs on Performance and Energy Efficiency in HPC Clusters

Quantifying the Impact of GPUs on Performance and Energy Efficiency in HPC Clusters Quantifying the Impact of GPUs on Performance and Energy Efficiency in HPC Clusters Craig Steffen csteffen@ncsa.uiuc.edu NCSA Innovative Systems Lab First International Green Computing Conference Workshop

More information

ENABLING NEW SCIENCE GPU SOLUTIONS

ENABLING NEW SCIENCE GPU SOLUTIONS ENABLING NEW SCIENCE TESLA BIO Workbench The NVIDIA Tesla Bio Workbench enables biophysicists and computational chemists to push the boundaries of life sciences research. It turns a standard PC into a

More information

NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU Systems

NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU Systems NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU Systems Carl Pearson 1, I-Hsin Chung 2, Zehra Sura 2, Wen-Mei Hwu 1, and Jinjun Xiong 2 1 University of Illinois Urbana-Champaign, Urbana

More information

NVIDIA Update and Directions on GPU Acceleration for Earth System Models

NVIDIA Update and Directions on GPU Acceleration for Earth System Models NVIDIA Update and Directions on GPU Acceleration for Earth System Models Stan Posey, HPC Program Manager, ESM and CFD, NVIDIA, Santa Clara, CA, USA Carl Ponder, PhD, Applications Software Engineer, NVIDIA,

More information

Bigger GPUs and Bigger Nodes. Carl Pearson PhD Candidate, advised by Professor Wen-Mei Hwu

Bigger GPUs and Bigger Nodes. Carl Pearson PhD Candidate, advised by Professor Wen-Mei Hwu Bigger GPUs and Bigger Nodes Carl Pearson (pearson@illinois.edu) PhD Candidate, advised by Professor Wen-Mei Hwu 1 Outline Experiences from working with domain experts to develop GPU codes on Blue Waters

More information

Preparing GPU-Accelerated Applications for the Summit Supercomputer

Preparing GPU-Accelerated Applications for the Summit Supercomputer Preparing GPU-Accelerated Applications for the Summit Supercomputer Fernanda Foertter HPC User Assistance Group Training Lead foertterfs@ornl.gov This research used resources of the Oak Ridge Leadership

More information

GPU Particle-Grid Methods: Electrostatics

GPU Particle-Grid Methods: Electrostatics GPU Particle-Grid Methods: Electrostatics John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign Research/gpu/

More information

NAMD at Extreme Scale. Presented by: Eric Bohm Team: Eric Bohm, Chao Mei, Osman Sarood, David Kunzman, Yanhua, Sun, Jim Phillips, John Stone, LV Kale

NAMD at Extreme Scale. Presented by: Eric Bohm Team: Eric Bohm, Chao Mei, Osman Sarood, David Kunzman, Yanhua, Sun, Jim Phillips, John Stone, LV Kale NAMD at Extreme Scale Presented by: Eric Bohm Team: Eric Bohm, Chao Mei, Osman Sarood, David Kunzman, Yanhua, Sun, Jim Phillips, John Stone, LV Kale Overview NAMD description Power7 Tuning Support for

More information

The Parallel Revolution in Computational Science and Engineering

The Parallel Revolution in Computational Science and Engineering The Parallel Revolution in Computational Science and Engineering applications, education, tools, and impact Wen-mei Hwu University of Illinois, Urbana-Champaign The Energy Behind Parallel Revolution Calculation:

More information

GPU Histogramming: Radial Distribution Functions

GPU Histogramming: Radial Distribution Functions GPU Histogramming: Radial Distribution Functions John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign

More information

GPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory

GPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory GPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Presentation Outline NVIDIA

More information

VSCSE Summer School Lecture 8: Application Case Study Accelerating Molecular Dynamics Experimentation

VSCSE Summer School Lecture 8: Application Case Study Accelerating Molecular Dynamics Experimentation VSCSE Summer School 2009 Many-core Processors for Science and Engineering Applications Lecture 8: Application Case Study Accelerating Molecular Dynamics Experimentation Guest Lecture by John Stone Theoretical

More information

Recent Advances in Heterogeneous Computing using Charm++

Recent Advances in Heterogeneous Computing using Charm++ Recent Advances in Heterogeneous Computing using Charm++ Jaemin Choi, Michael Robson Parallel Programming Laboratory University of Illinois Urbana-Champaign April 12, 2018 1 / 24 Heterogeneous Computing

More information

What does Heterogeneity bring?

What does Heterogeneity bring? What does Heterogeneity bring? Ken Koch Scientific Advisor, CCS-DO, LANL LACSI 2006 Conference October 18, 2006 Some Terminology Homogeneous Of the same or similar nature or kind Uniform in structure or

More information

Trends in HPC (hardware complexity and software challenges)

Trends in HPC (hardware complexity and software challenges) Trends in HPC (hardware complexity and software challenges) Mike Giles Oxford e-research Centre Mathematical Institute MIT seminar March 13th, 2013 Mike Giles (Oxford) HPC Trends March 13th, 2013 1 / 18

More information

The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System

The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System Alan Humphrey, Qingyu Meng, Martin Berzins Scientific Computing and Imaging Institute & University of Utah I. Uintah Overview

More information

OpenPOWER Performance

OpenPOWER Performance OpenPOWER Performance Alex Mericas Chief Engineer, OpenPOWER Performance IBM Delivering the Linux ecosystem for Power SOLUTIONS OpenPOWER IBM SOFTWARE LINUX ECOSYSTEM OPEN SOURCE Solutions with full stack

More information

Abhinav Bhatele, Laxmikant V. Kale University of Illinois at Urbana Champaign Sameer Kumar IBM T. J. Watson Research Center

Abhinav Bhatele, Laxmikant V. Kale University of Illinois at Urbana Champaign Sameer Kumar IBM T. J. Watson Research Center Abhinav Bhatele, Laxmikant V. Kale University of Illinois at Urbana Champaign Sameer Kumar IBM T. J. Watson Research Center Motivation: Contention Experiments Bhatele, A., Kale, L. V. 2008 An Evaluation

More information

Accelerating NAMD with Graphics Processors

Accelerating NAMD with Graphics Processors Accelerating NAMD with Graphics Processors James Phillips John Stone Klaus Schulten Research/namd/ NAMD: Practical Supercomputing 24,000 users can t all be computer experts. 18% are NIH-funded; many in

More information

GPU Accelerated Visualization and Analysis in VMD and Recent NAMD Developments

GPU Accelerated Visualization and Analysis in VMD and Recent NAMD Developments GPU Accelerated Visualization and Analysis in VMD and Recent NAMD Developments John Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University

More information

Introduction to GPU hardware and to CUDA

Introduction to GPU hardware and to CUDA Introduction to GPU hardware and to CUDA Philip Blakely Laboratory for Scientific Computing, University of Cambridge Philip Blakely (LSC) GPU introduction 1 / 35 Course outline Introduction to GPU hardware

More information

VMD Key Features of Recent Release

VMD Key Features of Recent Release VMD 1.8.7 Key Features of Recent Release John Stone Research/vmd/ Quick Summary of Features Broader platform support Updated user interfaces Accelerated analysis, rendering, display: multi-core CPUs, GPUs

More information

GPU Computing: Development and Analysis. Part 1. Anton Wijs Muhammad Osama. Marieke Huisman Sebastiaan Joosten

GPU Computing: Development and Analysis. Part 1. Anton Wijs Muhammad Osama. Marieke Huisman Sebastiaan Joosten GPU Computing: Development and Analysis Part 1 Anton Wijs Muhammad Osama Marieke Huisman Sebastiaan Joosten NLeSC GPU Course Rob van Nieuwpoort & Ben van Werkhoven Who are we? Anton Wijs Assistant professor,

More information

NAMD GPU Performance Benchmark. March 2011

NAMD GPU Performance Benchmark. March 2011 NAMD GPU Performance Benchmark March 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Intel, Mellanox Compute resource - HPC Advisory

More information

19. prosince 2018 CIIRC Praha. Milan Král, IBM Radek Špimr

19. prosince 2018 CIIRC Praha. Milan Král, IBM Radek Špimr 19. prosince 2018 CIIRC Praha Milan Král, IBM Radek Špimr CORAL CORAL 2 CORAL Installation at ORNL CORAL Installation at LLNL Order of Magnitude Leap in Computational Power Real, Accelerated Science ACME

More information

Experience with NAMD on GPU-Accelerated Clusters

Experience with NAMD on GPU-Accelerated Clusters Experience with NAMD on GPU-Accelerated Clusters James Phillips John Stone Klaus Schulten Research/gpu/ Outline Motivational images o NAMD simulations Why all the uss about GPUs? What is message-driven

More information

Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications

Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications Sep 2009 Gilad Shainer, Tong Liu (Mellanox); Jeffrey Layton (Dell); Joshua Mora (AMD) High Performance Interconnects for

More information

Mapping MPI+X Applications to Multi-GPU Architectures

Mapping MPI+X Applications to Multi-GPU Architectures Mapping MPI+X Applications to Multi-GPU Architectures A Performance-Portable Approach Edgar A. León Computer Scientist San Jose, CA March 28, 2018 GPU Technology Conference This work was performed under

More information

NAMD Performance Benchmark and Profiling. January 2015

NAMD Performance Benchmark and Profiling. January 2015 NAMD Performance Benchmark and Profiling January 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

PERFORMANCE PORTABILITY WITH OPENACC. Jeff Larkin, NVIDIA, November 2015

PERFORMANCE PORTABILITY WITH OPENACC. Jeff Larkin, NVIDIA, November 2015 PERFORMANCE PORTABILITY WITH OPENACC Jeff Larkin, NVIDIA, November 2015 TWO TYPES OF PORTABILITY FUNCTIONAL PORTABILITY PERFORMANCE PORTABILITY The ability for a single code to run anywhere. The ability

More information

Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering

Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering Klaus Schulten William R. Sherman John E. Stone Department of Physics Pervasive Technology Institute

More information

CSCI 402: Computer Architectures. Parallel Processors (2) Fengguang Song Department of Computer & Information Science IUPUI.

CSCI 402: Computer Architectures. Parallel Processors (2) Fengguang Song Department of Computer & Information Science IUPUI. CSCI 402: Computer Architectures Parallel Processors (2) Fengguang Song Department of Computer & Information Science IUPUI 6.6 - End Today s Contents GPU Cluster and its network topology The Roofline performance

More information

Advanced CUDA: Application Examples

Advanced CUDA: Application Examples Advanced CUDA: Application Examples John E. Stone Theoretical and Computational Biophysics Group Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign Research/gpu/

More information

NAMD Performance Benchmark and Profiling. February 2012

NAMD Performance Benchmark and Profiling. February 2012 NAMD Performance Benchmark and Profiling February 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource -

More information

INTRODUCTION TO OPENACC. Analyzing and Parallelizing with OpenACC, Feb 22, 2017

INTRODUCTION TO OPENACC. Analyzing and Parallelizing with OpenACC, Feb 22, 2017 INTRODUCTION TO OPENACC Analyzing and Parallelizing with OpenACC, Feb 22, 2017 Objective: Enable you to to accelerate your applications with OpenACC. 2 Today s Objectives Understand what OpenACC is and

More information

Petascale Multiscale Simulations of Biomolecular Systems. John Grime Voth Group Argonne National Laboratory / University of Chicago

Petascale Multiscale Simulations of Biomolecular Systems. John Grime Voth Group Argonne National Laboratory / University of Chicago Petascale Multiscale Simulations of Biomolecular Systems John Grime Voth Group Argonne National Laboratory / University of Chicago About me Background: experimental guy in grad school (LSCM, drug delivery)

More information