Dax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale

Size: px
Start display at page:

Download "Dax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale"

Transcription

1 Dax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale GPU Technology Conference March 26, 2014 Kenneth Moreland Sandia Na5onal Laboratories Robert Maynard Kitware, Inc. Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy s National Nuclear Security Administration under contract DE-AC04-94AL SAND NO P

2 A Toolkit for Scien5fic Visualiza5on

3 Sci Vis a Conglomera5on of Many Geometric and Other Tools

4 Mo#va#on

5 Slide of Doom System Parameter Factor Change System Peak 2 PetaFLOPS 1 ExaFLOP 500 Power 6 MW 20 MW 3 System Memory 0.3 PB PB Total Concurrency 225K 1B 10 1B , ,000 Node Performance 125 GF 1 TF 10 TF 8 80 Node Concurrency 12 1,000 10, Network BW 1.5 KB/s 100 GB/s 1000 GB/s System Size (nodes) 18,700 1,000, , I/O Capacity 15 PB PB I/O BW 0.2 TB/s TB/s 10 30

6 Slide of Doom System Parameter Factor Change System Peak 2 PetaFLOPS 1 ExaFLOP 500 Power 6 MW 20 MW 3 System Memory 0.3 PB PB Total Concurrency 225K 1B 10 1B , ,000 Node Performance 125 GF 1 TF 10 TF 8 80 Node Concurrency 12 1,000 10, Network BW 1.5 KB/s 100 GB/s 1000 GB/s System Size (nodes) 18,700 1,000, , I/O Capacity 15 PB PB I/O BW 0.2 TB/s TB/s 10 30

7 Extreme Scale is Threads, Threads, Threads! Jaguar XT5 Titan XK7 Exascale* Cores 224, ,008 and 18,688 gpu 1 billion Concurrency 224,256 way million way billion way Memory 300 Terabytes 700 Terabytes 128 Petabytes To succeed at extreme scale, you need to consider the finest possible level of concurrency Expect each thread to process exactly one element Disallow communica5on among threads *Source: Scien5fic Discovery at the Exascale, Ahern, Shoshani, Ma, et al.

8 Project Goals Reduce the challenges of wri5ng highly concurrent algorithms. Everybody who learns concurrency thinks they understand it, ends up finding mysterious races they thought weren t possible, and discovers that they didn t actually understand it yet aher all. Herb Suker

9 Approach

10 Functor Mapping [Baker, et al. 2010] functor()

11 Applied to Topologies functor()

12 Applied to Topologies functor()

13 Framework

14 Dax Framework Control Environment Execu5on Environment dax::cont dax::exec

15 Dax Framework Control Environment Execu5on Environment Grid Topology Array Handle Invoke dax::cont dax::exec

16 Dax Framework Control Environment Grid Topology Array Handle Invoke Execu5on Environment Cell Opera5ons Field Opera5ons Basic Math Make Cells Worklet dax::cont dax::exec

17 Dax Framework Control Environment Grid Topology Array Handle Invoke Execu5on Environment Cell Opera5ons Field Opera5ons Basic Math Make Cells Worklet dax::cont dax::exec

18 Dax Framework Control Environment Grid Topology Array Handle Invoke Device Adapter Allocate Transfer Schedule Sort Execu5on Environment Cell Opera5ons Field Opera5ons Basic Math Make Cells Worklet dax::cont dax::exec

19 Device Adapter Contents Tag (struct DeviceAdapterFoo { };) Execu5on Array Manager Control Environment Transfer Execu5on Environment Schedule functor Schedule worklet worklet worklet worklet worklet worklet worklet functor Compute Scan Compute Sort Other Support algorithms Stream compact, copy, parallel find, unique

20 Anatomy of a Worklet

21 struct Sine: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(Out)); typedef _2 ExecutionSignature(_1); DAX_EXEC_EXPORT dax::scalar operator()(dax::scalar v) const { return dax::math::sin(v); } };

22 struct Sine: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(Out)); typedef _2 ExecutionSignature(_1); DAX_EXEC_EXPORT dax::scalar operator()(dax::scalar v) const { return dax::math::sin(v); } };

23 struct Sine: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(Out)); typedef _2 ExecutionSignature(_1); DAX_EXEC_EXPORT dax::scalar operator()(dax::scalar v) const { return dax::math::sin(v); } };

24 struct Sine: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(Out)); typedef _2 ExecutionSignature(_1); DAX_EXEC_EXPORT dax::scalar operator()(dax::scalar v) const { return dax::math::sin(v); } };

25 struct Sine: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(Out)); typedef _2 ExecutionSignature(_1); DAX_EXEC_EXPORT dax::scalar operator()(dax::scalar v) const { return dax::math::sin(v); } };

26 dax::cont::arrayhandle<dax::scalar> inputhandle = dax::cont::make_arrayhandle(input); dax::cont::arrayhandle<dax::scalar> sineresult; Control Environment Execu5on Environment dax::cont::dispatchermapfield<sine> dispatcher; dispatcher.invoke(inputhandle, sineresult); struct Sine: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(Out)); typedef _2 ExecutionSignature(_1); DAX_EXEC_EXPORT dax::scalar operator()(dax::scalar v) const { return dax::math::sin(v); } };

27 struct Zip2: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(In), Field(Out)); typedef _3 ExecutionSignature(_1, _2); DAX_EXEC_EXPORT dax::vector2 operator()(dax::scalar x, dax::scalar y) const { return dax::make_vector2(x, y); } };

28 struct ImagToPolar: public dax::exec::workletmapfield { typedef void ControlSignature(Field(In), Field(In), Field(Out), Field(Out)); typedef void ExecutionSignature(_1, _2, _3, _4); DAX_EXEC_EXPORT void operator()(dax::scalar real, dax::scalar imaginary, dax::scalar &magnitude, dax::scalar &phase) const { magnitude = dax::math::magnitude( dax::make_vector2(real, imaginary)); phase = dax::math::atan2(imaginary, real); }

29 struct Advect: public dax::exec::workletmapfield { typedef void ControlSignature( Field(In), Field(In), Field(In), Field(Out), Field(Out), Field(Out), Field(Out)); typedef void ExecutionSignature( _1, _2, _3, _4, _5, _6, _7); DAX_EXEC_EXPORT void operator()(dax::vector3 startposition, dax::vector3 startvelocity, dax::vector3 acceleration, dax::vector3 &endposition, dax::vector3 &endvelocity, dax::scalar &rotation, dax::scalar &angularvelocity) const {... }

30 struct Contour: public dax::exec::workletinterpolatedcell { typedef void ControlSignature(Topology, Geometry(Out), Field(Point,In)); typedef void ExecutionSignature( Vertices(_1), _2, _3, VisitIndex); template<typename CellTag> DAX_EXEC_EXPORT void operator()( const CellVertices<CellTag>& verts, InterpolatedCellPoints<CellTagTriangle>& outcell, const CellField<dax::Scalar,CellTag> &values, dax::id inputcellvisitindex) const {...

31 Example Code Mandelbulb Fractal

32 Mandelbulb nth power of 3D Vector where

33

34 Mandelbulb Itera5on Fractal created by itera5ng on power opera5on, offseqng by input coordinate. Our implementa5on counts how many itera5ons to escape.

35

36

37

38 Acknowledgements This work was supported by the DOE Office of Science, Advanced Scien5fic Compu5ng Research, under award number , program manager Lucy Nowell. Addi5onal support by the Director, Office of Advanced Scien5fic Compu5ng Research, Office of Science, of the U.S. Department of Energy under Contract No , through the Scien5fic Discovery through Advanced Compu5ng (SciDAC) Ins5tute of Scalable Data Management, Analysis and Visualiza5on. Sandia Na5onal Laboratories is a mul5- program laboratory managed and operated by Sandia Corpora5on, a wholly owned subsidiary of Lockheed Mar5n Corpora5on, for the U.S. Department of Energy s Na5onal Nuclear Security Administra5on under contract DE- AC04-94AL85000.

Robert Maynard Kitware, Inc. Kenneth Moreland Sandia National Laboratories

Robert Maynard Kitware, Inc. Kenneth Moreland Sandia National Laboratories Robert Maynard Kitware, Inc. Kenneth Moreland Sandia National Laboratories Data Analysis at Extreme Data Analysis and Visualization library Multi- and Many-core processor support Allows users to create

More information

A Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard

A Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard A Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard Sandia Na*onal Laboratories Kitware, Inc. University of California at Davis

More information

VTK-m: Uniting GPU Acceleration Successes. Robert Maynard Kitware Inc.

VTK-m: Uniting GPU Acceleration Successes. Robert Maynard Kitware Inc. VTK-m: Uniting GPU Acceleration Successes Robert Maynard Kitware Inc. VTK-m Project Supercomputer Hardware Advances Everyday More and more parallelism High-Level Parallelism The Free Lunch Is Over (Herb

More information

Oh, Exascale! The effect of emerging architectures on scien1fic discovery. Kenneth Moreland, Sandia Na1onal Laboratories

Oh, Exascale! The effect of emerging architectures on scien1fic discovery. Kenneth Moreland, Sandia Na1onal Laboratories Photos placed in horizontal posi1on with even amount of white space between photos and header Oh, $#*@! Exascale! The effect of emerging architectures on scien1fic discovery Ultrascale Visualiza1on Workshop,

More information

In-Situ Data Analysis and Visualization: ParaView, Calalyst and VTK-m

In-Situ Data Analysis and Visualization: ParaView, Calalyst and VTK-m In-Situ Data Analysis and Visualization: ParaView, Calalyst and VTK-m GTC, San Jose, CA March, 2015 Robert Maynard Marcus D. Hanwell 1 Agenda Introduction to ParaView Catalyst Run example Catalyst Script

More information

Parallel Graph Coloring For Many- core Architectures

Parallel Graph Coloring For Many- core Architectures Parallel Graph Coloring For Many- core Architectures Mehmet Deveci, Erik Boman, Siva Rajamanickam Sandia Na;onal Laboratories Sandia National Laboratories is a multi-program laboratory managed and operated

More information

Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales

Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales Using a Robust Metadata Management System to Accelerate Scientific Discovery at Extreme Scales Margaret Lawson, Jay Lofstead Sandia National Laboratories is a multimission laboratory managed and operated

More information

Motivation Goal Idea Proposition for users Study

Motivation Goal Idea Proposition for users Study Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan Computer and Information Science University of Oregon 23 November 2015 Overview Motivation:

More information

Overview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization

Overview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan & Matt Larsen (University of Oregon), Hank Childs (Lawrence Berkeley National Laboratory) 26

More information

Titan - Early Experience with the Titan System at Oak Ridge National Laboratory

Titan - Early Experience with the Titan System at Oak Ridge National Laboratory Office of Science Titan - Early Experience with the Titan System at Oak Ridge National Laboratory Buddy Bland Project Director Oak Ridge Leadership Computing Facility November 13, 2012 ORNL s Titan Hybrid

More information

Using the Cray Gemini Performance Counters

Using the Cray Gemini Performance Counters Photos placed in horizontal position with even amount of white space between photos and header Using the Cray Gemini Performance Counters 0 1 2 3 4 5 6 7 Backplane Backplane 8 9 10 11 12 13 14 15 Backplane

More information

What is DARMA? DARMA is a C++ abstraction layer for asynchronous many-task (AMT) runtimes.

What is DARMA? DARMA is a C++ abstraction layer for asynchronous many-task (AMT) runtimes. DARMA Janine C. Bennett, Jonathan Lifflander, David S. Hollman, Jeremiah Wilke, Hemanth Kolla, Aram Markosyan, Nicole Slattengren, Robert L. Clay (PM) PSAAP-WEST February 22, 2017 Sandia National Laboratories

More information

Preparing GPU-Accelerated Applications for the Summit Supercomputer

Preparing GPU-Accelerated Applications for the Summit Supercomputer Preparing GPU-Accelerated Applications for the Summit Supercomputer Fernanda Foertter HPC User Assistance Group Training Lead foertterfs@ornl.gov This research used resources of the Oak Ridge Leadership

More information

Integrating Analysis and Computation with Trios Services

Integrating Analysis and Computation with Trios Services October 31, 2012 Integrating Analysis and Computation with Trios Services Approved for Public Release: SAND2012-9323P Ron A. Oldfield Scalable System Software Sandia National Laboratories Albuquerque,

More information

The Cielo Capability Supercomputer

The Cielo Capability Supercomputer The Cielo Capability Supercomputer Manuel Vigil (LANL), Douglas Doerfler (SNL), Sudip Dosanjh (SNL) and John Morrison (LANL) SAND 2011-3421C Unlimited Release Printed February, 2011 Sandia is a multiprogram

More information

Intel Many Integrated Core (MIC) Architecture

Intel Many Integrated Core (MIC) Architecture Intel Many Integrated Core (MIC) Architecture Karl Solchenbach Director European Exascale Labs BMW2011, November 3, 2011 1 Notice and Disclaimers Notice: This document contains information on products

More information

Massively Parallel Graph Analytics

Massively Parallel Graph Analytics Massively Parallel Graph Analytics Manycore graph processing, distributed graph layout, and supercomputing for graph analytics George M. Slota 1,2,3 Kamesh Madduri 2 Sivasankaran Rajamanickam 1 1 Sandia

More information

Oak Ridge National Laboratory Computing and Computational Sciences

Oak Ridge National Laboratory Computing and Computational Sciences Oak Ridge National Laboratory Computing and Computational Sciences OFA Update by ORNL Presented by: Pavel Shamis (Pasha) OFA Workshop Mar 17, 2015 Acknowledgments Bernholdt David E. Hill Jason J. Leverman

More information

Eureka! Task Teams! Kyle Wheeler SC 12 Chapel Lightning Talk SAND: P

Eureka! Task Teams! Kyle Wheeler SC 12 Chapel Lightning Talk SAND: P GO 08012011 Eureka! Task Teams! Kyle Wheeler SC 12 Chapel Lightning Talk Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary

More information

Visual Analysis of Lagrangian Particle Data from Combustion Simulations

Visual Analysis of Lagrangian Particle Data from Combustion Simulations Visual Analysis of Lagrangian Particle Data from Combustion Simulations Hongfeng Yu Sandia National Laboratories, CA Ultrascale Visualization Workshop, SC11 Nov 13 2011, Seattle, WA Joint work with Jishang

More information

EMPRESS Extensible Metadata PRovider for Extreme-scale Scientific Simulations

EMPRESS Extensible Metadata PRovider for Extreme-scale Scientific Simulations EMPRESS Extensible Metadata PRovider for Extreme-scale Scientific Simulations Photos placed in horizontal position with even amount of white space between photos and header Margaret Lawson, Jay Lofstead,

More information

High performance computing and numerical modeling

High performance computing and numerical modeling High performance computing and numerical modeling Volker Springel Plan for my lectures Lecture 1: Collisional and collisionless N-body dynamics Lecture 2: Gravitational force calculation Lecture 3: Basic

More information

High Performance Data Analytics for Numerical Simulations. Bruno Raffin DataMove

High Performance Data Analytics for Numerical Simulations. Bruno Raffin DataMove High Performance Data Analytics for Numerical Simulations Bruno Raffin DataMove bruno.raffin@inria.fr April 2016 About this Talk HPC for analyzing the results of large scale parallel numerical simulations

More information

Irregular Graph Algorithms on Parallel Processing Systems

Irregular Graph Algorithms on Parallel Processing Systems Irregular Graph Algorithms on Parallel Processing Systems George M. Slota 1,2 Kamesh Madduri 1 (advisor) Sivasankaran Rajamanickam 2 (Sandia mentor) 1 Penn State University, 2 Sandia National Laboratories

More information

Developing Integrated Data Services for Cray Systems with a Gemini Interconnect

Developing Integrated Data Services for Cray Systems with a Gemini Interconnect Developing Integrated Data Services for Cray Systems with a Gemini Interconnect Ron A. Oldfield Scalable System So4ware Sandia Na9onal Laboratories Albuquerque, NM, USA raoldfi@sandia.gov Cray User Group

More information

Implementing Many-Body Potentials for Molecular Dynamics Simulations

Implementing Many-Body Potentials for Molecular Dynamics Simulations Official Use Only Implementing Many-Body Potentials for Molecular Dynamics Simulations Using large scale clusters for higher accuracy simulations. Christian Trott, Aidan Thompson Unclassified, Unlimited

More information

Simple Parallel Biconnectivity Algorithms for Multicore Platforms

Simple Parallel Biconnectivity Algorithms for Multicore Platforms Simple Parallel Biconnectivity Algorithms for Multicore Platforms George M. Slota Kamesh Madduri The Pennsylvania State University HiPC 2014 December 17-20, 2014 Code, presentation available at graphanalysis.info

More information

Heterogeneous CPU+GPU Molecular Dynamics Engine in CHARMM

Heterogeneous CPU+GPU Molecular Dynamics Engine in CHARMM Heterogeneous CPU+GPU Molecular Dynamics Engine in CHARMM 25th March, GTC 2014, San Jose CA AnE- Pekka Hynninen ane.pekka.hynninen@nrel.gov NREL is a na*onal laboratory of the U.S. Department of Energy,

More information

Combinatorial Mathema/cs and Algorithms at Exascale: Challenges and Promising Direc/ons

Combinatorial Mathema/cs and Algorithms at Exascale: Challenges and Promising Direc/ons Combinatorial Mathema/cs and Algorithms at Exascale: Challenges and Promising Direc/ons Assefaw Gebremedhin Purdue University (Star/ng August 2014, Washington State University School of Electrical Engineering

More information

HPC Algorithms and Applications

HPC Algorithms and Applications HPC Algorithms and Applications Intro Michael Bader Winter 2015/2016 Intro, Winter 2015/2016 1 Part I Scientific Computing and Numerical Simulation Intro, Winter 2015/2016 2 The Simulation Pipeline phenomenon,

More information

CUDA. Matthew Joyner, Jeremy Williams

CUDA. Matthew Joyner, Jeremy Williams CUDA Matthew Joyner, Jeremy Williams Agenda What is CUDA? CUDA GPU Architecture CPU/GPU Communication Coding in CUDA Use cases of CUDA Comparison to OpenCL What is CUDA? What is CUDA? CUDA is a parallel

More information

Responsive Large Data Analysis and Visualization with the ParaView Ecosystem. Patrick O Leary, Kitware Inc

Responsive Large Data Analysis and Visualization with the ParaView Ecosystem. Patrick O Leary, Kitware Inc Responsive Large Data Analysis and Visualization with the ParaView Ecosystem Patrick O Leary, Kitware Inc Hybrid Computing Attribute Titan Summit - 2018 Compute Nodes 18,688 ~3,400 Processor (1) 16-core

More information

Extreme-scale Graph Analysis on Blue Waters

Extreme-scale Graph Analysis on Blue Waters Extreme-scale Graph Analysis on Blue Waters 2016 Blue Waters Symposium George M. Slota 1,2, Siva Rajamanickam 1, Kamesh Madduri 2, Karen Devine 1 1 Sandia National Laboratories a 2 The Pennsylvania State

More information

Improving Uintah s Scalability Through the Use of Portable

Improving Uintah s Scalability Through the Use of Portable Improving Uintah s Scalability Through the Use of Portable Kokkos-Based Data Parallel Tasks John Holmen1, Alan Humphrey1, Daniel Sunderland2, Martin Berzins1 University of Utah1 Sandia National Laboratories2

More information

Op#mizing PGAS overhead in a mul#-locale Chapel implementa#on of CoMD

Op#mizing PGAS overhead in a mul#-locale Chapel implementa#on of CoMD Op#mizing PGAS overhead in a mul#-locale Chapel implementa#on of CoMD Riyaz Haque and David F. Richards This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore

More information

Managing HPC Active Archive Storage with HPSS RAIT at Oak Ridge National Laboratory

Managing HPC Active Archive Storage with HPSS RAIT at Oak Ridge National Laboratory Managing HPC Active Archive Storage with HPSS RAIT at Oak Ridge National Laboratory Quinn Mitchell HPC UNIX/LINUX Storage Systems ORNL is managed by UT-Battelle for the US Department of Energy U.S. Department

More information

It s a Multicore World. John Urbanic Pittsburgh Supercomputing Center

It s a Multicore World. John Urbanic Pittsburgh Supercomputing Center It s a Multicore World John Urbanic Pittsburgh Supercomputing Center Waiting for Moore s Law to save your serial code start getting bleak in 2004 Source: published SPECInt data Moore s Law is not at all

More information

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis techniques for non-spatial data Project #1 out 4 Data

More information

PULP: Fast and Simple Complex Network Partitioning

PULP: Fast and Simple Complex Network Partitioning PULP: Fast and Simple Complex Network Partitioning George Slota #,* Kamesh Madduri # Siva Rajamanickam * # The Pennsylvania State University *Sandia National Laboratories Dagstuhl Seminar 14461 November

More information

Microgrid Design Toolkit (MDT)

Microgrid Design Toolkit (MDT) SAND2016-8151 C Microgrid Design Toolkit (MDT) 24 October 2016 John Eddy, Ph.D. Microgrid Design Toolkit (MDT) Principal Investigator System Sustainment & Readiness Technologies Department Sandia National

More information

Extreme-scale Graph Analysis on Blue Waters

Extreme-scale Graph Analysis on Blue Waters Extreme-scale Graph Analysis on Blue Waters 2016 Blue Waters Symposium George M. Slota 1,2, Siva Rajamanickam 1, Kamesh Madduri 2, Karen Devine 1 1 Sandia National Laboratories a 2 The Pennsylvania State

More information

Efficient Memory and Bandwidth Management for Industrial Strength Kirchhoff Migra<on

Efficient Memory and Bandwidth Management for Industrial Strength Kirchhoff Migra<on Efficient Memory and Bandwidth Management for Industrial Strength Kirchhoff Migra

More information

War Stories : Graph Algorithms in GPUs

War Stories : Graph Algorithms in GPUs SAND2014-18323PE War Stories : Graph Algorithms in GPUs Siva Rajamanickam(SNL) George Slota, Kamesh Madduri (PSU) FASTMath Meeting Exceptional service in the national interest is a multi-program laboratory

More information

Simulation-time data analysis and I/O acceleration at extreme scale with GLEAN

Simulation-time data analysis and I/O acceleration at extreme scale with GLEAN Simulation-time data analysis and I/O acceleration at extreme scale with GLEAN Venkatram Vishwanath, Mark Hereld and Michael E. Papka Argonne Na

More information

China's supercomputer surprises U.S. experts

China's supercomputer surprises U.S. experts China's supercomputer surprises U.S. experts John Markoff Reproduced from THE HINDU, October 31, 2011 Fast forward: A journalist shoots video footage of the data storage system of the Sunway Bluelight

More information

Por$ng Monte Carlo Algorithms to the GPU. Ryan Bergmann UC Berkeley Serpent Users Group Mee$ng 9/20/2012 Madrid, Spain

Por$ng Monte Carlo Algorithms to the GPU. Ryan Bergmann UC Berkeley Serpent Users Group Mee$ng 9/20/2012 Madrid, Spain Por$ng Monte Carlo Algorithms to the GPU Ryan Bergmann UC Berkeley Serpent Users Group Mee$ng 9/20/2012 Madrid, Spain 1 Outline Introduc$on to GPUs Why they are interes$ng How they operate Pros and cons

More information

IBM CORAL HPC System Solution

IBM CORAL HPC System Solution IBM CORAL HPC System Solution HPC and HPDA towards Cognitive, AI and Deep Learning Deep Learning AI / Deep Learning Strategy for Power Power AI Platform High Performance Data Analytics Big Data Strategy

More information

Hashing Strategies for the Cray XMT MTAAP 2010

Hashing Strategies for the Cray XMT MTAAP 2010 Hashing Strategies for the Cray XMT MTAAP 2010 Eric Goodman (SNL) David Haglin (PNNL) Chad Scherrer (PNNL) Daniel Chavarría-Miranda (PNNL) Jace Mogill (PNNL) John Feo (PNNL) Sandia is a multiprogram laboratory

More information

GPU Cluster Computing. Advanced Computing Center for Research and Education

GPU Cluster Computing. Advanced Computing Center for Research and Education GPU Cluster Computing Advanced Computing Center for Research and Education 1 What is GPU Computing? Gaming industry and high- defini3on graphics drove the development of fast graphics processing Use of

More information

Perspectives form US Department of Energy work on parallel programming models for performance portability

Perspectives form US Department of Energy work on parallel programming models for performance portability Perspectives form US Department of Energy work on parallel programming models for performance portability Jeremiah J. Wilke Sandia National Labs Livermore, CA IMPACT workshop at HiPEAC Sandia National

More information

Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments

Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments Swen Böhm 1,2, Christian Engelmann 2, and Stephen L. Scott 2 1 Department of Computer

More information

Enabling the Smart Grid through Big Data

Enabling the Smart Grid through Big Data Enabling the Smart Grid through Big Data Paul A. Navrá;l, Ph.D. Manager Scalable Visualiza;on Technologies Texas Advanced Compu;ng Center TACC Booth @ SC12 November 14, 2012 The Age of Big Data Records

More information

Distributed State Es.ma.on Algorithms for Electric Power Systems

Distributed State Es.ma.on Algorithms for Electric Power Systems Distributed State Es.ma.on Algorithms for Electric Power Systems Ariana Minot, Blue Waters Graduate Fellow Professor Na Li, Professor Yue M. Lu Harvard University, School of Engineering and Applied Sciences

More information

Large Scale Visualization on the Cray XT3 Using ParaView

Large Scale Visualization on the Cray XT3 Using ParaView Large Scale Visualization on the Cray XT3 Using ParaView Cray User s Group 2008 May 8, 2008 Kenneth Moreland David Rogers John Greenfield Sandia National Laboratories Alexander Neundorf Technical University

More information

Atos announces the Bull sequana X1000 the first exascale-class supercomputer. Jakub Venc

Atos announces the Bull sequana X1000 the first exascale-class supercomputer. Jakub Venc Atos announces the Bull sequana X1000 the first exascale-class supercomputer Jakub Venc The world is changing The world is changing Digital simulation will be the key contributor to overcome 21 st century

More information

Lecture Topic Projects

Lecture Topic Projects Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, data types 3 Data sources and preparation Project 1 out 4 Data reduction, similarity & distance, data augmentation

More information

The Stampede is Coming Welcome to Stampede Introductory Training. Dan Stanzione Texas Advanced Computing Center

The Stampede is Coming Welcome to Stampede Introductory Training. Dan Stanzione Texas Advanced Computing Center The Stampede is Coming Welcome to Stampede Introductory Training Dan Stanzione Texas Advanced Computing Center dan@tacc.utexas.edu Thanks for Coming! Stampede is an exciting new system of incredible power.

More information

Computational Challenges and Opportunities for Nuclear Astrophysics

Computational Challenges and Opportunities for Nuclear Astrophysics Computational Challenges and Opportunities for Nuclear Astrophysics Bronson Messer Acting Group Leader Scientific Computing Group National Center for Computational Sciences Theoretical Astrophysics Group

More information

In situ visualization is the coupling of visualization

In situ visualization is the coupling of visualization Visualization Viewpoints Editor: Theresa-Marie Rhyne The Tensions of In Situ Visualization Kenneth Moreland Sandia National Laboratories In situ is the coupling of software with a simulation or other data

More information

Op#miza#on of CUDA- based Monte Carlo Simula#on for Radia#on Therapy. GTC 2014 N. Henderson & K. Murakami

Op#miza#on of CUDA- based Monte Carlo Simula#on for Radia#on Therapy. GTC 2014 N. Henderson & K. Murakami Op#miza#on of CUDA- based Monte Carlo Simula#on for Radia#on Therapy GTC 2014 N. Henderson & K. Murakami The collabora#on Geant4 @ Special thanks to the CUDA Center of Excellence Program Makoto Asai, SLAC

More information

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber NERSC Site Update National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory Richard Gerber NERSC Senior Science Advisor High Performance Computing Department Head Cori

More information

Outline. In Situ Data Triage and Visualiza8on

Outline. In Situ Data Triage and Visualiza8on In Situ Data Triage and Visualiza8on Kwan- Liu Ma University of California at Davis Outline In situ data triage and visualiza8on: Issues and strategies Case study: An earthquake simula8on Case study: A

More information

MPICH: A High-Performance Open-Source MPI Implementation. SC11 Birds of a Feather Session

MPICH: A High-Performance Open-Source MPI Implementation. SC11 Birds of a Feather Session MPICH: A High-Performance Open-Source MPI Implementation SC11 Birds of a Feather Session Schedule MPICH2 status and plans Presenta

More information

Procurement Guidance for Energy Storage Projects: Help with RFIs, RFQs and RFPs

Procurement Guidance for Energy Storage Projects: Help with RFIs, RFQs and RFPs Procurement Guidance for Energy Storage Projects: Help with RFIs, RFQs and RFPs April 20, 2016 Hosted by Todd Olinsky-Paul Project Director Clean Energy Group/ Clean Energy States Alliance Housekeeping

More information

Hybrid Architectures Why Should I Bother?

Hybrid Architectures Why Should I Bother? Hybrid Architectures Why Should I Bother? CSCS-FoMICS-USI Summer School on Computer Simulations in Science and Engineering Michael Bader July 8 19, 2013 Computer Simulations in Science and Engineering,

More information

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations October 29, 2014 Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations XSEDE for HPC Users What is XSEDE? XSEDE mo/va/on and goals XSEDE Resources XSEDE for HPC Users: Before

More information

Present and Future Leadership Computers at OLCF

Present and Future Leadership Computers at OLCF Present and Future Leadership Computers at OLCF Al Geist ORNL Corporate Fellow DOE Data/Viz PI Meeting January 13-15, 2015 Walnut Creek, CA ORNL is managed by UT-Battelle for the US Department of Energy

More information

Resources Current and Future Systems. Timothy H. Kaiser, Ph.D.

Resources Current and Future Systems. Timothy H. Kaiser, Ph.D. Resources Current and Future Systems Timothy H. Kaiser, Ph.D. tkaiser@mines.edu 1 Most likely talk to be out of date History of Top 500 Issues with building bigger machines Current and near future academic

More information

Path to Exascale Computing

Path to Exascale Computing Path to Exascale Computing Brad Benton IBM Linux Technology Center Date: April 15, 2010 Jul 2009 Legal Trademarks and disclaimers The following are trademarks of the International Business Machines Corporation

More information

Combining Real Time Emula0on of Digital Communica0ons between Distributed Embedded Control Nodes with Real Time Power System Simula0on

Combining Real Time Emula0on of Digital Communica0ons between Distributed Embedded Control Nodes with Real Time Power System Simula0on 1 Combining Real Time Emula0on of Digital Communica0ons between Distributed Embedded Control Nodes with Real Time Power System Simula0on Ziyuan Cai and Ming Yu Electrical and Computer Eng., Florida State

More information

EXASCALE COMPUTING: WHERE OPTICS MEETS ELECTRONICS

EXASCALE COMPUTING: WHERE OPTICS MEETS ELECTRONICS EXASCALE COMPUTING: WHERE OPTICS MEETS ELECTRONICS Overview of OFC Workshop: Organizers: Norm Jouppi HP Labs, Moray McLaren HP Labs, Madeleine Glick Intel Labs March 7, 2011 1 AGENDA Introduction. Moray

More information

HPCGraph: Benchmarking Massive Graph Analytics on Supercomputers

HPCGraph: Benchmarking Massive Graph Analytics on Supercomputers HPCGraph: Benchmarking Massive Graph Analytics on Supercomputers George M. Slota 1, Siva Rajamanickam 2, Kamesh Madduri 3 1 Rensselaer Polytechnic Institute 2 Sandia National Laboratories a 3 The Pennsylvania

More information

Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng

Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng Energy Efficient Supercompu1ng, SC 16 November 14, 2016 This work was performed under the auspices of the U.S.

More information

Aras Innovator Active Directory Sync

Aras Innovator Active Directory Sync Photos placed in horizontal position with even amount of white space between photos and header Aras Innovator Active Directory Sync Sandia National Laboratories is a multi-program laboratory managed and

More information

Exascale: Parallelism gone wild!

Exascale: Parallelism gone wild! IPDPS TCPP meeting, April 2010 Exascale: Parallelism gone wild! Craig Stunkel, Outline Why are we talking about Exascale? Why will it be fundamentally different? How will we attack the challenges? In particular,

More information

Application Sensitivity to Link and Injection Bandwidth on a Cray XT4 System

Application Sensitivity to Link and Injection Bandwidth on a Cray XT4 System Application Sensitivity to Link and Injection Bandwidth on a Cray XT4 System Cray User Group Conference Helsinki, Finland May 8, 28 Kevin Pedretti, Brian Barrett, Scott Hemmert, and Courtenay Vaughan Sandia

More information

Asynchronous Termination Detection Module User s Guide

Asynchronous Termination Detection Module User s Guide Asynchronous Termination Detection Module User s Guide William McLon III Sandia National Laboratories wcmclen@sandia.gov 1 Introduction Interprocessor communications are performed through point-to-point

More information

Network Coding: Theory and Applica7ons

Network Coding: Theory and Applica7ons Network Coding: Theory and Applica7ons PhD Course Part IV Tuesday 9.15-12.15 18.6.213 Muriel Médard (MIT), Frank H. P. Fitzek (AAU), Daniel E. Lucani (AAU), Morten V. Pedersen (AAU) Plan Hello World! Intra

More information

NIA CFD Futures Conference Hampton, VA; August 2012

NIA CFD Futures Conference Hampton, VA; August 2012 Petascale Computing and Similarity Scaling in Turbulence P. K. Yeung Schools of AE, CSE, ME Georgia Tech pk.yeung@ae.gatech.edu NIA CFD Futures Conference Hampton, VA; August 2012 10 2 10 1 10 4 10 5 Supported

More information

The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System

The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System Alan Humphrey, Qingyu Meng, Martin Berzins Scientific Computing and Imaging Institute & University of Utah I. Uintah Overview

More information

An Introduc+on to OpenACC Part II

An Introduc+on to OpenACC Part II An Introduc+on to OpenACC Part II Wei Feinstein HPC User Services@LSU LONI Parallel Programming Workshop 2015 Louisiana State University 4 th HPC Parallel Programming Workshop An Introduc+on to OpenACC-

More information

Initial Performance Evaluation of the Cray SeaStar Interconnect

Initial Performance Evaluation of the Cray SeaStar Interconnect Initial Performance Evaluation of the Cray SeaStar Interconnect Ron Brightwell Kevin Pedretti Keith Underwood Sandia National Laboratories Scalable Computing Systems Department 13 th IEEE Symposium on

More information

Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI Presented by Xiang Gao

Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI Presented by Xiang Gao Bigtable: A Distributed Storage System for Structured Data By Fay Chang, et al. OSDI 2006 Presented by Xiang Gao 2014-11-05 Outline Motivation Data Model APIs Building Blocks Implementation Refinement

More information

Introduc)on to High Performance Compu)ng Advanced Research Computing

Introduc)on to High Performance Compu)ng Advanced Research Computing Introduc)on to High Performance Compu)ng Advanced Research Computing Outline What cons)tutes high performance compu)ng (HPC)? When to consider HPC resources What kind of problems are typically solved?

More information

DHS S&T supports National Level Exercise 2011 using SUMMIT

DHS S&T supports National Level Exercise 2011 using SUMMIT Fall 2011 STANDARD UNIFIED MODELING & MAPPING INTEGRATION TOOLKIT DHS S&T supports National Level Exercise 2011 using SUMMIT SUMMIT provides next-generation exercise capabilities to NLE 11 The Department

More information

Hobbes: Composi,on and Virtualiza,on as the Founda,ons of an Extreme- Scale OS/R

Hobbes: Composi,on and Virtualiza,on as the Founda,ons of an Extreme- Scale OS/R Hobbes: Composi,on and Virtualiza,on as the Founda,ons of an Extreme- Scale OS/R Ron Brightwell, Ron Oldfield Sandia Na,onal Laboratories Arthur B. Maccabe, David E. Bernholdt Oak Ridge Na,onal Laboratory

More information

Breakthrough Science via Extreme Scalability. Greg Clifford Segment Manager, Cray Inc.

Breakthrough Science via Extreme Scalability. Greg Clifford Segment Manager, Cray Inc. Breakthrough Science via Extreme Scalability Greg Clifford Segment Manager, Cray Inc. clifford@cray.com Cray s focus The requirement for highly scalable systems Cray XE6 technology The path to Exascale

More information

Measurements on (Complete) Graphs: The Power of Wedge and Diamond Sampling

Measurements on (Complete) Graphs: The Power of Wedge and Diamond Sampling Measurements on (Complete) Graphs: The Power of Wedge and Diamond Sampling Tamara G. Kolda plus Grey Ballard, Todd Plantenga, Ali Pinar, C. Seshadhri Workshop on Incomplete Network Data Sandia National

More information

Sandia National Laboratories Solutions for Aras Innovator

Sandia National Laboratories Solutions for Aras Innovator Photos placed in horizontal position with even amount of white space between photos and header Sandia National Laboratories Solutions for Aras Innovator Sandia National Laboratories is a multi-program

More information

HPC Saudi Jeffrey A. Nichols Associate Laboratory Director Computing and Computational Sciences. Presented to: March 14, 2017

HPC Saudi Jeffrey A. Nichols Associate Laboratory Director Computing and Computational Sciences. Presented to: March 14, 2017 Creating an Exascale Ecosystem for Science Presented to: HPC Saudi 2017 Jeffrey A. Nichols Associate Laboratory Director Computing and Computational Sciences March 14, 2017 ORNL is managed by UT-Battelle

More information

TiDA: High Level Programming Abstrac8ons for Data Locality Management

TiDA: High Level Programming Abstrac8ons for Data Locality Management h#p://parcorelab.ku.edu.tr TiDA: High Level Programming Abstrac8ons for Data Locality Management Didem Unat, Muhammed Nufail Farooqi, Burak Bastem Koç University, Turkey Tan Nguyen, Weiqun Zhang, George

More information

Tesla GPU Computing A Revolution in High Performance Computing

Tesla GPU Computing A Revolution in High Performance Computing Tesla GPU Computing A Revolution in High Performance Computing Gernot Ziegler, Developer Technology (Compute) (Material by Thomas Bradley) Agenda Tesla GPU Computing CUDA Fermi What is GPU Computing? Introduction

More information

Tesla GPU Computing A Revolution in High Performance Computing

Tesla GPU Computing A Revolution in High Performance Computing Tesla GPU Computing A Revolution in High Performance Computing Mark Harris, NVIDIA Agenda Tesla GPU Computing CUDA Fermi What is GPU Computing? Introduction to Tesla CUDA Architecture Programming & Memory

More information

Green Supercomputing

Green Supercomputing Green Supercomputing On the Energy Consumption of Modern E-Science Prof. Dr. Thomas Ludwig German Climate Computing Centre Hamburg, Germany ludwig@dkrz.de Outline DKRZ 2013 and Climate Science The Exascale

More information

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA

Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA Kazuhiko Komatsu, S. Momose, Y. Isobe, O. Watanabe, A. Musa, M. Yokokawa, T. Aoyama, M. Sato, H. Kobayashi Tohoku University 14 November,

More information

Distributed Energy Resource (DER) Cyber Security Working Group

Distributed Energy Resource (DER) Cyber Security Working Group Distributed Energy Resource (DER) Cyber Security Working Group Kickoff Meeting, 24 Aug 2017 Dr. Kemal Celik, U.S. Department of Energy Jay Johnson, Sandia National Laboratories Cedric Carter, Sandia National

More information

ECP VTK-m: Updating HPC Visualization Software for Exascale-Era Processors

ECP VTK-m: Updating HPC Visualization Software for Exascale-Era Processors ECP VTK-m: Updating HPC Visualization Software for Exascale-Era Processors Kenneth Moreland Sandia National Laboratories kmorel@sandia.gov David Pugmire Oak Ridge National Laboratory pugmire@ornl.gov Christopher

More information

Physis: An Implicitly Parallel Framework for Stencil Computa;ons

Physis: An Implicitly Parallel Framework for Stencil Computa;ons Physis: An Implicitly Parallel Framework for Stencil Computa;ons Naoya Maruyama RIKEN AICS (Formerly at Tokyo Tech) GTC12, May 2012 1 è Good performance with low programmer produc;vity Mul;- GPU Applica;on

More information

SDS: A Framework for Scientific Data Services

SDS: A Framework for Scientific Data Services SDS: A Framework for Scientific Data Services Bin Dong, Suren Byna*, John Wu Scientific Data Management Group Lawrence Berkeley National Laboratory Finding Newspaper Articles of Interest Finding news articles

More information

Microgrid System Design and Economic Analysis Tools

Microgrid System Design and Economic Analysis Tools Microgrid System Design and Economic Analysis Tools DOE Microgrid Workshop 30 August 2011 Jason Stamp, Ph.D. (Sandia National Laboratories) Michael Clark (Encorp) 1 Sandia National Laboratories is a multi-program

More information