Parallel Visualization, Data Formatting, Software Overview

Size: px
Start display at page:

Download "Parallel Visualization, Data Formatting, Software Overview"

Transcription

1 Parallel Visualization, Data Formatting, Software Overview Sean Ahern Remote Data Analysis and Visualization Center Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation

2 Who are you? Director of the University of Tennessee Center for Remote Data Analysis and Visualization Visualization lead at ORNL s Leadership Computing Facility ORNL co-i for the SciDAC Visualization and Analysis Center for Enabling Technologies (VACET)

3 How do we handle the hard cases? But what about large, distributed data? Or distributed rendering? Or distributed displays? Or all three?

4 Applications are pushing data set sizes toward Exascale Spatial resolution Astrophysics, Radiation transport, Combustion, Chemistry, Molecular dynamics, Fusion, Multivariate Climate, Radiation transport, V&V, Temporal resolution Molecular dynamics, Climate, Fusion, Ensembles Climate, V&V,

5 Data scale limits scientific understanding Spatial resolution increasing in many domains Too many zones to see on screen (with or without high-res Powerwall) Temporal complexity increasing Climate simulations will have 100,000s of time steps Manually finding temporal patterns is tedious and error-prone Multivariate overload Climate simulations have variables Issues of data models and domain-specific data E.g. Multigroup radiation fields

6 End-to-end schematic Result nput HPC Simulation System Data generation! Query

7 Hero Increasing Dataset Size Large Medium Small Dataset size growth

8 Data size categories Small: Data is small enough to easily move anywhere Analysis/vis is generally done on local workstations Medium: Hero Large Data won t fit on local workstations Generally have to process on fat SMP systems Large: Medium Rather painful to move Requires distributed parallelism Hero: Small Functionally impossible to move Only approachable on largest computational platforms

9 Large data remote visualization Largest datasets require use of institutional resources Reduces data movement issues Allows exploitation of multiple GPUs Provides visualization to remote users Exploited by VisIt, ParaView, EnSight Hero Large Mediu Smal

10 Hero data Purchasing separate analysis systems at the petascale can be prohibitively expensive: $5-20 million Working to move largest vis/analysis tools to HPC architecture VisIt on Cray XT4/5 See Cray User s Group 08 ParaView on Cray XT4/5 Hero Large Medium See Cray User s Group 09 Parallel R on Cray XT5 This year (hopefully) Small

11 A lot of success has been through data flow networks (pipelines)

12 A lot of success has been through data flow networks (pipelines) Different data formats: NetCDF, HDF, text, CSV, PDB, Different types of data operations: Slicing, resampling, mesh transforms, filtering, Different ways of plotting: Pseudocolor, isosurfaces, volume rendering These are independent of each other!

13 Make these independent modules Data reading Data operations Data plotting Read data Filter Filter Plot

14 Make these independent modules Data reading Data operations Read data Read data Data plotting Filter Filter Filter This is a data flow network Plot Plot

15 Many software systems use this method VTK AVS/Express SCIRun

16 Fine for small to medium sized data Hero Increasing Dataset Size Large Medium Small

17 How do we handle the hard cases? But what about large, distributed data? Or distributed rendering? Or distributed displays? Or all three?

18 A good solution: data parallelization for all components Parallel Simulation Code P1 P0 P2 P3 Identical data flow networks on each processor. Networks differentiated by portion of data they operate on. I/O P0 P1 P2 P3 Minimal synchronization Scattered/gather Parallel Analysis Code No distribution (i.e. scatter), because scatter is done through choice of what data to read. Gather: done when rendering Proc 0 Proc 1 Proc 2 Distribution of data is key Rendering/compositing

19 Decomposition of data is key issue Parallelization strategy: each processor works on piece of data Key: Must have good decomposition of data Two I/O situations: Reading from file, file format enforces decomposition Reading from file, file format allows for arbitrary decomposition Three processing situations: P1 P0 P2 P3 Pre-calculated decomposition Must do overloading of chunks when more chunks than processors. Dynamic decomposition at I/O Must understand how to read out arbitrary chunks of data On-the-fly redistribution Processing-dependent

20 This can take us a long way Weak scaling study (isocontouring, volume rendering): ~63M cells/core Machine Type Problem Size # cores Jaguar Cray XT5 2T 32k Franklin Cray XT4 1T, 2T 16k, 32k Dawn BG/P 4T 64k Juno Linux 1T 16k, 32k Ranger Sun Linux 1T 16k Purple AIX 0.5T 8k 2T cells, 32K procs

21 This can take us a long way Weak scaling study (isocontouring, volume rendering): ~63M cells/core Machine Type Problem Size # cores Jaguar Cray XT5 2T 32k Franklin Cray XT4 1T, 2T 16k, 32k Dawn BG/P 4T 64k Juno Linux 1T 16k, 32k Ranger Sun Linux 1T 16k Purple AIX 0.5T 8k Since this work, people have reached 4 trillion and 8 trillion cells. 2T cells, 32K procs

22 This is only part of the puzzle Parallel Simulation Code P1 P0 P2 P3 I/O P0 P1 P2 P3 Parallel Analysis Code Proc 0 Proc 1 Proc 2 Rendering/compositing

23 The job of turning into imagery is complex One or more generators Fed to one or more displays display display display display display display display display display

24 Rendering to a single tile (sort last) Depth sort Only good for opaque For each set of, generate image and depth When merging, compare depth and choose pixel that s closest to the viewer Alpha blending Parallel volume rendering / transparent Process all in back-to-front order (not always easy to do in parallel) Blend with transparency, creating a final image Compositing algorithms: Binary tree Direct send Binary swap / n-way swap 2-3 swap Radix-swap display

25 Rendering to a single tile (sort last) Depth sort Only good for opaque For each set of, generate image and depth When merging, compare depth and choose pixel that s closest to the viewer Alpha blending Parallel volume rendering / transparent Process all in back-to-front order (not always easy to do in parallel) Blend with transparency, creating a final image Compositing algorithms: Binary tree Direct send Binary swap / n-way swap 2-3 swap Radix-swap display

26 Rendering to multiple tiles (sort first) Break up based upon camera frustum Select only that corresponds to each tile Send subset of over network to be rendered display display display display

27 Rendering to multiple tiles (sort first) Break up based upon camera frustum Select only that corresponds to each tile Send subset of over network to be rendered display display display display Used for many Powerwalls

28 Both at the same time Complicated combinations of sort first and sort last Some libraries to make this easier: Chromium IceT Equalizer Paracomp SAGE display display display display

29 Performance is highly dependent upon I/O rates Reading data into memory takes the most time Much more than data processing Much more than rendering and compositing Parallel filesystems are essential for large data Data formats designed for parallel filesystems are essential for large data

30 Joule metric run Weak scaling study (isocontouring, volume rendering): 103M-321M cells on 4k-12k cores Superlinear scaling in processing But wallclock time is dominated by I/O Isocontour % time spent in I/O 103 M cells 321 M cells 2.48 s 87.9% s 94.9%

31 Trillion cell study confirms that I/O dominates Weak scaling study (isocontouring, volume rendering): ~63M cells/core Machine Type Problem Size # cores Jaguar Cray XT5 2T 32k Franklin Cray XT4 1T, 2T 16k, 32k Dawn BG/P 4T 64k Juno Linux 1T 16k, 32k Ranger Sun Linux 1T 16k Purple AIX 0.5T 8k -Approx I/O time: 2-5 minutes -Approx processing time: 10 seconds 2T cells, 32K procs

32 VisIt: an end user visualization and analysis tool for extremely large data 5 major use cases: data exploration, data analysis, visual debugging, comparison, and presentation Demonstrated scaling to >100k cores Strong emphasis on building a usable, robust tool for scientists. Plugin model allows for wide range of capabilities Open source project, with direct support from DOE (SciDAC, NEAMS, ASC, BER) and NSF (TeraGrid XD) visitusers.org wiki with 400+ pages of documentation and over 80,000 page views Over 100,000 downloads R&D100 Award in 2005

33 ParaView: open-source, multi-platform data analysis and visualization application Based on VTK: Deep pipeline modification Quantitative and qualitative use cases Developed to process large datasets using distributed memory computing resources: laptops to supercomputers Superb, best-in-class rendering infrastructure Customizable GUI Access grid integration for collaboration Commercial support from Kitware

34 EnSight: leading commercial parallel visualization tool Specialized for engineering analysis CFD, FEA, etc. Commercial support from CEI Distributed rendering infrastruture Direct Powerwall support Virtual reality Distributed parallel data processing and rendering

35 Rmpi: Parallel statistical analysis Lustre file system Node 0 Node 1 Node 2 Node 0 produces graphics files MPI (via modified Rmpi) Write data reader to bring data from parallel file system Develop data-parallel analysis with full capability of R on every node

36 Central file store Scalable parallel throughput Lustre, GPFS, Panassas,... Shared between source and destination Provides zero copy access to simulation results Allows smart deployment of remote visualization capabilities

37 Data formatting is important to data understanding The purpose of computing is insight, not numbers. Richard Hamming The most exciting phrase to hear in science, the one that heralds new discoveries, is not 'Eureka!' (I found it!) but 'That's funny...' Isaac Asimov

38 We want our data model to reflect how the scientist thinks of the data coordinate atoms collections multidimensional fields materials arrays mesh arrays bytes named scalars vectors refinement bits words arrays meshes interpolation bonds time Simple Complex Each step gets us closer to the ideal. But each step is more domain-specific.

39 Meshes All data lives on a mesh Discretizes space into points and cells 1D, 2D, 3D All of these over time (up to 4D) Can have lower-dimensional meshes in a higher-dimensional space (e.g. 2D surface in 3D space) Provides a place for data to be located Defines how data is interpolated

40 Mesh structure 1D Curves 2D/3D meshes Rectilinear Curvilinear Unstructured Curve Rectilinear Points AMR Curvilinear Unstructured Molecular Molecular Domain decomposed Ghost zones/nodes Points AMR

41 Variables Scalars, Vectors, Tensors Sits on points or cells of a mesh Points: linear interpolation Cells: piecewise constant Could have different dimensionality than the mesh (e.g. 3D vector data on a 2D mesh)

42 Materials Sometimes used in engineering codes Describes disjoint spatial regions at a sub-grid level Volume/area fractions

43 Parallel meshes Provides aggregation for meshes A mesh may be composed of hundreds of thousands of mesh blocks. Allows data parallelism

44 AMR meshes Mesh blocks can be associated with patches and levels. Allows for aggregation of meshes into AMR hierarchy levels.

45 Data model goals Rich Encompasses computational domain as much as possible Sharability Being able to easily move data between tools Being able to give data to other people Rapid I/O Minimize time dumping out data Minimize time reading data Parallel I/O Support access by multiple threads at once on parallel platforms. More Self description Changes to data shouldn t require a change in the format New data should be discoverable without human intervention

46 Data Storage OpenFOAM Fluent CGNS NetCDF XDMF Flash EnSight Gold ADIOS VTK S3D STL Silo ViSUS HDF5 Tecplot PDB Chombo

47 Underlying libraries NetCDF: Named array/scalar storage Generalized attributes Parallel access HDF5: Named array/scalar storage Hierarchical storage Parallel read and write ASCII Simple for humans to read and write Horrible for computers to read, especially in parallel

48 Higher-level libraries and formats ADIOS: Used for very fast I/O off HPC systems HDF5 and native storage Silo: Very rich (parallel) data model on top of HDF5 PDB (Protein Data Bank) Used for storage of molecular data Based on ASCII VTK: Native storage for the VTK pipeline visualization system Both ASCII and binary formats

49 Higher-level libraries and formats (cont) XDMF: Neat format that splits data and metadata ASCII XML file for metadata HDF5 file for heavyweight data.

50 Community standards NetCDF Climate and Forecast (CF) convention: Promotes sharing of data among climate researchers Standard naming Coordinate systems Dimensions, missing data, units, etc. Many communities build formats on top of others: S3D is over NetCDF GTC, Chombo, Enzo, PFLOTRAN, Flash, etc. are over HDF5

51 Summary Parallel visualization is an important capability for gaining insight from petascale (and soon to be exascale) data. It s as difficult of a job as doing the original HPC simulation itself. Rendering is not yet a solved problem. Data storage and data models are critical to high performance and scientific understanding.

Introduction to 3D Scientific Visualization. Training in Visualization for PRACE Summer of HPC 2013 Leon Kos, University of Ljubljana, Slovenia

Introduction to 3D Scientific Visualization. Training in Visualization for PRACE Summer of HPC 2013 Leon Kos, University of Ljubljana, Slovenia Introduction to 3D Scientific Visualization Training in Visualization for PRACE Summer of HPC 2013 Leon Kos, University of Ljubljana, Slovenia Motto Few correctly put words is worth hundreds of images.

More information

Experiments in Pure Parallelism

Experiments in Pure Parallelism Experiments in Pure Parallelism Dave Pugmire, ORNL Hank Childs, LBNL/ UC Davis Brad Whitlock, LLNL Mark Howison, LBNL Prabhat, LBNL Sean Ahern, ORNL Gunther Weber, LBNL Wes Bethel LBNL The story behind

More information

VisIt Overview. VACET: Chief SW Engineer ASC: V&V Shape Char. Lead. Hank Childs. Supercomputing 2006 Tampa, Florida November 13, 2006

VisIt Overview. VACET: Chief SW Engineer ASC: V&V Shape Char. Lead. Hank Childs. Supercomputing 2006 Tampa, Florida November 13, 2006 VisIt Overview Hank Childs VACET: Chief SW Engineer ASC: V&V Shape Char. Lead Supercomputing 2006 Tampa, Florida November 13, 2006 27B element Rayleigh-Taylor Instability (MIRANDA, BG/L) This is UCRL-PRES-226373

More information

VisIt. Hank Childs October 10, IEEE Visualization Tutorial

VisIt. Hank Childs October 10, IEEE Visualization Tutorial VisIt IEEE Visualization Tutorial Hank Childs October 10, 2004 The VisIt Team: Eric Brugger (project leader), Kathleen Bonnell, Hank Childs, Jeremy Meredith, Mark Miller, and Brad Gas bubble subjected

More information

PRACE/LinkSCEEM 2011 Winter School Scientific Visualization with VisIt

PRACE/LinkSCEEM 2011 Winter School Scientific Visualization with VisIt PRACE/LinkSCEEM 2011 Winter School Scientific Visualization with VisIt All the material is graciously offered by the VisIt Developpers, and is found on the Web Hank Childs very courteously allowed me to

More information

SCIENTIFIC VISUALIZATION ON GPU CLUSTERS PETER MESSMER, NVIDIA

SCIENTIFIC VISUALIZATION ON GPU CLUSTERS PETER MESSMER, NVIDIA SCIENTIFIC VISUALIZATION ON GPU CLUSTERS PETER MESSMER, NVIDIA Visualization Rendering Visualization Isosurfaces, Isovolumes Field Operators (Gradient, Curl,.. ) Coordinate transformations Feature extraction

More information

Center for Scalable Application Development Software: Application Engagement. Ewing Lusk (ANL) Gabriel Marin (Rice)

Center for Scalable Application Development Software: Application Engagement. Ewing Lusk (ANL) Gabriel Marin (Rice) Center for Scalable Application Development Software: Application Engagement Ewing Lusk (ANL) Gabriel Marin (Rice) CScADS Midterm Review April 22, 2009 1 Application Engagement Workshops (2 out of 4) for

More information

Multi-Resolution Streams of Big Scientific Data: Scaling Visualization Tools from Handheld Devices to In-Situ Processing

Multi-Resolution Streams of Big Scientific Data: Scaling Visualization Tools from Handheld Devices to In-Situ Processing Multi-Resolution Streams of Big Scientific Data: Scaling Visualization Tools from Handheld Devices to In-Situ Processing Valerio Pascucci Director, Center for Extreme Data Management Analysis and Visualization

More information

Advanced Visualization Techniques

Advanced Visualization Techniques Advanced Visualization Techniques Kelly Gaither Texas Advanced Computing Center UT/Portugal Summer Institute Coimbra, Portugal July 17, 2008 Topics Covered Remote and Collaborative Visualization EnVision

More information

HPC Visualization with EnSight

HPC Visualization with EnSight HPC Visualization with EnSight Beijing 2010.10.27 Aric Meyer Marketing Director, Asia & Pacific CEI Computational Engineering International, Inc. Founded in 1994 out of Cray Research Headquarters in Raleigh,

More information

Introduction to scientific visualization with ParaView

Introduction to scientific visualization with ParaView Introduction to scientific visualization with ParaView Paul Melis SURFsara Visualization group paul.melis@surfsara.nl (some slides courtesy of Robert Belleman, UvA) Outline Introduction, pipeline and data

More information

Parallel Visualization At TACC. Greg Abram

Parallel Visualization At TACC. Greg Abram Parallel Visualization At TACC Greg Abram Visualization Problems * With thanks to Sean Ahern for the metaphor Huge problems: Data cannot be moved off system where it is computed Large Visualization problems:

More information

Visualization Of A Deterministic Radiation Transport Model Using Standard Visualization Tools

Visualization Of A Deterministic Radiation Transport Model Using Standard Visualization Tools Visualization Of A Deterministic Radiation Transport Model Using Standard Visualization Tools James A. Galbraith and L. Eric Greenwade, Idaho National Engineering and Environmental Laboratory ABSTRACT:

More information

Parallel Visualization At TACC. Greg Abram

Parallel Visualization At TACC. Greg Abram Parallel Visualization At TACC Greg Abram Visualization Problems * With thanks to Sean Ahern for the metaphor Huge problems: Data cannot be moved off system where it is computed Large Visualization problems:

More information

Overview and Introduction to Scientific Visualization. Texas Advanced Computing Center The University of Texas at Austin

Overview and Introduction to Scientific Visualization. Texas Advanced Computing Center The University of Texas at Austin Overview and Introduction to Scientific Visualization Texas Advanced Computing Center The University of Texas at Austin Scientific Visualization The purpose of computing is insight not numbers. -- R. W.

More information

VisIt: Visualization and Analysis using Python. VisIt developers from Brad Whitlock NNSA/ASC, Office of Science/SciDAC, Cyrus Harrison

VisIt: Visualization and Analysis using Python. VisIt developers from Brad Whitlock NNSA/ASC, Office of Science/SciDAC, Cyrus Harrison VisIt: Visualization and Analysis using Python Hank Childs VisIt developers from Brad Whitlock NNSA/ASC, Office of Science/SciDAC, Cyrus Harrison Office of Nuclear Energy/AFCI Presented at: SIAM CSE09

More information

Introduction to Visualization on Stampede

Introduction to Visualization on Stampede Introduction to Visualization on Stampede Aaron Birkland Cornell CAC With contributions from TACC visualization training materials Parallel Computing on Stampede June 11, 2013 From data to Insight Data

More information

Damaris. In-Situ Data Analysis and Visualization for Large-Scale HPC Simulations. KerData Team. Inria Rennes,

Damaris. In-Situ Data Analysis and Visualization for Large-Scale HPC Simulations. KerData Team. Inria Rennes, Damaris In-Situ Data Analysis and Visualization for Large-Scale HPC Simulations KerData Team Inria Rennes, http://damaris.gforge.inria.fr Outline 1. From I/O to in-situ visualization 2. Damaris approach

More information

Parallel I/O Libraries and Techniques

Parallel I/O Libraries and Techniques Parallel I/O Libraries and Techniques Mark Howison User Services & Support I/O for scientifc data I/O is commonly used by scientific applications to: Store numerical output from simulations Load initial

More information

Insight VisREU Site. Agenda. Introduction to Scientific Visualization Using 6/16/2015. The purpose of visualization is insight, not pictures.

Insight VisREU Site. Agenda. Introduction to Scientific Visualization Using 6/16/2015. The purpose of visualization is insight, not pictures. 2015 VisREU Site Introduction to Scientific Visualization Using Vetria L. Byrd, Director Advanced Visualization VisREU Site Coordinator REU Site Sponsored by NSF ACI Award 1359223 Introduction to SciVis(High

More information

Scalable and Distributed Visualization using ParaView

Scalable and Distributed Visualization using ParaView Scalable and Distributed Visualization using ParaView Eric A. Wernert, Ph.D. Senior Manager & Scientist, Advanced Visualization Lab Pervasive Technology Institute, Indiana University Big Data for Science

More information

Parallel Algorithms: Adaptive Mesh Refinement (AMR) method and its implementation

Parallel Algorithms: Adaptive Mesh Refinement (AMR) method and its implementation Parallel Algorithms: Adaptive Mesh Refinement (AMR) method and its implementation Massimiliano Guarrasi m.guarrasi@cineca.it Super Computing Applications and Innovation Department AMR - Introduction Solving

More information

Bridging the Gap Between High Quality and High Performance for HPC Visualization

Bridging the Gap Between High Quality and High Performance for HPC Visualization Bridging the Gap Between High Quality and High Performance for HPC Visualization Rob Sisneros National Center for Supercomputing Applications University of Illinois at Urbana Champaign Outline Why am I

More information

Splotch: High Performance Visualization using MPI, OpenMP and CUDA

Splotch: High Performance Visualization using MPI, OpenMP and CUDA Splotch: High Performance Visualization using MPI, OpenMP and CUDA Klaus Dolag (Munich University Observatory) Martin Reinecke (MPA, Garching) Claudio Gheller (CSCS, Switzerland), Marzia Rivi (CINECA,

More information

Experiences with ENZO on the Intel Many Integrated Core Architecture

Experiences with ENZO on the Intel Many Integrated Core Architecture Experiences with ENZO on the Intel Many Integrated Core Architecture Dr. Robert Harkness National Institute for Computational Sciences April 10th, 2012 Overview ENZO applications at petascale ENZO and

More information

Architectural Challenges and Solutions for Petascale Visualization and Analysis. Hank Childs Lawrence Livermore National Laboratory June 27, 2007

Architectural Challenges and Solutions for Petascale Visualization and Analysis. Hank Childs Lawrence Livermore National Laboratory June 27, 2007 Architectural Challenges and Solutions for Petascale Visualization and Analysis Hank Childs Lawrence Livermore National Laboratory June 27, 2007 Work performed under the auspices of the U.S. Department

More information

Scientific Visualization Services at RZG

Scientific Visualization Services at RZG Scientific Visualization Services at RZG Klaus Reuter, Markus Rampp klaus.reuter@rzg.mpg.de Garching Computing Centre (RZG) 7th GOTiT High Level Course, Garching, 2010 Outline 1 Introduction 2 Details

More information

Visualization with ParaView

Visualization with ParaView Visualization with Before we begin Make sure you have 3.10.1 installed so you can follow along in the lab section http://paraview.org/paraview/resources/software.html http://www.paraview.org/ Background

More information

VTK-m: Uniting GPU Acceleration Successes. Robert Maynard Kitware Inc.

VTK-m: Uniting GPU Acceleration Successes. Robert Maynard Kitware Inc. VTK-m: Uniting GPU Acceleration Successes Robert Maynard Kitware Inc. VTK-m Project Supercomputer Hardware Advances Everyday More and more parallelism High-Level Parallelism The Free Lunch Is Over (Herb

More information

Introduction to scientific visualization with ParaView

Introduction to scientific visualization with ParaView Introduction to scientific visualization with ParaView Tijs de Kler SURFsara Visualization group Tijs.dekler@surfsara.nl (some slides courtesy of Robert Belleman, UvA) Outline Pipeline and data model (10

More information

Experiences with ENZO on the Intel R Many Integrated Core (Intel MIC) Architecture

Experiences with ENZO on the Intel R Many Integrated Core (Intel MIC) Architecture Experiences with ENZO on the Intel R Many Integrated Core (Intel MIC) Architecture 1 Introduction Robert Harkness National Institute for Computational Sciences Oak Ridge National Laboratory The National

More information

Visualization Support at RZG

Visualization Support at RZG Visualization Support at RZG Markus Rampp (RZG) mjr@rzg.mpg.de MPA Computer Seminar, Jan 14, 2009 Outline Topics Overview Existing services Some example projects Software overview & demo Remote visualization

More information

NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences

NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences NetCDF-4: : Software Implementing an Enhanced Data Model for the Geosciences Russ Rew, Ed Hartnett, and John Caron UCAR Unidata Program, Boulder 2006-01-31 Acknowledgments This work was supported by the

More information

Introduction to serial HDF5

Introduction to serial HDF5 Introduction to serial HDF Matthieu Haefele Saclay, - March 201, Parallel filesystems and parallel IO libraries PATC@MdS Matthieu Haefele Training outline Day 1: AM: Serial HDF (M. Haefele) PM: Parallel

More information

Hank Childs, Mark Miller Lawrence Livermore National Laboratory 7000 East Avenue, Livermore, Ca, {childs3

Hank Childs, Mark Miller Lawrence Livermore National Laboratory 7000 East Avenue, Livermore, Ca, {childs3 Beyond Meat Grinders: An Analysis Framework Addressing the Scale and Complexity of Large Data Sets Hank Childs, Mark Miller Lawrence Livermore National Laboratory 7000 East Avenue, Livermore, Ca, 94550

More information

ISC 09 Poster Abstract : I/O Performance Analysis for the Petascale Simulation Code FLASH

ISC 09 Poster Abstract : I/O Performance Analysis for the Petascale Simulation Code FLASH ISC 09 Poster Abstract : I/O Performance Analysis for the Petascale Simulation Code FLASH Heike Jagode, Shirley Moore, Dan Terpstra, Jack Dongarra The University of Tennessee, USA [jagode shirley terpstra

More information

GPU Debugging Made Easy. David Lecomber CTO, Allinea Software

GPU Debugging Made Easy. David Lecomber CTO, Allinea Software GPU Debugging Made Easy David Lecomber CTO, Allinea Software david@allinea.com Allinea Software HPC development tools company Leading in HPC software tools market Wide customer base Blue-chip engineering,

More information

Large Scale Data Visualization. CSC 7443: Scientific Information Visualization

Large Scale Data Visualization. CSC 7443: Scientific Information Visualization Large Scale Data Visualization Large Datasets Large datasets: D >> 10 M D D: Hundreds of gigabytes to terabytes and even petabytes M D : 1 to 4 GB of RAM Examples: Single large data set Time-varying data

More information

Scalable Parallel Building Blocks for Custom Data Analysis

Scalable Parallel Building Blocks for Custom Data Analysis Scalable Parallel Building Blocks for Custom Data Analysis Tom Peterka, Rob Ross (ANL) Attila Gyulassy, Valerio Pascucci (SCI) Wes Kendall (UTK) Han-Wei Shen, Teng-Yok Lee, Abon Chaudhuri (OSU) Morse-Smale

More information

In-Situ Data Analysis and Visualization: ParaView, Calalyst and VTK-m

In-Situ Data Analysis and Visualization: ParaView, Calalyst and VTK-m In-Situ Data Analysis and Visualization: ParaView, Calalyst and VTK-m GTC, San Jose, CA March, 2015 Robert Maynard Marcus D. Hanwell 1 Agenda Introduction to ParaView Catalyst Run example Catalyst Script

More information

7/27/10. CScADS : Scientific Data and Analytics for Petascale Computing Workshop. Outline. About ParaView. Short Tutorial Current Work in Progress

7/27/10. CScADS : Scientific Data and Analytics for Petascale Computing Workshop. Outline. About ParaView. Short Tutorial Current Work in Progress About ParaView CScADS : Scientific Data and Analytics for Petascale Computing Workshop Outline About ParaView Short Tutorial Current Work in Progress 1 History (http://www.paraview.org/wiki/paraview_release_notes

More information

Peta-Scale Simulations with the HPC Software Framework walberla:

Peta-Scale Simulations with the HPC Software Framework walberla: Peta-Scale Simulations with the HPC Software Framework walberla: Massively Parallel AMR for the Lattice Boltzmann Method SIAM PP 2016, Paris April 15, 2016 Florian Schornbaum, Christian Godenschwager,

More information

Introduction to High Performance Parallel I/O

Introduction to High Performance Parallel I/O Introduction to High Performance Parallel I/O Richard Gerber Deputy Group Lead NERSC User Services August 30, 2013-1- Some slides from Katie Antypas I/O Needs Getting Bigger All the Time I/O needs growing

More information

The Cray Rainier System: Integrated Scalar/Vector Computing

The Cray Rainier System: Integrated Scalar/Vector Computing THE SUPERCOMPUTER COMPANY The Cray Rainier System: Integrated Scalar/Vector Computing Per Nyberg 11 th ECMWF Workshop on HPC in Meteorology Topics Current Product Overview Cray Technology Strengths Rainier

More information

Three-Dimensional Graphics. as a Tool for Studying Dynamics

Three-Dimensional Graphics. as a Tool for Studying Dynamics Three-Dimensional Graphics as a Tool for Studying Dynamics (1) (3) (2) 3D Graphics: Where to begin? (5) (4) Images: Andrew Grace (1), David Deepwell (2), Laura Chandler (3-4), Aaron Coutino (5) 3D Graphics:

More information

Integrated Analysis and Visualization for Data Intensive Science: Challenges and Opportunities. Attila Gyulassy speaking for Valerio Pascucci

Integrated Analysis and Visualization for Data Intensive Science: Challenges and Opportunities. Attila Gyulassy speaking for Valerio Pascucci Integrated Analysis and Visualization for Data Intensive Science: Challenges and Opportunities Attila Gyulassy speaking for Valerio Pascucci Massive Simulation and Sensing Devices Generate Great Challenges

More information

Co-existence: Can Big Data and Big Computation Co-exist on the Same Systems?

Co-existence: Can Big Data and Big Computation Co-exist on the Same Systems? Co-existence: Can Big Data and Big Computation Co-exist on the Same Systems? Dr. William Kramer National Center for Supercomputing Applications, University of Illinois Where these views come from Large

More information

Generic finite element capabilities for forest-of-octrees AMR

Generic finite element capabilities for forest-of-octrees AMR Generic finite element capabilities for forest-of-octrees AMR Carsten Burstedde joint work with Omar Ghattas, Tobin Isaac Institut für Numerische Simulation (INS) Rheinische Friedrich-Wilhelms-Universität

More information

Scalable Software Components for Ultrascale Visualization Applications

Scalable Software Components for Ultrascale Visualization Applications Scalable Software Components for Ultrascale Visualization Applications Wes Kendall, Tom Peterka, Jian Huang SC Ultrascale Visualization Workshop 2010 11-15-2010 Primary Collaborators Jian Huang Tom Peterka

More information

Performance database technology for SciDAC applications

Performance database technology for SciDAC applications Performance database technology for SciDAC applications D Gunter 1, K Huck 2, K Karavanic 3, J May 4, A Malony 2, K Mohror 3, S Moore 5, A Morris 2, S Shende 2, V Taylor 6, X Wu 6, and Y Zhang 7 1 Lawrence

More information

EnSight October 2013 Japan EnSight User Meeting Tokyo, Japan. Darin McKinnis VP Sales and Marketing

EnSight October 2013 Japan EnSight User Meeting Tokyo, Japan. Darin McKinnis VP Sales and Marketing EnSight 10.1 October 2013 Japan EnSight User Meeting Tokyo, Japan Darin McKinnis VP Sales and Marketing darin@ceisoftware.com Business Reports 2012 was our best year in company history > 95% renewal rate

More information

A Study of High Performance Computing and the Cray SV1 Supercomputer. Michael Sullivan TJHSST Class of 2004

A Study of High Performance Computing and the Cray SV1 Supercomputer. Michael Sullivan TJHSST Class of 2004 A Study of High Performance Computing and the Cray SV1 Supercomputer Michael Sullivan TJHSST Class of 2004 June 2004 0.1 Introduction A supercomputer is a device for turning compute-bound problems into

More information

Debugging CUDA Applications with Allinea DDT. Ian Lumb Sr. Systems Engineer, Allinea Software Inc.

Debugging CUDA Applications with Allinea DDT. Ian Lumb Sr. Systems Engineer, Allinea Software Inc. Debugging CUDA Applications with Allinea DDT Ian Lumb Sr. Systems Engineer, Allinea Software Inc. ilumb@allinea.com GTC 2013, San Jose, March 20, 2013 Embracing GPUs GPUs a rival to traditional processors

More information

Adaptable IO System (ADIOS)

Adaptable IO System (ADIOS) Adaptable IO System (ADIOS) http://www.cc.gatech.edu/~lofstead/adios Cray User Group 2008 May 8, 2008 Chen Jin, Scott Klasky, Stephen Hodson, James B. White III, Weikuan Yu (Oak Ridge National Laboratory)

More information

Multicore Computing and Scientific Discovery

Multicore Computing and Scientific Discovery scientific infrastructure Multicore Computing and Scientific Discovery James Larus Dennis Gannon Microsoft Research In the past half century, parallel computers, parallel computation, and scientific research

More information

Lecture Topic Projects

Lecture Topic Projects Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, data types 3 Data sources and preparation Project 1 out 4 Data reduction, similarity & distance, data augmentation

More information

Introduction to HPC Parallel I/O

Introduction to HPC Parallel I/O Introduction to HPC Parallel I/O Feiyi Wang (Ph.D.) and Sarp Oral (Ph.D.) Technology Integration Group Oak Ridge Leadership Computing ORNL is managed by UT-Battelle for the US Department of Energy Outline

More information

Collaborators from SDM Center, CPES, GPSC, GSEP, Sandia, ORNL

Collaborators from SDM Center, CPES, GPSC, GSEP, Sandia, ORNL Fourth Workshop on Ultrascale Visualization 10/28/2009 Scott A. Klasky klasky@ornl.gov Collaborators from SDM Center, CPES, GPSC, GSEP, Sandia, ORNL H. Abbasi, J. Cummings, C. Docan,, S. Ethier, A Kahn,

More information

Interactive HPC: Large Scale In-Situ Visualization Using NVIDIA Index in ALYA MultiPhysics

Interactive HPC: Large Scale In-Situ Visualization Using NVIDIA Index in ALYA MultiPhysics www.bsc.es Interactive HPC: Large Scale In-Situ Visualization Using NVIDIA Index in ALYA MultiPhysics Christopher Lux (NV), Vishal Mehta (BSC) and Marc Nienhaus (NV) May 8 th 2017 Barcelona Supercomputing

More information

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development Jeremy Fischer Indiana University 9 September 2014 Citation: Fischer, J.L. 2014. ACCI Recommendations on Long Term

More information

SciSpark 201. Searching for MCCs

SciSpark 201. Searching for MCCs SciSpark 201 Searching for MCCs Agenda for 201: Access your SciSpark & Notebook VM (personal sandbox) Quick recap. of SciSpark Project What is Spark? SciSpark Extensions scitensor: N-dimensional arrays

More information

Introduction to Paraview GUI. Raffaele Ponzini SuperComputing Applications and Innovation Department

Introduction to Paraview GUI. Raffaele Ponzini SuperComputing Applications and Innovation Department Introduction to Paraview GUI Raffaele Ponzini r.ponzini@cineca.it SuperComputing Applications and Innovation Department OUTLINE PART A PART B What is Paraview The GUI Sources Loading Data Text annotation

More information

Introduction to Grid Computing

Introduction to Grid Computing Milestone 2 Include the names of the papers You only have a page be selective about what you include Be specific; summarize the authors contributions, not just what the paper is about. You might be able

More information

A Scalable Adaptive Mesh Refinement Framework For Parallel Astrophysics Applications

A Scalable Adaptive Mesh Refinement Framework For Parallel Astrophysics Applications A Scalable Adaptive Mesh Refinement Framework For Parallel Astrophysics Applications James Bordner, Michael L. Norman San Diego Supercomputer Center University of California, San Diego 15th SIAM Conference

More information

Introduction to Scientific Visualization

Introduction to Scientific Visualization Introduction to Scientific Visualization Aaron Birkland Cornell Center for Advanced Computing Data Analysis on Ranger January 2012 A lab-intensive workshop Start off with basic concepts Data, transformations,

More information

Project Name. The Eclipse Integrated Computational Environment. Jay Jay Billings, ORNL Parent Project. None selected yet.

Project Name. The Eclipse Integrated Computational Environment. Jay Jay Billings, ORNL Parent Project. None selected yet. Project Name The Eclipse Integrated Computational Environment Jay Jay Billings, ORNL 20140219 Parent Project None selected yet. Background The science and engineering community relies heavily on modeling

More information

Working with Scientific Data in ArcGIS Platform

Working with Scientific Data in ArcGIS Platform Working with Scientific Data in ArcGIS Platform Sudhir Raj Shrestha sshrestha@esri.com Hong Xu hxu@esri.com Esri User Conference, San Diego, CA. July 11, 2017 What we will cover today Scientific Multidimensional

More information

CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC. Guest Lecturer: Sukhyun Song (original slides by Alan Sussman)

CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC. Guest Lecturer: Sukhyun Song (original slides by Alan Sussman) CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC Guest Lecturer: Sukhyun Song (original slides by Alan Sussman) Parallel Programming with Message Passing and Directives 2 MPI + OpenMP Some applications can

More information

Tuning I/O Performance for Data Intensive Computing. Nicholas J. Wright. lbl.gov

Tuning I/O Performance for Data Intensive Computing. Nicholas J. Wright. lbl.gov Tuning I/O Performance for Data Intensive Computing. Nicholas J. Wright njwright @ lbl.gov NERSC- National Energy Research Scientific Computing Center Mission: Accelerate the pace of scientific discovery

More information

Forest-of-octrees AMR: algorithms and interfaces

Forest-of-octrees AMR: algorithms and interfaces Forest-of-octrees AMR: algorithms and interfaces Carsten Burstedde joint work with Omar Ghattas, Tobin Isaac, Georg Stadler, Lucas C. Wilcox Institut für Numerische Simulation (INS) Rheinische Friedrich-Wilhelms-Universität

More information

Scientific Visualization Example exam questions with commented answers

Scientific Visualization Example exam questions with commented answers Scientific Visualization Example exam questions with commented answers The theoretical part of this course is evaluated by means of a multiple- choice exam. The questions cover the material mentioned during

More information

Stream Processing for Remote Collaborative Data Analysis

Stream Processing for Remote Collaborative Data Analysis Stream Processing for Remote Collaborative Data Analysis Scott Klasky 146, C. S. Chang 2, Jong Choi 1, Michael Churchill 2, Tahsin Kurc 51, Manish Parashar 3, Alex Sim 7, Matthew Wolf 14, John Wu 7 1 ORNL,

More information

Utilisation de VTK/ParaView pour la visualization de gros volumes de donnees

Utilisation de VTK/ParaView pour la visualization de gros volumes de donnees Outline Some strategies to deal with large data Utilisation de VTK/ParaView pour la visualization de gros volumes de donnees Jean M. Favre Responsable du groupe de visualisation How do VTK and ParaView

More information

Enabling In Situ Viz and Data Analysis with Provenance in libmesh

Enabling In Situ Viz and Data Analysis with Provenance in libmesh Enabling In Situ Viz and Data Analysis with Provenance in libmesh Vítor Silva Jose J. Camata Marta Mattoso Alvaro L. G. A. Coutinho (Federal university Of Rio de Janeiro/Brazil) Patrick Valduriez (INRIA/France)

More information

Visualization Challenges for Large Scale Astrophysical Simulation Data. Ultrascale Visualization Workshop

Visualization Challenges for Large Scale Astrophysical Simulation Data. Ultrascale Visualization Workshop Visualization Challenges for Large Scale Astrophysical Simulation Data Ralf Kähler (KIPAC/SLAC) Tom Abel (KIPAC/Stanford) Marcelo Alvarez (CITA) Oliver Hahn (Stanford) Hans-Christian Hege (ZIB) Ji-hoon

More information

PERFORMANCE OF PARALLEL IO ON LUSTRE AND GPFS

PERFORMANCE OF PARALLEL IO ON LUSTRE AND GPFS PERFORMANCE OF PARALLEL IO ON LUSTRE AND GPFS David Henty and Adrian Jackson (EPCC, The University of Edinburgh) Charles Moulinec and Vendel Szeremi (STFC, Daresbury Laboratory Outline Parallel IO problem

More information

Big data for big river science: data intensive tools, techniques, and projects at the USGS/Columbia Environmental Research Center

Big data for big river science: data intensive tools, techniques, and projects at the USGS/Columbia Environmental Research Center Big data for big river science: data intensive tools, techniques, and projects at the USGS/Columbia Environmental Research Center Ed Bulliner U.S. Geological Survey, Columbia Environmental Research Center

More information

Smart Trading with Cray Systems: Making Smarter Models + Better Decisions in Algorithmic Trading

Smart Trading with Cray Systems: Making Smarter Models + Better Decisions in Algorithmic Trading Smart Trading with Cray Systems: Making Smarter Models + Better Decisions in Algorithmic Trading Smart Trading with Cray Systems Agenda: Cray Overview Market Trends & Challenges Mitigating Risk with Deeper

More information

Advanced Visualization

Advanced Visualization 320581 Advanced Visualization Prof. Lars Linsen Fall 2011 0 Introduction 0.1 Syllabus and Organization Course Website Link in CampusNet: http://www.faculty.jacobsuniversity.de/llinsen/teaching/320581.htm

More information

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis

Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis Lecture Topic Projects 1 Intro, schedule, and logistics 2 Applications of visual analytics, basic tasks, data types 3 Introduction to D3, basic vis techniques for non-spatial data Project #1 out 4 Data

More information

Parallel I/O. and split communicators. David Henty, Fiona Ried, Gavin J. Pringle

Parallel I/O. and split communicators. David Henty, Fiona Ried, Gavin J. Pringle Parallel I/O and split communicators David Henty, Fiona Ried, Gavin J. Pringle Dr Gavin J. Pringle Applications Consultant gavin@epcc.ed.ac.uk +44 131 650 6709 4x4 array on 2x2 Process Grid Parallel IO

More information

What is visualization? Why is it important?

What is visualization? Why is it important? What is visualization? Why is it important? What does visualization do? What is the difference between scientific data and information data Visualization Pipeline Visualization Pipeline Overview Data acquisition

More information

Fast Forward I/O & Storage

Fast Forward I/O & Storage Fast Forward I/O & Storage Eric Barton Lead Architect 1 Department of Energy - Fast Forward Challenge FastForward RFP provided US Government funding for exascale research and development Sponsored by 7

More information

Extreme I/O Scaling with HDF5

Extreme I/O Scaling with HDF5 Extreme I/O Scaling with HDF5 Quincey Koziol Director of Core Software Development and HPC The HDF Group koziol@hdfgroup.org July 15, 2012 XSEDE 12 - Extreme Scaling Workshop 1 Outline Brief overview of

More information

Chris Sewell Li-Ta Lo James Ahrens Los Alamos National Laboratory

Chris Sewell Li-Ta Lo James Ahrens Los Alamos National Laboratory Portability and Performance for Visualization and Analysis Operators Using the Data-Parallel PISTON Framework Chris Sewell Li-Ta Lo James Ahrens Los Alamos National Laboratory Outline! Motivation Portability

More information

Enzo-P / Cello. Scalable Adaptive Mesh Refinement for Astrophysics and Cosmology. San Diego Supercomputer Center. Department of Physics and Astronomy

Enzo-P / Cello. Scalable Adaptive Mesh Refinement for Astrophysics and Cosmology. San Diego Supercomputer Center. Department of Physics and Astronomy Enzo-P / Cello Scalable Adaptive Mesh Refinement for Astrophysics and Cosmology James Bordner 1 Michael L. Norman 1 Brian O Shea 2 1 University of California, San Diego San Diego Supercomputer Center 2

More information

The Fusion Distributed File System

The Fusion Distributed File System Slide 1 / 44 The Fusion Distributed File System Dongfang Zhao February 2015 Slide 2 / 44 Outline Introduction FusionFS System Architecture Metadata Management Data Movement Implementation Details Unique

More information

Matlab Advanced Programming. Matt Wyant University of Washington

Matlab Advanced Programming. Matt Wyant University of Washington Matlab Advanced Programming Matt Wyant University of Washington Matlab as a programming Language Strengths (as compared to C/C++/Fortran) Fast to write -no type declarations needed Memory allocation/deallocation

More information

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast

cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast cdo Data Processing (and Production) Luis Kornblueh, Uwe Schulzweida, Deike Kleberg, Thomas Jahns, Irina Fast Max-Planck-Institut für Meteorologie, DKRZ September 24, 2014 MAX-PLANCK-GESELLSCHAFT Data

More information

Principle Of Parallel Algorithm Design (cont.) Alexandre David B2-206

Principle Of Parallel Algorithm Design (cont.) Alexandre David B2-206 Principle Of Parallel Algorithm Design (cont.) Alexandre David B2-206 1 Today Characteristics of Tasks and Interactions (3.3). Mapping Techniques for Load Balancing (3.4). Methods for Containing Interaction

More information

ACGV 2008, Lecture 1 Tuesday January 22, 2008

ACGV 2008, Lecture 1 Tuesday January 22, 2008 Advanced Computer Graphics and Visualization Spring 2008 Ch 1: Introduction Ch 4: The Visualization Pipeline Ch 5: Basic Data Representation Organization, Spring 2008 Stefan Seipel Filip Malmberg Mats

More information

Big Data in Scientific Domains

Big Data in Scientific Domains Big Data in Scientific Domains Arie Shoshani Lawrence Berkeley National Laboratory BES Workshop August 2012 Arie Shoshani 1 The Scalable Data-management, Analysis, and Visualization (SDAV) Institute 2012-2017

More information

Feature Tracking & Visualization in VisIt

Feature Tracking & Visualization in VisIt 1 ABSTRACT OF THE THESIS Feature Tracking & Visualization in VisIt by Naveen Atmakuri Thesis Director: Professor Deborah Silver The study and analysis of huge experimental or simulation datasets in the

More information

ECE7995 (7) Parallel I/O

ECE7995 (7) Parallel I/O ECE7995 (7) Parallel I/O 1 Parallel I/O From user s perspective: Multiple processes or threads of a parallel program accessing data concurrently from a common file From system perspective: - Files striped

More information

Introduction to Parallel Computing

Introduction to Parallel Computing Introduction to Parallel Computing Bootcamp for SahasraT 7th September 2018 Aditya Krishna Swamy adityaks@iisc.ac.in SERC, IISc Acknowledgments Akhila, SERC S. Ethier, PPPL P. Messina, ECP LLNL HPC tutorials

More information

Accelerate your SAS analytics to take the gold

Accelerate your SAS analytics to take the gold Accelerate your SAS analytics to take the gold A White Paper by Fuzzy Logix Whatever the nature of your business s analytics environment we are sure you are under increasing pressure to deliver more: more

More information

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011 ANSYS HPC Technology Leadership Barbara Hutchings barbara.hutchings@ansys.com 1 ANSYS, Inc. September 20, Why ANSYS Users Need HPC Insight you can t get any other way HPC enables high-fidelity Include

More information

Large Scale Remote Interactive Visualization

Large Scale Remote Interactive Visualization Large Scale Remote Interactive Visualization Kelly Gaither Director of Visualization Senior Research Scientist Texas Advanced Computing Center The University of Texas at Austin March 1, 2012 Visualization

More information

Introduction to Visualization: ParaView. Dan Mazur, McGill HPC Aug 20, 2013

Introduction to Visualization: ParaView. Dan Mazur, McGill HPC Aug 20, 2013 Introduction to Visualization: ParaView Dan Mazur, McGill HPC daniel.mazur@mcgill.ca Aug 20, 2013 1 Outline What is scientific visualization? ParaView and visualization pipelines data import 1D, 2D, 3D

More information

Data Management. Parallel Filesystems. Dr David Henty HPC Training and Support

Data Management. Parallel Filesystems. Dr David Henty HPC Training and Support Data Management Dr David Henty HPC Training and Support d.henty@epcc.ed.ac.uk +44 131 650 5960 Overview Lecture will cover Why is IO difficult Why is parallel IO even worse Lustre GPFS Performance on ARCHER

More information