Simulation-time data analysis and I/O acceleration at extreme scale with GLEAN

Size: px
Start display at page:

Download "Simulation-time data analysis and I/O acceleration at extreme scale with GLEAN"

Transcription

1 Simulation-time data analysis and I/O acceleration at extreme scale with GLEAN Venkatram Vishwanath, Mark Hereld and Michael E. Papka Argonne Na<onal Laboratory

2 Simulation-time Analysis Opportunities on the Argonne Leadership Computing Facility Intrepid BG/P Compute Resource Eureka Analysis Cluster 1 40K Nodes 160K Cores 557 TFlops 4.3 Tb/s I/O Nodes 6.4 Tb/s Myrinet Switch Complex 900+ ports 1 Tb/s 1.3 Tb/s Nodes 200 GPUs 110 TFlops 128 File Servers Tb/s We need to perform the right computa<on at the right place and <me taking into account the characteris<cs of the simula<on, resources and analysis Storage System 2

3 Our approach - GLEAN Supercompu-ng Facility Home Ins-tu-on Compute resource Analysis Cluster Worksta<on Simula<on Analysis / Staging View GLEAN is a flexible and extensible framework for simula<on- <me data analysis and I/O accelera<on taking into account applica<on, analy<cs and system characteris<cs to perform the right analysis at the right place and -me.

4 Tradi<onal Mode Mode with GLEAN Applica<on Applica<on I/O Library (hdf5, pnetcdf) Compute Resource I/O Library (hdf5, pnetcdf) GLEAN I/O Network I/O Network File server GLEAN File server Analysis/Staging Nodes Analysis/Staging/Transforma<on

5 Key features of GLEAN Exploit the underlying network topology to speed data movement Leverage data seman<cs of applica<ons Provide non- intrusive integra<on with exis<ng applica<ons Enable simula<on- <me data analysis, transforma<on and reduc<on by providing a flexible and extensible API Provide asynchronous data I/O via staging nodes Provide transparent integra<on with na<ve applica<on data formats.

6 Strong scaling performance to write 1GiB By leveraging the topology of BG/P, we can achieve both weak scaling as well as strong scaling for data movement Topology- aware data movement and staging for I/O accelera<on for IBM Blue Gene/P supercompu<ng applica<ons, V. Vishwanath et. al. (To appear SC 2011)

7 Performance for FLASH checkpoints For weak scaling at 32,768 cores, GLEAN sustains 31 GiBps and achieves an observed speedup of 10- fold over pnetcdf and hdf5 For strong scaling at 32,768 cores, GLEAN sustains 27 GiBps and achieves an observed speedup of 15- fold over pnetcdf and hdf GiBps to Storage at 32K cores

8 in situ analysis of FLASH using GLEAN ALCF Facility Intrepid Compute Resource FLASH G L E A N Analysis Analysis in situ analysis to compute fractal dimension for 5 variables of a FLASH simula<on on 2048 BG/P processors Fractal Dimension illustrates the degree of turbulence in a par<cular <me step as well as within a sub- region of the domain 14- Fold improvement Analysis using GLEAN required no code changes to FLASH

9 Simulation-time analysis of PHASTA on 160K Intrepid BG/P cores Isosurface of ver<cal velocity colored by velocity and cut plane through the synthe<c jet (both on 3.3 Billion element mesh). Image Courtesy: Ken Jansen Visualiza<on of a PHASTA simula<on running on 160K cores of Intrepid using ParaView on 100 Eureka nodes enabled by GLEAN GLEAN achieves 48 GiBps sustained throughput for data movement enabling simula<on- <me analysis

10 GLEAN- Enabling simulation-time data analysis and I/O acceleration A flexible and extensible data analysis framework taking into account applica<on, analy<cs and system characteris<cs to perform the right analysis at the right place and -me Provides I/O accelera<on by asynchronous data staging Scaled to en-re ALCF infrastructure (160K BG/P cores Eureka Nodes) Infrastructure Simula-on Analysis Co- analysis PHASTA Visualiza<on using Paraview Staging FLASH, S3D I/O Accelera<on In situ FLASH Fractal Dimension, Surface Area, Histograms In flight MADBench2 Histogram Leverages data models of applica<ons including adap<ve mesh refinement and unstructured meshes Generic design to enable deployment on any plaeorm

11 Acknowledgements DOE Office of Advanced Scien<fic Compu<ng Research ANL Director s Fellow Award Argonne Leadership Compu<ng (ALCF) Resources ANL - Mike Papka, Mark Hereld, Joseph Insley, Eric Olson, Aaron Knoll, Tom Uram, Rob Ross, Tom Peterka, Rob Latham, Phil Carns, Kevin Harms, Kamil Iskra, Vitali Morozov, Susan Coughlan, Ray Loy, and the ALCF team FLASH Center Chris Daley, George Jordan, Anshu Dubey, John Norris, Randy Hudson and Don Lamb Kitware - Pat Marion and Berk Geveci PHASTA Ken Jansen, Michel Rasquin and the PHASTA team venkatv@mcs.anl.gov

12 Summary Exploi<ng topology, data seman<cs and asynchronous data staging is cri<cal as we scale to future systems GLEAN is a flexible and extensible framework for data analysis and I/O accelera<on taking into account applica<on, analy<cs and system characteris<cs Demonstrated GLEAN successfully with DOE INCITE and ESP applica<ons for simula<on- <me data analysis and I/O accelera<on at scale on leadership compu<ng systems venkatv@mcs.anl.gov

Oh, Exascale! The effect of emerging architectures on scien1fic discovery. Kenneth Moreland, Sandia Na1onal Laboratories

Oh, Exascale! The effect of emerging architectures on scien1fic discovery. Kenneth Moreland, Sandia Na1onal Laboratories Photos placed in horizontal posi1on with even amount of white space between photos and header Oh, $#*@! Exascale! The effect of emerging architectures on scien1fic discovery Ultrascale Visualiza1on Workshop,

More information

Trends in HPC I/O and File Systems

Trends in HPC I/O and File Systems Trends in HPC I/O and File Systems Rob Ross, Phil Carns, Dave Goodell, Kevin Harms, Kamil Iskra, Dries Kimpe, Rob Latham, Tom Peterka, Rajeev Thakur, and Venkat Vishwanath MathemaBcs and Computer Science

More information

Accelerating I/O Forwarding in IBM Blue Gene/P Systems

Accelerating I/O Forwarding in IBM Blue Gene/P Systems Accelerating I/O Forwarding in IBM Blue Gene/P Systems Venkatram Vishwanath, Mark Hereld, Kamil Iskra, Dries Kimpe, Vitali Morozov, Michael E. Papka, Robert Ross, and Kazutomo Yoshii Argonne National Laboratory

More information

ALCF Argonne Leadership Computing Facility

ALCF Argonne Leadership Computing Facility ALCF Argonne Leadership Computing Facility ALCF Data Analytics and Visualization Resources William (Bill) Allcock Leadership Computing Facility Argonne Leadership Computing Facility Established 2006. Dedicated

More information

On the Role of Burst Buffers in Leadership- Class Storage Systems

On the Role of Burst Buffers in Leadership- Class Storage Systems On the Role of Burst Buffers in Leadership- Class Storage Systems Ning Liu, Jason Cope, Philip Carns, Christopher Carothers, Robert Ross, Gary Grider, Adam Crume, Carlos Maltzahn Contact: liun2@cs.rpi.edu,

More information

Examples of In Transit Visualization

Examples of In Transit Visualization Examples of In Transit Visualization Kenneth Moreland Sandia National Laboratories kmorel@sandia.gov Sebastien Jourdain Kitware, Inc. sebastien.jourdain@kitware.com Nathan Fabian Sandia National Laboratories

More information

Scientific Computing at Million-way Parallelism - Blue Gene/Q Early Science Program

Scientific Computing at Million-way Parallelism - Blue Gene/Q Early Science Program Scientific Computing at Million-way Parallelism - Blue Gene/Q Early Science Program Implementing Hybrid Parallelism in FLASH Christopher Daley 1 2 Vitali Morozov 1 Dongwook Lee 2 Anshu Dubey 1 2 Jonathon

More information

Outline. In Situ Data Triage and Visualiza8on

Outline. In Situ Data Triage and Visualiza8on In Situ Data Triage and Visualiza8on Kwan- Liu Ma University of California at Davis Outline In situ data triage and visualiza8on: Issues and strategies Case study: An earthquake simula8on Case study: A

More information

The Fusion Distributed File System

The Fusion Distributed File System Slide 1 / 44 The Fusion Distributed File System Dongfang Zhao February 2015 Slide 2 / 44 Outline Introduction FusionFS System Architecture Metadata Management Data Movement Implementation Details Unique

More information

Results from the Early Science High Speed Combus:on and Detona:on Project

Results from the Early Science High Speed Combus:on and Detona:on Project Results from the Early Science High Speed Combus:on and Detona:on Project Alexei Khokhlov, University of Chicago Joanna Aus:n, University of Illinois Charles Bacon, Argonne Na:onal Laboratory Andrew Knisely,

More information

Transport Simulations beyond Petascale. Jing Fu (ANL)

Transport Simulations beyond Petascale. Jing Fu (ANL) Transport Simulations beyond Petascale Jing Fu (ANL) A) Project Overview The project: Peta- and exascale algorithms and software development (petascalable codes: Nek5000, NekCEM, NekLBM) Science goals:

More information

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago

Big Data, Big Compute, Big Interac3on Machines for Future Biology. Rick Stevens. Argonne Na3onal Laboratory The University of Chicago Assembly Annota3on Modeling Design Big Data, Big Compute, Big Interac3on Machines for Future Biology Rick Stevens stevens@anl.gov Argonne Na3onal Laboratory The University of Chicago There are no solved

More information

A Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard

A Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard A Classifica*on of Scien*fic Visualiza*on Algorithms for Massive Threading Kenneth Moreland Berk Geveci Kwan- Liu Ma Robert Maynard Sandia Na*onal Laboratories Kitware, Inc. University of California at Davis

More information

Petascale Adaptive Computational Fluid Dyanamics

Petascale Adaptive Computational Fluid Dyanamics Petascale Adaptive Computational Fluid Dyanamics K.E. Jansen, M. Rasquin Aerospace Engineering Sciences University of Colorado at Boulder O. Sahni, A. Ovcharenko, M.S. Shephard, M. Zhou, J. Fu, N. Liu,

More information

Cloud Computing WSU Dr. Bahman Javadi. School of Computing, Engineering and Mathematics

Cloud Computing WSU Dr. Bahman Javadi. School of Computing, Engineering and Mathematics Cloud Computing Research @ WSU Dr. Bahman Javadi School of Computing, Engineering and Mathematics Research Team and Research Interests Team 4 Academic Staff 5 PhD Students 1 Master Student Resource Scheduling

More information

Massively Parallel I/O for Partitioned Solver Systems

Massively Parallel I/O for Partitioned Solver Systems Parallel Processing Letters c World Scientific Publishing Company Massively Parallel I/O for Partitioned Solver Systems NING LIU, JING FU, CHRISTOPHER D. CAROTHERS Department of Computer Science, Rensselaer

More information

The ParaView Coprocessing Library: A Scalable, General Purpose In Situ Visualization Library

The ParaView Coprocessing Library: A Scalable, General Purpose In Situ Visualization Library The ParaView Coprocessing Library: A Scalable, General Purpose In Situ Visualization Library Nathan Fabian Sandia National Laboratories Pat Marion Kitware Inc. Berk Geveci Kitware Inc. Ken Moreland Sandia

More information

Damaris. In-Situ Data Analysis and Visualization for Large-Scale HPC Simulations. KerData Team. Inria Rennes,

Damaris. In-Situ Data Analysis and Visualization for Large-Scale HPC Simulations. KerData Team. Inria Rennes, Damaris In-Situ Data Analysis and Visualization for Large-Scale HPC Simulations KerData Team Inria Rennes, http://damaris.gforge.inria.fr Outline 1. From I/O to in-situ visualization 2. Damaris approach

More information

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations Argonne National Laboratory Argonne National Laboratory is located on 1,500

More information

Software Engineering and the Parallel Climate Analysis Library (ParCAL)

Software Engineering and the Parallel Climate Analysis Library (ParCAL) Software Engineering and the Parallel Climate Analysis Library (ParCAL) Robert Jacob and Xiabing Xu Mathema4cs and Computer Science Division Argonne Na4onal Laboratory SEA SoAware Engineering Conference

More information

ECP Alpine: Algorithms and Infrastructure for In Situ Visualization and Analysis

ECP Alpine: Algorithms and Infrastructure for In Situ Visualization and Analysis ECP Alpine: Algorithms and Infrastructure for In Situ Visualization and Analysis Presented By: Matt Larsen LLNL-PRES-731545 This work was performed under the auspices of the U.S. Department of Energy by

More information

Covisualization of full data and in situ data extracts from unstructured grid CFD at 160k cores

Covisualization of full data and in situ data extracts from unstructured grid CFD at 160k cores Covisualization of full data and in situ data extracts from unstructured grid CFD at 160k cores Michel Rasquin 1 michel.rasquin@colorado.edu Andrew C. Bauer 2 andy.bauer@kitware.com Jing Fu 6 fuj@cs.rpi.edu

More information

Improving Data Movement Performance for Sparse Data Patterns on the Blue Gene/Q Supercomputer

Improving Data Movement Performance for Sparse Data Patterns on the Blue Gene/Q Supercomputer Improving Data Movement Performance for Sparse Data Patterns on the Blue Gene/Q Supercomputer Huy Bui, Jason Leigh Electronic Visualization Laboratory (EVL) University of Illinois at Chicago Chicago, IL,

More information

Software-Defined Visualization Updates

Software-Defined Visualization Updates Software-Defined Visualization Updates IXPUG SC18 BOF PRESENTED BY: Chris Johnson SCI @ Univ. Utah Paul Navrátil TACC @ Univ. Texas November 15, 2018 Valerio Pascucci SCI @ Univ. Utah Guido Reina VRC @

More information

Bridging the Gap Between High Quality and High Performance for HPC Visualization

Bridging the Gap Between High Quality and High Performance for HPC Visualization Bridging the Gap Between High Quality and High Performance for HPC Visualization Rob Sisneros National Center for Supercomputing Applications University of Illinois at Urbana Champaign Outline Why am I

More information

Introducing ParVis. Robert Jacob. 16th Annual CESM Workshop June 23rd, 2011

Introducing ParVis. Robert Jacob. 16th Annual CESM Workshop June 23rd, 2011 Introducing ParVis Robert Jacob 16th Annual CESM Workshop June 23rd, 2011 ParVis (Parallel Analysis Tools and New Visualization Techniques for Ultra-Large Climate Data Sets) Motivation: Ability to gain

More information

High Performance Data Analytics for Numerical Simulations. Bruno Raffin DataMove

High Performance Data Analytics for Numerical Simulations. Bruno Raffin DataMove High Performance Data Analytics for Numerical Simulations Bruno Raffin DataMove bruno.raffin@inria.fr April 2016 About this Talk HPC for analyzing the results of large scale parallel numerical simulations

More information

Foundations of Data-Parallel Particle Advection!

Foundations of Data-Parallel Particle Advection! Foundations of Data-Parallel Particle Advection! Early stages of Rayleigh-Taylor Instability flow! Tom Peterka, Rob Ross Argonne National Laboratory! Boonth Nouanesengsey, Teng-Yok Lee, Abon Chaudhuri,

More information

Efficient Data Restructuring and Aggregation for I/O Acceleration in PIDX

Efficient Data Restructuring and Aggregation for I/O Acceleration in PIDX Efficient Data Restructuring and Aggregation for I/O Acceleration in PIDX Sidharth Kumar, Venkatram Vishwanath, Philip Carns, Joshua A. Levine, Robert Latham, Giorgio Scorzelli, Hemanth Kolla, Ray Grout,

More information

Multi-Resolution Streams of Big Scientific Data: Scaling Visualization Tools from Handheld Devices to In-Situ Processing

Multi-Resolution Streams of Big Scientific Data: Scaling Visualization Tools from Handheld Devices to In-Situ Processing Multi-Resolution Streams of Big Scientific Data: Scaling Visualization Tools from Handheld Devices to In-Situ Processing Valerio Pascucci Director, Center for Extreme Data Management Analysis and Visualization

More information

Charm++ for Productivity and Performance

Charm++ for Productivity and Performance Charm++ for Productivity and Performance A Submission to the 2011 HPC Class II Challenge Laxmikant V. Kale Anshu Arya Abhinav Bhatele Abhishek Gupta Nikhil Jain Pritish Jetley Jonathan Lifflander Phil

More information

Abstract Storage Moving file format specific abstrac7ons into petabyte scale storage systems. Joe Buck, Noah Watkins, Carlos Maltzahn & ScoD Brandt

Abstract Storage Moving file format specific abstrac7ons into petabyte scale storage systems. Joe Buck, Noah Watkins, Carlos Maltzahn & ScoD Brandt Abstract Storage Moving file format specific abstrac7ons into petabyte scale storage systems Joe Buck, Noah Watkins, Carlos Maltzahn & ScoD Brandt Introduc7on Current HPC environment separates computa7on

More information

Parallel File Systems. IIT Course: Data- Intensive Compu4ng Guest Lecture Samuel Lang September 20, 2010

Parallel File Systems. IIT Course: Data- Intensive Compu4ng Guest Lecture Samuel Lang September 20, 2010 Parallel File Systems IIT Course: Data- Intensive Compu4ng Guest Lecture Samuel Lang September 20, 2010 What are Parallel File Systems? 2 Parallel File Systems Store applica4on data persistently usually

More information

Visualiza(on So-ware and Hardware for In- Silico Brain Research. Stefan Eilemann Visualiza0on Team Lead Blue Brain Project, EPFL

Visualiza(on So-ware and Hardware for In- Silico Brain Research. Stefan Eilemann Visualiza0on Team Lead Blue Brain Project, EPFL Visualiza(on So-ware and Hardware for In- Silico Brain Research Stefan Eilemann Visualiza0on Team Lead Blue Brain Project, EPFL Blue Brain Project / Human Brain Project BBP: Swiss na0onal research project

More information

Performance Evaluation of a MongoDB and Hadoop Platform for Scientific Data Analysis

Performance Evaluation of a MongoDB and Hadoop Platform for Scientific Data Analysis Performance Evaluation of a MongoDB and Hadoop Platform for Scientific Data Analysis Elif Dede, Madhusudhan Govindaraju Lavanya Ramakrishnan, Dan Gunter, Shane Canon Department of Computer Science, Binghamton

More information

CSE6230 Fall Parallel I/O. Fang Zheng

CSE6230 Fall Parallel I/O. Fang Zheng CSE6230 Fall 2012 Parallel I/O Fang Zheng 1 Credits Some materials are taken from Rob Latham s Parallel I/O in Practice talk http://www.spscicomp.org/scicomp14/talks/l atham.pdf 2 Outline I/O Requirements

More information

UMAMI: A Recipe for Generating Meaningful Metrics through Holistic I/O Performance Analysis

UMAMI: A Recipe for Generating Meaningful Metrics through Holistic I/O Performance Analysis UMAMI: A Recipe for Generating Meaningful Metrics through Holistic I/O Performance Analysis Glenn K. Lockwood, Shane Snyder, Wucherl Yoo, Kevin Harms, Zachary Nault, Suren Byna, Philip Carns, Nicholas

More information

A Case Study for Scientific I/O: Improving the FLASH Astrophysics Code

A Case Study for Scientific I/O: Improving the FLASH Astrophysics Code A Case Study for Scientific I/O: Improving the FLASH Astrophysics Code Rob Latham 2, Chris Daley 1, Wei-keng Liao 3, Kui Gao 3, Rob Ross 2, Anshu Dubey 1, Alok Choudhary 3 1 DOE NNSA/ASCR Flash Center,

More information

ALCF Operations Best Practices and Highlights. Mark Fahey The 6 th AICS International Symposium Feb 22-23, 2016 RIKEN AICS, Kobe, Japan

ALCF Operations Best Practices and Highlights. Mark Fahey The 6 th AICS International Symposium Feb 22-23, 2016 RIKEN AICS, Kobe, Japan ALCF Operations Best Practices and Highlights Mark Fahey The 6 th AICS International Symposium Feb 22-23, 2016 RIKEN AICS, Kobe, Japan Overview ALCF Organization and Structure IBM BG/Q Mira Storage/Archive

More information

Combining Real Time Emula0on of Digital Communica0ons between Distributed Embedded Control Nodes with Real Time Power System Simula0on

Combining Real Time Emula0on of Digital Communica0ons between Distributed Embedded Control Nodes with Real Time Power System Simula0on 1 Combining Real Time Emula0on of Digital Communica0ons between Distributed Embedded Control Nodes with Real Time Power System Simula0on Ziyuan Cai and Ming Yu Electrical and Computer Eng., Florida State

More information

Assessing Medical Device. Cyber Risks in a Healthcare. Environment

Assessing Medical Device. Cyber Risks in a Healthcare. Environment Assessing Medical Device Medical Devices Security Cyber Risks in a Healthcare Phil Englert Director Technology Operations Environment Catholic Health Ini

More information

Heterogeneous CPU+GPU Molecular Dynamics Engine in CHARMM

Heterogeneous CPU+GPU Molecular Dynamics Engine in CHARMM Heterogeneous CPU+GPU Molecular Dynamics Engine in CHARMM 25th March, GTC 2014, San Jose CA AnE- Pekka Hynninen ane.pekka.hynninen@nrel.gov NREL is a na*onal laboratory of the U.S. Department of Energy,

More information

Center for Scalable Application Development Software: Application Engagement. Ewing Lusk (ANL) Gabriel Marin (Rice)

Center for Scalable Application Development Software: Application Engagement. Ewing Lusk (ANL) Gabriel Marin (Rice) Center for Scalable Application Development Software: Application Engagement Ewing Lusk (ANL) Gabriel Marin (Rice) CScADS Midterm Review April 22, 2009 1 Application Engagement Workshops (2 out of 4) for

More information

Storwize in IT Environments Market Overview

Storwize in IT Environments Market Overview Storwize in IT Environments Market Overview Topic Challenges in Tradi,onal IT Environment Types of informa,on Storage systems required Storage for private clouds where tradi,onal IT is involved Storwize

More information

Big Data in Scientific Domains

Big Data in Scientific Domains Big Data in Scientific Domains Arie Shoshani Lawrence Berkeley National Laboratory BES Workshop August 2012 Arie Shoshani 1 The Scalable Data-management, Analysis, and Visualization (SDAV) Institute 2012-2017

More information

Integrating Analysis and Computation with Trios Services

Integrating Analysis and Computation with Trios Services October 31, 2012 Integrating Analysis and Computation with Trios Services Approved for Public Release: SAND2012-9323P Ron A. Oldfield Scalable System Software Sandia National Laboratories Albuquerque,

More information

Short Talk: System abstractions to facilitate data movement in supercomputers with deep memory and interconnect hierarchy

Short Talk: System abstractions to facilitate data movement in supercomputers with deep memory and interconnect hierarchy Short Talk: System abstractions to facilitate data movement in supercomputers with deep memory and interconnect hierarchy François Tessier, Venkatram Vishwanath Argonne National Laboratory, USA July 19,

More information

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System

A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System A Distributed Data- Parallel Execu3on Framework in the Kepler Scien3fic Workflow System Ilkay Al(ntas and Daniel Crawl San Diego Supercomputer Center UC San Diego Jianwu Wang UMBC WorDS.sdsc.edu Computa3onal

More information

Submitted to: Dr. Sunnie Chung. Presented by: Sonal Deshmukh Jay Upadhyay

Submitted to: Dr. Sunnie Chung. Presented by: Sonal Deshmukh Jay Upadhyay Submitted to: Dr. Sunnie Chung Presented by: Sonal Deshmukh Jay Upadhyay Submitted to: Dr. Sunny Chung Presented by: Sonal Deshmukh Jay Upadhyay What is Apache Survey shows huge popularity spike for Apache

More information

Fault Tolerant Runtime ANL. Wesley Bland Joint Lab for Petascale Compu9ng Workshop November 26, 2013

Fault Tolerant Runtime ANL. Wesley Bland Joint Lab for Petascale Compu9ng Workshop November 26, 2013 Fault Tolerant Runtime Research @ ANL Wesley Bland Joint Lab for Petascale Compu9ng Workshop November 26, 2013 Brief History of FT Checkpoint/Restart (C/R) has been around for quite a while Guards against

More information

Towards Large Scale Predictive Flow Simulations

Towards Large Scale Predictive Flow Simulations Towards Large Scale Predictive Flow Simulations using ITAPS Tools and Services Onkar Sahni a a Present address: Center for Predictive Engineering and Computational Sciences (PECOS), ICES, UT Austin ollaborators:

More information

System Modeling Environment

System Modeling Environment System Modeling Environment Requirements, Architecture and Implementa

More information

ECE7995 (7) Parallel I/O

ECE7995 (7) Parallel I/O ECE7995 (7) Parallel I/O 1 Parallel I/O From user s perspective: Multiple processes or threads of a parallel program accessing data concurrently from a common file From system perspective: - Files striped

More information

Collaborative data-driven science. Collaborative data-driven science

Collaborative data-driven science. Collaborative data-driven science Alex Szalay ! Started with the SDSS SkyServer! Built very quickly in 2001! Goal: instant access to rich content! Idea: bring the analysis to the data! Interac

More information

CHARACTERIZING HPC I/O: FROM APPLICATIONS TO SYSTEMS

CHARACTERIZING HPC I/O: FROM APPLICATIONS TO SYSTEMS erhtjhtyhy CHARACTERIZING HPC I/O: FROM APPLICATIONS TO SYSTEMS PHIL CARNS carns@mcs.anl.gov Mathematics and Computer Science Division Argonne National Laboratory April 20, 2017 TU Dresden MOTIVATION FOR

More information

In situ visualization is the coupling of visualization

In situ visualization is the coupling of visualization Visualization Viewpoints Editor: Theresa-Marie Rhyne The Tensions of In Situ Visualization Kenneth Moreland Sandia National Laboratories In situ is the coupling of software with a simulation or other data

More information

Enabling Scalable Data Analysis for Large Computa9onal Structural Biology Datasets on Distributed Memory Systems

Enabling Scalable Data Analysis for Large Computa9onal Structural Biology Datasets on Distributed Memory Systems Enabling Scalable Data Analysis for Large Computa9onal Structural Biology Datasets on Distributed Memory Systems Michela Taufer Global Compu9ng Laboratory Computer and Informa9on Sciences University of

More information

TiDA: High Level Programming Abstrac8ons for Data Locality Management

TiDA: High Level Programming Abstrac8ons for Data Locality Management h#p://parcorelab.ku.edu.tr TiDA: High Level Programming Abstrac8ons for Data Locality Management Didem Unat, Muhammed Nufail Farooqi, Burak Bastem Koç University, Turkey Tan Nguyen, Weiqun Zhang, George

More information

Scalable Parallel Building Blocks for Custom Data Analysis

Scalable Parallel Building Blocks for Custom Data Analysis Scalable Parallel Building Blocks for Custom Data Analysis Tom Peterka, Rob Ross (ANL) Attila Gyulassy, Valerio Pascucci (SCI) Wes Kendall (UTK) Han-Wei Shen, Teng-Yok Lee, Abon Chaudhuri (OSU) Morse-Smale

More information

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations October 29, 2014 Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations XSEDE for HPC Users What is XSEDE? XSEDE mo/va/on and goals XSEDE Resources XSEDE for HPC Users: Before

More information

Preliminary Evalua.on of ABAQUS, FLUENT, and in- house GPU code Performance on Blue Waters

Preliminary Evalua.on of ABAQUS, FLUENT, and in- house GPU code Performance on Blue Waters Blue Water Symposium, University of Illinois, Urbana, IL, May 12-15, 2014 Preliminary Evalua.on of ABAQUS, FLUENT, and in- house GPU code Performance on Blue Waters B. G. Thomas 1, L. C. Hibbeler 1, K.

More information

Preparing GPU-Accelerated Applications for the Summit Supercomputer

Preparing GPU-Accelerated Applications for the Summit Supercomputer Preparing GPU-Accelerated Applications for the Summit Supercomputer Fernanda Foertter HPC User Assistance Group Training Lead foertterfs@ornl.gov This research used resources of the Oak Ridge Leadership

More information

Hobbes: Composi,on and Virtualiza,on as the Founda,ons of an Extreme- Scale OS/R

Hobbes: Composi,on and Virtualiza,on as the Founda,ons of an Extreme- Scale OS/R Hobbes: Composi,on and Virtualiza,on as the Founda,ons of an Extreme- Scale OS/R Ron Brightwell, Ron Oldfield Sandia Na,onal Laboratories Arthur B. Maccabe, David E. Bernholdt Oak Ridge Na,onal Laboratory

More information

Jitter-Free Co-Processing on a Prototype Exascale Storage Stack

Jitter-Free Co-Processing on a Prototype Exascale Storage Stack Jitter-Free Co-Processing on a Prototype Exascale Storage Stack John Bent john.bent@emc.com John Patchett patchett@lanl.gov Sorin Faibish sfaibish@emc.com Percy Tzelnic tzelnic@emc.com Jim Ahrens ahrens@lanl.gov

More information

FORTISSIMO HPGA HIGH PERFORMANCE GEAR ANALYZER. Ing. Antonio Zippo, PhD PULSAR DYNAMICS - UniMORE

FORTISSIMO HPGA HIGH PERFORMANCE GEAR ANALYZER. Ing. Antonio Zippo, PhD PULSAR DYNAMICS - UniMORE FORTISSIMO HPGA HIGH PERFORMANCE GEAR ANALYZER Ing. Antonio Zippo, PhD PULSAR DYNAMICS - UniMORE PARTNERS Applica?on Expert: Vibra?ons and PowerTrain Lab UNIMORE HPC Expert: CINECA Galileo Supercomputer

More information

Why Spectrum Storage Suite and Flash Systems for storage makes perfect sense

Why Spectrum Storage Suite and Flash Systems for storage makes perfect sense Why Storage Suite and Flash Systems for storage makes perfect sense Nick Harris Steve Ward 1 Summary: Things we covered #1 Simplify today s traditional storage environment The challenge Storage is siloed

More information

HACC: Extreme Scaling and Performance Across Diverse Architectures

HACC: Extreme Scaling and Performance Across Diverse Architectures HACC: Extreme Scaling and Performance Across Diverse Architectures Salman Habib HEP and MCS Divisions Argonne National Laboratory Vitali Morozov Nicholas Frontiere Hal Finkel Adrian Pope Katrin Heitmann

More information

Western Michigan University

Western Michigan University CS-6030 Cloud compu;ng Google App engine Sepideh Mohammadi Summer II 2017 Western Michigan University content Categories of cloud compu;ng Google cloud plaborm Google App Engine Storage technologies Datastore

More information

LLVM and Clang on the Most Powerful Supercomputer in the World

LLVM and Clang on the Most Powerful Supercomputer in the World LLVM and Clang on the Most Powerful Supercomputer in the World Hal Finkel November 7, 2012 The 2012 LLVM Developers Meeting Hal Finkel (Argonne National Laboratory) LLVM and Clang on the BG/Q November

More information

Architectures, and Protocol Design Issues for Mobile Social Networks: A Survey

Architectures, and Protocol Design Issues for Mobile Social Networks: A Survey Applica@ons, Architectures, and Protocol Design Issues for Mobile Social Networks: A Survey N. Kayastha,D. Niyato, P. Wang and E. Hossain, Proceedings of the IEEEVol. 99, No. 12, Dec. 2011. Sabita Maharjan

More information

Distributed State Es.ma.on Algorithms for Electric Power Systems

Distributed State Es.ma.on Algorithms for Electric Power Systems Distributed State Es.ma.on Algorithms for Electric Power Systems Ariana Minot, Blue Waters Graduate Fellow Professor Na Li, Professor Yue M. Lu Harvard University, School of Engineering and Applied Sciences

More information

Dax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale

Dax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale Dax: A Massively Threaded Visualiza5on and Analysis Toolkit for Extreme Scale GPU Technology Conference March 26, 2014 Kenneth Moreland Sandia Na5onal Laboratories Robert Maynard Kitware, Inc. Sandia National

More information

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill Introduction to FREE National Resources for Scientific Computing Dana Brunson Oklahoma State University High Performance Computing Center Jeff Pummill University of Arkansas High Peformance Computing Center

More information

Direct Numerical Simulation of Turbulent Boundary Layers at High Reynolds Numbers.

Direct Numerical Simulation of Turbulent Boundary Layers at High Reynolds Numbers. Direct Numerical Simulation of Turbulent Boundary Layers at High Reynolds Numbers. G. Borrell, J.A. Sillero and J. Jiménez, Corresponding author: guillem@torroja.dmt.upm.es School of Aeronautics, Universidad

More information

aginfra: High Performance Compu8ng einfrastructure for Agriculture

aginfra: High Performance Compu8ng einfrastructure for Agriculture aginfra: High Performance Compu8ng einfrastructure for Agriculture Antun Balaz Ins,tute of Physics Belgrade What is aginfra? A 3- years project, co- funded by the European Union, developing data infrastructure

More information

DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson

DataONE Cyberinfrastructure. Ma# Jones Dave Vieglais Bruce Wilson DataONE Cyberinfrastructure Ma# Jones Dave Vieglais Bruce Wilson Foremost a Federa9on Member Nodes (MNs) Heart of the federa9on Harness the power of local cura9on Coordina9ng Nodes (CNs) Services to link

More information

Scalable Bank of Filters for High Throughput Image Analysis

Scalable Bank of Filters for High Throughput Image Analysis Scalable Bank of Filters for High Throughput Image Analysis Shilpika1, Nicola Ferrier2, Venkatram Vishwanath1 shilpika@anl.gov, nferrier@anl.gov, venkat@anl.gov 1 Leadership Computing Facility, Argonne

More information

Concurrency-Optimized I/O For Visualizing HPC Simulations: An Approach Using Dedicated I/O Cores

Concurrency-Optimized I/O For Visualizing HPC Simulations: An Approach Using Dedicated I/O Cores Concurrency-Optimized I/O For Visualizing HPC Simulations: An Approach Using Dedicated I/O Cores Ma#hieu Dorier, Franck Cappello, Marc Snir, Bogdan Nicolae, Gabriel Antoniu 4th workshop of the Joint Laboratory

More information

Introduc)on to High Performance Compu)ng Advanced Research Computing

Introduc)on to High Performance Compu)ng Advanced Research Computing Introduc)on to High Performance Compu)ng Advanced Research Computing Outline What cons)tutes high performance compu)ng (HPC)? When to consider HPC resources What kind of problems are typically solved?

More information

Responsive Large Data Analysis and Visualization with the ParaView Ecosystem. Patrick O Leary, Kitware Inc

Responsive Large Data Analysis and Visualization with the ParaView Ecosystem. Patrick O Leary, Kitware Inc Responsive Large Data Analysis and Visualization with the ParaView Ecosystem Patrick O Leary, Kitware Inc Hybrid Computing Attribute Titan Summit - 2018 Compute Nodes 18,688 ~3,400 Processor (1) 16-core

More information

Johannes Günther, Senior Graphics Software Engineer. Intel Data Center Group, HPC Visualization

Johannes Günther, Senior Graphics Software Engineer. Intel Data Center Group, HPC Visualization Johannes Günther, Senior Graphics Software Engineer Intel Data Center Group, HPC Visualization Data set provided by Florida International University: Simulated fluid flow through a porous medium Large

More information

Toward portable I/O performance by leveraging system abstractions of deep memory and interconnect hierarchies

Toward portable I/O performance by leveraging system abstractions of deep memory and interconnect hierarchies Toward portable I/O performance by leveraging system abstractions of deep memory and interconnect hierarchies François Tessier, Venkatram Vishwanath, Paul Gressier Argonne National Laboratory, USA Wednesday

More information

Today (2010): Multicore Computing 80. Near future (~2018): Manycore Computing Number of Cores Processing

Today (2010): Multicore Computing 80. Near future (~2018): Manycore Computing Number of Cores Processing Number of Cores Manufacturing Process 300 250 200 150 100 50 0 2004 2006 2008 2010 2012 2014 2016 2018 100 Today (2010): Multicore Computing 80 1~12 cores commodity architectures 70 60 80 cores proprietary

More information

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky

BIG CPU, BIG DATA. Solving the World s Toughest Computational Problems with Parallel Computing. Alan Kaminsky BIG CPU, BIG DATA Solving the World s Toughest Computational Problems with Parallel Computing Alan Kaminsky Department of Computer Science B. Thomas Golisano College of Computing and Information Sciences

More information

Can Parallel Replication Benefit Hadoop Distributed File System for High Performance Interconnects?

Can Parallel Replication Benefit Hadoop Distributed File System for High Performance Interconnects? Can Parallel Replication Benefit Hadoop Distributed File System for High Performance Interconnects? N. S. Islam, X. Lu, M. W. Rahman, and D. K. Panda Network- Based Compu2ng Laboratory Department of Computer

More information

CORPORATE PRESENTATION

CORPORATE PRESENTATION CORPORATE PRESENTATION Background on device detec/on (1/2) Identifying the capabilities of a device accessing web contents has been an extensively explored issue in the past years, in particular in the

More information

MLlib. Distributed Machine Learning on. Evan Sparks. UC Berkeley

MLlib. Distributed Machine Learning on. Evan Sparks.  UC Berkeley MLlib & ML base Distributed Machine Learning on Evan Sparks UC Berkeley January 31st, 2014 Collaborators: Ameet Talwalkar, Xiangrui Meng, Virginia Smith, Xinghao Pan, Shivaram Venkataraman, Matei Zaharia,

More information

SIGHT. Benjamin Hernandez, PhD Advanced Data and Workflow(s) Group

SIGHT. Benjamin Hernandez, PhD Advanced Data and Workflow(s) Group SIGHT Benjamin Hernandez, PhD Advanced Data and Workflow(s) Group hernandezarb@ornl.gov ORNL is managed by UT-Battelle for the US Department of Energy name 1 Presentation This research used resources of

More information

UAB Research Compu1ng Resources and Ac1vi1es

UAB Research Compu1ng Resources and Ac1vi1es UAB Research Compu1ng Resources and Ac1vi1es Research Compu1ng Day September 13, 2012 UAB IT Research Compu1ng UAB IT Research Compu1ng Team Bob Cloud Execu1ve Director Infrastructure Services UAB IT Mike

More information

A More Realistic Way of Stressing the End-to-end I/O System

A More Realistic Way of Stressing the End-to-end I/O System A More Realistic Way of Stressing the End-to-end I/O System Verónica G. Vergara Larrea Sarp Oral Dustin Leverman Hai Ah Nam Feiyi Wang James Simmons CUG 2015 April 29, 2015 Chicago, IL ORNL is managed

More information

SKA Computing and Software

SKA Computing and Software SKA Computing and Software Nick Rees 18 May 2016 Summary Introduc)on System overview Compu)ng Elements of the SKA Telescope Manager Low Frequency Aperture Array Central Signal Processor Science Data Processor

More information

libis: A Lightweight Library for Flexible In Transit Visualization

libis: A Lightweight Library for Flexible In Transit Visualization libis: A Lightweight Library for Flexible In Transit Visualization Will Usher 1,2,Silvio Rizzi 3,Ingo Wald 2,4,JeffersonAmstutz 2,Joseph Insley 3,Venkatram Vishwanath 3,NicolaFerrier 3,Michael E. Papka

More information

A Holistic Approach for Performance Measurement and Analysis for Petascale Applications

A Holistic Approach for Performance Measurement and Analysis for Petascale Applications A Holistic Approach for Performance Measurement and Analysis for Petascale Applications Heike Jagode 1,2, Jack Dongarra 1,2 Sadaf Alam 2, Jeffrey Vetter 2 Wyatt Spear 3, Allen D. Malony 3 1 The University

More information

On-demand Research Computing: the European Grid Infrastructure

On-demand Research Computing: the European Grid Infrastructure EGI- InSPIRE On-demand Research Computing: the European Grid Infrastructure Gergely Sipos EGI.eu, Amsterdam gergely.sipos@egi.eu The Milky Way: Stars, Gas, Dust and Magnetic Fields in 3D 19-06-2012 Heidelberg,

More information

Christopher Sewell Katrin Heitmann Li-ta Lo Salman Habib James Ahrens

Christopher Sewell Katrin Heitmann Li-ta Lo Salman Habib James Ahrens LA-UR- 14-25437 Approved for public release; distribution is unlimited. Title: Portable Parallel Halo and Center Finders for HACC Author(s): Christopher Sewell Katrin Heitmann Li-ta Lo Salman Habib James

More information

A One-Sided View of HPC: Global-View Models and Portable Runtime Systems

A One-Sided View of HPC: Global-View Models and Portable Runtime Systems A One-Sided View of HPC: Global-View Models and Portable Runtime Systems James Dinan James Wallace Gives Postdoctoral Fellow Argonne Na9onal Laboratory Why Global-View? Proc 0 Proc 1 Proc n Global address

More information

Hybrid OpenMP-MPI Turbulent boundary Layer code over 32k cores

Hybrid OpenMP-MPI Turbulent boundary Layer code over 32k cores Hybrid OpenMP-MPI Turbulent boundary Layer code over 32k cores T/NT INTERFACE y/ x/ z/ 99 99 Juan A. Sillero, Guillem Borrell, Javier Jiménez (Universidad Politécnica de Madrid) and Robert D. Moser (U.

More information

Taming Parallel I/O Complexity with Auto-Tuning

Taming Parallel I/O Complexity with Auto-Tuning Taming Parallel I/O Complexity with Auto-Tuning Babak Behzad 1, Huong Vu Thanh Luu 1, Joseph Huchette 2, Surendra Byna 3, Prabhat 3, Ruth Aydt 4, Quincey Koziol 4, Marc Snir 1,5 1 University of Illinois

More information

Do You Know What Your I/O Is Doing? (and how to fix it?) William Gropp

Do You Know What Your I/O Is Doing? (and how to fix it?) William Gropp Do You Know What Your I/O Is Doing? (and how to fix it?) William Gropp www.cs.illinois.edu/~wgropp Messages Current I/O performance is often appallingly poor Even relative to what current systems can achieve

More information

IME (Infinite Memory Engine) Extreme Application Acceleration & Highly Efficient I/O Provisioning

IME (Infinite Memory Engine) Extreme Application Acceleration & Highly Efficient I/O Provisioning IME (Infinite Memory Engine) Extreme Application Acceleration & Highly Efficient I/O Provisioning September 22 nd 2015 Tommaso Cecchi 2 What is IME? This breakthrough, software defined storage application

More information