Automatically Encapsulating HPC Best Practices Into Data Transfers

Size: px
Start display at page:

Download "Automatically Encapsulating HPC Best Practices Into Data Transfers"

Transcription

1 National Aeronautics and Space Administration Automatically Encapsulating HPC Best Practices Into Data Transfers Paul Z. Kolano NASA Advanced Supercomputing Division

2 Introduction Outline of Presentation Transport tuning and selection Global resource management File system optimization Conclusions 2

3 Introduction Data transfers are part of life in HPC environments - Finite storage capacity Transfer to cheaper tape storage - Back up existing data - Make room for new data Transfer from tape storage to reprocess old data Transfer between file systems to fix imbalances - Finite computational capacity Transfer from off-site systems with cheaper pre-processing Transfer to off-site systems for cheaper post-processing 3

4 User transfer concerns Introduction (cont.) - Ease to use, integrity, turnaround time Administrator and owner transfer concerns - Environment stability, cost effectiveness These items can conflict with each other - Easy to use tools or those ensuring integrity may not be fast - Easiest file structure may degrade tape performance - Fastest turnaround time may lead to resource exhaustion Takes HPC expert to reconcile conflicts - Understands and applies accepted best practices to achieve fast and efficient verified transfers without impact on stability 4

5 Goal Let scientists focus on science without wading through documentation on transfer best practices - Specify transfers in simplest, naive fashion Source and destination Provide tool to perform transfer as if scientist were HPC expert - Choose appropriate tools and optimize for best performance - Fully utilize available resources without starving other users - Manage files appropriately by file system type to ensure efficient access by later and/or behind the scenes processes 5

6 Shift: Self-Healing Independent File Transfer Satisfies user requirements - Simple cp/scp syntax for local/remote transfers - End-to-end integrity via checksums and sanity checks - High speed via transport selection/tuning and automatic parallelization Satisfies administrator and owner requirements - Helps prevent resource exhaustion that leads to environment instability Global throttling to allocate resources fairly Load balancing to avoid highly loaded hosts Automatic striping to avoid imbalanced disk utilization - Helps prevent wasted resources that impact cost effectiveness Allows easy utilization of idle resources Reduces wasted CPU cycles during jobs due to inefficient disk I/O Prevents issues leading to inefficient tape I/O 6

7 Shift: Self-Healing Independent File Transfer (cont.) Used in production at NASA's Advanced Supercomputing Facility for over 3.5 years - User transfers across local/lan/wan - Disaster recovery backups to/from remote organizations - Rebalancing entire multi-pb Lustre file systems - etc. Facilitated transfers of over 14 PB in past year - 8 PB local transfers - 4 PB LAN transfers - 2 PB WAN transfers "I used to hate archiving data - now I almost look for a reason to archive something" Shift user 7

8 Shift Interface > shiftc --create-tar /nobackup/user1/dataset1 archive1:dataset1.tar Shift id is 36 Detaching process (use --status option to monitor progress) > shiftc --status id state dirs files file size date run rate sums attrs sum size time left error 0/ / TB/39.5TB 10/02 2d14h32m5s 175MB/s 46222/ / TB/79TB 10:26 35 done 1/1 5131/ GB/303GB 10/05 1m35s 3.19GB/s 10262/ / GB/605GB 12:28 36 run 24/ / TB/1.78TB 10/06 2h48m37s 176MB/s 15463/ / TB/3.56TB 12:11 1h47m55s 8

9 Shift Components Command-line client - Performs file operations and reports results to manager Command-line manager - Invoked by clients to track operations and parcel out work Client Host Shift Client App C1 Remote Host App R1 Client File System OS App C2 Interconnect App R2 OS Remote File System App Cj Shift Manager(s) App Rk 9

10 Introduction Outline of Presentation Transport tuning and selection Global resource management File system optimization Conclusions 10

11 Transport Tuning and Selection Shift includes built-in local/remote transports and checksum capabilities - Fully functional out of the box - Perl-based equivalents of cp, sftp, fish, m(d5)sum Shift calls higher performance tools when available - bbcp, bbftp, gridftp, mcp, rsync, msum - Knows how to construct command-lines and parse output Tune transports for optimal performance Select transports based on transfer characteristics 11

12 Transport Tuning TCP-based transports - bbcp, bbftp, gridftp - Choose TCP window size Transports with internal parallelism - TCP streams (bbcp, bbftp, gridftp) or threads (mcp, msum) - Choose appropriate level of parallelism SSH-based transports - fish, rsync, sftp-perl - Choose fastest SSH cipher and MAC algorithm 12

13 TCP Window Size Tuning TCP window is amount of data sender or receiver willing to buffer while waiting for acknowledgment Optimal value is bandwidth delay product (BDP) - bandwidth * round-trip time Constrained by configured operating system limits - e.g. Linux net.core.[wr]mem_max - Single stream only achieves bandwidth if limit at least BDP 13

14 TCP Window Size Tuning (cont.) LAN Transfer Performance (MB/s) bbcp bbftp gridftp 0 Default WAN Transfer Performance (MB/s) bbcp (get) bbftp (get) gridftp (put) 0 Default TCP Window Size (MB) TCP Window Size (MB) Shift determines latency using icmp/echo/syn ping Shift guesses bandwidth based on network type and client hardware if not given via --bandwidth - Bandwidth difficult to compute a priori Chooses window size up to operating system limit 14

15 Transport Parallelism Tuning Number of streams in TCP-based transports - Overcome improperly configured TCP window maximums - Overcome improperly specified TCP window - Overcome interference by cross traffic Number of threads in mcp and msum - Take advantage of excess resource capacity on one host 15

16 Transport Parallelism Tuning (cont.) LAN Transfer Performance (MB/s) bbcp bbftp gridftp WAN Transfer Performance (MB/s) bbcp (get) 100 bbftp (get) gridftp (put) TCP Streams TCP Streams Shift chooses streams based on bandwidth available beyond operating system window limit A minimum value can be configured for LAN/WAN to help overcome cross traffic 16

17 Transport Parallelism Tuning (cont.) Local Copy Performance (MB/s) Haswell (12-core, 4x FDR) Ivy Bridge (10-core, 4x FDR) Sandy Bridge (8-core, 4x FDR) Westmere (6-core, 4x QDR) Mcp Threads Threads can be centrally configured on the manager High thread counts can induce high load on shared resources - Intentionally set lower than optimal at NAS due to high load on archive front-ends 17

18 SSH Cipher and MAC Algorithm Tuning SSH-based transports use SSH pipe to communicate - Performance directly correlated to SSH performance SSH does not expose TCP window settings - HPN SSH patches can be used for better window handling Main SSH tuning parameters available - Encryption algorithm - Message authentication code (MAC) algorithm 18

19 SSH Cipher and MAC Algorithm Tuning (cont.) LAN Transfer Performance (MB/s) (fish with umac-64 MAC) des-cbc chacha20 case128-cbc blowfish-cbc arcfour256 arcfour128 arcfour aes256-ctr aes192-ctr aes128-ctr aes256-cbc aes192-cbc aes128-cbc LAN Transfer Performance (MB/s) (fish with arcfour256 cipher) hmac-sha1 hmac-ripemd160 hmac-md5-96 hmac-md5 umac-128 umac-64 hmac-sha2-512 hmac-sha2-256 hmac-sha1-96 SSH Cipher SSH MAC Algorithm Shift allows preferred cipher/mac order to be centrally configured Availability checked on client host before transfer 19

20 Transport Selection Different transports have different strengths and weaknesses Supporting multiple transports provides opportunity to vary transport across each batch of files within transfer 20

21 Transport Selection (cont.) LAN Transfer Performance (MB/s) bbcp bbftp fish gridftp rsync sftp-perl SSH/Non-SSH Break-Even Point WAN Transfer Performance (MB/s) bbcp bbftp fish gridftp rsync sftp-perl SSH/Non-SSH Break-Even Point File Size (MB) File Size (MB) Using a single transport does not achieve maximum performance Shift's support of multiple transports allows it to use the optimum transport for each batch of files 21

22 Transport Selection (cont.) Local Transfer Performance (MB/s) bbcp bbftp cp-perl fish gridftp mcp rsync Traditionally remote transports also perform well for local transfers File Size (MB) 22

23 Transport Selection (cont.) 64 x 1GB Performance (MB/s) Local LAN WAN 1000 x 4MB Performance (MB/s) Local LAN WAN 0 bbcp bbftp cp/sftp-perl fish gridftp mcp rsync 0 bbcp bbftp cp/sftp-perl fish gridftp mcp rsync Transport Transport Shift has configurable small file thresholds - Preferred local/lan/wan transports above/below Transport chosen using avg. size of each batch 23

24 Introduction Outline of Presentation Transport tuning and selection Global resource management File system optimization Conclusions 24

25 Global Resource Management Transfer parallelization - The only way to maximize performance in HPC environments is to use multiple resources at once Global throttling - If everyone tries to run at the maximum rate possible, everyone loses Load balancing - Avoid resource exhaustion on individual hosts 25

26 Transfer Parallelization Single host may have excess capacity - Transport does not have its own parallelism - Single client cannot fully utilize host's resources Full HPC environment may have excess capacity - Single system bottlenecks - Aggregate resources of many hosts Two complementary forms of parallelization - Multiple clients running on a single host - One or more clients running on multiple hosts 26

27 Transfer Parallelization (cont.) Local Transfer Performance (MB/s) bbcp bbftp cp-perl fish gridftp mcp rsync Local Transfer Performance (MB/s) bbcp bbftp cp-perl fish gridftp mcp rsync Parallel Clients Parallel Hosts Shift derives file system equivalency from mount information to determine parallelization candidates Shift can automatically parallelize all transfers using centrally configured clients/hosts values Can trivially run across allocated compute nodes - e.g. --host-file=$pbs_nodefile 27

28 Global Throttling Support transfers at full speed when excess capacity exists Prevent resource exhaustion when systems are busy 0 Local Write Performance (MB/s) Configured Limit Time Unthrottled (5.34 GB/s) Throttled (3.62 GB/s) Shift supports throttling limits on CPU %, disk usage, I/O rate, and network rate - Limits can be global, per user, per host, per file system Transfers directed to sleep until average reaches (transfer's fair share of) user's fair share of limit 28

29 Load Balancing Single host resource limitations impact maximum transfer speed of all processes on that host 3000 Less loaded hosts have 2000 more resources available 1000 for new transfers 0 I/O Bandwidth Utilization (MB/s) way dd write iperf IPoIB Time 32 Hosts 16 Hosts 8 Hosts 4 Hosts 2 Hosts 1 Host 29

30 Load Balancing (cont.) LAN Transfer Performance (MB/s) Disjoint remote hosts Same remote host Client Hosts Shift load balances during host parallelization - Picks least loaded hosts when spawning clients Shift load balances during remote transfers - Remote host switched to least loaded - Overlap prevented between parallel clients 30

31 Introduction Outline of Presentation Transport tuning and selection Global resource management File system optimization Conclusions 31

32 File System Optimization Different file systems have different idiosyncrasies - Can impact performance, stability, and operating cost - Not always directly visible to users Tape optimization - File extents impact tape write speed - Sequential retrieval results in inefficient tape movement Tar creation/extraction - Tape I/O is inefficient with small files Lustre striping - Too few stripes creates later I/O inefficiencies 32

33 Tape Optimization Tape-backed file systems have limited parallelism due to large slow-moving physical components Inefficient access by one user can have major impact on all others - Large number of file extents degrades write speed - Sequential file retrieval inhibits minimization of internal tape movement 33

34 Tape Optimization (cont.) Performance (MB/s) Tape Write Buffered dd Read Direct dd Read Extents Produced bbcp bbftp cp-perl fish gridftp mcp rsync Tape Writes Degrade Source File Extents File Size (GB) Shift can be configured to preallocate files below a given sparsity - Minimize extents for most common regular files - Minimize disk usage for large sparse files 34

35 Tape Optimization (cont.) Tape Retrieval Time (s) Sequential Batched Files automatically retrieved from tape if offline when accessed May be accessed in different order than stored on tape GB Files Shift automatically initiates a retrieval of all source files on SGI DMF file systems Retrieval initiated again for files in each batch in case files pushed offline before being transferred 35

36 Tar Creation/Extraction Users prefer archiving data in original hierarchy - Each file accessed as needed Small files impact stability - Not migrated below certain size so consume limited disk - Tape I/O is inefficient Normally use tar files - Tar is very slow (100 MB/s) - Must retrieve to view contents - No assurance of integrity - Retrieve entire tar for one file - May not have quota to create Tape Performance (MB/s) Write Read Source File Size (MB) 36

37 Tar Creation/Extraction (cont.) Tar Creation Performance (MB/s) Local (mcp) LAN (fish) WAN (fish) GNU tar Tar Extraction Performance (MB/s) Local (mcp) LAN (fish) WAN (fish) GNU tar Parallel Hosts Parallel Hosts Shift has built-in tar creation/extraction - Uses high performance transports and parallelism - Automatic creation of index files to see contents - Integrity verification of every file added/extracted - Automatic split of tar files at configurable size - Direct creation/extraction over network without use of quota 37

38 Lustre Striping Striping must be set before a file is first written Higher stripe count - More I/O bandwidth available for large files - More balanced distribution of large files over OSTs Lower stripe count - Less contention during metadata operations - Less wasted space for small files Striping can only be changed by copying file 38

39 Lustre Striping (cont.) Noncontiguous Write Performance (MB/s) Nodes 32 Nodes 16 Nodes 8 Nodes 4 Nodes 2 Nodes Noncontiguous Read Performance (MB/s) Nodes 32 Nodes 16 Nodes 8 Nodes 4 Nodes 2 Nodes Lustre Stripes Lustre Stripes Shift automatically stripes files according to size - Increases write performance during parallel transfers - Increases read performance during later job access Reduces wasted CPU cycles due to I/O Preserves non-default striping when applicable 39

40 Conclusion Shift is an automated transfer tool that encapsulates HPC best practices to achieve better performance while preserving stability of HPC environments - Centralized configuration in manager component allows policies to be changed globally across all clients - New best practices can be incorporated transparently without user in the loop Shift is open source and available for download - Contact info - paul.kolano@nasa.gov Questions? 40

Enabling High Performance Bulk Data Transfers With SSH

Enabling High Performance Bulk Data Transfers With SSH Enabling High Performance Bulk Data Transfers With SSH Chris Rapier Benjamin Bennett TIP 08 Moving Data Still crazy after all these years Multiple solutions exist Protocols UDT, SABUL, etc Implementations

More information

SSH Bulk Transfer Performance. Allan Jude --

SSH Bulk Transfer Performance. Allan Jude -- SSH Bulk Transfer Performance Allan Jude -- allanjude@freebsd.org Introduction 15 Years as FreeBSD Server Admin FreeBSD src/doc committer (ZFS, bhyve, ucl, xo) FreeBSD Core Team (July 2016-2018) Co-Author

More information

File Transfer: Basics and Best Practices. Joon Kim. Ph.D. PICSciE. Research Computing 09/07/2018

File Transfer: Basics and Best Practices. Joon Kim. Ph.D. PICSciE. Research Computing 09/07/2018 File Transfer: Basics and Best Practices Joon Kim. Ph.D. PICSciE Research Computing Workshop @Chemistry 09/07/2018 Our goal today Learn about data transfer basics Pick the right tool for your job Know

More information

IBM V7000 Unified R1.4.2 Asynchronous Replication Performance Reference Guide

IBM V7000 Unified R1.4.2 Asynchronous Replication Performance Reference Guide V7 Unified Asynchronous Replication Performance Reference Guide IBM V7 Unified R1.4.2 Asynchronous Replication Performance Reference Guide Document Version 1. SONAS / V7 Unified Asynchronous Replication

More information

Securing the Frisbee Multicast Disk Loader

Securing the Frisbee Multicast Disk Loader Securing the Frisbee Multicast Disk Loader Robert Ricci, Jonathon Duerig University of Utah 1 What is Frisbee? 2 Frisbee is Emulab s tool to install whole disk images from a server to many clients using

More information

Data Staging: Moving large amounts of data around, and moving it close to compute resources

Data Staging: Moving large amounts of data around, and moving it close to compute resources Data Staging: Moving large amounts of data around, and moving it close to compute resources PRACE advanced training course on Data Staging and Data Movement Helsinki, September 10 th 2013 Claudio Cacciari

More information

Data Staging: Moving large amounts of data around, and moving it close to compute resources

Data Staging: Moving large amounts of data around, and moving it close to compute resources Data Staging: Moving large amounts of data around, and moving it close to compute resources Digital Preserva-on Advanced Prac--oner Course Glasgow, July 19 th 2013 c.cacciari@cineca.it Definition Starting

More information

Data Movement and Storage. 04/07/09 1

Data Movement and Storage. 04/07/09  1 Data Movement and Storage 04/07/09 www.cac.cornell.edu 1 Data Location, Storage, Sharing and Movement Four of the seven main challenges of Data Intensive Computing, according to SC06. (Other three: viewing,

More information

A GPFS Primer October 2005

A GPFS Primer October 2005 A Primer October 2005 Overview This paper describes (General Parallel File System) Version 2, Release 3 for AIX 5L and Linux. It provides an overview of key concepts which should be understood by those

More information

An Overview of Fujitsu s Lustre Based File System

An Overview of Fujitsu s Lustre Based File System An Overview of Fujitsu s Lustre Based File System Shinji Sumimoto Fujitsu Limited Apr.12 2011 For Maximizing CPU Utilization by Minimizing File IO Overhead Outline Target System Overview Goals of Fujitsu

More information

ASPERA HIGH-SPEED TRANSFER. Moving the world s data at maximum speed

ASPERA HIGH-SPEED TRANSFER. Moving the world s data at maximum speed ASPERA HIGH-SPEED TRANSFER Moving the world s data at maximum speed ASPERA HIGH-SPEED FILE TRANSFER 80 GBIT/S OVER IP USING DPDK Performance, Code, and Architecture Charles Shiflett Developer of next-generation

More information

ASPERA HIGH-SPEED TRANSFER. Moving the world s data at maximum speed

ASPERA HIGH-SPEED TRANSFER. Moving the world s data at maximum speed ASPERA HIGH-SPEED TRANSFER Moving the world s data at maximum speed ASPERA HIGH-SPEED FILE TRANSFER Aspera FASP Data Transfer at 80 Gbps Elimina8ng tradi8onal bo

More information

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments *

Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Scalable, Reliable Marshalling and Organization of Distributed Large Scale Data Onto Enterprise Storage Environments * Joesph JaJa joseph@ Mike Smorul toaster@ Fritz McCall fmccall@ Yang Wang wpwy@ Institute

More information

EMC Solutions for Backup to Disk EMC Celerra LAN Backup to Disk with IBM Tivoli Storage Manager Best Practices Planning

EMC Solutions for Backup to Disk EMC Celerra LAN Backup to Disk with IBM Tivoli Storage Manager Best Practices Planning EMC Solutions for Backup to Disk EMC Celerra LAN Backup to Disk with IBM Tivoli Storage Manager Best Practices Planning Abstract This white paper describes how to configure the Celerra IP storage system

More information

Veeam Cloud Connect. Version 8.0. Administrator Guide

Veeam Cloud Connect. Version 8.0. Administrator Guide Veeam Cloud Connect Version 8.0 Administrator Guide June, 2015 2015 Veeam Software. All rights reserved. All trademarks are the property of their respective owners. No part of this publication may be reproduced,

More information

TGCC OVERVIEW. 13 février 2014 CEA 10 AVRIL 2012 PAGE 1

TGCC OVERVIEW. 13 février 2014 CEA 10 AVRIL 2012 PAGE 1 STORAGE @ TGCC OVERVIEW CEA 10 AVRIL 2012 PAGE 1 CONTEXT Data-Centric Architecture Centralized storage, accessible from every TGCC s compute machines Make cross-platform data sharing possible Mutualized

More information

Data Management. Parallel Filesystems. Dr David Henty HPC Training and Support

Data Management. Parallel Filesystems. Dr David Henty HPC Training and Support Data Management Dr David Henty HPC Training and Support d.henty@epcc.ed.ac.uk +44 131 650 5960 Overview Lecture will cover Why is IO difficult Why is parallel IO even worse Lustre GPFS Performance on ARCHER

More information

New User Seminar: Part 2 (best practices)

New User Seminar: Part 2 (best practices) New User Seminar: Part 2 (best practices) General Interest Seminar January 2015 Hugh Merz merz@sharcnet.ca Session Outline Submitting Jobs Minimizing queue waits Investigating jobs Checkpointing Efficiency

More information

VERITAS Volume Replicator. Successful Replication and Disaster Recovery

VERITAS Volume Replicator. Successful Replication and Disaster Recovery VERITAS Volume Replicator Successful Replication and Disaster Recovery V E R I T A S W H I T E P A P E R Table of Contents Introduction.................................................................................1

More information

Network Design Considerations for Grid Computing

Network Design Considerations for Grid Computing Network Design Considerations for Grid Computing Engineering Systems How Bandwidth, Latency, and Packet Size Impact Grid Job Performance by Erik Burrows, Engineering Systems Analyst, Principal, Broadcom

More information

Backup the System. Backup Overview. Backup Prerequisites

Backup the System. Backup Overview. Backup Prerequisites Backup Overview Backup Overview, on page 1 Backup Prerequisites, on page 1 Backup Task Flow, on page 2 Backup Interactions and Restrictions, on page 7 Cisco recommends performing regular backups. You can

More information

Introduction to High-Performance Computing (HPC)

Introduction to High-Performance Computing (HPC) Introduction to High-Performance Computing (HPC) Computer components CPU : Central Processing Unit cores : individual processing units within a CPU Storage : Disk drives HDD : Hard Disk Drive SSD : Solid

More information

Isilon Scale Out NAS. Morten Petersen, Senior Systems Engineer, Isilon Division

Isilon Scale Out NAS. Morten Petersen, Senior Systems Engineer, Isilon Division Isilon Scale Out NAS Morten Petersen, Senior Systems Engineer, Isilon Division 1 Agenda Architecture Overview Next Generation Hardware Performance Caching Performance SMB 3 - MultiChannel 2 OneFS Architecture

More information

Storage Area Network (SAN)

Storage Area Network (SAN) Storage Area Network (SAN) 1 Outline Shared Storage Architecture Direct Access Storage (DAS) SCSI RAID Network Attached Storage (NAS) Storage Area Network (SAN) Fiber Channel and Fiber Channel Switch 2

More information

Storage Area Networks: Performance and Security

Storage Area Networks: Performance and Security Storage Area Networks: Performance and Security Presented by Matthew Packard July 27, 2003 SAN Architecture - Definition & DAS Limitations Storage Area Network (SAN) universal storage connectivity free

More information

PERFORMANCE TUNING TECHNIQUES FOR VERITAS VOLUME REPLICATOR

PERFORMANCE TUNING TECHNIQUES FOR VERITAS VOLUME REPLICATOR PERFORMANCE TUNING TECHNIQUES FOR VERITAS VOLUME REPLICATOR Tim Coulter and Sheri Atwood November 13, 2003 VERITAS ARCHITECT NETWORK TABLE OF CONTENTS Introduction... 3 Overview of VERITAS Volume Replicator...

More information

DELL EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE

DELL EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE WHITEPAPER DELL EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE A Detailed Review ABSTRACT This white paper introduces Dell EMC Data Domain Extended Retention software that increases the storage scalability

More information

Announcements. Reading. Project #1 due in 1 week at 5:00 pm Scheduling Chapter 6 (6 th ed) or Chapter 5 (8 th ed) CMSC 412 S14 (lect 5)

Announcements. Reading. Project #1 due in 1 week at 5:00 pm Scheduling Chapter 6 (6 th ed) or Chapter 5 (8 th ed) CMSC 412 S14 (lect 5) Announcements Reading Project #1 due in 1 week at 5:00 pm Scheduling Chapter 6 (6 th ed) or Chapter 5 (8 th ed) 1 Relationship between Kernel mod and User Mode User Process Kernel System Calls User Process

More information

Computer Science Section. Computational and Information Systems Laboratory National Center for Atmospheric Research

Computer Science Section. Computational and Information Systems Laboratory National Center for Atmospheric Research Computer Science Section Computational and Information Systems Laboratory National Center for Atmospheric Research My work in the context of TDD/CSS/ReSET Polynya new research computing environment Polynya

More information

Enabling a SuperFacility with Software Defined Networking

Enabling a SuperFacility with Software Defined Networking Enabling a SuperFacility with Software Defined Networking Shane Canon Tina Declerck, Brent Draney, Jason Lee, David Paul, David Skinner May 2017 CUG 2017-1 - SuperFacility - Defined Combining the capabilities

More information

A Crash Course In Wide Area Data Replication. Jacob Farmer, CTO, Cambridge Computer

A Crash Course In Wide Area Data Replication. Jacob Farmer, CTO, Cambridge Computer A Crash Course In Wide Area Data Replication Jacob Farmer, CTO, Cambridge Computer SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals

More information

Data Movement & Tiering with DMF 7

Data Movement & Tiering with DMF 7 Data Movement & Tiering with DMF 7 Kirill Malkin Director of Engineering April 2019 Why Move or Tier Data? We wish we could keep everything in DRAM, but It s volatile It s expensive Data in Memory 2 Why

More information

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments Torben Kling-Petersen, PhD Presenter s Name Principle Field Title andengineer Division HPC &Cloud LoB SunComputing Microsystems

More information

Rio-2 Hybrid Backup Server

Rio-2 Hybrid Backup Server A Revolution in Data Storage for Today s Enterprise March 2018 Notices This white paper provides information about the as of the date of issue of the white paper. Processes and general practices are subject

More information

The Design and Implementation of AQuA: An Adaptive Quality of Service Aware Object-Based Storage Device

The Design and Implementation of AQuA: An Adaptive Quality of Service Aware Object-Based Storage Device The Design and Implementation of AQuA: An Adaptive Quality of Service Aware Object-Based Storage Device Joel Wu and Scott Brandt Department of Computer Science University of California Santa Cruz MSST2006

More information

DASH COPY GUIDE. Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 31

DASH COPY GUIDE. Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 31 DASH COPY GUIDE Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 31 DASH Copy Guide TABLE OF CONTENTS OVERVIEW GETTING STARTED ADVANCED BEST PRACTICES FAQ TROUBLESHOOTING DASH COPY PERFORMANCE TUNING

More information

File Systems Management and Examples

File Systems Management and Examples File Systems Management and Examples Today! Efficiency, performance, recovery! Examples Next! Distributed systems Disk space management! Once decided to store a file as sequence of blocks What s the size

More information

G-NET: Effective GPU Sharing In NFV Systems

G-NET: Effective GPU Sharing In NFV Systems G-NET: Effective Sharing In NFV Systems Kai Zhang*, Bingsheng He^, Jiayu Hu #, Zeke Wang^, Bei Hua #, Jiayi Meng #, Lishan Yang # *Fudan University ^National University of Singapore #University of Science

More information

SMB Direct Update. Tom Talpey and Greg Kramer Microsoft Storage Developer Conference. Microsoft Corporation. All Rights Reserved.

SMB Direct Update. Tom Talpey and Greg Kramer Microsoft Storage Developer Conference. Microsoft Corporation. All Rights Reserved. SMB Direct Update Tom Talpey and Greg Kramer Microsoft 1 Outline Part I Ecosystem status and updates SMB 3.02 status SMB Direct applications RDMA protocols and networks Part II SMB Direct details Protocol

More information

VERITAS Volume Replicator Successful Replication and Disaster Recovery

VERITAS Volume Replicator Successful Replication and Disaster Recovery VERITAS Replicator Successful Replication and Disaster Recovery Introduction Companies today rely to an unprecedented extent on online, frequently accessed, constantly changing data to run their businesses.

More information

Using file systems at HC3

Using file systems at HC3 Using file systems at HC3 Roland Laifer STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State of Baden-Württemberg and National Laboratory of the Helmholtz Association www.kit.edu Basic Lustre

More information

Lustre* is designed to achieve the maximum performance and scalability for POSIX applications that need outstanding streamed I/O.

Lustre* is designed to achieve the maximum performance and scalability for POSIX applications that need outstanding streamed I/O. Reference Architecture Designing High-Performance Storage Tiers Designing High-Performance Storage Tiers Intel Enterprise Edition for Lustre* software and Intel Non-Volatile Memory Express (NVMe) Storage

More information

Efficiently Backing up Terabytes of Data with pgbackrest

Efficiently Backing up Terabytes of Data with pgbackrest Efficiently Backing up Terabytes of Data with pgbackrest David Steele Crunchy Data PGDay Russia 2017 July 6, 2017 Agenda 1 Why Backup? 2 Living Backups 3 Design 4 Features 5 Performance 6 Changes to Core

More information

EMC Data Domain for Archiving Are You Kidding?

EMC Data Domain for Archiving Are You Kidding? EMC Data Domain for Archiving Are You Kidding? Bill Roth / Bob Spurzem EMC EMC 1 Agenda EMC Introduction Data Domain Enterprise Vault Integration Data Domain NetBackup Integration Q & A EMC 2 EMC Introduction

More information

Comparing TCP performance of tunneled and non-tunneled traffic using OpenVPN. Berry Hoekstra Damir Musulin OS3 Supervisor: Jan Just Keijser Nikhef

Comparing TCP performance of tunneled and non-tunneled traffic using OpenVPN. Berry Hoekstra Damir Musulin OS3 Supervisor: Jan Just Keijser Nikhef Comparing TCP performance of tunneled and non-tunneled traffic using OpenVPN Berry Hoekstra Damir Musulin OS3 Supervisor: Jan Just Keijser Nikhef Outline Introduction Approach Research Results Conclusion

More information

High-Performance Lustre with Maximum Data Assurance

High-Performance Lustre with Maximum Data Assurance High-Performance Lustre with Maximum Data Assurance Silicon Graphics International Corp. 900 North McCarthy Blvd. Milpitas, CA 95035 Disclaimer and Copyright Notice The information presented here is meant

More information

Efficiently Backing up Terabytes of Data with pgbackrest. David Steele

Efficiently Backing up Terabytes of Data with pgbackrest. David Steele Efficiently Backing up Terabytes of Data with pgbackrest PGConf US 2016 David Steele April 20, 2016 Crunchy Data Solutions, Inc. Efficiently Backing up Terabytes of Data with pgbackrest 1 / 22 Agenda 1

More information

Chapter 4 Data Movement Process

Chapter 4 Data Movement Process Chapter 4 Data Movement Process 46 - Data Movement Process Understanding how CommVault software moves data within the production and protected environment is essential to understanding how to configure

More information

Andreas Dilger. Principal Lustre Engineer. High Performance Data Division

Andreas Dilger. Principal Lustre Engineer. High Performance Data Division Andreas Dilger Principal Lustre Engineer High Performance Data Division Focus on Performance and Ease of Use Beyond just looking at individual features... Incremental but continuous improvements Performance

More information

Coordinating Parallel HSM in Object-based Cluster Filesystems

Coordinating Parallel HSM in Object-based Cluster Filesystems Coordinating Parallel HSM in Object-based Cluster Filesystems Dingshan He, Xianbo Zhang, David Du University of Minnesota Gary Grider Los Alamos National Lab Agenda Motivations Parallel archiving/retrieving

More information

The Google File System

The Google File System October 13, 2010 Based on: S. Ghemawat, H. Gobioff, and S.-T. Leung: The Google file system, in Proceedings ACM SOSP 2003, Lake George, NY, USA, October 2003. 1 Assumptions Interface Architecture Single

More information

CrashPlan Pro Client Introduction: Install on any computer you want to backup

CrashPlan Pro Client Introduction: Install on any computer you want to backup CrashPlan Pro Client Introduction: Install on any computer you want to backup CrashPlan Pro Client Features: Bullet-Proof Backup Continuous or scheduled Offsite and onsite destinations Incremental versioning

More information

The Case for Informed Transport Protocols

The Case for Informed Transport Protocols The Case for Informed Transport Protocols Stefan Savage Neal Cardwell Tom Anderson University of Washington Our position Wide-area network performance: is important is limited by inefficient congestion

More information

Performance of relational database management

Performance of relational database management Building a 3-D DRAM Architecture for Optimum Cost/Performance By Gene Bowles and Duke Lambert As systems increase in performance and power, magnetic disk storage speeds have lagged behind. But using solidstate

More information

Implementing a Hierarchical Storage Management system in a large-scale Lustre and HPSS environment

Implementing a Hierarchical Storage Management system in a large-scale Lustre and HPSS environment Implementing a Hierarchical Storage Management system in a large-scale Lustre and HPSS environment Brett Bode, Michelle Butler, Sean Stevens, Jim Glasgow National Center for Supercomputing Applications/University

More information

USING ISCSI AND VERITAS BACKUP EXEC 9.0 FOR WINDOWS SERVERS BENEFITS AND TEST CONFIGURATION

USING ISCSI AND VERITAS BACKUP EXEC 9.0 FOR WINDOWS SERVERS BENEFITS AND TEST CONFIGURATION WHITE PAPER Maximize Storage Networks with iscsi USING ISCSI AND VERITAS BACKUP EXEC 9.0 FOR WINDOWS SERVERS BENEFITS AND TEST CONFIGURATION For use with Windows 2000 VERITAS Software Corporation 03/05/2003

More information

Boundary control : Access Controls: An access control mechanism processes users request for resources in three steps: Identification:

Boundary control : Access Controls: An access control mechanism processes users request for resources in three steps: Identification: Application control : Boundary control : Access Controls: These controls restrict use of computer system resources to authorized users, limit the actions authorized users can taker with these resources,

More information

TCP and BBR. Geoff Huston APNIC

TCP and BBR. Geoff Huston APNIC TCP and BBR Geoff Huston APNIC Computer Networking is all about moving data The way in which data movement is controlled is a key characteristic of the network architecture The Internet protocol passed

More information

Software-defined Storage: Fast, Safe and Efficient

Software-defined Storage: Fast, Safe and Efficient Software-defined Storage: Fast, Safe and Efficient TRY NOW Thanks to Blockchain and Intel Intelligent Storage Acceleration Library Every piece of data is required to be stored somewhere. We all know about

More information

Symantec Backup Exec Blueprints

Symantec Backup Exec Blueprints Symantec Backup Exec Blueprints Blueprint for Large Installations Backup Exec Technical Services Backup & Recovery Technical Education Services Symantec Backup Exec Blueprints 1 Symantec Backup Exec Blueprints

More information

Lustre on ZFS. At The University of Wisconsin Space Science and Engineering Center. Scott Nolin September 17, 2013

Lustre on ZFS. At The University of Wisconsin Space Science and Engineering Center. Scott Nolin September 17, 2013 Lustre on ZFS At The University of Wisconsin Space Science and Engineering Center Scott Nolin September 17, 2013 Why use ZFS for Lustre? The University of Wisconsin Space Science and Engineering Center

More information

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT. Chapter 4:- Introduction to Grid and its Evolution Prepared By:- Assistant Professor SVBIT. Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies

More information

White paper ETERNUS CS800 Data Deduplication Background

White paper ETERNUS CS800 Data Deduplication Background White paper ETERNUS CS800 - Data Deduplication Background This paper describes the process of Data Deduplication inside of ETERNUS CS800 in detail. The target group consists of presales, administrators,

More information

How to Create a TINA VPN Tunnel between F- Series Firewalls

How to Create a TINA VPN Tunnel between F- Series Firewalls How to Create a TINA VPN Tunnel between F- Series Firewalls As the TINA protocol offers significant advantages over IPsec, it is the main protocol that is used for VPN connections between F-Series Firewalls.

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Silberschatz, Galvin and Gagne 2009 Chapter 1: Introduction What Operating Systems Do Computer-System Organization Computer-System Architecture Operating-System Structure Operating-System

More information

Chapter 1: Introduction. Operating System Concepts 8th Edition,

Chapter 1: Introduction. Operating System Concepts 8th Edition, Chapter 1: Introduction, Administrivia Project 0 due Monday. Reading: 2.1 2.7. Next Time: Operating system structure. 1.2 Outline Process management. Storage management and characteristics. Miscellaneous

More information

Cloud FastPath: Highly Secure Data Transfer

Cloud FastPath: Highly Secure Data Transfer Cloud FastPath: Highly Secure Data Transfer Tervela helps companies move large volumes of sensitive data safely and securely over network distances great and small. Tervela has been creating high performance

More information

CS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring Lecture 19: Networks and Distributed Systems

CS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring Lecture 19: Networks and Distributed Systems S 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring 2004 Lecture 19: Networks and Distributed Systems 19.0 Main Points Motivation for distributed vs. centralized systems

More information

LUG 2012 From Lustre 2.1 to Lustre HSM IFERC (Rokkasho, Japan)

LUG 2012 From Lustre 2.1 to Lustre HSM IFERC (Rokkasho, Japan) LUG 2012 From Lustre 2.1 to Lustre HSM Lustre @ IFERC (Rokkasho, Japan) Diego.Moreno@bull.net From Lustre-2.1 to Lustre-HSM - Outline About Bull HELIOS @ IFERC (Rokkasho, Japan) Lustre-HSM - Basis of Lustre-HSM

More information

The Future of High-Performance Networking (The 5?, 10?, 15? Year Outlook)

The Future of High-Performance Networking (The 5?, 10?, 15? Year Outlook) Workshop on New Visions for Large-Scale Networks: Research & Applications Vienna, VA, USA, March 12-14, 2001 The Future of High-Performance Networking (The 5?, 10?, 15? Year Outlook) Wu-chun Feng feng@lanl.gov

More information

EMC CLARiiON Backup Storage Solutions

EMC CLARiiON Backup Storage Solutions Engineering White Paper Backup-to-Disk Guide with Computer Associates BrightStor ARCserve Backup Abstract This white paper describes how to configure EMC CLARiiON CX series storage systems with Computer

More information

OASIS: Self-tuning Storage for Applications

OASIS: Self-tuning Storage for Applications OASIS: Self-tuning Storage for Applications Kostas Magoutis, Prasenjit Sarkar, Gauri Shah 14 th NASA Goddard- 23 rd IEEE Mass Storage Systems Technologies, College Park, MD, May 17, 2006 Outline Motivation

More information

IBM Tivoli Storage Manager V5.1 Implementation Exam # Sample Test

IBM Tivoli Storage Manager V5.1 Implementation Exam # Sample Test Note: This sample test is designed to give the candidate an idea of the content and format of the questions that will be on the certification exam. Performance on the sample test is NOT an indicator of

More information

The Google File System

The Google File System The Google File System Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung SOSP 2003 presented by Kun Suo Outline GFS Background, Concepts and Key words Example of GFS Operations Some optimizations in

More information

Intel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage

Intel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage Intel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage Evaluation of Lustre File System software enhancements for improved Metadata performance Wojciech Turek, Paul Calleja,John

More information

Application Layer Network Window Management in the SSH Protocol Chris Rapier Michael A. Stevens

Application Layer Network Window Management in the SSH Protocol Chris Rapier Michael A. Stevens Application Layer Network Window Management in the SSH Protocol Chris Rapier rapier@psc.edu Michael A. Stevens mstevens@andrew.cmu.edu Abstract Multiplexed network applications, such as SSH, require the

More information

Chapter Topics Part 1. Network Definitions. Behind the Scenes: Networking and Security

Chapter Topics Part 1. Network Definitions. Behind the Scenes: Networking and Security Chapter Topics Part 1 Behind the Scenes: Networking and Security CS10001 Computer Literacy Business Networks Network Advantages Client/Server Networks Network Classifications Servers Toplogies Chapter

More information

CS268: Beyond TCP Congestion Control

CS268: Beyond TCP Congestion Control TCP Problems CS68: Beyond TCP Congestion Control Ion Stoica February 9, 004 When TCP congestion control was originally designed in 1988: - Key applications: FTP, E-mail - Maximum link bandwidth: 10Mb/s

More information

TCP so far Computer Networking Outline. How Was TCP Able to Evolve

TCP so far Computer Networking Outline. How Was TCP Able to Evolve TCP so far 15-441 15-441 Computer Networking 15-641 Lecture 14: TCP Performance & Future Peter Steenkiste Fall 2016 www.cs.cmu.edu/~prs/15-441-f16 Reliable byte stream protocol Connection establishments

More information

An Introduction to Key Management for Secure Storage. Walt Hubis, LSI Corporation

An Introduction to Key Management for Secure Storage. Walt Hubis, LSI Corporation An Introduction to Key Management for Secure Storage Walt Hubis, LSI Corporation SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals may

More information

Optimizing Parallel Access to the BaBar Database System Using CORBA Servers

Optimizing Parallel Access to the BaBar Database System Using CORBA Servers SLAC-PUB-9176 September 2001 Optimizing Parallel Access to the BaBar Database System Using CORBA Servers Jacek Becla 1, Igor Gaponenko 2 1 Stanford Linear Accelerator Center Stanford University, Stanford,

More information

designed. engineered. results. Parallel DMF

designed. engineered. results. Parallel DMF designed. engineered. results. Parallel DMF Agenda Monolithic DMF Parallel DMF Parallel configuration considerations Monolithic DMF Monolithic DMF DMF Databases DMF Central Server DMF Data File server

More information

File Protection using rsync. Setup guide

File Protection using rsync. Setup guide File Protection using rsync Setup guide Contents 1. Introduction... 2 Documentation... 2 Licensing... 2 Overview... 2 2. Rsync technology... 3 Terminology... 3 Implementation... 3 3. Rsync data hosts...

More information

Chapter 1: Introduction

Chapter 1: Introduction Chapter 1: Introduction Chapter 1: Introduction What Operating Systems Do Computer-System Organization Computer-System Architecture Operating-System Structure Operating-System Operations Process Management

More information

CS 471 Operating Systems. Yue Cheng. George Mason University Fall 2017

CS 471 Operating Systems. Yue Cheng. George Mason University Fall 2017 CS 471 Operating Systems Yue Cheng George Mason University Fall 2017 Outline o Process concept o Process creation o Process states and scheduling o Preemption and context switch o Inter-process communication

More information

X1 StorNext SAN. Jim Glidewell Information Technology Services Boeing Shared Services Group

X1 StorNext SAN. Jim Glidewell Information Technology Services Boeing Shared Services Group X1 StorNext SAN Jim Glidewell Information Technology Services Boeing Shared Services Group Current HPC Systems Page 2 Two Cray T-90's A 384 CPU Origin 3800 Three 256-CPU Linux clusters Cray X1 X1 SAN -

More information

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance 11 th International LS-DYNA Users Conference Computing Technology LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance Gilad Shainer 1, Tong Liu 2, Jeff Layton

More information

COS 318: Operating Systems

COS 318: Operating Systems COS 318: Operating Systems File Systems: Abstractions and Protection Jaswinder Pal Singh Computer Science Department Princeton University (http://www.cs.princeton.edu/courses/cos318/) Topics What s behind

More information

PAC485 Managing Datacenter Resources Using the VirtualCenter Distributed Resource Scheduler

PAC485 Managing Datacenter Resources Using the VirtualCenter Distributed Resource Scheduler PAC485 Managing Datacenter Resources Using the VirtualCenter Distributed Resource Scheduler Carl Waldspurger Principal Engineer, R&D This presentation may contain VMware confidential information. Copyright

More information

Lustre usages and experiences

Lustre usages and experiences Lustre usages and experiences at German Climate Computing Centre in Hamburg Carsten Beyer High Performance Computing Center Exclusively for the German Climate Research Limited Company, non-profit Staff:

More information

Lustre2.5 Performance Evaluation: Performance Improvements with Large I/O Patches, Metadata Improvements, and Metadata Scaling with DNE

Lustre2.5 Performance Evaluation: Performance Improvements with Large I/O Patches, Metadata Improvements, and Metadata Scaling with DNE Lustre2.5 Performance Evaluation: Performance Improvements with Large I/O Patches, Metadata Improvements, and Metadata Scaling with DNE Hitoshi Sato *1, Shuichi Ihara *2, Satoshi Matsuoka *1 *1 Tokyo Institute

More information

Extraordinary HPC file system solutions at KIT

Extraordinary HPC file system solutions at KIT Extraordinary HPC file system solutions at KIT Roland Laifer STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State Roland of Baden-Württemberg Laifer Lustre and tools for ldiskfs investigation

More information

Ultra high-speed transmission technology for wide area data movement

Ultra high-speed transmission technology for wide area data movement Ultra high-speed transmission technology for wide area data movement Michelle Munson, president & co-founder Aspera Outline Business motivation Moving ever larger file sets over commodity IP networks (public,

More information

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM IBM Spectrum Protect Version 8.1.2 Introduction to Data Protection Solutions IBM IBM Spectrum Protect Version 8.1.2 Introduction to Data Protection Solutions IBM Note: Before you use this information

More information

Data Analytics and Storage System (DASS) Mixing POSIX and Hadoop Architectures. 13 November 2016

Data Analytics and Storage System (DASS) Mixing POSIX and Hadoop Architectures. 13 November 2016 National Aeronautics and Space Administration Data Analytics and Storage System (DASS) Mixing POSIX and Hadoop Architectures 13 November 2016 Carrie Spear (carrie.e.spear@nasa.gov) HPC Architect/Contractor

More information

Reduces latency and buffer overhead. Messaging occurs at a speed close to the processors being directly connected. Less error detection

Reduces latency and buffer overhead. Messaging occurs at a speed close to the processors being directly connected. Less error detection Switching Operational modes: Store-and-forward: Each switch receives an entire packet before it forwards it onto the next switch - useful in a general purpose network (I.e. a LAN). usually, there is a

More information

The MOSIX Scalable Cluster Computing for Linux. mosix.org

The MOSIX Scalable Cluster Computing for Linux.  mosix.org The MOSIX Scalable Cluster Computing for Linux Prof. Amnon Barak Computer Science Hebrew University http://www. mosix.org 1 Presentation overview Part I : Why computing clusters (slide 3-7) Part II : What

More information

Slurm at the George Washington University Tim Wickberg - Slurm User Group Meeting 2015

Slurm at the George Washington University Tim Wickberg - Slurm User Group Meeting 2015 Slurm at the George Washington University Tim Wickberg - wickberg@gwu.edu Slurm User Group Meeting 2015 September 16, 2015 Colonial One What s new? Only major change was switch to FairTree Thanks to BYU

More information

Chapter 11: Implementing File Systems

Chapter 11: Implementing File Systems Chapter 11: Implementing File Systems Operating System Concepts 99h Edition DM510-14 Chapter 11: Implementing File Systems File-System Structure File-System Implementation Directory Implementation Allocation

More information

PASS4TEST. IT Certification Guaranteed, The Easy Way! We offer free update service for one year

PASS4TEST. IT Certification Guaranteed, The Easy Way!   We offer free update service for one year PASS4TEST \ http://www.pass4test.com We offer free update service for one year Exam : E20-329 Title : Technology Architect Backup and Recovery Solutions Design Exam Vendor : EMC Version : DEMO Get Latest

More information