Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments
|
|
- Mervyn Walters
- 6 years ago
- Views:
Transcription
1 LCI HPC Revolution April 2005 Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments Matthew Woitaszek
2 Collaborators Organizations National Center for Atmospheric Research (NCAR) University of Colorado, Boulder (CU) Researchers Jason Cope Michael Oberg Henry Tufo 26 April
3 Outline Motivation Parallel Filesystem Products and Experiences PVFS2, Lustre, GPFS, TerraGrid FS Results Single-Node Performance Parallel Bandwidth and Metadata Performance Future Work 26 April
4 LLNL NCSA SDSC Related Work Directly involved with Cluster File Systems and Lustre Goal: Create a filesystem not restricted to specific hardware Exploring the breadth of available parallel filesystems Integration with Mass Storage and TeraGrid systems Examined GPFS, GFS, Panasas, SamFS, SGI CXFS, an ADIC solution, Lustre, and IBRIX LCI 2004: Examined PVFS, Lustre, and GPFS on IA-64 Focused on a homogeneous architecture with fibre-channel equipped storage servers 26 April
5 Motivation NCAR Storage Systems Supercomputers and Clusters and local working storage Archival Storage Tape silo system Archive Management and disk cache controller Visualization Systems and local working storage Grid Gateway GridFTP Server DataMover Server Shared Storage Cluster with shared filesystem 26 April
6 Motivation Current CU Boulder Systems NFS Servers Home Directories Shared Software Working Space Compute Clusters Xeon Cluster (64) PPC970 Cluster (28) 26 April
7 Motivation Future CU Boulder Systems CU Storage Cluster NCAR Experimental Platforms CU Compute Clusters Xeon Cluster (64) PPC970 Cluster (28) Opteron Cluster (128) 26 April
8 Parallel Filesystem Features Typical desired features Availability (no downtime) Reliability (no data loss) Performance (no waiting) Scalability (no limits) Affordability (no cost) We re starting small, but we want our filesystem to grow with us! Two storage servers at the present time Support expansion and external connectivity 26 April
9 Research Objectives Find a high performance parallel filesystem Minimum of specialized hardware Commodity servers with directly attached disk Filesystem access over Ethernet Support heterogeneous cluster client environment Servers: Xeon, Xeon EMT64 Clients: PPC970, Xeon, and eventually Opteron Examine features and requirements Functionality Performance Administrative overhead 26 April
10 Filesystems Overview and Experience We examined cluster-based filesystems PVFS2 Lustre GPFS TerraFS We did not examine SAN solutions SAN solutions require expensive hardware NCAR uses SANs as collective storage among hosts but not between supercomputers Separate NCAR evaluation team At SC2004, neither GPFS nor Lustre supported Xeon and PPC970 heterogeneous client environments 26 April
11 Experience PVFS2 Installation and configuration Compile kernel module on clients only, no restrictions Two storage servers and one metadata server Very easy to install and configure Worked on our original systems with no kernel changes Stable and reliable parallel filesystem in our environment 26 April
12 Experience in phases Experience Lustre Phase 1: Trying to build our own kernel patches Phase 2: Using the pre-built Lustre kernels Phase 3: Using a custom Lustre kernel Final configuration Required changing Xeon cluster to SLES9 Custom Lustre PPC-enabled kernels using SLES 9 Two object store targets and one metadata server Final phase worked on all systems in our environment Very reliable on Xeon Less reliable with performance variances on PPC April
13 Experience GPFS Installation and Configuration Compile kernel module on all machines, very restricted Quick and pleasant out-of-box experience Exceptionally well documented and robust Final Configuration Required removing LVM on storage servers Required changing Xeon cluster to SLES9 Two NSD storage servers Worked on all of the clusters in our environment 26 April
14 TerraGrid (TerraFS) iscsi Linux VFS Linux md iscsi Components iscsi initiator on clients Cache coherent iscsi target daemon Linux md (multi-device) software SCSI Kernel level extz derivative filesystem TerraFS Daemon Linux file system Word sense disambiguation Official product name is TerraGrid Frequently abbreviated as TerraFS 26 April
15 Experience TerraFS Installation and configuration Initial install performed by TerraScale engineers Replicated on additional nodes Documentation and software differ slightly (md RAID support) Final configuration Required custom TerraScale built kernel Two storage targets, no metadata server Final phase worked only on the Xeon cluster Generates lots of error messages in failure conditions No current support for PPC April
16 Table of Administrator Pain and Agony Intel x86-64 Metadata server Intel Xeon Storage server PPC970 Client Intel Xeon Client GPFS 2.3 Lustre PVFS2 TerraFS Not Used Restricted SLES.111 Restricted SLES.111 Restricted SLES.111 Restricted SLES.141 Restricted SLES.141 Restricted SLES.141 Restricted SLES.141 No Change No Change All (Module Only) All (Module Only) Our Xeon cluster is only 2.5 years old Not Used No Change N/A Custom Patch GPFS required a commercial OS and a specific kernel version Lustre required a commercial OS and a specific kernel patch TerraFS required a custom kernel Systems already running SuSE required less effort Original goal was to fit filesystem in environment 26 April
17 Performance Experimental Setup Storage Servers Dual Xeon 3.06 GHz 2.5 GB RAM SCSI320 disk array 4 400GB LVM partitions Metadata Server (optional) Dual Xeon EMT GHz 8 GB RAM Xeon Cluster (7 or 14) Core Switch Dual 1Gbps Trunked/Bonded PPC970 Cluster (14 or 27) Other impromptu independent variables Impact of Linux channel bonding servers (PVFS2, Lustre) Impact of Linux Logical Volume Management (Lustre) One disclaimer: GPFS was not run with LVM 26 April
18 Performance Results I Single Node Bandwidth CU workload characteristics Clusters utilized as a compute farm 75% jobs are serial (33% compute time) Used iozone to measure single node performance 26 April
19 Single Node Read Performance NFS PVFS2 Lustre TerraFS GPFS Local 26 April
20 Single Node Write Performance NFS PVFS2 Lustre TerraFS GPFS Local 26 April
21 Performance Results II Aggregate Bandwidth NCAR caggreio benchmark Used by NCAR for previous procurements Writes 30 x 128MB files: separate file per process Does not measure concurrent writer performance Measures average aggregate bandwidth Each process runs independently and is timed Average time is used to produce bandwidth Examined channel bonding variants Lustre No improvement PVFS2 Substantial improvement 26 April
22 Xeon Cluster Aggregate Read Rate 26 April
23 Xeon Cluster Aggregate Write Rate 26 April
24 PPC970 Cluster Aggregate Read Rate 26 April
25 PPC970 Cluster Aggregate Write Rate 26 April
26 Performance Results III - Metadata Testing NCAR metarates benchmark Used by NCAR for previous procurements Writes 10,000 files per task Places file in a single directory or unique directories Measures average file creation rate No GPFS results on the PPC Cluster GPFS was functional and was tested Unable to select balanced nodes for testing 26 April
27 Metadata Creation Rate Same Directory NFS PVFS2 Lustre TerraFS GPFS 26 April
28 Metadata Creation Rate Unique Directories NFS PVFS2 Lustre TerraFS GPFS 26 April
29 GPFS Metarates File creations per second in a unique directory for each task 26 April
30 Linux Logical Volume Management (LVM) There s always something GPFS was the last system we tested GPFS cannot run on top of LVM devices We used LVM with every other filesystem Lustre and GPFS demonstrated close bandwidth results Conclusion Did Linux Logical Volume Management affect Lustre s performance? LVM has no statistically significant impact on Lustre reads (the 95% confidence intervals overlap) Xeon cluster writes are faster without LVM on servers PPC970 cluster writes are inconclusive 26 April
31 Future Work Production filesystem installation Dedicate 2/3 server space to GPFS Reserve 1/3 server space for Lustre Subject filesystem to our user community MPI-IO concurrent write performance testing Wide area network filesystem access Examine higher performance and heterogeneous interconnects Infiniband, 10Gbps Ethernet, Gigabit Ethernet Single network solution not possible 26 April
32 Desired Features in a Production Filesystem Remain responsive even in failure conditions Filesystem failure should not interrupt standard UNIX commands used by administrators ls la /mnt or df should not hang the console Zombies should respond to kill s 9 Support clean normal and abnormal termination Support both service start and shutdown commands Provide an Emergency Stop feature Never hang Linux reboot command Cut losses and let the administrators fix things 26 April
33 Conclusions Heterogenous client support is a recent feature Expect full out-of-box capabilities in the next calendar year with GPFS, PVFS2, and Lustre Specific kernel dependencies and custom kernel patch implementations are a substantial inconvenience Parallel filesystem selection depends on individual site requirements and capabilities. Increased cost (operating system support contracts) Decreased research flexibility Delay when applying security patches Looking forward to Steve Woods Lustre presentation 26 April
34 Acknowledgements Cluster File Systems (Lustre) Jeffrey Denworth, Phil Schwan, and Jacob Berkman IBM (GPFS) Ray Paden, Gautam Shah, Barry Bolding, and Rajiv Bendale NCAR Bill Anderson, Pam Gillman, George Fuentes, and Rich Loft Terrascale Technologies (TerraFS) Tim Wilcox and Dave Jensen University of Colorado, Boulder Theron Voran 26 April
35 Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments Questions? Matthew Woitaszek
Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments
Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments Jason Cope*, Michael Oberg*, Henry M. Tufo*, and Matthew Woitaszek* * University of Colorado, Boulder National Center for Atmospheric
More informationParallel File Systems Compared
Parallel File Systems Compared Computing Centre (SSCK) University of Karlsruhe, Germany Laifer@rz.uni-karlsruhe.de page 1 Outline» Parallel file systems (PFS) Design and typical usage Important features
More informationParallel File Systems for HPC
Introduction to Scuola Internazionale Superiore di Studi Avanzati Trieste November 2008 Advanced School in High Performance and Grid Computing Outline 1 The Need for 2 The File System 3 Cluster & A typical
More informationExploration of Parallel Storage Architectures for a Blue Gene/L on the TeraGrid
Exploration of Parallel Storage Architectures for a Blue Gene/L on the TeraGrid Michael Oberg 2, Henry M. Tufo 1,2, and Matthew Woitaszek 2 oberg@ucar.edu 1 University of Colorado, Boulder, CO 2 National
More informationRAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System
RAIDIX Data Storage Solution Clustered Data Storage Based on the RAIDIX Software and GPFS File System 2017 Contents Synopsis... 2 Introduction... 3 Challenges and the Solution... 4 Solution Architecture...
More informationComputer Science Section. Computational and Information Systems Laboratory National Center for Atmospheric Research
Computer Science Section Computational and Information Systems Laboratory National Center for Atmospheric Research My work in the context of TDD/CSS/ReSET Polynya new research computing environment Polynya
More informationLS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance
11 th International LS-DYNA Users Conference Computing Technology LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance Gilad Shainer 1, Tong Liu 2, Jeff Layton
More informationWrite a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical
Identify a problem Review approaches to the problem Propose a novel approach to the problem Define, design, prototype an implementation to evaluate your approach Could be a real system, simulation and/or
More informationStorage and Storage Access
Rainer Többicke CERN/IT 1 Introduction Data access Raw data, analysis data, software repositories, calibration data Small files, large files Frequent access Sequential access, random access Large variety
More informationAn Introduction to GPFS
IBM High Performance Computing July 2006 An Introduction to GPFS gpfsintro072506.doc Page 2 Contents Overview 2 What is GPFS? 3 The file system 3 Application interfaces 4 Performance and scalability 4
More informationHPC In The Cloud? Michael Kleber. July 2, Department of Computer Sciences University of Salzburg, Austria
HPC In The Cloud? Michael Kleber Department of Computer Sciences University of Salzburg, Austria July 2, 2012 Content 1 2 3 MUSCLE NASA 4 5 Motivation wide spread availability of cloud services easy access
More informationFeedback on BeeGFS. A Parallel File System for High Performance Computing
Feedback on BeeGFS A Parallel File System for High Performance Computing Philippe Dos Santos et Georges Raseev FR 2764 Fédération de Recherche LUmière MATière December 13 2016 LOGO CNRS LOGO IO December
More informationMission-Critical Enterprise Linux. April 17, 2006
Mission-Critical Enterprise Linux April 17, 2006 Agenda Welcome Who we are & what we do Steve Meyers, Director Unisys Linux Systems Group (steven.meyers@unisys.com) Technical Presentations Xen Virtualization
More informationSystem input-output, performance aspects March 2009 Guy Chesnot
Headline in Arial Bold 30pt System input-output, performance aspects March 2009 Guy Chesnot Agenda Data sharing Evolution & current tendencies Performances: obstacles Performances: some results and good
More informationACCRE High Performance Compute Cluster
6 중 1 2010-05-16 오후 1:44 Enabling Researcher-Driven Innovation and Exploration Mission / Services Research Publications User Support Education / Outreach A - Z Index Our Mission History Governance Services
More informationData Management. Parallel Filesystems. Dr David Henty HPC Training and Support
Data Management Dr David Henty HPC Training and Support d.henty@epcc.ed.ac.uk +44 131 650 5960 Overview Lecture will cover Why is IO difficult Why is parallel IO even worse Lustre GPFS Performance on ARCHER
More informationBeyond Petascale. Roger Haskin Manager, Parallel File Systems IBM Almaden Research Center
Beyond Petascale Roger Haskin Manager, Parallel File Systems IBM Almaden Research Center GPFS Research and Development! GPFS product originated at IBM Almaden Research Laboratory! Research continues to
More informationCoordinating Parallel HSM in Object-based Cluster Filesystems
Coordinating Parallel HSM in Object-based Cluster Filesystems Dingshan He, Xianbo Zhang, David Du University of Minnesota Gary Grider Los Alamos National Lab Agenda Motivations Parallel archiving/retrieving
More informationSun Lustre Storage System Simplifying and Accelerating Lustre Deployments
Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments Torben Kling-Petersen, PhD Presenter s Name Principle Field Title andengineer Division HPC &Cloud LoB SunComputing Microsystems
More informationA Comparative Experimental Study of Parallel File Systems for Large-Scale Data Processing
A Comparative Experimental Study of Parallel File Systems for Large-Scale Data Processing Z. Sebepou, K. Magoutis, M. Marazakis, A. Bilas Institute of Computer Science (ICS) Foundation for Research and
More informationCrossing the Chasm: Sneaking a parallel file system into Hadoop
Crossing the Chasm: Sneaking a parallel file system into Hadoop Wittawat Tantisiriroj Swapnil Patil, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University In this work Compare and contrast large
More informationInfoBrief. Platform ROCKS Enterprise Edition Dell Cluster Software Offering. Key Points
InfoBrief Platform ROCKS Enterprise Edition Dell Cluster Software Offering Key Points High Performance Computing Clusters (HPCC) offer a cost effective, scalable solution for demanding, compute intensive
More informationUnleashing Clustered ECMWF-Workshop 2004
Terrascale Technologies, Inc. Unleashing Clustered Computing @ ECMWF-Workshop 2004 Gautham Sastri / Kolja Kuse 1 Company snapshot Founded in Nov/2002 by Gautham Sastri and Iain Findleton Headquarters in
More informationLinux Clustering Technologies. Mark Spencer November 8, 2005
Linux Clustering Technologies Mark Spencer November 8, 2005 Presentation Topics Business Drivers Clustering Methods High Availability High Performance Cluster Filesystems Volume Managers Business Drivers
More informationSystem that permanently stores data Usually layered on top of a lower-level physical storage medium Divided into logical units called files
System that permanently stores data Usually layered on top of a lower-level physical storage medium Divided into logical units called files Addressable by a filename ( foo.txt ) Usually supports hierarchical
More informationAccelerating Parallel Analysis of Scientific Simulation Data via Zazen
Accelerating Parallel Analysis of Scientific Simulation Data via Zazen Tiankai Tu, Charles A. Rendleman, Patrick J. Miller, Federico Sacerdoti, Ron O. Dror, and David E. Shaw D. E. Shaw Research Motivation
More informationpnfs, POSIX, and MPI-IO: A Tale of Three Semantics
Dean Hildebrand Research Staff Member PDSW 2009 pnfs, POSIX, and MPI-IO: A Tale of Three Semantics Dean Hildebrand, Roger Haskin Arifa Nisar IBM Almaden Northwestern University Agenda Motivation pnfs HPC
More informationStructuring PLFS for Extensibility
Structuring PLFS for Extensibility Chuck Cranor, Milo Polte, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University What is PLFS? Parallel Log Structured File System Interposed filesystem b/w
More informationHPC File Systems and Storage. Irena Johnson University of Notre Dame Center for Research Computing
HPC File Systems and Storage Irena Johnson University of Notre Dame Center for Research Computing HPC (High Performance Computing) Aggregating computer power for higher performance than that of a typical
More informationIBM Storwize V7000 Unified
IBM Storwize V7000 Unified Pavel Müller IBM Systems and Technology Group Storwize V7000 Position Enterprise Block DS8000 For clients requiring: Advanced disaster recovery with 3-way mirroring and System
More informationCrossing the Chasm: Sneaking a parallel file system into Hadoop
Crossing the Chasm: Sneaking a parallel file system into Hadoop Wittawat Tantisiriroj Swapnil Patil, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University In this work Compare and contrast large
More informationComparing Linux Clusters for the Community Climate System Model
Comparing Linux Clusters for the Community Climate System Model Matthew Woitaszek, Michael Oberg, and Henry M. Tufo Department of Computer Science University of Colorado, Boulder {matthew.woitaszek, michael.oberg}@colorado.edu,
More informationAssessing performance in HP LeftHand SANs
Assessing performance in HP LeftHand SANs HP LeftHand Starter, Virtualization, and Multi-Site SANs deliver reliable, scalable, and predictable performance White paper Introduction... 2 The advantages of
More informationThe Hadoop Distributed File System Konstantin Shvachko Hairong Kuang Sanjay Radia Robert Chansler
The Hadoop Distributed File System Konstantin Shvachko Hairong Kuang Sanjay Radia Robert Chansler MSST 10 Hadoop in Perspective Hadoop scales computation capacity, storage capacity, and I/O bandwidth by
More informationRed Hat Global File System
Red Hat Global File System HP User Society / DECUS 18. Mai 2006 Joachim Schröder Red Hat GmbH Two Key Industry Trends Clustering (scale-out) is happening 20% of all servers shipped will be clustered by
More informationIBM Scale Out Network Attached Storage (SONAS) using the Acuo Universal Clinical Platform
IBM Scale Out Network Attached Storage (SONAS) using the Acuo Universal Clinical Platform A vendor-neutral medical-archive offering Dave Curzio IBM Systems and Technology Group ISV Enablement February
More informationThe Optimal CPU and Interconnect for an HPC Cluster
5. LS-DYNA Anwenderforum, Ulm 2006 Cluster / High Performance Computing I The Optimal CPU and Interconnect for an HPC Cluster Andreas Koch Transtec AG, Tübingen, Deutschland F - I - 15 Cluster / High Performance
More informationSGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012
SGI Overview HPC User Forum Dearborn, Michigan September 17 th, 2012 SGI Market Strategy HPC Commercial Scientific Modeling & Simulation Big Data Hadoop In-memory Analytics Archive Cloud Public Private
More informationThe MOSIX Scalable Cluster Computing for Linux. mosix.org
The MOSIX Scalable Cluster Computing for Linux Prof. Amnon Barak Computer Science Hebrew University http://www. mosix.org 1 Presentation overview Part I : Why computing clusters (slide 3-7) Part II : What
More informationThe Last Bottleneck: How Parallel I/O can improve application performance
The Last Bottleneck: How Parallel I/O can improve application performance HPC ADVISORY COUNCIL STANFORD WORKSHOP; DECEMBER 6 TH 2011 REX TANAKIT DIRECTOR OF INDUSTRY SOLUTIONS AGENDA Panasas Overview Who
More informationOvercoming Obstacles to Petabyte Archives
Overcoming Obstacles to Petabyte Archives Mike Holland Grau Data Storage, Inc. 609 S. Taylor Ave., Unit E, Louisville CO 80027-3091 Phone: +1-303-664-0060 FAX: +1-303-664-1680 E-mail: Mike@GrauData.com
More informationShared File System Requirements for SAS Grid Manager. Table Talk #1546 Ben Smith / Brian Porter
Shared File System Requirements for SAS Grid Manager Table Talk #1546 Ben Smith / Brian Porter About the Presenters Main Presenter: Ben Smith, Technical Solutions Architect, IBM smithbe1@us.ibm.com Brian
More informationLustre A Platform for Intelligent Scale-Out Storage
Lustre A Platform for Intelligent Scale-Out Storage Rumi Zahir, rumi. May 2003 rumi.zahir@intel.com Agenda Problem Statement Trends & Current Data Center Storage Architectures The Lustre File System Project
More informationCyberStore DSS. Multi Award Winning. Broadberry. CyberStore DSS. Open-E DSS v7 based Storage Appliances. Powering these organisations
Multi Award Winning CyberStore DSS CyberStore DSS CyberStore DSS Storage Appliances: Open-E DSS v7 based Storage Appliances The CyberStore DSS range of iscsi SAN / NAS storage appliances include 1U-4U
More informationSymantec NetBackup PureDisk Compatibility Matrix Created August 26, 2010
Symantec NetBackup PureDisk 6.6.1 Compatibility Matrix Created August 26, 2010 Copyright 2010 Symantec Corporation. All rights reserved. Symantec, the Symantec Logo, and Backup Exec are trademarks or registered
More informationToward An Integrated Cluster File System
Toward An Integrated Cluster File System Adrien Lebre February 1 st, 2008 XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 Outline Context Kerrighed and root file
More informationIBM Emulex 16Gb Fibre Channel HBA Evaluation
IBM Emulex 16Gb Fibre Channel HBA Evaluation Evaluation report prepared under contract with Emulex Executive Summary The computing industry is experiencing an increasing demand for storage performance
More informationShared Object-Based Storage and the HPC Data Center
Shared Object-Based Storage and the HPC Data Center Jim Glidewell High Performance Computing BOEING is a trademark of Boeing Management Company. Computing Environment Cray X1 2 Chassis, 128 MSPs, 1TB memory
More informationDVS, GPFS and External Lustre at NERSC How It s Working on Hopper. Tina Butler, Rei Chi Lee, Gregory Butler 05/25/11 CUG 2011
DVS, GPFS and External Lustre at NERSC How It s Working on Hopper Tina Butler, Rei Chi Lee, Gregory Butler 05/25/11 CUG 2011 1 NERSC is the Primary Computing Center for DOE Office of Science NERSC serves
More informationREMEM: REmote MEMory as Checkpointing Storage
REMEM: REmote MEMory as Checkpointing Storage Hui Jin Illinois Institute of Technology Xian-He Sun Illinois Institute of Technology Yong Chen Oak Ridge National Laboratory Tao Ke Illinois Institute of
More informationSAN, HPSS, Sam-QFS, and GPFS technology in use at SDSC
SAN, HPSS, Sam-QFS, and GPFS technology in use at SDSC Bryan Banister, San Diego Supercomputing Center bryan@sdsc.edu Manager, Storage Systems and Production Servers Production Services Department Big
More informationSami Saarinen Peter Towers. 11th ECMWF Workshop on the Use of HPC in Meteorology Slide 1
Acknowledgements: Petra Kogel Sami Saarinen Peter Towers 11th ECMWF Workshop on the Use of HPC in Meteorology Slide 1 Motivation Opteron and P690+ clusters MPI communications IFS Forecast Model IFS 4D-Var
More informationSCS Distributed File System Service Proposal
SCS Distributed File System Service Proposal Project Charter: To cost effectively build a Distributed networked File Service (DFS) that can grow to Petabyte scale, customized to the size and performance
More informationExperiences with HP SFS / Lustre in HPC Production
Experiences with HP SFS / Lustre in HPC Production Computing Centre (SSCK) University of Karlsruhe Laifer@rz.uni-karlsruhe.de page 1 Outline» What is HP StorageWorks Scalable File Share (HP SFS)? A Lustre
More informationHigh-Performance Lustre with Maximum Data Assurance
High-Performance Lustre with Maximum Data Assurance Silicon Graphics International Corp. 900 North McCarthy Blvd. Milpitas, CA 95035 Disclaimer and Copyright Notice The information presented here is meant
More informationMAHA. - Supercomputing System for Bioinformatics
MAHA - Supercomputing System for Bioinformatics - 2013.01.29 Outline 1. MAHA HW 2. MAHA SW 3. MAHA Storage System 2 ETRI HPC R&D Area - Overview Research area Computing HW MAHA System HW - Rpeak : 0.3
More informationEmerging Technologies for HPC Storage
Emerging Technologies for HPC Storage Dr. Wolfgang Mertz CTO EMEA Unstructured Data Solutions June 2018 The very definition of HPC is expanding Blazing Fast Speed Accessibility and flexibility 2 Traditional
More informationp5 520 server Robust entry system designed for the on demand world Highlights
Robust entry system designed for the on demand world IBM p5 520 server _` p5 520 rack system with I/O drawer Highlights Innovative, powerful, affordable, open and adaptable UNIX and Linux environment system
More informationINFOBrief. Dell-IBRIX Cluster File System Solution. Key Points
INFOBrief Dell-IBRIX Cluster File System Solution High-performance parallel, segmented file system for scale-out clusters, grid computing, and enterprise applications Capable of delivering linear scalability
More informationManufacturing Bringing New Levels of Performance to CAE Applications
Solution Brief: Manufacturing Bringing New Levels of Performance to CAE Applications Abstract Computer Aided Engineering (CAE) is used to help manufacturers bring products to market faster while maintaining
More informationINTEGRATING HPFS IN A CLOUD COMPUTING ENVIRONMENT
INTEGRATING HPFS IN A CLOUD COMPUTING ENVIRONMENT Abhisek Pan 2, J.P. Walters 1, Vijay S. Pai 1,2, David Kang 1, Stephen P. Crago 1 1 University of Southern California/Information Sciences Institute 2
More informationA Global Operating System for HPC Clusters
A Global Operating System Emiliano Betti 1 Marco Cesati 1 Roberto Gioiosa 2 Francesco Piermaria 1 1 System Programming Research Group, University of Rome Tor Vergata 2 BlueGene Software Division, IBM TJ
More informationThe RAMDISK Storage Accelerator
The RAMDISK Storage Accelerator A Method of Accelerating I/O Performance on HPC Systems Using RAMDISKs Tim Wickberg, Christopher D. Carothers wickbt@rpi.edu, chrisc@cs.rpi.edu Rensselaer Polytechnic Institute
More informationThe Last Bottleneck: How Parallel I/O can attenuate Amdahl's Law
The Last Bottleneck: How Parallel I/O can attenuate Amdahl's Law ERESEARCH AUSTRALASIA, NOVEMBER 2011 REX TANAKIT DIRECTOR OF INDUSTRY SOLUTIONS AGENDA Parallel System Parallel processing goes mainstream
More informationBlueGene/L. Computer Science, University of Warwick. Source: IBM
BlueGene/L Source: IBM 1 BlueGene/L networking BlueGene system employs various network types. Central is the torus interconnection network: 3D torus with wrap-around. Each node connects to six neighbours
More informationAn introduction to GPFS Version 3.3
IBM white paper An introduction to GPFS Version 3.3 Scott Fadden, IBM Corporation Contents 1 Overview 2 What is GPFS? 2 The file system 2 Application interfaces 3 Performance and scalability 3 Administration
More informationThe Leading Parallel Cluster File System
The Leading Parallel Cluster File System www.thinkparq.com www.beegfs.io ABOUT BEEGFS What is BeeGFS BeeGFS (formerly FhGFS) is the leading parallel cluster file system, developed with a strong focus on
More informationDELL EMC ISILON F800 AND H600 I/O PERFORMANCE
DELL EMC ISILON F800 AND H600 I/O PERFORMANCE ABSTRACT This white paper provides F800 and H600 performance data. It is intended for performance-minded administrators of large compute clusters that access
More informationBeoLink.org. Design and build an inexpensive DFS. Fabrizio Manfredi Furuholmen. FrOSCon August 2008
Design and build an inexpensive DFS Fabrizio Manfredi Furuholmen FrOSCon August 2008 Agenda Overview Introduction Old way openafs New way Hadoop CEPH Conclusion Overview Why Distributed File system? Handle
More informationHorizontal Scaling Solution using Linux Environment
Systems Software for the Next Generation of Storage Horizontal Scaling Solution using Linux Environment December 14, 2001 Carter George Vice President, Corporate Development PolyServe, Inc. PolyServe Goal:
More informationScaling a Global File System to the Greatest Possible Extent, Performance, Capacity, and Number of Users
Scaling a Global File System to the Greatest Possible Extent, Performance, Capacity, and Number of Users Phil Andrews, Bryan Banister, Patricia Kovatch, Chris Jordan San Diego Supercomputer Center University
More informationQLogic TrueScale InfiniBand and Teraflop Simulations
WHITE Paper QLogic TrueScale InfiniBand and Teraflop Simulations For ANSYS Mechanical v12 High Performance Interconnect for ANSYS Computer Aided Engineering Solutions Executive Summary Today s challenging
More informationHPC Architectures. Types of resource currently in use
HPC Architectures Types of resource currently in use Reusing this material This work is licensed under a Creative Commons Attribution- NonCommercial-ShareAlike 4.0 International License. http://creativecommons.org/licenses/by-nc-sa/4.0/deed.en_us
More informationECE7995 (7) Parallel I/O
ECE7995 (7) Parallel I/O 1 Parallel I/O From user s perspective: Multiple processes or threads of a parallel program accessing data concurrently from a common file From system perspective: - Files striped
More informationMicrosoft Office SharePoint Server 2007
Microsoft Office SharePoint Server 2007 Enabled by EMC Celerra Unified Storage and Microsoft Hyper-V Reference Architecture Copyright 2010 EMC Corporation. All rights reserved. Published May, 2010 EMC
More informationOutline 1 Motivation 2 Theory of a non-blocking benchmark 3 The benchmark and results 4 Future work
Using Non-blocking Operations in HPC to Reduce Execution Times David Buettner, Julian Kunkel, Thomas Ludwig Euro PVM/MPI September 8th, 2009 Outline 1 Motivation 2 Theory of a non-blocking benchmark 3
More informationData storage services at KEK/CRC -- status and plan
Data storage services at KEK/CRC -- status and plan KEK/CRC Hiroyuki Matsunaga Most of the slides are prepared by Koichi Murakami and Go Iwai KEKCC System Overview KEKCC (Central Computing System) The
More informationExperiences with the Parallel Virtual File System (PVFS) in Linux Clusters
Experiences with the Parallel Virtual File System (PVFS) in Linux Clusters Kent Milfeld, Avijit Purkayastha, Chona Guiang Texas Advanced Computing Center The University of Texas Austin, Texas USA Abstract
More informationBuilding an HPC Watering Hole for Boulder Area Computational Science
Building an HPC Watering Hole for Boulder Area Computational Science E. R. Jessup 1,H.M.Tufo 2, and M. S. Woitaszek 3 1 University of Colorado, Boulder, CO 80309, USA {jessup, tufo}@cs.colorado.edu http://www.cs.colorado.edu/
More informationComparing File (NAS) and Block (SAN) Storage
Comparing File (NAS) and Block (SAN) Storage January 2014 Contents Abstract... 3 Introduction... 3 Network-Attached Storage... 3 Storage Area Network... 4 Networks and Storage... 4 Network Roadmaps...
More informationIntroducing SUSE Enterprise Storage 5
Introducing SUSE Enterprise Storage 5 1 SUSE Enterprise Storage 5 SUSE Enterprise Storage 5 is the ideal solution for Compliance, Archive, Backup and Large Data. Customers can simplify and scale the storage
More informationCisco Prime Home 6.X Minimum System Requirements: Standalone and High Availability
White Paper Cisco Prime Home 6.X Minimum System Requirements: Standalone and High Availability White Paper August 2014 2014 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public
More informationSPECIFICATION FOR NETWORK ATTACHED STORAGE (NAS) TO BE FILLED BY BIDDER. NAS Controller Should be rack mounted with a form factor of not more than 2U
SPECIFICATION FOR NETWORK ATTACHED STORAGE (NAS) TO BE FILLED BY BIDDER S.No. Features Qualifying Minimum Requirements No. of Storage 1 Units 2 Make Offered 3 Model Offered 4 Rack mount 5 Processor 6 Memory
More informationData Movement & Storage Using the Data Capacitor Filesystem
Data Movement & Storage Using the Data Capacitor Filesystem Justin Miller jupmille@indiana.edu http://pti.iu.edu/dc Big Data for Science Workshop July 2010 Challenges for DISC Keynote by Alex Szalay identified
More informationAn ESS implementation in a Tier 1 HPC Centre
An ESS implementation in a Tier 1 HPC Centre Maximising Performance - the NeSI Experience José Higino (NeSI Platforms and NIWA, HPC Systems Engineer) Outline What is NeSI? The National Platforms Framework
More informationGFS: The Google File System
GFS: The Google File System Brad Karp UCL Computer Science CS GZ03 / M030 24 th October 2014 Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one
More informationReadme for Platform Open Cluster Stack (OCS)
Readme for Platform Open Cluster Stack (OCS) Version 4.1.1-2.0 October 25 2006 Platform Computing Contents What is Platform OCS? What's New in Platform OCS 4.1.1-2.0? Supported Architecture Distribution
More informationTOSS - A RHEL-based Operating System for HPC Clusters
TOSS - A RHEL-based Operating System for HPC Clusters Supercomputing 2017 Red Hat Booth November 14, 2017 Ned Bass System Software Development Group Leader Livermore Computing Division LLNL-PRES-741473
More informationIntel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage
Intel Enterprise Edition Lustre (IEEL-2.3) [DNE-1 enabled] on Dell MD Storage Evaluation of Lustre File System software enhancements for improved Metadata performance Wojciech Turek, Paul Calleja,John
More informationInfiniBand based storage target
Philippe BRUIANT Business Development Manager EMEA OpenFabrics Workshop Paris June 22-23, 2006 InfiniBand based storage target Tuesday, 27 June 2006 At-A-Glance Founded in Feb. 2000 by Veterans of Cheyenne
More informationThe benefits of. Clustered storage offers advantages in both performance and scalability, but users need to evaluate three different architectures.
The benefits of clustered block storage Clustered storage offers advantages in both performance and scalability, but users need to evaluate three different architectures. By Ray Lucchesi Today s data centers
More informationRedbooks Paper. GPFS/NSD on Linux/xSeries using a 100 Megabit Ethernet Network. Raymond L. Paden
Redbooks Paper Raymond L. Paden GPFS/NSD on Linux/xSeries using a 100 Megabit Ethernet Network A series of four benchmark tests were completed during 22 26 September 2003 at the IBM Benchmarking Center
More informationSONAS Best Practices and options for CIFS Scalability
COMMON INTERNET FILE SYSTEM (CIFS) FILE SERVING...2 MAXIMUM NUMBER OF ACTIVE CONCURRENT CIFS CONNECTIONS...2 SONAS SYSTEM CONFIGURATION...4 SONAS Best Practices and options for CIFS Scalability A guide
More informationProvisioning with SUSE Enterprise Storage. Nyers Gábor Trainer &
Provisioning with SUSE Enterprise Storage Nyers Gábor Trainer & Consultant @Trebut gnyers@trebut.com Managing storage growth and costs of the software-defined datacenter PRESENT Easily scale and manage
More informationRAIDIX 4.5. Product Features. Document revision 1.0
RAIDIX 4.5 Product Features Document revision 1.0 2017 Table of Content TABLE OF CONTENT... 2 RAIDIX 4.5... 3 HOW IT WORKS... 3 DUPLICATION OF HARDWARE COMPONENTS... 4 NETWORK ATTACHED STORAGE... 5 DISTINCTIVE
More informationHow то Use HPC Resources Efficiently by a Message Oriented Framework.
How то Use HPC Resources Efficiently by a Message Oriented Framework www.hp-see.eu E. Atanassov, T. Gurov, A. Karaivanova Institute of Information and Communication Technologies Bulgarian Academy of Science
More informationThe advantages of architecting an open iscsi SAN
Storage as it should be The advantages of architecting an open iscsi SAN Pete Caviness Lefthand Networks, 5500 Flatiron Parkway, Boulder CO 80301, Ph: +1-303-217-9043, FAX: +1-303-217-9020 e-mail: pete.caviness@lefthandnetworks.com
More informationDell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance
Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for Simulia
More informationFuture Trends in Hardware and Software for use in Simulation
Future Trends in Hardware and Software for use in Simulation Steve Feldman VP/IT, CD-adapco April, 2009 HighPerformanceComputing Building Blocks CPU I/O Interconnect Software General CPU Maximum clock
More informationEnterprise Volume Management System Project. April 2002
Enterprise Volume Management System Project April 2002 Mission Statement To create a state-of-the-art, enterprise level volume management system for Linux which will also reduce the costs associated with
More information