HPCMP Requirements for Metadata and Archiving at Scale

Similar documents
UNCLASSIFIED. UNCLASSIFIED R-1 Line Item #49 Page 1 of 10

Tape Data Storage in Practice Minnesota Supercomputing Institute

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

Computer Science Section. Computational and Information Systems Laboratory National Center for Atmospheric Research

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

Preservation at scale

PRODUCT BROCHURE Mailbox Shuttle and Archive Shuttle:

At a Glance. Employees 1,500. Office Locations. 16 across the U.S.

HPSS Treefrog Summary MARCH 1, 2018

Headquarters U.S. Air Force

Organizational Update: December 2015

Digital Preservation and The Digital Repository Infrastructure

Moving From Reactive to Proactive Storage Management with an On-demand Cloud Solution

Government IT Modernization and the Adoption of Hybrid Cloud

Storage for HPC, HPDA and Machine Learning (ML)

Engineered Resilient Systems Advanced Analytics and Modeling in Support of Acquisition

Advancing the Role of DT&E in the Systems Engineering Process:

Tape Sucks for Long-Term Retention Time to Move to the Cloud. How Cloud is Transforming Legacy Data Strategies

Object storage platform How it can help? Martin Lenk, Specialist Senior Systems Engineer Unstructured Data Solution, Dell EMC

AUTOMATING IBM SPECTRUM SCALE CLUSTER BUILDS IN AWS PROOF OF CONCEPT

The Hangover 2.5: Waking Up to Exchange 2010 Archiving and ediscovery Realities. Vision Session IM B29

An Introduction to GPFS

AFRL-ML-WP-TM

TECHNICAL GUIDELINES FOR APPLICANTS TO PRACE 13 th CALL (T ier-0)

An introduction to GPFS Version 3.3

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

Web-based secure high performance remote visualization

The SHARED hosting plan is designed to meet the advanced hosting needs of businesses who are not yet ready to move on to a server solution.

Automated Storage Tiering on Infortrend s ESVA Storage Systems

Novetta Cyber Analytics

Test Resource Management Center Directed Energy T&E Conference A Joint DEPS ITEA Event

New research on Key Technologies of unstructured data cloud storage

RAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System

Copyright 2013, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12

DoD Information Technology Security Certification and Accreditation Process (DITSCAP) A presentation by Lawrence Feinstein, CISSP

Ohio Supercomputer Center

Bringing Core-Level Data Protection Solutions to the Tactical Field. January 2018

National Aeronautics and Space Administration Jet Propulsion Laboratory California Institute of Technology WISE Archive.

Electronic Records Archives: Philadelphia Federal Executive Board

Edge for All Business

High Performance Computing Data Management. Philippe Trautmann BDM High Performance Computing Global Research

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Advanced Technical Exploitation Contract

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations

Symantec Enterprise Vault

Data storage services at KEK/CRC -- status and plan

Active Archive and the State of the Industry

WHITE PAPER. Operationalizing Threat Intelligence Data: The Problems of Relevance and Scale

Architecting for Resiliency Army s Common Operating Environment (COE) SERC

Overcoming Business Challenges in WAN infrastructure

Exchange 2010 & 2013 Archiving & ediscovery Realities

The Perfect Storm Cyber RDT&E

DISA CLOUD CLOUD SYMPOSIUM

TIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD

Baseline Configuration

GlobalSearch Security Definition Guide

SEMI-DEDICATED SERVERS WITH CREATE A ONLINE STORE 123 FREE

NARA s Electronic Records Archives Program

ARCHIVE ESSENTIALS: Key Considerations When Moving to Office 365 DISCUSSION PAPER

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

CSC Operating Systems Spring Lecture - XIX Storage and I/O - II. Tevfik Koşar. Louisiana State University.

RAID Structure. RAID Levels. RAID (cont) RAID (0 + 1) and (1 + 0) Tevfik Koşar. Hierarchical Storage Management (HSM)

CSD3 The Cambridge Service for Data Driven Discovery. A New National HPC Service for Data Intensive science

File Storage Management Systems (FSMS) and ANSI/AIIM MS66

McKesson mixes SSDs with HDDs for Optimal Performance and ROI. Bob Fine, Dir., Product Marketing

I/O Challenges: Todays I/O Challenges for Big Data Analysis. Henry Newman CEO/CTO Instrumental, Inc. April 30, 2013

UBIQUITIOUS, RESILIENT, SECURE CONNECTIVITY IN THE NEAR-PEER THREAT ENVIRONMENT

The Success of the AMRAAM DBMS/DAS

An End User s Perspective of Central Administration

Openness, Growth, Evolution, and Closure in Archival Information Systems

Information Infrastructure Forum

Towards a Federated SOA Model in Achieving Data Interoperability in DoD. Nick Duan, Ph.D. ManTech MBI AFCEA/GMU C4I Symposium May 20, 2008

Parallel File Systems Compared

Selecting the Right Method

Mitigating Risk of Data Loss in Preservation Environments

NextGen Interagency Experimentation Hub

Abstract. The Challenges. ESG Lab Review InterSystems IRIS Data Platform: A Unified, Efficient Data Platform for Fast Business Insight

White paper 200 Camera Surveillance Video Vault Solution Powered by Fujitsu

KeyBanc Industrial, Automotive and Transportation Conference

Improved Solutions for I/O Provisioning and Application Acceleration

Portal Development for High Performance Computing (HPC) at Maui High Performance Computing Center (MHPCC)

Data Movement & Tiering with DMF 7

XtreemStore A SCALABLE STORAGE MANAGEMENT SOFTWARE WITHOUT LIMITS YOUR DATA. YOUR CONTROL

Providing a first class, enterprise-level, backup and archive service for Oxford University

Five-Year Strategic Plan

Brooke Roecker, Kristen Ward, Chris Mickle, Sarah Wright & Shauna McKellar

Cyber, Command, Control, Communications, and Computers Assessments Division (C5AD)

irods at TACC: Secure Infrastructure for Open Science Chris Jordan

Total Cost of Ownership: Benefits of the OpenText Cloud

Utilizing Network Emulation to Provide Cost Effective Large Scale System of Systems Test and Evaluation

enhance the network transform performance

Domestic Nuclear Detection Office (DNDO) DNDO Overview

Product Overview Archive2Anywhere. From Archive360

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside.

Enabling Hybrid Cloud Transformation

THE ZADARA CLOUD. An overview of the Zadara Storage Cloud and VPSA Storage Array technology WHITE PAPER

Archive exchange Format AXF

Hortonworks DataFlow. Accelerating Big Data Collection and DataFlow Management. A Hortonworks White Paper DECEMBER Hortonworks DataFlow

IBM Spectrum Scale IO performance

Evolving Storage Technology

Transcription:

CMP Requirements for Metadata and Archiving at Scale Amanda Tumminello, Navy DSRC April 2019 DoD High Performance Computing Modernization Program ; distribution is unlimited

CMP Ecosystem Results Acquisition Science and Engineering Technology Decision Support Acquisition Engineering DoD DoD Supercomputing Resource Centers (DSRCs) (DSRCs) U.S. Air Force Research Laboratory DSRC A technology-led, innovation-focused program providing the computational environments to solve the Department's critical mission challenges. Decision Support Acquisition Engineering Networking and and Security Defense Research & Engineering Network (DREN) Decision Support Acquisition Engineering Software Applications Core Software Decision Test and Support Evaluation U.S. Army Research Laboratory DSRC Computational Environments Maui High Performance Computing Center DSRC U.S. Army Engineer Research and Development Center DSRC Computer Network Defense, Security R&D, and Security Integration Education and Training Decision Acquisition Support Engineering U.S. Navy DSRC C User Support Page-2

DoD Supercomputing Resource Centers US Army Research Lab DSRC US Air Force Maui C Center DSRC US Air Force Research Laboratory DSRC US Army Engineer Research and Development Center DSRC https://centers.hpc.mil US Navy DSRC Page-3

By the numbers Decision Support Acquisition Engineering DoD Supercomputing Resource Centers (DSRCs) Resource Centers (DSRCs) Five DoD Supercomputing Resource Centers (DSRCs) in four states U.S. Army Research Laboratory DSRC Maui High Performance Computing Center DSRC U.S. Air Force Research Laboratory DSRC U.S. Army Engineer Research and Development Center DSRC U.S. Navy DSRC 350 staff ~2000 users from 3 DoD Services and additional DoD agencies 22 C systems from four manufacturers 995,896 cores Over 700 GPUs Over 700 accelerators (Phi and KNL) 45.6 Petaflops aggregate compute capability Over seven billion compute hours delivered annually 120 Petabytes of data stored 40 Gb interconnect between DSRCs Page-4

DoD Supercomputing Resource Centers Four Allocated DSRCs provide allocated resources to all CMP users Air Force Research Laboratory (AFRL) DSRC Wright-Patterson Air Force Base, Dayton, OH Army Research Laboratory (ARL) DSRC Aberdeen Proving Grounds, Aberdeen, MD Engineer Research and Development Center (ERDC) DSRC Information Technology Laboratory, Vicksburg, MS Navy DSRC Stennis Space Center, MS Today, C Centers supports 22 C systems with 995,896 cores and 45.6 PetaFLOPS capability One Vanguard Center provides exploratory architectures and technology evaluation to CMP and select users Maui High Performance Computing Center DSRC Kihei, Maui, HI Page-5

CMP Current Data Environment DSRC1 C1 C2 DSRC2 C1 C3 C2 DSRC3 C1 C3 C2 HSMC3 DSRC4 C1 C2 HSMC3 DSRC5 C1 C2 HSM C3 HSM Each DSRC is made up of the following: C systems File System (Lustre/GPFS) Center-Wide File system (GPFS) Archival Storage (SAM-FS/Tape) HSM Most of our applications currently require data access via POSIX interface Page-6

Current Customer Workflow We have two completely different types of workflows Research Time Sensitive Page-7

Research Workflow Users ingest data to desired DSRC C system (or systems) Users run applications on C systems, store data on regularly scrubbed parallel file system Three options for saving data semi-permanently or permanently: Copy data to remote are outside of the program Copy data to the Center-Wide File System Copy data to the HSM at the site of their choice. Page-8

Time Sensitive I/O variability hampers processing I/O tends to be a mix of large block sequential and small block random Ensured performance is a desired characteristic Page-9

Current Limitations Duplication of data across sites Staging data to the worksite Migrating data from old technology to new technology Quotas Ability to query data holding based on user, group, and data type Inability to profile user I/O behavior in real time Page-10

If I had three wishes (or 30) When dealing with wishes for metadata and archival at scale there are three perspectives. That of the user, the administrator, and the acquisition team (show me the money). Let s look into the wishes of each perspective Page-11

What a user wants Metadata Ability to find physical location of all copies of data. How many files and total capacity being used (how much is left?) I/O characteristics of the physical residence Staging of data to specific resources Chain of custody (user and digital access) Extended attributes that are easily searchable Ability to enhance/add/change attributes as environment and science change Archive Ability to create transportable archives (physical/cloud) Page-12

What an admin needs Real time I/O characteristics Easily queried sensitivity levels for data Encryption capabilities and key management Data at Rest Encryption Data in Flight Encryption Data Encryption per Object User defined Encryption Group defined Encryption Page-13

What and admin also needs Chain of Custody User, Project, Organizational level reporting for utilization and data location Data collection and movement for user/project/organization Data curation for lifetime of project Heat maps of metadata operations during processing Efficient means to purge data Ability to prove Qos of I/O subsystem Page-14

Balancing needs within an acquisition Detailed reports (how/what/when/where) User level reports, System level reports (high level), Physical reports on data at rest (low level), Program level reports (enterprise level), Reports on data in flight (how is data moving) Auditing of lifecycle of data Analysis of metadata types to data capacity for cost analysis Amount of time file/object spends on media type (SSD, HDD, tape) Heat maps of I/O traffic and utilization across the Enterprise Page-15

Page-16