High Performance Computing Data Management. Philippe Trautmann BDM High Performance Computing Global Research

Similar documents
HIGH PERFORMANCE COMPUTING FROM SUN

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

CSD3 The Cambridge Service for Data Driven Discovery. A New National HPC Service for Data Intensive science

Grid BT. The evolution toward grid services. EU Grid event, Brussels May Piet Bel Grid Action Team

Introducing Panasas ActiveStor 14

Scaling to Petaflop. Ola Torudbakken Distinguished Engineer. Sun Microsystems, Inc

New Approach to Unstructured Data

Tech Talk on HPC. Ken Claffey. VP, Cloud Systems. May 2016

Some aspect of research and development in ICT in Bulgaria. Authors Kiril Boyanov and Stefan Dodunekov

DDN Annual High Performance Computing Trends Survey Reveals Rising Deployment of Flash Tiers & Private/Hybrid Clouds vs.

DDN About Us Solving Large Enterprise and Web Scale Challenges

Survey of Research Data Management Practices at the University of Pretoria

NetApp: Solving I/O Challenges. Jeff Baxter February 2013

Astrium Accelerates Research and Design with IHS Goldfire

Sun and Oracle. Kevin Ashby. Oracle Technical Account Manager. Mob:

IBM Spectrum Scale IO performance

HPC Storage Use Cases & Future Trends

Modular Platforms Market Trends & Platform Requirements Presentation for IEEE Backplane Ethernet Study Group Meeting. Gopal Hegde, Intel Corporation

irods at TACC: Secure Infrastructure for Open Science Chris Jordan

Emerging Technologies for HPC Storage

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Dell EMC Isilon All-Flash

Refining and redefining HPC storage

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations

EMC Corporation. Tony Takazawa Vice President, EMC Investor Relations. August 5, 2008

Organizational Update: December 2015

The Blue Water s File/Archive System. Data Management Challenges Michelle Butler

DATACENTER SERVICES DATACENTER

The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research

I data set della ricerca ed il progetto EUDAT

Global Grid Computing Market Size, Status and Forecast

Introduction to FREE National Resources for Scientific Computing. Dana Brunson. Jeff Pummill

20 Fast Facts About Microsoft Windows Server 2012

Active Archive and the State of the Industry

Moving e-infrastructure into a new era the FP7 challenge

Top 4 considerations for choosing a converged infrastructure for private clouds

Data Movement & Tiering with DMF 7

Shared Services Canada Environment and Climate Change Canada HPC Renewal Project

Intelligent Storage for HPC: Sun StorageTek QFS and Sun StorageTek Storage Archive Manager Harriet Coverston

Green Supercomputing

Some Big Data Challenges

in Action Fujitsu High Performance Computing Ecosystem Human Centric Innovation Innovation Flexibility Simplicity

Taking Hyper-converged Infrastructure to a New Level of Performance, Efficiency and TCO

Managing Terascale Systems and Petascale Data Archives

Service Provider Consulting

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

THE ADVANCEMENT OF STORAGE SYSTEM DESIGNS FOR DIGITAL INDIA. Dana Kammersgard February 2017

Lustre2.5 Performance Evaluation: Performance Improvements with Large I/O Patches, Metadata Improvements, and Metadata Scaling with DNE

e BOOK Do you feel trapped by your database vendor? What you can do to take back control of your database (and its associated costs!

The Oracle Database Appliance I/O and Performance Architecture

SD-WAN. Enabling the Enterprise to Overcome Barriers to Digital Transformation. An IDC InfoBrief Sponsored by Comcast

HPC IN EUROPE. Organisation of public HPC resources

Deep Storage for Exponential Data. Nathan Thompson CEO, Spectra Logic

Availability in the Modern Datacenter

I D C T E C H N O L O G Y S P O T L I G H T

ENCS The European Network for Cyber Security

Power of the Portfolio. Copyright 2012 EMC Corporation. All rights reserved.

Private Cloud at IIT Delhi

HPC Capabilities at Research Intensive Universities

Store Process Analyze Collaborate Archive Cloud The HPC Storage Leader Invent Discover Compete

Power Systems for Your Business

The Road from Peta to ExaFlop

W H I T E P A P E R S e r v e r R e f r e s h t o M e e t t h e C h a n g i n g N e e d s o f I T?

Storage Systems Market Analysis Dec 04

IBM Power Systems HPC Cluster

Nimble Storage vs HPE 3PAR: A Comparison Snapshot

ehealth Ministerial Conference 2013 Dublin May 2013 Irish Presidency Declaration

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber

External RAID-Based Storage System Analysis by Form Factor

CIO Forum Maximize the value of IT in today s economy

IBM eserver Total Storage, On Demand

Virtualizing Oracle on VMware

Architecting Storage for Semiconductor Design: Manufacturing Preparation

I/O Challenges: Todays I/O Challenges for Big Data Analysis. Henry Newman CEO/CTO Instrumental, Inc. April 30, 2013

Question No: 1 Which tool should a sales person use to find the CAPEX and OPEX cost of an IBM FlashSystem V9000 compared to other flash vendors?

SGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012

HP StorageWorks LTO-5 Ultrium tape portfolio

Technology Trend : Green IT and Virtualizaiton. Education and Research Sun Microsystems(Thailand)

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Fujitsu and the HPC Pyramid

EUDAT. Towards a pan-european Collaborative Data Infrastructure

Making research data repositories visible and discoverable. Robert Ulrich Karlsruhe Institute of Technology

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

CYFRONET SITE REPORT IMPROVING SLURM USABILITY AND MONITORING. M. Pawlik, J. Budzowski, L. Flis, P. Lasoń, M. Magryś

IT Optimization Trends. Summary Results January 2018

Oracle Exadata: The World s Fastest Database Machine

Smart Trading with Cray Systems: Making Smarter Models + Better Decisions in Algorithmic Trading

Data Center Engineering Acceleration Efficiency Interoperability HCL ERS DATA CENTER ENGINEERING SERVICES

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

IT Enterprise Services. Capita Private Cloud. Cloud potential unleashed

EUDAT & SeaDataCloud

An Oracle White Paper December Accelerating Deployment of Virtualized Infrastructures with the Oracle VM Blade Cluster Reference Configuration

e-infrastructure: objectives and strategy in FP7

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Data Movement & Storage Using the Data Capacitor Filesystem

InfiniBand Strengthens Leadership as the Interconnect Of Choice By Providing Best Return on Investment. TOP500 Supercomputers, June 2014

Sun Microsystems Product Information

Increasing Performance of Existing Oracle RAC up to 10X

The Computation and Data Needs of Canadian Astronomy

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

Transcription:

High Performance Computing Management Philippe Trautmann BDM High Performance Computing Global Education @ Research

HPC Market and Trends High Performance Computing: Availability/Sharing is key European Digital Preservation of Research Output initiative Conclusions 2 Sun Confidential: Partner Under NDA Only

HPC Market Overview IDC HPC Application/Industry Forecast Servers Application Segment University Academic Govt. Lab Bio Sciences CAE Defense EDA DCC & Distribution Geosciences & Geo Engineering Weather Economics /Financial Chemical Engineering Other Mechanical Design & Drafting Total Revenue 2009 ($K) $1,800,235 $1,425,431 $1,217,297 $952,761 $871,585 $613,729 $576,228 $529,772 $371,260 $261,750 $223,468 $182,756 $106,400 $9,132,672 IDC Server Revenue by Vendor 2008 Storage 2013 ($K) $2,337,419 $1,863,896 $1,781,031 $1,562,311 $1,186,212 $948,920 $835,046 $807,039 $545,329 $421,115 $260,900 $140,644 $98,205 $12,788,067 CAGR (07-13) 6.75% 6.93% 9.98% 13.16% 8.01% 11.51% 9.72% 11.10% 10.09% 12.62% 3.95% -6.34% -1.98% 4.10% 2009 ($K) % of Mkt $571,344 16.27% $433,087 12.33% $652,271 18.57% $455,087 12.96% $414,288 11.80% $173,687 4.94% $269,913 7.68% $222,042 6.32% $119,956 3.42% $64,663 1.84% $88,262 2.51% $20,227 0.58% $27,568 0.78% $3,512,395 100.00% HP IBM Dell Other Sun SUN Other HP DELL IBM IDC Estimates that for every $ spent on Servers An additional $.39 is spent on storage An additional $.25 is spent on services Server Revenue by IDC Competitive Segments Segment Supercomputer Division Department Workgroup Price Range $500K and up $250K - $500K $100k - $250K <100K 2009 TAM $B $2.58 $1.30 $3.62 $1.73 CAGR (07 13) 3.20% 1.60% 7.10% 1.90% Sun Confidential: Partner Under NDA Only DOWN SIDE CAGR 1.50% -0.70% -0.04% -0.06% 3

The Importance of HPC Organizations are Under Pressure Reduce costs and increase efficiency Improve quality and be first to market Make better and faster decisions Applications becoming increasingly computationally intensive Required to run more and more of these applications Need to analyze more and more data HPC can solve these problems and is now a required technology to stay competitive Sun Confidential: Partner Under NDA Only 4

Barriers to High Performance Computing The P in HPC Technical limitations system, storage, interconnect, complexity Exploding Requirements Increasing fidelity of modeling and simulation Instruments that spit out PetaBytes of Requirement for collaborative research Complexity of Use Need reliable solutions that are easy to architect, deploy and use Space, power and cooling issues 5

Barriers to HPC Access 2009 Time to Store Time to Compute Exponential Growth 2011 Time to Compute Time to Store Time to Load Time to Load You can only compute as fast as you can move the data 6

Barriers to HPC: I/O Bottlenecks Application Enemy #1 Prevents applications from scaling Leads to poor overall application performance Complex CPU? Memory? Storage? Interconnect? Application? Removing I/O Bottlenecks requires an end-toend approach 7

A European survey (May 2009) Where do researchers store their data External web service Other Don't store data Digital archive of disciplin Digital archive of organisation Journal Computer at home Organisational server Portable storage carrier Computer at work 0 0,1 0,2 0,3 0,4 0,5 0,6 0,7 0,8 0,9 Source: PARSE Insight Interim report, May 2009 PARSE: Permanent Access to the Records of Science in Europe, EU funded project 8

The Information Infrastructure the researcher acts through ingest and access Archival Creation The Body of Knowledge Virtual Research Environment Access Curation Services the researcher shouldn t have to Network worry about the information infrastructure Storage Compute Information Infrastructure 9

Current view Distinct Infrastructures / Distinct User Experiences Raw Analysis Analysed Publication Publications Analysis Analysed Publication Publications Analysis Analysed Publication Publications Facility 1 Raw Facility 2 Raw Facility 3 10

Future view Common Infrastructure / Common User Experience Raw Catalogue Raw Analysis Analysed Catalogue Publication Catalogue Publication s Catalogue Analysis Analysed Publication Publications Analysis Analysed Publication Publications Analysis Analysed Publication Publications Facility 1 Raw Facility 2 Raw Facility 3 Capacity Storage Standards/ Converters Repositories Publications Repositories 11

PARSE Permanent Access to the Records of Science in Europe European funded project 2 years from 2008-2010 Closely linked with European Alliance for Permanent Access Roadmap of Science Infrastructure Based on UK s Digital Curation Centre There is a need for a common European Storage Standard UK s UK Research Storage Service pilot funding just agreed 12

management and sharing: Big issues stated by the EU Energy requirements Who protects the data ad eterna as publications are linked Terrorism Nation speaking unto nation or project interlinking with project Lack of true large scale project management experience Protectionism 13

Flash Technologies accelerate applications Flash Facts > CPU & memory ~ 260 times faster than disk drives > One SSD provides IOPs equal to 100 disk drives at less than 1/500th of the power Flash accelerates I/O, reduces job times and enables more work with less hardware... 14

Sun end-to-end infrastructure optimized to accelerate data centric HPC workfl ows M9 QDR Infi niband Network Storage Archive Storage Sun Storage 7000 Unifi ed Storage System High Availability, Manageability, Shared Parallel Storage Access Sun Lustre Storage System Home Directories, Application Code High Performance Parallel File System Input, Results Files Ongoing Computation Sun StorageTek Tape Archives Economic, Green, Long Term Retention Protection of IP Assets SAM Storage Archive Manager HSM 15

Petascale projects in the real world TACC Ranger @ 579 TFLOPS World s Largest General Purpose Compute Cluster Sun Constellation System @ X4500 1.7 Petabytes 72 GB/sec total bandwidth X4600 25 systems 800 cores Sun Blade 6048 3,936 blades 15,744 CPUs 62,976 cores 125 TB/RAM Switch 3,456 Dual redundant 110 Tb/sec bisectional bandwidth 16

INNOVATION MATTERS!! Peta FLOP Computing key points Compute density > Flops/watt, TB/watt, GB/sec I/O technologies > In CPUs, on mother boards, on systems (Flash, SSDs, etc.) > management technologies > I/O technologies Power and Cooling > Density and efficiency Management > > > > > Hardware (provisioning, upgrade & monitoring) Software (OS, application and patching) Job (scheduling and monitoring) services (Scratch, archival, multi-site, etc.) People and procedures Serviceability Sun Confidential CDA Required 17

philippe.trautmann@sun.com Sun Microsystems, Inc. Sun Confidential CDA Required Sun Confidential CDA Required High Performance Computing Management A European Perspective