Kinetic Open Storage Platform: Enabling Break-through Economics in Scale-out Object Storage PRESENTATION TITLE GOES HERE Ali Fenn & James Hughes

Size: px
Start display at page:

Download "Kinetic Open Storage Platform: Enabling Break-through Economics in Scale-out Object Storage PRESENTATION TITLE GOES HERE Ali Fenn & James Hughes"

Transcription

1 Kinetic Open Storage Platform: Enabling Break-through Economics in Scale-out Object Storage PRESENTATION TITLE GOES HERE Ali Fenn & James Hughes Seagate Technology

2 2020: 7.3 Zettabytes 56% of total = in the cloud 90% Unstructured data 6.5 Zettabytes of unstructured data stored in the cloud 2010: 100 Exabytes 2

3 How big is 6.5 Zettabytes? 1 inch If you stacked 4TB hard drives side by side, they would circle the earth 3

4 6.5 Zettabytes Using today s architecture, it could cost cloud data centers: >$240,000,000,000/ye ar in CAPEX + 1 Year OPEX 4

5 The Opportunity for Change How do we get there? Ecosystem = Open and Software- Defined HDFS CEP H Where do we go from here? 5

6 Look at Legacy Opportunity Server Storage Application File System DB POSIX File System Volume Manager Driver RAID 1986 POSIX 1988 NTFS s to 2000s XFS 1993 Storage Server RAID Battery Backed RAM CACHE FC SAS Devices SAS Interface SMR, Mapping Cylinder, Head, Sector Drive HDA 6

7 Storage to Enable New World Use Cases Application File System DB Standard Device Recording POSIX File System Volume Manager Driver Storage Server RAID Battery Backed RAM CACHE F C Devices SAS Interface SMR, Mapping Cylinder, Head, Sector Drive HDA SAS Key/Value Interface Ethernet Connectivity 7

8 The Kinetic Open Storage Platform Open Source Key/Value API and libraries Open Source Interface Specification Object storage software partners Systems partners Storage now fully disaggregated from compute 8

9 Simplifying Storage Advancements 4K Sector Transitions = Greater Agility Shingled Magnetic Recording Advanced Management Data Security 9

10 Performance Opportunities Performance Raw throughput IO utilization Data streams to drive as written Drive handles space mgmt. (no file system metadata) 10

11 TCO Impact Deploying a Kinetic based architecture could deliver: Up to 50% lower TCO 11

12 Single Drive 1 Additional Chip to Start 12

13 Libraries, API enable Applications Application Clustering Management Proprietary to System Vendor C++, Java, Python, Erlang, DIY Interconnect ProtoBuf TCP/IP/GbE GPL Standard Storage Proprietary to Seagate 13

14 Multiple Masters Application Clustering Management Proprietary to System Vendor Interconnect ProtoBuf TCP/IP/GbE GPL Standard Storage Proprietary to Seagate 14

15 Multiple Drives, P2P Operations Application Clustering Management Proprietary to System Vendor Interconnect ProtoBuf TCP/IP/GbE GPL Standard Storage Proprietary to Seagate 15

16 Goals of the Kinetic API Data movement Get/put/delete/getnext/getprevious Versioned (== for success), options Multiple masters Authentication/Integrity/Authorization Cluster-able Simple cluster configuration version enforcement 3rd party copy Management 16

17 Management (System Vendor) Configures the drive Network Authorized clients Monitors Health Statistics Logs Initiates recovery Change cluster version 3rdPartyCopy 17 17

18 Standard HDD Form Factor Connector re-pinned Two Ethernet connections Connector 18

19 System Implications No new ports Ethernet v. SAS switch 19

20 Network Implications No impact to Data Center Networking Traditional Architecture Kinetic Architecture 20

21 Performance Opportunities 21

22 Map of Operations 22

23 Performance Expectations Same normal performance expectations Sequential Write: 50 MB/s Random Write: 50 MB/s Sequential Read: 50 MB/s Random Read: 1.2x slower than traditional drives 23

24 Write Performance Results [PRELIMINARY] 24

25 Swift - Traditional 2 25

26 Swift - Kinetic 2 26

27 Swift - Kinetic 2 27

28 Basho Riak 2 28

29 Basho Riak - Kinetic 2 29

30 HDFS - Kinetic 30 30

31 Scality Kinetic Model Direct data path from clients to kinetic drives Native support for file, object and block Geo distribution across multiple sites Mix of replication and erasure coding Geo distributed metadata cluster 31

32 Kinetic Fits All Scale-Out Storage Object Storage Cloud Storage, Cloud backup, Cloud Archive / Cold Storage (Open Stack Swift, S3, Riak CS) Distributed File System Architectures Hadoop Distributed File System (HDFS), Google File System (GFS), Ceph, Windows Distributed File System (DFS), FhGFS, GlusterFS, Lustre Distributed Database and Memory Systems No SQL: Cassandra, Voldemort, Riak Memory: Memcached 32

33 Summary The Kinetic Open Storage Platform: Lowers TCO Disaggregates storage from compute Improves performance Increases innovation agility and efficiency More info at: developers.seagate.com 33

Seagate Kinetic Open Storage Platform. Mayur Shetty - Senior Solutions Architect

Seagate Kinetic Open Storage Platform. Mayur Shetty - Senior Solutions Architect Seagate Kinetic Open Storage Platform Mayur Shetty - Senior Solutions Architect 2 Application Clustering Management Interconnect App App LibKinetic App A D App No. 77103, LibKinetic effective Jan. 18,

More information

Kinetic drive. Bingzhe Li

Kinetic drive. Bingzhe Li Kinetic drive Bingzhe Li Consumption has changed It s an object storage world, unprecedented growth and scale In total, a complete redefinition of the storage stack https://www.openstack.org/summit/openstack-summit-atlanta-2014/session-videos/presentation/casestudy-seagate-kinetic-platform-in-action

More information

At-Scale Data Centers & Demand for New Architectures

At-Scale Data Centers & Demand for New Architectures Allen Samuels At-Scale Data Centers & Demand for New Architectures Software Architect, Software and Systems Solutions June 4, 2015 1 Forward-Looking Statements During our meeting today we may make forward-looking

More information

Storage for HPC, HPDA and Machine Learning (ML)

Storage for HPC, HPDA and Machine Learning (ML) for HPC, HPDA and Machine Learning (ML) Frank Kraemer, IBM Systems Architect mailto:kraemerf@de.ibm.com IBM Data Management for Autonomous Driving (AD) significantly increase development efficiency by

More information

Quobyte The Data Center File System QUOBYTE INC.

Quobyte The Data Center File System QUOBYTE INC. Quobyte The Data Center File System QUOBYTE INC. The Quobyte Data Center File System All Workloads Consolidate all application silos into a unified highperformance file, block, and object storage (POSIX

More information

November 7, DAN WILSON Global Operations Architecture, Concur. OpenStack Summit Hong Kong JOE ARNOLD

November 7, DAN WILSON Global Operations Architecture, Concur. OpenStack Summit Hong Kong JOE ARNOLD November 7, 2013 DAN WILSON Global Operations Architecture, Concur dan.wilson@concur.com @tweetdanwilson OpenStack Summit Hong Kong JOE ARNOLD CEO, SwiftStack joe@swiftstack.com @joearnold Introduction

More information

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu

Database Architecture 2 & Storage. Instructor: Matei Zaharia cs245.stanford.edu Database Architecture 2 & Storage Instructor: Matei Zaharia cs245.stanford.edu Summary from Last Time System R mostly matched the architecture of a modern RDBMS» SQL» Many storage & access methods» Cost-based

More information

Introducing SUSE Enterprise Storage 5

Introducing SUSE Enterprise Storage 5 Introducing SUSE Enterprise Storage 5 1 SUSE Enterprise Storage 5 SUSE Enterprise Storage 5 is the ideal solution for Compliance, Archive, Backup and Large Data. Customers can simplify and scale the storage

More information

Deep Storage for Exponential Data. Nathan Thompson CEO, Spectra Logic

Deep Storage for Exponential Data. Nathan Thompson CEO, Spectra Logic Deep Storage for Exponential Data Nathan Thompson CEO, Spectra Logic HISTORY Partnered with Fujifilm on a variety of projects HQ in Boulder, 35 years of business Customers in 54 countries Spectra builds

More information

SMORE: A Cold Data Object Store for SMR Drives

SMORE: A Cold Data Object Store for SMR Drives SMORE: A Cold Data Object Store for SMR Drives Peter Macko, Xiongzi Ge, John Haskins Jr.*, James Kelley, David Slik, Keith A. Smith, and Maxim G. Smith Advanced Technology Group NetApp, Inc. * Qualcomm

More information

IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage

IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage Silverton Consulting, Inc. StorInt Briefing 2017 SILVERTON CONSULTING, INC. ALL RIGHTS RESERVED Page 2 Introduction Unstructured data has

More information

GlusterFS Architecture & Roadmap

GlusterFS Architecture & Roadmap GlusterFS Architecture & Roadmap Vijay Bellur GlusterFS co-maintainer http://twitter.com/vbellur Agenda What is GlusterFS? Architecture Integration Use Cases Future Directions Challenges Q&A What is GlusterFS?

More information

Hadoop An Overview. - Socrates CCDH

Hadoop An Overview. - Socrates CCDH Hadoop An Overview - Socrates CCDH What is Big Data? Volume Not Gigabyte. Terabyte, Petabyte, Exabyte, Zettabyte - Due to handheld gadgets,and HD format images and videos - In total data, 90% of them collected

More information

SCS Distributed File System Service Proposal

SCS Distributed File System Service Proposal SCS Distributed File System Service Proposal Project Charter: To cost effectively build a Distributed networked File Service (DFS) that can grow to Petabyte scale, customized to the size and performance

More information

EMC ISILON HARDWARE PLATFORM

EMC ISILON HARDWARE PLATFORM EMC ISILON HARDWARE PLATFORM Three flexible product lines that can be combined in a single file system tailored to specific business needs. S-SERIES Purpose-built for highly transactional & IOPSintensive

More information

Scality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale

Scality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale Scality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale What You Will Learn Cisco and Scality provide a joint solution for storing and protecting file, object, and OpenStack data at

More information

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO Agenda Technical challenge Custom product Growth of aspirations Enterprise requirements Making an enterprise cold storage product 2 Technical Challenge

More information

Why software defined storage matters? Sergey Goncharov Solution Architect, Red Hat

Why software defined storage matters? Sergey Goncharov Solution Architect, Red Hat Why software defined storage matters? Sergey Goncharov Solution Architect, Red Hat sgonchar@redhat.com AGENDA Storage and Datacenter evolution Red Hat Storage portfolio Red Hat Gluster Storage Red Hat

More information

An Exploration into Object Storage for Exascale Supercomputers. Raghu Chandrasekar

An Exploration into Object Storage for Exascale Supercomputers. Raghu Chandrasekar An Exploration into Object Storage for Exascale Supercomputers Raghu Chandrasekar Agenda Introduction Trends and Challenges Design and Implementation of SAROJA Preliminary evaluations Summary and Conclusion

More information

Emerging Technologies for HPC Storage

Emerging Technologies for HPC Storage Emerging Technologies for HPC Storage Dr. Wolfgang Mertz CTO EMEA Unstructured Data Solutions June 2018 The very definition of HPC is expanding Blazing Fast Speed Accessibility and flexibility 2 Traditional

More information

CEPH APPLIANCE Take a leap into the next generation of enterprise storage

CEPH APPLIANCE Take a leap into the next generation of enterprise storage CEPH APPLIANCE Take a leap into the next generation of enterprise storage 1 Software Defined Storage An unstoppable trend Riding on the mega wave of cloud computing, big data analytics, and Internet of

More information

Data Analytics and Storage System (DASS) Mixing POSIX and Hadoop Architectures. 13 November 2016

Data Analytics and Storage System (DASS) Mixing POSIX and Hadoop Architectures. 13 November 2016 National Aeronautics and Space Administration Data Analytics and Storage System (DASS) Mixing POSIX and Hadoop Architectures 13 November 2016 Carrie Spear (carrie.e.spear@nasa.gov) HPC Architect/Contractor

More information

IP-Based Object Drives Now Have a Management Standard

IP-Based Object Drives Now Have a Management Standard IP-Based Object Drives Now Have a Management Standard Live Webcast April 20, 2017 10:00 am PT Today s Presenters Enrico Signoretti OpenIO David Slik Co-Chair, SNIA Cloud TWG NetApp Erik Riedel Dell EMC

More information

UNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017

UNIFY DATA AT MEMORY SPEED. Haoyuan (HY) Li, Alluxio Inc. VAULT Conference 2017 UNIFY DATA AT MEMORY SPEED Haoyuan (HY) Li, CEO @ Alluxio Inc. VAULT Conference 2017 March 2017 HISTORY Started at UC Berkeley AMPLab In Summer 2012 Originally named as Tachyon Rebranded to Alluxio in

More information

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics

Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Increase Value from Big Data with Real-Time Data Integration and Streaming Analytics Cy Erbay Senior Director Striim Executive Summary Striim is Uniquely Qualified to Solve the Challenges of Real-Time

More information

BIG DATA READY WITH ISILON JEUDI 19 NOVEMBRE Bertrand OUNANIAN: Advisory System Engineer

BIG DATA READY WITH ISILON JEUDI 19 NOVEMBRE Bertrand OUNANIAN: Advisory System Engineer BIG DATA READY WITH ISILON JEUDI 19 NOVEMBRE 2015 Bertrand OUNANIAN: Advisory System Engineer Unstructured Data Growth Total Capacity Shipped Worldwide % of Unstructured Data 67% 74% 80% 2013 37 EB 2015

More information

EsgynDB Enterprise 2.0 Platform Reference Architecture

EsgynDB Enterprise 2.0 Platform Reference Architecture EsgynDB Enterprise 2.0 Platform Reference Architecture This document outlines a Platform Reference Architecture for EsgynDB Enterprise, built on Apache Trafodion (Incubating) implementation with licensed

More information

How Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera,

How Apache Hadoop Complements Existing BI Systems. Dr. Amr Awadallah Founder, CTO Cloudera, How Apache Hadoop Complements Existing BI Systems Dr. Amr Awadallah Founder, CTO Cloudera, Inc. Twitter: @awadallah, @cloudera 2 The Problems with Current Data Systems BI Reports + Interactive Apps RDBMS

More information

HDFS: Hadoop Distributed File System. Sector: Distributed Storage System

HDFS: Hadoop Distributed File System. Sector: Distributed Storage System GFS: Google File System Google C/C++ HDFS: Hadoop Distributed File System Yahoo Java, Open Source Sector: Distributed Storage System University of Illinois at Chicago C++, Open Source 2 System that permanently

More information

Cloud Computing and Hadoop Distributed File System. UCSB CS170, Spring 2018

Cloud Computing and Hadoop Distributed File System. UCSB CS170, Spring 2018 Cloud Computing and Hadoop Distributed File System UCSB CS70, Spring 08 Cluster Computing Motivations Large-scale data processing on clusters Scan 000 TB on node @ 00 MB/s = days Scan on 000-node cluster

More information

BeoLink.org. Design and build an inexpensive DFS. Fabrizio Manfredi Furuholmen. FrOSCon August 2008

BeoLink.org. Design and build an inexpensive DFS. Fabrizio Manfredi Furuholmen. FrOSCon August 2008 Design and build an inexpensive DFS Fabrizio Manfredi Furuholmen FrOSCon August 2008 Agenda Overview Introduction Old way openafs New way Hadoop CEPH Conclusion Overview Why Distributed File system? Handle

More information

Ceph Intro & Architectural Overview. Abbas Bangash Intercloud Systems

Ceph Intro & Architectural Overview. Abbas Bangash Intercloud Systems Ceph Intro & Architectural Overview Abbas Bangash Intercloud Systems About Me Abbas Bangash Systems Team Lead, Intercloud Systems abangash@intercloudsys.com intercloudsys.com 2 CLOUD SERVICES COMPUTE NETWORK

More information

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype?

Nowcasting. D B M G Data Base and Data Mining Group of Politecnico di Torino. Big Data: Hype or Hallelujah? Big data hype? Big data hype? Big Data: Hype or Hallelujah? Data Base and Data Mining Group of 2 Google Flu trends On the Internet February 2010 detected flu outbreak two weeks ahead of CDC data Nowcasting http://www.internetlivestats.com/

More information

Crossing the Chasm: Sneaking a parallel file system into Hadoop

Crossing the Chasm: Sneaking a parallel file system into Hadoop Crossing the Chasm: Sneaking a parallel file system into Hadoop Wittawat Tantisiriroj Swapnil Patil, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University In this work Compare and contrast large

More information

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples

Topics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?

More information

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING THE EASIEST WAY TO INCREASE PERFORMANCE AND LOWER STORAGE COSTS Bruce Kornfeld, Chief Marketing Officer, StorMagic Luke Pruen, Technical

More information

BIG DATA AND HADOOP ON THE ZFS STORAGE APPLIANCE

BIG DATA AND HADOOP ON THE ZFS STORAGE APPLIANCE BIG DATA AND HADOOP ON THE ZFS STORAGE APPLIANCE BRETT WENINGER, MANAGING DIRECTOR 10/21/2014 ADURANT APPROACH TO BIG DATA Align to Un/Semi-structured Data Instead of Big Scale out will become Big Greatest

More information

Data Movement & Tiering with DMF 7

Data Movement & Tiering with DMF 7 Data Movement & Tiering with DMF 7 Kirill Malkin Director of Engineering April 2019 Why Move or Tier Data? We wish we could keep everything in DRAM, but It s volatile It s expensive Data in Memory 2 Why

More information

RAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System

RAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System RAIDIX Data Storage Solution Clustered Data Storage Based on the RAIDIX Software and GPFS File System 2017 Contents Synopsis... 2 Introduction... 3 Challenges and the Solution... 4 Solution Architecture...

More information

New Fresh Storage Approach for New IT Challenges Laurent Denel Philippe Nicolas OpenIO

New Fresh Storage Approach for New IT Challenges Laurent Denel Philippe Nicolas OpenIO New Fresh Storage Approach for New IT Challenges Laurent Denel Philippe Nicolas OpenIO Agenda Company profile and background Business and Users needs OpenIO approach Competition Conclusion Company profile

More information

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu Deploying Software Defined Storage for the Enterprise with Ceph PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu Agenda Yet another attempt to define SDS Quick Overview of Ceph from a SDS perspective

More information

Distributed Storage with GlusterFS

Distributed Storage with GlusterFS Distributed Storage with GlusterFS Dr. Udo Seidel Linux-Strategy @ Amadeus OSDC 2013 1 Agenda Introduction High level overview Storage inside Use cases Summary OSDC 2013 2 Introduction OSDC 2013 3 Me ;-)

More information

Next Generation Storage for The Software-Defned World

Next Generation Storage for The Software-Defned World ` Next Generation Storage for The Software-Defned World John Hofer Solution Architect Red Hat, Inc. BUSINESS PAINS DEMAND NEW MODELS CLOUD ARCHITECTURES PROPRIETARY/TRADITIONAL ARCHITECTURES High up-front

More information

TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1

TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1 TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1 ABSTRACT This introductory white paper provides a technical overview of the new and improved enterprise grade features introduced

More information

Scale-Out backups with Bareos and Gluster. Niels de Vos Gluster co-maintainer Red Hat Storage Developer

Scale-Out backups with Bareos and Gluster. Niels de Vos Gluster co-maintainer Red Hat Storage Developer Scale-Out backups with Bareos and Gluster Niels de Vos Gluster co-maintainer Red Hat Storage Developer ndevos@redhat.com Agenda Gluster integration in Bareos Introduction into GlusterFS Quick Start Example

More information

DataDirect Networks steps up object storage push with WOS refresh

DataDirect Networks steps up object storage push with WOS refresh DataDirect Networks steps up object storage push with WOS refresh Analyst: Simon Robinson 10 Jul, 2015 Next month, DataDirect Networks (DDN) will release the latest version of its Web Object Scalar (WOS)

More information

Flash Storage with 24G SAS Leads the Way in Crunching Big Data

Flash Storage with 24G SAS Leads the Way in Crunching Big Data Flash Storage with 24G SAS Leads the Way in Crunching Big Data SCSI Trade Association August 8th, 2018 1 Today s Panel Dennis Martin Founder and President Demartek Mohamad El-Batal Sr. Director of Architecture,

More information

Enterprise Architectures The Pace Accelerates Camberley Bates Managing Partner & Analyst

Enterprise Architectures The Pace Accelerates Camberley Bates Managing Partner & Analyst Enterprise Architectures The Pace Accelerates Camberley Bates Managing Partner & Analyst Change is constant in IT.But some changes alter forever the way we do things Inflections & Architectures Solid State

More information

HPE Storage news. Mauro Colombo Hybrid IT sales & presales manager 23 rd May 2018

HPE Storage news. Mauro Colombo Hybrid IT sales & presales manager 23 rd May 2018 HPE Storage news Mauro Colombo Hybrid IT sales & presales manager 23 rd May 2018 1 Agenda 1. It s all about flash 2. HPE Primary storage news Nimble Gen5 3PAR OS 3.3.1 HPE Infosight for 3PAR 3. HPE solutions

More information

INTRODUCTION TO CEPH. Orit Wasserman Red Hat August Penguin 2017

INTRODUCTION TO CEPH. Orit Wasserman Red Hat August Penguin 2017 INTRODUCTION TO CEPH Orit Wasserman Red Hat August Penguin 2017 CEPHALOPOD A cephalopod is any member of the molluscan class Cephalopoda. These exclusively marine animals are characterized by bilateral

More information

TITLE: PRE-REQUISITE THEORY. 1. Introduction to Hadoop. 2. Cluster. Implement sort algorithm and run it using HADOOP

TITLE: PRE-REQUISITE THEORY. 1. Introduction to Hadoop. 2. Cluster. Implement sort algorithm and run it using HADOOP TITLE: Implement sort algorithm and run it using HADOOP PRE-REQUISITE Preliminary knowledge of clusters and overview of Hadoop and its basic functionality. THEORY 1. Introduction to Hadoop The Apache Hadoop

More information

VMware Virtual SAN Technology

VMware Virtual SAN Technology VMware Virtual SAN Technology Today s Agenda 1 Hyper-Converged Infrastructure Architecture & Vmware Virtual SAN Overview 2 Why VMware Hyper-Converged Software? 3 VMware Virtual SAN Advantage Today s Agenda

More information

BIG DATA STRATEGY FOR TODAY AND TOMORROW RYAN SAYRE, EMC ISILON CTO-AT-LARGE, EMEA

BIG DATA STRATEGY FOR TODAY AND TOMORROW RYAN SAYRE, EMC ISILON CTO-AT-LARGE, EMEA BIG DATA STRATEGY FOR TODAY AND TOMORROW RYAN SAYRE, EMC ISILON CTO-AT-LARGE, EMEA 1 Unstructured Data Growth 67% 74% 80% 2013 2015 2017 37 EB 71 EB 133 EB Total Capacity Shipped, Worldwide Unstructured

More information

BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST. Copyright 2016 EMC Corporation. All rights reserved.

BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST. Copyright 2016 EMC Corporation. All rights reserved. BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST 1 UNSTRUCTURED DATA GROWTH 75% 78% 80% 2015 71 EB 2016 106 EB 2017 133 EB Total Capacity Shipped, Worldwide % of Unstructured Data

More information

Big Data in OpenStack Storage

Big Data in OpenStack Storage Big Data in OpenStack Storage Ivan Tomašić, Aleksandra Rashkovska, Matjaž Depolli, Roman Trobec Department of Communication Systems Jožef Stefan Institute, Ljubljana, Slovenia Outline Introduction Swift

More information

Crossing the Chasm: Sneaking a parallel file system into Hadoop

Crossing the Chasm: Sneaking a parallel file system into Hadoop Crossing the Chasm: Sneaking a parallel file system into Hadoop Wittawat Tantisiriroj Swapnil Patil, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University In this work Compare and contrast large

More information

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools

Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools SAP Technical Brief Data Warehousing SAP HANA Data Warehousing Combine Native SQL Flexibility with SAP HANA Platform Performance and Tools A data warehouse for the modern age Data warehouses have been

More information

Backtesting with Spark

Backtesting with Spark Backtesting with Spark Patrick Angeles, Cloudera Sandy Ryza, Cloudera Rick Carlin, Intel Sheetal Parade, Intel 1 Traditional Grid Shared storage Storage and compute scale independently Bottleneck on I/O

More information

HPE Synergy HPE SimpliVity 380

HPE Synergy HPE SimpliVity 380 HPE Synergy HPE SimpliVity 0 Pascal.Moens@hpe.com, Solutions Architect Technical Partner Lead February 0 HPE Synergy Composable infrastructure at HPE CPU Memory Local Storage LAN I/O SAN I/O Power Cooling

More information

CA485 Ray Walshe Google File System

CA485 Ray Walshe Google File System Google File System Overview Google File System is scalable, distributed file system on inexpensive commodity hardware that provides: Fault Tolerance File system runs on hundreds or thousands of storage

More information

DEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies!

DEMYSTIFYING BIG DATA WITH RIAK USE CASES. Martin Schneider Basho Technologies! DEMYSTIFYING BIG DATA WITH RIAK USE CASES Martin Schneider Basho Technologies! Agenda Defining Big Data in Regards to Riak A Series of Trade-Offs Use Cases Q & A About Basho & Riak Basho Technologies is

More information

5 Fundamental Strategies for Building a Data-centered Data Center

5 Fundamental Strategies for Building a Data-centered Data Center 5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse

More information

NVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory

NVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory NVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory Dhananjoy Das, Sr. Systems Architect SanDisk Corp. 1 Agenda: Applications are KING! Storage landscape (Flash / NVM)

More information

Functional Testing of SQL Server on Kaminario K2 Storage

Functional Testing of SQL Server on Kaminario K2 Storage Functional Testing of SQL Server on Kaminario K2 Storage September 2016 TABLE OF CONTENTS 2 3 4 11 12 14 Executive Summary Introduction to Kaminario K2 Functionality Tests for SQL Server Summary Appendix:

More information

Introduction to Scientific Data Management

Introduction to Scientific Data Management Introduction to Scientific Data Management damien.francois@uclouvain.be November 2017 http://www.cism.ucl.ac.be/training 1 http://www.cism.ucl.ac.be/training Goal of this session: Share tools, tips and

More information

Introduction to Scientific Data Management

Introduction to Scientific Data Management Introduction to Scientific Data Management damien.francois@uclouvain.be October 2015 http://www.cism.ucl.ac.be/training 1 http://www.cism.ucl.ac.be/training Goal of this session: Share tools, tips and

More information

Provisioning with SUSE Enterprise Storage. Nyers Gábor Trainer &

Provisioning with SUSE Enterprise Storage. Nyers Gábor Trainer & Provisioning with SUSE Enterprise Storage Nyers Gábor Trainer & Consultant @Trebut gnyers@trebut.com Managing storage growth and costs of the software-defined datacenter PRESENT Easily scale and manage

More information

IBM Storwize V7000, Storwize V5000 and IBM Storwize V5000F

IBM Storwize V7000, Storwize V5000 and IBM Storwize V5000F IBM Storwize V7000, Storwize V5000 and IBM Storwize V5000F Presenter s Name Presenter s Title DD Month Year Multiply your possibilities Changing Nature of Business and Data Support Data-Aware Applications

More information

New Approach to Unstructured Data

New Approach to Unstructured Data Innovations in All-Flash Storage Deliver a New Approach to Unstructured Data Table of Contents Developing a new approach to unstructured data...2 Designing a new storage architecture...2 Understanding

More information

Universal Storage. Innovation to Break Decades of Tradeoffs VASTDATA.COM

Universal Storage. Innovation to Break Decades of Tradeoffs VASTDATA.COM Universal Storage Innovation to Break Decades of Tradeoffs F e b r u a r y 2 0 1 9 AN END TO DECADES OF STORAGE COMPLEXITY AND COMPROMISE SUMMARY When it s possible to store all of your data in a single

More information

Warehouse- Scale Computing and the BDAS Stack

Warehouse- Scale Computing and the BDAS Stack Warehouse- Scale Computing and the BDAS Stack Ion Stoica UC Berkeley UC BERKELEY Overview Workloads Hardware trends and implications in modern datacenters BDAS stack What is Big Data used For? Reports,

More information

Modernize Your Storage

Modernize Your Storage Modernize Your Storage Kurt Bilderback VP Global Markets Software Defined Storage Slovenia Think CityName / DOC ID / Month XX, 2018 / 2018 IBM Corporation We have for the first time, an economy based on

More information

Software Defined Storage for the Evolving Data Center

Software Defined Storage for the Evolving Data Center Software Defined Storage for the Evolving Data Center Petter Sveum Information Availability Solution Lead EMEA Technology Practice ATTENTION Forward-looking Statements: Any forward-looking indication of

More information

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside.

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside. MODERNISE WITH ALL-FLASH Intel Inside. Powerful Data Centre Outside. MODERNISE WITHOUT COMPROMISE In today s lightning-fast digital world, it s critical for businesses to make their move to the Modern

More information

Data Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public

Data Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public Data Protection for Cisco HyperFlex with Veeam Availability Suite 1 2017 2017 Cisco Cisco and/or and/or its affiliates. its affiliates. All rights All rights reserved. reserved. Highlights Is Cisco compatible

More information

Stages of Data Processing

Stages of Data Processing Data processing can be understood as the conversion of raw data into a meaningful and desired form. Basically, producing information that can be understood by the end user. So then, the question arises,

More information

A BigData Tour HDFS, Ceph and MapReduce

A BigData Tour HDFS, Ceph and MapReduce A BigData Tour HDFS, Ceph and MapReduce These slides are possible thanks to these sources Jonathan Drusi - SCInet Toronto Hadoop Tutorial, Amir Payberah - Course in Data Intensive Computing SICS; Yahoo!

More information

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block

More information

Cisco UCS B440 M1High-Performance Blade Server

Cisco UCS B440 M1High-Performance Blade Server Cisco UCS B440 M1 High-Performance Blade Server Product Overview The Cisco UCS B440 M1 High-Performance Blade Server delivers the performance and reliability to power compute-intensive, enterprise-critical

More information

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team Isilon: Raising The Bar On Performance & Archive Use Cases John Har Solutions Product Manager Unstructured Data Storage Team What we ll cover in this session Isilon Overview Streaming workflows High ops/s

More information

DDN s Vision for the Future of Lustre LUG2015 Robert Triendl

DDN s Vision for the Future of Lustre LUG2015 Robert Triendl DDN s Vision for the Future of Lustre LUG2015 Robert Triendl 3 Topics 1. The Changing Markets for Lustre 2. A Vision for Lustre that isn t Exascale 3. Building Lustre for the Future 4. Peak vs. Operational

More information

SwiftStack and python-swiftclient

SwiftStack and python-swiftclient SwiftStack and python-swiftclient Summary Storage administrators around the world are turning to object storage and particularly SwiftStack s enterprise-ready object storage built on OpenStack Swift for

More information

朱义普. Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration. Director, North Asia, HPC

朱义普. Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration. Director, North Asia, HPC October 28, 2013 Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration 朱义普 Director, North Asia, HPC DDN Storage Vendor for HPC & Big Data

More information

SAS Technical Update Connectivity Roadmap and MultiLink SAS Initiative Jay Neer Molex Corporation Marty Czekalski Seagate Technology LLC

SAS Technical Update Connectivity Roadmap and MultiLink SAS Initiative Jay Neer Molex Corporation Marty Czekalski Seagate Technology LLC SAS Technical Update Connectivity Roadmap and MultiLink SAS Initiative Jay Neer Molex Corporation Marty Czekalski Seagate Technology LLC SAS Connectivity Roadmap Background Connectivity Objectives Converged

More information

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini White Paper Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini February 2015 2015 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 1 of 9 Contents

More information

Cisco and Cloudera Deliver WorldClass Solutions for Powering the Enterprise Data Hub alerts, etc. Organizations need the right technology and infrastr

Cisco and Cloudera Deliver WorldClass Solutions for Powering the Enterprise Data Hub alerts, etc. Organizations need the right technology and infrastr Solution Overview Cisco UCS Integrated Infrastructure for Big Data and Analytics with Cloudera Enterprise Bring faster performance and scalability for big data analytics. Highlights Proven platform for

More information

Software Development Using Full System Simulation with Freescale QorIQ Communications Processors

Software Development Using Full System Simulation with Freescale QorIQ Communications Processors Patrick Keliher, Simics Field Application Engineer Software Development Using Full System Simulation with Freescale QorIQ Communications Processors 1 2013 Wind River. All Rights Reserved. Agenda Introduction

More information

HCI: Hyper-Converged Infrastructure

HCI: Hyper-Converged Infrastructure Key Benefits: Innovative IT solution for high performance, simplicity and low cost Complete solution for IT workloads: compute, storage and networking in a single appliance High performance enabled by

More information

Webinar Series: Triangulate your Storage Architecture with SvSAN Caching. Luke Pruen Technical Services Director

Webinar Series: Triangulate your Storage Architecture with SvSAN Caching. Luke Pruen Technical Services Director Webinar Series: Triangulate your Storage Architecture with SvSAN Caching Luke Pruen Technical Services Director What can you expect from this webinar? To answer a simple question How can I create the perfect

More information

A product by CloudFounders. Wim Provoost Open vstorage

A product by CloudFounders. Wim Provoost Open vstorage A product by CloudFounders Wim Provoost (@wimpers_be) Open vstorage (@openvstorage) http://www.openvstorage.com CloudFounders vrun Converged infrastructure that combines the benefits of the hyperconverged

More information

2014 年 3 月 13 日星期四. From Big Data to Big Value Infrastructure Needs and Huawei Best Practice

2014 年 3 月 13 日星期四. From Big Data to Big Value Infrastructure Needs and Huawei Best Practice 2014 年 3 月 13 日星期四 From Big Data to Big Value Infrastructure Needs and Huawei Best Practice Data-driven insight Making better, more informed decisions, faster Raw Data Capture Store Process Insight 1 Data

More information

How to solve your backup problems with HP StoreOnce

How to solve your backup problems with HP StoreOnce How to solve your backup problems with HP StoreOnce Andrew Dickerson Senior Manager Backup, Recovery and Archive June 25, 2014 Time is running out for legacy storage Cloud Volume 2005 2010 2012 2015 50X

More information

Software Defined Storage

Software Defined Storage Software Defined Storage Riyadh Forum 2016 By: Ayman El Marazky Sr. System Engineer software defined solutions The CIO Dilemma IT Budget $ By 2020, the digital universe of data will go to 44 ZB Modern

More information

A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd.

A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd. A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd. 1 Agenda Introduction Background and Motivation Hybrid Key-Value Data Store Architecture Overview Design details Performance

More information

System that permanently stores data Usually layered on top of a lower-level physical storage medium Divided into logical units called files

System that permanently stores data Usually layered on top of a lower-level physical storage medium Divided into logical units called files System that permanently stores data Usually layered on top of a lower-level physical storage medium Divided into logical units called files Addressable by a filename ( foo.txt ) Usually supports hierarchical

More information

High-Performance and Large-Capacity Storage: A Winning Combination for Future Data Centers. Phil Brace August 12, 2015

High-Performance and Large-Capacity Storage: A Winning Combination for Future Data Centers. Phil Brace August 12, 2015 High-Performance and Large-Capacity Storage: A Winning Combination for Future Data Centers Phil Brace August 12, 2015 Data is Changing Bigger Different $ Constrained Zettabytes 45 40 35 30 25 20 15 10

More information

Automating Information Lifecycle Management with

Automating Information Lifecycle Management with Automating Information Lifecycle Management with Oracle Database 2c The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated

More information

Introduction to MapReduce

Introduction to MapReduce Basics of Cloud Computing Lecture 4 Introduction to MapReduce Satish Srirama Some material adapted from slides by Jimmy Lin, Christophe Bisciglia, Aaron Kimball, & Sierra Michels-Slettvet, Google Distributed

More information

The Datacentered Future Greg Huff CTO, LSI Corporation

The Datacentered Future Greg Huff CTO, LSI Corporation The Datacentered Future Greg Huff CTO, LSI Corporation 1 Tremendous Growth in Connected Data Sources, Consumption Devices, and Services 2 Nearly limitless data depth and breadth needed Execution of millions

More information

The Google File System. Alexandru Costan

The Google File System. Alexandru Costan 1 The Google File System Alexandru Costan Actions on Big Data 2 Storage Analysis Acquisition Handling the data stream Data structured unstructured semi-structured Results Transactions Outline File systems

More information