SNIA Tutorial 3 EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE: Part Teal Queues, Caches and Buffers

Similar documents
Everything You Wanted To Know About Storage: Part Teal The Buffering Pod

SNIA Tutorial 1 A CASE FOR FLASH STORAGE HOW TO CHOOSE FLASH STORAGE FOR YOUR APPLICATIONS

SNIA Tutorial 2 EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE, BUT WERE TOO PROUD TO ASK:

EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE, BUT WERE TOO PROUD TO ASK Part Cyan Storage Management. September 28, :00 am PT

EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE, BUT WERE TOO PROUD TO ASK Where Does My Data Go? August 1, :00 am PT

Extending RDMA for Persistent Memory over Fabrics. Live Webcast October 25, 2018

Mark Rogov, Dell EMC Chris Conniff, Dell EMC. Feb 14, 2018

Architectural Principles for Networked Solid State Storage Access

Use Cases for iscsi and FCoE: Where Each Makes Sense

Expert Insights: Expanding the Data Center with FCoE

Everything You Wanted To Know About Storage (But Were Too Proud To Ask) The Basics

iscsi : A loss-less Ethernet fabric with DCB Jason Blosil, NetApp Gary Gumanow, Dell

Virtualization Practices:

Fibre Channel vs. iscsi. January 31, 2018

Jeff Dodson / Avago Technologies

Multi-Cloud Storage: Addressing the Need for Portability and Interoperability

ETHERNET ENHANCEMENTS FOR STORAGE. Sunil Ahluwalia, Intel Corporation

Virtualization Practices: Providing a Complete Virtual Solution in a Box

Ron Emerick, Oracle Corporation

Overview and Current Topics in Solid State Storage

Overview and Current Topics in Solid State Storage

The Benefits of Solid State in Enterprise Storage Systems. David Dale, NetApp

Notes & Lessons Learned from a Field Engineer. Robert M. Smith, Microsoft

Adrian Proctor Vice President, Marketing Viking Technology

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

NVMe Over Fabrics (NVMe-oF)

Trends in Worldwide Media and Entertainment Storage

LTFS Bulk Transfer Standard PRESENTATION TITLE GOES HERE

Apples to Apples, Pears to Pears in SSS performance Benchmarking. Esther Spanjer, SMART Modular

Tiered File System without Tiers. Laura Shepard, Isilon

SRM: Can You Get What You Want? John Webster, Evaluator Group.

Planning For Persistent Memory In The Data Center. Sarah Jelinek/Intel Corporation

What NVMe /TCP Means for Networked Storage. Live Webcast January 22, 2019

RoCE vs. iwarp A Great Storage Debate. Live Webcast August 22, :00 am PT

Design and Implementations of FCoE for the DataCenter. Mike Frase, Cisco Systems

Facing an SSS Decision? SNIA Efforts to Evaluate SSS Performance. Ray Lucchesi Silverton Consulting, Inc.

Storage Performance Management Overview. Brett Allison, IntelliMagic, Inc.

SRM: Can You Get What You Want? John Webster Principal IT Advisor, Illuminata

The Role of WAN Optimization in Cloud Infrastructures. Josh Tseng, Riverbed

Mobile and Secure Healthcare: Encrypted Objects and Access Control Delegation

How to create a synthetic workload test. Eden Kim, CEO Calypso Systems, Inc.

ADVANCED DATA REDUCTION CONCEPTS

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

Data Deduplication Methods for Achieving Data Efficiency

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

ETHERNET ENHANCEMENTS FOR STORAGE. Sunil Ahluwalia, Intel Corporation Errol Roberts, Cisco Systems Inc.

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

Felix Xavier CloudByte Inc.

PRESENTATION TITLE GOES HERE. Understanding Architectural Trade-offs in Object Storage Technologies

Tutorial. A New Standard for IP Based Drive Management. Mark Carlson SNIA Technical Council Co-Chair

Scaling Data Center Application Infrastructure. Gary Orenstein, Gear6

Storage Virtualization II Effective Use of Virtualization - focusing on block virtualization -

A Promise Kept: Understanding the Monetary and Technical Benefits of STaaS Implementation. Mark Kaufman, Iron Mountain

Felix Xavier CloudByte Inc.

WHAT HAPPENS WHEN THE FLASH INDUSTRY GOES TO TLC? Luanne M. Dauber, Pure Storage

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation

Optimize Storage Efficiency & Performance with Erasure Coding Hardware Offload. Dror Goldenberg VP Software Architecture Mellanox Technologies

Deploying Public, Private, and Hybrid. Storage Cloud Environments

The File Systems Evolution. Christian Bandulet, Sun Microsystems

Analysis and Optimization. Carl Waldspurger Irfan Ahmad CloudPhysics, Inc.

SCSI Security Nuts and Bolts. Ralph Weber, ENDL Texas

Performance and Innovation of Storage. Advances through SCSI Express

InfiniBand Networked Flash Storage

Baidu s Best Practice with Low Latency Networks

Application Recovery. Andreas Schwegmann / HP

Storage Virtualization II. - focusing on block virtualization -

in Transition to the Cloud David A. Chapa, CTE EVault, a Seagate Company

High Availability Using Fault Tolerance in the SAN. Wendy Betts, IBM Mark Fleming, IBM

ASPECTS OF DEDUPLICATION. Dominic Kay, Oracle Mark Maybee, Oracle

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

Persistent Memory over Fabrics

NVMe over Fabrics. High Performance SSDs networked over Ethernet. Rob Davis Vice President Storage Technology, Mellanox

Server and Storage Consolidation with iscsi Arrays. David Dale, NetApp Suzanne Morgan, Microsoft

Data Center Networks. Joseph L White, Juniper Networks

Evolution of Fibre Channel. Mark Jones FCIA / Emulex Corporation

Blockchain Beyond Bitcoin. Mark O Connell

SAS: Today s Fast and Flexible Storage Fabric. Rick Kutcipal President, SCSI Trade Association Product Planning and Architecture, Broadcom Limited

pnfs, parallel storage for grid and enterprise computing Joshua Konkle, NetApp, Inc.

Cloud Archive and Long Term Preservation Challenges and Best Practices

Cisco Data Center Ethernet

How NAS Systems Participate in Data Protection. Paul Massiglia, Symantec Corporation

SAS: Today s Fast and Flexible Storage Fabric

FIBRE CHANNEL OVER ETHERNET

The Evolution of File Systems

Architecting For Availability, Performance & Networking With ScaleIO

Revisiting Network Support for RDMA

PMEM Software and Applications

Chapter 13 TRANSPORT. Mobile Computing Winter 2005 / Overview. TCP Overview. TCP slow-start. Motivation Simple analysis Various TCP mechanisms

SNIA Developers Conference - Growth of the iscsi RDMA (iser) Ecosystem

Persistent Memory Over Fabrics. Paul Grun, Cray Inc Stephen Bates, Eideticom Rob Davis, Mellanox Technologies

Storage in combined service/product data infrastructures. Craig Dunwoody CTO, GraphStream Incorporated

Restoration Technologies

Router s Queue Management

OpenStack Manila An Overview of Manila Liberty & Mitaka

Network Performance: Queuing

Audience This paper is targeted for IT managers and architects. It showcases how to utilize your network efficiently and gain higher performance using

EXPERIENCES WITH NVME OVER FABRICS

Dell EMC Networking Deploying Data Center Bridging (DCB)

NVMe SSDs with Persistent Memory Regions

Configuring Queuing and Flow Control

Transcription:

SNIA Tutorial 3 EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE: Part Teal Queues, Caches and Buffers John Kim, Mellanox - @Tier1Storage J Metz, Cisco - @drjmetz 2018 Flash Memory Summit

Welcome to SNIA Education Afternoon at Flash Memory Summit 2018

Agenda 1:00 pm 1:50 pm SNIA Tutorial 1 A Case for Flash Storage Dejan Kocic, NetApp 1:50 pm 2:45 pm SNIA Tutorial 2 What if Programming and Networking Had a Storage Baby Pod? John Kim, Mellanox Technologies and J Metz, Cisco Systems 2:45 pm 3:00 pm Break 3:00 pm 3:50 pm SNIA Tutorial 3 Buffers, Queues, and Caches John Kim, Mellanox Technologies and J Metz, Cisco Systems 4:00 pm 5:00 pm SNIA Tutorial 4 Birds-of-a-Feather Persistent Memory Futures Jeff Chang, SNIA Persistent Memory and NVDIMM SIG Co-Chair 3

June 2018 @SNIA

Join SNIA at These Upcoming Events SDC discount registration cards in FMS bags & at SNIA booth 820 Complimentary registration now open at snia.org/pmsummit 5

SNIA Legal Notice The material contained in this presentation is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA. Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney. The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 6

Special Thanks Mark Rogov Dell EMC David Minturn Intel Rob Davis Mellanox 7

Today s Agenda Queuing Buffering Caching Flow Control 8

QUEUING

Definitions- Queue Depth IO Operation (aka IOP ) Storage operation issued by a host (initiator) to a storage device/system (target) Example: Host issues a READ Operation of 100 blocks from a storage device IO Queue A queue which holds one or more outstanding IO Operations IO Queue Depth Maximum number of outstanding IO Operations that the IO Queue can hold 10

IO Queue Depth Example Example: IO Queue with a IO Queue Depth of three 11

IO Queue Depth Considerations (Storage Device Performance) Larger Queue Depth allows IO Operations to be serviced in parallel or batched resulting in higher total IOPS and bandwidth Example Results: @Queue Depth =1; IOPS are 10K (1/100usec) @Queue Depth =3; IOPS are 25K (3/120usec) 9

IO Queue Depth Considerations (End to End Queue Depth) Host systems typically have multiple Autonomous Processes simultaneously issuing IO Operations (Application Queue Depth) Host O/S storage stacks have internal queues to accommodate oversubscribed Storage Device IO Queues (O/S Stack Queue Depth) Queue Depth must be looked at End to End; App->O/S->Storage Device->Media 13

IO Queue Depth Considerations (Multiple IO Queues) Modern Storage Devices use a multi IO queue model for efficiency, typically one IO Queue per host CPU Effective Storage Device IO Queue Depth equals: # of IO Queues * individual IOQ Depth 14

IO Queue Depth Considerations (Memory Usage) Memory buffers (termed Transfer Buffers) are used to exchange IO Operation data between the Host System and Storage Device Transfer buffer resources are committed until the IO Operation completes Resources may be large; example 64K IO requires 64K of memory IO Queue Memory = IO Queue Depth * size of (IOP Descriptor + Transfer Buffer) 15

CACHE

Cache in the US Cache Cache Cache 1 7

Definition Cache (aka Cache Memory) /ˈkaSH/ An auxiliary memory from which high-speed retrieval is possible 1 8

Buffer Use once and throw it away + allows blocks re-arranging BUFFER 19

Cache Cache implies multiple use of blocks CACHE BUFFERS 20

Reads with Cache Read Hit: First reason for Caches to exist CACHE A REPLY REQUEST 21

Reads with Cache Read Miss CACHE A READ FROM DISK REPLY REQUEST 22

Writes with Cache Write-through: Write data to Cache and Disk, then confirm completion CACHE WRITE A A ACK WRITE 23

Writes with Cache Write-around: Bypass cache CACHE A ACK WRITE 24

Writes with Cache Write-back: Write data to Cache and confirm completion; write to disk later CACHE A WRITE LATER A ACK WRITE 25

Performance in a File Oriented World 25 Listing the contents of a directory 20 uncached ls Seconds to Complete 15 10 5 0 cached ls 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 180,000 200,000 220,000 240,000 260,000 280,000 300,000 320,000 340,000 360,000 380,000 400,000 420,000 440,000 460,000 480,000 500,000 520,000 540,000 Files In Directory Uncached ls had a USB unmount just prior to the ls command execution 2 7

Where Do Caches Exist? PHYSICAL BACK-END STORAGE FRONT-END CLIENTS / STORAGE CONNECT CONTROLLER CONNECT HOSTS IO IO CACHE 2 8

FLOW CONTROL 29

What is Flow Control? Flow control is a mechanism for temporarily stopping the transmission of data on computer network to avoid buffer overflows 30

What is Flow Control? No Flow Control 31

What is Flow Control? No Flow Control Flow Control 32

Buffers are Everywhere switch switch node node All computer networking devices have some buffers to facilitate speed matching 33

Buffers are Everywhere switch switch node node But these buffers seem to never be big enough. 34

Buffers are Everywhere switch switch node node This can lead to a Buffer Overflow resulting in Data Packet Drops forcing error recovery delays. Buffer Overflow 35

Buffer Overflows are Bad 36

Why Do Overflows Happen 25Gb 10Gb Data Buffer Overflows happen when more data is coming into a networking device then is going out. 37

Flow Control Prevents Overflows 25Gb 10Gb 10Gb Data IEEE 802.3x standard defines a flow control mechanism for Ethernet called the pause frame. 38

Flow Control Prevents Overflows 25Gb 10Gb 10Gb Data When the data in a buffer gets to a certain level the pause frame is sent causing the upstream device to stop sending data for a specified amount of time. Threshold 39

Fibre Channel and InfiniBand ANSI INCITS T11 IBTA With credit based flow control the sending device knows how much buffer space the receiving device has eliminating buffer overflows. 40

Explicit Congestion Notification Explicit Congestion Notification (ECN) slows down a explicit device s data rate that is believed to be overflowing another devices buffer. 41

Explicit Congestion Notification The data rate of the device slowed down then increases in increments over time based on preset parameters. RFC 3168 - Explicit Congestion Notification (ECN) 42

Priority Flow Control 43

Priority Flow Control 44

Priority Flow Control Priority Flow Control (PFC) is similar to 802.3x Pause, except eight priority levels are added. When the data in any of the eight buffers gets to a certain level a pause is sent causing the upstream device to stop sending data only for that priority level for a specified amount of time. 802.1Qbb - Priority-based Flow Control 45

Overall Summary Queues Line Up Work Processes or Requests Buffers absorb traffic bursts and smooth out data flow Caches store data closer to the user to accelerate access Flow Control Modules the Rate of Data or Requests to prevent buffer overflow 46

Other Storage Terms Got Your Pride? This is a Series! Check out previously recorded webcasts: http://sniaesfblog.org/everything-you-wanted-to-know-about-storage-but-weretoo-proud-to-ask/ Teal Buffers, Queues and Caches Rosé - All things iscsi Chartreuse The Basics: Initiator, Target, Storage Controller, RAID, Volume Manager and more Mauve Architecture: Channel vs. Bus, Control Plane vs. Data Plane, Fabric vs. Network Sepia Getting from Here to There Turquoise Where Does My Data Go? Cyan Storage Management Aqua Storage Controllers 47

Speaking of Series Check out Storage Performance Benchmarking Storage Performance Benchmarking: 1. Introduction and Fundamentals 2. Solution under Test 3. Block Components 4. File Components Watch them all on-demand at: http://www.snia.org/forums/esf/knowledge/webcasts-topics 48

Thank You! Visit snia.org/education for more webcasts, videos, and educational materials