Exploring System Challenges of Ultra-Low Latency Solid State Drives
|
|
- Julian Robbins
- 5 years ago
- Views:
Transcription
1 Exploring System Challenges of Ultra-Low Latency Solid State Drives Sungjoon Koh Changrim Lee, Miryeong Kwon, and Myoungsoo Jung Computer Architecture and Memory systems Lab
2 Executive Summary Motivation. Ultra-low latency (ULL) is emerging, but not characterized by far. Contributions. - Characterizing the performance behaviors of ULL SSD. - Studying several system-level challenges of the current storage stack. Key Observations. - ULL SSD minimizes the I/O interferences (interleaving reads and writes). - NVMe queue mechanisms are required to be optimized for ULL SSDs. - Polling-based I/O completion routine isn t effective for current NVMe SSDs.
3 Architectural Change of SSD CPU NVMe SSD PCI Express PCI Express MCH (North Bridge) Direct Access High bandwidth ICH (South Bridge) DRAM DRAM SATA SATA SSD
4 Evolution of SSDs Bandwidth almost reaches the maximum performance. Still, long latency (far from DRAM) SATA SSD Read: 0.5 GB/s Changes NVMe SSD Read: 2.4GB/s New flash memory, called Z-NAND Write: 0.5 GB/s Write: 1.2 GB/s
5 New Flash Memory Existing 3D NAND Read: μs Write: μs Technology Capacity Page Size Z-NAND [1] SLC based 3D NAND 48 stacked word-line layer 64Gb 2kB/Page Z-NAND [1] Read: 3μs (15~20x) Z-NAND based archives Z-SSD Write: 100μs (6~7x)
6 Characterization Categories Performance Analysis. - Average latency. - Long-tail latency. - Bandwidth. - I/O interference impact. Polling vs. Interrupt - Overall latency comparison. - CPU utilization analysis. - Memory requirement. - Five-nines latency.
7 Evaluation Settings OS: Linux CPU: Intel Core i7-4790k (4-core, 4.00GHz) Z-SSD Prototype Memory: DDR4 DRAM (16GB) SSD - ULL SSD: Z-SSD Prototype (800GB) - NVMe SSD: Intel SSD 750 Series (400GB) <Our testbed w/ Z-SSDs> Benchmark: Flexible I/O Tester (FIO v2.99)
8 Performance Analysis
9 Overview Request Queue Host Increase queue depth 4KB Rd 4KB Wr 4KB Rd 4KB Wr 4KB Rd 4KB Wr 4KB Rd 4KB Wr 1 Average latency & Long-tail latency NVMe Driver NVMe Controller SSD 2 Bandwidth 3 Read latency under Read & Write intermixed workload
10 Average Latency of ULL SSD Average Latency (μsec) Average Latency (μsec) Sequential Read Write SeqRd SeqWr RndRd RndWr NVMe NVMe ULL ULL I/O Depth x x t R 11 μs t DMA 4KB DMA = 8μs ( t R =3μs) Split-DMA & Super-Channel
11 Split-DMA & Super-Channel Z-SSD Reference: Cheong, Woosung et al., A flash memory controller for 15μs ultra-lowlatency SSD using high-speed 3D NAND flash with 3μs read time, ISSCC, 2018 Channel 0 Split DMA Engine Channel 2 Super 4KB Request 2KB 2KB Split Channel 4 Channel 1 Channel 3 Channel Channel 5 t DMA = 4μs
12 Long-tail Latency of ULL SSD th Latency (msec) ULL SeqRd RndRd SeqWr RndWr NVMe SeqRd RndRd SeqWr RndWr I/O Depth Resource conflict Insufficient internal buffer, Internal tasks Split DMA & Suspend/Resume
13 Suspend/Resume DMA Technique Reference: Cheong, Woosung et al., A flash memory controller for 15μs ultra-lowlatency SSD using high-speed 3D NAND flash with 3μs read time, ISSCC, 2018 Way 1 DMA (for write request) Way 2 Wait t R Reduce read latency & Increase QoS CMD t R Data Out Suspend/Resume [1] Suspend Resume Way 1 Read DMA (for write request) Way 2 t R CMD Data Out
14 I/O Interference Great performance bottleneck of conventional SSDs. Read Latency (μsec) Average NVMe SSD ULL SSD ULL SSD Significant be performance applied to real-life storage degradation stack w/o in intermixed performance workloads. degradation. How about ULL SSD? Flush operation / meta data writes Remains in file almost system constant are intermixed Suspend/resume, with user requests [1] Write fraction (%)
15 Queue Analysis Normalized Bandwidth NVMe SSD Only 50% of Max BW SeqRd RndRd SeqWr RndWr I/O Depth I/O Requires request Too rescheduling more long write than 100 latency within entries. queue. Normalized Bandwidth ULL SSD Almost Max BW SeqRd RndRd SeqWr RndWr I/O Depth Only Short 6 entries write latency required Light queue mechanisms (ex. NCQ) are not sufficient. Requires rich queue mechanism Well-aligned with light queue mechanisms (ex. NCQ). NVMe needs to be lightened
16 Polling vs. Interrupt Two different I/O completion methods
17 Interrupt / Polling Systems with short waiting time adopts polling-based waiting strategy.(even though it incurs lots of overheads) For example, spin lock, network message passing applies polling-based waiting strategy. Polling is currently implemented to NVMe storage stack. Does it really need for current NVMe SSDs?
18 Interrupt / Polling Interrupt. Submit request CS Sleep CS Complete request CS ISR 3 Wake Low latency SSD Command Execution 2 Raise IRQ 1 Finishes NVMe Controller Polling. Shorter Larger portion Submit request Polling Complete request CS CS SSD Command Execution Done?? Gain
19 Overall Performance NVMe SSD ULL SSD Average Latency ( sec) Does Interrupt polling-based 28 Interrupt I/O works 20 on ULL PollingSSD? Polling 4KB 8KB 16KB 32KB Average Latency ( sec) 4KB 8KB 16KB 32KB Polling-based Interrupt 20 Interrupt I/O services 22 are not Polling Polling Read Write Read Write Future lower latency SSD can achieve remarkable performance improvement with Decreases only Read: 0.9% & Write: 8.2% Average Latency ( sec) 4KB 8KB 16KB 32KB Average Latency ( sec) effective for current polling-based I/O completion routine. NVMe SSDs. 4KB 8KB 16KB 32KB Decreases by Read: 7.5% & Write: 13.2%
20 System Challenges % Latency (msec) CPU Utilization (%) Memory Bound (%) Polling Host 4.9 Core Polling-based I/O Memory services boundincur InterruptCore always CPU CPU Polling = Fraction of slots where significant Working 60Polling system-level Core overheads 1 Core n pipeline could be stalled SQ CQ ULL Write Interrupt Interrupt release CPU Spin lock for head/tail pointer Synchronization 0 High 0 CPU utilization Time 4KB 4KB 8KB 8KB 16KB 16KB 32KB 32KB <Memory <CPU Uitlization> Bound> Polling does not Needs to due be to addressed load/store. 4KB 8KB 16KB 32KB Head Check CQ update NVMe Controller Memory Space High memory bound Tail CQ SQ Head NVMe Controller = Frequent memory access Tail CQ Head Doorbell SQ Tail Doorbell
21 Conclusion Motivation. Ultra-low latency (ULL) is emerging, but not characterized by far. Contributions. - Characterizing the performance behaviors of ULL SSD. - Studying several system-level challenges of the current storage stack. Key Insights. - ULL SSDs can be effectively applied to real-life storage stack. (RW mixed) - NVMe queue mechanisms are required to be optimized for ULL SSDs. - Polling-based I/O completion routine isn t effective for current NVMe SSDs.
22 Thank you Q&A
FlashShare: Punching Through Server Storage Stack from Kernel to Firmware for Ultra-Low Latency SSDs
FlashShare: Punching Through Server Storage Stack from Kernel to Firmware for Ultra-Low Latency SSDs Jie Zhang, Miryeong Kwon, Donghyun Gouk, Sungjoon Koh, Changlim Lee, Mohammad Alian, Myoungjun Chun,
More informationMoneta: A High-performance Storage Array Architecture for Nextgeneration, Micro 2010
Moneta: A High-performance Storage Array Architecture for Nextgeneration, Non-volatile Memories Micro 2010 NVM-based SSD NVMs are replacing spinning-disks Performance of disks has lagged NAND flash showed
More informationLinux Storage System Bottleneck Exploration
Linux Storage System Bottleneck Exploration Bean Huo / Zoltan Szubbocsev Beanhuo@micron.com / zszubbocsev@micron.com 215 Micron Technology, Inc. All rights reserved. Information, products, and/or specifications
More informationUnblinding the OS to Optimize User-Perceived Flash SSD Latency
Unblinding the OS to Optimize User-Perceived Flash SSD Latency Woong Shin *, Jaehyun Park **, Heon Y. Yeom * * Seoul National University ** Arizona State University USENIX HotStorage 2016 Jun. 21, 2016
More informationA Predictable RTOS. Mantis Cheng Department of Computer Science University of Victoria
A Predictable RTOS Mantis Cheng Department of Computer Science University of Victoria Outline I. Analysis of Timeliness Requirements II. Analysis of IO Requirements III. Time in Scheduling IV. IO in Scheduling
More informationFlashShare: Punching Through Server Storage Stack from Kernel to Firmware for Ultra-Low Latency SSDs
FlashShare: Punching Through Server Storage Stack from Kernel to Firmware for Ultra-Low Latency SSDs Jie Zhang, Miryeong Kwon, Donghyun Gouk, Sungjoon Koh, and Changlim Lee, Yonsei University; Mohammad
More informationMoneta: A High-Performance Storage Architecture for Next-generation, Non-volatile Memories
Moneta: A High-Performance Storage Architecture for Next-generation, Non-volatile Memories Adrian M. Caulfield Arup De, Joel Coburn, Todor I. Mollov, Rajesh K. Gupta, Steven Swanson Non-Volatile Systems
More informationThe Transition to PCI Express* for Client SSDs
The Transition to PCI Express* for Client SSDs Amber Huffman Senior Principal Engineer Intel Santa Clara, CA 1 *Other names and brands may be claimed as the property of others. Legal Notices and Disclaimers
More informationReducing CPU and network overhead for small I/O requests in network storage protocols over raw Ethernet
Reducing CPU and network overhead for small I/O requests in network storage protocols over raw Ethernet Pilar González-Férez and Angelos Bilas 31 th International Conference on Massive Storage Systems
More informationFalcon: Scaling IO Performance in Multi-SSD Volumes. The George Washington University
Falcon: Scaling IO Performance in Multi-SSD Volumes Pradeep Kumar H Howie Huang The George Washington University SSDs in Big Data Applications Recent trends advocate using many SSDs for higher throughput
More informationMDev-NVMe: A NVMe Storage Virtualization Solution with Mediated Pass-Through
MDev-NVMe: A NVMe Storage Virtualization Solution with Mediated Pass-Through Bo Peng 1,2, Haozhong Zhang 2, Jianguo Yao 1, Yaozu Dong 2, Yu Xu 1, Haibing Guan 1 1 Shanghai Key Laboratory of Scalable Computing
More informationOpenMPDK and unvme User Space Device Driver for Server and Data Center
OpenMPDK and unvme User Space Device Driver for Server and Data Center Open source for maximally utilizing Samsung s state-of-art Storage Solution in shorter development time White Paper 2 Target Audience
More informationEnhancing SSD Control of NVMe Devices for Hyperscale Applications. Luca Bert - Seagate Chris Petersen - Facebook
Enhancing SSD Control of NVMe Devices for Hyperscale Applications Luca Bert - Seagate Chris Petersen - Facebook Agenda Introduction & overview (Luca) Problem statement & proposed solution (Chris) SSD implication
More informationPresented by: Nafiseh Mahmoudi Spring 2017
Presented by: Nafiseh Mahmoudi Spring 2017 Authors: Publication: Type: ACM Transactions on Storage (TOS), 2016 Research Paper 2 High speed data processing demands high storage I/O performance. Flash memory
More informationZiye Yang. NPG, DCG, Intel
Ziye Yang NPG, DCG, Intel Agenda What is SPDK? Accelerated NVMe-oF via SPDK Conclusion 2 Agenda What is SPDK? Accelerated NVMe-oF via SPDK Conclusion 3 Storage Performance Development Kit Scalable and
More informationSPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation
SPDK China Summit 2018 Ziye Yang Senior Software Engineer Network Platforms Group, Intel Corporation Agenda SPDK programming framework Accelerated NVMe-oF via SPDK Conclusion 2 Agenda SPDK programming
More informationBIBIM: A Prototype Multi-Partition Aware Heterogeneous New Memory
HotStorage 18 BIBIM: A Prototype Multi-Partition Aware Heterogeneous New Memory Gyuyoung Park 1, Miryeong Kwon 1, Pratyush Mahapatra 2, Michael Swift 2, and Myoungsoo Jung 1 Yonsei University Computer
More informationDongjun Shin Samsung Electronics
2014.10.31. Dongjun Shin Samsung Electronics Contents 2 Background Understanding CPU behavior Experiments Improvement idea Revisiting Linux I/O stack Conclusion Background Definition 3 CPU bound A computer
More informationPersistent Memory. High Speed and Low Latency. White Paper M-WP006
Persistent Memory High Speed and Low Latency White Paper M-WP6 Corporate Headquarters: 3987 Eureka Dr., Newark, CA 9456, USA Tel: (51) 623-1231 Fax: (51) 623-1434 E-mail: info@smartm.com Customer Service:
More informationA Flash Scheduling Strategy for Current Capping in Multi-Power-Mode SSDs
A Flash Scheduling Strategy for Current Capping in Multi-Power-Mode SSDs Li-Pin Chang, Chia-Hsiang Cheng, and Kai-Hsiang Lin Department of Computer Science National Chiao-Tung University, Taiwan Presented
More informationAn NVMe-based FPGA Storage Workload Accelerator
An NVMe-based FPGA Storage Workload Accelerator Dr. Sean Gibb, VP Software Eideticom Santa Clara, CA 1 PCIe Bus NVMe SSD NVMe SSD Acceleration Host CPU HDD RDMA NIC NoLoad Accel. Card TM Storage I/O Bandwidth
More informationEnd-to-End Adaptive Packet Aggregation for High-Throughput I/O Bus Network Using Ethernet
Hot Interconnects 2014 End-to-End Adaptive Packet Aggregation for High-Throughput I/O Bus Network Using Ethernet Green Platform Research Laboratories, NEC, Japan J. Suzuki, Y. Hayashi, M. Kan, S. Miyakawa,
More informationI/O Devices & SSD. Dongkun Shin, SKKU
I/O Devices & SSD 1 System Architecture Hierarchical approach Memory bus CPU and memory Fastest I/O bus e.g., PCI Graphics and higherperformance I/O devices Peripheral bus SCSI, SATA, or USB Connect many
More informationA Buffer Replacement Algorithm Exploiting Multi-Chip Parallelism in Solid State Disks
A Buffer Replacement Algorithm Exploiting Multi-Chip Parallelism in Solid State Disks Jinho Seol, Hyotaek Shim, Jaegeuk Kim, and Seungryoul Maeng Division of Computer Science School of Electrical Engineering
More informationDesign Choices for FPGA-based SoCs When Adding a SATA Storage }
U4 U7 U7 Q D U5 Q D Design Choices for FPGA-based SoCs When Adding a SATA Storage } Lorenz Kolb & Endric Schubert, Missing Link Electronics Rudolf Usselmann, ASICS World Services Motivation for SATA Storage
More informationThe Non-Volatile Memory Verbs Provider (NVP): Using the OFED Framework to access solid state storage
The Non-Volatile Memory Verbs Provider (NVP): Using the OFED Framework to access solid state storage Bernard Metzler 1, Animesh Trivedi 1, Lars Schneidenbach 2, Michele Franceschini 2, Patrick Stuedi 1,
More informationThe Long-Term Future of Solid State Storage Jim Handy Objective Analysis
The Long-Term Future of Solid State Storage Jim Handy Objective Analysis Agenda How did we get here? Why it s suboptimal How we move ahead Why now? DRAM speed scaling Changing role of NVM in computing
More informationModule 6: INPUT - OUTPUT (I/O)
Module 6: INPUT - OUTPUT (I/O) Introduction Computers communicate with the outside world via I/O devices Input devices supply computers with data to operate on E.g: Keyboard, Mouse, Voice recognition hardware,
More informationReFlex: Remote Flash Local Flash
ReFlex: Remote Flash Local Flash Ana Klimovic Heiner Litz Christos Kozyrakis NVMW 18 Memorable Paper Award Finalist 1 Flash in Datacenters Flash provides 1000 higher throughput and 100 lower latency than
More informationArchitecture Exploration of High-Performance PCs with a Solid-State Disk
Architecture Exploration of High-Performance PCs with a Solid-State Disk D. Kim, K. Bang, E.-Y. Chung School of EE, Yonsei University S. Yoon School of EE, Korea University April 21, 2010 1/53 Outline
More informationLETTER Solid-State Disk with Double Data Rate DRAM Interface for High-Performance PCs
IEICE TRANS. INF. & SYST., VOL.E92 D, NO.4 APRIL 2009 727 LETTER Solid-State Disk with Double Data Rate DRAM Interface for High-Performance PCs Dong KIM, Kwanhu BANG, Seung-Hwan HA, Chanik PARK, Sung Woo
More informationHigh Performance Solid State Storage Under Linux
High Performance Solid State Storage Under Linux Eric Seppanen, Matthew T. O Keefe, David J. Lilja Electrical and Computer Engineering University of Minnesota April 20, 2010 Motivation SSDs breaking through
More informationHigh-Speed NAND Flash
High-Speed NAND Flash Design Considerations to Maximize Performance Presented by: Robert Pierce Sr. Director, NAND Flash Denali Software, Inc. History of NAND Bandwidth Trend MB/s 20 60 80 100 200 The
More informationEfficient Memory Mapped File I/O for In-Memory File Systems. Jungsik Choi, Jiwon Kim, Hwansoo Han
Efficient Memory Mapped File I/O for In-Memory File Systems Jungsik Choi, Jiwon Kim, Hwansoo Han Operations Per Second Storage Latency Close to DRAM SATA/SAS Flash SSD (~00μs) PCIe Flash SSD (~60 μs) D-XPoint
More informationQuiz for Chapter 6 Storage and Other I/O Topics 3.10
Date: 3.10 Not all questions are of equal difficulty. Please review the entire quiz first and then budget your time carefully. Name: Course: 1. [6 points] Give a concise answer to each of the following
More informationAccelerating NVMe I/Os in Virtual Machine via SPDK vhost* Solution Ziye Yang, Changpeng Liu Senior software Engineer Intel
Accelerating NVMe I/Os in Virtual Machine via SPDK vhost* Solution Ziye Yang, Changpeng Liu Senior software Engineer Intel @optimistyzy Notices & Disclaimers Intel technologies features and benefits depend
More informationPerformance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms
Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms Sayantan Sur, Matt Koop, Lei Chai Dhabaleswar K. Panda Network Based Computing Lab, The Ohio State
More informationNVMe: The Protocol for Future SSDs
When do you need NVMe? You might have heard that Non-Volatile Memory Express or NVM Express (NVMe) is the next must-have storage technology. Let s look at what NVMe delivers. NVMe is a communications protocol
More informationUsing Transparent Compression to Improve SSD-based I/O Caches
Using Transparent Compression to Improve SSD-based I/O Caches Thanos Makatos, Yannis Klonatos, Manolis Marazakis, Michail D. Flouris, and Angelos Bilas {mcatos,klonatos,maraz,flouris,bilas}@ics.forth.gr
More informationSHRD: Improving Spatial Locality in Flash Storage Accesses by Sequentializing in Host and Randomizing in Device
SHRD: Improving Spatial Locality in Flash Storage Accesses by Sequentializing in Host and Randomizing in Device Hyukjoong Kim 1, Dongkun Shin 1, Yun Ho Jeong 2 and Kyung Ho Kim 2 1 Samsung Electronics
More informationLow-Overhead Flash Disaggregation via NVMe-over-Fabrics Vijay Balakrishnan Memory Solutions Lab. Samsung Semiconductor, Inc.
Low-Overhead Flash Disaggregation via NVMe-over-Fabrics Vijay Balakrishnan Memory Solutions Lab. Samsung Semiconductor, Inc. 1 DISCLAIMER This presentation and/or accompanying oral statements by Samsung
More informationSummarizer: Trading Communication with Computing Near Storage
Summarizer: Trading Communication with Computing Near Storage Gunjae Koo*, Kiran Kumar Matam*, Te I, H.V. Krishina Giri Nara*, Jing Li, Hung-Wei Tseng, Steven Swanson, Murali Annavaram* *University of
More informationSpring 2017 :: CSE 506. Device Programming. Nima Honarmand
Device Programming Nima Honarmand read/write interrupt read/write Spring 2017 :: CSE 506 Device Interface (Logical View) Device Interface Components: Device registers Device Memory DMA buffers Interrupt
More informationStorage. Hwansoo Han
Storage Hwansoo Han I/O Devices I/O devices can be characterized by Behavior: input, out, storage Partner: human or machine Data rate: bytes/sec, transfers/sec I/O bus connections 2 I/O System Characteristics
More informationSolving the I/O bottleneck with Flash
Solving the I/O bottleneck with Flash Ori Balaban Director of Sales for Global Accounts SanDisk Corporation August 2007 1 Agenda Performance bottlenecks in HDD Alternative solutions SSD value proposition
More informationA Semi Preemptive Garbage Collector for Solid State Drives. Junghee Lee, Youngjae Kim, Galen M. Shipman, Sarp Oral, Feiyi Wang, and Jongman Kim
A Semi Preemptive Garbage Collector for Solid State Drives Junghee Lee, Youngjae Kim, Galen M. Shipman, Sarp Oral, Feiyi Wang, and Jongman Kim Presented by Junghee Lee High Performance Storage Systems
More informationSamsung PM1725a NVMe SSD
Samsung PM1725a NVMe SSD Exceptionally fast speeds and ultra-low latency for enterprise application Brochure 1 Extreme performance from an SSD technology leader Maximize data transfer with the high-performance,
More informationLow-Overhead Flash Disaggregation via NVMe-over-Fabrics
Low-Overhead Flash Disaggregation via NVMe-over-Fabrics Vijay Balakrishnan Memory Solutions Lab. Samsung Semiconductor, Inc. August 2017 1 DISCLAIMER This presentation and/or accompanying oral statements
More informationManaging Array of SSDs When the Storage Device is No Longer the Performance Bottleneck
Managing Array of Ds When the torage Device is No Longer the Performance Bottleneck Byung. Kim, Jaeho Kim, am H. Noh UNIT (Ulsan National Institute of cience & Technology) Outline Motivation & Observation
More informationDeep Learning Performance and Cost Evaluation
Micron 5210 ION Quad-Level Cell (QLC) SSDs vs 7200 RPM HDDs in Centralized NAS Storage Repositories A Technical White Paper Rene Meyer, Ph.D. AMAX Corporation Publish date: October 25, 2018 Abstract Introduction
More informationBuilding an All Flash Server What s the big deal? Isn t it all just plug and play?
Building an All Flash Server What s the big deal? Isn t it all just plug and play? Doug Rollins Micron Technology Santa Clara, CA 1 What we ll cover Industry Secrets (shhhhh. ) Example Platform Key features
More informationKey Points. Rotational delay vs seek delay Disks are slow. Techniques for making disks faster. Flash and SSDs
IO 1 Today IO 2 Key Points CPU interface and interaction with IO IO devices The basic structure of the IO system (north bridge, south bridge, etc.) The key advantages of high speed serial lines. The benefits
More informationMQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices
MQSim: A Framework for Enabling Realistic Studies of Modern Multi-Queue SSD Devices Arash Tavakkol, Juan Gómez-Luna, Mohammad Sadrosadati, Saugata Ghose, Onur Mutlu February 13, 2018 Executive Summary
More informationArchitectural Principles for Networked Solid State Storage Access
Architectural Principles for Networked Solid State Storage Access SNIA Legal Notice! The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted.! Member companies and individual
More informationToward SLO Complying SSDs Through OPS Isolation
Toward SLO Complying SSDs Through OPS Isolation October 23, 2015 Hongik University UNIST (Ulsan National Institute of Science & Technology) Sam H. Noh 1 Outline Part 1: FAST 2015 Part 2: Beyond FAST 2
More informationOperating Systems. V. Input / Output
Operating Systems V. Input / Output Ludovic Apvrille ludovic.apvrille@telecom-paristech.fr Eurecom, office 470 http://soc.eurecom.fr/os/ @OS Eurecom Devices of a Computer System Applications OS CPU Memory
More informationChangpeng Liu. Cloud Storage Software Engineer. Intel Data Center Group
Changpeng Liu Cloud Storage Software Engineer Intel Data Center Group Notices & Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software
More informationEmbedded Systems Dr. Santanu Chaudhury Department of Electrical Engineering Indian Institute of Technology, Delhi
Embedded Systems Dr. Santanu Chaudhury Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 13 Virtual memory and memory management unit In the last class, we had discussed
More informationUNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Computer Architecture ECE 568
UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering Computer Architecture ECE 568 Part 6 Input/Output Israel Koren ECE568/Koren Part.6. Motivation: Why Care About I/O? CPU Performance:
More informationApplying Polling Techniques to QEMU
Applying Polling Techniques to QEMU Reducing virtio-blk I/O Latency Stefan Hajnoczi KVM Forum 2017 Agenda Problem: Virtualization overhead is significant for high IOPS devices QEMU
More informationAD910A M.2 (NGFF) to SATA III Converter Card
MINERVA AD910A M.2 (NGFF) to SATA III Converter Card Performance & Burn In Test Rev. 1.0 Table of Contents 1. Overview 2. Performance Measurement Tools and Results 2.1 Test Platform 2.2 Test target and
More informationExtending the NVMHCI Standard to Enterprise
Extending the NVMHCI Standard to Enterprise Amber Huffman Principal Engineer Intel Corporation August 2009 1 Outline Remember: What is NVMHCI PCIe SSDs Coming with Challenges Enterprise Extensions to NVMHCI
More informationNo Tradeoff Low Latency + High Efficiency
No Tradeoff Low Latency + High Efficiency Christos Kozyrakis http://mast.stanford.edu Latency-critical Applications A growing class of online workloads Search, social networking, software-as-service (SaaS),
More informationReference Design: NVMe-oF JBOF
Reference Design: NVMe-oF JBOF 1 Composable Infrastructure Two Target Architectures RNIC RNIC Driver NVMe Driver NVMe Fabric Driver NVMe SSDs Application NVMe Driver NVMe Fabric Driver RNIC Driver RNIC
More informationHP SSD EX920 M.2. 2TB Sustained sequential read: Up to 3200 MB/s Sustained sequential write: Up to 1600 MB/s
HP SSD EX920 M.2 Product Specification Capacity: 256GB, 512GB, 1TB, 2TB Components: 3D NAND/ DRAM Cache Read and Write IOPS (Iometer* Queue Depth 32) 256 GB Random 4 KB reads: Up to 180K IOPS Random 4
More informationMemory Systems DRAM, etc.
Memory Systems DRAM, etc. Prof. Bruce Jacob Keystone Professor & Director of Computer Engineering Program Electrical & Computer Engineering University of Maryland at College Park Today s Story DRAM (the
More informationMinerva. Performance & Burn In Test Rev AD903A/AD903D Converter Card. Table of Contents. 1. Overview
Minerva AD903A/AD903D Converter Card Performance & Burn In Test Rev. 1.0 Table of Contents 1. Overview 2. Performance Measurement Tools and Results 2.1 Test Platform 2.2 Test target and Used SATA III SSD
More informationIdentifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage
Identifying Performance Bottlenecks with Real- World Applications and Flash-Based Storage TechTarget Dennis Martin 1 Agenda About Demartek Enterprise Data Center Environments Storage Performance Metrics
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
SER2734BU Extreme Performance Series: Byte-Addressable Nonvolatile Memory in vsphere VMworld 2017 Content: Not for publication Qasim Ali and Praveen Yedlapalli #VMworld #SER2734BU Disclaimer This presentation
More informationWORKLOAD CHARACTERIZATION OF INTERACTIVE CLOUD SERVICES BIG AND SMALL SERVER PLATFORMS
WORKLOAD CHARACTERIZATION OF INTERACTIVE CLOUD SERVICES ON BIG AND SMALL SERVER PLATFORMS Shuang Chen*, Shay Galon**, Christina Delimitrou*, Srilatha Manne**, and José Martínez* *Cornell University **Cavium
More informationChangpeng Liu. Senior Storage Software Engineer. Intel Data Center Group
Changpeng Liu Senior Storage Software Engineer Intel Data Center Group Legal Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware,
More informationProfiling: Understand Your Application
Profiling: Understand Your Application Michal Merta michal.merta@vsb.cz 1st of March 2018 Agenda Hardware events based sampling Some fundamental bottlenecks Overview of profiling tools perf tools Intel
More informationComparing UFS and NVMe Storage Stack and System-Level Performance in Embedded Systems
Comparing UFS and NVMe Storage Stack and System-Level Performance in Embedded Systems Bean Huo, Blair Pan, Peter Pan, Zoltan Szubbocsev Micron Technology Introduction Embedded storage systems have experienced
More informationComputer Organization and Structure. Bing-Yu Chen National Taiwan University
Computer Organization and Structure Bing-Yu Chen National Taiwan University Storage and Other I/O Topics I/O Performance Measures Types and Characteristics of I/O Devices Buses Interfacing I/O Devices
More informationQuickRec: Prototyping an Intel Architecture Extension for Record and Replay of Multithreaded Programs
QuickRec: Prototyping an Intel Architecture Extension for Record and Replay of Multithreaded Programs Intel: Gilles Pokam, Klaus Danne, Cristiano Pereira, Rolf Kassa, Tim Kranich, Shiliang Hu, Justin Gottschlich
More informationInput/Output Introduction
Input/Output 1 Introduction Motivation Performance metrics Processor interface issues Buses 2 Page 1 Motivation CPU Performance: 60% per year I/O system performance limited by mechanical delays (e.g.,
More informationIntroduction. Motivation Performance metrics Processor interface issues Buses
Input/Output 1 Introduction Motivation Performance metrics Processor interface issues Buses 2 Motivation CPU Performance: 60% per year I/O system performance limited by mechanical delays (e.g., disk I/O)
More informationIntel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment
Intel Solid State Drive Data Center Family for PCIe* in Baidu s Data Center Environment Case Study Order Number: 334534-002US Ordering Information Contact your local Intel sales representative for ordering
More informationECE 341. Lecture # 19
ECE 341 Lecture # 19 Instructor: Zeshan Chishti zeshan@ece.pdx.edu December 3, 2014 Portland State University Announcements Final exam is on Monday, December 8 from 5:30 PM to 7:20 PM Similar format and
More informationOSSD: A Case for Object-based Solid State Drives
MSST 2013 2013/5/10 OSSD: A Case for Object-based Solid State Drives Young-Sik Lee Sang-Hoon Kim, Seungryoul Maeng, KAIST Jaesoo Lee, Chanik Park, Samsung Jin-Soo Kim, Sungkyunkwan Univ. SSD Desktop Laptop
More informationAll-NVMe Performance Deep Dive Into Ceph + Sneak Preview of QLC + NVMe Ceph
All-NVMe Performance Deep Dive Into Ceph + Sneak Preview of QLC + NVMe Ceph Ryan Meredith Sr. Manager, Storage Solutions Engineering 2018 Micron Technology, Inc. All rights reserved. Information, products,
More informationPerformance Modeling and Analysis of Flash based Storage Devices
Performance Modeling and Analysis of Flash based Storage Devices H. Howie Huang, Shan Li George Washington University Alex Szalay, Andreas Terzis Johns Hopkins University MSST 11 May 26, 2011 NAND Flash
More informationToward a Memory-centric Architecture
Toward a Memory-centric Architecture Martin Fink EVP & Chief Technology Officer Western Digital Corporation August 8, 2017 1 SAFE HARBOR DISCLAIMERS Forward-Looking Statements This presentation contains
More informationLinux Storage System Analysis for e.mmc With Command Queuing
Linux Storage System Analysis for e.mmc With Command Queuing Linux is a widely used embedded OS that also manages block devices such as e.mmc, UFS and SSD. Traditionally, advanced embedded systems have
More informationHigh Performance SSD & Benefit for Server Application
High Performance SSD & Benefit for Server Application AUG 12 th, 2008 Tony Park Marketing INDILINX Co., Ltd. 2008-08-20 1 HDD SATA 3Gbps Memory PCI-e 10G Eth 120MB/s 300MB/s 8GB/s 2GB/s 1GB/s SSD SATA
More informationEnabling NVMe I/O Scale
Enabling NVMe I/O Determinism @ Scale Chris Petersen, Hardware System Technologist Wei Zhang, Software Engineer Alexei Naberezhnov, Software Engineer Facebook Facebook @ Scale 800 Million 1.3 Billion 2.2
More informationMySQL Performance Optimization and Troubleshooting with PMM. Peter Zaitsev, CEO, Percona
MySQL Performance Optimization and Troubleshooting with PMM Peter Zaitsev, CEO, Percona In the Presentation Practical approach to deal with some of the common MySQL Issues 2 Assumptions You re looking
More informationSamsung Z-SSD and ScyllaDB: Delivering Low Latency and Multi-Terabyte Capacity in a Persistent Database
Samsung Z-SSD and ScyllaDB: Delivering Low Latency and Multi-Terabyte Capacity in a Persistent Database A comparative study of Scylla performance on Samsung Z-SSD and DRAM January 218 Table of Contents
More informationUNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Computer Architecture ECE 568
UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering Computer Architecture ECE 568 Part 6 Input/Output Israel Koren ECE568/Koren Part.6. CPU performance keeps increasing 26 72-core Xeon
More informationAccelerating NVMe-oF* for VMs with the Storage Performance Development Kit
Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit Jim Harris Principal Software Engineer Intel Data Center Group Santa Clara, CA August 2017 1 Notices and Disclaimers Intel technologies
More informationELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 7: Memory Organization Part II
ELEC 5200/6200 Computer Architecture and Design Spring 2017 Lecture 7: Organization Part II Ujjwal Guin, Assistant Professor Department of Electrical and Computer Engineering Auburn University, Auburn,
More informationMINERVA. Performance & Burn In Test Rev AD912A Interposer Card. Table of Contents. 1. Overview
MINERVA AD912A Interposer Card Performance & Burn In Test Rev. 1.0 Table of Contents 1. Overview 2. Performance Measurement Tools and Results 2.1 Test Platform 2.2 Test target and Used msata III SSD 2.3
More informationIX: A Protected Dataplane Operating System for High Throughput and Low Latency
IX: A Protected Dataplane Operating System for High Throughput and Low Latency Belay, A. et al. Proc. of the 11th USENIX Symp. on OSDI, pp. 49-65, 2014. Reviewed by Chun-Yu and Xinghao Li Summary In this
More informationImportant new NVMe features for optimizing the data pipeline
Important new NVMe features for optimizing the data pipeline Dr. Stephen Bates, CTO Eideticom Santa Clara, CA 1 Outline Intro to NVMe Controller Memory Buffers (CMBs) Use cases for CMBs Submission Queue
More informationJackson Marusarz Intel Corporation
Jackson Marusarz Intel Corporation Intel VTune Amplifier Quick Introduction Get the Data You Need Hotspot (Statistical call tree), Call counts (Statistical) Thread Profiling Concurrency and Lock & Waits
More informationComputer Architecture Computer Science & Engineering. Chapter 6. Storage and Other I/O Topics BK TP.HCM
Computer Architecture Computer Science & Engineering Chapter 6 Storage and Other I/O Topics Introduction I/O devices can be characterized by Behaviour: input, output, storage Partner: human or machine
More informationPreface. Fig. 1 Solid-State-Drive block diagram
Preface Solid-State-Drives (SSDs) gained a lot of popularity in the recent few years; compared to traditional HDDs, SSDs exhibit higher speed and reduced power, thus satisfying the tough needs of mobile
More informationUnderstanding System Characteristics of Online Erasure Coding on Scalable, Distributed and Large-Scale SSD Array Systems
Understanding System Characteristics of Online Erasure Coding on Scalable, Distributed and Large-Scale SSD Array Systems arxiv:179.5365v2 [cs.dc] 19 Sep 217 Sungjoon Koh, Jie Zhang, Miryeong Kwon, Jungyeon
More informationDeep Learning Performance and Cost Evaluation
Micron 5210 ION Quad-Level Cell (QLC) SSDs vs 7200 RPM HDDs in Centralized NAS Storage Repositories A Technical White Paper Don Wang, Rene Meyer, Ph.D. info@ AMAX Corporation Publish date: October 25,
More informationHP Z Turbo Drive G2 PCIe SSD
Performance Evaluation of HP Z Turbo Drive G2 PCIe SSD Powered by Samsung NVMe technology Evaluation Conducted Independently by: Hamid Taghavi Senior Technical Consultant August 2015 Sponsored by: P a
More information