Low-Latency Datacenters. John Ousterhout Platform Lab Retreat May 29, 2015

Similar documents
Low-Latency Datacenters. John Ousterhout

RAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University

RAMCloud: A Low-Latency Datacenter Storage System Ankita Kejriwal Stanford University

RAMCloud: Scalable High-Performance Storage Entirely in DRAM John Ousterhout Stanford University

An Implementation of the Homa Transport Protocol in RAMCloud. Yilong Li, Behnam Montazeri, John Ousterhout

What s Wrong with the Operating System Interface? Collin Lee and John Ousterhout

Lightning Talks. Platform Lab Students Stanford University

CGAR: Strong Consistency without Synchronous Replication. Seo Jin Park Advised by: John Ousterhout

FaRM: Fast Remote Memory

RAMCloud. Scalable High-Performance Storage Entirely in DRAM. by John Ousterhout et al. Stanford University. presented by Slavik Derevyanko

Pricing Intra-Datacenter Networks with

IsoStack Highly Efficient Network Processing on Dedicated Cores

The NE010 iwarp Adapter

6.9. Communicating to the Outside World: Cluster Networking

IX: A Protected Dataplane Operating System for High Throughput and Low Latency

Arachne. Core Aware Thread Management Henry Qin Jacqueline Speiser John Ousterhout

Scaling Internet TV Content Delivery ALEX GUTARIN DIRECTOR OF ENGINEERING, NETFLIX

Advanced Computer Networks. End Host Optimization

Why Your Application only Uses 10Mbps Even the Link is 1Gbps?

Data Centers. Tom Anderson

IX: A Protected Dataplane Operating System for High Throughput and Low Latency

Messaging Overview. Introduction. Gen-Z Messaging

Networks and distributed computing

Fastpass A Centralized Zero-Queue Datacenter Network

CS 856 Latency in Communication Systems

Exploiting Commutativity For Practical Fast Replication. Seo Jin Park and John Ousterhout

Storage Protocol Offload for Virtualized Environments Session 301-F

Computer Systems C S Cynthia Lee Today s materials adapted from Kevin Webb at Swarthmore College

Packet Scheduling in Data Centers. Lecture 17, Computer Networks (198:552)

NIC TEAMING IEEE 802.3ad

Initial Performance Evaluation of the Cray SeaStar Interconnect

Network Processors and their memory

Themes. The Network 1. Energy in the DC: ~15% network? Energy by Technology

NVMe over Fabrics. High Performance SSDs networked over Ethernet. Rob Davis Vice President Storage Technology, Mellanox

FaSST: Fast, Scalable, and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs

FPGA Augmented ASICs: The Time Has Come

Introduction to the OpenCAPI Interface

Exploiting Commutativity For Practical Fast Replication. Seo Jin Park and John Ousterhout

Storage. Hwansoo Han

Network Design Considerations for Grid Computing

Multifunction Networking Adapters

Fast packet processing in the cloud. Dániel Géhberger Ericsson Research

Low-Cost Traffic Analysis of Tor

OpenOnload. Dave Parry VP of Engineering Steve Pope CTO Dave Riddoch Chief Software Architect

Programmable NICs. Lecture 14, Computer Networks (198:552)

Storage Speed and Human Behavior. PRESENTATION TITLE GOES HERE Eric Herzog CMO and Senior VP of Business Development Violin Memory

I/O, Disks, and RAID Yi Shi Fall Xi an Jiaotong University

N V M e o v e r F a b r i c s -

A Disseminated Distributed OS for Hardware Resource Disaggregation Yizhou Shan

NFS/RDMA over 40Gbps iwarp Wael Noureddine Chelsio Communications

Got Loss? Get zovn! Daniel Crisan, Robert Birke, Gilles Cressier, Cyriel Minkenberg, and Mitch Gusat. ACM SIGCOMM 2013, August, Hong Kong, China

HIGH-PERFORMANCE NETWORKING :: USER-LEVEL NETWORKING :: REMOTE DIRECT MEMORY ACCESS

Arrakis: The Operating System is the Control Plane

RDMA and Hardware Support

Link Aggregation: A Server Perspective

Tales of the Tail Hardware, OS, and Application-level Sources of Tail Latency

Everything You Wanted To Know About Storage: Part Teal The Buffering Pod

The Google File System

Application Advantages of NVMe over Fabrics RDMA and Fibre Channel

The Google File System

Application Acceleration Beyond Flash Storage

Impact of Cache Coherence Protocols on the Processing of Network Traffic

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

How Might Recently Formed System Interconnect Consortia Affect PM? Doug Voigt, SNIA TC

Benn Thomsen. Microsoft Research

iscsi or iser? Asgeir Eiriksson CTO Chelsio Communications Inc

Gateware Defined Networking (GDN) for Ultra Low Latency Trading and Compliance

Where We Are in This Course Right Now. ECE 152 Introduction to Computer Architecture Input/Output (I/O) Copyright 2012 Daniel J. Sorin Duke University

Today: Protection! Protection!

Linux Plumbers Conference TCP-NV Congestion Avoidance for Data Centers

Titan: Fair Packet Scheduling for Commodity Multiqueue NICs. Brent Stephens, Arjun Singhvi, Aditya Akella, and Mike Swift July 13 th, 2017

The Missing Piece of Virtualization. I/O Virtualization on 10 Gb Ethernet For Virtualized Data Centers

Platform Lab Overview and Update. John Ousterhout Faculty Director

Utilizing Datacenter Networks: Centralized or Distributed Solutions?

DeTail Reducing the Tail of Flow Completion Times in Datacenter Networks. David Zats, Tathagata Das, Prashanth Mohan, Dhruba Borthakur, Randy Katz

Low-latency Network Monitoring via Oversubscribed Port Mirroring

Throughput & Latency Control in Ethernet Backplane Interconnects. Manoj Wadekar Gary McAlpine. Intel

Flash: an efficient and portable web server

Survey of ETSI NFV standardization documents BY ABHISHEK GUPTA FRIDAY GROUP MEETING FEBRUARY 26, 2016

Chapter 6. Storage and Other I/O Topics

High-resolution Measurement of Data Center Microbursts

Dynamic Partitioned Global Address Spaces for Power Efficient DRAM Virtualization

Evolution of Rack Scale Architecture Storage

Today: Protection. Sermons in Computer Science. Domain Structure. Protection

Basic Reliable Transport Protocols

Data Center Networks and Switching and Queueing and Covert Timing Channels

EECS 482 Introduction to Operating Systems

Computer Architecture Computer Science & Engineering. Chapter 6. Storage and Other I/O Topics BK TP.HCM

Learn Your Alphabet - SRIOV, NPIV, RoCE, iwarp to Pump Up Virtual Infrastructure Performance

The Case for Informed Transport Protocols

Oracle 1Z Exalogic Elastic Cloud X2-2 Essentials.

RoCE vs. iwarp Competitive Analysis

Commodity Converged Fabrics for Global Address Spaces in Accelerator Clouds

Indexing in RAMCloud. Ankita Kejriwal, Ashish Gupta, Arjun Gopalan, John Ousterhout. Stanford University

Internet Technology. 06. Exam 1 Review Paul Krzyzanowski. Rutgers University. Spring 2016

LECTURE 1. Introduction

Half full or half empty? William Gropp Mathematics and Computer Science

Internet Technology 3/2/2016

Memory management. Last modified: Adaptation of Silberschatz, Galvin, Gagne slides for the textbook Applied Operating Systems Concepts

The Future of High-Performance Networking (The 5?, 10?, 15? Year Outlook)

Transcription:

Low-Latency Datacenters John Ousterhout Platform Lab Retreat May 29, 2015

Datacenters: Scale and Latency Scale: 1M+ cores 1-10 PB memory 200 PB disk storage Latency: < 0.5 µs speed-of-light delay Most work so far has focused on scale: One app, many resources Map-Reduce, etc. Latency potential unrealized: High-latency hardware/software Most apps designed to tolerate latency (communication via large blocks) May 29, 2015 Low-Latency Datacenter Slide 2

Latency Round-trip times (100K servers): Today: 100-500 µs best case Often much worse because of congestion Hardware limit: ~2 µs Storage latency dropping: Disk Flash DRAM Can we create a new platform that makes the hardware limit accessible to applications? If so, will it enable important new applications? May 29, 2015 Low-Latency Datacenter Slide 3

Clean-Slate Low-Latency Datacenter New switching architecture (30 ns per switch) NIC fused with CPU cores; on-chip routing User-level networking, polling instead of interrupts New transport protocol Storage systems based primarily in DRAM New software stack May 29, 2015 Low-Latency Datacenter Slide 4

Low-Latency Storage: RAMCloud New class of datacenter storage: All data in DRAM at all times (disk/flash for backup only) Large scale: aggregate 1000 s of servers Low latency: 5-10µs remote access 1000x improvements over disk in Performance Energy/op Goal: enable a new class of data-intensive applications 1000 100,000 Application Servers Appl. Library Master Backup Appl. Library Master Backup Appl. Library Datacenter Network Master Backup 1000 10,000 Storage Servers Appl. Library Master Backup Coordinator 32-256 GB per server May 29, 2015 Low-Latency Datacenter Slide 5

New Transport Protocol TCP protocol optimized for: Throughput, not latency Long-haul networks (high latency) Congestion throughout Modest # connections/server Future datacenters: High performance networking fabric: Low latency Multi-path Congestion primarily at edges Little congestion in core Many connections/server (1M?) Need new transport protocol Top-of-rack switches Congested link Perfect core fabric May 29, 2015 Low-Latency Datacenter Slide 6

New Transport Protocol, cont d Greatest obstacle to low latency: Congestion at receiver s link Large messages delay small ones Solution: drive congestion control from receiver Schedule incoming traffic Prioritize small messages Behnam Montazeri will present work in progress May 29, 2015 Low-Latency Datacenter Slide 7

Low-Latency Software Stacks? Today s stacks: highly layered Good for structuring software Each layer solves one problem Bad for performance Each layer adds latency Example: Thrift RPC system Handles several problems: marshalling, threading, etc. General-purpose: re-pluggable components Adds 7 µs latency For low latency, must replace the entire software stack May 29, 2015 Low-Latency Datacenter Slide 8

Reducing Software Stack Latency High Latency 3. Bypass layers 1. Optimize layers (specialize?) 2. Eliminate layers May 29, 2015 Low-Latency Datacenter Slide 9

Integrate NIC Into CPU Chip? Per- Mini-NIC OS controls routing tables for incoming packets Switch Network May 29, 2015 Low-Latency Datacenter Slide 10

Low Latency => New Applications? Does 2 µs latency matter? Use low latency for collecting data? Small chunks of data Random access Dependencies serialize accesses Need a lot of chunks in a small amount of time: 20K chunks in 50 ms? Use low latency for new computational models? Independent compute-storage elements Low latency allows high coherency May 29, 2015 Low-Latency Datacenter Slide 11

Discussion Topics What are the key elements of a low-latency platform for datacenters? What will a new software stack look like? What applications could make use of a low-latency datacenter? May 29, 2015 Low-Latency Datacenter Slide 12

Palette May 29, 2015 Low-Latency Datacenter Slide 13