Lightning Talks. Platform Lab Students Stanford University

Similar documents
Platform Lab Overview and Update. John Ousterhout Faculty Director

CGAR: Strong Consistency without Synchronous Replication. Seo Jin Park Advised by: John Ousterhout

Exploiting Commutativity For Practical Fast Replication. Seo Jin Park and John Ousterhout

Low-Latency Datacenters. John Ousterhout Platform Lab Retreat May 29, 2015

Low-Latency Datacenters. John Ousterhout

Making RAMCloud Writes Even Faster

An Implementation of the Homa Transport Protocol in RAMCloud. Yilong Li, Behnam Montazeri, John Ousterhout

Exploiting Commutativity For Practical Fast Replication. Seo Jin Park and John Ousterhout

Arachne. Core Aware Thread Management Henry Qin Jacqueline Speiser John Ousterhout

2018 Spring Retreat: Lab Overview and Update. John Ousterhout Faculty Director

RAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University

What s Wrong with the Operating System Interface? Collin Lee and John Ousterhout

2017 Winter Review: Lab Overview and Update. John Ousterhout Faculty Director

High-Level Data Models on RAMCloud

COLLIN LEE INITIAL DESIGN THOUGHTS FOR A GRANULAR COMPUTING PLATFORM

RAMCloud: A Low-Latency Datacenter Storage System Ankita Kejriwal Stanford University

Hello, my name is Stephen Yang and I m a PhD Candidate from Stanford University. Today, I ll be giving a talk on NanoLog, a nanosecond scale logging

DeTail Reducing the Tail of Flow Completion Times in Datacenter Networks. David Zats, Tathagata Das, Prashanth Mohan, Dhruba Borthakur, Randy Katz

RAMCloud: Scalable High-Performance Storage Entirely in DRAM John Ousterhout Stanford University

Distributed Geometric Data Structures. Philip Levis Stanford Platform Lab Review Feb 9, 2017

DAL: A Locality-Optimizing Distributed Shared Memory System

IX: A Protected Dataplane Operating System for High Throughput and Low Latency

Fastpass A Centralized Zero-Queue Datacenter Network

Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities

Arachne: Core-Aware Thread Management

Toward a Reliable Data Transport Architecture for Optical Burst-Switched Networks

Exploiting Commutativity For Practical Fast Replication

Millisort: An Experiment in Granular Computing. Seo Jin Park with Yilong Li, Collin Lee and John Ousterhout

Application of SDN: Load Balancing & Traffic Engineering

At-most-once Algorithm for Linearizable RPC in Distributed Systems

A Generalized Blind Scheduling Policy

Pocket: Elastic Ephemeral Storage for Serverless Analytics

Memory-Based Cloud Architectures

Towards Deadline Guaranteed Cloud Storage Services Guoxin Liu, Haiying Shen, and Lei Yu

Packet Scheduling in Data Centers. Lecture 17, Computer Networks (198:552)

Tailwind: Fast and Atomic RDMA-based Replication. Yacine Taleb, Ryan Stutsman, Gabriel Antoniu, Toni Cortes

FaRM: Fast Remote Memory

CPS 512 midterm exam #1, 10/7/2016

Using the SDACK Architecture to Build a Big Data Product. Yu-hsin Yeh (Evans Ye) Apache Big Data NA 2016 Vancouver

Tales of the Tail Hardware, OS, and Application-level Sources of Tail Latency

Designing Distributed Systems using Approximate Synchrony in Data Center Networks

Kernel Bypass. Sujay Jayakar (dsj36) 11/17/2016

Starling: A Scheduler Architecture for High Performance Cloud Computing

ReFlex: Remote Flash Local Flash

Scalable Low-Latency Indexes for a Key-Value Store Ankita Kejriwal

Indexing in RAMCloud. Ankita Kejriwal, Ashish Gupta, Arjun Gopalan, John Ousterhout. Stanford University

VoltDB vs. Redis Benchmark

High-Performance Data Loading and Augmentation for Deep Neural Network Training

Rocksteady: Fast Migration for Low-Latency In-memory Storage. Chinmay Kulkarni, Aniraj Kesavan, Tian Zhang, Robert Ricci, Ryan Stutsman

HPX. High Performance ParalleX CCT Tech Talk Series. Hartmut Kaiser

Execu&on Templates: Caching Control Plane Decisions for Strong Scaling of Data Analy&cs

Asymmetry-aware execution placement on manycore chips

Implementing Linearizability at Large Scale and Low Latency

DIBS: Just-in-time congestion mitigation for Data Centers

Data Analytics on RAMCloud

RAMCloud. Scalable High-Performance Storage Entirely in DRAM. by John Ousterhout et al. Stanford University. presented by Slavik Derevyanko

Datacenter replication solution with quasardb

Real-Time Internet of Things

Making Non-Distributed Databases, Distributed. Ioannis Papapanagiotou, PhD Shailesh Birari

TCP Nice: A Mechanism for Background Transfers

Automatic Scaling Iterative Computations. Aug. 7 th, 2012

SOFTWARE DEFINED NETWORKS. Jonathan Chu Muhammad Salman Malik

I/O Buffering and Streaming

Variable Step Fluid Simulation for Communication Network

Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services

ParalleX. A Cure for Scaling Impaired Parallel Applications. Hartmut Kaiser

No Tradeoff Low Latency + High Efficiency

DCRoute: Speeding up Inter-Datacenter Traffic Allocation while Guaranteeing Deadlines

Ambry: LinkedIn s Scalable Geo- Distributed Object Store

Open-Channel SSDs Offer the Flexibility Required by Hyperscale Infrastructure Matias Bjørling CNEX Labs

Essential Features of an Integration Solution

Goals. Facebook s Scaling Problem. Scaling Strategy. Facebook Three Layer Architecture. Workload. Memcache as a Service.

FlexNIC: Rethinking Network DMA

Developing Enterprise Cloud Solutions with Azure

ReFlex: Remote Flash Local Flash

Mobile Edge Computing for 5G: The Communication Perspective

Advanced Computer Networks. Datacenter TCP

Storage Networking Strategy for the Next Five Years

Wedge A New Frontier for Pull-based Graph Processing. Samuel Grossman and Christos Kozyrakis Platform Lab Retreat June 8, 2018

Networking Recap Storage Intro. CSE-291 (Cloud Computing), Fall 2016 Gregory Kesden

Diffusing Your Mobile Apps: Extending In-Network Function Virtualisation to Mobile Function Offloading

WAN Application Infrastructure Fueling Storage Networks

TDT Appendix E Interconnection Networks

Module 17: "Interconnection Networks" Lecture 37: "Introduction to Routers" Interconnection Networks. Fundamentals. Latency and bandwidth

Application-defined Multipath TCP Scheduling

Data Center Networks and Switching and Queueing and Covert Timing Channels

EbbRT: A Framework for Building Per-Application Library Operating Systems

How Cisco IT Deployed Enterprise Messaging on Cisco UCS

P51: High Performance Networking

Memshare: a Dynamic Multi-tenant Key-value Cache

Performance and Scalability with Griddable.io

Topic 6: SDN in practice: Microsoft's SWAN. Student: Miladinovic Djordje Date:

Router s Queue Management

Advanced Computer Networks. Flow Control

Cisco I/O Accelerator Deployment Guide

IX: A Protected Dataplane Operating System for High Throughput and Low Latency

Disks and their Cloudy Future

Infiniswap. Efficient Memory Disaggregation. Mosharaf Chowdhury. with Juncheng Gu, Youngmoon Lee, Yiwen Zhang, and Kang G. Shin

Xen and the Art of Virtualization. CSE-291 (Cloud Computing) Fall 2016

Venugopal Ramasubramanian Emin Gün Sirer SIGCOMM 04

Transcription:

Lightning Talks Platform Lab Students Stanford University

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

MC Switch: Application-Layer Load Balancing on a Switch

MC Switch: Application-Layer Load Balancing on a Switch

MC Switch: Application-Layer Load Balancing on a Switch

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Goals: Round-trip latency < 10µs for short messages Support high network load Simulation results: 99th percentile tail latency for short messages within 2x of ideal, at 80% load Can support 90% bandwidth utilization for variety of workloads Outperforms both pfabric and phost

Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Key ideas: Manage congestion from receiver: receiver schedules incoming packets Minimize buffer occupancy Small amount of unscheduled packets per message Use a few network priorities to prioritize small messages and resolve bipartite matching problem

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Distributing Simulations on Adaptive Grids OpenVDB is a hierarchical, sparse, geometric data structure, for representing 3D volumes with varying levels of details. Resembles B+ trees. Store data at multiple levels. Uses geometric information for fast accessors. Partitioning dynamic geometric structures and computations over them is challenging: Varying topology Varying partition sizes

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Building Blocks of a Modular Notification System Collin Lee and John Ousterhout Motivation Popular notification systems are full-featured but that means applications pay for features they don t need. New notification system focused on modularity, flexibility, and composability. Avoids unnecessary overhead in a unified system to improve performance for today s applications or enable granular computing. Want to chat? Notifications, Applications, Granular Computing, Drones, and much much more!

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Parallelizing Computational Applications Requires High Task Throughput Parallelize an application to more cores: 1. 2. 3. More tasks to keep cores busy. Individual tasks are shorter. More than 1 task per core gives scheduling flexibility. A limited task throughput hurts parallelization.

Existing Solutions Spark (centralized) TensorFlow (distributed, but static)

Our Solution A scheduling abstraction that enables nodes to spawn and executed tasks in parallel. A lightweight control plane which pushes scheduling decisions to nodes, while nodes redistribute tasks accordingly.

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

NumFabric: Fast and Flexible Bandwidth Allocation in Datacenters Several proposals for bandwidth allocation frameworks in last 5 years Each to achieve a specific flow-level objective Which one does the operator chose if he wants flexibility? NUMFabric provides a flexible data center fabric that is also fast

NumFabric: Flexibility Network Utility Maximization (NUM) NUMFabric can flexibly implement several operator objective functions like minimize average flow completion time, weighted max-min, MPTCP objective, etc.

Why are existing NUM approaches not fast?

NUMFabric : Fast

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Grazelle Hardware-Optimized In-Memory Graph Processing

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Don't Wait for Sync to Achieve Strong Consistency! Primary-backup is widely used because of its simplicity For strong consistency, updates had to wait for replication to backup Primary-Backup Async Replication (eg. Redis, TAO) Ø? Strong consistency (eg. MySQL,RAMCloud) Use RPC retry + async backup for consistent recovery Result: Consistency Strongest (linearizability) RAMCloud write Durability Same as synchronous backup Latency No delay by backup 14.6 µs 7.8 µs Throughput Maximum batching for backup 138 kops 500 kops (~3x)

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

NanoLog: A Nanosecond Scale Logging System 10-100x faster than its competitors such as Log4j2 or spdlog Key Concept: Shift work out of Runtime Achieves a throughput of over 60M log messages/second at a 12.5ns median latency Extract static log information at Compile-Time Only log dynamic information in binary format at Runtime Postpone message formatting until Post-Execution Talk later today

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

TorcDB: TinkerPop on RAMCloud

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Arachne: Make Multi-Core Great Again It is time for applications to take scheduling back from the kernel! Schedule threads in user land. Allocate cores in user land.

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

Self Driving Networks SILC: a System for Sensing, Inference, Learning and Control for the Cloud Infrastructure

Lightning Talks 1. MC Switch: Application-Layer Load Balancing on a Switch Eyal Cidon and Sean Choi 2. ReFlex: Remote Flash == Local Flash Ana Klimovic 3. Flashield: a key-value cache that minimizes write to flash Assaf Eisenman 4. Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri 5. Distributing Simulations on Adaptive Grids Chinmayee Shah 6. Building Blocks of a Modular Notification System Collin Lee 7. Dynamic, High-throughput Task Scheduling with Task Recipes Hang Qu 8. NumFabric - Fast and Flexible Bandwidth Allocation in Datacenters Kanthi Nagaraj 9. Grazelle: Hardware-Optimized In-Memory Graph Processing Samuel Robert Grossman 10. Don't wait for sync to achieve strong consistency Seo Jin Park 11. NanoLog: A Nanosecond Scale Logging System Stephen Yang 12. TorcDB: TinkerPop on RAMCloud Jonathan Ellithorpe 13. Arachne: Make Multi-Core Great Again Henry Qin and Jacqueline Speiser 14. SILC: A Sensing, Inference, Learning and Control System for a Self Driving Network Zi Yin and Shiyu Liu 15. High Speed Networks Need Proactive Congestion Control Lavanya Jose and Steve Ibanez

High Speed Networks Need Proactive Congestion Control

High Speed Networks Need Proactive Congestion Control Synchronous Scheme in NetFPGA SUME No Per Flow State. Static Flows. Holy Grail?? - Asynchronous Scheme - Simple to implement - No Per Flow State - Dynamic Flows.