Cisco Ultra Packet Core High Performance AND Features. Aeneas Dodd-Noble, Principal Engineer Daniel Walton, Director of Engineering October 18, 2018

Size: px
Start display at page:

Download "Cisco Ultra Packet Core High Performance AND Features. Aeneas Dodd-Noble, Principal Engineer Daniel Walton, Director of Engineering October 18, 2018"

Transcription

1 Cisco Ultra Packet Core High Performance AND Features Aeneas Dodd-Noble, Principal Engineer Daniel Walton, Director of Engineering October 18, 2018

2 The World s Top Networks Rely On Cisco Ultra M 600M 200G Deployments Total Subscribers Total Sessions Gbps / System (at par with physical)

3 Market Evolution Delivering the 1 s 1 Gbps at edge 1 ms latency 1 billion connections Transition to Virtual Performance 4G Ultra Services Platform UGP USF UPP ASR 5500 Ultra Performance parity Functional parity Scale Distributed Architecture Slicing Low Latency Gig-Speed Automation Containers Micro-Services Cloud Native 5G Ultra Ultra Platform with CUPS 5G Network Functions Massive IOT Multi-Access Automated Lifecycle Management Dynamic Distributed Slices Micro-Services Architecture

4 Capacity Growth - Fact PB 2.5Tbps Peak hour throughput N/A Tier 1 X1.8 Peak hour throughput YoY APJC Tier 2 8PB Total daily volume N/A Tier 1 X2.5 Peak hour throughput YoY APJC Tier 1

5 5G Core is Distributed by Design Centralized Services and Connectivity Highly Distributed and Fragmented Network and Services 5G Logical Layout Actual Layout SMF SMF Red Slice Orange Slice

6 Performance

7 Background Despite being best-known for hardware forwarding, Cisco has always built high performance packet forwarding software Exception path processing CPU-centric products VPP (Vector Packet Processor) began ~2002 VPP has been incorporated into many hardware and software products and more recently has been open-sourced as part of FD.io

8 What is VPP/FD.io?

9 FD.io is Project at Linux Foundation Multi-party Multi-project Software Dataplane High throughput Low Latency Feature Rich Resource Efficient Bare Metal/VM/Container Multiplatform FD.io Scope: Dataplane Management Agents - Control Plane Packet Processing Classify/Transform/Prioritize/Forward/Terminate Network IO - NIC/vNIC cores/threads Bare Metal/VM/Container Dataplane Management Agent Packet Processing Network IO

10 Multiparty: FD.io Members Service Providers Network Vendors Chip Vendors Integrators

11 Multiparty: Contributor/Committer Diversity Yandex Qiniu Universitat Politècnica de Catalunya (UPC)

12 Read more at

13 How does VPP work?

14 Memory is the enemy A single 10GbE port is capable of 14Mpps On a 3.5GHz CPU core, we have 250 cycles/packet Each packet must be processed in 67ns Main (DDR) memory is 70ns away This is >100 clock cycles away On Intel Sandy Bridge CPUs caches are 4/12/30 clock cycles away If we are serious about performance - we must optimize the code for cache and memory operations Programming paradigm shift: Scalar to Vector

15 Primer: Instruction and Data caches Instruction cache (I-Cache) Stores only CPU instructions. Holds branch prediction information Helps pre-fetch the incoming instructions Instruction pipeline Registers Data cache (D-Cache) Fast buffer that contains application data Processor operate on data loaded from memory into the data cache then from cache into the CPU registers Resultant stored into register, then to cache and finally to main memory

16 Scalar Packet Processing Packet processing Ethernet-Input IPv4 Input IPv4 lookup IPv4 transmit ECMP processing LAG processing Transmit Process only single packet at a time. In scalar processing the whole code cannot fit into instruction cache Modules processing packet, are loaded into instruction cache. E.g.: 7 modules processing a single packet. So 4 packets will cause 7*4=28 cache misses High performance hit, workaround bigger caches.

17 Vector Packet Processing Process more than one packet at a time. Grabs all available packets from Rx device. Form a vector of packets ( frame ) Process frame (vector) using a directed graph of nodes

18 VPP Architecture Packet n Hardware Plugin Vector of n packets hw-accel-input dpdkinput vhost-user-input ethernet-input af-packet-input Input Graph Node Graph Node Packet Processing Graph Skip sftw nodes where work is done by hardware already ip6-rewrite ip6-input ip6- lookup ip6-local ip4-input ip4- lookup ip4-local mpls-input ip4- rewrite arp-input Plugin custom-1 custom-2 custom-3 Plugins are: First class citizens That can: Add graph nodes Add API Rearrange the graph Can be built independently of VPP source tree

19 How Vector packet processing works? Exploits the probability that most packets will follow the same graph Fixes I-cache thrashing I-cache reloaded when all packets are finished a node

20 How Vector packet processing works? For eg, here 4 packets will cause I-cache thrashing only 7 times, compared to 28 in scalar packet processing. Primary problem VPP solving Reducing i-cache misses Reducing d-cache misses

21 What happens when processing diverges? Same process, but for subset of packets. Each node still executes the set of packets that belong to that node. Scheduler takes care of the node execution.

22 Sounds good. How fast?

23 VPP Performance at Scale Phy-VS-Phy [Gbps]] [Mpps] Gbps zero frame loss 200Mpps zero frame loss IPv6, 24 of 72 cores IPv4+ 2k Whitelist, 36 of 72 cores Zero-packet-loss Throughput 1518B 64B 1518B 64B [Gbps]] Regular performance characterizations online: IMIX => 342 Gbps,1518B => 462 Gbps [Mpps] B => 238 Mpps 64B 64B for 12 port 40GE Hardware: Cisco UCS C460 M4 Intel C610 series chipset 4 x Intel Xeon Processor E v3 (18 cores, 2.5GHz, 45MB Cache) 2133 MHz, 512 GB Total 9 x 2p40GE Intel XL x 40GE = 720GE!! Latency 18 x 7.7trillion packets soak test Average latency: <23 usec Min Latency: 7 10 usec Max Latency: 3.5 ms Headroom Average vector size ~24-27 Max vector size 255 Headroom for much more throughput/features NIC/PCI bus is the limit not vpp

24 VPP Benefits from Intel Xeon Processor Developments Increased Processor I/O Improves Packet Forwarding Rates FD.io SoftwareFD.io Intel Xeon Hardware Intel Xeon = = Terabit Service SP Platform Ethernet YESTERDAY Socket 0 QPI Socket 1 2 Broadwell Broadwell 2 QPI Server CPU Server CPU 3 Server 2x [2 Sockets] PCle Packet Forwarding Rate [Gbps] % 1,120* Gbps TODAY Socket 0 UPI Socket 1 2 Skylake Skylake 2 UPI Server CPU Server CPU SATA B I O S PCH x8 PCIe 50GE PCIe x16 100GE 1 Network I/O: 160 Gbps Core ALU: 4-wide parallel µops Memory: 4-channels 2400 MHz Max power: 145W (TDP) x8 PCIe 50GE PCIe PCIe x16 100GE x16 100GE 1 Server [2 Sockets] Server [1 Socket] +75% % FD.io Takes Full Advantage of Faster Intel Xeon Scalable Processors No Code Change Required SATA B I O S Lewisburg PCH x8 PCIe 40GE x8 PCIe 50GE PCIe x16 100GE 1 PCIe x8 x8 Network I/O: 280 Gbps Core ALU: 5-wide parallel µops Memory: 6-channels 2666 MHz Max power: 205W (TDP) PCIe 50GE 50GE PCIe PCIe x16 100GE x16 100GE 1 Intel Xeon E5-2699v4 22 Cores, 2.2 GHz, 55MB Cache Intel Xeon v3, v4 Processors Intel Xeon Platinum 8180 Processors * On compute platforms with all PCIe lanes from the Processors routed to PCIe slots. Intel Xeon Platinum Cores, 2.7 GHz, 33MB Cache Breaking the Barrier of Software Defined Network Services 1 Terabit Services on a Single Intel Xeon Server!

25 Features define your customers experience, define how you charge/monetize, identify fraud, protect your RAN assets provide visibility into your mobile network. Do % of features follow from 4G to 5G? How, and when, does Slicing change this?

26 Is hardware still needed?

27 Hardware Cisco has unrivaled expertise in packet forwarding in silicon Cisco ASICs FPGAs GPUs Merchant silicon (NPUs, ASICs) SmartNICs Feature / Performance tradeoffs are limiting Experience with new software architecture is making software stronger than ever before Continue to investigate/prototype

28 Summary

29 Summary Cisco Ultra Packet Core Feature Rich Unmatched for feature/performance Many IP services beyond standards Packet core user planes are changing/adapting Demand for cost / performance New form factor (physical, virtual, containers) 4G data is growing fast ( x per year) CUPS User Plane opportunity for SW deployments 5G is coming Much higher data rates demand multithreaded solutions Needs small, distributed and public cloud UPF

30 Thank You!

31

A Universal Terabit Network Dataplane

A Universal Terabit Network Dataplane A Universal Terabit Network Dataplane FastData.io Project th of July 07: material developed for Intel Purley Scalable Processor launch. Video clip version: https://www.youtube.com/watch?v=alj0xlevv. Accompanying

More information

A Universal Dataplane. FastData.io Project

A Universal Dataplane. FastData.io Project A Universal Dataplane FastData.io Project : A Universal Dataplane Platform for Native Cloud Network Services EFFICIENCY Most Efficient on the Planet Superior Performance PERFORMANCE Flexible and Extensible

More information

Ed Warnicke, Cisco. Tomasz Zawadzki, Intel

Ed Warnicke, Cisco. Tomasz Zawadzki, Intel Ed Warnicke, Cisco Tomasz Zawadzki, Intel Agenda SPDK iscsi target overview FD.io and VPP SPDK iscsi VPP integration Q&A 2 Notices & Disclaimers Intel technologies features and benefits depend on system

More information

Benchmarking Software Data Planes Intel Xeon Skylake vs. Broadwell 1. Maciek Konstantynowicz

Benchmarking Software Data Planes Intel Xeon Skylake vs. Broadwell 1. Maciek Konstantynowicz Benchmarking Software Data Planes Intel Xeon Skylake vs. Broadwell 1 March 7 th, 2019 Georgii Tkachuk georgii.tkachuk@intel.com Maciek Konstantynowicz mkonstan@cisco.com Shrikant M. Shah shrikant.m.shah@intel.com

More information

High Performance Cloud-native Networking K8s Unleashing FD.io

High Performance Cloud-native Networking K8s Unleashing FD.io High Performance Cloud-native Networking K8s Unleashing FD.io Giles Heron Principal Engineer, Cisco giheron@cisco.com Maciek Konstantynowicz FD.io CSIT Project Lead Distinguished Engineer, Cisco mkonstan@cisco.com

More information

VPP The Universal Fast Dataplane

VPP The Universal Fast Dataplane VPP The Universal Fast Dataplane FD.io: The Universal Fast Dataplane Project at Linux Foundation Multi-party Multi-project Software Dataplane High throughput Low Latency Feature Rich Resource Efficient

More information

Empower Diverse Open Transport Layer Protocols in Cloud Networking GEORGE ZHAO DIRECTOR OSS & ECOSYSTEM, HUAWEI

Empower Diverse Open Transport Layer Protocols in Cloud Networking GEORGE ZHAO DIRECTOR OSS & ECOSYSTEM, HUAWEI Empower Diverse Open Transport Layer Protocols in Cloud Networking GEORGE ZHAO DIRECTOR OSS & ECOSYSTEM, HUAWEI Agenda FD.io Introduction Challenges in Container & Cloud Native Apps Proposed Solutions

More information

Recent Advances in Software Router Technologies

Recent Advances in Software Router Technologies Recent Advances in Software Router Technologies KRNET 2013 2013.6.24-25 COEX Sue Moon In collaboration with: Sangjin Han 1, Seungyeop Han 2, Seonggu Huh 3, Keon Jang 4, Joongi Kim, KyoungSoo Park 5 Advanced

More information

OpenFlow Software Switch & Intel DPDK. performance analysis

OpenFlow Software Switch & Intel DPDK. performance analysis OpenFlow Software Switch & Intel DPDK performance analysis Agenda Background Intel DPDK OpenFlow 1.3 implementation sketch Prototype design and setup Results Future work, optimization ideas OF 1.3 prototype

More information

Building high performance network functions in VPP. Ole Trøan, VPP contributor FOSDEM 2018

Building high performance network functions in VPP. Ole Trøan, VPP contributor FOSDEM 2018 Building high performance network functions in VPP Ole Trøan, ot@cisco.com, VPP contributor FOSDEM 2018 1 2 This talk? Goal: Make you into VPP developers Agenda: VPP architecture An example decomposed

More information

fd.io Intro Mark Gray fd.io Foundation 1

fd.io Intro Mark Gray fd.io Foundation 1 fd.io Intro Mark Gray fd.io Foundation 1 Evolution of Programmable Networking Many industries are transitioning to a more dynamic model to deliver network services The great unsolved problem is how to

More information

CICN Community Information-Centric Networking

CICN Community Information-Centric Networking CICN Community Information-Centric Networking : The Universal Dataplane Project at Linux Foundation Multi-party Multi-project Software Dataplane High throughput Low Latency Feature Rich Resource Efficient

More information

Copyright 2017 Intel Corporation

Copyright 2017 Intel Corporation Agenda Intel Xeon Scalable Platform Overview Architectural Enhancements 2 Platform Overview 3x16 PCIe* Gen3 2 or 3 Intel UPI 3x16 PCIe Gen3 Capabilities Details 10GbE Skylake-SP CPU OPA DMI Intel C620

More information

High Performance Cloud-native Networking K8s Unleashing FD.io

High Performance Cloud-native Networking K8s Unleashing FD.io High Performance Cloud-native Networking K8s Unleashing FD.io Giles Heron Principal Engineer, Cisco giheron@cisco.com Maciek Konstantynowicz FD.io CSIT Project Lead Distinguished Engineer, Cisco mkonstan@cisco.com

More information

OCP Engineering Workshop - Telco

OCP Engineering Workshop - Telco OCP Engineering Workshop - Telco Low Latency Mobile Edge Computing Trevor Hiatt Product Management, IDT IDT Company Overview Founded 1980 Workforce Approximately 1,800 employees Headquarters San Jose,

More information

PUSHING THE LIMITS, A PERSPECTIVE ON ROUTER ARCHITECTURE CHALLENGES

PUSHING THE LIMITS, A PERSPECTIVE ON ROUTER ARCHITECTURE CHALLENGES PUSHING THE LIMITS, A PERSPECTIVE ON ROUTER ARCHITECTURE CHALLENGES Greg Hankins APRICOT 2012 2012 Brocade Communications Systems, Inc. 2012/02/28 Lookup Capacity and Forwarding

More information

Pactron FPGA Accelerated Computing Solutions

Pactron FPGA Accelerated Computing Solutions Pactron FPGA Accelerated Computing Solutions Intel Xeon + Altera FPGA 2015 Pactron HJPC Corporation 1 Motivation for Accelerators Enhanced Performance: Accelerators compliment CPU cores to meet market

More information

100 GBE AND BEYOND. Diagram courtesy of the CFP MSA Brocade Communications Systems, Inc. v /11/21

100 GBE AND BEYOND. Diagram courtesy of the CFP MSA Brocade Communications Systems, Inc. v /11/21 100 GBE AND BEYOND 2011 Brocade Communications Systems, Inc. Diagram courtesy of the CFP MSA. v1.4 2011/11/21 Current State of the Industry 10 Electrical Fundamental 1 st generation technology constraints

More information

Extreme Networks Session Director

Extreme Networks Session Director Data Sheet Highlights Designed for 4G/LTE, 5G Mobile Network Operators, and IoT scale Maximizes utilization of existing monitoring solutions with subscriberaware network traffic load balancing, filtering,

More information

Benchmarking and Analysis of Software Network Data Planes

Benchmarking and Analysis of Software Network Data Planes Benchmarking and Analysis of Software Network Data Planes Maciek Konstantynowicz Distinguished Engineer, Cisco (FD.io CSIT Project Lead) Patrick Lu Performance Engineer, Intel Corporation, (FD.io pma_tools

More information

VPP Host Stack. TCP and Session Layers. Florin Coras, Dave Barach, Keith Burns, Dave Wallace

VPP Host Stack. TCP and Session Layers. Florin Coras, Dave Barach, Keith Burns, Dave Wallace Host Stack and Layers Florin Coras, Dave Barach, Keith Burns, Dave Wallace - A Universal Terabit Network Platform For Native Cloud Network Services Most Efficient on the Planet EFFICIENCY Superior Performance

More information

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Introduction: Modern computer architecture The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Motivation: Multi-Cores where and why Introduction: Moore s law Intel

More information

Dell EMC Networking: the Modern Infrastructure Platform

Dell EMC Networking: the Modern Infrastructure Platform Dell EMC Networking: the Modern Infrastructure Platform From Core to Edge to Campus. Fabio Bellini Network Sales Engineer CIO challenge: next-gen infrastructures are needed Traditional Open-Networking

More information

PacketShader as a Future Internet Platform

PacketShader as a Future Internet Platform PacketShader as a Future Internet Platform AsiaFI Summer School 2011.8.11. Sue Moon in collaboration with: Joongi Kim, Seonggu Huh, Sangjin Han, Keon Jang, KyoungSoo Park Advanced Networking Lab, CS, KAIST

More information

Fast packet processing in the cloud. Dániel Géhberger Ericsson Research

Fast packet processing in the cloud. Dániel Géhberger Ericsson Research Fast packet processing in the cloud Dániel Géhberger Ericsson Research Outline Motivation Service chains Hardware related topics, acceleration Virtualization basics Software performance and acceleration

More information

SmartNICs: Giving Rise To Smarter Offload at The Edge and In The Data Center

SmartNICs: Giving Rise To Smarter Offload at The Edge and In The Data Center SmartNICs: Giving Rise To Smarter Offload at The Edge and In The Data Center Jeff Defilippi Senior Product Manager Arm #Arm Tech Symposia The Cloud to Edge Infrastructure Foundation for a World of 1T Intelligent

More information

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product

More information

Industry Collaboration and Innovation

Industry Collaboration and Innovation Industry Collaboration and Innovation Industry Landscape Key changes occurring in our industry Historical microprocessor technology continues to deliver far less than the historical rate of cost/performance

More information

OCP-T Spec Open Pod Update

OCP-T Spec Open Pod Update OCP-T Spec Open Pod Update Defining the next generation of OCP and RSD Open Pods Jeff Sharpe January, 2017 Building Forward Together Building Forward Together 1 Scope for OCP-T Pod for Frame/appliance

More information

Building a Platform Optimized for the Network Edge

Building a Platform Optimized for the Network Edge Building a Platform Optimized for the Network Edge MPLS + SDN + NFV WORLD 2018 Nicolas Bouthors, Enea Innovation Agenda Software Virtualization - Key Requirements Leveraging DPDK Multi-Function VNFs at

More information

Toward a Memory-centric Architecture

Toward a Memory-centric Architecture Toward a Memory-centric Architecture Martin Fink EVP & Chief Technology Officer Western Digital Corporation August 8, 2017 1 SAFE HARBOR DISCLAIMERS Forward-Looking Statements This presentation contains

More information

A Low Latency Solution Stack for High Frequency Trading. High-Frequency Trading. Solution. White Paper

A Low Latency Solution Stack for High Frequency Trading. High-Frequency Trading. Solution. White Paper A Low Latency Solution Stack for High Frequency Trading White Paper High-Frequency Trading High-frequency trading has gained a strong foothold in financial markets, driven by several factors including

More information

Enterprise Cloud Computing. Eddie Toh Platform Marketing Manager, APAC Data Centre Group Cisco Summit 2010, Kuala Lumpur

Enterprise Cloud Computing. Eddie Toh Platform Marketing Manager, APAC Data Centre Group Cisco Summit 2010, Kuala Lumpur 1 Enterprise Cloud Computing Eddie Toh Platform Marketing Manager, APAC Data Centre Group Cisco Summit 2010, Kuala Lumpur Agenda 2 Fundamentals of Enterprise Cloud Computing IT & Cloud Computing Requirements

More information

DELL EMC POWER EDGE R940 MAKES DE NOVO ASSEMBLY EASIER

DELL EMC POWER EDGE R940 MAKES DE NOVO ASSEMBLY EASIER DELL EMC POWER EDGE R940 MAKES DE NOVO ASSEMBLY EASIER Genome Assembly on Deep Sequencing data with SOAPdenovo2 ABSTRACT De novo assemblies are memory intensive since the assembly algorithms need to compare

More information

The Power of Batching in the Click Modular Router

The Power of Batching in the Click Modular Router The Power of Batching in the Click Modular Router Joongi Kim, Seonggu Huh, Keon Jang, * KyoungSoo Park, Sue Moon Computer Science Dept., KAIST Microsoft Research Cambridge, UK * Electrical Engineering

More information

Facilitating IP Development for the OpenCAPI Memory Interface Kevin McIlvain, Memory Development Engineer IBM. Join the Conversation #OpenPOWERSummit

Facilitating IP Development for the OpenCAPI Memory Interface Kevin McIlvain, Memory Development Engineer IBM. Join the Conversation #OpenPOWERSummit Facilitating IP Development for the OpenCAPI Memory Interface Kevin McIlvain, Memory Development Engineer IBM Join the Conversation #OpenPOWERSummit Moral of the Story OpenPOWER is the best platform to

More information

Maximizing heterogeneous system performance with ARM interconnect and CCIX

Maximizing heterogeneous system performance with ARM interconnect and CCIX Maximizing heterogeneous system performance with ARM interconnect and CCIX Neil Parris, Director of product marketing Systems and software group, ARM Teratec June 2017 Intelligent flexible cloud to enable

More information

CPU Project in Western Digital: From Embedded Cores for Flash Controllers to Vision of Datacenter Processors with Open Interfaces

CPU Project in Western Digital: From Embedded Cores for Flash Controllers to Vision of Datacenter Processors with Open Interfaces CPU Project in Western Digital: From Embedded Cores for Flash Controllers to Vision of Datacenter Processors with Open Interfaces Zvonimir Z. Bandic, Sr. Director Robert Golla, Sr. Fellow Dejan Vucinic,

More information

Topic & Scope. Content: The course gives

Topic & Scope. Content: The course gives Topic & Scope Content: The course gives an overview of network processor cards (architectures and use) an introduction of how to program Intel IXP network processors some ideas of how to use network processors

More information

NFS/RDMA over 40Gbps iwarp Wael Noureddine Chelsio Communications

NFS/RDMA over 40Gbps iwarp Wael Noureddine Chelsio Communications NFS/RDMA over 40Gbps iwarp Wael Noureddine Chelsio Communications Outline RDMA Motivating trends iwarp NFS over RDMA Overview Chelsio T5 support Performance results 2 Adoption Rate of 40GbE Source: Crehan

More information

Intel s Architecture for NFV

Intel s Architecture for NFV Intel s Architecture for NFV Evolution from specialized technology to mainstream programming Net Futures 2015 Network applications Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION

More information

SERVER. Samuli Toivola Lead HW Architect Nokia

SERVER. Samuli Toivola Lead HW Architect Nokia SERVER AirFrame Open Rack Server with Integrated HW Acceleration. Samuli Toivola Lead HW Architect Nokia Nokia in Open Compute Project Nokia is a Platinum Member of the Open Compute Project and an OCP

More information

Accelerate Cloud Native with FD.io

Accelerate Cloud Native with FD.io Accelerate Cloud Native with FDio Naoyuki Mori, Ping Yu, Kinsella Ray, Hongjun Ni Intel Agenda FDio*: Cloud native acceleration framework Acceleration of Envoy with FDio* TCP and QAT Acceleration of Load

More information

Achieve Optimal Network Throughput on the Cisco UCS S3260 Storage Server

Achieve Optimal Network Throughput on the Cisco UCS S3260 Storage Server White Paper Achieve Optimal Network Throughput on the Cisco UCS S3260 Storage Server Executive Summary This document describes the network I/O performance characteristics of the Cisco UCS S3260 Storage

More information

NetCache: Balancing Key-Value Stores with Fast In-Network Caching

NetCache: Balancing Key-Value Stores with Fast In-Network Caching NetCache: Balancing Key-Value Stores with Fast In-Network Caching Xin Jin, Xiaozhou Li, Haoyu Zhang, Robert Soulé Jeongkeun Lee, Nate Foster, Changhoon Kim, Ion Stoica NetCache is a rack-scale key-value

More information

FD.io VPP & Ligato Use Cases. Contiv-VPP CNI plugin for Kubernetes IPSEC VPN gateway

FD.io VPP & Ligato Use Cases. Contiv-VPP CNI plugin for Kubernetes IPSEC VPN gateway FD.io & Ligato Use Cases Contiv- CNI plugin for Kubernetes IPSEC VPN gateway Project at Linux Foundation Multi-party Multi-project Software Dataplane High throughput Low Latency Feature Rich Resource Efficient

More information

NetCache: Balancing Key-Value Stores with Fast In-Network Caching

NetCache: Balancing Key-Value Stores with Fast In-Network Caching NetCache: Balancing Key-Value Stores with Fast In-Network Caching Xin Jin, Xiaozhou Li, Haoyu Zhang, Robert Soulé Jeongkeun Lee, Nate Foster, Changhoon Kim, Ion Stoica NetCache is a rack-scale key-value

More information

Broadberry. Artificial Intelligence Server for Fraud. Date: Q Application: Artificial Intelligence

Broadberry. Artificial Intelligence Server for Fraud. Date: Q Application: Artificial Intelligence TM Artificial Intelligence Server for Fraud Date: Q2 2017 Application: Artificial Intelligence Tags: Artificial intelligence, GPU, GTX 1080 TI HM Revenue & Customs The UK s tax, payments and customs authority

More information

Intel Workstation Technology

Intel Workstation Technology Intel Workstation Technology Turning Imagination Into Reality November, 2008 1 Step up your Game Real Workstations Unleash your Potential 2 Yesterday s Super Computer Today s Workstation = = #1 Super Computer

More information

Ultimate Workstation Performance

Ultimate Workstation Performance Product brief & COMPARISON GUIDE Intel Scalable Processors Intel W Processors Ultimate Workstation Performance Intel Scalable Processors and Intel W Processors for Professional Workstations Optimized to

More information

An FPGA-Based Optical IOH Architecture for Embedded System

An FPGA-Based Optical IOH Architecture for Embedded System An FPGA-Based Optical IOH Architecture for Embedded System Saravana.S Assistant Professor, Bharath University, Chennai 600073, India Abstract Data traffic has tremendously increased and is still increasing

More information

Broadcom Trident 3 Programmable, Varied And Volume

Broadcom Trident 3 Programmable, Varied And Volume Broadcom Trident 3 Programmable, Varied And Volume Greg FerroJune 14, 2017 Broadcom is announcing the Trident 3 ASIC. Takeaways: Trident 3 ASICs have increased performance. 5 versions of the Trident 3

More information

100 Gbps Open-Source Software Router? It's Here. Jim Thompson, CTO, Netgate

100 Gbps Open-Source Software Router? It's Here. Jim Thompson, CTO, Netgate 100 Gbps Open-Source Software Router? It's Here. Jim Thompson, CTO, Netgate @gonzopancho Agenda Edge Router Use Cases Need for Speed Cost, Flexibility, Control, Evolution The Engineering Challenge Solution

More information

What is 5g? Next generation of wireless networks Will provide higher speeds, greater capacity, and lower latency Will be capable of supporting billions of connected devices and things Distributes intelligence

More information

The Next Opportunity in the Data Centre

The Next Opportunity in the Data Centre The Next Opportunity in the Data Centre Application Centric Infrastructure Soni Jiandani Senior Vice President, Cisco THE NETWORK IS THE INFORMATION BROKER FOR ALL APPLICATIONS Applications Are Changing

More information

Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct :45 p.m. - 5:30 p.m. Moscone West - Room 3020

Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct :45 p.m. - 5:30 p.m. Moscone West - Room 3020 Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct-2017 4:45 p.m. - 5:30 p.m. Moscone West - Room 3020 Big Data Talk Exploring New SSD Usage Models to Accelerate Cloud Performance

More information

Service Edge Virtualization - Hardware Considerations for Optimum Performance

Service Edge Virtualization - Hardware Considerations for Optimum Performance Service Edge Virtualization - Hardware Considerations for Optimum Performance Executive Summary This whitepaper provides a high level overview of Intel based server hardware components and their impact

More information

NFV Infrastructure for Media Data Center Applications

NFV Infrastructure for Media Data Center Applications NFV Infrastructure for Media Data Center Applications Today s Presenters Roger Sherwood Global Strategy & Business Development, Cisco Systems Damion Desai Account Manager for Datacenter, SDN, NFV and Mobility,

More information

FPGAs and Networking

FPGAs and Networking FPGAs and Networking Marc Kelly & Richard Hughes-Jones University of Manchester 12th July 27 1 Overview of Work Looking into the usage of FPGA's to directly connect to Ethernet for DAQ readout purposes.

More information

ANIC Host CPU Offload Features Overview An Overview of Features and Functions Available with ANIC Adapters

ANIC Host CPU Offload Features Overview An Overview of Features and Functions Available with ANIC Adapters ANIC Host CPU Offload Features Overview An Overview of Features and Functions Available with ANIC Adapters ANIC Adapters Accolade s ANIC line of FPGA-based adapters/nics help accelerate security and networking

More information

Tile Processor (TILEPro64)

Tile Processor (TILEPro64) Tile Processor Case Study of Contemporary Multicore Fall 2010 Agarwal 6.173 1 Tile Processor (TILEPro64) Performance # of cores On-chip cache (MB) Cache coherency Operations (16/32-bit BOPS) On chip bandwidth

More information

Agenda. Introduction Network functions virtualization (NFV) promise and mission cloud native approach Where do we want to go with NFV?

Agenda. Introduction Network functions virtualization (NFV) promise and mission cloud native approach Where do we want to go with NFV? August, 2018 Agenda Introduction Network functions virtualization (NFV) promise and mission cloud native approach Where do we want to go with NFV? 2 Miroslaw Walukiewicz I m from Gdansk, Poland. 25 years

More information

Supra-linear Packet Processing Performance with Intel Multi-core Processors

Supra-linear Packet Processing Performance with Intel Multi-core Processors White Paper Dual-Core Intel Xeon Processor LV 2.0 GHz Communications and Networking Applications Supra-linear Packet Processing Performance with Intel Multi-core Processors 1 Executive Summary Advances

More information

PVPP: A Programmable Vector Packet Processor. Sean Choi, Xiang Long, Muhammad Shahbaz, Skip Booth, Andy Keep, John Marshall, Changhoon Kim

PVPP: A Programmable Vector Packet Processor. Sean Choi, Xiang Long, Muhammad Shahbaz, Skip Booth, Andy Keep, John Marshall, Changhoon Kim PVPP: A Programmable Vector Packet Processor Sean Choi, Xiang Long, Muhammad Shahbaz, Skip Booth, Andy Keep, John Marshall, Changhoon Kim Fixed Set of Protocols Fixed-Function Switch Chip TCP IPv4 IPv6

More information

Highly Scalable, Non-RDMA NVMe Fabric. Bob Hansen,, VP System Architecture

Highly Scalable, Non-RDMA NVMe Fabric. Bob Hansen,, VP System Architecture A Cost Effective,, High g Performance,, Highly Scalable, Non-RDMA NVMe Fabric Bob Hansen,, VP System Architecture bob@apeirondata.com Storage Developers Conference, September 2015 Agenda 3 rd Platform

More information

Accelerating Contrail vrouter

Accelerating Contrail vrouter WHITE PAPER Accelerating Contrail vrouter WHEN DEPLOYED WITH THE JUNIPER NETWORKS CONTRAIL CLOUD NETWORKING PLATFORM, THE NETRONOME AGILIO VROUTER SOLUTION DELIVERS ACCELERATED PERFORMANCE THAT ENABLES

More information

Improve Performance of Kube-proxy and GTP-U using VPP

Improve Performance of Kube-proxy and GTP-U using VPP Improve Performance of Kube-proxy and GTP-U using VPP Hongjun Ni (hongjun.ni@intel.com) Danny Zhou (danny.zhou@intel.com) Johnson Li (johnson.li@intel.com) Network Platform Group, DCG, Intel Acknowledgement:

More information

Backend for Software Data Planes

Backend for Software Data Planes The Case for a Flexible Low-Level Backend for Software Data Planes Sean Choi 1, Xiang Long 2, Muhammad Shahbaz 3, Skip Booth 4, Andy Keep 4, John Marshall 4, Changhoon Kim 5 1 2 3 4 5 Why software data

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

Toward a unified architecture for LAN/WAN/WLAN/SAN switches and routers

Toward a unified architecture for LAN/WAN/WLAN/SAN switches and routers Toward a unified architecture for LAN/WAN/WLAN/SAN switches and routers Silvano Gai 1 The sellable HPSR Seamless LAN/WLAN/SAN/WAN Network as a platform System-wide network intelligence as platform for

More information

CIT 668: System Architecture. Computer Systems Architecture

CIT 668: System Architecture. Computer Systems Architecture CIT 668: System Architecture Computer Systems Architecture 1. System Components Topics 2. Bandwidth and Latency 3. Processor 4. Memory 5. Storage 6. Network 7. Operating System 8. Performance Implications

More information

HW Trends and Architectures

HW Trends and Architectures Pavel Tvrdík, Jiří Kašpar (ČVUT FIT) HW Trends and Architectures MI-POA, 2011, Lecture 1 1/29 HW Trends and Architectures prof. Ing. Pavel Tvrdík CSc. Ing. Jiří Kašpar Department of Computer Systems Faculty

More information

INT G bit TCP Offload Engine SOC

INT G bit TCP Offload Engine SOC INT 10011 10 G bit TCP Offload Engine SOC Product brief, features and benefits summary: Highly customizable hardware IP block. Easily portable to ASIC flow, Xilinx/Altera FPGAs or Structured ASIC flow.

More information

OpenCAPI Technology. Myron Slota Speaker name, Title OpenCAPI Consortium Company/Organization Name. Join the Conversation #OpenPOWERSummit

OpenCAPI Technology. Myron Slota Speaker name, Title OpenCAPI Consortium Company/Organization Name. Join the Conversation #OpenPOWERSummit OpenCAPI Technology Myron Slota Speaker name, Title OpenCAPI Consortium Company/Organization Name Join the Conversation #OpenPOWERSummit Industry Collaboration and Innovation OpenCAPI Topics Computation

More information

Graph Database and Analytics in a GPU- Accelerated Cloud Offering

Graph Database and Analytics in a GPU- Accelerated Cloud Offering Graph Database and Analytics in a GPU- Accelerated Cloud Offering - Blazegraph GPU @ Cirrascale Cloud Brad Bebee, CEO, Blazegraph Dave Driggers, Chief Executive and Technical Officer, Cirrascale Corporation

More information

DynamIQ Processor Designs Using Cortex-A75 & Cortex-A55 for 5G Networks

DynamIQ Processor Designs Using Cortex-A75 & Cortex-A55 for 5G Networks DynamIQ Processor Designs Using Cortex-A75 & Cortex-A55 for 5G Networks Jeff Maguire Senior Product Manager Infrastructure IP Product Management Arm 2017 Arm Limited Arm Tech Symposia 2017 Agenda 5G networks

More information

Disaggregation and Virtualization within the Juniper Networks Mobile Cloud Architecture. White Paper

Disaggregation and Virtualization within the Juniper Networks Mobile Cloud Architecture. White Paper Disaggregation and Virtualization within the Juniper Networks Mobile Cloud Architecture White Paper June 2017 Juniper Networks, Inc. 1133 Innovation Way Sunnyvale, California 94089 USA 408-745-2000 www.juniper.net

More information

PacketShader: A GPU-Accelerated Software Router

PacketShader: A GPU-Accelerated Software Router PacketShader: A GPU-Accelerated Software Router Sangjin Han In collaboration with: Keon Jang, KyoungSoo Park, Sue Moon Advanced Networking Lab, CS, KAIST Networked and Distributed Computing Systems Lab,

More information

FAST FORWARD TO YOUR <NEXT> CREATION

FAST FORWARD TO YOUR <NEXT> CREATION FAST FORWARD TO YOUR CREATION THE ULTIMATE PROFESSIONAL WORKSTATIONS POWERED BY INTEL XEON PROCESSORS 7 SEPTEMBER 2017 WHAT S NEW INTRODUCING THE NEW INTEL XEON SCALABLE PROCESSOR BREAKTHROUGH PERFORMANCE

More information

Catapult: A Reconfigurable Fabric for Petaflop Computing in the Cloud

Catapult: A Reconfigurable Fabric for Petaflop Computing in the Cloud Catapult: A Reconfigurable Fabric for Petaflop Computing in the Cloud Doug Burger Director, Hardware, Devices, & Experiences MSR NExT November 15, 2015 The Cloud is a Growing Disruptor for HPC Moore s

More information

Jakub Cabal et al. CESNET

Jakub Cabal et al. CESNET CONFIGURABLE FPGA PACKET PARSER FOR TERABIT NETWORKS WITH GUARANTEED WIRE- SPEED THROUGHPUT Jakub Cabal et al. CESNET 2018/02/27 FPGA, Monterey, USA Packet parsing INTRODUCTION It is among basic operations

More information

Interconnect Your Future

Interconnect Your Future Interconnect Your Future Paving the Path to Exascale November 2017 Mellanox Accelerates Leading HPC and AI Systems Summit CORAL System Sierra CORAL System Fastest Supercomputer in Japan Fastest Supercomputer

More information

Best Practices for Setting BIOS Parameters for Performance

Best Practices for Setting BIOS Parameters for Performance White Paper Best Practices for Setting BIOS Parameters for Performance Cisco UCS E5-based M3 Servers May 2013 2014 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page

More information

Altos R320 F3 Specifications. Product overview. Product views. Internal view

Altos R320 F3 Specifications. Product overview. Product views. Internal view Product overview The Altos R320 F3 single-socket 1U rack server delivers great performance and enterprise-level scalability in a space-saving design. Proactive management utilities effectively handle SMB

More information

ARISTA: Improving Application Performance While Reducing Complexity

ARISTA: Improving Application Performance While Reducing Complexity ARISTA: Improving Application Performance While Reducing Complexity October 2008 1.0 Problem Statement #1... 1 1.1 Problem Statement #2... 1 1.2 Previous Options: More Servers and I/O Adapters... 1 1.3

More information

High Performance Ethernet for Grid & Cluster Applications. Adam Filby Systems Engineer, EMEA

High Performance Ethernet for Grid & Cluster Applications. Adam Filby Systems Engineer, EMEA High Performance Ethernet for Grid & Cluster Applications Adam Filby Systems Engineer, EMEA 1 Agenda Drivers & Applications The Technology Ethernet Everywhere Ethernet as a Cluster interconnect Ethernet

More information

Networking at the Speed of Light

Networking at the Speed of Light Networking at the Speed of Light Dror Goldenberg VP Software Architecture MaRS Workshop April 2017 Cloud The Software Defined Data Center Resource virtualization Efficient services VM, Containers uservices

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme FUT3056BU VMware vsphere Scales on the Amazing Next-Gen Intel Xeon Architecture VMworld 2017 Content: Not for publication Tom Adelmeyer, Richard A. Brunner, Principal Engineer, Intel Principal Engineer,

More information

SDA: Software-Defined Accelerator for Large- Scale DNN Systems

SDA: Software-Defined Accelerator for Large- Scale DNN Systems SDA: Software-Defined Accelerator for Large- Scale DNN Systems Jian Ouyang, 1 Shiding Lin, 1 Wei Qi, 1 Yong Wang, 1 Bo Yu, 1 Song Jiang, 2 1 Baidu, Inc. 2 Wayne State University Introduction of Baidu A

More information

Intel Enterprise Processors Technology

Intel Enterprise Processors Technology Enterprise Processors Technology Kosuke Hirano Enterprise Platforms Group March 20, 2002 1 Agenda Architecture in Enterprise Xeon Processor MP Next Generation Itanium Processor Interconnect Technology

More information

Transformation through Innovation

Transformation through Innovation INSSPG-2921 Transformation through Innovation Sumeet Arora Senior Vice President/GM, SP Network Systems Service Providers Biggest Challenges Web scale breaks our current cost and design models. l don t

More information

Experiences in Building a 100 Gbps (D)DoS Traffic Generator

Experiences in Building a 100 Gbps (D)DoS Traffic Generator Experiences in Building a 100 Gbps (D)DoS Traffic Generator DIY with a Single Commodity-off-the-shelf (COTS) Server March 31, 2018 Umeda Sky Building Escalators Surasak Sanguanpong Surasak.S@ku.ac.th About

More information

Agilio CX 2x40GbE with OVS-TC

Agilio CX 2x40GbE with OVS-TC PERFORMANCE REPORT Agilio CX 2x4GbE with OVS-TC OVS-TC WITH AN AGILIO CX SMARTNIC CAN IMPROVE A SIMPLE L2 FORWARDING USE CASE AT LEAST 2X. WHEN SCALED TO REAL LIFE USE CASES WITH COMPLEX RULES TUNNELING

More information

Disruptive Innovation in ethernet switching

Disruptive Innovation in ethernet switching Disruptive Innovation in ethernet switching Lincoln Dale Principal Engineer, Arista Networks ltd@aristanetworks.com AusNOG 2012 Ethernet switches have had a pretty boring existence. The odd speed increase

More information

Accelerating vrouter Contrail

Accelerating vrouter Contrail WHITE PAPER Accelerating vrouter Contrail A VIRTUAL ROUTER (VROUTER) IS A SOFTWARE ONLY IMPLEMENTATION OF LAYER 3 INTERNET PROTOCOL (IP) ROUTING. ROUTING FUNCTIONS THAT ARE TRADITIONALLY DELIVERED AS DEDICATED

More information

SDA: Software-Defined Accelerator for general-purpose big data analysis system

SDA: Software-Defined Accelerator for general-purpose big data analysis system SDA: Software-Defined Accelerator for general-purpose big data analysis system Jian Ouyang(ouyangjian@baidu.com), Wei Qi, Yong Wang, Yichen Tu, Jing Wang, Bowen Jia Baidu is beyond a search engine Search

More information

WHITE PAPER SINGLE & MULTI CORE PERFORMANCE OF AN ERASURE CODING WORKLOAD ON AMD EPYC

WHITE PAPER SINGLE & MULTI CORE PERFORMANCE OF AN ERASURE CODING WORKLOAD ON AMD EPYC WHITE PAPER SINGLE & MULTI CORE PERFORMANCE OF AN ERASURE CODING WORKLOAD ON AMD EPYC INTRODUCTION With the EPYC processor line, AMD is expected to take a strong position in the server market including

More information

SmartNIC Programming Models

SmartNIC Programming Models SmartNIC Programming Models Johann Tönsing 206--09 206 Open-NFP Agenda SmartNIC hardware Pre-programmed vs. custom (C and/or P4) firmware Programming models / offload models Switching on NIC, with SR-IOV

More information

Benchmarking of VPP. Arijit Pramanik RnD Project

Benchmarking of VPP. Arijit Pramanik RnD Project Benchmarking of VPP Arijit Pramanik RnD Project Abstract Vector Packet Processing Technology, currently released under FD.io is a framework for high-speed packet processing in user-space. VPP is capable

More information

Industry Collaboration and Innovation

Industry Collaboration and Innovation Industry Collaboration and Innovation OpenCAPI Topics Industry Background Technology Overview Design Enablement OpenCAPI Consortium Industry Landscape Key changes occurring in our industry Historical microprocessor

More information