OpenFlow Software Switch & Intel DPDK. performance analysis
|
|
- Leslie Clark
- 5 years ago
- Views:
Transcription
1 OpenFlow Software Switch & Intel DPDK performance analysis
2 Agenda Background Intel DPDK OpenFlow 1.3 implementation sketch Prototype design and setup Results Future work, optimization ideas OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 2
3 Intel dpdk basics Why Intel DPDK? ( kernel space implementation is more restricted harder to develop and debug interrupts are still needed performance issues user space implementation over normal Linux kernel is slow user kernel memory separation, copy is slow - some workarounds exist (e.g. pcap mmap), but they are still not fast enough a similar, but less widespread solution: Main features poll mode driver: avoid using interrupts and scheduling direct I/O: packet or first X bytes is copied to L1 cache directly some details from the Intel DPDK tutorial will follow OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 3
4 Intel DPDK Basic Design Designed to run on any Intel architecture CPU Intel Atom to client cores to Sandy Bridge Essential to the IA value proposition PThread to bind h/w thread to s/w task Literally no scheduler overhead User Level Polled Mode Driver No Kernel Context / Interrupt Context Switching Overhead Huge Pages To Improve Performance 1Gig Huge as well as 2 Meg Page support Co-exists with Linux s 4 K Page Low Latency Cache and Memory Access DDIO - Cache Prefetch and rte_cache_aligned - memory 4
5 Understanding the Choices & Performance Setting the Direction for the Intel DPDK Scheduler (or why not) Hardware threads only No scheduler/task switcher - typical task switch time is between 200+ processor cycles (varies depending on processor architecture) Process bunch of packets at a time Cores process a bunch of packets at a time to amortize some latencies Prefetch Critical to latency hiding since we don t have software threads. Stalls on hardware threads are costly The queue-based model is key to making prefetch effective Locks Generally lockless implementations where-ever possible. A spinlock-unlock pair costs between cycles. Queues are lockless (single producer & multi-producer, single consumer) 5
6 Scheduler or Why not? Primary reason was performance: Task-switch overhead is typically a few hundred cycles FXSAVE/FXRSTOR are 100 and 150 cycles respectively (on Intel NetBurst ) Faster on recent processors, but not significantly Need to add cost of interrupt if pre-emptive To put that in perspective in a 10 GbE environment On a 3 GHz processor, for small (64B) packets, a packet arrives every 67.2 ns = 201 cycles For lower bandwidth environments, an essential thing to think about is the added CPU bandwidth consumed 6
7 Packet Bunching Done on the NIC today NIC Receive descriptors are bunched four to a cache line Writing back partial descriptors has a severe performance penalty Conflicts between CPU and I/O device on the same cache line Increases memory & PCI-E bandwidth usage Needed to overcome PCI-Express latencies All Intel Ethernet* controllers have settings that can be tweaked to control descriptor write-back Coalesce as many descriptors as possible on Receive Transmit side coalescing done as well (software controlled) Timer values can be set to control latency (EITR) Took the paradigm to the next level in having the fast-path process bunches of packets Facilitated by the queue abstraction * Other names and brands may be claimed as the property of others. 7
8 Prefetch Two types of prefetch hardware & software Hardware prefetch is issued by the core L1 DCU prefetcher: Streaming prefetcher triggered by ascending access to recently loaded data L1 IP-based strided prefetcher: triggered on individual loads with a stride L2 DPL: Prefetches data into L2 cache based on DCU requests Adjacent cache line (n, n+2, prefetch n+2) Strided prefetcher (e.g. skipped cache lines) Software prefetch needs to be issued appropriately ahead of time to be effective Too early could cause eviction before use Multiple types of software prefetch 8
9 Paging 1GB super-pages & 2 Meg Huge Page Support Performance implications Primarily due to D-TLB thrashing/replacement Paging performance drop is difficult to gauge really dependent on application Gets significantly worse as memory footprint increases Varies by architecture, but initial measurements suggested ~30% on L3 forwarding Quite often 2-3 D-TLB replacements per packet 9
10 Intel Data Direct I/O Technology (Intel DDIO) 1x SNB-EP 8C B0, 2.0GHz 10
11 Intel DPDK Performance IPv4 Layer 3 Forwarding on an IA Server Platform 64B Throughput Mpps Native Linux Kernel Performance 12 MPPS PS Native Linux Introduction of Integrated Memory Controller + Intel DPDK 42 MPPS Intel DPDK R MPPS PS R1.0 Intel DPDK Release 1.3 GHz 1C/1T = 18.6 Mpps GHz 1C/1T = 23.7 Mpps Intel DPDK Release 1.4 GHz 1C/1T = 23.9 Mpps Introduction of Integrated PCIe* Controller 110 MPPS R MPPS R1.3 1C/2T = 24 Mpps 1C/2T = 28.8 Mpps 1C/2T = 28.5 Mpps 255 MPPS R S Intel Xeon processor E5645 (2x6C Westmere) 2.4GHz S Intel Xeon processor E5540 (2x4C Nehalem) 2.53 GHz S Intel Xeon processor E5645 (2x6C Westmere-EP) 2.40 GHz S Intel Xeon E processors C1 Stepping (1x8C Sandy bridge-ep) 2.1 GHz 8 x 10GbE PCIe Gen S Intel Xeon E processors C1 Stepping (2x8C Sandy bridge-ep) 2.1 GHz 22 x 10GbE PCIe Gen S Intel Xeon E5-2658v2 (2x10C Ivy bridge-ep) 2.4 GHz 22 x 10GbE PCIe Gen2 Massive IA Performance Improvements since 2009, PCIe Gen3 will Offer Even Better Performance.! Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. 11
12 openflow basics Main idea: programmable networking flexibility, programmability together with high performance Problem: OF is either flexible OR fast today flexible rules with many tuples: use TCAM or slow lookup TCAM is expensive and uses a lot of power complex instructions and actions: high overhead for software implementations some solutions limit flexibility to increase performance (e.g. TTP) In theory performance should only depend on the data plane functions the node is implementing in the given scenario it should be irrelevant whether the device is executing a native implementation of the use case, or is executing OF rules programmed by a controller for the same purpose OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 12
13 add/rem/mod flow entry table ID Flow Table wildcard/prio lookup Action Set Execution Actions add/rem/mod group entry data access per packet processing data access by control plane data access by internal control liveliness propagation Flow Entry remove dependent flows on removal meter ID remove assoc. flow entry Instructions Group Entry group ID remove assoc. instructions Apply Actions group ID group ID Buckets bucket liveliness port ID Meter meter ID add/rem/mod meter port ID Action Set Port queue ID port liveliness Queue OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 13
14 Agenda Background OpenFlow 1.3 implementation sketch Intel DPDK Prototype design and setup Results Future work, optimization ideas OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 14
15 why new prototype Software prototypes investigated and not selected OVS: well-established, open source mainly for virtual environment, performance issues - OVS on Intel DPDK (OVDK) is an ongoing activity CPqD softswitch: used by ONF for prototyping new features, open source serious performance limitations Linc: Erlang based softswitch, open source runs in a VM environment, while we are primarily interested in close to the hardware solutions Hardware based prototypes / products they have serious limitations in terms of number of rules usually OF implementations use TCAM which has limited capacity usually hard to program / modify / add new features OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 15
16 configuration Simple MAC based forwarding 1, 10, 100, 1000, 2000 and 5000 DMAC rules currently with linear search always the last rule will match caching is not easy instruction = write action action set = Output (egress port) Intel DPDK based generator station (tgen) generates 15 Mpps (@ 64 Bytes / pkt) on one core OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 16
17 add/rem/mod flow entry table ID Flow Table 1 Flow Entry 2 wildcard/prio lookup Action Set Execution 4 5 Actions add/rem/mod group entry remove dependent flows on removal data access per packet processing data access by control plane data access by internal control liveliness propagation meter ID remove assoc. flow entry Instructions remove assoc. instructions 3 Apply Actions group ID group ID Group Entry Buckets bucket liveliness group ID port ID Meter meter ID add/rem/mod meter port ID Port Queue OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page queue ID Action Set port liveliness
18 measurement setup 3Com (mgmt) /24 1G GENERATOR Intel Xeon E x6 2.3 GHz 8x4 GB DDR3 SDRAM Intel Niantic (82599EB) 2x10 GbE 10G 10G 1G OF-SW Intel Xeon E x6 2.3 GHz 8x4 GB DDR3 SDRAM OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 18
19 Linux OF-SW core0 core3 core4 OF code core5 OF code core1 rx tx rx tx rx tx core2 Linux driver Intel DPDK driver q42 q52 q43 q53 ETH0 (1G) ETH1 (1G) ETH2 (10G) ETH3 (10G) ETH2 (10G) ETH3 (10G) GENERATOR OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 19
20 results Main results: 25% overhead vs. L2FWD (Intel s example) it was more without highly optimizing the software Pkt size L2FWD OF rule Mpps Gbps Mpps Gbps linear with nr. of rules (not surprisingly) Performance (kpps) Processing time (ns) So we began some investigation OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 20
21 some details Processing time per number of rules at small number of rules cache(s) are effectively used note that real traffic would behave better time per rule (ns) OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 21
22 improving further Preliminary results of current code: overhead was completely removed L2FWD OF rule OF OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 22
23 and further Current status: basically removed static OF overhead It s time for improving rule processing speed and implement control plane Basic ideas under discussion high-performance southbound interface minimize the need for locking, timeouts, etc. fast data plane execution flow caching lookup algorithm selection, selective TTP usage prediction OF 1.3 prototype measurement - EWSDN Public Ericsson AB Page 23
24
Fast packet processing in the cloud. Dániel Géhberger Ericsson Research
Fast packet processing in the cloud Dániel Géhberger Ericsson Research Outline Motivation Service chains Hardware related topics, acceleration Virtualization basics Software performance and acceleration
More informationDPDK Performance Report Release Test Date: Nov 16 th 2016
Test Date: Nov 16 th 2016 Revision History Date Revision Comment Nov 16 th, 2016 1.0 Initial document for release 2 Contents Audience and Purpose... 4 Test setup:... 4 Intel Xeon Processor E5-2699 v4 (55M
More informationDPDK Intel NIC Performance Report Release 18.02
DPDK Intel NIC Performance Report Test Date: Mar 14th 2018 Author: Intel DPDK Validation team Revision History Date Revision Comment Mar 15th, 2018 1.0 Initial document for release 2 Contents Audience
More informationDPDK Intel NIC Performance Report Release 18.05
DPDK Intel NIC Performance Report Test Date: Jun 1th 2018 Author: Intel DPDK Validation team Revision History Date Revision Comment Jun 4th, 2018 1.0 Initial document for release 2 Contents Audience and
More informationDPDK Intel NIC Performance Report Release 17.08
DPDK Intel NIC Performance Report Test Date: Aug 23th 2017 Author: Intel DPDK Validation team Revision History Date Revision Comment Aug 24th, 2017 1.0 Initial document for release 2 Contents Audience
More informationNext Gen Virtual Switch. CloudNetEngine Founder & CTO Jun Xiao
Next Gen Virtual Switch CloudNetEngine Founder & CTO Jun Xiao Agenda Thoughts on next generation virtual switch Technical deep dive on CloudNetEngine virtual switch Q & A 2 Major vswitches categorized
More informationThe Power of Batching in the Click Modular Router
The Power of Batching in the Click Modular Router Joongi Kim, Seonggu Huh, Keon Jang, * KyoungSoo Park, Sue Moon Computer Science Dept., KAIST Microsoft Research Cambridge, UK * Electrical Engineering
More informationIntel s Architecture for NFV
Intel s Architecture for NFV Evolution from specialized technology to mainstream programming Net Futures 2015 Network applications Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION
More informationVALE: a switched ethernet for virtual machines
L < > T H local VALE VALE -- Page 1/23 VALE: a switched ethernet for virtual machines Luigi Rizzo, Giuseppe Lettieri Università di Pisa http://info.iet.unipi.it/~luigi/vale/ Motivation Make sw packet processing
More informationDPDK Roadmap. Tim O Driscoll & Chris Wright Open Networking Summit 2017
DPDK Roadmap Tim O Driscoll & Chris Wright Open Networking Summit 2017 Agenda Overview: What is DPDK? What problems does it solve? Open source community and transition to Linux Foundation: Why is this
More informationFAQ. Release rc2
FAQ Release 19.02.0-rc2 January 15, 2019 CONTENTS 1 What does EAL: map_all_hugepages(): open failed: Permission denied Cannot init memory mean? 2 2 If I want to change the number of hugepages allocated,
More informationDPDK Vhost/Virtio Performance Report Release 18.05
DPDK Vhost/Virtio Performance Report Test Date: Jun 1 2018 Author: Intel DPDK Validation Team Revision History Date Revision Comment Jun 1st, 2018 1.0 Initial document for release 2 Release 18.02 Contents
More informationNVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit
NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit Ben Walker Data Center Group Intel Corporation 2018 Storage Developer Conference. Intel Corporation. All Rights Reserved. 1 Notices
More informationDPDK Vhost/Virtio Performance Report Release 18.11
DPDK Vhost/Virtio Performance Report Test Date: December 3st 2018 Author: Intel DPDK Validation Team Revision History Date Revision Comment December 3st, 2018 1.0 Initial document for release 2 Contents
More informationSupporting Fine-Grained Network Functions through Intel DPDK
Supporting Fine-Grained Network Functions through Intel DPDK Ivano Cerrato, Mauro Annarumma, Fulvio Risso - Politecnico di Torino, Italy EWSDN 2014, September 1st 2014 This project is co-funded by the
More informationPacketShader: A GPU-Accelerated Software Router
PacketShader: A GPU-Accelerated Software Router Sangjin Han In collaboration with: Keon Jang, KyoungSoo Park, Sue Moon Advanced Networking Lab, CS, KAIST Networked and Distributed Computing Systems Lab,
More informationTotal Cost of Ownership Analysis for a Wireless Access Gateway
white paper Communications Service Providers TCO Analysis Total Cost of Ownership Analysis for a Wireless Access Gateway An analysis of the total cost of ownership of a wireless access gateway running
More informationASPERA HIGH-SPEED TRANSFER. Moving the world s data at maximum speed
ASPERA HIGH-SPEED TRANSFER Moving the world s data at maximum speed ASPERA HIGH-SPEED FILE TRANSFER 80 GBIT/S OVER IP USING DPDK Performance, Code, and Architecture Charles Shiflett Developer of next-generation
More informationBe Fast, Cheap and in Control with SwitchKV. Xiaozhou Li
Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Goal: fast and cost-efficient key-value store Store, retrieve, manage key-value objects Get(key)/Put(key,value)/Delete(key) Target: cluster-level
More informationAn Experimental review on Intel DPDK L2 Forwarding
An Experimental review on Intel DPDK L2 Forwarding Dharmanshu Johar R.V. College of Engineering, Mysore Road,Bengaluru-560059, Karnataka, India. Orcid Id: 0000-0001- 5733-7219 Dr. Minal Moharir R.V. College
More informationHigh Performance Packet Processing with FlexNIC
High Performance Packet Processing with FlexNIC Antoine Kaufmann, Naveen Kr. Sharma Thomas Anderson, Arvind Krishnamurthy University of Washington Simon Peter The University of Texas at Austin Ethernet
More informationData Path acceleration techniques in a NFV world
Data Path acceleration techniques in a NFV world Mohanraj Venkatachalam, Purnendu Ghosh Abstract NFV is a revolutionary approach offering greater flexibility and scalability in the deployment of virtual
More informationSPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation
SPDK China Summit 2018 Ziye Yang Senior Software Engineer Network Platforms Group, Intel Corporation Agenda SPDK programming framework Accelerated NVMe-oF via SPDK Conclusion 2 Agenda SPDK programming
More informationAdvanced Computer Networks. End Host Optimization
Oriana Riva, Department of Computer Science ETH Zürich 263 3501 00 End Host Optimization Patrick Stuedi Spring Semester 2017 1 Today End-host optimizations: NUMA-aware networking Kernel-bypass Remote Direct
More informationLearning with Purpose
Network Measurement for 100Gbps Links Using Multicore Processors Xiaoban Wu, Dr. Peilong Li, Dr. Yongyi Ran, Prof. Yan Luo Department of Electrical and Computer Engineering University of Massachusetts
More informationNew Approach to OVS Datapath Performance. Founder of CloudNetEngine Jun Xiao
New Approach to OVS Datapath Performance Founder of CloudNetEngine Jun Xiao Agenda VM virtual network datapath evolvement Technical deep dive on a new OVS datapath Performance comparisons Q & A 2 VM virtual
More informationThe Transition to PCI Express* for Client SSDs
The Transition to PCI Express* for Client SSDs Amber Huffman Senior Principal Engineer Intel Santa Clara, CA 1 *Other names and brands may be claimed as the property of others. Legal Notices and Disclaimers
More informationProgrammable NICs. Lecture 14, Computer Networks (198:552)
Programmable NICs Lecture 14, Computer Networks (198:552) Network Interface Cards (NICs) The physical interface between a machine and the wire Life of a transmitted packet Userspace application NIC Transport
More informationDPDK Summit China 2017
Summit China 2017 Embedded Network Architecture Optimization Based on Lin Hao T1 Networks Agenda Our History What is an embedded network device Challenge to us Requirements for device today Our solution
More informationZiye Yang. NPG, DCG, Intel
Ziye Yang NPG, DCG, Intel Agenda What is SPDK? Accelerated NVMe-oF via SPDK Conclusion 2 Agenda What is SPDK? Accelerated NVMe-oF via SPDK Conclusion 3 Storage Performance Development Kit Scalable and
More informationBe Fast, Cheap and in Control with SwitchKV Xiaozhou Li
Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li Raghav Sethi Michael Kaminsky David G. Andersen Michael J. Freedman Goal: fast and cost-effective key-value store Target: cluster-level storage for
More informationHigh-Speed Forwarding: A P4 Compiler with a Hardware Abstraction Library for Intel DPDK
High-Speed Forwarding: A P4 Compiler with a Hardware Abstraction Library for Intel DPDK Sándor Laki Eötvös Loránd University Budapest, Hungary lakis@elte.hu Motivation Programmability of network data plane
More informationDPDK Vhost/Virtio Performance Report Release 17.08
DPDK Vhost/Virtio Performance Report Test Date: August 15 th 2017 Author: Intel DPDK Validation team Revision History Date Revision Comment August 15 th, 2017 1.0 Initial document for release 2 Contents
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
NET1343BU NSX Performance Samuel Kommu #VMworld #NET1343BU Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no
More informationIntel Workstation Technology
Intel Workstation Technology Turning Imagination Into Reality November, 2008 1 Step up your Game Real Workstations Unleash your Potential 2 Yesterday s Super Computer Today s Workstation = = #1 Super Computer
More informationMWC 2015 End to End NFV Architecture demo_
MWC 2015 End to End NFV Architecture demo_ March 2015 demonstration @ Intel booth Executive summary The goal is to demonstrate how an advanced multi-vendor implementation of the ETSI ISG NFV architecture
More informationNetworking at the Speed of Light
Networking at the Speed of Light Dror Goldenberg VP Software Architecture MaRS Workshop April 2017 Cloud The Software Defined Data Center Resource virtualization Efficient services VM, Containers uservices
More informationImpact of Cache Coherence Protocols on the Processing of Network Traffic
Impact of Cache Coherence Protocols on the Processing of Network Traffic Amit Kumar and Ram Huggahalli Communication Technology Lab Corporate Technology Group Intel Corporation 12/3/2007 Outline Background
More informationIntel Open Network Platform. Recep Ozdag Intel Networking Division May 8, 2013
Intel Open Network Platform Recep Ozdag Intel Networking Division May 8, 2013 Agenda Traditional Networking vs. SDN Intel Open Network Platform (ONP) Introduction SDN Use Cases Future of ONP Traditional
More informationFlexNIC: Rethinking Network DMA
FlexNIC: Rethinking Network DMA Antoine Kaufmann Simon Peter Tom Anderson Arvind Krishnamurthy University of Washington HotOS 2015 Networks: Fast and Growing Faster 1 T 400 GbE Ethernet Bandwidth [bits/s]
More informationBen Walker Data Center Group Intel Corporation
Ben Walker Data Center Group Intel Corporation Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation.
More informationCisco Ultra Packet Core High Performance AND Features. Aeneas Dodd-Noble, Principal Engineer Daniel Walton, Director of Engineering October 18, 2018
Cisco Ultra Packet Core High Performance AND Features Aeneas Dodd-Noble, Principal Engineer Daniel Walton, Director of Engineering October 18, 2018 The World s Top Networks Rely On Cisco Ultra 90+ 300M
More information6.9. Communicating to the Outside World: Cluster Networking
6.9 Communicating to the Outside World: Cluster Networking This online section describes the networking hardware and software used to connect the nodes of cluster together. As there are whole books and
More informationFast packet processing in linux with af_xdp
Fast packet processing in linux with af_xdp Magnus Karlsson and Björn Töpel, Intel Legal Disclaimer Intel technologies may require enabled hardware, specific software, or services activation. Check with
More informationEd Warnicke, Cisco. Tomasz Zawadzki, Intel
Ed Warnicke, Cisco Tomasz Zawadzki, Intel Agenda SPDK iscsi target overview FD.io and VPP SPDK iscsi VPP integration Q&A 2 Notices & Disclaimers Intel technologies features and benefits depend on system
More informationData Plane Development Kit
Data Plane Development Kit Quality of Service (QoS) Cristian Dumitrescu SW Architect - Intel Apr 21, 2015 1 Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS.
More informationStorage Performance Development Kit (SPDK) Daniel Verkamp, Software Engineer
Storage Performance Development Kit (SPDK) Daniel Verkamp, Software Engineer Agenda Threading model discussion SPDK Environment Layer SPDK Application Framework SPDK Blockdev Layer SPDK Example Apps 2
More informationEvolution of the netmap architecture
L < > T H local Evolution of the netmap architecture Evolution of the netmap architecture -- Page 1/21 Evolution of the netmap architecture Luigi Rizzo, Università di Pisa http://info.iet.unipi.it/~luigi/vale/
More informationXilinx Answer QDMA Performance Report
Xilinx Answer 71453 QDMA Performance Report Important Note: This downloadable PDF of an Answer Record is provided to enhance its usability and readability. It is important to note that Answer Records are
More informationPacketShader as a Future Internet Platform
PacketShader as a Future Internet Platform AsiaFI Summer School 2011.8.11. Sue Moon in collaboration with: Joongi Kim, Seonggu Huh, Sangjin Han, Keon Jang, KyoungSoo Park Advanced Networking Lab, CS, KAIST
More informationDPDK Summit 2016 OpenContrail vrouter / DPDK Architecture. Raja Sivaramakrishnan, Distinguished Engineer Aniket Daptari, Sr.
DPDK Summit 2016 OpenContrail vrouter / DPDK Architecture Raja Sivaramakrishnan, Distinguished Engineer Aniket Daptari, Sr. Product Manager CONTRAIL (MULTI-VENDOR) ARCHITECTURE ORCHESTRATOR Interoperates
More informationTHE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF
14th ANNUAL WORKSHOP 2018 THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF Paul Luse Intel Corporation Apr 2018 AGENDA Storage Performance Development Kit What is SPDK? The SPDK Community Why are so
More informationThe Path to DPDK Speeds for AF XDP
The Path to DPDK Speeds for AF XDP Magnus Karlsson, magnus.karlsson@intel.com Björn Töpel, bjorn.topel@intel.com Linux Plumbers Conference, Vancouver, 2018 Legal Disclaimer Intel technologies may require
More informationDPDK Intel Cryptodev Performance Report Release 18.08
DPDK Intel Cryptodev Performance Report Test Date: August 7th 2018 Author: Intel DPDK Validation team Revision History Date Revision Comment August 7th, 2018 1.0 Initial document for release 2 Contents
More informationArrakis: The Operating System is the Control Plane
Arrakis: The Operating System is the Control Plane Simon Peter, Jialin Li, Irene Zhang, Dan Ports, Doug Woos, Arvind Krishnamurthy, Tom Anderson University of Washington Timothy Roscoe ETH Zurich Building
More informationAgilio CX 2x40GbE with OVS-TC
PERFORMANCE REPORT Agilio CX 2x4GbE with OVS-TC OVS-TC WITH AN AGILIO CX SMARTNIC CAN IMPROVE A SIMPLE L2 FORWARDING USE CASE AT LEAST 2X. WHEN SCALED TO REAL LIFE USE CASES WITH COMPLEX RULES TUNNELING
More informationAccelerating NVMe-oF* for VMs with the Storage Performance Development Kit
Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit Jim Harris Principal Software Engineer Intel Data Center Group Santa Clara, CA August 2017 1 Notices and Disclaimers Intel technologies
More informationPVPP: A Programmable Vector Packet Processor. Sean Choi, Xiang Long, Muhammad Shahbaz, Skip Booth, Andy Keep, John Marshall, Changhoon Kim
PVPP: A Programmable Vector Packet Processor Sean Choi, Xiang Long, Muhammad Shahbaz, Skip Booth, Andy Keep, John Marshall, Changhoon Kim Fixed Set of Protocols Fixed-Function Switch Chip TCP IPv4 IPv6
More informationAn FPGA-Based Optical IOH Architecture for Embedded System
An FPGA-Based Optical IOH Architecture for Embedded System Saravana.S Assistant Professor, Bharath University, Chennai 600073, India Abstract Data traffic has tremendously increased and is still increasing
More informationDPDK Intel Cryptodev Performance Report Release 17.11
DPDK Intel Cryptodev Performance Report Test Date: Nov 20th 2017 Author: Intel DPDK Validation team Revision History Date Revision Comment Nov 20th, 2017 1.0 Initial document for release 2 Contents Audience
More informationBenchmarking and Analysis of Software Network Data Planes
Benchmarking and Analysis of Software Network Data Planes Maciek Konstantynowicz Distinguished Engineer, Cisco (FD.io CSIT Project Lead) Patrick Lu Performance Engineer, Intel Corporation, (FD.io pma_tools
More informationIntel Core TM Processor i C Embedded Application Power Guideline Addendum
Intel Core TM Processor i3-2115 C Embedded Application Power Guideline Addendum August 2012 Document Number: 327874-001US INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO
More informationMeasurement-based Analysis of TCP/IP Processing Requirements
Measurement-based Analysis of TCP/IP Processing Requirements Srihari Makineni Ravi Iyer Communications Technology Lab Intel Corporation {srihari.makineni, ravishankar.iyer}@intel.com Abstract With the
More informationReliably Scalable Name Prefix Lookup! Haowei Yuan and Patrick Crowley! Washington University in St. Louis!! ANCS 2015! 5/8/2015!
Reliably Scalable Name Prefix Lookup! Haowei Yuan and Patrick Crowley! Washington University in St. Louis!! ANCS 2015! 5/8/2015! ! My Topic for Today! Goal: a reliable longest name prefix lookup performance
More informationDesign and Implementation of Virtual TAP for Software-Defined Networks
Design and Implementation of Virtual TAP for Software-Defined Networks - Master Thesis Defense - Seyeon Jeong Supervisor: Prof. James Won-Ki Hong Dept. of CSE, DPNM Lab., POSTECH, Korea jsy0906@postech.ac.kr
More informationMeasuring a 25 Gb/s and 40 Gb/s data plane
Measuring a 25 Gb/s and 40 Gb/s data plane Christo Kleu Pervaze Akhtar 1 Contents Preliminaries Equipment Traffic generators Test topologies Host and VM configuration NUMA Architecture CPU allocation BIOS
More informationA Universal Dataplane. FastData.io Project
A Universal Dataplane FastData.io Project : A Universal Dataplane Platform for Native Cloud Network Services EFFICIENCY Most Efficient on the Planet Superior Performance PERFORMANCE Flexible and Extensible
More informationInterrupt Swizzling Solution for Intel 5000 Chipset Series based Platforms
Interrupt Swizzling Solution for Intel 5000 Chipset Series based Platforms Application Note August 2006 Document Number: 314337-002 Notice: This document contains information on products in the design
More informationAchieve Low Latency NFV with Openstack*
Achieve Low Latency NFV with Openstack* Yunhong Jiang Yunhong.Jiang@intel.com *Other names and brands may be claimed as the property of others. Agenda NFV and network latency Why network latency on NFV
More informationXen Network I/O Performance Analysis and Opportunities for Improvement
Xen Network I/O Performance Analysis and Opportunities for Improvement J. Renato Santos G. (John) Janakiraman Yoshio Turner HP Labs Xen Summit April 17-18, 27 23 Hewlett-Packard Development Company, L.P.
More informationChangpeng Liu. Cloud Storage Software Engineer. Intel Data Center Group
Changpeng Liu Cloud Storage Software Engineer Intel Data Center Group Notices & Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software
More informationKeeping up with the hardware
Keeping up with the hardware Challenges in scaling I/O performance Jonathan Davies XenServer System Performance Lead XenServer Engineering, Citrix Cambridge, UK 18 Aug 2015 Jonathan Davies (Citrix) Keeping
More informationBenchmarking Software Data Planes Intel Xeon Skylake vs. Broadwell 1. Maciek Konstantynowicz
Benchmarking Software Data Planes Intel Xeon Skylake vs. Broadwell 1 March 7 th, 2019 Georgii Tkachuk georgii.tkachuk@intel.com Maciek Konstantynowicz mkonstan@cisco.com Shrikant M. Shah shrikant.m.shah@intel.com
More informationTOWARDS FAST IP FORWARDING
TOWARDS FAST IP FORWARDING IP FORWARDING PERFORMANCE IMPROVEMENT AND MEASUREMENT IN FREEBSD Nanako Momiyama Keio University 25th September 2016 EuroBSDcon 2016 OUTLINE Motivation Design and implementation
More informationNetronome 25GbE SmartNICs with Open vswitch Hardware Offload Drive Unmatched Cloud and Data Center Infrastructure Performance
WHITE PAPER Netronome 25GbE SmartNICs with Open vswitch Hardware Offload Drive Unmatched Cloud and NETRONOME AGILIO CX 25GBE SMARTNICS SIGNIFICANTLY OUTPERFORM MELLANOX CONNECTX-5 25GBE NICS UNDER HIGH-STRESS
More informationIntel Core TM i7-4702ec Processor for Communications Infrastructure
Intel Core TM i7-4702ec Processor for Communications Infrastructure Application Power Guidelines Addendum May 2014 Document Number: 330009-001US Introduction INFORMATION IN THIS DOCUMENT IS PROVIDED IN
More informationIsoStack Highly Efficient Network Processing on Dedicated Cores
IsoStack Highly Efficient Network Processing on Dedicated Cores Leah Shalev Eran Borovik, Julian Satran, Muli Ben-Yehuda Outline Motivation IsoStack architecture Prototype TCP/IP over 10GE on a single
More informationNetchannel 2: Optimizing Network Performance
Netchannel 2: Optimizing Network Performance J. Renato Santos +, G. (John) Janakiraman + Yoshio Turner +, Ian Pratt * + HP Labs - * XenSource/Citrix Xen Summit Nov 14-16, 2007 2003 Hewlett-Packard Development
More informationUsing (Suricata over) PF_RING for NIC-Independent Acceleration
Using (Suricata over) PF_RING for NIC-Independent Acceleration Luca Deri Alfredo Cardigliano Outlook About ntop. Introduction to PF_RING. Integrating PF_RING with
More informationBringing the Power of ebpf to Open vswitch. Linux Plumber 2018 William Tu, Joe Stringer, Yifeng Sun, Yi-Hung Wei VMware Inc. and Cilium.
Bringing the Power of ebpf to Open vswitch Linux Plumber 2018 William Tu, Joe Stringer, Yifeng Sun, Yi-Hung Wei VMware Inc. and Cilium.io 1 Outline Introduction and Motivation OVS-eBPF Project OVS-AF_XDP
More informationHKG net_mdev: Fast-path userspace I/O. Ilias Apalodimas Mykyta Iziumtsev François-Frédéric Ozog
HKG18-110 net_mdev: Fast-path userspace I/O Ilias Apalodimas Mykyta Iziumtsev François-Frédéric Ozog Why userland I/O Time sensitive networking Developed mostly for Industrial IOT, automotive and audio/video
More informationOpenDataplane project
OpenDataplane project ENOG15 2018, Russia, Moscow, 4-5 June Maxim Uvarov Senior Software engineer, Linaro Networking Group maxim.uvarov@linaro.org ENOG15, Russia, Moscow Data plane refers to all the
More informationMoonGen. A Scriptable High-Speed Packet Generator. Paul Emmerich. January 31st, 2016 FOSDEM Chair for Network Architectures and Services
MoonGen A Scriptable High-Speed Packet Generator Paul Emmerich January 31st, 216 FOSDEM 216 Chair for Network Architectures and Services Department of Informatics Paul Emmerich MoonGen: A Scriptable High-Speed
More informationRecent Advances in Software Router Technologies
Recent Advances in Software Router Technologies KRNET 2013 2013.6.24-25 COEX Sue Moon In collaboration with: Sangjin Han 1, Seungyeop Han 2, Seonggu Huh 3, Keon Jang 4, Joongi Kim, KyoungSoo Park 5 Advanced
More informationIntel Architecture for Software Developers
Intel Architecture for Software Developers 1 Agenda Introduction Processor Architecture Basics Intel Architecture Intel Core and Intel Xeon Intel Atom Intel Xeon Phi Coprocessor Use Cases for Software
More informationAll product specifications are subject to change without notice.
MSI N3000 series is cost-benefit rackmount network security. Basing on Intel Xeon E3-1200 v3/v4/v5 series CPU and Xeon D-1500 series SoC which is to help enterprise to be flexibly applied to various network
More informationAccelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing
Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product
More informationVPP Host Stack. TCP and Session Layers. Florin Coras, Dave Barach, Keith Burns, Dave Wallace
Host Stack and Layers Florin Coras, Dave Barach, Keith Burns, Dave Wallace - A Universal Terabit Network Platform For Native Cloud Network Services Most Efficient on the Planet EFFICIENCY Superior Performance
More informationWORKLOAD CHARACTERIZATION OF INTERACTIVE CLOUD SERVICES BIG AND SMALL SERVER PLATFORMS
WORKLOAD CHARACTERIZATION OF INTERACTIVE CLOUD SERVICES ON BIG AND SMALL SERVER PLATFORMS Shuang Chen*, Shay Galon**, Christina Delimitrou*, Srilatha Manne**, and José Martínez* *Cornell University **Cavium
More informationHigh Performance Solid State Storage Under Linux
High Performance Solid State Storage Under Linux Eric Seppanen, Matthew T. O Keefe, David J. Lilja Electrical and Computer Engineering University of Minnesota April 20, 2010 Motivation SSDs breaking through
More informationAdvanced Caching Techniques (2) Department of Electrical Engineering Stanford University
Lecture 4: Advanced Caching Techniques (2) Department of Electrical Engineering Stanford University http://eeclass.stanford.edu/ee282 Lecture 4-1 Announcements HW1 is out (handout and online) Due on 10/15
More information100 GBE AND BEYOND. Diagram courtesy of the CFP MSA Brocade Communications Systems, Inc. v /11/21
100 GBE AND BEYOND 2011 Brocade Communications Systems, Inc. Diagram courtesy of the CFP MSA. v1.4 2011/11/21 Current State of the Industry 10 Electrical Fundamental 1 st generation technology constraints
More informationCS3350B Computer Architecture
CS335B Computer Architecture Winter 25 Lecture 32: Exploiting Memory Hierarchy: How? Marc Moreno Maza wwwcsduwoca/courses/cs335b [Adapted from lectures on Computer Organization and Design, Patterson &
More informationIntel Speed Select Technology Base Frequency - Enhancing Performance
Intel Speed Select Technology Base Frequency - Enhancing Performance Application Note April 2019 Document Number: 338928-001 You may not use or facilitate the use of this document in connection with any
More informationIntel Open Network Platform Server (Release 1.3) Release Notes
(Release 1.3) Revision 1.1 February 2015 Revision History Date Revision Description February 19, 2015 1.1 Updated for release of 1.3 of Intel Network Platform Server 1.3 December 15, 2014 1.0 Initial release
More informationP51: High Performance Networking
P51: High Performance Networking Lecture 6: Programmable network devices Dr Noa Zilberman noa.zilberman@cl.cam.ac.uk Lent 2017/18 High Throughput Interfaces Performance Limitations So far we discussed
More informationLinux multi-core scalability
Linux multi-core scalability Oct 2009 Andi Kleen Intel Corporation andi@firstfloor.org Overview Scalability theory Linux history Some common scalability trouble-spots Application workarounds Motivation
More informationQorIQ Intelligent Network Interface Card (inic) Solution SDK v1.0 Update
QorIQ Intelligent Network Interface Card (inic) Solution SDK v1.0 Update APF-NET-T0658 Gary Chu A U G. 2 0 1 4 TM External Use The New Virtualized Network Cloud Cloud gaining momentum as the service platform
More informationChangpeng Liu. Senior Storage Software Engineer. Intel Data Center Group
Changpeng Liu Senior Storage Software Engineer Intel Data Center Group Legal Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware,
More informationThe Missing Piece of Virtualization. I/O Virtualization on 10 Gb Ethernet For Virtualized Data Centers
The Missing Piece of Virtualization I/O Virtualization on 10 Gb Ethernet For Virtualized Data Centers Agenda 10 GbE Adapters Built for Virtualization I/O Throughput: Virtual & Non-Virtual Servers Case
More information