MOVING FORWARD WITH FABRIC INTERFACES
|
|
- Dominick Day
- 5 years ago
- Views:
Transcription
1 14th ANNUAL WORKSHOP 2018 MOVING FORWARD WITH FABRIC INTERFACES Sean Hefty, OFIWG co-chair Intel Corporation April, 2018
2 USING THE PAST TO PREDICT THE FUTURE OFI Provider Infrastructure OFI API Exploration Companion APIs (Bonus!) 1.5 API Updates RxM provider SOCK endpoint types Memory registration API optimizations 2017 v v v v1.6.1 v1.6.2 v Provider Enhancements PSM2 native RxM performance SHM shared memory support Persistent memory 1.7 Predictions New providers RxD, multi-rail, new vendors SHM xpmem support API enhancements 2 OpenFabrics Alliance Workshop 2018
3 PROVIDER INFRASTRUCTURE 3 OpenFabrics Alliance Workshop 2018
4 ARUN AND DMITRY S AMAZING RXM PROVIDER High-priority OFI OFI RxM MPI / SHMEM MSG MSG MSG MSG Primary path for HPC apps accessing verbs hardware Verbs NetworkDirect TCP Optimizes for hardware features TCP to replace sockets MSG MSG MSG MSG Connection multiplexing Strong MPI performance Evaluating tighter provider coupling 4 OpenFabrics Alliance Workshop 2018
5 RXD Focus for v1.7 OFI OFI RxD MPI / SHMEM DGRAM DGRAM Offload large transfers Future path for HPC scalability Verbs UD usnic UDP Raw Ethernet Other..? Fast development path for hardware support Extend features of simple provider DGRAM DGRAM DGRAM Reliability, segmentation, and reassembly DGRAM Re-designing for performance and scalability Analyzing provider specific optimizations 5 OpenFabrics Alliance Workshop 2018
6 ALEXIA S FANTASTIC SHARED MEMORY PROVIDER Now available in stores near you! SMR SHM Provider Shared Memory Region SMR One-sided and two-sided transfers SMR CMA (cross-memory attach) for large transfers Version Flags PID Region Size Lock Shared memory primitives Command Queue Response Queue Inject Buffers Peer Address Map Single command queue xpmem support under development 6 OpenFabrics Alliance Workshop 2018
7 MEMORY MONITOR AND REGISTRATION CACHE Provider Driver notification, hook alloc/free, provider specific Internal API Get/put MRs Notification Notification Queue Queue subscribe events Memory Monitor Core Monitor Plug-in A generic solution is desired here Callbacks to add/delete MRs Registration Cache LRU List Custom Limits Usage Stats Merges overlapping regions MR Map MR MR MR Tracks active usage 7 OpenFabrics Alliance Workshop 2018
8 PERFORMANCE MONITORING Ex: Sample CPU instructions for various code paths Performance Data Set Event Data Event Data Count Event Data Count Sum Count Sum Sum Performance Management Unit CPU Cache Linux RDPMC Cycles Instructions Hits Misses Inline performance tracking NIC Performance domains? 8 OpenFabrics Alliance Workshop 2018
9 HOOKING PROVIDER Framework done, needs core integration OFI User OFI Core Always available release and debug builds Zero-impact unless enabled Hook Core/Util Provider Intercept calls to any provider Debugging, performance analysis, feature enhancements, testing 9 OpenFabrics Alliance Workshop 2018
10 API EXPLORATION 10 OpenFabrics Alliance Workshop 2018
11 VARIABLE LENGTH MESSAGES User User send Size unknown until sent receive size =? size = X X > transport msg size X Software layers duplicate feature Eager message rendezvous RMA read or tagged message MTU ack remaining transfer RMA write, tagged send, send RTS CLS transfer Similar wire protocols different implementations 11 OpenFabrics Alliance Workshop 2018
12 VARIABLE LENGTH MESSAGES No change at sender maybe send User Only lowest layer developer needs to figure out how to spell rendezvous! Report ready to receive completion User Claim/ Discard ID size = X ID + X Modeled after tagged message feature Opt-in impacts protocol Provider optimizes around hardware abilities Opportunity: report discard to sender Application flow control and load balancing Dynamically disable receive processing (e.g. EBUSY) 12 OpenFabrics Alliance Workshop 2018
13 MULTI-RAIL PROVIDER Focus for v1.7 Application or admin configured Multiple EPs, ports, NICs, fabrics One fi_info structure per rail OFI OFI mrail EP 1 EP User EP 1 EP 2 EP 2 Increase bandwidth and message rate Failover Rail selection plug-in EP 1 EP 2 Isolate rail selection algorithm Require variable message support TBD: recovery fallback Active Standby 13 OpenFabrics Alliance Workshop 2018
14 PERSISTENT MEMORY High-availability model (v1.6) Documentation limits use case RMA Write User PMEM MR New completion semantic User Register PMEM Persistent Memory Commit complete Work with SNIA (Storage Networking Industry Association) Evolve APIs to support other usage models Exploration Byte addressable or object aware Single or multi-transfer commit Advanced operations (e.g. atomics) Keep implementation agnostic Handle offload and on-load models Support multi-rail Minimize state footprint 14 OpenFabrics Alliance Workshop 2018
15 DATA DOMAINS APIs assume memory mapped regions Same coherency domain CPU load/stores CPU Memory PMEM May need to sync results with CPU Peer Device Device Memory (Smart) NIC Device FPGA Memory May not want to write data through CPU caches Memory regions may not be mapped Programmable offload capabilities and flow processing Results may be cached by NIC for long transactions 15 OpenFabrics Alliance Workshop 2018
16 COMPANION APIS 16 OpenFabrics Alliance Workshop 2018
17 C++ STANDARDIZATION Feedback from C++ community Implement proposal Detail alternatives Justify extensions Add support for fabrics directly to the C++ language Proposal Extend ASIO Implement over libfabric ASIO Model User Program Async Handler e.g. connect Async Handler e.g. transmits IO Object e.g. resolver IO Object e.g. socket Callback driven IO Service (tracks and progresses requests) Maps to all OFI asynchronous reporting objects
18 NOTIFICATION QUEUE Async Handler e.g. connect Extend to allow separation of control and data events Callback completion model Async Handler e.g. transmits IO Service (tracks and progresses requests) Notification Queue Event Handler Transmit Handler Receive Handler Error Handler Concurrency Wait Object Queue Size Signaling Vector Tx Format Rx Format dispatch() poll() post() run() stop() reset() Interfaces modeled after IO service Event Queue(s) Completion Queue(s) Wait Set Poll Set
19 RSOCKETS rsockets (librdmacm) A CM Verbs RC QP UD QP Increase OS & fabric portability Significantly boosts performance versus sockets with HW acceleration Pursuing OpenJDK integration OFI Maintain verbs protocol Verbs SOCK DGRAM EP SOCK STREAM EP Omni Path SOCK DGRAM EP SOCK STREAM EP rsockets (librsockets) TCP SOCK STREAM EP UDP SOCK DGRAM EP Network Direct SOCK STREAM EP Always available 19 OpenFabrics Alliance Workshop 2018
20 14th ANNUAL WORKSHOP 2018 THANK YOU Sean Hefty, President and CEO My Own Little World
Sayantan Sur, Intel. Presenting work done by Arun Ilango, Dmitry Gladkov, Dmitry Durnov and Sean Hefty and others in the OFIWG community
Sayantan Sur, Intel Presenting work done by Arun Ilango, Dmitry Gladkov, Dmitry Durnov and Sean Hefty and others in the OFIWG community 6 th Annual MVAPICH User Group (MUG) 2018 Legal Disclaimer & Optimization
More informationAn Introduction to the OpenFabrics Interface. #OFAUserGroup Paul Grun Cray w/ slides stolen (with pride) from Sean Hefty
An Introduction to the OpenFabrics Interface #OFAUserGroup Paul Grun Cray w/ slides stolen (with pride) from Sean Hefty Agenda Where the OFA is going Forming the OFI WG First Principles Application-centric
More informationOPENFABRICS INTERFACES: PAST, PRESENT, AND FUTURE
12th ANNUAL WORKSHOP 2016 OPENFABRICS INTERFACES: PAST, PRESENT, AND FUTURE Sean Hefty OFIWG Co-Chair [ April 5th, 2016 ] OFIWG: develop interfaces aligned with application needs Open Source Expand open
More informationOpen Fabrics Interfaces Architecture Introduction. Sean Hefty Intel Corporation
Open Fabrics Interfaces Architecture Introduction Sean Hefty Intel Corporation Current State of Affairs OFED software Widely adopted low-level RDMA API Ships with upstream Linux but OFED SW was not designed
More informationUSING OPEN FABRIC INTERFACE IN INTEL MPI LIBRARY
14th ANNUAL WORKSHOP 2018 USING OPEN FABRIC INTERFACE IN INTEL MPI LIBRARY Michael Chuvelev, Software Architect Intel April 11, 2018 INTEL MPI LIBRARY Optimized MPI application performance Application-specific
More informationOpenFabrics Interface WG A brief introduction. Paul Grun co chair OFI WG Cray, Inc.
OpenFabrics Interface WG A brief introduction Paul Grun co chair OFI WG Cray, Inc. OFI WG a brief overview and status report 1. Keep everybody on the same page, and 2. An example of a possible model for
More informationOPENFABRICS INTERFACES: PAST, PRESENT, AND FUTURE
OPENFABRICS INTERFACES: PAST, PRESENT, AND FUTURE Sean Hefty Openfabrics Interfaces Working Group Co-Chair Intel November 2016 OFIWG: develop interfaces aligned with application needs Open Source Expand
More informationFabric Interfaces Architecture. Sean Hefty - Intel Corporation
Fabric Interfaces Architecture Sean Hefty - Intel Corporation Changes v2 Remove interface object Add open interface as base object Add SRQ object Add EQ group object www.openfabrics.org 2 Overview Object
More informationOPENSHMEM AND OFI: BETTER TOGETHER
4th ANNUAL WORKSHOP 208 OPENSHMEM AND OFI: BETTER TOGETHER James Dinan, David Ozog, and Kayla Seager Intel Corporation [ April, 208 ] NOTICES AND DISCLAIMERS Intel technologies features and benefits depend
More informationSayantan Sur, Intel. SEA Symposium on Overlapping Computation and Communication. April 4 th, 2018
Sayantan Sur, Intel SEA Symposium on Overlapping Computation and Communication April 4 th, 2018 Legal Disclaimer & Benchmark results were obtained prior to implementation of recent software patches and
More informationPersistent Memory over Fabrics
Persistent Memory over Fabrics Rob Davis, Mellanox Technologies Chet Douglas, Intel Paul Grun, Cray, Inc Tom Talpey, Microsoft Santa Clara, CA 1 Agenda The Promise of Persistent Memory over Fabrics Driving
More informationRemote Persistent Memory SNIA Nonvolatile Memory Programming TWG
Remote Persistent Memory SNIA Nonvolatile Memory Programming TWG Tom Talpey Microsoft 2018 Storage Developer Conference. SNIA. All Rights Reserved. 1 Outline SNIA NVMP TWG activities Remote Access for
More informationAdvanced Computer Networks. End Host Optimization
Oriana Riva, Department of Computer Science ETH Zürich 263 3501 00 End Host Optimization Patrick Stuedi Spring Semester 2017 1 Today End-host optimizations: NUMA-aware networking Kernel-bypass Remote Direct
More information2017 Storage Developer Conference. Mellanox Technologies. All Rights Reserved.
Ethernet Storage Fabrics Using RDMA with Fast NVMe-oF Storage to Reduce Latency and Improve Efficiency Kevin Deierling & Idan Burstein Mellanox Technologies 1 Storage Media Technology Storage Media Access
More informationSayantan Sur, Intel. ExaComm Workshop held in conjunction with ISC 2018
Sayantan Sur, Intel ExaComm Workshop held in conjunction with ISC 2018 Legal Disclaimer & Optimization Notice Software and workloads used in performance tests may have been optimized for performance only
More informationScalable Fabric Interfaces
Scalable Fabric Interfaces Sean Hefty Intel Corporation OFI software will be backward compatible OFI WG Charter Develop an extensible, open source framework and interfaces aligned with ULP and application
More informationRavindra Babu Ganapathi
14 th ANNUAL WORKSHOP 2018 INTEL OMNI-PATH ARCHITECTURE AND NVIDIA GPU SUPPORT Ravindra Babu Ganapathi Intel Corporation [ April, 2018 ] Intel MPI Open MPI MVAPICH2 IBM Platform MPI SHMEM Intel MPI Open
More informationADVANCED PGAS CENTRIC USAGE OF THE OPENFABRICS INTERFACE
13 th ANNUAL WORKSHOP 2017 ADVANCED PGAS CENTRIC USAGE OF THE OPENFABRICS INTERFACE Erik Paulson, Kayla Seager, Sayantan Sur, James Dinan, Dave Ozog: Intel Corporation Collaborators: Howard Pritchard:
More informationHIGH-PERFORMANCE NETWORKING :: USER-LEVEL NETWORKING :: REMOTE DIRECT MEMORY ACCESS
HIGH-PERFORMANCE NETWORKING :: USER-LEVEL NETWORKING :: REMOTE DIRECT MEMORY ACCESS CS6410 Moontae Lee (Nov 20, 2014) Part 1 Overview 00 Background User-level Networking (U-Net) Remote Direct Memory Access
More informationWelcome to the IBTA Fall Webinar Series
Welcome to the IBTA Fall Webinar Series A four-part webinar series devoted to making I/O work for you Presented by the InfiniBand Trade Association The webinar will begin shortly. 1 September 23 October
More informationRDMA programming concepts
RDMA programming concepts Robert D. Russell InterOperability Laboratory & Computer Science Department University of New Hampshire Durham, New Hampshire 03824, USA 2013 Open Fabrics Alliance,
More informationThe Exascale Architecture
The Exascale Architecture Richard Graham HPC Advisory Council China 2013 Overview Programming-model challenges for Exascale Challenges for scaling MPI to Exascale InfiniBand enhancements Dynamically Connected
More informationNTRDMA v0.1. An Open Source Driver for PCIe NTB and DMA. Allen Hubbe at Linux Piter 2015 NTRDMA. Messaging App. IB Verbs. dmaengine.h ntb.
Messaging App IB Verbs NTRDMA dmaengine.h ntb.h DMA DMA DMA NTRDMA v0.1 An Open Source Driver for PCIe and DMA Allen Hubbe at Linux Piter 2015 1 INTRODUCTION Allen Hubbe Senior Software Engineer EMC Corporation
More informationPersistent Memory Over Fabrics. Paul Grun, Cray Inc Stephen Bates, Eideticom Rob Davis, Mellanox Technologies
Persistent Memory Over Fabrics Paul Grun, Cray Inc Stephen Bates, Eideticom Rob Davis, Mellanox Technologies Agenda Persistent Memory as viewed by a consumer, and some guidance to the fabric community
More informationRDMA in Embedded Fabrics
RDMA in Embedded Fabrics Ken Cain, kcain@mc.com Mercury Computer Systems 06 April 2011 www.openfabrics.org 2011 Mercury Computer Systems, Inc. www.mc.com Uncontrolled for Export Purposes 1 Outline Embedded
More informationRemote Persistent Memory With Nothing But Net Tom Talpey Microsoft
Remote Persistent Memory With Nothing But Net Tom Talpey Microsoft 1 Outline Aspiration RDMA NIC as a Persistent Memory storage adapter Steps to there: Flush Write-after-flush Integrity Privacy QoS Some
More informationWindows OpenFabrics (WinOF) Update
Windows OpenFabrics (WinOF) Update Eric Lantz, Microsoft (elantz@microsoft.com) April 2008 Agenda OpenFabrics and Microsoft Current Events HPC Server 2008 Release NetworkDirect - RDMA for Windows 2 OpenFabrics
More informationOPEN MPI AND RECENT TRENDS IN NETWORK APIS
12th ANNUAL WORKSHOP 2016 OPEN MPI AND RECENT TRENDS IN NETWORK APIS #OFADevWorkshop HOWARD PRITCHARD (HOWARDP@LANL.GOV) LOS ALAMOS NATIONAL LAB LA-UR-16-22559 OUTLINE Open MPI background and release timeline
More informationMOM MESSAGE ORIENTED MIDDLEWARE OVERVIEW OF MESSAGE ORIENTED MIDDLEWARE TECHNOLOGIES AND CONCEPTS. MOM Message Oriented Middleware
MOM MESSAGE ORIENTED MOM Message Oriented Middleware MIDDLEWARE OVERVIEW OF MESSAGE ORIENTED MIDDLEWARE TECHNOLOGIES AND CONCEPTS Peter R. Egli 1/25 Contents 1. Synchronous versus asynchronous interaction
More informationIO virtualization. Michael Kagan Mellanox Technologies
IO virtualization Michael Kagan Mellanox Technologies IO Virtualization Mission non-stop s to consumers Flexibility assign IO resources to consumer as needed Agility assignment of IO resources to consumer
More information14th ANNUAL WORKSHOP 2018 NVMF TARGET OFFLOAD. Liran Liss. Mellanox Technologies. April 2018
14th ANNUAL WORKSHOP 2018 NVMF TARGET OFFLOAD Liran Liss Mellanox Technologies April 2018 AGENDA Introduction NVMe NVMf NVMf target driver Offload model Verbs interface Status 2 OpenFabrics Alliance Workshop
More informationA Brief Introduction to the OpenFabrics Interfaces
A Brief Introduction to the OpenFabrics Interfaces A New Network API for Maximizing High Performance Application Efficiency Paul Grun, Sean Hefty, Sayantan Sur, David Goodell, Robert D. Russell, Howard
More informationApplication Acceleration Beyond Flash Storage
Application Acceleration Beyond Flash Storage Session 303C Mellanox Technologies Flash Memory Summit July 2014 Accelerating Applications, Step-by-Step First Steps Make compute fast Moore s Law Make storage
More informationIsoStack Highly Efficient Network Processing on Dedicated Cores
IsoStack Highly Efficient Network Processing on Dedicated Cores Leah Shalev Eran Borovik, Julian Satran, Muli Ben-Yehuda Outline Motivation IsoStack architecture Prototype TCP/IP over 10GE on a single
More informationThe Common Communication Interface (CCI)
The Common Communication Interface (CCI) Presented by: Galen Shipman Technology Integration Lead Oak Ridge National Laboratory Collaborators: Scott Atchley, George Bosilca, Peter Braam, David Dillow, Patrick
More informationInfiniBand Linux Operating System Software Access Layer
Software Architecture Specification (SAS) Revision Draft 2 Last Print Date: 4/19/2002-9:04 AM Copyright (c) 1996-2002 Intel Corporation. All rights reserved. InfiniBand Linux Operating System Software
More informationContaining RDMA and High Performance Computing
Containing RDMA and High Performance Computing Liran Liss ContainerCon 2015 Agenda High Performance Computing (HPC) networking RDMA 101 Containing RDMA Challenges Solution approach RDMA network namespace
More informationLUSTRE NETWORKING High-Performance Features and Flexible Support for a Wide Array of Networks White Paper November Abstract
LUSTRE NETWORKING High-Performance Features and Flexible Support for a Wide Array of Networks White Paper November 2008 Abstract This paper provides information about Lustre networking that can be used
More informationSCTP s Reliability and Fault Tolerance
SCTP s Reliability and Fault Tolerance Brad Penoff, Mike Tsai, and Alan Wagner Department of Computer Science University of British Columbia Vancouver, Canada Distributed Systems Group Seattle Conference
More informationApplication Access to Persistent Memory The State of the Nation(s)!
Application Access to Persistent Memory The State of the Nation(s)! Stephen Bates, Paul Grun, Tom Talpey, Doug Voigt Microsemi, Cray, Microsoft, HPE The Suspects Stephen Bates Microsemi Paul Grun Cray
More informationInterconnect Your Future
Interconnect Your Future Smart Interconnect for Next Generation HPC Platforms Gilad Shainer, August 2016, 4th Annual MVAPICH User Group (MUG) Meeting Mellanox Connects the World s Fastest Supercomputer
More informationPARAVIRTUAL RDMA DEVICE
12th ANNUAL WORKSHOP 2016 PARAVIRTUAL RDMA DEVICE Aditya Sarwade, Adit Ranadive, Jorgen Hansen, Bhavesh Davda, George Zhang, Shelley Gong VMware, Inc. [ April 5th, 2016 ] MOTIVATION User Kernel Socket
More informationExtending RDMA for Persistent Memory over Fabrics. Live Webcast October 25, 2018
Extending RDMA for Persistent Memory over Fabrics Live Webcast October 25, 2018 Today s Presenters John Kim SNIA NSF Chair Mellanox Tony Hurson Intel Rob Davis Mellanox SNIA-At-A-Glance 3 SNIA Legal Notice
More informationDevice-Functionality Progression
Chapter 12: I/O Systems I/O Hardware I/O Hardware Application I/O Interface Kernel I/O Subsystem Transforming I/O Requests to Hardware Operations Incredible variety of I/O devices Common concepts Port
More informationChapter 12: I/O Systems. I/O Hardware
Chapter 12: I/O Systems I/O Hardware Application I/O Interface Kernel I/O Subsystem Transforming I/O Requests to Hardware Operations I/O Hardware Incredible variety of I/O devices Common concepts Port
More informationIntroduction to OpenOnload Building Application Transparency and Protocol Conformance into Application Acceleration Middleware
White Paper Introduction to OpenOnload Building Application Transparency and Protocol Conformance into Application Acceleration Middleware Steve Pope, PhD Chief Technical Officer Solarflare Communications
More informationAdvancing RDMA. A proposal for RDMA on Enhanced Ethernet. Paul Grun SystemFabricWorks
Advancing RDMA A proposal for RDMA on Enhanced Ethernet Paul Grun SystemFabricWorks pgrun@systemfabricworks.com Objective: Accelerate the adoption of RDMA technology Why bother? I mean, who cares about
More informationETHERNET OVER INFINIBAND
14th ANNUAL WORKSHOP 2018 ETHERNET OVER INFINIBAND Evgenii Smirnov and Mikhail Sennikovsky ProfitBricks GmbH April 10, 2018 ETHERNET OVER INFINIBAND: CURRENT SOLUTIONS mlx4_vnic Currently deprecated Requires
More informationDeveloping deterministic networking technology for railway applications using TTEthernet software-based end systems
Developing deterministic networking technology for railway applications using TTEthernet software-based end systems Project n 100021 Astrit Ademaj, TTTech Computertechnik AG Outline GENESYS requirements
More informationLessons learned from MPI
Lessons learned from MPI Patrick Geoffray Opinionated Senior Software Architect patrick@myri.com 1 GM design Written by hardware people, pre-date MPI. 2-sided and 1-sided operations: All asynchronous.
More informationCERN openlab Summer 2006: Networking Overview
CERN openlab Summer 2006: Networking Overview Martin Swany, Ph.D. Assistant Professor, Computer and Information Sciences, U. Delaware, USA Visiting Helsinki Institute of Physics (HIP) at CERN swany@cis.udel.edu,
More informationPERSISTENT MEMORY PROGRAMMING
14th ANNUAL WORKSHOP 2018 PERSISTENT MEMORY PROGRAMMING THE REMOTE ACCESS PERSPECTIVE Tom Talpey, Architect Microsoft April 10, 2018 OUTLINE SNIA NVMP Programming Model PMEM Remote Access considerations
More informationVPP Host Stack. Transport and Session Layers. Florin Coras, Dave Barach
Host Stack Transport and Layers Florin Coras, Dave Barach - A Universal Terabit Network Platform For Native Cloud Network Services Most Efficient on the Planet EFFICIENCY Superior Performance PERFORMANCE
More informationChapter 13: I/O Systems
Chapter 13: I/O Systems I/O Hardware Application I/O Interface Kernel I/O Subsystem Transforming I/O Requests to Hardware Operations Streams Performance I/O Hardware Incredible variety of I/O devices Common
More informationAsynchronous Peer-to-Peer Device Communication
13th ANNUAL WORKSHOP 2017 Asynchronous Peer-to-Peer Device Communication Feras Daoud, Leon Romanovsky [ 28 March, 2017 ] Agenda Peer-to-Peer communication PeerDirect technology PeerDirect and PeerDirect
More informationRapidIO.org Update. Mar RapidIO.org 1
RapidIO.org Update rickoco@rapidio.org Mar 2015 2015 RapidIO.org 1 Outline RapidIO Overview & Markets Data Center & HPC Communications Infrastructure Industrial Automation Military & Aerospace RapidIO.org
More informationDPDK Integration within F5 BIG-IP BRENT BLOOD, SR MANAGER SOFTWARE ENGINEERING VIJAY MANICKAM, SR SOFTWARE ENGINEER
x DPDK Integration within F5 BIG-IP BRENT BLOOD, SR MANAGER SOFTWARE ENGINEERING VIJAY MANICKAM, SR SOFTWARE ENGINEER F5 Company Snapshot Founded: 1996 IPO: June 1999 Employees: 4,395 Headquarters: Seattle,
More informationSNIA NVM Programming Model Workgroup Update. #OFADevWorkshop
SNIA NVM Programming Model Workgroup Update #OFADevWorkshop Persistent Memory (PM) Vision Fast Like Memory PM Brings Storage PM Durable Like Storage To Memory Slots 2 Latency Thresholds Cause Disruption
More informationUnified Communication X (UCX)
Unified Communication X (UCX) Pavel Shamis / Pasha ARM Research SC 18 UCF Consortium Mission: Collaboration between industry, laboratories, and academia to create production grade communication frameworks
More informationHigh-Throughput and Low-Latency Network Communication with NetIO
High-Throughput and Low-Latency Network Communication with NetIO Jörn Schumacher 1,2, Christian Plessl 2 and Wainer Vandelli 1 1 CERN, Geneva, Switzerland 2 Paderborn University, Germany E-mail: jorn.schumacher@cern.ch
More informationAn FPGA-Based Optical IOH Architecture for Embedded System
An FPGA-Based Optical IOH Architecture for Embedded System Saravana.S Assistant Professor, Bharath University, Chennai 600073, India Abstract Data traffic has tremendously increased and is still increasing
More informationTLDK Overview. Transport Layer Development Kit Keith Wiles April Contributions from Ray Kinsella & Konstantin Ananyev
TLDK Overview Transport Layer Development Kit Keith Wiles April 2017 Contributions from Ray Kinsella & Konstantin Ananyev Notices and Disclaimers Intel technologies features and benefits depend on system
More informationINT G bit TCP Offload Engine SOC
INT 10011 10 G bit TCP Offload Engine SOC Product brief, features and benefits summary: Highly customizable hardware IP block. Easily portable to ASIC flow, Xilinx/Altera FPGAs or Structured ASIC flow.
More informationMellanox HPC-X Scalable Software Toolkit README
Mellanox HPC-X Scalable Software Toolkit README Rev 1.0 www.mellanox.com 2 Mellanox Technologies Document Number: MLNX-15-3871 Table of Contents Table of Contents..........................................................
More informationREMOTE PERSISTENT MEMORY ACCESS WORKLOAD SCENARIOS AND RDMA SEMANTICS
13th ANNUAL WORKSHOP 2017 REMOTE PERSISTENT MEMORY ACCESS WORKLOAD SCENARIOS AND RDMA SEMANTICS Tom Talpey Microsoft [ March 31, 2017 ] OUTLINE Windows Persistent Memory Support A brief summary, for better
More informationMessaging Overview. Introduction. Gen-Z Messaging
Page 1 of 6 Messaging Overview Introduction Gen-Z is a new data access technology that not only enhances memory and data storage solutions, but also provides a framework for both optimized and traditional
More informationInfiniBand* Software Architecture Access Layer High Level Design June 2002
InfiniBand* Software Architecture June 2002 *Other names and brands may be claimed as the property of others. THIS SPECIFICATION IS PROVIDED "AS IS" WITH NO WARRANTIES WHATSOEVER, INCLUDING ANY WARRANTY
More informationChapter 13: I/O Systems
Chapter 13: I/O Systems Chapter 13: I/O Systems I/O Hardware Application I/O Interface Kernel I/O Subsystem Transforming I/O Requests to Hardware Operations Streams Performance 13.2 Silberschatz, Galvin
More informationUsing Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology
Using Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology September 19, 2007 Markus Levy, EEMBC and Multicore Association Enabling the Multicore Ecosystem Multicore
More informationConcurrent Support of NVMe over RDMA Fabrics and Established Networked Block and File Storage
Concurrent Support of NVMe over RDMA Fabrics and Established Networked Block and File Storage Ásgeir Eiriksson CTO Chelsio Communications Inc. August 2016 1 Introduction API are evolving for optimal use
More informationIn-Network Computing. Sebastian Kalcher, Senior System Engineer HPC. May 2017
In-Network Computing Sebastian Kalcher, Senior System Engineer HPC May 2017 Exponential Data Growth The Need for Intelligent and Faster Interconnect CPU-Centric (Onload) Data-Centric (Offload) Must Wait
More informationThe Non-Volatile Memory Verbs Provider (NVP): Using the OFED Framework to access solid state storage
The Non-Volatile Memory Verbs Provider (NVP): Using the OFED Framework to access solid state storage Bernard Metzler 1, Animesh Trivedi 1, Lars Schneidenbach 2, Michele Franceschini 2, Patrick Stuedi 1,
More informationUpdate on Scalable SA Project
Update on Scalable SA Project Hal Rosenstock Mellanox Technologies #OFADevWorkshop The Problem And The Solution n^2 SA load SA queried for every connection Communication between all nodes creates an n
More informationPerformance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms
Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms Sayantan Sur, Matt Koop, Lei Chai Dhabaleswar K. Panda Network Based Computing Lab, The Ohio State
More informationNetwork Adapter Flow Steering
Network Adapter Flow Steering OFA 2012 Author: Tzahi Oved Date: March 2012 Receive Steering Evolution The traditional Single Ring All ingress traffic to land on a single receive ring Kernel threads / DPC
More informationIndirect Communication
Indirect Communication To do q Today q q Space and time (un)coupling Common techniques q Next time: Overlay networks xkdc Direct coupling communication With R-R, RPC, RMI Space coupled Sender knows the
More informationMemory Management Strategies for Data Serving with RDMA
Memory Management Strategies for Data Serving with RDMA Dennis Dalessandro and Pete Wyckoff (presenting) Ohio Supercomputer Center {dennis,pw}@osc.edu HotI'07 23 August 2007 Motivation Increasing demands
More informationTHE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF
14th ANNUAL WORKSHOP 2018 THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF Paul Luse Intel Corporation Apr 2018 AGENDA Storage Performance Development Kit What is SPDK? The SPDK Community Why are so
More informationInformatix Solutions INFINIBAND OVERVIEW. - Informatix Solutions, Page 1 Version 1.0
INFINIBAND OVERVIEW -, 2010 Page 1 Version 1.0 Why InfiniBand? Open and comprehensive standard with broad vendor support Standard defined by the InfiniBand Trade Association (Sun was a founder member,
More informationRoGUE: RDMA over Generic Unconverged Ethernet
RoGUE: RDMA over Generic Unconverged Ethernet Yanfang Le with Brent Stephens, Arjun Singhvi, Aditya Akella, Mike Swift RDMA Overview RDMA USER KERNEL Zero Copy Application Application Buffer Buffer HARWARE
More informationLow latency, high bandwidth communication. Infiniband and RDMA programming. Bandwidth vs latency. Knut Omang Ifi/Oracle 2 Nov, 2015
Low latency, high bandwidth communication. Infiniband and RDMA programming Knut Omang Ifi/Oracle 2 Nov, 2015 1 Bandwidth vs latency There is an old network saying: Bandwidth problems can be cured with
More informationReliable UDP (RDP) Transport for CORBA
OMG Embedded and Real-Time 2002 Workshop Reliable UDP (RDP) Transport for CORBA Voula Fotopoulos Catherine Heaberlin January 10, 2002 (voula.fotopoulos@lmco.com, catherine.t.heaberlin@lmco.com) Naval Electronics
More informationNFS/RDMA over 40Gbps iwarp Wael Noureddine Chelsio Communications
NFS/RDMA over 40Gbps iwarp Wael Noureddine Chelsio Communications Outline RDMA Motivating trends iwarp NFS over RDMA Overview Chelsio T5 support Performance results 2 Adoption Rate of 40GbE Source: Crehan
More information10-Gigabit iwarp Ethernet: Comparative Performance Analysis with InfiniBand and Myrinet-10G
10-Gigabit iwarp Ethernet: Comparative Performance Analysis with InfiniBand and Myrinet-10G Mohammad J. Rashti and Ahmad Afsahi Queen s University Kingston, ON, Canada 2007 Workshop on Communication Architectures
More informationUnderstanding MPI on Cray XC30
Understanding MPI on Cray XC30 MPICH3 and Cray MPT Cray MPI uses MPICH3 distribution from Argonne Provides a good, robust and feature rich MPI Cray provides enhancements on top of this: low level communication
More information6.9. Communicating to the Outside World: Cluster Networking
6.9 Communicating to the Outside World: Cluster Networking This online section describes the networking hardware and software used to connect the nodes of cluster together. As there are whole books and
More informationManagement Scalability. Author: Todd Rimmer Date: April 2014
Management Scalability Author: Todd Rimmer Date: April 2014 Agenda Projected HPC Scalability Requirements Key Challenges Path Record IPoIB Mgmt Security Partitioning Multicast Notices SA interaction Call
More informationThe SNIA NVM Programming Model. #OFADevWorkshop
The SNIA NVM Programming Model #OFADevWorkshop Opportunities with Next Generation NVM NVMe & STA SNIA 2 NVM Express/SCSI Express: Optimized storage interconnect & driver SNIA NVM Programming TWG: Optimized
More informationMultifunction Networking Adapters
Ethernet s Extreme Makeover: Multifunction Networking Adapters Chuck Hudson Manager, ProLiant Networking Technology Hewlett-Packard 2004 Hewlett-Packard Development Company, L.P. The information contained
More informationMPI on the Cray XC30
MPI on the Cray XC30 Aaron Vose 4/15/2014 Many thanks to Cray s Nick Radcliffe and Nathan Wichmann for slide ideas. Cray MPI. MPI on XC30 - Overview MPI Message Pathways. MPI Environment Variables. Environment
More informationToward a Memory-centric Architecture
Toward a Memory-centric Architecture Martin Fink EVP & Chief Technology Officer Western Digital Corporation August 8, 2017 1 SAFE HARBOR DISCLAIMERS Forward-Looking Statements This presentation contains
More informationPaving the Road to Exascale
Paving the Road to Exascale Gilad Shainer August 2015, MVAPICH User Group (MUG) Meeting The Ever Growing Demand for Performance Performance Terascale Petascale Exascale 1 st Roadrunner 2000 2005 2010 2015
More informationXen Network I/O Performance Analysis and Opportunities for Improvement
Xen Network I/O Performance Analysis and Opportunities for Improvement J. Renato Santos G. (John) Janakiraman Yoshio Turner HP Labs Xen Summit April 17-18, 27 23 Hewlett-Packard Development Company, L.P.
More informationFlexible Architecture Research Machine (FARM)
Flexible Architecture Research Machine (FARM) RAMP Retreat June 25, 2009 Jared Casper, Tayo Oguntebi, Sungpack Hong, Nathan Bronson Christos Kozyrakis, Kunle Olukotun Motivation Why CPUs + FPGAs make sense
More informationNVMf based Integration of Non-volatile Memory in a Distributed System - Lessons learned
14th ANNUAL WORKSHOP 2018 NVMf based Integration of Non-volatile Memory in a Distributed System - Lessons learned Jonas Pfefferle, Bernard Metzler, Patrick Stuedi, Animesh Trivedi and Adrian Schuepbach
More informationAccessing NVM Locally and over RDMA Challenges and Opportunities
Accessing NVM Locally and over RDMA Challenges and Opportunities Wendy Elsasser Megan Grodowitz William Wang MSST - May 2018 Emerging NVM A wide variety of technologies with varied characteristics Address
More informationSMB3 Extensions for Low Latency. Tom Talpey Microsoft May 12, 2016
SMB3 Extensions for Low Latency Tom Talpey Microsoft Problem Statement Storage Class Memory A new, disruptive class of storage Nonvolatile medium with RAM-like performance Low latency, high throughput,
More informationDesigning Next Generation Data-Centers with Advanced Communication Protocols and Systems Services
Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services P. Balaji, K. Vaidyanathan, S. Narravula, H. W. Jin and D. K. Panda Network Based Computing Laboratory
More informationThe Case for RDMA. Jim Pinkerton RDMA Consortium 5/29/2002
The Case for RDMA Jim Pinkerton RDMA Consortium 5/29/2002 Agenda What is the problem? CPU utilization and memory BW bottlenecks Offload technology has failed (many times) RDMA is a proven sol n to the
More informationArchitected for Performance. NVMe over Fabrics. September 20 th, Brandon Hoff, Broadcom.
Architected for Performance NVMe over Fabrics September 20 th, 2017 Brandon Hoff, Broadcom Brandon.Hoff@Broadcom.com Agenda NVMe over Fabrics Update Market Roadmap NVMe-TCP The benefits of NVMe over Fabrics
More information