INT G bit TCP Offload Engine SOC

Similar documents
10 G bit TCP Offload Engine + PCIe/DMA SOC IP

INT 1011 TCP Offload Engine (Full Offload)

1G Bit TCP+UDP Offload Engine (TOE+UOE) Hardware IP Core

10G bit-16k Sess TCP+UDP Offload Engine + MAC + Host_IF. IP Cores for FPGAs and SoCs

25G bit-1k Sess TCP+UDP Offload + Host_IF. IP Cores for FPGAs and SoCs

1G bit TCP Offload Engine SOC IP

10G bit UDP Offload Engine (UOE) MAC+ PCIe SOC IP

10 G Bit TCP+UDP Offload Engine (TOE+UOE) Hardware IP Core

INT-1010 TCP Offload Engine

INT Gbit Ethernet MAC Engine

Programmable Logic Design Grzegorz Budzyń Lecture. 15: Advanced hardware in FPGA structures

FPGA Augmented ASICs: The Time Has Come

The world s most reliable and mature full hardware ultra-low latency TCP, MAC and PCS IP Cores.

Intelop. *As new IP blocks become available, please contact the factory for the latest updated info.

Introduction to TCP/IP Offload Engine (TOE)

The Nios II Family of Configurable Soft-core Processors

6.9. Communicating to the Outside World: Cluster Networking

S2C K7 Prodigy Logic Module Series

RiceNIC. Prototyping Network Interfaces. Jeffrey Shafer Scott Rixner

Jakub Cabal et al. CESNET

100% PACKET CAPTURE. Intelligent FPGA-based Host CPU Offload NIC s & Scalable Platforms. Up to 200Gbps

Single Root I/O Virtualization (SR-IOV) and iscsi Uncompromised Performance for Virtual Server Environments Leonid Grossman Exar Corporation

Product Overview. Programmable Network Cards Network Appliances FPGA IP Cores

Networks-on-Chip Router: Configuration and Implementation

NFS/RDMA over 40Gbps iwarp Wael Noureddine Chelsio Communications

Multifunction Networking Adapters

A Next Generation Home Access Point and Router

Extreme TCP Speed on GbE

5051 & 5052 PCIe Card Overview

QuickSpecs. HP Z 10GbE Dual Port Module. Models

HIGH-PERFORMANCE NETWORKING :: USER-LEVEL NETWORKING :: REMOTE DIRECT MEMORY ACCESS

Intel PRO/1000 PT and PF Quad Port Bypass Server Adapters for In-line Server Appliances

TOE10G-IP with CPU reference design

Co-Design and Co-Verification using a Synchronous Language. Satnam Singh Xilinx Research Labs

1GbEth. Access Switch. 1GbEth. Workgroup Switch. 10MbEth. Figure 1: Enterprise LAN Topology Example

Midterm Exam. Solutions

1GbEth. Access Switch. Workgroup Switch. 10MbEth. Figure 1: Enterprise LAN Topology Example

ANIC Host CPU Offload Features Overview An Overview of Features and Functions Available with ANIC Adapters

NetFPGA Hardware Architecture

LogiCORE IP AXI Ethernet v6.0

STATEFUL TCP/UDP traffic generation and analysis

UDP1G-IP reference design manual

Advanced Computer Networks. End Host Optimization

nxtcp Standard Edition 25G/10G/1G TCP/IP + MAC IP Core for FPGAs nxudp Standard Edition 25G/10G/1G UDP/IP + MAC IP Core for FPGAs

The Convergence of Storage and Server Virtualization Solarflare Communications, Inc.

OpenSMART: Single-cycle Multi-hop NoC Generator in BSV and Chisel

Video capture using GigE Vision with MIL. What is GigE Vision

Hardware Design. University of Pannonia Dept. Of Electrical Engineering and Information Systems. MicroBlaze v.8.10 / v.8.20

PowerPC on NetFPGA CSE 237B. Erik Rubow

Designing Embedded Processors in FPGAs

Solace Message Routers and Cisco Ethernet Switches: Unified Infrastructure for Financial Services Middleware

Presentation_ID. 2002, Cisco Systems, Inc. All rights reserved.

Ethernet Switch. WAN Gateway. Figure 1: Switched LAN Example

TOE1G-IP Multisession Reference design manual Rev May-17

Developing deterministic networking technology for railway applications using TTEthernet software-based end systems

Leveraging HyperTransport for a custom high-performance cluster network

IOV hardware and software plumbing for VMware ESX - journey to uncompromised I/O Virtualization and Convergence.

SoC Design Lecture 11: SoC Bus Architectures. Shaahin Hessabi Department of Computer Engineering Sharif University of Technology

Enabling success from the center of technology. Networking with Xilinx Embedded Processors

TOE10G-IP Core. Core Facts

TOE40G-IP Introduction (Xilinx( Realize 40GbE limit speed!

TOE1G-IP Core. Core Facts

IBM POWER8 100 GigE Adapter Best Practices

Extreme Low Latency 10G Ethernet IP Solution Product Brief (HTK-ELL10G-ETH-FPGA)

Multi-Gigabit Transceivers Getting Started with Xilinx s Rocket I/Os

440GX Application Note

Atacama: An Open Experimental Platform for Mixed-Criticality Networking on Top of Ethernet

OCP Engineering Workshop - Telco

Support for Smart NICs. Ian Pratt

SoC Design. Prof. Dr. Christophe Bobda Institut für Informatik Lehrstuhl für Technische Informatik

CPCI-HPDI32ALT High-speed 64 Bit Parallel Digital I/O PCI Board 100 to 400 Mbytes/s Cable I/O with PCI-DMA engine

Network Interface Architecture and Prototyping for Chip and Cluster Multiprocessors

Implementing Ultra Low Latency Data Center Services with Programmable Logic

LegUp: Accelerating Memcached on Cloud FPGAs

NVMe Direct. Next-Generation Offload Technology. White Paper

Applying the Benefits of Network on a Chip Architecture to FPGA System Design

UDP10G-IP reference design manual

An FPGA-Based Optical IOH Architecture for Embedded System

GigaX API for Zynq SoC

vnetwork Future Direction Howie Xu, VMware R&D November 4, 2008

White Paper The Need for a High-Bandwidth Memory Architecture in Programmable Logic Devices

P51: High Performance Networking

Improving DPDK Performance

by Brian Hausauer, Chief Architect, NetEffect, Inc

The RM9150 and the Fast Device Bus High Speed Interconnect

The CoreConnect Bus Architecture

Topic & Scope. Content: The course gives

Simplify System Complexity

Achieving UFS Host Throughput For System Performance

Next Generation Enterprise Solutions from ARM

CN-100 Network Analyzer Product Overview

An Intelligent NIC Design Xin Song

TOE1G-IP Core. Core Facts

OpenOnload. Dave Parry VP of Engineering Steve Pope CTO Dave Riddoch Chief Software Architect

Mike Anderson. TCP/IP in Embedded Systems. CTO/Chief Scientist The PTR Group, Inc.

Module 2 Storage Network Architecture

Zynq-7000 All Programmable SoC Product Overview

Linux Network Tuning Guide for AMD EPYC Processor Based Servers

Simplify System Complexity

Qsys and IP Core Integration

Transcription:

INT 10011 10 G bit TCP Offload Engine SOC Product brief, features and benefits summary: Highly customizable hardware IP block. Easily portable to ASIC flow, Xilinx/Altera FPGAs or Structured ASIC flow. Provides lowest latency and highest bandwidth INT 10011 is the only SOC that integrates 10G TOE + 10 GEMAC + PLB interfaces in the smallest logic footprint. It is highly flexible that is customizable for layer-3, layer 4-7 network infrastructure and network security systems applications. It is recommended for use in, among others, high performance Servers, NICs, SAN/NAS and data center equipment design applications. It provides key IP building blocks for very high performance 10-Giga bit Ethernet ASIC/ASSP/FPGAs. INT 10011 has built in advanced architectural flexibility that provides capability for enterprises to differentiate their Network security and Network infrastructure appliances from others and customize them for their specific design application. INT 10011 can process TCP/IP sessions as client/server in mixed session mode and other protocols for Network equipment and in-line network security appliances, simultaneously, at 10-G-bit rate. This relieves the host CPU from costly TCP/IP software related session setup/tear down, data copying and maintenance tasks thereby delivering 4x to 8x TCP/IP network performance improvement when compared with TCP/IP software. Intilop offers a wide range of TOE processing hardware cores for 10-GE to 1-GE applications using PCI Express or embedded system interfaces. TOE products support full TCP offload as well as conventional NIC mode operation (in TCP Bypass Mode) and feature advanced software support (optional) where applications need no modification to take advantage of TOE acceleration. As an option, it provides easy-to-use frameworks for utilizing the Xilinx Virtex-5/6 and PCIe hardcore enabling rapid and efficient system application development. The 10 G Bit TOE is based upon the proven and mature patent pending 1 G bit TOE architecture from Intilop corporation. The same architecture is scalable to 40 G bit. Intilop Corporation www.intilop.com email: info@intilop.com 1

TOE s design versions- Generic TOE for Network infrastructure design applications: a) 4 Session with Payload FIFO of 8/16 K Bytes. b) 16 Session with Payload FIFO of 16/32 K Bytes c) 64 Session with scalable FIFO of 64K/256K bytes. d) 65+ Sessions depend upon onchip memory or external DDRx (available upon request). e) Optional Very high performance DMA blocks also available to integrate with high performance PCIe Gen 2 interface. TOE with enhanced Security features (available upon request) a) All of the options available in Generic TOE plus; i. Protocol filter block can selectively direct traffic for any known application level protocol to any selected MAC port; e.g. all IM/chat traffic, SMTP (email), Web(http) traffic, VoIP etc. can be filtered and directed to selected ports. ii. iii. iv. IP and Port number filter block Specific IP and Port Filtered traffic routed to optional selected MAC interface/s or PCIe interface or Memory interface directly at line rate without CPU involvement. MAC Filter block, traffic routed to any of the selected interfaces Benefits of Intilop TOE: Featuring APIs at different levels the General TOE allows the application developer to easily migrate from software, to TOE hardware, to custom hardware, to achieve higher performance. Advantages and benefits: 10G application to application throughput Very low application to application latency Software compatibility Scalable solution; 1G, 10G 40G APIs - NIC mode TOE Socket API HW API performance - Latency through 10 G TOE = less than 300 ns Network applications use the Socket API. Typically OS implements the Socket API with a software stack. In its basic mode the Intilop TOE implements a standard Hardware API which allows next higher level applications to fully take advantage of TOEs complete benefits. Intilop Corporation www.intilop.com email: info@intilop.com 2

Optionally, to achieve higher performance, Intilop can implement an equivalent Socket API named TOE Socket API which enables plug and play acceleration through a simple intercept of standard Socket calls. Hardware API: Enables dedicated processing in the FPGA for application specific acceleration Ideal for Very high performance specialized, differentiable ASICs or FPGAs for Network security or Network infrastructure applications Fully verified using comprehensive verification methdology for ASIC ports and Network system tested core. Smallest logic foot print; less than 20,000 Xilinx slices, Altera ALMs or 250,000 ASIC gates + on-chip memory Sliding Window mechanism implemented in hardware allowing total Flow Control Slow start transfer control in hardware Customizable for IP-protocol only. Non-TCP Bypass mode lets all Non TCP/IP related traffic go directly to host interface via user_fifo for TCP/IP software to handle Fully integrated 10 G bit high performance Ethernet MAC. Scalable MAC Rx FIFOs and Tx FIFOs make it ideal for optimizing system performance. Hardware implementation of TCP/IP stacks control plane and data plane. Hardware implementation of ARP protocol processing. Can be deployed behind a gateway which will respond to Gateway-IP request as opposed to ARP request (optional) On-chip DDR or SSRAM memory controller which can address from 4K Bytes to 4 MB Bytes on chip or 256 MB off chip memories (optional) Simple User Side interface for easy hardware integration or a little more complicated for more power full and controlled Streaming data transfers. Extended ARP table creation, deletion management (optional) Adheres to RFCs; 793, 1500, 1700, 813, 791, 2001 Many trade-offs for some functions performed in hardware or software Configurable Packet buffers, session table buffers On-chip or Off-chip memories, Intilop Corporation www.intilop.com email: info@intilop.com 3

attached DDR I/II interface. Depending on system, performance, ASIC/FPGA size requirements-> User Customizable, (optional) Interfaces directly to XGMII, 10 G Bit serial interfaces Architecture can be scaled up to 40-G bits Customizable to handle jumbo frames Integrated PLB interface (Xilinx) Integrated AMBA 2.0 interface or MIPs CPU bus for Local Processor control. (opt) User programmable/ prioritize-able interrupts Performs connection/session management Monitors, Stores, Maintains and processes up to 1024 live TCP sessions. Customizable to implement more, depending upon on-chip memory availability and other FPGA limitations. Extendable to 4K TCP sessions. Internal Memory dependent. TCP + IP check sum generation and check performed in hardware in less than 6 clks (30 ns at 200 MHz) vs 1-2 us by typical software TCP-stack Connection set up and tear down/termination without CPU involvement. User programmable Session table parameters Dedicated set of hardware Timers for each TCP/IP session (opt) or customizable for sharing one set of common timers for all stale sessions. Multiple slot storage for fragmented packets. More slots allocated when more Onchip Memory available. Self-checking available memory logic. (optional) Out of sequence packet detection/storage and Reassembly/Segmentation (optional) Direct Data placement in Applications buffer at full wire speed without CPU-> reduces CPU s buffer copy time and utilization by 95% Support VLAN mode (optional) Wire-speed 20-Gbps performance in full duplex Easily customizable for filtering various IP and TCP traffic Protocols, directed towards any port or IP (Ideal for security appliances) Multiple TOEs can process up to 4K connections per second Implements Full TCP/IP offload or By- Pass mode. (Optional) Intilop Corporation www.intilop.com email: info@intilop.com 4

Future Proof- Flexible implementation of TCP Offload Basic mini API available for easy integration with Linux/windows. Others OSs/CPUs also available Accommodates future Specifications changes. 10-GEMAC XGMII XGMII Rx-I/F Tx-I/F (Opt) D D R (Opt) User Data Bus 32/64/128 bit User Data bus 64/128 bit APB/PLB Interface Regs block PCIe x 4, 8, 16 Interface User Control Bus 32/64 bit Host system Simplified Block Diagram Intilop Corporation www.intilop.com email: info@intilop.com 5

Specifications brief: Original 1 G TOE Functionality Proven in multiple networking equipment Complete header and flag processing of TCP/IP sessions in hardware accelerates by 10x 20x TCP Offload Engine- 20-G b/s Wire-speed performance Scalable to 40 G b/s TCP + IP check sum- hardware TCP segmentation/reassembly in hardware Multiple slot storage for fragmented packets (opt) Out of sequence packet detection/storage/reassembly(opt) TCP port address tracking/automatic DMA MAC Address search logic/filter (opt) IP address search logic/filter (opt) Accelerate security processing, Storage Networking- TCP irdma implementation- Direct Data placement in Applications buffer --> reduces CPU utilization by 90+ % Future Proof- Flexible implementation of TCP Offload Accommodates future Specifications changes. AMBA/PLB CPU interface features; Basic transfers Various Transfer types Master/slave Bus Arbitration (optional) Bus slave transactions Bus master transactions (optional) Address decoder Bus Arbiter (optional) System Endianness: Little-Endian Deliverables: - Verilog source code or NetList. - Test Bench,,vcd files, configuration code/api for easy Linux port - Verilog models for various components e.g. TCP/IP Client and Server models, transaction model (optional) - 10-GEMAC - External memory interface/model (optional). - TCP Model (optional) - Verification suite (optional) - Test packet-traffic suite (optional) Intilop Corporation www.intilop.com email: info@intilop.com 6