Intel Stratix 10 Logic Array Blocks and Adaptive Logic Modules User Guide

Similar documents
Intel Arria 10 Native Floating- Point DSP Intel FPGA IP User Guide

Intel Arria 10 Core Fabric and General Purpose I/Os Handbook

2. Stratix II Architecture


Arria 10 Core Fabric and General Purpose I/Os Handbook

Stratix V Device Handbook

Intel Stratix 10 Variable Precision DSP Blocks User Guide

High Bandwidth Memory (HBM2) Interface Intel FPGA IP Design Example User Guide

Chapter 2. Cyclone II Architecture

Interlaken IP Core (2nd Generation) Design Example User Guide

ASMI Parallel II Intel FPGA IP Core User Guide

Arria V Device Handbook

AN 756: Altera GPIO to Altera PHYLite Design Implementation Guidelines

Stratix. Introduction. Features... Programmable Logic Device Family. Preliminary Information

Altera ASMI Parallel II IP Core User Guide

Stratix V Device Handbook

Intel Stratix 10 Analog to Digital Converter User Guide

4. TriMatrix Embedded Memory Blocks in HardCopy IV Devices

Section I. Cyclone II Device Family Data Sheet

Intel Quartus Prime Software Download and Installation Quick Start Guide

Stratix. Introduction. Features... 10,570 to 114,140 LEs; see Table 1. FPGA Family. Preliminary Information

Cyclone V Device Handbook

Intel Cyclone 10 External Memory Interfaces IP Design Example User Guide

External Memory Interfaces Intel Arria 10 FPGA IP Design Example User Guide

Section I. Cyclone II Device Family Data Sheet

Section I. Cyclone II Device Family Data Sheet

Intel Stratix 10 H-tile Hard IP for Ethernet Design Example User Guide

SerialLite III Streaming IP Core Design Example User Guide for Intel Stratix 10 Devices

Intel Acceleration Stack for Intel Xeon CPU with FPGAs 1.0 Errata

SerialLite III Streaming IP Core Design Example User Guide for Intel Arria 10 Devices

AN 834: Developing for the Intel HLS Compiler with an IDE

Section I. Cyclone FPGA Family Data Sheet

Intel Stratix 10 H-Tile PCIe Link Hardware Validation

Section I. Cyclone FPGA Family Data Sheet

Intel MAX 10 User Flash Memory User Guide

Intel Xeon with FPGA IP Asynchronous Core Cache Interface (CCI-P) Shim

AN 464: DFT/IDFT Reference Design

10. Introduction to UniPHY IP

Section I. Cyclone FPGA Family Data Sheet

Customizable Flash Programmer User Guide

Intel Stratix 10 External Memory Interfaces IP Design Example User Guide

Generic Serial Flash Interface Intel FPGA IP Core User Guide

Intel Quartus Prime Pro Edition Software and Device Support Release Notes

Intel Cyclone 10 LP Core Fabric and General Purpose I/Os Handbook

Remote Update Intel FPGA IP User Guide

Simulating the ASMI Block in Your Design

Intel FPGA Voltage Sensor IP Core User Guide

Altera I/O Phase-Locked Loop (Altera IOPLL) IP Core User Guide

Intel MAX 10 FPGA Device Overview

Stratix II Device Handbook, Volume 1

Low Power Design Techniques

Intel Stratix 10 Low Latency 40G Ethernet Design Example User Guide

Early Power Estimator for Intel Stratix 10 FPGAs User Guide

Intel Arria 10 Native Fixed Point DSP IP Core User Guide

Intel Quartus Prime Pro Edition Software and Device Support Release Notes

Stratix II Device Handbook, Volume 1

PCI Express Multi-Channel DMA Interface

AN 839: Design Block Reuse Tutorial

AN 825: Partially Reconfiguring a Design on Intel Stratix 10 GX FPGA Development Board

Memory Optimization for OpenCL on Intel FPGAs Exercise Manual

Intel Quartus Prime Pro Edition Software and Device Support Release Notes

Intel MAX 10 Embedded Memory User Guide

Section I. MAX II Device Family Data Sheet

MAX 10 FPGA Device Overview

MAX 10 FPGA Device Overview

Intel FPGA Temperature Sensor IP Core User Guide

8. Introduction to UniPHY IP

Timing Analyzer Quick-Start Tutorial

Intel Stratix 10 Clocking and PLL User Guide

25G Ethernet Intel Stratix 10 FPGA IP Design Example User Guide

MAX 10 Embedded Memory User Guide

Altera FLEX 8000 Block Diagram

Quick Start Guide for Intel FPGA Development Tools on the Nimbix Cloud

Low Latency 100G Ethernet Intel Stratix 10 FPGA IP Design Example User Guide

11. SEU Mitigation in Stratix IV Devices

AN 825: Partially Reconfiguring a Design on Intel Stratix 10 GX FPGA Development Board

Intel MAX 10 Clocking and PLL User Guide

AN 830: Intel FPGA Triple-Speed Ethernet and On-Board PHY Chip Reference Design

AN 370: Using the Intel FPGA Serial Flash Loader with the Intel Quartus Prime Software

Nios II Custom Instruction User Guide

Intel MAX 10 Clocking and PLL User Guide

AN 830: Intel FPGA Triple-Speed Ethernet and On-Board PHY Chip Reference Design

Intel Stratix 10 High-Speed LVDS I/O User Guide

AN 836: RapidIO II Reference Design for Avalon-ST Pass-Through Interface

Intel Cyclone 10 LP Device Family Pin Connection Guidelines

Intel Quartus Prime Pro Edition

9. Functional Description Example Designs

Section I. Device Core for Arria II Devices

9. SEU Mitigation in Cyclone IV Devices

Nios II Embedded Design Suite Release Notes

Cyclone Device Handbook, Volume 1

AN 818: Static Update Partial Reconfiguration Tutorial

Stratix 10 Serial Flash Mailbox Client Intel FPGA IP Core User Guide

MAX 10 User Flash Memory User Guide

Low Latency 100G Ethernet Design Example User Guide

AN 447: Interfacing Intel FPGA Devices with 3.3/3.0/2.5 V LVTTL/ LVCMOS I/O Systems

EFEC20 IP Core. Features

Cyclone Device Handbook, Volume 1

Altera I/O Phase-Locked Loop (Altera IOPLL) IP Core User Guide

AN 818: Static Update Partial Reconfiguration Tutorial

Transcription:

Intel Stratix 10 Logic Array Blocks and Adaptive Logic Modules User Guide Subscribe Send Feedback Latest document on the web: PDF HTML

Contents Contents 1 Intel Stratix 10 LAB and Overview... 3 2 HyperFlex Register... 4 3 Intel Stratix 10 LAB and Architecture and Features...5 3.1 LAB... 5 3.1.1 MLAB... 5 3.1.2 Local and Direct Link Interconnects... 6 3.1.3 Carry Chain Interconnects... 7 3.1.4 LAB Control Signals...8 3.2...9 3.2.1 Resources... 9 3.2.2 Output...10 3.2.3 Operating Modes... 11 4 Document Revision History for Intel Stratix 10 LAB and User Guide...18 2

1 Intel Stratix 10 LAB and Overview The logic array block (LAB) is composed of basic building blocks known as adaptive logic modules (s). You can configure the LABs to implement logic functions, arithmetic functions, and register functions. You can use a quarter of the available LABs in the Intel Stratix 10 devices as memory LABs (MLABs). Certain devices may have a higher MLAB ratio. The Intel Quartus Prime software and other supported third-party synthesis tools automatically choose the appropriate mode for common functions such as counters, adders, subtractors, and arithmetic functions. Related Links HyperFlex Core Architecture, Intel Stratix 10 Device Overview Provides more information about Hyper-Registers and the HyperFlex core architecture. Hyper-Registers are additional registers available in every interconnect routing segment throughout the core fabric, including the routing segments connected to the LAB inputs and outputs. Intel Corporation. All rights reserved. Intel, the Intel logo, Altera, Arria, Cyclone, Enpirion, MAX, Nios, Quartus and Stratix words and logos are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. Intel warrants performance of its FPGA and semiconductor products to current specifications in accordance with Intel's standard warranty, but reserves the right to make changes to any products and services at any time without notice. Intel assumes no responsibility or liability arising out of the application or use of any information, product, or service described herein except as expressly agreed to in writing by Intel. Intel customers are advised to obtain the latest version of device specifications before relying on any published information and before placing orders for products or services. *Other names and brands may be claimed as the property of others. ISO 9001:2008 Registered

2 HyperFlex Register The Intel Stratix 10 device family introduces the Intel Hyperflex core architecture. The Intel Stratix 10 LAB contains Intel Hyperflex registers and other features designed to facilitate retiming. Intel Hyperflex registers are available in s and carry chain. As shown in the Stratix 10 Connection Details figure, the Intel Hyperflex registers are located on the synchronous clear and clock enable inputs to increase or reduce effective path delay. All the Intel Hyperflex registers can be enabled and are controlled by the Intel Quartus Prime software during retiming. Intel Corporation. All rights reserved. Intel, the Intel logo, Altera, Arria, Cyclone, Enpirion, MAX, Nios, Quartus and Stratix words and logos are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. Intel warrants performance of its FPGA and semiconductor products to current specifications in accordance with Intel's standard warranty, but reserves the right to make changes to any products and services at any time without notice. Intel assumes no responsibility or liability arising out of the application or use of any information, product, or service described herein except as expressly agreed to in writing by Intel. Intel customers are advised to obtain the latest version of device specifications before relying on any published information and before placing orders for products or services. *Other names and brands may be claimed as the property of others. ISO 9001:2008 Registered

3 Intel Stratix 10 LAB and Architecture and Features 3.1 LAB The following sections describe the LAB and for Intel Stratix 10 devices. The LABs are configurable logic blocks that consist of a group of logic resources. Each LAB contains dedicated logic for driving control signals to its s. The MLAB is a superset of the LAB and includes all the LAB features. There are a total of 10 s in each LAB, as shown in the LAB and MLAB Structure for Intel Stratix 10 Devices figure. Figure 1. Intel Stratix 10 LAB Structure and Interconnects Overview This figure shows an overview of the Intel Stratix 10 LAB and MLAB structure with the LAB interconnects. C2/C3/C4 C16 Row Interconnects of Variable Speed and Length R24 R10/R4/R2 s Direct-Link Interconnect from Adjacent Block Direct-Link Interconnect from Adjacent Block Direct-Link Interconnect to Adjacent Block Direct-Link Interconnect to Adjacent Block Local Interconnect LAB MLAB Column Interconnects of Local Interconnect is Driven Variable Speed and Length from Either Side by Column Interconnects and LABs, and from Above by Row Interconnects 3.1.1 MLAB Related Links MLAB on page 5 Each MLAB supports a maximum of 640 bits of simple dual-port SRAM. You can configure each in an MLAB as a 32 (depth) x 2 (width) memory block, resulting in a configuration of 32 (depth) x 20 (width) simple dual-port SRAM block. Intel Corporation. All rights reserved. Intel, the Intel logo, Altera, Arria, Cyclone, Enpirion, MAX, Nios, Quartus and Stratix words and logos are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. Intel warrants performance of its FPGA and semiconductor products to current specifications in accordance with Intel's standard warranty, but reserves the right to make changes to any products and services at any time without notice. Intel assumes no responsibility or liability arising out of the application or use of any information, product, or service described herein except as expressly agreed to in writing by Intel. Intel customers are advised to obtain the latest version of device specifications before relying on any published information and before placing orders for products or services. *Other names and brands may be claimed as the property of others. ISO 9001:2008 Registered

Figure 2. Intel Stratix 10 LAB and MLAB Structure You can use an MLAB as a regular LAB or configure it as a dual-port SRAM. -Based-32 x 2 -Based-32 x 2 -Based-32 x 2 -Based-32 x 2 -Based-32 x 2 LAB Control Block LAB Control Block You can use an MLAB as a regular LAB or configure it as a dual-port SRAM. -Based-32 x 2 -Based-32 x 2 -Based-32 x 2 -Based-32 x 2 -Based-32 x 2 MLAB LAB 3.1.2 Local and Direct Link Interconnects Each LAB can drive out 40 outputs. Two groups of 20 outputs can drive the adjacent LABs directly through direct-link interconnects. The direct link connection feature minimizes the use of row and column interconnects, providing higher performance and flexibility. The local interconnect drives the inputs. outputs, as well as column and row interconnects drive the local interconnect. Neighboring LABs, MLABs, M20K blocks, or digital signal processing (DSP) blocks from the left or right can also drive the LAB's local interconnect using the direct link connection. 6

Figure 3. Intel Stratix 10 LAB Local and Direct Link Interconnects Direct-Link Interconnect from the Left LAB, MLAB, M20K Memory Block, DSP Block, or IOE Output Direct-Link Interconnect from the Right LAB, MLAB, M20K Memory Block, DSP Block, or IOE Output s s Direct-Link Interconnect to Left Direct-Link Interconnect to Right MLAB 3.1.3 Carry Chain Interconnects Local Interconnect LAB There is a dedicated carry chain path between the s. Intel Stratix 10 devices include an enhanced interconnect structure in LABs for routing carry chains for efficient arithmetic functions. These -to- connections bypass the local interconnect. The Intel Hyperflex registers are added to the carry chain to enable flexible retiming across a chain of LABs and the Intel Quartus Prime Compiler automatically takes advantage of these resources to improve utilization and performance. 7

Figure 4. Carry Chain Interconnects Local Interconnect Routing among s in the LAB 1 Local Interconnect 2 3 4 5 Carry Chain Routing to Adjacent 6 7 8 9 10 3.1.4 LAB Control Signals Each LAB supports a single clock to drive the registers in the LAB. The LAB supports two unique clock enable signals, as well as additional clear signals, for the registers. In addition, each LAB control block drives clock signals for the Hyper-Registers. There is a single clock for the Hyper-Registers on the local interconnect, and additional clocks for the Hyper-Registers located at the inputs. The LAB row clocks [5..0] and LAB local interconnects generate the LAB-wide control signals. A low skew clock network distributes global signals to the row clocks [5..0]. The MultiTrack interconnect consists of continuous, performance-optimized routing lines of different lengths and speeds used for routing efficiency. The Intel Quartus Prime Compiler automatically routes critical design paths on faster interconnects to improve design performance and optimizes the device resources. 8

3.1.4.1 Clear Logic Control LAB-wide signals control the logic for the register's clear signal. The register directly supports both a synchronous and an asynchronous clear. Each LAB supports up to two synchronous clear signals and two asynchronous clear signals, provided that the total number of clear signals is no greater than three. Intel Stratix 10 devices provide a device-wide reset pin (DEV_CLRn) that resets all the registers in the device. You can enable the DEV_CLRn pin in the Intel Quartus Prime software before compilation. The device-wide reset signal overrides all other control signals. Figure 5. Intel Stratix 10 LAB-Wide Control Signals Dedicated Lane LAB Clocks 6 6 6 Local Interconnect synclr0 labclk labclkena0 labclkena1 labclr0/ hyperflex_register_clk hyperflex_register_clk synclr1 labclr1 3.2 The following sections cover the resources, output, and operating modes. 3.2.1 Resources Each contains a variety of -based resources that can be divided between two combinational adaptive s (As), a two-bits full adder, and four registers. 9

With up to eight inputs for the two combinational As, one can implement various combinations of two functions. This adaptability allows an to be completely backward-compatible with four input architectures. One can also implement a subset of eight input functions. One contains four programmable registers. Each register has the following ports: Data in Data out Clock Clock enable Synchronous clear Asynchronous clear Global signals, general-purpose I/O (GPIO) pins, or any internal logic can drive the clock enable signal, clock, and asynchronous or synchronous clear control signals of an register. The clock enable signal has priority over synchronous reset signal. For combinational functions, the registers are bypassed and the output of the look-up table () and adders drives directly to the outputs of an. Figure 6. Intel Stratix 10 High-Level Block Diagram Combinational/ Memory A0 carry_in labclk 6 output 5 output adder0 reg0 6 output reg1 To General Routing datag datah 5 output adder1 reg2 Combinational/ Memory A1 carry_out reg3 3.2.2 Output The general routing outputs in each drive the local, row, and column routing resources. Four outputs can drive column, row, or direct link routing connections. 10

The, adder, or register output can drive the outputs. Both the or adder and the register can drive out of the simultaneously. Register packing improves device utilization by allowing unrelated register and combinational logic to be packed into a single. Another mechanism to improve fitting is to allow the register output to feed back into the of the same so that the register is packed with its own fan-out. The can also drive out registered and unregistered versions of the or adder output. The following figure shows the Intel Stratix 10 connectivity. In the Intel Quartus Prime Resource Property Editor, the entire connection is simplified. Some routing will be routed internally by the Intel Quartus Prime software. Figure 7. Intel Stratix 10 Connection Details carry_in synclr synclr labclkena labclkena labclk labclr 4 aclr D Q 4 aclr D Q To General Routing 4 aclr D Q datag 4 aclr D Q datah Legend: HyperFlex Registers carry_out 3.2.3 Operating Modes The Intel Stratix 10 operates in any of the following modes: Normal mode Extended mode Arithmetic mode 3.2.3.1 Normal Mode Normal mode allows two functions to be implemented in one Intel Stratix 10, or a single function of up to six inputs. Up to eight data inputs from the LAB local interconnect are inputs to the combinational logic. 11

The can support certain combinations of completely independent functions and various combinations of functions that have common inputs. The Intel Quartus Prime Compiler automatically selects the inputs to the. s in normal mode support register packing. The following figure shows a combination of different input connections for the mode. In your design, the Intel Quartus Prime software may assign different input namings during compilation. Figure 8. in Normal Mode datag datah 4-Input 4-Input combout0 combout1 datag datah 5-Input 5-Input combout0 combout1 datag datah 5-Input 3-Input combout0 combout1 6-Input combout0 5-Input combout0 datag datah 4-Input combout1 Combinations of functions with fewer inputs than those shown are also supported. For example, combinations of functions with the following number of inputs are supported. 4 and 3 3 and 3 3 and 2 5 and 2 12

For the packing of two 5-input functions into one, the functions must have at least two common inputs. The common inputs are and. The combination of a 4-input function with a 5-input function requires one common input (either or ). In a sparsely used device, functions that could be placed in one may be implemented in separate s by the Intel Quartus Prime software to achieve the best possible performance. As a device begins to fill up, the Intel Quartus Prime software automatically uses the full potential of the Intel Stratix 10. The Intel Quartus Prime Compiler automatically searches for functions using common inputs or completely independent functions to be placed in one to make efficient use of device resources. In addition, you can manually control resource use by setting location assignments. Figure 9. 6-Input Mode Function in Normal Mode labclk 6-Input reg2 To General Routing 13

Figure 10. 3-Input Mode Function in Normal Mode and are available for register packing. labclk reg0 3-Input reg1 To General Routing reg2 datag datah 3-Input reg3 Note: The 3- atom's inputs are named as,, and regardless of which physical data input that the Intel Quartus Prime software selects to map it to. You can implement any three to six input function using the following inputs: datag datah and whereby and are shared across both s to provide flexibility to implement a different function in each. 14

Both and inputs support the register packing feature. If you enable the register packing feature, both and inputs or either one of the inputs bypass the and directly feed into the register, depending on the packed register mode used. For Intel Stratix 10 devices, the following types of packed register modes are supported: 5-input with 1 packed register path Two 3-input s with 2 packed register paths The 3-input with 2 packed register paths is illustrated in the 3-Input Mode Function in Normal Mode figure. For Intel Stratix 10 devices, the 6-input mode does not support the register packing feature. 3.2.3.2 Extended Mode Figure 11. Supported 8-Input Functions in the Extended Mode datag datah Extended labclk reg2 To General Routing Certain 8-input functions can be implemented in a single using all the inputs: datag datah In the 8-input extended mode, the packed register mode is supported, provided that the packed register shares a or input with the 8-input. 15

3.2.3.3 Arithmetic Mode The in arithmetic mode uses two sets of two 4-input s along with two dedicated full adders. The dedicated adders allow the s to perform pre-adder logic. Therefore, each adder can add the output of two 4-input functions. Arithmetic mode also offers clock enable, counter enable, synchronous up and down control, add and subtract control, and synchronous clear. The clear and clock enable options are LAB-wide signals that affect all registers in the LAB. You can individually disable or enable these signals for each register. The Intel Quartus Prime software automatically places any registers that are not used by the counter into other LABs. Figure 12. Intel Stratix 10 in Arithmetic Mode 4-Input carry_in labclk reg0 4-Input datag 4-Input reg2 To General Routing 4-Input carry_out 3.2.3.3.1 Carry Chain The carry chain provides a fast carry function between the dedicated adders in the arithmetic mode. The 2-bit carry select feature in Intel Stratix 10 devices splits the propagation delay of carry chains with the. Carry chains can begin in either the first or the sixth in a LAB. The final carry-out signal is routed to an, where it is fed to local, row, or column interconnects. To avoid routing congestion in one small area of the device when a high fan-in arithmetic function is implemented, the LAB can support carry chains that only use the bottom half of the LAB before connecting to the next LAB. You can use the available top half of the s in the LAB to implement narrower fan-in functions in the normal 16

mode. Carry chains that use the bottom five s in the first LAB carry into the bottom half of the s in the next LAB within the column. The behavior is the same for both the LAB and the MLAB columns. The Intel Quartus Prime Compiler creates carry chains longer than 20 As (10 s in arithmetic) by linking LABs together. For an enhanced fitting, a long carry chain runs vertically, allowing fast horizontal connections to the TriMatrix memory and DSP blocks. 17

4 Document Revision History for Intel Stratix 10 LAB and User Guide Date Version Changes November 2017 2017.11.06 Added the 6-Input Mode Function in Normal Mode figure. October 2016 2016.10.31 Initial release. Update the Intel Stratix 10 LAB and Overview section. Updated the Carry Chain section. Updated the Intel Stratix 10 High-Level Block Diagram figure. Updated the hyperflex registers in the Intel Stratix 10 Connection Details figure. Intel Corporation. All rights reserved. Intel, the Intel logo, Altera, Arria, Cyclone, Enpirion, MAX, Nios, Quartus and Stratix words and logos are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. Intel warrants performance of its FPGA and semiconductor products to current specifications in accordance with Intel's standard warranty, but reserves the right to make changes to any products and services at any time without notice. Intel assumes no responsibility or liability arising out of the application or use of any information, product, or service described herein except as expressly agreed to in writing by Intel. Intel customers are advised to obtain the latest version of device specifications before relying on any published information and before placing orders for products or services. *Other names and brands may be claimed as the property of others. ISO 9001:2008 Registered