Implementation of SCN Based Content Addressable Memory

Similar documents
Content Addressable Memory Architecture based on Sparse Clustered Networks

DESIGN OF PARAMETER EXTRACTOR IN LOW POWER PRECOMPUTATION BASED CONTENT ADDRESSABLE MEMORY

INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume VI /Issue 4 / AUG 2016

Low Power using Match-Line Sensing in Content Addressable Memory S. Nachimuthu, S. Ramesh 1 Department of Electrical and Electronics Engineering,

POWER REDUCTION IN CONTENT ADDRESSABLE MEMORY

Design of low power Precharge-Free CAM design using parity based mechanism

INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume 9 /Issue 5 / JAN 2018

A Survey on Design of Content Addressable Memories

Survey on Content Addressable Memory and Sparse Clustered Network

Power Gated Match Line Sensing Content Addressable Memory

Optimized CAM Design

International Journal of Advance Engineering and Research Development LOW POWER AND HIGH PERFORMANCE MSML DESIGN FOR CAM USE OF MODIFIED XNOR CELL

Content Addressable Memory Using Automatic Charge Balancing with Self-Control Mechanism and Master-Slave Match Line Design

Content Addressable Memory performance Analysis using NAND Structure FinFET

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

Design of high speed low power Content Addressable Memory (CAM) using parity bit and gated power matchline sensing

Resource Efficient Multi Ported Sram Based Ternary Content Addressable Memory

Research Scholar, Chandigarh Engineering College, Landran (Mohali), 2

Content Addressable Memory with Efficient Power Consumption and Throughput

Design of Read and Write Operations for 6t Sram Cell

Unique Journal of Engineering and Advanced Sciences Available online: Research Article

Ternary Content Addressable Memory Types And Matchline Schemes

High Performance and Energy Efficient SRAM-based Architecture for TCAM

Filter-Based Dual-Voltage Architecture for Low-Power Long-Word TCAM Design

Implementation of Efficient Ternary Content Addressable Memory by Using Butterfly Technique

Area Efficient Z-TCAM for Network Applications

DESIGN ANDIMPLEMENTATION OF EFFICIENT TERNARY CONTENT ADDRESSABLE MEMORY

[Kalyani*, 4.(9): September, 2015] ISSN: (I2OR), Publication Impact Factor: 3.785

Index Terms- Field Programmable Gate Array, Content Addressable memory, Intrusion Detection system.

Efficient VLSI Huffman encoder implementation and its application in high rate serial data encoding

A Low Power Content Addressable Memory Implemented In Deep Submicron Technology

Design of a High Speed CAVLC Encoder and Decoder with Parallel Data Path

Low Complexity Architecture for Max* Operator of Log-MAP Turbo Decoder

An Energy Improvement in Cache System by Using Write Through Policy

Column decoder using PTL for memory

Performance Analysis and Designing 16 Bit Sram Memory Chip Using XILINX Tool

VLSI Implementation of Daubechies Wavelet Filter for Image Compression

Analysis and Design of Low Voltage Low Noise LVDS Receiver

DESIGN AND SIMULATION OF 1 BIT ARITHMETIC LOGIC UNIT DESIGN USING PASS-TRANSISTOR LOGIC FAMILIES

6T- SRAM for Low Power Consumption. Professor, Dept. of ExTC, PRMIT &R, Badnera, Amravati, Maharashtra, India 1

Novel Hybrid k-d-apriori Algorithm for Web Usage Mining

A High Performance Reconfigurable Data Path Architecture For Flexible Accelerator

Design & Analysis of 16 bit RISC Processor Using low Power Pipelining

A LOSSLESS INDEX CODING ALGORITHM AND VLSI DESIGN FOR VECTOR QUANTIZATION

Design of Low Power Wide Gates used in Register File and Tag Comparator

Analysis of Power Dissipation and Delay in 6T and 8T SRAM Using Tanner Tool

CONTENT ADDRESSABLE MEMORY WITH EFFICIENT POWER OPTIMIZATION OF SPEED

Implementation of FM0 and Manchester Encoder for Efficient Hardware Utilization

Gated-Demultiplexer Tree Buffer for Low Power Using Clock Tree Based Gated Driver

FPGA Based Design and Simulation of 32- Point FFT Through Radix-2 DIT Algorith

A Low Power DDR SDRAM Controller Design P.Anup, R.Ramana Reddy

Power Reduction Techniques in the Memory System. Typical Memory Hierarchy

Design and Simulation of Power Optimized 8 Bit Arithmetic Unit using Gating Techniques in Cadence 90nm Technology

Efficient design and FPGA implementation of JPEG encoder

8Kb Logic Compatible DRAM based Memory Design for Low Power Systems

Modeling and Comparative Analysis of Logic Gates for Adder and Multiplier Applications -A VLSI based approach

STUDY OF SRAM AND ITS LOW POWER TECHNIQUES

Performance Evolution of DDR3 SDRAM Controller for Communication Networks

INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume 9 /Issue 3 / OCT 2017

DYNAMIC CIRCUIT TECHNIQUE FOR LOW- POWER MICROPROCESSORS Kuruva Hanumantha Rao 1 (M.tech)

DESIGN AND IMPLEMENTATION ARCHITECTURE FOR RELIABLE ROUTER RKT SWITCH IN NOC

Critical-Path Realization and Implementation of the LMS Adaptive Algorithm Using Verilog-HDL and Cadence-Tool

High Performance Interconnect and NoC Router Design

Viterbi Algorithm for error detection and correction

Simulation and Analysis of SRAM Cell Structures at 90nm Technology

Analysis of 8T SRAM Cell Using Leakage Reduction Technique

Near Optimal Repair Rate Built-in Redundancy Analysis with Very Small Hardware Overhead

Efficient Crosstalk Avoidance Using Modified Redundant Fibonacci Code (MRFC) in VLSI Interconnects

of Soft Core Processor Clock Synchronization DDR Controller and SDRAM by Using RISC Architecture

ISSN Vol.05,Issue.09, September-2017, Pages:

Three DIMENSIONAL-CHIPS

IMPLEMENTATION OF A BIT ERROR RATE TESTER OF A WIRELESS COMMUNICATION SYSTEM ON AN FPGA

Implementation of A Optimized Systolic Array Architecture for FSBMA using FPGA for Real-time Applications

Analysis of 8T SRAM with Read and Write Assist Schemes (UDVS) In 45nm CMOS Technology

Complex test pattern generation for high speed fault diagnosis in Embedded SRAM

A Low-Power Field Programmable VLSI Based on Autonomous Fine-Grain Power Gating Technique

Designing and Characterization of koggestone, Sparse Kogge stone, Spanning tree and Brentkung Adders

A Low Power 32 Bit CMOS ROM Using a Novel ATD Circuit

Pak. J. Biotechnol. Vol. 14 (Special Issue II) Pp (2017) Keerthiga D.S. and S. Bhavani

Survey on Stability of Low Power SRAM Bit Cells

Fast frame memory access method for H.264/AVC

Optimization of Task Scheduling and Memory Partitioning for Multiprocessor System on Chip

DESIGN AND PERFORMANCE ANALYSIS OF CARRY SELECT ADDER

/$ IEEE

Reducing Cache Energy in Embedded Processors Using Early Tag Access and Tag Overflow Buffer

EFFICIENT MEMORY BUILT - IN SELF TEST FOR EMBEDDED SRAM USING PA ALGORITHM

Performance Analysis of Gray Code based Structured Regular Column-Weight Two LDPC Codes

Design and Implementation of VLSI 8 Bit Systolic Array Multiplier

32-bit Signed and Unsigned Advanced Modified Booth Multiplication using Radix-4 Encoding Algorithm

A MULTIBANK MEMORY-BASED VLSI ARCHITECTURE OF DIGITAL VIDEO BROADCASTING SYMBOL DEINTERLEAVER

Design of 8 bit Pipelined Adder using Xilinx ISE

250nm Technology Based Low Power SRAM Memory

DUE to the high computational complexity and real-time

Tree-Based Minimization of TCAM Entries for Packet Classification

ISSN: [Bilani* et al.,7(2): February, 2018] Impact Factor: 5.164

A Low Power Asynchronous FPGA with Autonomous Fine Grain Power Gating and LEDR Encoding

Fused Floating Point Arithmetic Unit for Radix 2 FFT Implementation

DETECTION AND CORRECTION OF CELL UPSETS USING MODIFIED DECIMAL MATRIX

Design For High Performance Flexray Protocol For Fpga Based System

ISSN: [Garade* et al., 6(1): January, 2017] Impact Factor: 4.116

Transcription:

IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 12, Issue 4, Ver. II (Jul.-Aug. 2017), PP 48-52 www.iosrjournals.org Implementation of SCN Based Content Addressable Memory * Shruthi H Shetty 1, Ashwath Rao 1, Megha N 1 1 (Dept. Electronics & Communication Engineering, Sahyadri College of Engineering& Management, India) Corresponding Author: Shruthi H Shetty Abstract: Most memory devices store and retrieve data by addressing specific memory locations. As a result, this path often becomes the limiting factor for systems that rely on fast memory accesses. CAM is introduced employing a new mechanism for associativity between the input tags and the corresponding address of the output data. The proposed CAM is based on a recently developed sparse clustered network (SCN) that onaverage will eliminate most of the parallel comparisons performed during a search. SCN-CAM is suitable for low-power applications, where frequent and parallel look-up operations are required. In this paper the SCN- CAM is implemented in HDL Keywords: Content Addressable Memory, CAM cell, Sparse Clustered Networks ----------------------------------------------------------------------------------------------------------------------------- ---------- Date of Submission: 15-07-2017 Date of acceptance: 26-07-2017 ----------------------------------------------------------------------------------------------------------------------------- ---------- I. Introduction A CAM is a new technology memory device that accelerates any application requiring fast searches. It is a special type of memory structure that can be accessed using its contents rather than an explicit address. Such memory structures are used in diverse applications ranging from branch prediction in a processor to complex pattern recognition; it is particularly well suited to perform any kind of search operations. CAM is also known as associative memory, associative storage or associative array. Unlike standard computer memory (RAM) in which user supplies a memory address and the RAM returns the data word stored at the address, a CAM searches its entire memory to see if that data word is stored anywhere in it. If the data is found, the CAM returns a list of one or more storage addresses where the word was found. In order to access a particular entry in memories, a search data word is compared against previously stored entries in parallel to find a match. Each stored entry is associated with a tag that is used in the comparison process. Once a search data word is applied to the input of a CAM, the matching data word is retrieved within a single clock cycle if it exists. This prominent feature makes CAM a promising candidate for applications where frequent and fast look-up operations are required, such as in translation look-aside buffers (TLBs), network routers, database accelerators, image processing, parametric curve extraction, Hough transformation, Huffman coding/decoding and image coding. Due to the frequent and parallel search operations, CAMs consume a significant amount of energy. The main research objective has been focused on reducing the energy consumption. A new family of associative memories based on sparse clustered networks (SCNs), reducing the unnecessary parallel computation in CAM array. Such memories make it possible to store many short messages instead of few long ones as in the conventional Hopfield networks with significant lower level of computational complexity. The CAM is divided into binary and Ternary CAM. The CAM throughout in this paper refers to Binary CAM. Fig1. Simple CAM Cell The individual CAM cell is as shown in Fig1. The CAM cell operation is divided as bit storage (usually a RAM memory that uses SRAM cell) and bit comparison is unique operation CAM (that uses XNOR logic). The simple block diagram of CAM cell is as shown in the Fig 2. It consists of a search data register that stores the search data (input data), Match Line Search Amplifiers (MLSAs) and an Encoder. CAM performs three modes of operation Read (R), Write (W) and Search (S) operation. Hence the cell is fed with R, W, S three enable signals respectively. The CAM Read/ Write operation is same as the traditional memory. The Search DOI: 10.9790/2834-1204024852 www.iosrjournals.org 48 Page

operation i.e. it performs bit comparison, it compares the each bit of search data with the data input that is present in the CAM. Fig2. Simple Block Diagram of CAM II. SCN-Cam Overview 2.1 CAM Review The Fig. 3 shows CAM array consisting of 4 words. There is a match-line corresponding to each word (ML 0, ML 1, etc.) feeding into match line sense amplifiers (MLSAs), and there is a differential search line pair corresponding to each bit of the search word (SL 0, SL0, SL 1, SL1, etc.). CAM search operation begins with loading the search-data word into the search-data registers followed by precharging all match lines high, putting them all temporarily in the match state. Next, the search line drivers broadcast the search word onto the differential search lines, and each CAM core cell compares its stored bit against the bit on its corresponding search lines. Match lines on which all bits match remain in the precharged-high state. The MLSA then detects whether its match line has a matching condition or miss condition. Finally, the encoder maps the match line of the matching location to its encoded address. Fig3. Conceptual representation of CAM array 2.2 SCN-CAM A new family of associative memory based on sparse clustered network is introduced such that these memories make it possible to store many short messages instead of few long ones, with significant lower level of computational complexity. The proposed architecture (SCN-CAM) consists of an SCN-based classifier coupled to a CAM-array. The CAM array is divided into several equally sized sub-blocks, which can be activated independently. For a trained network and given an input tag, the classifier only uses a small portion of the tag and predicts very few sub-blocks of the CAM to be activated. Once the sub-blocks are activated, the tag is compared against the few entries in them while keeping the rest deactivated and thus lowers the dynamic energy dissipation. This reduces the number of comparisons to only one in average, SCN-CAM uses only a portion of the actual tag to create or recover the association with the corresponding output. SCN-based classifier is for either of the two situations: training or decoding. The values are set during the training process, and are stored in a memory module such that they can later be used to retrieve the address of the target data. When an update is requested in SCN-CAM, retraining the entire SCN-based classifier with the entries is not required. The new entry can therefore be added by adding new connections while keeping the previous connections. Once an input tag is presented to the SCN-based classifier, it predicts which CAM sub-block(s) need to be compareenabled and thus saves the dynamic power by disabling the rest. DOI: 10.9790/2834-1204024852 www.iosrjournals.org 49 Page

Fig4. Block diagram of SCN-CAM III. Simulation Results In this paper the SCN-CAM is implemented using HDL in Xilinx ISE. SCN-CAM employs an SCNbased classifier, which is connected to several independently compare-enabled CAM sub-blocks, some of which are enabled once a tag is presented to the SCN-based classifier. Fig 5 shows the Design flow of system. Once the input is given to SCN classifier, it predicts which CAM block has to be selected from the array. The classifier generates the compare- enable signals to array such that it avoids unnecessary search operations thus reducing the dynamic power. Fig5. Design Flow of SCN-CAM The content addressable memory compare input search data against stored data and return address of matched data indicating match signal bit high as shown in Fig7. The power and timing delay is also analyzed. Fig 6. RTL of SCN-CAM DOI: 10.9790/2834-1204024852 www.iosrjournals.org 50 Page

Table 1 Timing summary of SCN-CAM Timing summary Maximum frequency 1290.073MHz Clock period 0.775ns Maximum combinational path delay 0.930ns Delay 0.775ns Implementation of SCN Based Content Addressable Memory Table 2 Power Analysis of SCN-CAM Device On-Chip Power(W) Family Vertex6 Clocks 0.011 Part Xc6vcx75t Logic 0.003 Package ff484 Signal 0.002 Temp Grade Commercial IO 0.004 Process Typical Leakage 1.291 Speed Grade -2 Total 1.312 Fig7. SCN-CAM Waveform IV. Conclusion For any application that requires a fast memory search, CAM can provide a solution. CAM can be used to speed up any applications ranging from local-area networks, database management, file-storage management, pattern recognition, artificial intelligence, fully associative and processor-specific cache memories, and disk cache memories. Although CAM has many applications, it is particularly well suited to perform any kind of search operations. SCN-CAM employs an SCN-based classifier, which is connected to several independently compare-enabled CAM sub-blocks, some of which are enabled once a tag is presented to the SCN-based classifier. SCN-CAM eliminates most of the comparison operations, thus SCN-CAM is suitable for low-power applications, where frequent and parallel look-up operations are required. References [1] Hooman Jarollahi, Vincent Gripon, Naoya Onizawa and Warren J. Gross, Algorithm and Architecture for a Low-Power Content-Addressable Memory Based on Sparse Clustered Networks IEEE Transactions On Very Large Scale Integration (VLSI) Systems, Vol. 23, No. 4, April 2015 [2] Dejan Georgiev, Low Power Concept For Content Addressable Memory (Cam) Chip Design, International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering Vol. 2, Issue 7, July 2013 [3] Kostas Pagiamtzis and Ali Sheikholeslami, Content-Addressable Memory (CAM) Circuits and Architectures: A Tutorial and Survey, IEEE Journal Of Solid-state Circuits, Vol. 41, No. 3, March 2006 [4] Shanq-Jang Ruan, Chi-Yu Wu, and Jui-Yuan Hsieh, Low Power Design of Precomputation-Based Content-Addressable Memory, IEEE Transactions On Very Large Scale Integration (VLSI) Systems, Vol. 16, No. 3, March 2008 [5] A. Agarwal et al., A 128 128 b high-speed wide-and match-line content addressable memory in 32 nm CMOS, in Proc. ESSCIRC, pp. 83 86, Sep. 2011. [6] Y.-J. Chang and M.-F. Lan, Two new techniques integrated for energy efficient TLB design, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 15, no. 1, pp. 13 23, Jan. 2007. [7] H. Miyatake, M. Tanaka, and Y. Mori, A design for high-speed low power CMOS fully parallel content-addressable memory macros, IEEE J. Solid-State Circuits, vol. 36, no. 6, pp. 956 968, Jun. 2001. DOI: 10.9790/2834-1204024852 www.iosrjournals.org 51 Page

[8] S. Panchanathan and M. Goldberg, A content-addressable memory architecture for image coding using vector quantization, IEEE Trans. Signal Process., vol. 39, no. 9, pp. 2066 2078, Sep. 1991. [9] Y.-J. Chang and Y.-H. Liao, Hybrid-type CAM design for both power and performance efficiency, IEEETrans. Very Large Scale Integr. (VLSI) Syst., vol. 16, no. 8, pp. 965 974, Aug. 2008. [10] H. Jarollahi, V. Gripon, N. Onizawa, and W. J. Gross, A low-power content-addressable memory based on clustered-sparse networks, in Proc. 24th IEEE Int. Conf. ASAP, pp. 305 308, Jun. 2013. Shruthi H Shetty. " Implementation of SCN Based Content Addresable Memory." IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) 12.4 (2017): 48-52. DOI: 10.9790/2834-1204024852 www.iosrjournals.org 52 Page