IJCSIET--International Journal of Computer Science information and Engg., Technologies ISSN

Similar documents
FPGA Based Low Area Motion Estimation with BISCD Architecture

IMPLEMENTATION OF ROBUST ARCHITECTURE FOR ERROR DETECTION AND DATA RECOVERY IN MOTION ESTIMATION ON FPGA

Detecting and Correcting the Multiple Errors in Video Coding System

Detecting and Correcting the Multiple Errors in Video Coding System

Keywords: Processing Element, Motion Estimation, BIST, Error Detection, Error Correction, Residue-Quotient(RQ) Code.

DESIGN FOR TESTABILITY TECHNIQUES FOR VIDEO CODING SYSTEMS

Architecture to Detect and Correct Error in Motion Estimation of Video System Based on RQ Code

VLSI Architecture to Detect/Correct Errors in Motion Estimation Using Biresidue Codes

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

Built In Self Test For Multi Error Detection In Motion Estimation Computing Arrays

A Proposed RAISIN for BISR for RAM s with 2D Redundancy

A VLSI Architecture for H.264/AVC Variable Block Size Motion Estimation

Design of a High Speed CAVLC Encoder and Decoder with Parallel Data Path

MultiFrame Fast Search Motion Estimation and VLSI Architecture

POWER CONSUMPTION AND MEMORY AWARE VLSI ARCHITECTURE FOR MOTION ESTIMATION

IN RECENT years, multimedia application has become more

A COST-EFFICIENT RESIDUAL PREDICTION VLSI ARCHITECTURE FOR H.264/AVC SCALABLE EXTENSION

Fast Wavelet-based Macro-block Selection Algorithm for H.264 Video Codec

Low-Complexity Block-Based Motion Estimation via One-Bit Transforms

Fast Motion Estimation for Shape Coding in MPEG-4

Fast frame memory access method for H.264/AVC

High Performance Hardware Architectures for A Hexagon-Based Motion Estimation Algorithm

3D Memory Formed of Unrepairable Memory Dice and Spare Layer

Using Error Detection Codes to detect fault attacks on Symmetric Key Ciphers

DUE to the high computational complexity and real-time

A Novel Deblocking Filter Algorithm In H.264 for Real Time Implementation

Reducing/eliminating visual artifacts in HEVC by the deblocking filter.

A High-Speed FPGA Implementation of an RSD-Based ECC Processor

A Survey on Early Determination of Zero Quantized Coefficients in Video Coding

VLSI IMPLEMENTATION OF ERROR TOLERANCE ANALYSIS

THE orthogonal frequency-division multiplex (OFDM)

Near Optimal Repair Rate Built-in Redundancy Analysis with Very Small Hardware Overhead

A reversible data hiding based on adaptive prediction technique and histogram shifting

AN EFFICIENT VIDEO WATERMARKING USING COLOR HISTOGRAM ANALYSIS AND BITPLANE IMAGE ARRAYS

High Performance VLSI Architecture of Fractional Motion Estimation for H.264/AVC

Block-based Watermarking Using Random Position Key

Video Compression System for Online Usage Using DCT 1 S.B. Midhun Kumar, 2 Mr.A.Jayakumar M.E 1 UG Student, 2 Associate Professor

Reduced 4x4 Block Intra Prediction Modes using Directional Similarity in H.264/AVC

An Area-Efficient BIRA With 1-D Spare Segments

A SCALABLE COMPUTING AND MEMORY ARCHITECTURE FOR VARIABLE BLOCK SIZE MOTION ESTIMATION ON FIELD-PROGRAMMABLE GATE ARRAYS. Theepan Moorthy and Andy Ye

A Robust Error Resilient Approach for Data Hiding in MPEG Video Files Using Multivariate Regression and Flexible Macroblock Ordering

Improving Energy Efficiency of Block-Matching Motion Estimation Using Dynamic Partial Reconfiguration

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

Enhanced Hexagon with Early Termination Algorithm for Motion estimation

HYBRID DCT-WIENER-BASED INTERPOLATION VIA LEARNT WIENER FILTER. Kwok-Wai Hung and Wan-Chi Siu

Design and Implementation of VLSI 8 Bit Systolic Array Multiplier

POWERFUL BISR DESIGN FOR EMBEDDED SRAM WITH SELECTABLE REDUNDANCY

INTERNATIONAL JOURNAL OF PROFESSIONAL ENGINEERING STUDIES Volume 9 /Issue 3 / OCT 2017

Analysis and Architecture Design of Variable Block Size Motion Estimation for H.264/AVC

VLSI DESIGN FOR CONVOLUTIVE BLIND SOURCE SEPARATION

MOTION estimation is one of the major techniques for

Area Efficient SAD Architecture for Block Based Video Compression Standards

CONTENT ADAPTIVE SCREEN IMAGE SCALING

AN ADJUSTABLE BLOCK MOTION ESTIMATION ALGORITHM BY MULTIPATH SEARCH

A LOW-COMPLEXITY AND LOSSLESS REFERENCE FRAME ENCODER ALGORITHM FOR VIDEO CODING

120 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 61, NO. 2, FEBRUARY 2014

Digit-Level Semi-Systolic and Systolic Structures for the Shifted Polynomial Basis Multiplication Over Binary Extension Fields

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Image Authentication and Recovery Scheme Based on Watermarking Technique

An Efficient Mode Selection Algorithm for H.264

TESTING OF FAULTS IN VLSI CIRCUITS USING ONLINE BIST TECHNIQUE BASED ON WINDOW OF VECTORS

An Efficient VLSI Architecture for Full-Search Block Matching Algorithms

VERY large scale integration (VLSI) design for power

Complexity Reduced Mode Selection of H.264/AVC Intra Coding

An Efficient Intra Prediction Algorithm for H.264/AVC High Profile

A full-pipelined 2-D IDCT/ IDST VLSI architecture with adaptive block-size for HEVC standard

DYNAMIC CIRCUIT TECHNIQUE FOR LOW- POWER MICROPROCESSORS Kuruva Hanumantha Rao 1 (M.tech)

International Journal of Emerging Technology and Advanced Engineering Website: (ISSN , Volume 2, Issue 4, April 2012)

The Serial Commutator FFT

Digital Image Stabilization and Its Integration with Video Encoder

Evaluating the Impact of Carry-Ripple and Carry-Lookahead Adders in Pel Decimation VLSI Implementations

Image Error Concealment Based on Watermarking

DISCRETE COSINE TRANSFORM (DCT) is a widely

Reversible Data Hiding VIA Optimal Code for Image

An Efficient Adaptive Binary Arithmetic Coder and Its Application in Video Coding

RECENTLY, researches on gigabit wireless personal area

A Comparison of Two Algorithms Involving Montgomery Modular Multiplication

A High Quality/Low Computational Cost Technique for Block Matching Motion Estimation

Implementation of A Optimized Systolic Array Architecture for FSBMA using FPGA for Real-time Applications

Hardware Implementation of an Image Interpolation Method with Controllable Sharpness

A NOVEL SECURED BOOLEAN BASED SECRET IMAGE SHARING SCHEME

Very Large Scale Integration (VLSI)

Error Concealment Used for P-Frame on Video Stream over the Internet

II. MOTIVATION AND IMPLEMENTATION

ARITHMETIC operations based on residue number systems

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

HIGH-PERFORMANCE RECONFIGURABLE FIR FILTER USING PIPELINE TECHNIQUE

Estimation and Compensation of Video Motion - A Review

Quality versus Intelligibility: Evaluating the Coding Trade-offs for American Sign Language Video

Design and Implementation of Signed, Rounded and Truncated Multipliers using Modified Booth Algorithm for Dsp Systems.

FAST MOTION ESTIMATION WITH DUAL SEARCH WINDOW FOR STEREO 3D VIDEO ENCODING

VLSI Design and Implementation of High Speed and High Throughput DADDA Multiplier

FOR compressed video, due to motion prediction and

An HEVC Fractional Interpolation Hardware Using Memory Based Constant Multiplication

By Charvi Dhoot*, Vincent J. Mooney &,

OVERVIEW OF IEEE 1857 VIDEO CODING STANDARD

FPGA based High Performance CAVLC Implementation for H.264 Video Coding

A High Sensitive and Fast Motion Estimation for One Bit Transformation Using SSD

Learning based face hallucination techniques: A survey

Reconfigurable Variable Block Size Motion Estimation Architecture for Search Range Reduction Algorithm

Transcription:

Implementation of EDDR Architecture for High Speed Motion Estimation Testing Applications A.MADHAV KUMAR #1, S.RAGAHAVA RAO #2,B.V.RAMANA #3, V.RAMOJI #4 ECE Department, Bonam Venkata Chalamayya Institute of Engineering and Technology, Amalapuram 1 madhavamk07@gmail.com 2 raghu7327@gmail.com 3 bvramana.bvcits@gmail.com 4 ramoji.btech@gmail.com ABSTRACT _Motion Estimation (ME) plays a critical role in a video coder, testing such a module is of priority concern. While focusing on the testing of ME in a video coding system, this work presents an error detection and data recovery (EDDR) design, based on the residue-and-quotient (RQ) code, to embed into ME for video coding testing applications. An error in processing Elements (PEs), i.e. key components of a ME, can be detected and recovered effectively by using the proposed EDDR design. The proposed EDDR design for ME testing can detect errors and recover data with an acceptable area overhead and timing penalty. Index Terms Area overhead, data recovery, error detection, motion estimation, reliability, residueand-quotient (RQ) code. 1. INTRODUCTION Multimedia applications are flexible and reliable when we used advances in semiconductors and communication technologies.h.264 video standard is a good example, it is also known as MPEG-4(Motion Picture Experts Group-4) Part10 Advanced Video Coding, it is widely regarded as the next generation video compression standard. Video Compression reduces the total data amount required for transmitting or storing video data which is necessary in a wide range of applications. Motion estimation (ME) explores the temporal redundancy, it is inherent in video sequences and represents a basis for lossy video compression. Other than video compression, motion estimation can also be used as the basis for powerful video analysis and video processing. A ME (motion estimation) generally consists of PEs (Processing elements) with a size of 4 x 4. Additionally, the visual quality and peak signal-to-noise ratio (PSNR) at a given bit rate are influenced if an error occurred in ME process. As a commercial chip, it is absolutely necessary for the ME to introduce design for testability (DFT). Device testing is increased by using DFT which gives high reliability of a system. DFT methods rely on reconfiguration of a circuit under test (CUT) to improve testability. Systems have meant that built-in self-test (BIST) schemes have rapidly become necessary in the digital world. BIST does not expensive test equipment, ultimately lowering test costs. The built-in testing approaches not only detect faults but also specify their locations for error correcting. Thus, BIST extended scheme referred to as built-in self-diagnosis and built-in self correction have been developed recently. The extended BIST schemes generally focus on testing-related issues, memory circuits of video coding have seldom been addressed. 2. PROPOSED EDDR ARCHITECTURE DESIGN Fig. 1 shows the proposed EDDR scheme, which depends on error detection circuit (EDC) and data recovery circuit (DRC), to detect errors and recover the corresponding data in a specific CUT. The test code generator (TCG) utilizes RQ code to generate the corresponding test codes for error detection and data recovery. DRC is in charge of recovering data from TCG and a selector is enabled to export error-free data or data-recovery results. This work adopts the systolic ME as a CUT to demonstrate the feasibility of the proposed EDDR architecture. A ME consists of many PEs (processing elements) incorporated in a 1-D or 2-D array for video encoding applications. A PE generally consists of two ADDs (i.e. an 8-b ADD and a 12-b ADD) and an accumulator (ACC). Next, the 8-b ADD (a pixel has 8-b data) is used to estimate the addition of the current pixel (Cur pixel) and reference pixel (Ref pixel). Additionally, a 12-b ADD and an ACC are required to accumulate the results from the 8- b ADD in order to determine the sum of absolute difference (SAD) value for video encoding applications [20]. Notably, some registers and latches may exist in ME to complete the data shift and storage. Fig. 2 shows an example of the proposed EDDR circuit design for a specific of a ME. Fig. 1. Conceptual view of the proposed EDDR architecture. IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 1

which N is coded as a pair ( ). Notably, is 3. MAIN MODULES OF EDDR ARCHITECTURE the residue N of modulo m. the residue code can detect a bit error and it cannot be recovered effectively. a) PROCESSING ELEMENT Therefore quotient code is derived from residue code to b) RQ CODE GENERATION detect multiple errors and recovering errors. c) TEST CODE GENERATION d) ERROR DETECTION CIRCUIT 3c. TEST CODE GENERATIONS e) D ATA RECOVERY CIRCUIT 3a. PROCESSING ELEMENT (PE) ME consists of many PEs incorporated in a 1-D or 2-D array for video encoding applications.pe generally consists of two adders (i.e. an 8-b ADD and a 12-b ADD) and an accumulator (ACC). Next, the 8-b Adder (a pixel has 8-b data) is used to estimate the addition of the current pixel (Cur pixel) and reference pixel (Ref pixel). Additionally, a 12-b ADD and an ACC are required to accumulate the results from the 8- b ADD in order to determine the sum of absolute difference (SAD) value for video encoding applications. The specific estimates the absolute difference between the Cur pixel and Ref pixel. Thus, by utilizing PEs, SAD shown in as follows, in a macro block with size of N X N can be evaluated: In proposed EDDR architecture TCG is an important component.the design of TCG is based on the RQCG circuit to generate corresponding test codes in order to detect errors and recover data. The specific PE (processing element) estimates the absolute difference between the Cur pixel and the Ref pixel. The residue code generates the RQ code ( ) from TCG. N 1 N 1 i 0 j 0 ( q xij. m r xij ) ( q yij. m r Where, and, denote the corresponding RQ code of and modulo. 3b. RQ CODE GENERATION Coding approaches are like this as Berger code, parity code, and residue code which is destined to detect circuit errors. Residue code is a generally separable arithmetic codes which estimates residue for data and appending it to data. Separate residue code is typically derived through error detection logic for operations, which makes an easy and simply implemented. It can detect only a bit error and additionally an error cannot be recovered effectively by using the residue code. Therefore, this work presents a quotient code, it is derived from the residue code, which detect the multiple errors and recovering errors. For instance, assume that N denotes an integer, N_1and N_2 represent data words, and m refers to the modulus value. A separate residue code of interest is one in yij ) IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 2 And

results from the DRC is selected by a multiplexer (MUX) to pass to the next species for subsequent testing. Fig. 2. A specific testing processes of the proposed EDDR architecture. 4. Numerical Example A numerical example of the 16 pixels for a 4 x 4 macroblock in a specific of a ME is described as follows. Fig. 5 presents an example of pixel values of the Cur_pixel and Ref_pixel.The SAD value of the 4 x 4 macroblock is Fig. 4. Example of pixel values. Fig. 3. Circuit design of the TCG. 3d. ERROR DETECTION CIRCIUT (EDC) The operations of error detection in a specific is achieved by using EDC, based on the outputs between TCG and in order to determine whether errors have occurred. If the values of and/or, then the errors in a specific can be detected. The EDC output is then used to generate a 0/1 signal to indicate that the tested is errorfree/errancy. = (128-1) + (128-1)+ + (128-5) = 2124 3e. DATA RECOVERY CIRCUIT (DRC) DRC generates error free output by the quotient multiply with constant value and add to reminder code. During the data recovery, the circuit DRC plays a significant role in recovering RQ code from TCG. The proposed EDDR design executes the error detection and data recovery operations simultaneously and also error-free data from the tested or data recovery that IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 3

5. CONCLUSION This work presents EDDR architecture for detecting the errors and recovering the data of PEs in a ME. By using RQ code, a RQCG-based TCG design is developed to generate the corresponding test codes to detect errors and recover data. Experimental results indicate that that the proposed EDDR architecture can effectively detect errors and recover data in PEs of a ME with reasonable area overhead and only a slight time penalty. Fig. 5. Proposed EDDR architecture design for a ME. Fig. 6 RTL diagram of proposed EDDR Fig. 7.Simulation results of proposed EDDR TABLE 1 ESTIMATION OF AREA OVERHEAD AND TIME PENALTY OPERATION TIME(ns) 35.137 Number of 954 4 input LUT S REFERENCES [1] Chang-Hsin Cheng, Yu Liu, and Chun-Lung Hsu, Design of an Error Detection and Data Recovery Architecture for Motion Estimation Testing Applications, ieee transactions on very large scale integration (vlsi) systems, vol. 20, no. 4, april 2012 [2] Advanced Video Coding for Generic Audiovisual Services, ISO/IEC 14496-10:2005 (E), Mar.2005, ITU- T Rec. H.264 (E). [3] Information Technology-Coding of Audio-Visual Objects Part 2: Visual, ISO/IEC 14 496-2,1999. [4] Y.W.Huang, B.Y.Hsieh, S.Y.Chien, S.Y.Ma, and L.G.Chen, Analysis and complexity reduction of multiple reference frames motion estimation in H.264/AVC, IEEE Trans.Circuits Syst. Video Technol., vol. 16, no. 4, pp. 507 522, Apr. 2006. [5] C. Y. Chen, S. Y. Chien, Y. W. Huang, T. C. Chen, T. C. Wang, and L. G. Chen, Analysis and architecture design of variable block-size motion estimation for H.264/AVC, IEEE Trans. Circuits Syst. I, Reg.Papers, vol. 53, no. 3, pp. 578 593, Mar. 2006. [6]T.H.Wu,Y.L.Tsai,andS.J.Chang, An efficient design-for-testability scheme for motionestimation in H.264/AVC, in Proc. Int. Symp.VLSI Design, Autom. Test, Apr. 2007, pp. 1 4. [7] M. Y. Dong, S. H. Yang, and S. K. Lu, Designfor-testability techniques for motion estimation computing arrays, in Proc. Int. Conf.Commun., Circuits Syst., May 2008, pp. 1188 1191. [8] Y. S. Huang, C. J. Yang, and C. L. Hsu, C-testable motion estimation design for video coding systems, J. Electron. Sci. Technol., vol. 7, no.4, pp. 370 374, Dec. 2009. [9] D. Li, M. Hu, and O. A. Mohamed, Built-in selftest design of motion estimation computing array, in Proc. IEEE Northeast Workshop CircuitsSyst., Jun. 2004, pp. 349 352. [10] Y. S. Huang, C. K. Chen, and C. L. Hsu, Efficient built-in self-test for video coding cores: A case study on motion estimation computing array, in Proc. IEEE Asia Pacific Conf. Circuit Syst., Dec. 2008, pp.1751 1754. [11] W. Y Liu, J. Y. Huang, J. H. Hong, and S. K. Lu, Testable design and BIST techniques for systolic motion estimators in the transform domain, in Proc. IEEE Int. Conf. Circuits Syst., Apr. 2009, pp. 1 4. IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 4

[12] J. M. Portal, H. Aziza, and D. Nee, EEPROM memory: Threshold voltage built in self diagnosis, in Proc. Int. Test Conf., Sep. 2003, pp.23 28. [13] J. F. Lin, J. C. Yeh, R. F. Hung, and C. W. Wu, A built-in self-repair design for RAMs with 2-D redundancy, IEEE Trans. Vary Large Scale Integr. (VLSI) Syst., vol. 13, no. 6, pp. 742 745, Jun. 2005. [14] C. L. Hsu, C. H. Cheng, and Y. Liu, Built-in selfdetection/correction architecture for motion estimation computing arrays, IEEE Trans.Vary Large Scale Integr. (VLSI) Systs., vol. 18, no. 2, pp. 319 324, Feb.2010. [15] C. H. Cheng, Y.Liu, and C. L.Hsu, Low-cost BISDC design for motion estimation computing array, in Proc. IEEE Circuits Syst. Int.Conf., 2009, pp. 1 4. [16] S. Bayat-Sarmadi and M. A. Hasan, On concurrent detection of errors in polynomial basis multiplication, IEEE Trans. Vary Large Scale Integr.(VLSI) Systs., vol. 15, no. 4, pp. 413 426, Apr. 2007. [17] C. W. Chiou, C. C. Chang, C. Y. Lee, T. W. Hou, and J. M. Lin, Concurrent error detection and correction in Gaussian normal basis multiplier over GF, IEEE Trans. Comput., vol.58, no. 6, pp.851 857, Jun. 2009. [18] L. Breveglieri, P. Maistri, and I. Koren, A note on error detection in an RSA architecture by means of residue codes, in Proc. IEEE Int.Symp.On-Line Testing, Jul. 2006, 176 177. [19] S. J. Piestrak, D. Bakalis, and X. Kavousianos, On the design of selftesting checkers for modified Berger codes, in Proc.IEEE Int. WorkshopOn-Line Testing, Jul. 2001, pp. 153 157. [20] S. Surin and Y. H. Hu, Frame-level pipeline motion estimation array processor, IEEE Trans. Circuits Syst. Video Technol., vol. 11, no. 2,pp. 248 251, Feb. 2001. [21] D. K. Park, H. M. Cho, S. B. Cho, and J. H. Lee, A fast motion estimation algorithm for SAD optimization in sub-pixel, in Proc. Int. Symp.Integr. Circuits, Sep. 2007, pp. 528 531. IJCSIET-ISSUE4-VOLUME3-SERIES3 Page 5