Locating 1-D Bar Codes in DCT-Domain

Similar documents
Region Segmentation for Facial Image Compression

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

Artifacts and Textured Region Detection

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

A Image Comparative Study using DCT, Fast Fourier, Wavelet Transforms and Huffman Algorithm

Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig

Haresh D. Chande #, Zankhana H. Shah *

Biometric Security System Using Palm print

An Accurate Method for Skew Determination in Document Images

Color Local Texture Features Based Face Recognition

Outline Introduction MPEG-2 MPEG-4. Video Compression. Introduction to MPEG. Prof. Pratikgiri Goswami

Copyright Detection System for Videos Using TIRI-DCT Algorithm

DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM

signal-to-noise ratio (PSNR), 2

Scene Text Detection Using Machine Learning Classifiers

OCR For Handwritten Marathi Script

Skin colour based face detection

A LOW COST OPTICAL BARCODE READER USING A WEBCAM

Wavelet Transform (WT) & JPEG-2000

A Texture Feature Extraction Technique Using 2D-DFT and Hamming Distance

Image Compression for Mobile Devices using Prediction and Direct Coding Approach

Wireless Communication

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur

EDGE BASED REGION GROWING

Texture Segmentation by Windowed Projection

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Hybrid Approach for MRI Human Head Scans Classification using HTT based SFTA Texture Feature Extraction Technique

Image Inpainting Using Sparsity of the Transform Domain

MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ)

Illumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model

Image Enhancement Techniques for Fingerprint Identification

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction

Detecting People in Images: An Edge Density Approach

Implementation and analysis of Directional DCT in H.264

CMPT 365 Multimedia Systems. Media Compression - Image

Efficient Image Compression of Medical Images Using the Wavelet Transform and Fuzzy c-means Clustering on Regions of Interest.

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

TEXTURE ANALYSIS USING GABOR FILTERS

Image and Video Watermarking

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

Fractional Discrimination for Texture Image Segmentation

DWT Based Text Localization

A Compressed Representation of Mid-Crack Code with Huffman Code

Compression of Stereo Images using a Huffman-Zip Scheme

Yui-Lam CHAN and Wan-Chi SIU

Footprint Recognition using Modified Sequential Haar Energy Transform (MSHET)

Bit-Plane Decomposition Steganography Using Wavelet Compressed Video

A Kind of Fast Image Edge Detection Algorithm Based on Dynamic Threshold Value

A Miniature-Based Image Retrieval System

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

Auto-focusing Technique in a Projector-Camera System

MANY image and video compression standards such as

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

Data Hiding in Binary Text Documents 1. Q. Mei, E. K. Wong, and N. Memon

Mesh Based Interpolative Coding (MBIC)

Short Survey on Static Hand Gesture Recognition

Detection of a Single Hand Shape in the Foreground of Still Images

A new approach to reference point location in fingerprint recognition

Palmprint Recognition Using Transform Domain and Spatial Domain Techniques

An Adaptive Approach for Image Contrast Enhancement using Local Correlation

EE795: Computer Vision and Intelligent Systems

Final Review. Image Processing CSE 166 Lecture 18

Content-Based Image Retrieval of Web Surface Defects with PicSOM

Video compression with 1-D directional transforms in H.264/AVC

IMAGE PROCESSING AND IMAGE REGISTRATION ON SPIRAL ARCHITECTURE WITH salib

Document Text Extraction from Document Images Using Haar Discrete Wavelet Transform

A Very Low Bit Rate Image Compressor Using Transformed Classified Vector Quantization

Video Compression System for Online Usage Using DCT 1 S.B. Midhun Kumar, 2 Mr.A.Jayakumar M.E 1 UG Student, 2 Associate Professor

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image

A New Algorithm for QR Code Watermarking Technique For Digital Images Using Wavelet Transformation Alikani Vijaya Durga, S Srividya

EE 5359 MULTIMEDIA PROCESSING SPRING Final Report IMPLEMENTATION AND ANALYSIS OF DIRECTIONAL DISCRETE COSINE TRANSFORM IN H.

Review and Implementation of DWT based Scalable Video Coding with Scalable Motion Coding.

DIGITAL IMAGE WATERMARKING BASED ON A RELATION BETWEEN SPATIAL AND FREQUENCY DOMAINS

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

C E N T E R A T H O U S T O N S C H O O L of H E A L T H I N F O R M A T I O N S C I E N C E S. Image Operations II

The Vehicle Logo Location System based on saliency model

LETTER Improvement of JPEG Compression Efficiency Using Information Hiding and Image Restoration

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

A deblocking filter with two separate modes in block-based video coding

ECE 533 Digital Image Processing- Fall Group Project Embedded Image coding using zero-trees of Wavelet Transform

Face Detection for Skintone Images Using Wavelet and Texture Features

DESIGNING A REAL TIME SYSTEM FOR CAR NUMBER DETECTION USING DISCRETE HOPFIELD NETWORK

Review for the Final

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

CSE237A: Final Project Mid-Report Image Enhancement for portable platforms Rohit Sunkam Ramanujam Soha Dalal

DCT Coefficients Compression Using Embedded Zerotree Algorithm

Adaptive thresholding by variational method. IEEE Transactions on Image Processing, 1998, v. 7 n. 3, p

Critique: Efficient Iris Recognition by Characterizing Key Local Variations

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP ( 1

Multimedia Systems Video II (Video Coding) Mahdi Amiri April 2012 Sharif University of Technology

Efficient SLAM Scheme Based ICP Matching Algorithm Using Image and Laser Scan Information

Multimedia Information Retrieval

Fast Evaluation of the Square Root and Other Nonlinear Functions in FPGA

An Autoassociator for Automatic Texture Feature Extraction

Wavelet Based Image Compression Using ROI SPIHT Coding

Lossless and Lossy Minimal Redundancy Pyramidal Decomposition for Scalable Image Compression Technique

EECS 209A Rendering Techniques for Biomedical Imaging. ( x)

Fingerprint Image Compression

Transcription:

Edith Cowan University Research Online ECU Publications Pre. 2011 2006 Locating 1-D Bar Codes in DCT-Domain Alexander Tropf Edith Cowan University Douglas Chai Edith Cowan University 10.1109/ICASSP.2006.1660449 This article was originally published as: Tropf, A. S., & Chai, D. K. (2006). Locating 1-D Bar Codes in DCT-Domain. Proceedings of 2006 IEEE International Conference on Acoustics, Speech and Signal Processing. (pp. 741-744). Toulouse, France. IEEE. Original article available here 2006 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. This Conference Proceeding is posted at Research Online. http://ro.ecu.edu.au/ecuworks/2129

LOCATING 1-D BAR CODES IN DCT-DOMAIN Alexander Tropf and Douglas Chai Visual Information Processing Research Group School of Engineering and Mathematics, Edith Cowan University 100 Joondalup Drive, Joondalup WA 6027, Perth, Australia ABSTRACT Today s digital cameras and camera phones allow users to capture bar codes, which are used to uniquely identify consumer products. In this paper a fast algorithm is proposed that locates a 1-D bar code in the DCT-domain of a bar code image taken by a digital camera. The algorithm uses the DCT-transform properties to distinguish bar code from other texture, morphological operations to smooth the detected bar code area and the features of the extracted area to detect position and orientation of a bar code in the image. 1. INTRODUCTION In today s modern society, almost every consumer product has a unique 1-D bar code for identification. With the use of a bar code laser scanner, information about a product such as description and price can be easily accessed. To enable consumer to obtain information about a consumer product at home or in a supermarket, a scanning device that can decode the product s bar code, and a communication device that retrieves the information from a consumer product server, are required. Today s camera phones can handle both tasks: they can take a picture of the bar code and connect to a consumer product server to obtain information about the product. The critical part of enabling a camera phone for bar code reading is that it first has to locate the bar code in the image and then it requires to decode the bar code. In this article we shall concentrate on the first part: locating the bar code. Bar code localisation in computer images is not new. There exist numerous approaches for 1-D bar code location using analysis in the spatial domain [1 5], Gabor filtering [6] and analysis in the Wavelet domain [7]. However using the DCTdomain for bar code location has not been investigated yet. JPEG which uses DCT prior to data quantisation is one of the most common compression standards for digital images. Because of this fact, JPEG codecs are usually implemented in a camera phone s hardware [8, 9] which will provide fast access to the DCT-domain of a bar code image. This paper is organised as follows: Section 2 investigates the properties of 1-D bar codes in the DCT-domain. Our algorithm is presented in Section 3. Experimental results are given in Section 4, while concluding remarks can be found in Section 5. 2. PROPERTIES OF 1-D BAR CODES IN THE DCT-DOMAIN Basically, JPEG compression involves the following steps: DCT is performed on each 8 8 block of the image. The DCT coefficients are quantised. The quantised DCT coefficients are encoded with Huffman coding to generate the JPEG bit-stream. The DCT transforms an 8 8 image block into 64 DCT coefficients. The first DCT coefficient is the DC-value since it gives the average value of the image block. The remaining coefficients are the AC-values that give the spatial frequencies of an image block in ascending order. A 1-D bar code can simply be described as an alternating sequence of black and white stripes in one specific direction. Consider two simple cases: 1. The bar code is aligned horizontally and the black and white stripes alternate in x-direction. 2. The bar code is aligned vertically and the black and white stripes alternate in y-direction. Let us have a look at the DCT-coefficient in the bar code region of the first case. There should be AC-coefficients in x- direction of high magnitude, whereas the coefficients that represent alternating y-components should be ideally zero, or at least very small. In the second case there will be strong ACcomponents in y-direction and no or only small components in x-direction. Hence DCT-coefficients of bar code regions not only can be distinguished from non-bar code regions, they also give information about the orientation of the bar code. In case the bar code is aligned under an arbitrary angle, the DCTcoefficients of the bar code area will have AC-components in both directions, where the centre weight of each frequency component should give the approximate angle. For example if the alignment angle is 45, the bar code AC-coefficients will have an equal portion in x- and y-direction. 1 4244 0469 X/06/$20.00 2006 IEEE II 741 ICASSP 2006

DCT-blocks DC-components 0 abs(dct-coefficients) mean(dct-blocks) group G f, c fmax =max(g f ) weight matrix W sum(dct-blocks) morphological closing conversion to binary region labeling region properties bar code region verification G 1 G 2 G 3 G 4 G 5 G 6 G 7 G 8 c 11 c 12 c 13 c 21 c 22 c 23 c 31 c 32 c 33 Fig. 1. Flowchart of bar code location algorithm Fig. 3. Grouping of DCT-coefficients from each frequency range...... Collection of DCT blocks Average DCT block c 11 each frequency range f in the average block. The coefficients c fmax now indicate the coefficients in the bar code area that are strongest. 5. Compute a weight matrix W of dimension 8 8, where each element w i,j is defined as { ke, if c w i,j = ij = c fmax k d, else Fig. 2. Computing average DCT-block from all DCT-blocks 3. BAR CODE LOCATION ALGORITHM Here, we propose a bar code location algorithm that works on the DCT-coefficients on luminance of a bar code image. To be able to locate bar code DCT-coefficients and to assist in the decoding step, we set the constraint that a bar code should cover an area of at least 10% of an image. The ideal outcome is to detect all lines of the bar code, so that a simple algorithm can be devised for decoding. A flow chart of our proposed bar code location algorithm is given in Figure 1. The steps of the algorithm are described as follows: 1. Compute magnitude of DCT-coefficients. Only the magnitude of the AC-values is of interest. 2. Set DC-components to 0. Only the AC-values are required to locate the bar code area. 3. Calculate an average DCT-block from all 8 8 blocks of DCT-coefficients (see Figure 2). 4. Take the average DCT-block and group all DCT-coefficients c ij of one frequency range f together into an array G f, so that G f = c ij, (i = f) j = {1,...,f} (j = f) (i = {1,...,f 1}). For example G 3 = {c 31,c 32,c 33,c 13,c 23 } (see Figure 3). Then calculate the largest DCT-coefficient c fmax =max(g f ), from where k e and k d are the emphasis and deemphasis factors, respectively. A relationship between k e and k d will be determined in Section 4, but generally one has to choose k e >k d to emphasise bar code coefficients more than deemphasising other coefficients. 6. Perform an element multiplication of each 8 8 DCT block with W. Then calculate the sum of each DCTblock. The higher the sum of a DCT block, the higher is the likelihood that it belongs to the bar code region. The DCT-sums make up a subsampled DCT-image by factor 8 in each dimension, which will increase computation time of subsequent steps. Negative values will be set to 0. Positive values will be scaled to the range [0,255], to construct a gray-scale image. 7. Perform morphological closing on the gray-scale image from Step 6. This will smooth the bar code area. In most cases the bar code area now has the highest intensity compared to surrounding regions. 8. Convert gray scale data into binary using the Otsu thresholding technique [10]. 9. Label all 8-connected regions as it is described in [11]. 10. At this stage we know that the bar code region has to be large and of approximate rectangular shape. In a simple case the bar code area is much larger than other areas. Therefore the algorithm will choose the largest region as bar code area and proceed with Step 11. In case there are a few large regions that differ in size by factor 2 or less, the one with the highest solidity is taken. The term solidity here refers to the proportion of pixel II 742

in the convex hull that are also in the region. If both areas have a high solidity, the bar code detection will go back to Stage 8 and set a high threshold for the binary image conversion. Since the bar code area has a high intensity after Step 7, it will be the first one to show up. Therefore Step 10 chooses the largest area as bar code region and checks the solidity. The algorithm will then repeat Steps 8 to 10 until the result is satisfactory, or abort if no bar code region can be distinguished after a few iterations. 11. In the last stage the bar code region has to be verified to be rectangular to a certain degree. The extrema points [11] are used to check whether the area conforms to a rectangular. In the end we obtain a binary image mask, which identifies the bar code region of the image. 4. RESULTS We are using digital bar code images of size 1280 960 pixel to test the algorithm. First of all the emphasis and deemphasis factors have to be obtained. If the ratio k e /k d is too low, the bar code area is not emphasised enough and parts of the bar code can be missing. On the other hand, if the ratio is chosen too high, texture is not deemphasised enough and will show up as high intensity area. Experimental results have shown that choosing k e /k d =3, gives reasonable results. Figure 4 shows the result of locating a horizontally aligned bar code. Since the bar code area has only vertical edges, only the DCT-coefficients that represent changes in x-direction will be emphasised. Vertical edges that appear outside of the bar code area are emphasised as well (see upper right corner in Figure 4b). However they can be easily distinguished from the bar code region in Step 10 of the algorithm. The example also shows that texture is deemphasised and it appears only at low intensity after the morphological operation (see left side of Figure 4b). Note that the inclusion or exclusion of numbers beneath the bar code does not have a significant impact of the subsequent decoding process. Figures 5 and 6 show the results of locating a diagonally and vertically aligned bar code, respectively. It shows that our algorithm is rotation invariant to a high degree. The detection works even if the bar code has a high content of texture (see Figure 7). In most cases the bar code was located correctly. However if the percentage of texture in a bar code image is too high compared to the bar code region, the algorithm fails. 5. CONCLUSION We have shown in this paper that bar code location in the DCT-domain produces good results. Furthermore, the pro- Fig. 4. Result of locating a horizontally aligned bar code: original image; result after morphological operation; binary image mask; located bar code Fig. 5. Result of locating an arbitrarily aligned bar code: original image; result after morphological operation; binary image mask; located bar code II 743

posed algorithm is relatively simple to implement and performs very fast. To increase robustness of the algorithm, the bar code region estimation and bar code region verification in the last two steps of our algorithm should be improved. However the algorithm in its current state successfully detects a high number of bar code areas on consumer goods. The future work will involve an implementation of the bar code location algorithm in JAVA on a camera phone and the implementation of a suitable bar code decoding algorithm. 6. REFERENCES [1] C. Viard-Gaudin, N. Normand, and D. Barba, A bar code location algorithm using a two-dimensional approach, in Proc. IEEE ICDAR 93, Oct. 1993, pp. 45 48. [2] S.-J. Liu, H.-Y. Liao, L.-H. Chen, H.-R. Tyan, and J.- W. Hsieh, Camera-based bar code recognition system using neural net, in Proc. IEEE IJCNN 93, vol. 2, Nagoya, Oct. 1993, pp. 1301 1305. Fig. 6. Result of locating a vertically aligned bar code: original image; result after morphological operation; binary image mask; located bar code Fig. 7. Result of locating a bar code in image with strong surrounding texture: original image; result after morphological operation; binary image mask; located bar code [3] R. Howlett, S. Berthier, and G. Awcock, Determining the location of industrial bar-codes using neural networks, in Proc. IEE IPA 97, vol. 2, July 1997, pp. 511 515. [4] S. Arnould, G. Awcock, and R. Thomas, Remote barcode localisation using mathematical morphology, in Proc. IEE IPA 93, vol. 2, July 1999, pp. 642 646. [5] D. Chai and F. Hock, Locating and decoding EAN-13 barcodes from images captured by digital cameras, in Proc. IEEE ICICS 05, Dec. 2005, pp. 1556 1560. [6] A. K. Jain and Y. Chen, Bar code localization using texture analysis, in Proc. IEEE ICDAR 93, Oct. 1993, pp. 41 44. [7] R. Oktem, Bar code localization in wavelet domain by using binary morphology, in Proc. IEEE SIU 04, Apr. 2004, pp. 499 501. [8] Epson, New Epson mobile graphics engine for enhanced music playing on mobile phones, June 2005. [Online]. Available: http://www.epson.co.jp/e/ newsroom/2005/news_2005_06_27.htm [9] 3G, New mobile media processors for mobile phones, Mar. 2004. [Online]. Available: http://www.3g.co.uk/ PR/March2004/6886.htm [10] N. Otsu, A threshold selection method from gray-level histograms, IEEE Trans. Syst., Man, Cybern., vol. 9, no. 1, pp. 62 66, 1979. [11] R. M. Haralick and L. G. Shapiro, Computer and Robot Vision. Addison-Wesley, 1992, vol. 1. II 744