An Integrated Skew Detection And Correction Using Fast Fourier Transform And DCT

Similar documents
An Accurate Method for Skew Determination in Document Images

Segmentation of Characters of Devanagari Script Documents

Skew Detection Technique for Binary Document Images based on Hough Transform

A Theoretical Comparison of Skew Detection Techniques

SCALED WAVELET TRANSFORM VIDEO WATERMARKING METHOD USING HYBRID TECHNIQUE: SWT-SVD-DCT

Hybrid Image Compression Using DWT, DCT and Huffman Coding. Techniques

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

Two Phase Data Reduction for Hough Transformation to Detect Skew

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach

A Technique for Classification of Printed & Handwritten text

[Kaur*, 5(2): February, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity

Multiple Skew Estimation In Multilingual Handwritten Documents

OCR For Handwritten Marathi Script

Skew Detection and Correction of Document Image using Hough Transform Method

Skew angle Detection and correction using Radon Transform

Feature Based Watermarking Algorithm by Adopting Arnold Transform

Improved Qualitative Color Image Steganography Based on DWT

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Comparison of DCT, DWT Haar, DWT Daub and Blocking Algorithm for Image Fusion

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

Palmprint Recognition Using Transform Domain and Spatial Domain Techniques

Number Plate Extraction using Template Matching Technique

Copy Move Forgery using Hu s Invariant Moments and Log-Polar Transformations

A Review of Skew Detection Techniques for Document

ISSN (ONLINE): , VOLUME-3, ISSUE-1,

SPEECH WATERMARKING USING DISCRETE WAVELET TRANSFORM, DISCRETE COSINE TRANSFORM AND SINGULAR VALUE DECOMPOSITION

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

Optical Character Recognition

LANGUAGE INDEPENDENT ROBUST SKEW DETECTION AND CORRECTION TECHNIQUE FOR DOCUMENT IMAGES

Digital Image Processing. Image Enhancement in the Frequency Domain

An Integrated Face Recognition Algorithm Based on Wavelet Subspace

ADVANCED IMAGE PROCESSING METHODS FOR ULTRASONIC NDE RESEARCH C. H. Chen, University of Massachusetts Dartmouth, N.

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

Subpixel Corner Detection Using Spatial Moment 1)

DCT SVD Based Hybrid Transform Coding for Image Compression

Research on QR Code Image Pre-processing Algorithm under Complex Background

Statistical Image Compression using Fast Fourier Coefficients

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

A Hierarchical Pre-processing Model for Offline Handwritten Document Images

FRACTAL IMAGE COMPRESSION OF GRAYSCALE AND RGB IMAGES USING DCT WITH QUADTREE DECOMPOSITION AND HUFFMAN CODING. Moheb R. Girgis and Mohammed M.

Authentication and Secret Message Transmission Technique Using Discrete Fourier Transformation

Advanced Image Processing, TNM034 Optical Music Recognition

DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION

Image Enhancement Techniques for Fingerprint Identification

MORPHOLOGICAL BOUNDARY BASED SHAPE REPRESENTATION SCHEMES ON MOMENT INVARIANTS FOR CLASSIFICATION OF TEXTURES

Estimation of Skew Angle in Binary Document Images Using Hough Transform

Estimation of Skew Angle in Binary Document Images Using Hough Transform

Human Face Detection and Tracking using Skin Color Combined with Optical Flow Based Segmentation

Invisible Watermarking Using Eludician Distance and DWT Technique

Part-Based Skew Estimation for Mathematical Expressions

Biometric Security System Using Palm print

Copyright Detection System for Videos Using TIRI-DCT Algorithm

Spatial, Transform and Fractional Domain Digital Image Watermarking Techniques

Digital Image Steganography Techniques: Case Study. Karnataka, India.

A Novel Field-source Reverse Transform for Image Structure Representation and Analysis

Rotation and Scaling Image Using PCA

DIGITAL WATERMARKING OF VIDEO USING DCT AND EXTRACTION FROM ATTACKED FRAMES

Performance Analysis of Discrete Wavelet Transform based Audio Watermarking on Indian Classical Songs

APPLICATION OF RADON TRANSFORM IN CT IMAGE MATCHING Yufang Cai, Kuan Shen, Jue Wang ICT Research Center of Chongqing University, Chongqing, P.R.

Texture Image Segmentation using FCM

A New Approach to Compressed Image Steganography Using Wavelet Transform

Pixels. Orientation π. θ π/2 φ. x (i) A (i, j) height. (x, y) y(j)

Image Gap Interpolation for Color Images Using Discrete Cosine Transform

JPEG compression of monochrome 2D-barcode images using DCT coefficient distributions

IMAGE COMPRESSION USING HYBRID TRANSFORM TECHNIQUE

Image Segmentation Based on. Modified Tsallis Entropy

Iterative Removing Salt and Pepper Noise based on Neighbourhood Information

International Journal of Advance Engineering and Research Development. Improving the Compression Factor in a Color Image Compression

Robust DWT Based Technique for Digital Watermarking

Comparative Analysis of Image Compression Using Wavelet and Ridgelet Transform

Comparative Analysis of Different Spatial and Transform Domain based Image Watermarking Techniques

An Efficient Iris Recognition Using Correlation Method

A Novel Image Transform Based on Potential field Source Reverse for Image Analysis

Digital Image Watermarking Using DWT and SLR Technique Against Geometric Attacks

FPGA IMPLEMENTATION FOR REAL TIME SOBEL EDGE DETECTOR BLOCK USING 3-LINE BUFFERS

Multi-focus image fusion using de-noising and sharpness criterion

CREATING 3D WRL OBJECT BY USING 2D DATA

A Parallel Reconfigurable Architecture for DCT of Lengths N=32/16/8

A Novel Approach to Correction of a Skew at Document Level Using an Arabic Script

Filtering. -If we denote the original image as f(x,y), then the noisy image can be denoted as f(x,y)+n(x,y) where n(x,y) is a cosine function.

An Adaptive Threshold LBP Algorithm for Face Recognition

Locating 1-D Bar Codes in DCT-Domain

Comparative Analysis on Medical Images using SPIHT, STW and EZW

A Fast and Accurate Eyelids and Eyelashes Detection Approach for Iris Segmentation

Digital Image Processing

Comparison of Wavelet Based Watermarking Techniques for Various Attacks

Scene Text Detection Using Machine Learning Classifiers

Skew Angle Detection of Bangla Script using Radon Transform

Survey on Multi-Focus Image Fusion Algorithms

Adaptive Fingerprint Image Enhancement Techniques and Performance Evaluations

WEINER FILTER AND SUB-BLOCK DECOMPOSITION BASED IMAGE RESTORATION FOR MEDICAL APPLICATIONS

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Cover Page. Abstract ID Paper Title. Automated extraction of linear features from vehicle-borne laser data

Footprint Recognition using Modified Sequential Haar Energy Transform (MSHET)

Reduction of Blocking artifacts in Compressed Medical Images

Tracing and Straightening the Baseline in Handwritten Persian/Arabic Text-line: A New Approach Based on Painting-technique

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014)

Transcription:

An Integrated Skew Detection And Correction Using Fast Fourier Transform And DCT Mandip Kaur, Simpel Jindal Abstract: Skew detection and correction is very important task before pre-processing of an image and it is a major problem in scanned documents, if it not detected correctly it will lead wrong result in future during image analysis. During the scanning of the document, skew is being often introduced in the document image. To measure the processed time and speed taken by skew detection algorithm, the Fast Fourier Transform (FFT) technique is applied as it is fast approach for finding the angle of skewed document. This technique is used by firstly applying DCT compression and thresholding on image to reduce timing computation and after that Fourier spectrum is obtained. Further this spectrum is divided into four quadrants and detected skewed angle of each quadrant is measured. And finally Input image is rotated by using bilinear interpolation method. Keywords: Skew correction, Skew detection, Skew angle, Fast Fourier Transform, DCT, Thresholding, Spectrum 1 INTRODUCTION Skew angle estimation and correction of a scanned document is very important task for document analysis. During digitization of documents, it often happens that the document is not aligned correctly and it may lead to skewed image. Therefore, due to skew it can cause further performance degradation of segmentation and recognition stage of any text processing system [1]. Skew in scanned document can be of two types and shown in figure 1 [2]. 1. Clock -wise skew (Positive skew) 2. Anti-clock wise skew (Negative skew) Figure 1: clock -wise skew (Positive skew) and Anti-clock -wise skew (Negative skew). Several techniques have been used to do the analysis of skewed images such as Projection Profile [4], Centre of Gravity [5], Fourier Transform [6], Hough Transform [7] and Geometric Text-line Modeling [9] but the Fourier Transformation is the fastest and easiest to implement in comparison to all other existing techniques. The working of Fast Fourier technique is explained bellow. Mandip Kaur is currently pursuing masters degree program in Computer Engineering in Yadavindra College of Engineering, Talwandi Sabo, India. E-mail: chahal.mandeep621@gmail.com Simpel Jindal is Assistant Professor at Yadavindra college of Engineering, Talwandi Sabo, India. E-mail: simpel_jindal@rediffmail.com Fast Fourier Transform When the image was converted from its Spatial Domain f(x, y) to its frequency domain f(u, v), then by using Fast Fourier Transform technique it is easy to process the image and its further calculations such as skew angle, speed, time etc. because the frequency at each point of the image can be easily calculated. This technique based upon removal of certain range of frequencies from a processed image. This technique mainly contains the following steps: Computes the Fourier Transform of an Image f(x,y). Computes a new transform f(u,v) by setting some values to zeros The FFT of 2D image having spatial domain f(x, y) of size M x N is given by following Fourier equation [3]. M N 1 f u v f x y e MN j2 ( u / M vy / N ),, x 0 y 0 164

FFT as described by above formula calculates the frequency spectrum of the spatial coordinate function f(x, y). The Fourier function f (u, v) may have complex values but for better accuracy, only absolute values of the frequency spectrum have been taken. As the image is rotated, its frequency spectrum also rotates and shifts the energy of the image to the Centre of the image. Here is the example of the input (skewed) and output (De-skewed) images frequency spectrums shown in figures 2 and figure 3. Figure 3: Output-De-skewed Image and Output Spectrum of the De-skew Image. Figure 2: Original input and Fourier Spectrum of input skewed Image. From the above frequency spectrum it concludes that when image rotates to some angle then its frequency spectrum also rotates, this is the basic mantra of finding the skew angle detection with frequency spectrum. In this paper, we used the Fast Fourier technique to calculate the skewed angle of various images by keeping the similar input parameters as used by Manjunath et al [7] for better performance comparison with Hough transform. 2. LITERATURE SURVEY In 2012 Kumar et al. has introduced a new method which reduces the time complexity without compromising with the accuracy of Hough transform. The main advantage of Hough transform [1] is due to its better accuracy and simplicity. In 2002 Lowether et al. [6] has presented a new Averaged Block Directional Spectrum (ABDS) technique for determining the skew angle of digitized documents. It is based on calculating the average 2D Fourier Transform of blocks in a document image and using the Radon transform to find the peak in the directional spectrum. In 2007 Aradhya et al. [7] has proposed a novel skew detection method for binary document images. This method considered some selected characters of the text which may be subjected to thinning and after applying Hough transform skew angle of the documents are estimated. On the other hand, several experiments have been conducted on various types of documents such as English documents, Journals, Text-Books, different languages, documents with different fonts and resolutions etc. to reveal the robustness of the proposed method. The experimental results reveal that the proposed method is accurate in comparison to the results of well-known existing methods. 3 RESEARCH METHODOLOGY Figure 4 shows the pictorial representation of various steps to be taken in proposed technique to implement for detecting skew of the images. 165

3.1 Proposed Algorithm Step 1.Start. Step 2.Input the Image and read the image using imread() function of the MatLab. Step3.Perform the DCT lossy Compression on the image to reduce the timing calculations during processing of the image. Step 4.Apply thresholding on the input image to get grey-scale image because grey-scale image having improved intensity values and then convert it into binary image. Step 5.Convert the image into its Frequency Domain Spectrum and now frequency value Matrix of the image is stored in size [M N] matrix. Step 6.Divide the Fourier spectrum into Four quadrant. Step7. Calculate the angle of tilt of each quadrant corresponding highest transform intensity of each quadrant say angle degree (1), angle degree(2), angle degree(3), angle degree(4) and Sum up the four angles and find average angle to find the Detected angle. Step 8. If Detected_ angle>=45 Then final angle=90- Detected_angle. Else final angle = -Detected_angle. Step 9.Rotate the image through its centre by angle final angle using bilinear interpolation method. Step 10. Display the De-Skew image. Step 11.End 4 EXPERIMENTAL RESULTS Figure 5 and 5 shows the rotation angle of -25 degree and the compressed image. Original Image 100 200 300 400 500 600 700 100 200 300 400 500 600 700 800 900 1000 1100 Figure 4: Data Flow Diagram of the proposed technique. 166

100 200 300 400 500 600 700 Compression Factor 2 detection. For better comparison, the similar documents have been taken as input as taken by Aradhya et al. [7] and compared the results obtained with the existing technique. In these both the tables the time taken, accuracy and detection angle calculated by proposed and existing techniques have been presented. Sr. No. Actual Detecte d Error Accurac y Time Second 1-30 -30.43 0.014 98.58% 0.202 2-20 -20.11 0.005 99.45 % 0.296 100 200 300 400 500 600 700 800 900 1000 1100 Figure 5: rotated and Compressed Skewed Images. Figure 6 and 6 shows the four quadrants of the Fourier spectrum of an image and output image rotated by -25.38 degree using experimental technique. Fig.3: First Quadrant Fig.4: Second Quadrant 3-10 -10.43 0.04 95.88% 0.192 4-5 -4.90 0.02 98% 0.203 5-3 -3.01 0.003 99.66% 0.111 6 1 0.77 0.23 77% 0.154 7 3 2.32 0.23 77% 0.126 8 5 5.02 0.004 99.6% 0.110 9 10 10.95 0.095 91.32% 0.173 10 15 15.26 0.017 98.29% 0.156 11 18 17.32 0.038 96.22% 0.134 12 19 19.39 0.02 98.34% 0.224 13 20 20.30 0.015 98.52% 0.216 Fig.5: Third Quadrant Fig.6: Fourth Quadrant 14 25 25.38 0.015 98.50% 0.257 15 27 27.72 0.027 97.40% 0.277 16 30 29.80 0.006 99.33% 0.205 17 33 32.53 0.014 98.57% 0.347 18 40 40.16 0.04 99.60% 0.187 19 42 41.05 0.023 97.73% 0.257 20 45 44.34 0.015 98.53% 0.239 Sr. No. Actual Detected Error Accuracy Time Elapsed(in Seconds) 1 3 2.89 0.037 96.33% 1.78 2 5 5.08 0.016 98.42% 1.68 3 10 9.86 00.014 98.60% 1.77 4 15 15.12 0.008 99.21% 1.70 Figure 6: four quadrants of Fourier spectrum and Ouput De-skewed Image. Table 1 and shows the results obtained by proposed and existing techniques. In proposed method, some scanned documents have been selected and after that thresholding is applied on them to convert into binary level documents. The main goal of proposed method is to speed up the skew angle 5 20 20.11 0.005 99.45% 1.80 6 30 30 0.00 100% 1.71 7 40 39.89 0.002 99.72% 1.72 8 45 44.98 0.00004 99.75% 1.74 Table 1: Evaluation of proposed and existing technique. 167

From the above tables it has been obtained that the proposed method has almost similar accuracy in comparison to the results of existing method but with less time consuming. Figure 7 shows the comparison between the actual and detected angle of the scanned document by using proposed technique and comparison of existing and proposed techniques. Figure 8 shows the comparison of accuracy rate of existing technique [7] and proposed technique Figure 8: comparison of accuracy. The accuracy results obtained are nearly similar to the results of existing technique. Time comparison Time is defined as the time taken by the algorithm to run and it is directly proportional to number of input elements. Also the speed is inversely proportional to the time. The comparison of time taken for detection of the skew angle of scanned document by existing technique [7] and proposed technique has been compared in figure 9. Figure 7: Detected angle of the proposed technique and Detected angle of the proposed technique. Accuracy comparison Accuracy rate must be improved to achieve good results. It increases with decrease in error rate and increase reliability of algorithm. Figure 9: comparison of Time. 6 CONCLUSION AND FUTURE WORK 6.1 Conclusion In this paper, a new technique is proposed for determining the skew angle of digitized documents. The results obtained are highly accurate and less time consuming with better speed as compare to the other existing methods. The accuracy achieved a skew angle determination within the range of ±45 degree of true skew angle. The advantage of the technique over most other techniques is the ease of detecting skew over ±45 degree skewed angle. This technique calculates the processed time of skewed angle of different handwritten English documents, Journals, Text-Books, different languages, 168

documents with different fonts and printed documents. Accuracy is also near to 95% and time for calculating the skewed angle is less than one second (average) for all scanned images. 6.2 Future Work Performance evaluation has shown that proposed algorithm is quite good result for determining the angle of an image but there is one limitation it does not work with small resolution images and error is considerable in the case of small degree skewed images. In this technique we have taken preprocessed images, In near future work will to detect low resolution noised skewed images angle correctly and to reduce the elapsed time to detect the angle of skewed image. [8]. Zhang ruilin, Hu yan, Fang zhijian and Zhang lei "Skew Detection and Correction Method of Fabric Images Based on Hough Transform Second International Conference on Intelligent Computation Technology and Automation,Vol. 2, October 2009,Pages 340-343. [9]. Joost van Beusekom, Faisal Shafait and Thomas M. Breuel Combined Orientation and Skew Detection Using Geometric Text-Line Modeling Vol. 13 June 2010, Issue 2, Pages 79-92. ACKNOWLEDGEMENT The author Mandip Kaur would like to thank Yadwindra College of Engineering staff for their support and also thank to Amanpreet Singh,Amit Bansal,Sheetal Singla and Shevali Garg for their fruitful discussion during the writing of research paper. REFERENCES [1]. Deepak Kumar and Dalwinder Singh "Modified Approach of Hough Transform for Skew Detection and Correction in Documented Images" International Journal of Research in Computer Science, Vol. 2, Issue 3, April 2012, Pages 37-40. [2]. Shijian Lu, Jie Wang and Chew Lim Tan Fast and Accurate Detection of Document Skew and Orientation 9th International Conference On Document Analysis And Recognition Vol. 2, 2007, Pages 684-688. [3]. Frank Y. Shih "Image processing and mathematical morphology fundamentals and applications, Edition- 2009. [4]. Shutao Li, Qinghua Shen and Jun Sun Skew detection using wavelet decomposition and projection profile analysis Pattern Recognition Letters, Vol. 28, Issue 5, 1 April 2007, Pages 555 562. [5]. Atallah Mahmoud Al-Shatnawi and Khairuddin Omar Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity Journal of Computer Science, Vol. 5, 2009, Pages 363-368. [6]. S. Lowther, V. Chandran and S. Sridharan An Accurate Method for Skew Determination in Document Images Digital Image Computing Techniques and Applications (DICTA), 21-22 January 2002, Pages 1-5. [7]. Manjunath Aradhya V N, Hemantha Kumar G and Shivakumara P Skew Detection technique for Binary Documents Images Based on Hough Transform, International Journal of Information and Communication Engineering, Vol. 3:7, 2007, Pages 493-499. 169