Skew Detection and Correction of Document Image using Hough Transform Method

Similar documents
DATABASE DEVELOPMENT OF HISTORICAL DOCUMENTS: SKEW DETECTION AND CORRECTION

Skew Detection Technique for Binary Document Images based on Hough Transform

An Accurate Method for Skew Determination in Document Images

Estimation of Skew Angle in Binary Document Images Using Hough Transform

Estimation of Skew Angle in Binary Document Images Using Hough Transform

Pixels. Orientation π. θ π/2 φ. x (i) A (i, j) height. (x, y) y(j)

OCR For Handwritten Marathi Script

NEW TECHNIQUE FOR SKEW ANGLE DETECTION OF TEXT IN IMAGE DOCUMENT

Research on QR Code Image Pre-processing Algorithm under Complex Background

Segmentation of Characters of Devanagari Script Documents

Determining Document Skew Using Inter-Line Spaces

A Theoretical Comparison of Skew Detection Techniques

Skew Angle Detection of Bangla Script using Radon Transform

A Review of Skew Detection Techniques for Document

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Skew angle Detection and correction using Radon Transform

Part-Based Skew Estimation for Mathematical Expressions

OBJECT detection in general has many applications

A Hierarchical Pre-processing Model for Offline Handwritten Document Images

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

A Model-based Line Detection Algorithm in Documents

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

Scanner Parameter Estimation Using Bilevel Scans of Star Charts

CIRCULAR MOIRÉ PATTERNS IN 3D COMPUTER VISION APPLICATIONS

An Integrated Skew Detection And Correction Using Fast Fourier Transform And DCT

Cover Page. Abstract ID Paper Title. Automated extraction of linear features from vehicle-borne laser data

SKEW DETECTION AND CORRECTION

HCR Using K-Means Clustering Algorithm

Layout Segmentation of Scanned Newspaper Documents

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

Motion Estimation. There are three main types (or applications) of motion estimation:

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation

Computer Assisted Image Analysis TF 3p and MN1 5p Lecture 1, (GW 1, )

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

(Refer Slide Time 00:17) Welcome to the course on Digital Image Processing. (Refer Slide Time 00:22)

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach

Subpixel Corner Detection Using Spatial Moment 1)

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction

Extraction and Recognition of Alphanumeric Characters from Vehicle Number Plate

Using Web Camera Technology to Monitor Steel Construction

Schedule for Rest of Semester

Optical Character Recognition

Face Detection Using Color Based Segmentation and Morphological Processing A Case Study

A two-stage approach for segmentation of handwritten Bangla word images

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network

Image Processing Fundamentals. Nicolas Vazquez Principal Software Engineer National Instruments

Multiple Skew Estimation In Multilingual Handwritten Documents

Based on correlation coefficient in image matching

HOUGH TRANSFORM CS 6350 C V

LANGUAGE INDEPENDENT ROBUST SKEW DETECTION AND CORRECTION TECHNIQUE FOR DOCUMENT IMAGES

Coarse-to-Fine Search Technique to Detect Circles in Images

SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT

[Kaur*, 5(2): February, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Recognition of Unconstrained Malayalam Handwritten Numeral

On Segmentation of Documents in Complex Scripts

Handwritten Devanagari Character Recognition Model Using Neural Network

CHANGED DOCUMENTED IMAGES APPROACH OF HOUGH TRANSFORM FOR SKEW DETECTION AND CORRECTION

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Multimedia Technology CHAPTER 4. Video and Animation

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

ABSTRACT I. INTRODUCTION. Dr. J P Patra 1, Ajay Singh Thakur 2, Amit Jain 2. Professor, Department of CSE SSIPMT, CSVTU, Raipur, Chhattisgarh, India

A System towards Indian Postal Automation

Including the Size of Regions in Image Segmentation by Region Based Graph

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

OFFLINE SIGNATURE VERIFICATION

Character Recognition of High Security Number Plates Using Morphological Operator

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A New Algorithm for Detecting Text Line in Handwritten Documents

RESEARCH ON OPTIMIZATION OF IMAGE USING SKELETONIZATION TECHNIQUE WITH ADVANCED ALGORITHM

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK

Practical Image and Video Processing Using MATLAB

Auto-Digitizer for Fast Graph-to-Data Conversion

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

The Vehicle Logo Location System based on saliency model

An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b

A Simple Text-line segmentation Method for Handwritten Documents

EE640 FINAL PROJECT HEADS OR TAILS

Scene Text Detection Using Machine Learning Classifiers

Extracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang

Motion Detection Algorithm

Skew Detection for Complex Document Images Using Fuzzy Runlength

CS 223B Computer Vision Problem Set 3

A Painting Based Technique for Skew Estimation of Scanned Documents

Algorithm research of 3D point cloud registration based on iterative closest point 1

A Document Image Analysis System on Parallel Processors

CS 231A Computer Vision (Fall 2012) Problem Set 3

Identifying and Reading Visual Code Markers

[10] Industrial DataMatrix barcodes recognition with a random tilt and rotating the camera

Distance and Angles Effect in Hough Transform for line detection

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

HANDWRITTEN TEXT LINE EXTRACTION BASED ON MINIMUM SPANNING TREE CLUSTERING

IMPLEMENTING ON OPTICAL CHARACTER RECOGNITION USING MEDICAL TABLET FOR BLIND PEOPLE

MATRIX BASED INDEXING TECHNIQUE FOR VIDEO DATA

A Novel Field-source Reverse Transform for Image Structure Representation and Analysis

Types of image feature and segmentation

Detection of Rooftop Regions in Rural Areas Using Support Vector Machine

Transcription:

Skew Detection and Correction of Document Image using Hough Transform Method [1] Neerugatti Varipally Vishwanath, [2] Dr.T. Pearson, [3] K.Chaitanya, [4] MG JaswanthSagar, [5] M.Rupesh [1] Asst.Professor, [2] Professor, [3][4][5] UG students [1][2][3] Department of ECE, St. Peter s Engineering College, Maisammaguda, Medchal, Hyderabad, Telangana, India Abstract - During document scanning, skew is inevitably introduced into the incoming document image. Skew detection is one the first operations to be applied to scanned documents when converting data to a digital format. Its aim is to align an image before processing because text segmentation and recognition methods require properly aligned next lines. In this paper, skew detection and correction of documnet image is performed by hough transform technique. The hough transform algorithm have been implemented and the results of each have been compared for accuracy Key words: Skew Detection, Skew Correction, Document Image, Hough Transform 1. INTRODUCTION In many of OCR readers or applications while developing we should have an algorithm for skew detection and correction at preprocessing section. The OCR readers or applications will accept a binary image as input for that, it will follow. Visual representation of something in the presence of light is an image. Information technology has many uses in many applications. An image can be shown in two dimensional function f(x, y) ; where x and y are coordinates in spatial domain and the amplitude of f is known as the intensity of an image at that point which are denoted by (x, y) [1][2]. A grid of pixels present in rectangular shape can used to represent an image. Every image has particular height and width which can be depended on number of pixels. We all know that pixel is a smallest part of a display which is present in square. If we had smaller pixels in a display we can get more clear information. Every pixel has colors (RGB) and every pixel color can be in a 32-bit integer value. In this 32-bit integer, the redness of a pixel is given by first eight bits and greenness is given by another eight bits and blueness is given by another eight bits and remaining eight bits will help to adjust the transparency in a pixel. Vector graphics and raster graphics help to describe an image. A picture which can be created or copied and stored in an electronic form is called as image. Image which is stored in raster form and it is also called as bitmap. A file containing information of a specified image which is associates with various locations with hypertext links such as a digital image or still images like drawings, pictures, logos, or individual video frames.[3]we can also add text on animate which is captured by a digital camera using some software tools. By using mathematical techniques we can process an image which we call as image processing. In image processing we give an image or series of images as input, we get images or it s parameters as output. There are many applications where the image processing concept is widely used such as medicine, biology, astronomy, satellite imagery, personal image editing, weather reporting, etc.[4] We frequently see quotes in our social networking sites, some text advertisements on boarding which are present besides the roads, or on palm plates. We can found group of characters which can form words by placing adjacently, the words which are placed adjacently with a meaning can form a sentence. The orientation of these text lines, we can determine the skew angle of an image or a script or a report. If the skew angle is not there or if it is zero then we can find lines vertically and horizontally which are drawn on the paper near to the edges such that they seems to be parallel w.r.t to edges. The skew angle with zero and 45 degrees of a synthetic document image. This skew angle of zero degrees helps to get an image or a document which is containing text to get in a straight line w.r.t to the angle of the text. So, when a image or a document is scanned at that time we get text which is parallel with the borders.[5] Page layout analysis and optical character recognition will help to find the skew angle in an image or a document. These two are very sensitive to the orientation of input document or image. The skew detection and correction All Rights Reserved 2018 IJERCSE 163

should be done automatically due to the vast development in the image or document entry and interpretation systems.[6] In this paper the main scope is that text which is not aligned in straight lines, due to skew in the image or document is because rigid rotation. Some of the papers deal with the situation where there is a non-uniform skew of an image or a document.[7] 2. PROPOSED WORK: In this paper we written an algorithm for fast transformation of an skewed image, it doesn t even take one second and the code which we written MATLAB will show the figures of skewed image which has skewed angle and also shows the figure of non skew angle or zero skew angle of an image. The algorithm uses integer operations. Projection profile (pp) and its variations is most popularly used technique. But, in case of text only the do well. For skew detection, we can also use Hough transform to achieve high accuracy in detection of skew. In these two techniques the computational complexity is always proportional to the range and accuracy. So, in most of the PP or HT cases the detection range is typically between - 15 to 15.[8],[9]. Another approach is by using k N N (nearest neighbor) clustering (1 '11/ of the components which are connected. In this approach the computational cost is high but we can get accuracy with high relativity and it is independent to the detection range, C(N2), where N is number of components which are connected. In skew detection the computational cost mainly involves, text lines location and estimation of skew angle using PP, and HT or by using PP and k N N clustering. Based on morphological transforms approaches, we can extract the base lines, we can identify the components which are connected and we can also find the burst image which is created due to black run lengths which have been proposed to improve the locating text lines efficiency. The projection profile and k N N clustering will have more The key feature for characterizing this pre-processing algorithm of an image or a document analysis is the algorithmic complexity. Hough Transform: It gives very good results in any tough conditions. It helps in skew detection and correction for text oriented images. A skew detection and correction process has three parts 1. Block adjany graph (BAG) of simplified used for preprocessing stage 2. The voting process is done by using Hough transform to perform skewing the image by its orientation rotation. 3. For correction phase of skew, the quality of skewed images with respect to the rotation type. Figure 1. step by step simulation of skew detection algorithm. As we all know Hough transform will help to find different shapes and it will mainly focus on detection of lines. For skew detection we can use following methods[9] 1.Hough transform 2.Projection profile 3.Clustering of nearest neighbors 4.Correlation between lines. In this paper we are concentrating on Hough transform. Algorithm: Step 1: skewed image is given as input. Step 2: gray image is converted into binary image. Step 3: the optimal threshold value should be plotted. Step 4: using soble edge method, find the edges. Step 5: for first axis apply Principle Component Analysis (PCA). Step 6: skew should be detected. Step 7: the corrected skew is displayed. All Rights Reserved 2018 IJERCSE 164

3.RESULTS Firstly, the experiment is tested on standard which are skewed. images Figure 3. Input image before simulation, Output Figure 2. Input image before simulation, Output Figure 4. Input image before simulation, Output All Rights Reserved 2018 IJERCSE 165

4. PERFORMANCE ANALYSIS The Hough transform taken with Radon transform for comparisons. This shows that Hough transform works well with the sparse sampling in the parametric domain and malfunctions if sampled densely in the parametric domain. Fig. 3 shows that the Hough transform works well with sparse sampling. And if the sampling rate is kept constant still the Hough transform takes much more time for with increase in the size of the document image. Fig. 4. Chart showing the difference in the time taken by the Hough and Radon transforms with parameters V.CONCLUSION Fig. 3. An image with six lines, Hough transform, (c) Radon transform and (d) Hough transform with parameters Table 1 : Time taken by Hough and Radon transforms with parametric domain values The work in this paper, primarily focuses on skew detection in scanned document images. Hough transform, which can be used for skew angle detection and correction. The Hough transform method was found to be as good as the scan line algorithm in terms of skew detection and has a much lesser time complexity. REFERENCES 1. Daniel, X. L., George, R. T., & Harry, W. (1994)." Automated Page Orientation and Skew AngleDetection for Binary Document Images ". PatternRecognition, 27 (10), 1325-1344. 1.Alireza, A., & et.al. (2011). " A Painting Based Technique for Skew Estimation of Scanned Documents ". International Conference on Document Analysis and Recognition, (pp. 299-303). Beijing, China. Radon transform works in line parameter extraction even in presence of noise. Fig. 5 explains when the parametric domain values have been taken from Table 3 for Hough transform so as to have best possible Hough transform and the Table 5 has been used for getting the Radon transform of Fig. 5 (a noisy image). Radon transform are able to transform each of the lines into peaks positioned corresponding to the parameters of the lines. In this way Radon transform converts a difficult global detection problem to local peak detection problem in the parameter. 2. M. kundu M. Nasipuri S. Basu, C. Chaudhuri and D. K. Basu, Text line extraction from multi-skewed handwritten documents, Patten Recognition, vol. 40, (2007), pp. 1825 1839. 3. Cao, Y., Shuhua, W., & Li, H. (2003). " Skew Detection and Correction in Document Images Based on Straight-Line Fitting ". Pattern Recognition Letters, 24 (12), 1871-1879. 4. Changming, S., & Deyi, S. (1997). " Skew andslant Correction for Document Images UsingGradient Direction ". ICDAR 97 Proceedings ofthe 4th International Conference on DocumentAnalysis and Recognition, 1, pp. 142-146. All Rights Reserved 2018 IJERCSE 166

5. J., S., & M., P. (1995). " Skew Angle DetectionUsing Texture Direction Analysis ". In Proc. of the9th Scandinavian Conference on Image Analysis,(pp. 1099-1106). Sweden. 6. S. N. Srihari and Govindaraju, Analysis of textual images using the Hough transform, Machine Vision Appl., vol. 2, (1989), pp. 141-153. 7. B. B. Chaudhari and U. Pal, Skew angle detection of digitized Indian script document, PAMI, vol. 19, no. 2, (1997), pp. 182-186. 8. N. Nandini, S. Murthy K and G. H. Kumar, Estimation of Skew Angle in Binary Document Images Using Hough Transform, World Academy of Science, Engineering and Technology, vol. 42, (2008), pp. 44-49.International Journal of Advanced Science and Technology Vol. 48, November, 2012. 9. http://en.wikipedia.org/wiki/hough_transform All Rights Reserved 2018 IJERCSE 167