Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Similar documents
A Binarization Algorithm specialized on Document Images and Photos

Detection of an Object by using Principal Component Analysis

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

Parallelism for Nested Loops with Non-uniform and Flow Dependences

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

IMAGE FUSION TECHNIQUES

Support Vector Machines

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Novel Fuzzy logic Based Edge Detection Technique

Gender Classification using Interlaced Derivative Patterns

An efficient method to build panoramic image mosaics

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Backpropagation: In Search of Performance Parameters

Classifier Selection Based on Data Complexity Measures *

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

A Computer Vision System for Automated Container Code Recognition

WATERSHED ALGORITHM BASED SEGMENTATION FOR HANDWRITTEN TEXT IDENTIFICATION

CORRELATION ICP ALGORITHM FOR POSE ESTIMATION BASED ON LOCAL AND GLOBAL FEATURES

Combination of Color and Local Patterns as a Feature Vector for CBIR

Edge Detection in Noisy Images Using the Support Vector Machines

SRBIR: Semantic Region Based Image Retrieval by Extracting the Dominant Region and Semantic Learning

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

12. Segmentation. Computer Engineering, i Sejong University. Dongil Han

Cluster Analysis of Electrical Behavior

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Feature Reduction and Selection

Feature-based image registration using the shape context

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

IMPROVING AND EXTENDING THE INFORMATION ON PRINCIPAL COMPONENT ANALYSIS FOR LOCAL NEIGHBORHOODS IN 3D POINT CLOUDS

The Research of Support Vector Machine in Agricultural Data Classification

Algorithm for Human Skin Detection Using Fuzzy Logic

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Collaboratively Regularized Nearest Points for Set Based Recognition

Histogram of Template for Pedestrian Detection

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures

Attributed Relational Graph Based Feature Extraction of Body Poses In Indian Classical Dance Bharathanatyam

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Object Tracking Based on PISC Image and Template Matching

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

Palmprint Feature Extraction Using 2-D Gabor Filters

Pictures at an Exhibition

Implementation of Robust HOG-SVM based Pedestrian Classification

Reducing Frame Rate for Object Tracking

An Image Fusion Approach Based on Segmentation Region

Feature Selection for Target Detection in SAR Images

Vectorization of Image Outlines Using Rational Spline and Genetic Algorithm

An Improved Image Segmentation Algorithm Based on the Otsu Method

PERFORMANCE EVALUATION FOR SCENE MATCHING ALGORITHMS BY SVM

ESTIMATION OF PROPER PARAMETER VALUES FOR DOCUMENT BINARIZATION

Learning-based License Plate Detection on Edge Features

The Shortest Path of Touring Lines given in the Plane

Editorial Manager(tm) for International Journal of Pattern Recognition and

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

A Modified Median Filter for the Removal of Impulse Noise Based on the Support Vector Machines

The Research of Ellipse Parameter Fitting Algorithm of Ultrasonic Imaging Logging in the Casing Hole

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Wishing you all a Total Quality New Year!

Enhanced Face Detection Technique Based on Color Correction Approach and SMQT Features

Straight Line Detection Based on Particle Swarm Optimization

EDGE DETECTION USING MULTISPECTRAL THRESHOLDING

Recognizing Faces. Outline

An Approach to Real-Time Recognition of Chinese Handwritten Sentences

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

AUTOMATIC ROAD EXTRACTION FROM HIGH RESOLUTION SATELLITE IMAGES USING NEURAL NETWORKS, TEXTURE ANALYSIS, FUZZY CLUSTERING AND GENETIC ALGORITHMS

FACIAL FEATURE EXTRACTION TECHNIQUES FOR FACE RECOGNITION

Classification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM

Local and Global Accessibility Evaluation with Tool Geometry

Machine Learning. Topic 6: Clustering

S1 Note. Basis functions.

Maximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation

RECOGNITION AND AGE PREDICTION WITH DIGITAL IMAGES OF MISSING CHILDREN

Constructing Minimum Connected Dominating Set: Algorithmic approach

TN348: Openlab Module - Colocalization

Detection and Removal of Graphical Components in Pre-Printed Documents

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features

CS 534: Computer Vision Model Fitting

ICDAR2005 Page Segmentation Competition

User Authentication Based On Behavioral Mouse Dynamics Biometrics

Lecture 5: Multilayer Perceptrons

Data Mining: Model Evaluation

Foreground and Background Information in an HMM-based Method for Recognition of Isolated Characters and Numeral Strings

Applying EM Algorithm for Segmentation of Textured Images

Spatially Localized Circular and Overlapped Feature Extraction for Gray Scale Images using Gabor Jets

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

A New Approach For the Ranking of Fuzzy Sets With Different Heights

FACE RECOGNITION USING MAP DISCRIMINANT ON YCBCR COLOR SPACE

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

Line-based Camera Movement Estimation by Using Parallel Lines in Omnidirectional Video

Transcription:

Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research Scholar Research Scholar Bangalore, Karnataka PICT, Pune PICT, Pune Inda Inda Inda Abstract- angle estmaton and correcton of a document page s an mportant task for document analyss and optcal character recognton (OCR) applcatons. Many approaches of skew detecton can process pure textual document mages successfully. But t s a challengng problem to process documents lke handwrtten, large areas of non-textual contents. In ths drecton, a novel approach for textual, large area of non-textual, handwrtten document skew detecton s presented n ths paper. Estmaton of skew angle s based on fndng centrod of each connected component (CC) and plottng the major axs of the ellpse of each connected component. The proposed technque was tested on varety of mages ncludng handwrtten and machne prnted documents. The testng results showed a successful rate more than 98% of the data set. Keywords: detecton, correcton, connected component, Major axs 1.0 Introducton Most of the tmes, document analyss systems requre pror skew detecton and correcton before the mages are forwarded for character recognton, layout analyss, document mage mosacng and many other applcatons. An effcent and accurate method for determnng document mage skew s therefore an essental need. A number of methods have prevously been proposed for dentfyng document mage skew angles. However, most of the approaches are desgned manly for machne prnted documents. They are usually able to deal wth small skew angles wthn ±15 0 falng to manage cases of documents that may exceed ths lmt. Moreover, some of them ental hgh computatonal cost, especally n the case where the Hough Transform s used. Also, certan approaches are font, column, graphcs or border dependent. There s very few skew detecton methods proposed to handle handwrtten documents. Some of them are desgned for specfc applcatons. Ths s manly because of the complexty of handwrtten documents ncludng varatons n wrtng styles szes of characters and touchng or connecton of characters, words and text lnes. A survey n ths regard was reported by Hull [1]. Many approaches of skew detecton can process pure textual document mages successfully. But t s a challengng problem to process documents wth large areas of non-textual contents. Some approaches have been proposed to address ths problem. Le et al. [] select a square regon domnated by text from the document mage and calculate the skew angle by ths area. Avanndra and Subhass Chaudhur [3] dvde the document nto blocks and use the medan of the cross-correlatons of all blocks to determne the skew angle. Xaoyan Zhu [4] proposed an approach based on classfcaton of textual and nontextual areas for skew correcton. The method dvdes the document mage nto blocks and uses Fourer Transform and Support Vector Machnes to determne whether each block s a textual one or not. The approach uses a classfer for classfyng textual and non-textual contents to determne skew detecton. The

approach entals hgh computatonal cost and the best classfer to classfy textual or non-textual regons. In ths paper, we propose a novel method for skew estmaton of complex document mages based on fndng orentaton of each word. The man advantage of the proposed method s the ablty to detect skew angle of document mxed wth non-text elements such as graphcs, bar-codes and forms. The paper s organzed as follows. In secton, skew detecton approach s presented and secton 3 gves analyss of algorthm on dfferent documents. Secton 4 gves evaluaton of proposed algorthm.e. percentage error and secton 5 concludes the work..0 detecton approach The approach s based on dentfcaton of connected components and fndng optmum skew angles of connected components. Intally, document mage (Fg 1) s preprocessed by extractng the edges usng Canny Edge Detector [5]. The extracted edges are dlated usng crcular structurng element for dentfcaton of connected components, whch s depcted n Fg..1 Estmaton of Angle The skew angle s estmated based on fndng centrod of each CC n the document and plottng ellpse on t. Orentaton of a CC s defned as an angle between reference-axs and prncpal axs (Major axs) around whch CC rotates wth mnmum nerta. The condton to be mantaned s that the nd moment of the regon s same as that of the ellpse. The centrod CEN (X, Y) of CCs s deduced as CEN ( X, Y ) = ( OrgX N, OrgY N )...(1) Where (OrgX, OrgY ) are co-ordnates values of pxels wthn respectve CCs and N represents total number of pxels present n each CC. 1 st order moments are calculated as follows ( X ) = ( OrgX X )...() ( Y ) = ( OrgY Y )...(3) Usng above mathematcal equatons, nd moments are order µ = xx µ = yy ( ( X ) N ) ( ( Y ) N )... (4)...(5) Fg.1 Document Image The moments µ = xy µ and xx x and y respectvely. The moment covarance between x and y. ( ( X Y ) N )... (6) µ are thus the varances of yy µ s the xy µ µ + = xx yy 4 µ + xy µ xy...(7) Fg. Connected components Image = µ µ + xx yy µ xy + 4 µ xy...(8)

Orentato n Angle ( ) ( 180 θ = ) tan 1 ( )...( 9) Fg.3 (a) Geometry of plottng ellpse of CC Π Algorthm: detecton Input: ed mage Output: corrected mage Method: Step 1. Usng canny edge detector, extract edges from gven nput mage, dlate t usng dsk as structurng element. Step. Orentaton of each CC s calculated usng equaton 9 by fndng major axs of ellpse drawn around each CC and store these angles n an array. Step 3. From the hstogram determne the orentaton of the Image by consderng the average of the longest spke. To get optmum skew angle calculate the average adjacent spke (whchever s larger) along wth the longest spke. Step 4. Rotate nput mage by the angle obtaned from above step to remove the skew. End of Algorthm. 3.0 EXPERIMENTAL RESULTS Fg.3 (b) Determnng the orentaton angle As shown n fg.3 (a), ellpse s plotted on CC and fg3 (b) gves orentaton angle θ. The proposed methodology s tested on dfferent types of documents namely Marath handwrtten, documents contanng mnmum textual and more non-textual nformaton, Kannada documents and other languages also. Ths automated skew detecton and correcton s carred out usng MATLAB 6.5.0.180913a Release 13.Algorthm s processed on.0 GHZ Pentum-IV processor wth 56 RAM sze. Result 1. Experment s performed on Kannada document wth manual rotaton The skew angle s estmated by takng hstogram of all the orentaton angles n the document and fndng the longest spke n t. Ths gves maxmum number of words orented n that range and average of these orentatons gves the skewed angle. For gettng more optmum results select ether left or rght spke of the longest spke and also consder ts average to get the domnant skewed angle. Fg 4(a) Kannada skewed document

Fg. 4(b) Bar graph whch determnes skewed angle Fg. 5(b) Bar graph whch determnes ed angle Fg. 4(c) corrected mage Fg. 5(c) corrected mage Result. Experment on manually rotated Marath handwrtten document Result 3. Experment on manually rotated large area of Non-textual document Fg.5(a) Marath handwrtten skewed document Fg.6(a) Large areas of Non-Textual skewed document

Fg. 6(b) Bar graph whch determnes skewed angle Fg. 7(b) Bar graph whch determnes ed angle Fg. 6(c) corrected Image Fg. 7(c) corrected Image Result 4. Experment on manually rotated Englsh document 4.0 EVALUATION OF PROPOSED ALGORITHM The effcency of the proposed algorthmc model s evaluated by testng the algorthm for dfferent data sets. Table 1 gves error calculaton for obtaned skewed angle as compared to orgnal skewng. It s observed that the proposed algorthm gves 1.5.0 % of error. Fg. 7(a) Englsh skewed Document

TABLE I: ERROR COST PARAMETER 6.0 REFERENCES Input mage Marath Handwrtten Document wth Text and mage Englsh Text document Kannada Text Document Orgn al (n deg.) Experme ntal (n deg.) Error (%) Total tme (sec) -5-4.915 1.7 4.797 35 35.3188 0.911 1.469 0 0.11 0.605 6.4-7 -6.87 1.88 4.567 [1] J. J. Hull, Document mage skew detecton: survey and notated bblography, Document Analyss Systems II, World Scentfc, 1998. [] D. S. Le, G. R. Thoma and H. Wechsler, Automatc page orentaton and skew angle detecton or bnary document mages, Pattern Recognton, vol. 7, pp.135-1344, 1994. [3] Avanndra and Subhass Chaudhur, Robust Detecton of n Document Images, IEEE Trans on Image Processng, Vol.6, pp. 344-349, 1997. [4] Xaoyan Zhu and Xaoxn Yn, A New Textual/Non- textual Classfer for Document Correcton, Proceedngs of the 16 th Internatonal Conference on Pattern Recognton (ICPR 0), 00 5. CONCLUSION In ths paper, a novel technque for calculatng and correctng the skew of textual/non- textual documents s presented. It s based on fndng the orentaton of the major axs of the ellpse boundng each word. The proposed scheme works on any language ncludng non textual data also. Proposed method s mplemented on 10 document mages wth 60% of textual documents and t s observed that average tme requred s around 4-6 sec. The results provded by ths approach are also satsfactory. The documents were tlted by a prespecfed angle rangng between 0 and ± 45 degree. From our experments t s observed that about 98% of the accuracy s obtaned. Future work encompasses the use of fuzzy logc for takng the decson regardng the orentaton angle. [5] Canny J. F., A computatonal approach to Edge detecton, IEEE Trans on Pattern Analyss and Machne Intellgence, Vol.8 (6), pp. 679-698, 1986. [6] S. Srhar and V.Govndaraju, Analyss of textual mages usng the hough transform, Machne Vson and Applcatons, :141 153, 1989. [7] A. H. W. Chn and A. Jennngs, detecton n handwrtten scrpts, Proc. IEEE on Speech and Image Technologes for Computng and Telecommuncatons, pages 319 3, 1997.