Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

Similar documents
Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

PCA-based Offline Handwritten Character Recognition System

A Technique for Offline Handwritten Character Recognition

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

Segmentation of Characters of Devanagari Script Documents

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

Date Field Extraction from Gurmukhi Handwritten Documents

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

A Technique for Classification of Printed & Handwritten text

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script.

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

Handwritten Character Recognition: A Comprehensive Review on Geometrical Analysis

Problems in Extraction of Date Field from Gurmukhi Documents

Handwritten Gurumukhi Character Recognition Using Zoning Density and Background Directional Distribution Features

Offline Handwritten Gurmukhi Word Recognition Using Deep Neural Networks

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS

OCR For Handwritten Marathi Script

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Handwritten Character Recognition A Review

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts

Offline Handwritten Gurmukhi Character Recognition: A Review

Preprocessing of Gurmukhi Strokes in Online Handwriting Recognition

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

Handwritten Numeral Recognition of Kannada Script

LECTURE 6 TEXT PROCESSING

SEGMENTATION OF BROKEN CHARACTERS OF HANDWRITTEN GURMUKHI SCRIPT

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION

Human Face Detection and Tracking using Skin Color Combined with Optical Flow Based Segmentation

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System

A Review on Handwritten Character Recognition

HCR Using K-Means Clustering Algorithm

An Integrated Skew Detection And Correction Using Fast Fourier Transform And DCT

Chapter Review of HCR

A Hierarchical Pre-processing Model for Offline Handwritten Document Images

Handwritten Script Recognition at Block Level

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Image Normalization and Preprocessing for Gujarati Character Recognition

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network

Opportunities and Challenges of Handwritten Sanskrit Character Recognition System

A Novel Handwritten Gurmukhi Character Recognition System Based On Deep Neural Networks

Word-wise Script Identification from Video Frames

Segmentation Based Optical Character Recognition for Handwritten Marathi characters

Short Survey on Static Hand Gesture Recognition

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

Hand Written Character Recognition using VNP based Segmentation and Artificial Neural Network

Complementary Features Combined in a MLP-based System to Recognize Handwritten Devnagari Character

Optical Character Recognition

Machine Learning: Handwritten Character Recognition

Neural network based Numerical digits Recognization using NNT in Matlab

Neural Network Classifier for Isolated Character Recognition

CREATING 3D WRL OBJECT BY USING 2D DATA

SIMULATIVE ANALYSIS OF EDGE DETECTION OPERATORS AS APPLIED FOR ROAD IMAGES

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier

Handwritten Devanagari Character Recognition Model Using Neural Network

Handwritten Hindi Numerals Recognition System

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Keywords Handwritten alphabet recognition, local binary pattern (LBP), feature Descriptor, nearest neighbor classifier.

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text

Handwritten character and word recognition using their geometrical features through neural networks

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation.

Character Recognition Using Matlab s Neural Network Toolbox

Classification of objects from Video Data (Group 30)

Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network

[Kaur*, 5(2): February, 2016] ISSN: (I2OR), Publication Impact Factor: 3.785

Character Recognition of High Security Number Plates Using Morphological Operator

2. LITERATURE REVIEW

Mixture of Printed and Handwritten Kannada Numeral Recognition Using Normalized Chain Code and Wavelet Transform

International Journal of Advance Research in Engineering, Science & Technology

Recognition of online captured, handwritten Tamil words on Android

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

A Detailed Review of Feature Extraction in Image Processing Systems

Hybrid Image Compression Using DWT, DCT and Huffman Coding. Techniques

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Skin Infection Recognition using Curvelet

Offline Tamil Handwritten Character Recognition using Zone based Hybrid Feature Extraction Technique

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852

Review on Optical Character Recognition and Signature Recognition and Verification Technique

Devanagari Isolated Character Recognition by using Statistical features

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Handwritten English Alphabet Recognition Using Bigram Cost Chengshu (Eric) Li Fall 2015, CS229, Stanford University

Image Denoising Using wavelet Transformation and Principal Component Analysis Using Local Pixel Grouping

PCA and KPCA algorithms for Face Recognition A Survey

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering

Offline Signature verification and recognition using ART 1

Use of Splines in Handwritten Character Recognition

Recognition of Handwritten Numerals of Manipuri Script

Gabor Features Based Script Identification of Lines within a Bilingual/Trilingual Document

Performance Comparison of Devanagari Handwritten Numerals Recognition

A Simple Text-line segmentation Method for Handwritten Documents

HMM-based Indic Handwritten Word Recognition using Zone Segmentation

Transcription:

International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 6 (2017), pp. 1387-1396 Research India Publications http://www.ripublication.com Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient Sukhpreet Kaur M. Tech. Research Scholar (Computer Science & Engineering), Yadavindra College of Engineering, Punjabi University Guru Kashi Campus, Talwandi Sabo, Bathinda, Punjab, India. Simpel Rani Associate Professor (Computer Science & Engineering), Yadavindra College of Engineering, Punjabi University Guru Kashi Campus, Talwandi Sabo, Bathinda, Punjab, India. Abstract In this paper, we have presented a system for isolated curved Gurmukhi character recognition using projection of gradient. We have used projection of gradient features to extract the features of the character. The method relies upon the application of a projection based feature extraction using Radon transform, on both the original image and a set of produced images corresponding to different gradient orientations of the original image. Principal Component Analysis (PCA) is used for the recognition. The method is evaluated using the database of 525 samples images written by persons which are prepared for experiment. We have achieved recognition accuracy of 95.23% using projection of gradient features with PCA. Keywords: Handwritten character recognition, Feature extraction, Projection of gradient, PCA, Radon Transform

1388 Sukhpreet Kaur and Simpel Rani 1. INTRODUCTION Handwritten Character Recognition (HCR) is the process of changing scanned images of printed and handwritten text into computer processing text such as ASCII code. It is generally used for improve the speed of operations, reduced errors or noise in the documents and decrease storage space needed for papers documents. It is a simple method for fast retrieval, easily searched, saved more compressed data. It is an active field of research in pattern recognition and image processing field. HCR has many applications in 3D objects recognition, receipt processing, postal code recognize and banking sectors. The three main aspects accuracy, speed and flexibility make the best system. In the past of character recognition area, a numerous methods are available for isolated character recognition for particular languages. But research for isolated curved Gurmukhi character recognition is not so wide. Recognition accuracy of text depends upon the feature set. Various feature extraction techniques like zoning, diagonal, directional, geometrical based features are described by different research scholar. Sharma and Jhajj [1] have described isolated characters recognition in Gurmukhi script through zoning features extraction. They have used 3075 images database to train the system by SVM and K-NN classifier and 2050 images for testing. They have achieved 73.02% accuracy by SVM with Polynomial kernel. Kumar et al. [2] have proposed Principal Component Analysis (PCA) with power curve & parabola curve fitting for extracting features for character recognition. For the purpose of classification, they have used k-nn, Linear-SVM, Polynomial-SVM and RBF-SVM based approaches. They have collected 7,000 samples of isolated offline handwritten Gurumukhi characters from 200 different writers and achieved recognition accuracy of 94.8% without PCA and recognition accuracy of 97.7% with PCA. In an another work, Kumar et al. [3] have described a grading system for writers based on off-line handwritten Gurmukhi character recognition using zoning, directional, diagonal, intersection and open end points and Zernike moments features. They have used k-nn, HMM and Bayesian classifiers for classification and compare the handwriting of one writer with other writers by attach a score with handwriting of a writer. Singh et al. [4] have recognized Gurmukhi character using two Gabor filter features extraction methods namely Gabor- GABM and Gabor-GABN. SVM classifier is used for recognition of character. With the help of Gabor-GABN feature extraction technique, they have achieved high recognition accuracy of 94.29% with dimensionality 200 as compared to Gabor- GABM. Retsinas et al. [5] have described projection based feature extraction technique for recognition of isolated characters. Radon transform algorithm is implemented on primary image as well as on a set of produced gradient orientation images of the initial image. SVM classifier is used for recognize accuracy of method and achieved result 94.35%. Siddharth et al. [6] have presented a technique to recognized isolated handwritten Gurumukhi character. They

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient 1389 have extracted two types of characteristics. They have used 16 ZD and 8 BDD features for every 16 zones. For classification SVM classifier with RBF kernel is used. They have obtained 99.93% accuracy for known writers and the average accuracy has achieved 95.07% for known and unknown writers. Aggarwal et al. [7] have presented single isolated handwritten Devanagari character recognition using gradient feature extraction technique. They have collected 200 samples of 36 Devanagari character from 20 different writers writing 10 samples of each 36 characters. All sample of Devanagari character are normalized to 90 90 pixel sizes. For classification, Support Vector Machine with RBF kernel is used and achieved better accuracy of 94%. In this paper, we have proposed a system for isolated curved Gurmukhi character recognition using projection of gradient feature extraction technique. For recognition, we have used Principal Component Analysis. The procedure of HCR involves various steps including image acquisition, preprocessing, segmentation, feature extraction, and classification. In proposed system, we have taken pre-segmented character and hence the segmentation has been done. 2. PROPOSED METHODOLOGY The methodology used has following steps:- Step 1: Create digitized image of handwritten character. Step 2: Change RGB image into gray scale image. Step 3: Trained dataset by extraction of features of character using Gradient feature extraction technique and four gray scale gradient images are generated. Step 4: Apply Radon transform to compute angle and projection features extraction of original image and Gradient images. Step 5: Recognize characters using PCA method. The main interpretation of projection based feature derivation is break-down the gray scale images into various projections under specified angles through the radon transform. 2.1 Data Acquisition There is no standard dataset available for Gurmukhi handwritten character. So database is prepared from handwritten data written by different people on A4 sheet. In this work, we have used 35 basic Gurmukhi consonants. In data acquisition, the input image is scanned Gurmukhi character. The image is acquired through a HP

1390 Sukhpreet Kaur and Simpel Rani scanner in 300 dpi. Then these handwritten characters are converted into electronic forms i.e. bitmap images like.jpg,.png,.gif,.tiff,.bmp,.pcx. We have converted all handwritten character into.jpeg images. 2.2 Pre-Processing Pre- processing is next step, in which RGB images have been converted into gray scale images. In Figure 1(a) and 1(b), the digitized image n has been converted into gray scale image. It has been assumend that images are noise free. The image has been converted into pixel matrix. We have standardized the size of image into 180 200 pixels. (a) Figure 1: Gurmukhi Character n : (a) Digitized image (b) Gray scale image (b) 2.3 Feature Extraction Feature extraction technique is the most important part of recognition system. Feature extraction phase is used to remove redundancy from data. There are various feature extraction techniques like statistical and structural features. We have used Projections of Oriented Gradient (POG) feature extraction technique. POG (Projections of Oriented gradient) feature extraction technique comprises of two stages:- a) Computation of the oriented gradients of the image. b) Extraction of projection based features for each image gradient and for original image. 2.3.1 Gradient Features This step is necessary for capturing information of image, such as directional change in the intensity in small neighborhood of each pixel in an image through presented the gradient orientations as gray scale images.

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient 1391 The computation of two directional gradients Gx and Gy of initial image through horizontal (x-axis) and vertically (y-axis).to calculate gradient orientation at every pixel; a transformation is performed uisng Equations 1 and 2. Only G is performed because values of orientation lie in interval [0, 180 ] instead of [0, 360 ]. The gradient magnitude is then calculated as:- G(x, y) = (Gₓ 2 (x, y) + Gy 2 (x, y)..(1) Gradient direction is calculated as:- G(x, y) = tan 1 ( Gy(x,y) ).... (2) Gₓ(x,y) The gradient filter possible orientations are (0, 45, 90, and 135 ) that four gray scale images are constructed. 2.3.2 Projection Based Feature The Projection based features extraction is to divide the gray scale images into various projections under the selected angles, through Radon transform technique. Total projections are nɵ = 4 and every projection sampled at 180 / nɵ, such that angles of projections are θр = p(180 /nɵ), p [0 nɵ-1] i.e. 0, 45, 90 and 135. The size of image is denotes as (Iₓ, Iy), foreground pixel of image represent by (xᵢ, yᵢ) (i = 1 N), center of image represent by (xc, yc) and θ represent the select angle of projection. In Figure 2(a) and 2(b), we change the digitized image n into gray scale image.the projection pi of every pixel into the chosen angle vector is calculated through Equation 3. Figure 2: (a) Find center of the image (b) Visualization of a projection of angle 45 pi = (xᵢ - xc) cos(θ) + (yᵢ - yc)sin(θ), i [1, N]...(3)

1392 Sukhpreet Kaur and Simpel Rani Finally, the goal is to reduce the redundancy and excessive variability of image. The obtained matrix of radon transform is large to be useful descriptor and variations of same character may convert into distortions of projections. The important information related to smoothed regions of high pixel concentration after calculating the Discrete Fourier transform coefficients cj, j [0, k-1] of the projection vector. 2.4 Final Descriptor The final feature vector is the concatenation of projection features applied in the original image as well as in every four gradient orientation images (G0, G45, G90 and G135) as shown in Figure 3. Total length of descriptor is: 5(images) nɵ (projections) 2nc (Fourier coefficients) = 80. Figure 3: Final descriptor is the concatenation of features of original image and its oriented gradients. 2.5 Classification and Recognition Classification phase is the decision making part of a recognition system and it uses the features extracted in the previous phase for identify character. It improves accuracy & reduces the misclassification in system. In this work, we have used Principal component analysis (PCA) method. PCA is a mathematical process that uses orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components Analysis. The number of principal components may be less than or equal to the number of original variables. PCA is a dimensionality reduction or data compression method. The goal is dimension reduction and there is no guarantee that the dimensions are interpretable.

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient 1393 3. EXPERIMENT RESULTS 3.1 Database For our proposed technique, we have created handwritten Gurmukhi character database. The database comprises of 35 basic Gurmukhi characters written by persons. The resulting 35 classes consist of 525 isolated characters, in which 350 characters are used for training and the remaining 175 characters are used for testing. 3.2 Recognition Results Figure 4 shows the overall recognition results of the Gurmukhi character. Character recognition with PCA method and Projection of Gradient Orientation feature extraction technique is used for feature extraction. We take samples of each character and described how many characters recognize correct, which characters are misclassified with it and show accuracy of each character. The overall accuracy is achieved by divided the recognized characters with total characters. 0% Recognition Rate Accuracy 100% 50% 0% 1 2 3 4 5 6 7 8 9 10 11 12 13 14 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 Figure 4: Recognition results of Gurmukhi characters with PCA 3.3 Character confusion matrix & percentage of recognition In character recognition of Gurmukhi script similar patterns are creating confusion. Table 1 shows confusion matrix of Gurmukhi characters recognized. In this table n gives 93.33% recognition rate. The most confused characters that occur are observed in table:-n with j; u with T, G, y; M with B, N and R with t. These confusions are decreasing the performance of recognition.

1394 Sukhpreet Kaur and Simpel Rani Table 1:- Confusion matrix of handwritten isolated curved Gurmukhi characters of best result with PCA S. Angle Character Test Image No. of Character No. of Samples Recognized Misclassified with Percentage Accuracy 1 T 138 NIL 100% 2 n 137 14 1(j) 93.33% 3 J 137 13 1(;), 1(d) 86.66% 4 ; 137 NIL 100% 5 j 137 14 1 (B) 93.33% 6 e 41 NIL 100% 7 8 r 9 x y 41 NIL 100% 41 14 1(B) 93.33% 137 13 1(e), 1(;) 86.66% 10 C 137 NIL 100% 1(T), 1(G), 11 u 140 12 80% 1(y) 12 S 42 NIL 100% 13 i 137 NIL 100% 14 M 41 R 138 16 N 41 17 m 137 18 v 41 19 Y 138 20 D 42 21 S 41 13 1(B), 1(N) 86.66% 14 1(t) 93.33% NIL 100% NIL 100% 13 1(X),1(s) 86.66% 14 1(D) 93.33% NIL 100% 13 2(G) 86.66%

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient 1395 22 E 40 23 D 44 24 X 40 25 B 42 26 g 41 27 c 137 28 p 42 29 G 41 30 w 40 31 : 137 32 o 42 33 b 40 34 t 40 35 V 42 NIL 100% 14 1(N) 93.33% NIL 100% 14 1(X) 93.33% NIL 100% NIL 100% NIL 100% 14 1(s) 93.33% NIL 100% NIL 100% 13 2(r) 86.66% NIL 100% 13 1(T), 1(R) 86.66% NIL 100% Total 525 500 25 95.23% 4. CONCLUSION In this isolated curved handwritten Gurmukhi character recognition system, an average recognition rate of 95.23% is obtained. The proposed technique has tested on created database in which consist of 35 basic Gurmukhi character writing from different persons. The database is containing 525 isolated characters, among which 350 characters are used for training and the remaining 175 characters for testing. In this research we have used PCA algorithm and Radon transform. Radon transform is used to find out projection angle of curved character. In future, we can extend this work using other languages characters recognition like Devanagari characters, English characters etc. We have used segmentation free isolated characters, so extend up the work we can also consider words and slant correction in characters will be performed in future.

1396 Sukhpreet Kaur and Simpel Rani REFERENCES [1] Sharma, D. and Jhajj, P., 2010. Recognition of Isolated Handwritten Characters in Gurmukhi Script, International Journal of computer Applications, Vol. 4(8), pp. 9-17. [2] Kumar, M., Sharma, R. K. and Jindal, M. K., 2012. Offline Handwritten Gurmukhi Character Recognition: Study of Different Feature-Classifier Combinations, Workshop on Document Analysis and Recognition, pp. 94-99. [3] Kumar, M., Jindal, M. K. and Sharma, R. K., 2011. Classification of Characters and Grading Writers in Offline Handwritten Gurmukhi Script, International Conference on Image Information Processing, IEEE, pp. 1-4. [4] Singh, S., Aggarwal, A. and Dhir, R., 2012. Use of Gabor Filters for Recognition of Handwritten Gurmukhi Character, International Journal of Advanced Research in Computer Science and Software Engineering, Vol. 2(5), pp. 234-240. [5] Retsinas, G., Gatos, B., Stamatopoulos, N. and Louloudis, G., 20. Isolated Character Recognition using Projections of Oriented Gradients, International Conference on Document Analysis and Recognition (ICDAR), pp. 336-340. [6] Siddharth, K. S., Dhir, R. and Rani, R., 2011. Handwritten Gurumukhi Character Recognition Using Zoning Density and Background Directional Distribution Features, International Journal of Computer Science and Information Technologies, Vol. 2(3), pp. 1036-1041. [7] Aggarwal, A., Rani, R. and Dhir, R., 2012. Handwritten Devnagari Character Recognition Using Gradient Features, International Journal of Advanced Research in Computer Science and Software Engineering, Vol. 2(5), pp. 85-90. [8] Aggarwal, A., Singh, K. and Singh, K., 2014. Use of Gradient Technique for extracting features from Handwritten Gurmukhi Characters and Numerals, International Conference on Information and Communication Technologies (ICICT), pp. 1716-1723. [9] Herekar, R. and Dhotre, S. R., 2014. Handwritten Character Recognition Based on Zoning Using Euler Number for English Alphabets and Numerals, IOSR Journal of Computer Engineering (IOSR-JCE), Vol. 16(4), pp. 75-88. [10] Singh, R. and Kaur, M., 2010. OCR for Telugu Script Using Back- Propagation Based Classifier, International Journal of Information Technology and Knowledge Management, Vol. 2(2), pp.639-643.