Handwritten Devanagari Character Recognition Model Using Neural Network

Similar documents
Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Segmentation Based Optical Character Recognition for Handwritten Marathi characters

OCR For Handwritten Marathi Script

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS

Complementary Features Combined in a MLP-based System to Recognize Handwritten Devnagari Character

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Handwritten character and word recognition using their geometrical features through neural networks

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

Devanagari Handwriting Recognition and Editing Using Neural Network

Handwritten Numeral Recognition of Kannada Script

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network

HCR Using K-Means Clustering Algorithm

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT

Devanagari Isolated Character Recognition by using Statistical features

Hand Written Character Recognition using VNP based Segmentation and Artificial Neural Network

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL RESEARCH JOURNAL OF MULTIDISCIPLINARY STUDIES

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION

A Review on Handwritten Character Recognition

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text

Offline Signature verification and recognition using ART 1

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

A Novel Approach: Recognition of Devanagari Handwritten Numerals

RECOGNITION OF HANDWRITTEN DEVANAGARI WORDS USING NEURAL NETWORK

Performance Comparison of Devanagari Handwritten Numerals Recognition

An Efficient Character Segmentation Based on VNP Algorithm

Handwritten Marathi Character Recognition on an Android Device

Minimally Segmenting High Performance Bangla Optical Character Recognition Using Kohonen Network

A HYBRID FEATURE EXTRACTION AND RECOGNITION TECHNIQUE FOR OFFLINE DEVNAGRI HADWRITING

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Character Recognition Using Matlab s Neural Network Toolbox

Image Normalization and Preprocessing for Gujarati Character Recognition

Marathi Handwritten Numeral Recognition using Fourier Descriptors and Normalized Chain Code

Handwritten Hindi Numerals Recognition System

Online Handwritten Devnagari Word Recognition using HMM based Technique

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script.

Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network

Segmentation of Characters of Devanagari Script Documents

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features

Off-line Recognition of Hand-written Bengali Numerals using Morphological Features

An Improved Zone Based Hybrid Feature Extraction Model for Handwritten Alphabets Recognition Using Euler Number

A two-stage approach for segmentation of handwritten Bangla word images

Opportunities and Challenges of Handwritten Sanskrit Character Recognition System

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks

Multiple Classifier Combination for Off-line Handwritten Devnagari Character Recognition

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

CHAPTER 1 INTRODUCTION

LECTURE 6 TEXT PROCESSING

Neural network based Numerical digits Recognization using NNT in Matlab

PCA-based Offline Handwritten Character Recognition System

A Technique for Offline Handwritten Character Recognition

Problems in Extraction of Date Field from Gurmukhi Documents

OFFLINE SIGNATURE VERIFICATION

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition

Recognition of Unconstrained Malayalam Handwritten Numeral

Word-wise Script Identification from Video Frames

Handwritten Script Recognition at Block Level

Recognition of Marathi Handwritten Numerals by Using Support Vector Machine

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

HANDWRITTEN SIGNATURE VERIFICATION USING NEURAL NETWORK & ECLUDEAN APPROACH

Hand Written Telugu Character Recognition Using Bayesian Classifier

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

ISSN: [Mukund* et al., 6(4): April, 2017] Impact Factor: 4.116

OPTICAL CHARACTER RECOGNITION FOR VIETNAMESE SCANNED TEXT

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach

BANGLA OPTICAL CHARACTER RECOGNITION. A Thesis. Submitted to the Department of Computer Science and Engineering. BRAC University. S. M.

Multilevel Classifiers in Recognition of Handwritten Kannada Numerals

Convolution Neural Networks for Chinese Handwriting Recognition

TEXT DETECTION AND RECOGNITION FROM IMAGES OF NATURAL SCENE

Offline Tamil Handwritten Character Recognition using Chain Code and Zone based Features

Research Article Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

Handwritten Character Recognition: A Comprehensive Review on Geometrical Analysis

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

MURDOCH RESEARCH REPOSITORY.

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

Keywords Binary Linked Object, Binary silhouette, Fingertip Detection, Hand Gesture Recognition, k-nn algorithm.

I. INTRODUCTION. Image Acquisition. Denoising in Wavelet Domain. Enhancement. Binarization. Thinning. Feature Extraction. Matching

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM

Spotting Words in Latin, Devanagari and Arabic Scripts

Handwritten Character Recognition System using Chain code and Correlation Coefficient

Mixture of Printed and Handwritten Kannada Numeral Recognition Using Normalized Chain Code and Wavelet Transform

In this assignment, we investigated the use of neural networks for supervised classification

Multi-Layer Perceptron Network For Handwritting English Character Recoginition

A New Algorithm for Detecting Text Line in Handwritten Documents

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique

Khmer Character Recognition using Artificial Neural Network

Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System

A Hierarchical Pre-processing Model for Offline Handwritten Document Images

Time Stamp Detection and Recognition in Video Frames

Neural Network Classifier for Isolated Character Recognition

Cursive Handwriting Recognition

Pattern Recognition Letters

Mobile Application with Optical Character Recognition Using Neural Network

Character Segmentation for Telugu Image Document using Multiple Histogram Projections

Transcription:

Handwritten Devanagari Character Recognition Model Using Neural Network Gaurav Jaiswal M.Sc. (Computer Science) Department of Computer Science Banaras Hindu University, Varanasi. India gauravjais88@gmail.com Abstract - In this paper, a recognition model is described for recognizing handwritten Devanagari characters. The scanned image database of handwritten Devanagari character form several different writers was used to train and test to this classifier model. This model first preprocess (normalization, binarization, crop) then extracts the feature set. Based on the extracted feature database it classifies the characters. This model achieves the accuracy rate of recognition which range from 75% to 80%. Index Terms Devanagari, Neural Network, Handwritten, classifier I. INTRODUCTION Handwriting recognition is one of the challenging problems of Artificial Intelligence. The main challenges are large variety of handwriting style, large variety of pen type, variety of paper color where they are written. For solving this problem, several types of decision methods (including statistical method, structural method and stochastic processing) have been used along with different types of features[1][3][4][5]. Handwriting recognition for Devanagari script and its continental variation can be found in [2][11]. This paper proposed Neural Network based model to classify (recognize) handwritten Devanagari character. This system consists of three main stages preprocessing, feature extraction and classification [fig. 1]. Input images are preprocessed before passing into the feature extraction stage. Feature vector set is extracted in the feature extraction stage using feature extraction techniques and then feed into the classifier engine. Classifier is implemented using neural network based classification algorithm. II. DEVANAGARI SCRIPT Fig. 1 : Three stage of character recognition Devanagari script is used to write many Indian languages such as Hindi, Marathi, Rajasthani, Sanskrit and Nepali [1][2]. The Devanagari script follows left to right fashion for writing. In Devanagari script, there are 12 vowels and 36 consonant [fig. 2]. Each Devanagari consonant has an inherent vowel (A). Vowels can be written as independent letters, or by using a variety of diacritical marks which are written above, below, before or after the consonant they belong to. Fig. 2 : Vowels of Devanagari Script IJEDR1401162 International Journal of Engineering Development and Research (www.ijedr.org) 901

Fig. 3 : Consonant of Devanagari Script Fig. 4 : Numerals of Devanagari Script III. IMAGE PREPROCESSING Image preprocessing is the preliminary stage of this handwritten Devanagari character recognition model. In this stage, the following preprocessing techniques are used. RGB to gray scale conversion RGB image is converted into grayscale image by taking weighted summation threshold of R, G, B component of RGB image. For RGB image f(x,y), corresponding grayscale image is given by g(x,y)=0.2989*f R +0.5870*f G +0.1140*f B (1) Where f R, f G, f B are Red, Green and Blue components of the RGB image f(x; y) respectively. Noise Removal Noisy image pixels are removed with the help of filtering. Here, Non-linear median filtering technique is used for noise removal. For the digital image f(x; y), median filtered image is obtained as, g(x; y) = median{f(i, j) (i, j) ϵ w} (2) Where w is the neighbourhood centered around location (x,y) in the image. Image Segmentation Segmentation is the central problem of distinguishing object from the background. For the grayscale image f(x,y), the segmented image g(x, y) is obtained by the image binarization process as given below. g(x; y) = {1 if f(x; y) >= T {0 if f(x; y) < T (3) Where, T is the threshold value and it can be obtained using the Otsu s threshold selection technique for grayscale image segmentation [10]. Image Inversion Handwritten documents are normally written in white paper with black/blue pen. For the recognition system, we assume black pixels as a background and white pixels as the foreground. So, captured images are inverted before passing into the feature extraction stage. The inverted image of binary image f(x; y) can be obtained by the negative transformation as, g(x, y) = 1 - f(x, y) (4) IJEDR1401162 International Journal of Engineering Development and Research (www.ijedr.org) 902

Universe of Discourse Determining universe of discourse of the character image is finding smallest rectangle that encloses the character object. It removes extra pixels outside the convex rectangle of the character image. Size Normalization Size normalization is the technique of converting all the variable size input images to fixed size images. Size normalization is done so that we do not require padding of pixels at the time of feature extraction. All the input images are normalized to the predefined size of 50x70 pixels. Image Skeletonization Skeletonization is a process of reducing object regions in a binary image to a skeletal remainder that largely preserves the extent and connectivity of the original object. It creates single pixel wide connected object boundary that preserves Euler number of the original object. IV. FEATURE EXTRACTION Fig. 5 : Preprocessing step Feature extraction stage is important stage of this model. The most important aspect of handwriting recognition scheme is the selection of good feature set, which is reasonably invariant with respect to shape variations caused by various writing styles. There are several methods for feature extraction [7][8][9]. Here, Zone and count metric based feature extraction algorithm is used. The image (character/numeral) is divided into thirty five (10px*10 px) equal zones. Then count all white pixels (value 1) for each zone and divide by 100, which give average value of white pixels in that zone. We repeated this procedure for all the zones present in the image. There could be some zones that are empty, and then the value of that particular zone image value in the feature vector is zero. Finally 35 such features are used for feature extraction [fig. 6]. Algorithm: Zone and count metric based feature extraction system. Input: Handwritten thinned numeral image Output: Extracted Features for Classification and Recognition Method Begins Step 1: Divide the input image in to n equal zones. Step 2: Compute the count of white pixel (value 1) present in the zone. Step 3: Take average of computed value. Step 4: Repeat this procedure sequentially for the each zone of image. Step 7: Finally, n such features will be obtained for classification and recognition. Method Ends V. CLASSIFICATION Fig. 6 : Plot of Extracted Feature set Artificial Neural networks have emerged as an important tool for classification. The recent vast research activities in neural classification have established that neural networks are a promising alternative to various conventional classification methods [6]. Here, Multilayer feed forward neural network based classifier is used for classification. A multilayer feed forward neural network consists of a layer of input units, one or more layers of hidden units, and one layer of output units. The output from each layer is the weighted linear summation of all input vectors along with the bias term, passed through some activation function. The network weight adjustment is done by back-propagating the error of the network. The learning algorithms used for learning weights in the network are Gradient Descent with Momentum & Adaptive Learning Rate (GDMA). In this system, Input layer have 35 input node which takes extracted feature set as input. Hidden layer have 50 internal nodes and output layer have 58 output nodes (12 vowel class, 36 consonant classes, 10 numeric classes). IJEDR1401162 International Journal of Engineering Development and Research (www.ijedr.org) 903

Fig. 7: Multilayer feed forward neural network VI. EXPERIMENTAL RESULT In order to evaluate the performance of the proposed model, we consider handwritten Devanagari characters. We have collected handwriting of 30 individual writers and total of 1740 samples are considered for devanagari characters. Here 1160 samples are used for training purpose and remaining 580 samples are used for testing. This system is simulated in MATLAB using nprtool. Table I shows recognition results for each class of Devanagari handwritten numeral dataset. Table II shows recognition results for each class of Devanagari handwritten vowel dataset. Table III shows recognition results for each class of Devanagari handwritten consonant dataset. TABLE I RECOGNITION RESULTS FOR EACH CLASS OF NUMERALS. Numerals Accuracy rate(%) 94.4 83.7 82.4 83.1 82.5 82.9 88.1 97.2 81.4 97.8 TABLE III RECOGNITION RESULTS FOR EACH CLASS OF NUMERALS. Vowels Accuracy rate(%) 79.8 93.5 84.7 88.5 76.8 85.3 76.4 82.8 72.4 64.7 IJEDR1401162 International Journal of Engineering Development and Research (www.ijedr.org) 904

66.2 86.3 TABLE IIIII RECOGNITION RESULTS FOR EACH CLASS OF NUMERALS. Consonant Accuracy rate(%) Consonant Accuracy rate(%) 94.3 74.9 80.2 56.2 80.8 85.9 58.2 74.3 59.8 73.5 84.3 62.8 65.2 65.7 78.4 53.1 64.6 89.3 57.2 68.2 95.8 74.6 82.5 81.3 74.9 72.8 87.1 57.3 86.3 86.5 93.7 63.0 60.2 72.7 65.3 63.9 VI. CONCLUSION Off-line handwriting recognition with neural network is presented and evaluated on Devanagari handwritten character datasets. Recognition system evaluated on numeral dataset gives best accuracy of 87.35% over all other datasets. In other datasets, due to high variations on writing styles, shapes and cursive nature of characters, the recognition accuracy is decreased. For vowel dataset, recognition accuracy of 79.78% is obtained and for consonant dataset, recognition accuracy of 73.47% is obtained. The total accuracy rate of this system is 77.16%. REFERENCES [1] Magdi A. Mohamed, and Paul Gader, Generalized Hidden Markov Models-Part II: Application to Handwritten Word Recognition, IEEE Trans. On Fuzzy System, Vol. 8. No.1, February 2000. [2] U. Pal and B. B. Chaudhari, Indian Script Character Recogntion: a Survey, Pattern Recogntion, 37:1887-1899, 2004. [3] R. Bajaj, L. Dey and S. Chaudhary, Devanagari Numeral Recognition by Combining Decision of Multiple Connectionist Classifiers, Sadhana, 27:59-72, 2002. [4] K. C. Santosh and C. Nattee, Template-based nepali handwritten alphanumeric character recognition, Thammasat International Journal of Science and Technology (TIJSAT), Thammasat University, Thailand, vol. 12, no. 1, 2007. [5] S. K. Parui and B. Shaw. Offline handwritten devanagari word recognition: An hmm based approach. Proc. PReMI- 2007(Springer), LNCS-4815:528 535, December 2007. [6] Guoqiang Peter Zhang, Neural Networks for Classification: A Survey. IEEE Transactions on systems, man, and cybernetics - part c: applications and reviews, vol. 30, no. 4, november 2000 [7] Anil.K.Jain and Torfinn Taxt, ''Feature extraction methods for character recognition-a Survey'', Pattern Recognition, vol. 29, no. 4, pp. 641-662., 1996, [8] K. Jain, R. P. W. Duin, and J. Mao, Statistical pattern recognition: A review, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 4 37, Jan. 2000. [9] D. Zhang and G. Lu, Review of shape representation and description techniques, Pattern Recognition, vol. 37, no. 1, pp. 1 19, 2004. IJEDR1401162 International Journal of Engineering Development and Research (www.ijedr.org) 905

[10] N. Otsu. A threshold selection method from gray-level histograms. IEEE Trans. on Systems, Man, and cybernetics, 9(1):62 66, March 1979. [11] P. S. Deshpande, L. Malik, and S. Arora, Fine classification & recognition of hand written devnagari characters with regular expressions & minimum edit distance method, JCP, vol. 3, no. 5, pp. 11 17, 2008. IJEDR1401162 International Journal of Engineering Development and Research (www.ijedr.org) 906