A two-stage approach for segmentation of handwritten Bangla word images

Size: px
Start display at page:

Download "A two-stage approach for segmentation of handwritten Bangla word images"

Transcription

1 A two-stage approach for segmentation of handwritten Bangla word images Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri #, Dipak Kumar Basu Computer Science & Engineering Department, Jadavpur University, Kolkata , India. # Corresponding Author; nasipuri@vsnl.com Abstract. Segmentation of handwritten Bangla word images is a challenging problem for the researchers. Discontinuity or absence of Matra, an important feature of Bangla script, may lead to inherent segmentation within the word images. Around 55% of these inherently segmented connected sub-images do not require further segmentation. In the present work, we have designed a novel two-stage approach for segmentation of isolated Bangla word images. In the first stage, a feature based approach is designed to classify the connected word segments into either of the two classes, namely, Segment further and Do not Segment using a multi-layer perceptron based classifier. In the second stage, fuzzy segmentation features are designed to identify the Matra region and the potential segmentation points on the Matra of the connected word segments that belong to Segment further class. Using the current technique, the overall successful segmentation accuracy achieved after two stages is 95.87%. Keywords: Two-stage segmentation, handwritten Bangla words, multi-layer perceptron, fuzzy segmentation features. 1. Introduction Segmentation of isolated word images, extracted from optically scanned document images of handwritten text, is one of the major problems of optical character recognition (OCR). Segmentation and identification of word components make a decisive contribution towards the overall performance of an OCR system. The better is the segmentation process, the lesser is the ambiguity encountered in recognition of candidate characters or word pieces. Vertical pixel density histograms of word images are often used for segmenting the word images into constituent characters. This can be done by identifying the valleys of the histograms as the terminal points of the said characters. The technique has applications in OCR of English text. But it cannot be effective for segmenting words of Bangla script. Appearance of consecutive characters overlapped in column positions makes the problem of Bangla word segmentation more complex compared to segmentation of English words. The problem becomes compounded with handwritten Bangla words because of variation in sizes and shapes of handwritten characters. Bangla is an important East Asian script widely used in India and Bangladesh. Popularity wise, Bangla ranks fifth in the world, both as a script and a language. Some of the significant contributions made so far for OCR of handwritten texts include English texts [1-6]. A contour based segmentation technique [5] and a Bayesian knowledge based SVM [6] were designed for segmentation of handwritten English word images. The work relating to OCR of Bangla script is found to have limited references in the literature. Two such instances, [7] and [8], the former focusing on recognition of isolated handwritten characters based on stroke features and the latter on a multistage approach based on different topological features, have not addressed the problem of Bangla text segmentation. The problem of Bangla text segmentation has been addressed in [9-13]. The technique of word segmentation, as described in [9], has shown a high success rate by properly segmenting nearly 98.6% of characters of printed text. The technique is based on detection of an important feature of Bangla text, called the Matra. A Matra is a horizontal line, which passes touching the upper part of many characters of Bangla script as shown in Fig. 1(a). Depending on the characters, it covers at most the entire character width. The consecutive characters, in a Bangla word, which have Matras, are joined through a common Matra formed by joining the Matras of individual characters as shown in Fig. 1(b). This line may have some discontinuity over the positions where the characters in the word appear without Matras. The said technique [9] mainly works by successfully identifying and removing the Matra. The technique will not be effective for handwritten text, where the Matras are not horizontal as strictly as those of printed words. Another work related to segmentation of touching characters in printed Bangla and Devnagri text is presented in [10]. A. Bishnu et al. had developed a recursive contour following technique [11] for segmentation of handwritten Bangla word images. In another work, U. Pal et al. had used water reservoir principle [12] for the same purpose. Prior to the present work, a fuzzy technique [13] was developed for segmentation of handwritten Bangla word images by these authors. However, due to unavailability of

2 standard Bangla word datasets, the performances of these works can not be compared. In the light of the above discussion, the problem of segmentation of Bangla words still remains as an active area of research. shapes of characters or their subparts. Some or all of these sub-images may require further segmentation for extracting individual characters or modified shapes. In a typical survey it has been found that only around 18% handwritten Bangla words are written as a single connected segment. Around 74% of the words generate 2-4 connected segments and around 8% word images generate more than 4 connected segments. (a) An illustration of Matras of individual characters word in a Fig. 2. Internal segmentation within a handwritten Bangla word image. (b) An illustration of the common Matra of a word (c) An illustration of the three zones and region boundaries of a word Fig. 1(a-c). Illustration of some important features of Bangla script Most of the aforesaid techniques for segmentation of handwritten Bangla word images depend heavily on the presence of the Matra feature. But in reality, due to the variety of writing styles of individuals, the Matra in a word image often appear as wavy or non-horizontal, discontinuous and even completely missing in continuous writing. These wavy, discontinuous or missing Matras make the problem of word segmentation more difficult. The existing segmentation algorithms, as reported in [9-13], often identify potential segmentation points on the common Matra of the word image. However, the performances of such techniques may degrade significantly in case of discontinuous or missing Matra in the word images. Fig. 2 shows a sample word image with discontinuous Matras, where the technique described in [13] fails to identify potential segmentation points on the Matra. In such cases, the word image appears internally segmented into number of sub-images or segments, containing a collection of connected (4-connected or 8- connected) black pixels as shown in Fig. 2. Such subimages may contain one or more characters, modified The major motivation behind the present work is to identify only those connected segments from handwritten Bangla word images that need to be segmented further and subsequently segment them using a fuzzy feature based segmentation algorithm. The segments which need no further segmentation may be left alone, as an input to the recognition module for subsequent processing. 2. The present work Choice of suitable features for pattern classes is a domain specific design issue. In the first stage of the present work we have designed a set of features that can classify the noise-free connected segments into one of the two classes, namely Segment Further and Do not Segment, using a MLP classifier. In the second stage we have designed a non-linear fuzzy feature-set, to identify the Matra of the word segments and subsequently segment them into potential word components. The following sub-sections briefly discuss the methodology involved in the present work Preprocessing of word images Preprocessing is an important task in document image processing. In the present work, we have used several computing metrics based on spatial attributes of pixels of the binary image. Therefore, noise pixels appearing at the background and along the contour of the word image may affect the segmentation accuracy. To remove a noisy pixel and to smooth the contours of data, we have used a sequence of erosion and dilation, two basic mathematical morphological operators [14], on the input handwritten word images Connected component analysis In the present work we have used a simple technique for identifying the connected segments within the word image. To label all connected pixels in a word image identically, the connected component labeling algorithm [14] scans the

3 image pixel by pixel from left to right and from top to bottom. During scanning, it considers all 8 neighbours of each pixel. For each of the connected segments, all its member pixels appearing in the sub-image are replaced by a single distinct symbol. This is done to complete labellings of the connected pixels in the image and to generate uniquely coded connected segments. Each of such connected segments is subsequently extracted for segment analysis Design of the segment classification features For identifying segments, that require further segmentation, we have designed a 7-element feature vector based on the morphological attributes of different connected segments. Table.1 shows brief descriptions of these features, selected for the present work. All the feature values are normalized within the range (0,1) using the following formula. Normalized feature value = (actual feature value / maximum possible feature value) The height/width ratio of a connected segment gives a rough idea about the possible structure of the sub-image. This feature can have the maximum value as the height of the word image. Lower is this ratio, more is the chance of the sub-image to be segmented further. Table 1. Different feature vectors used for classification of the word segments Sl. Description of the feature vector No. 1 The height/width ratio of each connected segment 2 Width of each connected segment 3 Proportion of black pixels in each connected segment 4 Offset in number of rows of each connected segment from the starting row of the original word image 5 Maximum horizontalness of black pixels in each connected segment 6 Count of Matra pixels, as discussed in [13], for each connected segment 7 Count of segmentation pixels, as discussed in [13], for each connected segment Width of a connected segment, alone, is also an important attribute for segment classification. Often, due to the presence of ascendants and descendants within a word image, the height/width ratio may result into misleading information. In such cases this feature can be of use for true classification of segments. More is the width of a connected segment, higher is its chance of belongingness in the Segment Further class. The proportion of black pixels within the sub-image of given size is also used as one of the features to estimate the length of pen strokes within it. Higher is the proportion within the connected segment, more is its chance to be further segmented. As shown in Fig. 1(c), any handwritten Bangla word image may be hypothetically segmented three horizontal zones, namely the upper zone containing the ascendants, the lower zone containing the descendants and the middle zone, containing most of the characters, modified shapes of characters and their sub-parts. Connected segments generated close to the middle zone have higher chances of belongingness to the Segment Further class. On the contrary, connected segments generated close to the upper and lower zones have more chances to be in the Do not Segment class. To approximate this observation, we have computed the starting row offset of each connected segment and the original word image. This feature gives an estimate of belongingness of the segment in any of the three aforementioned zones. Segmentation of any sub-image depends significantly on the Matra feature. As mentioned earlier, this Matra, within a word image, appears to be horizontal in nature. Any connected segment with significant presence of Matra may further be segmented into component characters or modified shapes or their sub-parts. To estimate the presence of Matra pixels within a sub-image, a horizontal longest-run count of each pixel, as discussed in [13], is computed within the sub-image. The maximum value of this longest-run count is used as a feature after suitable normalization. Any connected segment may further be segmented using existing Matra based segmentation algorithm, as discussed in [13]. In doing so, each of the segments may generate a set of approximate Matra pixels and some potential segmentation points on the Matra. Any connected segment that needs further segmentation identifies more number of Matra pixels and potential segmentation points on the Matra in comparison to the segments that require no further segmentation. In the present work, we have used these features as the counts of Matra pixels and potential segmentation points with suitable normalization. Fig. 3 shows a sample word image and the two possible categories of connected word segments in the same. Fig. 3. A sample word image and its two classes of connected segments 2.4. Design of the MLP classifier in first stage In the present work, an MLP classifier is used for classification of connected word segments, generated from the word image, into either of the two output classes to

4 decide whether the given segmented sub-image needs to be further segmented or not, using the above mentioned feature set. The MLP classifier designed for this work is trained with the Back Propagation (BP) algorithm. It minimizes the sum of the squared errors for the training samples by conducting a gradient descent search in the weight space. The number of neurons in a hidden layer in the same is also adjusted during its training Fuzzy headline estimation in second stage The common headline or Matra of a connected word segment may be identified as the continuous horizontal stripe of black pixels appearing at the top of most of the characters and some of modified shapes in the word segment. In a cursive handwriting the appearance of a Matra is often disjoint and wavy. This makes the identification of potential Matra pixels a challenging task. In the present work, we have developed two fuzzy measures to identify the membership value of each pixel for its potential of belongingness to Matra Horizontalness feature This horizontalness property of the Matra may be extracted from the row wise sum of continuous run of black pixels, as shown in Fig. 4. This value is normalized with respect to the maximum longest run value of any pixel within the word image. with respect to the maximum longest run value of any pixel within the word image. Fig 5: Word images and the corresponding vertical longest run components that exceeds the mean verticalness of the respective words Design of the fuzzy membership function In the present work, we have designed a bell shaped membership functions to map the horizontalness feature values of each row to determine its belongingness in the Matra region. The generalized bell function depends on three parameters a, b, and c as given by: Where, the parameter b is usually positive. The parameter c locates the center of the curve, i.e., R 2 and x is the row index for any black pixel P x y in the word image. For computation of the fuzzy feature values, we have designed a fuzzy function, viz, f h (x h,f(x;a,b,c)) for horizontalness feature respectively. Such that, f h (x h, f(x;a,b,c)) = x h * f(x;a,b,c) Fig. 4. Word images and the corresponding horizontal longest run components that exceeds the mean horizontalness of the respective words Verticalness feature Many characters and modified shapes in Bangla script have vertical stripe of black pixels, as a part of their shapes. This vertical stripe often appears at the right side, middle or left side of the characters. These stripes touch the Matra of a word image and often extend till the bottom of the respective characters or modified shapes. In the present work, we have developed a technique to identify prominent vertical stripes in word image and identify their average top and bottom rows within the principal segments. This verticalness property of the Matra may be extracted from the column wise count of continuous run of black pixels, as shown in Fig. 5. This value is normalized Where, x h is normalized horizontalness component of each pixel P x y under consideration and 0 x h 1.. Fig. 6 shows a diagramatic representation of the bell shaped fuzzy membership function, designed for the present work. A pixel P x y is identified as a headline pixel, if its value exceeds the mean of all such f h (P x y ) values within the region R 1 -R 3. Fig. 6. Fuzzy Bell-shape memberships function for Matra determination.

5 2.6. Design of fuzzy segmentation features Once the black pixels constituting the Matra of a word segment are identified the next task becomes to identify certain column positions on the Matra from where the word segment can be vertically segmented into constituent characters. Such column positions are called terminal points of segments. One of the prominent features for identifying terminal points of segments is the number of black pixels along each vertical column position on the Matra. The less is the number of black pixels along a vertical column position on the Matra, the higher is its degree of belongingness (µ 1 ) to the set of terminal segment-points. On this basis a bell-shaped fuzzy membership function (µ 1 ), as discussed in previous section, is designed. Another feature (F 2 ), is considered here within the region (R 2 - R 4 ). Here again the more is the distance, the less is the degree of belongingness (µ 2 ) of the associated point to the set of segment terminal points. A third feature (F 3 ), similar to (F 2 ), is considered here by extending the region (R 2 - R 3 ), previously considered for computing F 2, to (R 2 - R 4 ). Detailed description of these three features is already given in [1]. The necessary membership functions (µ 1, µ 2, µ 3 ) for these features are shown in Fig. 7. Fig. 7. Fuzzy membership functions µ 1, µ 2 and µ 3 To determine finally whether a black pixel on the Matra can be considered as a segment terminal point, the average of all the three feature values exceed certain predetermined threshold, are finally considered as segment terminal points. The threshold is fixed up by taking the average of all the three feature values of all the black pixel positions over the Matra of a word segment. 3. Results and discussion In the present work, we have collected isolated handwritten Bangla word images from different persons of varying age groups. Word images are assumed to be slant and slope corrected and written in black ink with uniform pressure. Each such image is digitized using a flatbed scanner with 300 dpi resolution. 250 such word images were randomly selected for the current experimentation. As discussed earlier, around 82% of such word images generate more than one connected segment after the connected component analysis, and around 52% of such connected segments need no further segmentation. In the first stage, to classify the connected segments into one of the two classes, namely, Segment Further and Do not Segment, an MLP based classifier is designed with Back Propagation learning algorithm. For preparation of the training and the test sets, a collection of 600 such connected segments of Bangla word images is formed by taking 300 segments each from the two aforesaid classes. For cross validation of results, three different folds of test sets are formed by dividing the original dataset of 600 samples into three equal mutually disjoint parts. For each fold of the test set, the corresponding training set is formed with the rest of the dataset. Thus three pairs of the test and the training sets are formed for three fold cross validation of results. In each of these pairs, the training and the test sets are of sizes 400 samples and 200 samples respectively. For the present work, a single layer MLP, i.e., an MLP with one hidden layer is chosen. This is mainly to keep the computational requirement of the same low without affecting its function approximation capability. According to Universal Approximation theorem [15], a single hidden layer is sufficient to compute a uniform approximation to a given training set. To design an MLP for classification of handwritten alphabetic characters, several runs of BP algorithm with learning rate (η) = 0.8 and momentum term (α) = 0.7 are executed for different numbers of neurons in its hidden layer. The maximum recognition performances of the MLP, as achieved through three-fold cross validation of results, are 94.5%, 93% and 94.5%. Finally, the average success rate of these three sets of experiments is computed as 94%. Fig. 8(a-c) shows the images of some test samples successfully classified through this experimentation. Fig. 9 shows some of the images where our technique fails to classify the segments in the desired classes. (a) (b) (c) Fig. 8. Some of the correctly classified test samples (a) Successfully classified into Do not Segment class (b-c) Successfully classified into Segment Further class Fig. 9. Some sample test images misclassified into Segment Further class In the second stage, for designing of the fuzzy function, the values of two positive constants a and b were chosen as 1. As discussed earlier, the row index of the lower

6 boundary of the upper zone (R 2 ) is assigned to the third constant c in the said fuzzy function. Fig. 10(a-b) shows some of the sample word images, where segment classification algorithm of first stage is not applied, resulting in incorrect segmentation results in different parts of the word image. However, using the current two stage algorithm, connected word segments belonging to Do not Segment class are successfully extracted and only the segments that need further segmentation are segmented properly, as shown in Fig. 11(a-b) respectively. (a) (b) Fig. 10. Sample word images with segmentation errors shown in encircled regions. (b) Fig. 11. Sample word images with successfully classified word segments and subsequent segmentation result. After two stages, the overall segmentation accuracy, as observed manually with respect to potential segmentation points on each word image of the 250 word dataset, is evaluated as 95.87%. When only the second stage segmentation algorithm is employed on the same dataset, the successful segmentation accuracy becomes 91.86%. Thus we could significantly improve over the conventional single stage approach to identify potential segmentation points in handwritten Bangla word images. This technique may significantly reduce the cases of under-segmentation. However, there are further scopes of improvements. An iterative implementation of the present technique, along with the existing segmentation algorithm, may further improve the overall segmentation performance of handwritten Bangla word images in future. (a) Acknowledgements Authors are thankful to the Center for Microprocessor Application for Training Education and Research, Project on Storage Retrieval and Understanding of Video for Multimedia of Computer Science & Engineering Department, Jadavpur University, for providing infrastructure facilities during progress of the work. References 1. R.G. Casey et.al. A Survey of Methods and Strategies in Character Segmentation, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 18,pp , R.M. Bozinovic et.al. Off-line Cursive Script Word Recognition, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 11,pp 68-83, J.T. Faveta, Offline General Handwritten Word Recognition Using an Approximate BEAM Matching Algorithm, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23,pp , A.W. Senior et.al. An Off-line Cursive Handwriting Recognition System, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20,pp , B. Verma, A contour code feature based segmentation for handwriting recognition, in Proc. 7 th ICDAR, pp M. Maragoudisakis, et.al., Improving handwritten character segmentation by incorporating Bayesian knowledge with support vector machines, in Proc. ICASSP 2002, vol. 4, pp. IV A. F. R. Rahman, R. Rahman, M.C. Fairhurst, Recognition of Handwritten Bengali Characters: a Novel Multistage Approach, Pattern Recognition, vol. 35, p.p , T. K. Bhowmik, U. Bhattacharya and S. K. Parui, Recognition of Bangla Handwritten Characters Using an MLP Classifier Based on Stroke Features, in Proc. ICONIP, Kolkata, India, pp , B. B. Chaudhuri and U. Pal, A Complete Printed Bangla OCR System, Pattern Recognition, vol. 31, No. 5. pp , A. Bishnu, B. B. Chaudhuri, Segmentation of Bangla Handwritten Text into Characters by Recursive Contour Following, in Proc. 5 th ICDAR, pp , U. Pal, S. Datta, Segmentation of Bangla Unconstrained Handwritten text, in Proc. 7 th ICDAR, pp , U. Garain, B. B. Chaudhuri, Segmentation of touching characters in printed Devnagri and Bangla scripts using fuzzy multifactorial analysis, IEEE Trans. On Systems, Man and Cybernetics Part C: Applications and Reviews, vol. 22, pp , S.Basu, R.Sarkar, N. Das, M.Kundu, M.Nasipuri, D.K.Basu, A Fuzzy Technique for Segmentation of Handwritten Bangla Word Images, iccta, pp , International Conference on Computing: Theory and Applications (ICCTA'07), R.C. Gonzalez and R.E. Woods, Digital Image Processing, Prentice-Hall India, First Edition, (1992). 15. S. Haykin, Neural Networks: A Comprehensive Foundation, Second Edition, Pearson Education Asia, pp (2001).

Recognition of handwritten Bangla basic characters and digits using convex hull based feature set

Recognition of handwritten Bangla basic characters and digits using convex hull based feature set Recognition of handwritten Bangla basic characters and digits using convex hull based feature set Nibaran Das *, Sandip Pramanik *, Subhadip Basu *, Punam Kumar Saha +, Ram Sarkar *, Mahantapas Kundu *,

More information

Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script

Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script HTTPS://SITES.GOOGLE.COM/SITE/JOURNALOFCOMPUTING/ Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in

More information

Recognition of Unconstrained Malayalam Handwritten Numeral

Recognition of Unconstrained Malayalam Handwritten Numeral Recognition of Unconstrained Malayalam Handwritten Numeral U. Pal, S. Kundu, Y. Ali, H. Islam and N. Tripathy C VPR Unit, Indian Statistical Institute, Kolkata-108, India Email: umapada@isical.ac.in Abstract

More information

Complementary Features Combined in a MLP-based System to Recognize Handwritten Devnagari Character

Complementary Features Combined in a MLP-based System to Recognize Handwritten Devnagari Character Journal of Information Hiding and Multimedia Signal Processing 2011 ISSN 2073-4212 Ubiquitous International Volume 2, Number 1, January 2011 Complementary Features Combined in a MLP-based System to Recognize

More information

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION Binod Kumar Prasad * * Bengal College of Engineering and Technology, Durgapur, W.B., India. Rajdeep Kundu 2 2 Bengal College

More information

CHARACTER Segmentation and Ground truth preparation for handwritten Bangla word images

CHARACTER Segmentation and Ground truth preparation for handwritten Bangla word images CHARACTER Segmentation and Ground truth preparation for handwritten Bangla word images Submitted by SANCHITA MAITY Exam. Roll No. : MCA-3212027 of 2011-12 University Regn. No. : 108560 of 2009-10 Under

More information

Segmentation of Bangla Handwritten Text

Segmentation of Bangla Handwritten Text Thesis Report Segmentation of Bangla Handwritten Text Submitted By: Sabbir Sadik ID:09301027 Md. Numan Sarwar ID: 09201027 CSE Department BRAC University Supervisor: Professor Dr. Mumit Khan Date: 13 th

More information

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text Abstract In this paper we present a system towards the of Hindi Handwritten Devnagari Text. Segmentation of script is essential for handwritten script recognition. This system deals with of (matras) and

More information

Segmentation of Characters of Devanagari Script Documents

Segmentation of Characters of Devanagari Script Documents WWJMRD 2017; 3(11): 253-257 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manpreet Kaur Research

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

Word-wise Hand-written Script Separation for Indian Postal automation

Word-wise Hand-written Script Separation for Indian Postal automation Word-wise Hand-written Script Separation for Indian Postal automation K. Roy U. Pal Dept. of Comp. Sc. & Engg. West Bengal University of Technology, Sector 1, Saltlake City, Kolkata-64, India Abstract

More information

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Galaxy Bansal Dharamveer Sharma ABSTRACT Segmentation of handwritten words is a challenging task primarily because of structural features

More information

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 27-37 TJPRC Pvt. Ltd., HANDWRITTEN GURMUKHI

More information

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,

More information

Design of a novel convex hull based feature set for recognition of isolated handwritten Roman numerals

Design of a novel convex hull based feature set for recognition of isolated handwritten Roman numerals Design of a novel convex hull based feature set for recognition of isolated handwritten Roman numerals N. Das *, S. Pramanik *, S. Basu *, P. K. Saha +, R. Sarkar *, M. Kundu * *Computer Sc. & Engg. Dept.,

More information

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT International Journal of Information Technology, Modeling and Computing (IJITMC) Vol. 2, No. 1, February 2014 FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT Shuchi Kapoor 1 and Vivek

More information

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language I.J. Information Technology and Computer Science, 2013, 05, 38-43 Published Online April 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2013.05.05 A New Technique for Segmentation of Handwritten

More information

Online Bangla Handwriting Recognition System

Online Bangla Handwriting Recognition System 1 Online Bangla Handwriting Recognition System K. Roy Dept. of Comp. Sc. West Bengal University of Technology, BF 142, Saltlake, Kolkata-64, India N. Sharma, T. Pal and U. Pal Computer Vision and Pattern

More information

II. WORKING OF PROJECT

II. WORKING OF PROJECT Handwritten character Recognition and detection using histogram technique Tanmay Bahadure, Pranay Wekhande, Manish Gaur, Shubham Raikwar, Yogendra Gupta ABSTRACT : Cursive handwriting recognition is a

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

Character Recognition Using Matlab s Neural Network Toolbox

Character Recognition Using Matlab s Neural Network Toolbox Character Recognition Using Matlab s Neural Network Toolbox Kauleshwar Prasad, Devvrat C. Nigam, Ashmika Lakhotiya and Dheeren Umre B.I.T Durg, India Kauleshwarprasad2gmail.com, devnigam24@gmail.com,ashmika22@gmail.com,

More information

Off-line Recognition of Hand-written Bengali Numerals using Morphological Features

Off-line Recognition of Hand-written Bengali Numerals using Morphological Features Off-line Recognition of Hand-written Bengali Numerals using Morphological Features Pulak Purkait and Bhabatosh Chanda ECSU, Indian Statistical Institute, Kolkata, India {pulak r, chanda}@isical.ac.in Abstract

More information

আ ঐ হ ফ. An Enhanced Harmony Search Method for Bangla Handwritten Character Recognition Using Region Sampling

আ ঐ হ ফ. An Enhanced Harmony Search Method for Bangla Handwritten Character Recognition Using Region Sampling An Enhanced Harmony Search Method for Bangla Handwritten Character Recognition Using Region Sampling Ritesh Sarkhel 1, Amit K Saha 1, Nibaran Das 1 1 Computer Science and Engineering Department, Jadavpur

More information

A System towards Indian Postal Automation

A System towards Indian Postal Automation A System towards Indian Postal Automation K. Roy C.V.P.R Unit, I.S.I, Kolkata-108; India, Kaushik_mrg@hotmail.com S. Vajda LORIA Research Center, B.P. 239 54506, Nancy, France Szilard.Vajda@loria.fr U.

More information

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts WWJMRD2017; 3(10): 162-166 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manas Kaur Research

More information

Segmentation Based Optical Character Recognition for Handwritten Marathi characters

Segmentation Based Optical Character Recognition for Handwritten Marathi characters Segmentation Based Optical Character Recognition for Handwritten Marathi characters Madhav Vaidya 1, Yashwant Joshi 2,Milind Bhalerao 3 Department of Information Technology 1 Department of Electronics

More information

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts 25 A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts Rohit Sachdeva, Asstt. Prof., Computer Science Department, Multani Mal

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information

Handwritten Devanagari Character Recognition Model Using Neural Network

Handwritten Devanagari Character Recognition Model Using Neural Network Handwritten Devanagari Character Recognition Model Using Neural Network Gaurav Jaiswal M.Sc. (Computer Science) Department of Computer Science Banaras Hindu University, Varanasi. India gauravjais88@gmail.com

More information

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier

Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier N. Sharma, U. Pal*, F. Kimura**, and S. Pal Computer Vision and Pattern Recognition Unit, Indian Statistical Institute

More information

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS Sushilkumar N. Holambe Dr. Ulhas B. Shinde Shrikant D. Mali Persuing PhD at Principal

More information

Handwriting segmentation of unconstrained Oriya text

Handwriting segmentation of unconstrained Oriya text Sādhanā Vol. 31, Part 6, December 2006, pp. 755 769. Printed in India Handwriting segmentation of unconstrained Oriya text N TRIPATHY and U PAL Computer Vision and Pattern Recognition Unit, Indian Statistical

More information

LECTURE 6 TEXT PROCESSING

LECTURE 6 TEXT PROCESSING SCIENTIFIC DATA COMPUTING 1 MTAT.08.042 LECTURE 6 TEXT PROCESSING Prepared by: Amnir Hadachi Institute of Computer Science, University of Tartu amnir.hadachi@ut.ee OUTLINE Aims Character Typology OCR systems

More information

Multiple Classifier Combination for Off-line Handwritten Devnagari Character Recognition

Multiple Classifier Combination for Off-line Handwritten Devnagari Character Recognition Multiple Combination for Off-line Handwritten Devnagari Character Recognition Sandhya Arora Department of CSE & T Meghnad Saha nstitute of Technology Kolkata-700107 sandhyabhagat@yahoo.com Debotosh Bhattacharjee,

More information

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques 1 Lohitha B.J, 2 Y.C Kiran 1 M.Tech. Student Dept. of ISE, Dayananda Sagar College

More information

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Ajay K. Talele Department of Electronics Dr..B.A.T.U. Lonere. Sanjay L Nalbalwar

More information

A Technique for Offline Handwritten Character Recognition

A Technique for Offline Handwritten Character Recognition A Technique for Offline Handwritten Character Recognition 1 Shilpy Bansal, 2 Mamta Garg, 3 Munish Kumar 1 Lecturer, Department of Computer Science Engineering, BMSCET, Muktsar, Punjab 2 Assistant Professor,

More information

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network

Handwritten Hindi Character Recognition System Using Edge detection & Neural Network Handwritten Hindi Character Recognition System Using Edge detection & Neural Network Tanuja K *, Usha Kumari V and Sushma T M Acharya Institute of Technology, Bangalore, India Abstract Handwritten recognition

More information

Handwritten Hindi Numerals Recognition System

Handwritten Hindi Numerals Recognition System CS365 Project Report Handwritten Hindi Numerals Recognition System Submitted by: Akarshan Sarkar Kritika Singh Project Mentor: Prof. Amitabha Mukerjee 1 Abstract In this project, we consider the problem

More information

Layout Segmentation of Scanned Newspaper Documents

Layout Segmentation of Scanned Newspaper Documents , pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms

More information

Line and Word Segmentation Approach for Printed Documents

Line and Word Segmentation Approach for Printed Documents Line and Word Segmentation Approach for Printed Documents Nallapareddy Priyanka Computer Vision and Pattern Recognition Unit Indian Statistical Institute, 203 B.T. Road, Kolkata-700108, India Srikanta

More information

SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT

SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT ABSTRACT Rupak Bhattacharyya et al. (Eds) : ACER 2013, pp. 11 24, 2013. CS & IT-CSCP 2013 Fakruddin Ali Ahmed Department of Computer

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK HANDWRITTEN DEVANAGARI CHARACTERS RECOGNITION THROUGH SEGMENTATION AND ARTIFICIAL

More information

Handwritten Numeral Recognition of Kannada Script

Handwritten Numeral Recognition of Kannada Script Handwritten Numeral Recognition of Kannada Script S.V. Rajashekararadhya Department of Electrical and Electronics Engineering CEG, Anna University, Chennai, India svr_aradhya@yahoo.co.in P. Vanaja Ranjan

More information

HCR Using K-Means Clustering Algorithm

HCR Using K-Means Clustering Algorithm HCR Using K-Means Clustering Algorithm Meha Mathur 1, Anil Saroliya 2 Amity School of Engineering & Technology Amity University Rajasthan, India Abstract: Hindi is a national language of India, there are

More information

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script.

LITERATURE REVIEW. For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script. LITERATURE REVIEW For Indian languages most of research work is performed firstly on Devnagari script and secondly on Bangla script. The study of recognition for handwritten Devanagari compound character

More information

A FEATURE BASED CHAIN CODE METHOD FOR IDENTIFYING PRINTED BENGALI CHARACTERS

A FEATURE BASED CHAIN CODE METHOD FOR IDENTIFYING PRINTED BENGALI CHARACTERS A FEATURE BASED CHAIN CODE METHOD FOR IDENTIFYING PRINTED BENGALI CHARACTERS Ankita Sikdar 1, Payal Roy 1, Somdeep Mukherjee 1, Moumita Das 1 and Sreeparna Banerjee 2 1 Department of Computer Science and

More information

Available online at ScienceDirect. Procedia Computer Science 45 (2015 )

Available online at  ScienceDirect. Procedia Computer Science 45 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 45 (2015 ) 205 214 International Conference on Advanced Computing Technologies and Applications (ICACTA- 2015) Automatic

More information

A Technique for Classification of Printed & Handwritten text

A Technique for Classification of Printed & Handwritten text 123 A Technique for Classification of Printed & Handwritten text M.Tech Research Scholar, Computer Engineering Department, Yadavindra College of Engineering, Punjabi University, Guru Kashi Campus, Talwandi

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 5, ISSUE OPTICAL HANDWRITTEN DEVNAGARI CHARACTER RECOGNITION USING ARTIFICIAL NEURAL NETWORK APPROACH JYOTI A.PATIL Ashokrao Mane Group of Institution, Vathar Tarf Vadgaon, India. DR. SANJAY R. PATIL Ashokrao Mane

More information

PCA-based Offline Handwritten Character Recognition System

PCA-based Offline Handwritten Character Recognition System Smart Computing Review, vol. 3, no. 5, October 2013 346 Smart Computing Review PCA-based Offline Handwritten Character Recognition System Munish Kumar 1, M. K. Jindal 2, and R. K. Sharma 3 1 Computer Science

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

Automatic License Plate Recognition ALPRfor Bangladeshi Vehicles

Automatic License Plate Recognition ALPRfor Bangladeshi Vehicles Global Journal of Computer Science and Technology Volume 11 Issue 21 Version 1.0 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc. (USA) Online ISSN: 0975-4172

More information

An ICA based Approach for Complex Color Scene Text Binarization

An ICA based Approach for Complex Color Scene Text Binarization An ICA based Approach for Complex Color Scene Text Binarization Siddharth Kherada IIIT-Hyderabad, India siddharth.kherada@research.iiit.ac.in Anoop M. Namboodiri IIIT-Hyderabad, India anoop@iiit.ac.in

More information

Research Article Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition

Research Article Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition Applied Computational Intelligence and Soft Computing Volume 2012, Article ID 871834, 5 pages doi:10.1155/2012/871834 Research Article Development of Comprehensive Devnagari Numeral and Character base

More information

A New Algorithm for Detecting Text Line in Handwritten Documents

A New Algorithm for Detecting Text Line in Handwritten Documents A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer

More information

A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features

A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features Hamidreza Alvari, Seyed Mehdi Hazrati Fard, and Bahar Salehi Abstract Automatic recognition

More information

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad

More information

Handwritten Devanagari Character Recognition

Handwritten Devanagari Character Recognition Handwritten Devanagari Character Recognition Akhil Deshmukh, Rahul Meshram, Sachin Kendre, Kunal Shah Department of Computer Engineering Sinhgad Institute of Technology (SIT) Lonavala University of Pune,

More information

Identifying Layout Classes for Mathematical Symbols Using Layout Context

Identifying Layout Classes for Mathematical Symbols Using Layout Context Rochester Institute of Technology RIT Scholar Works Articles 2009 Identifying Layout Classes for Mathematical Symbols Using Layout Context Ling Ouyang Rochester Institute of Technology Richard Zanibbi

More information

Neural Network Classifier for Isolated Character Recognition

Neural Network Classifier for Isolated Character Recognition Neural Network Classifier for Isolated Character Recognition 1 Ruby Mehta, 2 Ravneet Kaur 1 M.Tech (CSE), Guru Nanak Dev University, Amritsar (Punjab), India 2 M.Tech Scholar, Computer Science & Engineering

More information

Robust line segmentation for handwritten documents

Robust line segmentation for handwritten documents Robust line segmentation for handwritten documents Kamal Kuzhinjedathu, Harish Srinivasan and Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) University at Buffalo, State

More information

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Shamim Ahmed 1, Mohammod Abul Kashem 2 1 M.S. Student, Department of Computer Science and Engineering, Dhaka University of Engineering

More information

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text Available online at www.sciencedirect.com Procedia Computer Science 17 (2013 ) 434 440 Information Technology and Quantitative Management (ITQM2013) A New Approach to Detect and Extract Characters from

More information

Character Segmentation for Telugu Image Document using Multiple Histogram Projections

Character Segmentation for Telugu Image Document using Multiple Histogram Projections Global Journal of Computer Science and Technology Graphics & Vision Volume 13 Issue 5 Version 1.0 Year 2013 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.

More information

An Efficient Character Segmentation Based on VNP Algorithm

An Efficient Character Segmentation Based on VNP Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5438-5442, 2012 ISSN: 2040-7467 Maxwell Scientific organization, 2012 Submitted: March 18, 2012 Accepted: April 14, 2012 Published:

More information

Online Handwritten Devnagari Word Recognition using HMM based Technique

Online Handwritten Devnagari Word Recognition using HMM based Technique Online Handwritten Devnagari Word using HMM based Technique Prachi Patil Master of Engineering Dept. of Electronics & Telecommunication Dr. D. Y. Patil SOE, Pune, India Saniya Ansari Professor Dept. of

More information

Optical Character Recognition For Bangla Documents Using HMM

Optical Character Recognition For Bangla Documents Using HMM Optical Character Recognition For Bangla Documents Using HMM Md. Sheemam Monjel and Mumit Khan Dept. of CSE, BRAC University, Dhaka, Bangladesh. sheemam@bracuniversity.net, mumit@bracuniversity.net Abstract

More information

A Recognition System for Devnagri and English Handwritten Numerals

A Recognition System for Devnagri and English Handwritten Numerals A Recognition System for Devnagri and English Handwritten Numerals G S Lehal 1 and Nivedan Bhatt 2 1 Department of Computer Science & Engineering, Thapar Institute of Engineering & Technology, Patiala,

More information

Skew Detection and Correction of Document Image using Hough Transform Method

Skew Detection and Correction of Document Image using Hough Transform Method Skew Detection and Correction of Document Image using Hough Transform Method [1] Neerugatti Varipally Vishwanath, [2] Dr.T. Pearson, [3] K.Chaitanya, [4] MG JaswanthSagar, [5] M.Rupesh [1] Asst.Professor,

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015 Offline Handwritten Signature Verification using Neural Network Pallavi V. Hatkar Department of Electronics Engineering, TKIET Warana, India Prof.B.T.Salokhe Department of Electronics Engineering, TKIET

More information

Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network

Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network Review of Automatic Handwritten Kannada Character Recognition Technique Using Neural Network 1 Mukesh Kumar, 2 Dr.Jeeetendra Sheethlani 1 Department of Computer Science SSSUTMS, Sehore Abstract Data processing

More information

Chapter 2. Literature Survey and Objectives. 2.1 Literature Survey

Chapter 2. Literature Survey and Objectives. 2.1 Literature Survey Chapter 2 Literature Survey and Objectives 2.1 Literature Survey In India, there are 18 official (Indian constitution accepted) languages. Two or more of these languages may be written in one script. Twelve

More information

Minimally Segmenting High Performance Bangla Optical Character Recognition Using Kohonen Network

Minimally Segmenting High Performance Bangla Optical Character Recognition Using Kohonen Network Minimally Segmenting High Performance Bangla Optical Character Recognition Using Kohonen Network Adnan Mohammad Shoeb Shatil and Mumit Khan Computer Science and Engineering, BRAC University, Dhaka, Bangladesh

More information

Prototype Selection for Handwritten Connected Digits Classification

Prototype Selection for Handwritten Connected Digits Classification 2009 0th International Conference on Document Analysis and Recognition Prototype Selection for Handwritten Connected Digits Classification Cristiano de Santana Pereira and George D. C. Cavalcanti 2 Federal

More information

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach

Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach Segmentation of Isolated and Touching characters in Handwritten Gurumukhi Word using Clustering approach Akashdeep Kaur Dr.Shaveta Rani Dr. Paramjeet Singh M.Tech Student (Associate Professor) (Associate

More information

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

Keywords Connected Components, Text-Line Extraction, Trained Dataset. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Language Independent

More information

Devanagari Isolated Character Recognition by using Statistical features

Devanagari Isolated Character Recognition by using Statistical features Devanagari Isolated Character Recognition by using Statistical features ( Foreground Pixels Distribution, Zone Density and Background Directional Distribution feature and SVM Classifier) Mahesh Jangid

More information

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition Dinesh Mandalapu, Sridhar Murali Krishna HP Laboratories India HPL-2007-109 July

More information

Enhanced Image. Improved Dam point Labelling

Enhanced Image. Improved Dam point Labelling 3rd International Conference on Multimedia Technology(ICMT 2013) Video Text Extraction Based on Stroke Width and Color Xiaodong Huang, 1 Qin Wang, Kehua Liu, Lishang Zhu Abstract. Video text can be used

More information

Hand Written Telugu Character Recognition Using Bayesian Classifier

Hand Written Telugu Character Recognition Using Bayesian Classifier Hand Written Telugu Character Recognition Using Bayesian Classifier K.Mohana Lakshmi 1,K.Venkatesh 2,G.Sunaina 3, D.Sravani 4, P.Dayakar 5 ECE Deparment, JNTUH, CMR Technical Campus, Hyderabad, India.

More information

Skeletonization Algorithm for Numeral Patterns

Skeletonization Algorithm for Numeral Patterns International Journal of Signal Processing, Image Processing and Pattern Recognition 63 Skeletonization Algorithm for Numeral Patterns Gupta Rakesh and Kaur Rajpreet Department. of CSE, SDDIET Barwala,

More information

K S Prasanna Kumar et al,int.j.computer Techology & Applications,Vol 3 (1),

K S Prasanna Kumar et al,int.j.computer Techology & Applications,Vol 3 (1), Optical Character Recognition (OCR) for Kannada numerals using Left Bottom 1/4 th segment minimum features extraction K.S. Prasanna Kumar Research Scholar, JJT University, Jhunjhunu, Rajasthan, India prasannakumarks@acharya.ac.in

More information

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM RAMZI AHMED HARATY and HICHAM EL-ZABADANI Lebanese American University P.O. Box 13-5053 Chouran Beirut, Lebanon 1102 2801 Phone: 961 1 867621 ext.

More information

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE Rupali Vitthalrao Suryawanshi Department of Electronics Engineering, Bharatratna Indira Gandhi College,

More information

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CHAPTER 4 CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS 4.1 Introduction Optical character recognition is one of

More information

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique

An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique An Improvement Study for Optical Character Recognition by using Inverse SVM in Image Processing Technique I Dinesh KumarVerma, II Anjali Khatri I Assistant Professor (ECE) PDM College of Engineering, Bahadurgarh,

More information

A Novel Feature Extraction and Classification Methodology for the Recognition of Historical Documents

A Novel Feature Extraction and Classification Methodology for the Recognition of Historical Documents 2009 10th International Conference on Document Analysis and Recognition A Novel Feature Eraction and Classification Methodology for the Recognition of Historical Documents G. Vamvakas, B. Gatos and S.

More information

AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE

AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE AN EFFICIENT BINARIZATION TECHNIQUE FOR FINGERPRINT IMAGES S. B. SRIDEVI M.Tech., Department of ECE sbsridevi89@gmail.com 287 ABSTRACT Fingerprint identification is the most prominent method of biometric

More information

Pattern Recognition Letters

Pattern Recognition Letters Pattern Recognition Letters 32 (2011) 1836 1842 Contents lists available at SciVerse ScienceDirect Pattern Recognition Letters journal homepage: www.elsevier.com/locate/patrec An improved contour-based

More information

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION S. M. Mali Department of Computer Science, MAEER S Arts, Commerce and Science College, Pune Shankarmali007@gmail.com Abstract In this paper,

More information

Neural Network based textural labeling of images in multimedia applications

Neural Network based textural labeling of images in multimedia applications Neural Network based textural labeling of images in multimedia applications S.A. Karkanis +, G.D. Magoulas +, and D.A. Karras ++ + University of Athens, Dept. of Informatics, Typa Build., Panepistimiopolis,

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an

More information

OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION

OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION OFF-LINE HANDWRITTEN JAWI CHARACTER SEGMENTATION USING HISTOGRAM NORMALIZATION AND SLIDING WINDOW APPROACH FOR HARDWARE IMPLEMENTATION Zaidi Razak 1, Khansa Zulkiflee 2, orzaily Mohamed or 3, Rosli Salleh

More information

Handwritten Marathi Character Recognition on an Android Device

Handwritten Marathi Character Recognition on an Android Device Handwritten Marathi Character Recognition on an Android Device Tanvi Zunjarrao 1, Uday Joshi 2 1MTech Student, Computer Engineering, KJ Somaiya College of Engineering,Vidyavihar,India 2Associate Professor,

More information

Hand Written Character Recognition using VNP based Segmentation and Artificial Neural Network

Hand Written Character Recognition using VNP based Segmentation and Artificial Neural Network International Journal of Emerging Engineering Research and Technology Volume 4, Issue 6, June 2016, PP 38-46 ISSN 2349-4395 (Print) & ISSN 2349-4409 (Online) Hand Written Character Recognition using VNP

More information