Online Bangla Handwriting Recognition System

Similar documents
Recognition of Off-Line Handwritten Devnagari Characters Using Quadratic Classifier

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Word-wise Hand-written Script Separation for Indian Postal automation

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Recognition of Unconstrained Malayalam Handwritten Numeral

SEGMENTATION OF CHARACTERS WITHOUT MODIFIERS FROM A PRINTED BANGLA TEXT

A two-stage approach for segmentation of handwritten Bangla word images

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

Line and Word Segmentation Approach for Printed Documents

Recognition of online captured, handwritten Tamil words on Android

Segmentation of Characters of Devanagari Script Documents

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Preprocessing of Gurmukhi Strokes in Online Handwriting Recognition

Handwritten Marathi Character Recognition on an Android Device

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

PCA-based Offline Handwritten Character Recognition System

Skew Angle Detection of Bangla Script using Radon Transform

Complementary Features Combined in a MLP-based System to Recognize Handwritten Devnagari Character

FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT

Segmentation of Bangla Handwritten Text

Opportunities and Challenges of Handwritten Sanskrit Character Recognition System

HMM-based Indic Handwritten Word Recognition using Zone Segmentation

Online Handwritten Devnagari Word Recognition using HMM based Technique

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Handwriting segmentation of unconstrained Oriya text

Comparative Performance Analysis of Feature(S)- Classifier Combination for Devanagari Optical Character Recognition System

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language

Handwritten Numeral Recognition of Kannada Script

Segmentation Based Optical Character Recognition for Handwritten Marathi characters

A Review on Different Character Segmentation Techniques for Handwritten Gurmukhi Scripts

Word-wise Script Identification from Video Frames

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Devanagari Handwriting Recognition and Editing Using Neural Network

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

On-line handwriting recognition using Chain Code representation

Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting

Gradient-Angular-Features for Word-Wise Video Script Identification

Recognition of handwritten Bangla basic characters and digits using convex hull based feature set

Dynamic Stroke Information Analysis for Video-Based Handwritten Chinese Character Recognition

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation

Robust line segmentation for handwritten documents

OCR For Handwritten Marathi Script

Paper ID: NITETE&TC05 THE HANDWRITTEN DEVNAGARI NUMERALS RECOGNITION USING SUPPORT VECTOR MACHINE

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

A System towards Indian Postal Automation

Speeding up Online Character Recognition

Learning and Adaptation for Improving Handwritten Character Recognizers

CASIA-OLHWDB1: A Database of Online Handwritten Chinese Characters

CHARACTER Segmentation and Ground truth preparation for handwritten Bangla word images

Handwritten Devanagari Character Recognition

Structure in On-line Documents

A Statistical approach to line segmentation in handwritten documents

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM

A Technique for Offline Handwritten Character Recognition

Handwritten Devanagari Character Recognition Model Using Neural Network

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN

Devanagari Isolated Character Recognition by using Statistical features

A Complete Handwritten Numeral Database of Bangla A Major Indic Script

Explicit fuzzy modeling of shapes and positioning for handwritten Chinese character recognition

Research Article Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition

Convolution Neural Networks for Chinese Handwriting Recognition

Problems in Extraction of Date Field from Gurmukhi Documents

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

CHAPTER 1 INTRODUCTION

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features

Multilevel Classifiers in Recognition of Handwritten Kannada Numerals

Handwritten Gurumukhi Character Recognition Using Zoning Density and Background Directional Distribution Features

Chapter Review of HCR

On-Line Recognition of Mathematical Expressions Using Automatic Rewriting Method

LEKHAK [MAL]: A System for Online Recognition of Handwritten Malayalam Characters

A Segmentation Free Approach to Arabic and Urdu OCR

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models

A Technique for Classification of Printed & Handwritten text

A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

A Recognition System for Devnagri and English Handwritten Numerals

Image Normalization and Preprocessing for Gujarati Character Recognition

Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script

Devnagari Handwriting Recognition System using Dynamic Time Warping Algorithm

Morphological Approach for Segmentation of Scanned Handwritten Devnagari Text

account the distribution of black pixels around every pixel of a connected component. In this work we have considered document images in portrait orie

A Dynamic Programming Method for Segmentation of Online Cursive Uyghur Handwritten Words into Basic Recognizable Units

Optical Character Recognition For Bangla Documents Using HMM

Hidden Loop Recovery for Handwriting Recognition

Character Recognition Experiments using Unipen Data

A Segmentation-free Approach to Recognise Printed Sinhala Script

Automatic Detection of Change in Address Blocks for Reply Forms Processing

A semi-incremental recognition method for on-line handwritten Japanese text

Signature Based Document Retrieval using GHT of Background Information

On Segmentation of Documents in Complex Scripts

Improved Optical Recognition of Bangla Characters

A FEATURE BASED CHAIN CODE METHOD FOR IDENTIFYING PRINTED BENGALI CHARACTERS

with Profile's Amplitude Filter

Transcription:

1 Online Bangla Handwriting Recognition System K. Roy Dept. of Comp. Sc. West Bengal University of Technology, BF 142, Saltlake, Kolkata-64, India N. Sharma, T. Pal and U. Pal Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata-108, India Handwriting recognition is a difficult task because of the variability involved in the writing styles of different individuals. This paper presents a scheme for the online handwriting recognition of Bangla script. Online handwriting recognition refers to the problem of interpretation of handwriting input captured as a stream of pen positions using a digitizer or other pen position sensor. The sequential and dynamical information obtained from the pen movements on the writing pads are used as features in our proposed scheme. These features are then fed to the quadratic classifier for recognition. We tested our system on 2500 Bangla numeral data and 12500 Bangla character data and obtained 98.42% accuracy on numeral data and 91.13% accuracy on character data from the proposed system. Keywords: Online Recognition, Indian Script, Bangla, Modified quadratic discriminant function 1. Introduction Data entry using pen-based devices is gaining popularity in recent times. This is so because machines are getting smaller in size and keyboards are becoming more difficult to use in these smaller devices. Also, data entry for scripts having large alphabet size is difficult using keyboard. Moreover, there is an attempt to mimic the pen and paper metaphor by automatic processing of online characters. However, wide variation of human writing style makes online handwriting recognition a challenging pattern recognition problem. Work on online character recognition started gaining momentum about forty years ago. Numerous approaches have been proposed in the literature [1 5] and the existing approaches can be grouped into three classes namely, (i) structural analysis methods where each character is classified by its stroke structures, (ii) statistical approaches where various features extracted from character strokes are matched against a set of templates using statistical tools and (iii) motor function models that explicitly use trajectory information where the time evaluation of the pen co-ordinates plays an important role. Many techniques are available for online recognition of English,Arabic, Japanese and Chinese [1 4, 6 9] characters but there are only a few pieces of work [10 13] available towards Indian characters although India is a multi-lingual and multi-script country. Connell et al. [11] presented a preliminary study on online Devnagari character recognition. Joshi et al. [12] also proposed a work on Devnagari online character recognition. Later Joshi et al. [13] proposed an elastic matching based scheme for online recognition of Tamil character recognition. Although there are some work towards online recognition of Devnagari and Tamil scripts but the online recognition work towards other Indian languages is very few. In this paper we propose a system for the online recognition of Bangla character. Recognition of Indian character is very difficult with compare to English because of its shape variability of the characters as well as larger number of character classes. See Figure 1 where samples of four Bangla characters are shown to get an idea of handwriting variability. There are twelve scripts in India and in most of these scripts the number of alphabets (basic and compound characters) is more than 250, which makes keyboard design and subsequent data entry a difficult job. Hence, online recognition of such scripts has a commercial demand. Although a number of studies [14 16] have been done for offline recognition of a few printed Indian scripts like Devnagari, Bangla, Gurumukhi, Oriya, etc. with commercial level accuracy, but to the best of our knowledge no system is commercially available for online recognition of any Indian script. In this paper we propose a scheme for online Bangla handwritten character recognition and the scheme is robust against stroke connections as well as shape variation while maintaining reasonable robustness

2 against stroke order-variations. A quadratic classifier is used here for recognition purpose. The rest of the paper is organized as follows. In Section 2 we discuss about the Bangla language and data collection. Feature extraction process is presented in Section 3. Section 4 details the classifier used for recognition. The experimental results are discussed in Section 5. Finally, conclusion of the paper is given in Section 6. Fig. 1. Examples of some Bangla online characters. First three columns show samples of handwritten characters and last column shows samples of a numeral. 2. Bangla Language and online data collection Bangla, the second most popular language in India and the fifth most popular language in the world, is an ancient Indo-Aryans language. About 200 million people in the eastern part of Indian subcontinent speak in this language. Bangla script alphabets are used in texts of Bangla, Assamese and Manipuri languages. Also, Bangla is the national language of Bangladesh. The alphabet of the modern Bangla script consists of 11 vowels and 40 consonants. These characters are called as basic characters. Writing style in Bangla is from left to right and the concept of upper/lower case is absent in this script. It can be seen that most of the characters of Bangla have a horizontal line (Matra) at the upper part. From a statistical analysis we notice that the probability that a Bangla word will have horizontal line is 0.994 [14]. In Bangla script a vowel following a consonant takes a modified shape. Depending on the vowel, its modified shape is placed at the left, right, both left and right, or bottom of the consonant. These modified shapes are called modified characters. A consonant or a vowel following a consonant sometimes takes a compound orthographic shape, which we call as compound character. Compound characters can be combinations of two consonants as well as a consonant and a vowel. Compounding of three or four characters also exists in Bangla. There are about 280 compound characters in Bangla [15]. In this paper we consider the recognition of Bangla basic characters. To get an idea of Bangla basic characters and their variability in handwriting, a set of handwritten Bangla basic characters are shown in Figure 2. Main difficulty of Bangla character recognition is shape similarity, stroke size and the order variation of different strokes. By stroke we mean the set of points obtained between a pen down and pen up. From the statistical analysis on our dataset we found that the minimum (maximum) number of stroke used to write a Bangla character is 1 (4). The average number of stroke per character is 2.2. We have also seen that the characters E, O and are mostly written by single stroke whereas the character is written by almost all writer by 4 strokes. It also found that the characters,, t, n, K, h, etc. are always written by 2 strokes. Online recognition of these characters imposes several problems like stroke-number, stroke connection and shape variations. Most characters are composed of multiple strokes. Another difficult problem involves in the stroke-order variations. Writing of different stroke sequences of a character is not similar for all persons. During handwriting, some people give upper stroke of a character before giving a lower stroke of that character. Whereas to write the same character, some people give lower stroke before giving the upper stroke of that character. These stoke order variations complicates the development procedure of an online recognition system.

3 actual trace of the characters are generally large and varies greatly due to high variation in writing speed, a fixed lesser number of points, regularly spaced in time are selected for further processing. The digitizer output is represented in the format of p i R 2 X{0, 1} i = 1..M, where p i is the pen position having x-coordinate (x i ) and y-coordinate (y i ) and M is the total number of sample points. Let (p i ) and (p j ) be two consecutive pen points. We retain both of these two consecutive pen points (p i ) and (p j ) if the following condition is satisfied: x 2 + y 2 > m 2 (1) where x = x i x j and y = y i y j. The parameter m is empirically chosen. We have set m equal to zero; in Equation 1 to removes all consecutive repeated points. Fig. 2. Examples of handwritten Bangla basic characters. To illustrate this stoke order variation in Bangla script, Figure 3 shows a Bangla character that contains four different strokes. The left-most column shows the first stroke and this stroke is same for all the three samples of three different writers. Strokeorder varies from the second column onwards and the final (complete) character is shown in the rightmost column. From the 2nd column of Figure 3, it can be noted that the three samples have different shape. This is because of stroke order variation of the writers. For upper sample of the second column of Figure 3, the writer has given the upper stroke as second stroke. For middle sample of the second column the writer has given the lower stroke as second stroke. For lower sample, the writer has given the middle stroke as second stroke. Similar situation also occurs in 3rd column for Figure 3. Matra is a very common feature in Bangla and its length varies from writer to writer. It is seen that the presence or absence of Matra is the main difference between two characters. For example the character t is a basic character which has Matra but 3 is a Bangla numeral and it does not have Matra. For online data collection, the sampling rate of the signal is considered fixed for all the samples of all the classes of character. Thus the number of points M in the series of co-ordinates for a particular sample is not fixed and depends on the time taken to write the sample on the pad. As the number of points in Fig. 3. Example of different stroke order for a character having four strokes. Analyzing a total of 15000 Bangla characters we found that, for writing Bangla characters, the number of sample points (M) varies from 14 (for the character t) to 176 (for the character X) points. The average number of sample points in a Bangla character is 72. We also computed the average number of sample points in each character class. We noted that the character class ( ) has the maximum number of sample points and its average value is 113. The character class c has the minimum number (46) of sample points. 3. Feature extraction Any online feature is very much sensitive to writing stroke sequence and size variation. Also, in Bangla Matra creates a lot of problem in online recognition. To overcome it, we detect and remove the Matra present in the characters. As Matra of Bangla script

4 is a digital straight line lies on the upper part of a character. A stroke is called Matra if : (a) The ratio of distance of 1st and last point of the stroke to sum of individual distances between constituent points of the stroke is less than 1.5. (b) The ratio of the stroke height to the character height [max(y i ) min(y i ) i] is less than 0.35 and all the points of the stroke lie in upper side (we consider top 40% height of the character as upper side) of the character. These thresholds are decided based on statistical analysis of our dataset. If more than one stroke satisfies the above criteria then higher value of condition 1 is selected. The features calculated based on Matra are (1) The ratio of average value of x coordinate of the selected stroke to the length of the character, (2) The ratio of average value of y coordinate of the selected stroke to the width of the character, (3) Ratio of the length (L = l i i = 1..M where l i = (x 2 + y 2 ), x = x i x i+1 and y = y i y i+1 ) of the stroke to the length of the character, (4) Ratio of the area of the stroke to the character and (5) Ratio of aspect ratio of the stroke to that of character. A total of 5 features as discussed above are calculated based on Matra. After feature detection from Matra, it is then removed from the character and the rest of the points of the characters are first normalized. The normalization is done in two stages. First the points are re-sampled to a fixed number points (N) and then they are converted from equal time sample to equal distant points. For example see Figure 4. We have studied several local features, which include a normalized representation of the co-ordinates, a representation of the tangent slope angle, a normalized curvature, the ratio of tangents, etc. The processed character is transformed into a sequence t = [t 1,.., t N ] of feature vectors t i = (t i1, t i2, t i3 ) T. Here (1) t i1 = (x i µ x )/σ y and t i2 = (y i µ y )/σ y are the pen co-ordinates normalized by the sample mean µ = 1 N N i=1 p i and standard deviation, σ y = 1 N 1 N (µ y y i ) 2 of the character s sample points, i=1 and (2) t i3 = arg((x i+1 x i 1 ) + j (y i+1 y i 1 )), with j 2 = 1 and arg the phase of the complex number above, is an approximation of the tangent slope angle at point i. Thus finally, a feature vector sequence is defined as t = [t 1,.., t N ], each vector of it as t i = (t i1, t i2, t i3 ) T is obtained. Here the number of points (N) in which the character is normalized is 50. So a total of 155 (50 X 3 [3 for each point] + 5 [5 features based on Matra]) features are used in our experiment. Fig. 4. Feature extraction from a sample of character is shown. (a) Original image, (b) its normalized point used as feature (mapped into 50 points), (c) the normalized character. 4. The Classifier Based on these 155 dimensional features, recognition of characters in quadratic classifier [17] is carried out by using the following discriminant function: g(x) = (N + N 0 n 1) ln[1 + 1 [ X M 2 N 0 σ2 k λi λi + N {ΦT o i=1 N σ2 i (X M)} 2 ]] k + ln(λi + N o N σ2 ). (2) i=1 where X is the feature vector of an input character; M is a mean vector of samples; φ T i is the i th eigen vector of the sample covariance matrix; λ i is the i th eigen value of the sample covariance matrix; n is the feature size; σ 2 is the initial estimation of a variance; N is the number of learning samples; and N o is a confidence constant for σ, and N o is considered as 3N/7 for 155 dimensional feature. We do not use all eigen values and their respective eigen vectors for the classification. We sort the eigen values in descending order and take first 70 eigen values and their respective eigen vectors for classification. Rejection in the system is done if for a character the difference of 1st and 2nd value of g(x) is smaller than a threshold.

5 5. Results and Discussion The experimental evaluation of the above techniques was carried out using isolated Bangla character and numerals (digits). The data was collected from people of different background both by using mouse as well as Wacom tablet. The mouse is used as an input medium to facilitate the user with the flexibility of using this common input medium. The data obtained by mouse is in general poorer than Wacom tablet. Our another aim to use this mouse as one of the input devices is to test our system on these poorer data. If we get encouraging results from the data capture by mouse then there is no need to use any additional hardware like Wacom tablet for data capture. A total of 15,000 characters (2500 digits and rest are characters) are collected for the experiment. Out of them 66.7% of the characters (digits) are used for the training of the classifier for the present work and rest is used for the testing purpose. The recognition accuracy obtained from our classifier is shown in Table 1. Table 1. Recognition results on Bangla character and numeral (here rejection is not considered). Data Recognition rate Error rate Character 91.13% 8.87% Numeral 98.42% 1.58% From the experiments we noted that the overall recognition accuracy of the proposed scheme was 91.13% for Bangla characters and 98.42% accuracy for Bangla numerals when rejection is not considered. Accuracy of 96.23% and 99.58% is obtained for Bangla characters and numerals respectively, if we consider first two top choices of the recognition results. The detail recognition results are given in Table 2. Table 2. Recognition results based on different choices from top. Top Accuracy choices Character Numeral 1 91.13% 98.42% 2 96.23% 99.58% 3 97.68% 99.72% 4 98.26% 99.72% 5 98.39% 99.86% We also analyzed the error versus rejection rate of the classifier and the results are presented in Table 3. Table 3 depicts that for Bangla numeral, 1.57% error occurs at a rejection of 0.13% and only 0.85% error occurs when rejection was 1.10%. Table 3 also depicts that for Bangla character 6.56% error occurs at a rejection of 3.79% and only 2.64% error occurs when rejection was 17.27%. Table 3. Rejection versus error rate obtained for the Characters and Numerals. Characters Numerals Rejection Error Rejection Error 3.79% 6.56% 0.13% 1.57% 7.44% 5.12% 0.61% 1.10% 12.28% 3.69% 0.85% 0.96% 17.27% 2.64% 1.10% 0.85% Recognition accuracy of about 96% (99.6%) is obtained in character (numeral) recognition, if first two choices are considered. These different choices in recognition accuracy will be very helpful in designing a complete recognition system on word/sentence level where we will be able to take the help of dictionary or we can predict alternate results. From the experiment we noted most of the errors occurred because of similar shape characters. Maximum error occurred between m and s and it is noted that about 1.067% cases they mis-recognized one as the other. Main difference between the above two characters is that there is a small loop in left bottom side of one of the characters. Some times during handwriting people do not give this loop and hence mis-recognition occurs. The next erroneous character pair in Bangla is c, and Z. They mis-recognized in 0.93% cases. The main difference between the above characters is also the presence of loop in the lower half of the character and sometimes people do not give this loop during handwriting. The five most erroneous character pairs obtained from our experiment in Bangla are shown in Figure 5. Here erroneous character pairs are shown in the first row. In the second row the percentage of error and how many times they are mis-recognized are given. For example character m is mis-recognized as s in 19 cases, and the characters s is mis-recognized as m in 11 cases. So they mis-recognized between them in 30 (19+11) cases and their percentage of error is 1.067%.

6 Fig. 5. Some examples of misclassified Bangla characters. upper part. We tested the proposed system on 15000 samples and obtained encouraging results. Not much work has been done towards the online recognition of Indian scripts in general and Bangla in particular. So this work will be helpful for the research towards online recognition of other Indian scripts. We compared our work with the work of Garain et. al. [10]. For numeral recognition they have used a dataset of 400 numerals for their work. They captured their data by Wacom tablet and did not used mouse for data collection. For feature extraction they have used angular variation in 8 directional code and Euclidian distance of each sample point. Nearest neighbor classifier has been used by them for recognition. A comparative result on numeral recognition is shown in Table 4. Garain et. al. obtained 97.43% accuracy on training data of size 400 and they did not report any test result. We obtained 99.86% accuracy on training data and 98.42% on test data. Table 4. Comparison result on numeral recognition. Method Data Recognition Train Test Garain et. al. [10]. 400 97.43% Not reported Proposed 2500 99.86% 98.42 % Comparative results of character recognition is shown in Table 5. Garain et. al. [10] have used a dataset of 2440 character collected using Wacom tablet for their work and obtained 96.34% accuracy on training data. We obtained 98.73% accuracy on training data and 91.13% on test data. Table 5. Comparison result on character recognition. Method Data Recognition Train Test Garain et. al. [10]. 2440 96.34% Not reported Proposed 12500 98.73% 91.13 % 6. Conclusion This paper presents a quadratic classifier based system for the recognition of online Bangla handwriting. A modified and robust feature extraction technique is proposed here which can be used for the scripts with Matra/Shirorekha at the References [1] R. Plamondon and S. N. Srihari, IEEE PAMI 22, 63 (2000). [2] C. C. T. C. Y. Suen and T. Wakahara, IEEE PAMI 12, 179 (1990). [3] E. J. Bellagarda, J. R. Bellagarda, D. Nahamoo and N. S. Nathan, A probabilistic framework for online handwriting recognition, in Proc. of 3rd IWFHR, 1993. [4] I. Guyon, M. Schenkel and J. Denker, Overview and synthesis of on-line cursive handwriting recognition techniques, in Handbook of Character Recognition and Document Image Analysis, 1997. [5] CEDAR, Penman: Handwritten Text Recognition Project Description. [6] C. Bahlmann and H. Burkhardt, IEEE PAMI 26, 1 (2004). [7] S. D. Connell and A. K. Jain, PR 34, 1 (2001). [8] S. Jaeger, C. L. Liu and M. Nakagawa, IJDAR 6, 75 (2003). [9] C. L. Liu, S. Jaeger and M. Nakagawa, IEEE PAMI 26, 198 (2004). [10] U. Garain, B. B. Chaudhuri and T. Pal, Online handwritten indian script recognition: A human motor function based framework, in Proc. of 16th ICPR, 2002. [11] S. D. Connell, R. M. K. Sinha and A. K. Jain, Recognition of unconstrained online devnagari characters, in Proc. of 15th ICPR, 2000. [12] N. Joshi, G. Sita, A. G. Ramakrishnan, V. Deepu and S. Madhvanath, Machine recognition of online handwritten devanagari characters, in Proc. of 8th ICDAR, 2005. [13] N. Joshi, G. Sita, A. G. Ramakrishnan and S. Madhvanath, Elastic matching algorithms for online tamil character recognition, in Proc. ICONIP, 2004. [14] B. B. Chaudhuri and U. Pal, PR 31, 531 (1998). [15] B. B. Chaudhuri and U. Pal, An ocr system to read two indian language scripts: Bangla and Devnagari, in Proc. of 4th ICDAR, 1997. [16] V. Bansal and R. M. K. Sinha, On how to describe shapes of Devanagari characters and use them for recognition, in Proc. 5th ICDAR, 1999. [17] T. Wakabayashi, S. Tsuruoka, F. Kimura and Y. Miyake, Systems and Computers in Japan 26, 35 (1995).