Recognition of writer-independent off-line handwritten Arabic (Indian) numerals using hidden Markov models

Size: px
Start display at page:

Download "Recognition of writer-independent off-line handwritten Arabic (Indian) numerals using hidden Markov models"

Transcription

1 Signal Processing 88 (2008) Recognition of writer-independent off-line handwritten Arabic (Indian) numerals using hidden Markov models Sabri Mahmoud King Fahd University of Petroleum and Minerals, P.O. Box 1378, Dhahran 31261, Saudi Arabia Received 11 July 2007; received in revised form 30 September 2007; accepted 1 October 2007 Available online 9 October 2007 Abstract This paper describes a technique for the recognition of optical off-line handwritten Arabic (Indian) numerals using hidden Markov models (HMM). The success of HMM in speech recognition encouraged researchers to apply it to text recognition. In this work we did not follow the general trend of using sliding windows in the direction of the writing line to generate features. Instead we generated features based on the digit as a unit. Angle-, distance-, horizontal-, and verticalspan features are extracted from Arabic (Indian) numerals and used in training and testing the HMM. These features proved to be simple and effective. In addition to the HMM the nearest neighbor classifier is used. The results of both classifiers are then compared. Several experiments were conducted for estimating the suitable number of states for the HMM. The best results were achieved with an HMM model with 10 states. In addition, we experimented with different number of features. The best results were achieved with 120 feature vector representing a digit. A database of 44 writers, each writer wrote 48 samples of each digit resulting in a database of 21,120 samples. The data were size normalized to enable the technique to be size invariant. In extracting the features the center of gravity of the digit is used to make the technique translation invariant. The randomization technique was used to generate Arabic (Indian) numbers for training and testing the HMM classifier. The randomization was done on the number of digits per number and on the digit sequence. About 2171 Arabic (Indian) numbers were generated, totaling 21,120 digits numbers (totaling 16,657 digits) were used in training the HMM and 471 numbers (totaling 4463 digits) are used in testing the HMM. The samples of the first 24 writers were used in training the nearest neighbor classifier and the remaining 20 writers samples were used in testing. The achieved average recognition rates are 97.99% and 94.35% using the HMM and the nearest neighbor classifiers, respectively. The classification errors were analyzed and it was clear that some errors may be attributed to bad data, some to deformation and unbalanced proportion of digit segments, different writing styles of some digits, errors between digit pairs were specified and analyzed, and genuine errors. It was clear that the real misclassification of genuine data, in the case of HMM was nearly 1%. This proves the effectiveness of the presented technique to writerindependent off-line Arabic (Indian) handwritten digit recognition. The technique is writer independent as separate writers data were used in training of the classifiers and other writers data were used in the testing phase. r 2007 Elsevier B.V. All rights reserved. Keywords: Arabic (Indian) numeral recognition; OCR; HMM; Handwritten digit recognition; Independent writer recognition; Normalization address: smasaad@kfupm.edu.sa /$ - see front matter r 2007 Elsevier B.V. All rights reserved. doi: /j.sigpro

2 S. Mahmoud / Signal Processing 88 (2008) Introduction Machine simulation of human reading (i.e. optical character recognition) has been the subject of extensive research for more than 5 decades. The convenience of paper, its widespread use for communication and archiving, and the amount of information already on paper, press for quick and accurate methods to automatically reading that information and convert it into electronic form [1]. The potential application areas of automatic reading machines are numerous. One of the earliest and most successful applications is sorting check in banks, as the volume of checks that circulates daily has proven to be too enormous for manual entry [2,3]. Handwritten digit recognition is a vital component in many applications; office automation, check verification, and a large variety of banking, business, postal address reading, sorting and reading handwritten and printed postal codes and data entry applications are few examples. The recognition of handwritten text (characters and numeral digits) is a more difficult task due to the different handwriting styles of the writers that is subject to inter- and intra-writer variations. Arabic handwriting, unlike Latin, has many handwriting styles like Naskh, Kofi, and others. In several instances, writers mix between these writing styles. This makes the recognition problem more difficult hence requiring more sophisticated and very advanced feature extraction and recognition techniques. Arabic text recognition (ATR) has not been researched as thoroughly as Latin, Japanese, or Chinese. The lag of research on ATR compared with other languages (e.g. Latin or Chinese) may be attributed, in part, to lack of adequate support in terms of human resources, journals, books, etc.; lack of general supporting utilities like Arabic text databases, dictionaries, programming tools; and the special characteristics of Arabic language. The calligraphic nature of the Arabic set is distinguished from other languages in several ways. For example, Arabic text is written from right to left, with the Arabic having 28 basic characters, of which 16 have from one to three dots. Those dots differentiate between the otherwise similar characters. Within a word, some characters connect to the preceding and/or following characters and some do not connect. The shape of an Arabic character depends on its position in the word; a character might have up to four different shapes depending on it being isolated, connected from the right (beginning form), connected from the left (ending form), or connected from both sides (middle form). Characters in a word may overlap vertically (even without touching). Arabic characters do not have fixed size (height and width). On the other hand Arabic (Indian) numerals are not cursive. Fig. (1a) shows the Arabic (Indian) numerals. Indian numerals are used in Arabic writing while Arabic numerals are used in Latin languages. Hence, when the term Arabic numerals is used it refers to the Indian numerals that are used in Arabic. Although Arabic text is written right to left, Arabic (Indian) numbers are written left to right with most significant digit being the left-most one and the least significant digit is the right-most one. However, the way the digits are stored in memory is in the reverse order (viz. most significant digit is stored first and so on), is contrary to the way the number is displayed and seen in the scanned image. For example, Fig. (1b) shows the Arabic (Indian) number Digit 9 is written first then digits 8, 7, and 6 are written last. Digit 9 is the most significant digit and digit 6 is the least. Scanning an Arabic (Indian) number by scanner and saving the image will give 6 as the right-most digit, then 7, 8, then 9 as the left-most digits. The scanned image will have the digits of an Arabic (Indian) number in the reverse order of truth value in the text file. Hence, care must be taken in case of automation of the scanned images truth values generation. Various methods have been proposed and high recognition rates are reported for the recognition of English handwritten digits [4 8]. In recent years many researchers addressed the recognition of Arabic text including Arabic (Indian) numerals [9 17]. Surveys on Arabic optical text recognition may be cited in [1,18,19]. Bazzi et al. [20,21] presented a system for bilingual text recognition (English/Arabic). In addition, several researchers reported the recognition of Persian (Arabic) handwritten digits. However, the reported recognition rates need more improvements to be practical [22 26]. Al-Omari [9] presented a recognition system for Indian numeral digits using average template-matching Fig. 1. (a) Arabic (Indian) numerals, (b) Arabic (Indian) number.

3 846 ARTICLE IN PRESS S. Mahmoud / Signal Processing 88 (2008) approaches. Freehand sketches of online numeric digits placed on an image template were processed to extract a key feature vector representing significant boundary point distances from the digit center of gravity (COG). A model for each numeric digit is formed by processing 30 handwritten digit samples. Classification was made using the Euclidean distance between the feature vector of the test samples and the models. In another work Al-Omari and Al-Jarrah [10] presented a recognition system for online handwritten Indian numerals one to nine. The system skeletonizes the digits and then geometrical features of the skeleton of the digits are extracted. Probabilistic neural networks (PNNs) are used for classification. The developed system is translation, rotation, and scaling invariant. The authors claim that the system may be extended to address Arabic characters [10]. [11] Presented an algorithm based on structural techniques for extracting local features from the geometric and topological properties of online Arabic characters using fuzzy logic. Salah et al. [12] developed a serial model for visual digit classification based on the primitive selective attention mechanism. The technique is based on parallel scanning of a down sampled image to find interesting locations through a saliency map, and by extracting key features at those locations at high resolution. Shahrezea et al. [22] used the shadow coding method for recognition of Persian handwritten digits. In this method, a segment mask is overlaid on the digit image and the features are calculated by projecting the image pixels into these segments. In [23] the Persian digit images are represented by line segments that are used to model and recognize the digits. Additional features and classifier are needed for discriminating the digit pairs 0 5, 7 8, 4 6. Said et al. [24] fed the pixels of the normalized digit image as is into a neural network for classification, where the number of hidden units for the neural network classifier is determined dynamically. Sadri et al. [25] used a feature vector of length 16 that is estimated from the derivative of the horizontal and vertical profiles of the image. [26] Used the normalized image profile calculated at multiple orientations as the main feature for the recognition of Persian handwritten digits. The crossing counts and projection histogram calculated at multiple orientations are used as complementary features. The authors indicated that most of the system errors occurred in discriminating the digits 2, 3, 4 and 0, 5. Hence, discriminating these digits requires the use of additional features and may require the use of additional classifiers. It is worth mentioning that there is no generally accepted database for Arabic text/numeral recognition that is freely available for researchers. Hence different researchers of Arabic text/numeral recognition use different data and hence the recognition rates of the different techniques may not be comparable. In order to help in tackling this problem of Arabic (Indian) numerals data, the author will make his data freely available to interested researchers. In this paper, we present a simple, effective, and scalable technique for the recognition of writerindependent offline handwritten Indian numerals (0,1,y,9) used in Arabic writing. The presented technique was implemented using HMMs and the nearest neighbor classifier. The results of the two classifiers were analyzed and compared. Although we are addressing Arabic (Indian) numeral recognition in this paper, we aim at extending the work for addressing ATR in the future using the same technique. Using this technique for text recognition requires segmenting Arabic text or estimating possible segmentation points and using this information in the feature extraction and in the recognition engine. An Arabic (Indian) number may consist of an arbirary number of digits. The recognition system performs classification on each digit independently, preserving its relative position with respect to other digits in order to obtain the actual value of the number after recognition. The subsequent stages of the developed recognition system have enough flexibility to treat variations, line thickness, writing size, and translation of the handwritten string. The left to right position order of each digit is preserved to account for the digit weight after individual digit classification. This paper is organized as follows. Feature extraction is addressed in Section 2 where four types of features are used. Section 3 addresses data preparation and normalization. Hidden Markov models (HMM) are addressed in Section 4, Training, recognition, and experimental results are addressed in Section 5, and finally the conclusions are presented in Section Feature extraction To use HMMs several researchers computed the feature vectors as a function of an independent

4 S. Mahmoud / Signal Processing 88 (2008) variable. Normally this independent variable is time in the case of speech recognition. This simulates the use of HMM in speech recognition where sliding frames/windows are used. The same technique is used in off-line text recognition where the independent variable is in the text line direction [20,21]. This enables the use of HMM engine of speech recognition in text recognition. In this paper we are using different technique to extract the features of an Arabic (Indian) numeral using the numeral as a whole (and not using a sliding window). However, the same HMM classifier is used without modification. We believe that using sliding windows limits the type of features that may be extracted for a numeral. In our technique, many types of features used for off-line text recognition, using other classifiers, may be used with the HMM classifier. In this work the following sections present the features that are extracted for each Arabic (Indian) numeral Angular span features To enable the presented technique to be translation invariant we are using the digit COG to estimate the angle span features. The COG of the digit image (x c, y c ) is estimated using Eq. (1). This is used as the center of the numeral image. ðx c ; y c Þ¼ P m P n j¼1 i¼1 P m j¼1 P n i¼1 P ii½i; jš m P n i¼1 I½i; jš ; j¼1 ji½i; jš P n!, i¼1 I½i; jš P m j¼1 (1) where I is a binary image of dimension m n, x c and y c are the x- and y-coordinates of the digit COG. The image of the Arabic (Indian) numeral is sliced using angular lines with angles of a degrees between consecutive lines passing through the COG (x c, y c ). Fig. 2 shows the slicing of Arabic (Indian) numerals 4 and 9. The number of black pixels in each slice is computed using the lines Eq. (2) at the different angles. y ¼ mx þ b, (2) where m is the slope of the line, b is the y-intercept, m ¼ tan(y),and y is the line inclination angle. Since each line passes through the COG, by putting (x c, y c ) in Eq. (2), the y-intercept is estimated. Slice 1 is formed by the two lines having slope ¼ 0 and tan(y1), where y 1 is the angle of the first intercepting line (y 1 equal to a, y 2 equal 2a, and so on). The two lines are calculated by substituting Fig. 2. The angle slicing of Arabic (Indian) digits 4 and 9. the values of m and b in Eq. (2) for the two lines. The x-coordinate of each black pixel is substituted in the equations of the two lines to find the corresponding y-coordinates. If for a black pixel y falls between the two lines of a slice then this pixel is considered within slice 1 and slice 1 counter is incremented by 1. This procedure is repeated for other slices. Hence a digit image will have 360/a slices (features). These features are normalized by dividing the number of black pixels in each slice by the total number of black pixels of the Arabic (Indian) digit. Several values of a were used in our experimentation of the technique. The results are presented in the following sections Distance span features The COG of the Arabic (Indian) digit is used in the extraction of the distance span features. Distance d between COG (x c, y c ) and digit image origin (0, 0), which is the index of the top most left most pixel, is calculated using the Euclidean qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi distance formula, d ¼ ðx c 0Þ 2 þþðy c 0Þ 2 ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi x 2 c þ y2 c. Several concentric circles having center at (x c, y c ) are used. Radius of these circles can be found out by using equation r i ¼ d/c 2 i 1, where C is the number of used concentric circles, i ¼ 1,y,C, and d as above. The distance span feature values are calculated by summing up the black pixels between two consecutive concentric circles. The first feature value will be the number of black pixels within concentric circle having radius r1 while the second feature value will be the number of pixels outside circle having radius r1 and the inside circle having radius r2 and so on. To find these

5 848 ARTICLE IN PRESS S. Mahmoud / Signal Processing 88 (2008) feature values, the number of pixels Pi (where i ¼ 1,y,C) within the C circles are calculated. Consider the general form of the circle equation, r 2 ¼ (x x c ) 2 +(y y c ), where (x c, y c ) is the COG of the image and r is the radius of circle. The values of the black pixels are substituted in the equation and the counters belonging to the circles with radius larger than the estimated radius is incremented. The first feature is equal to P 1, the second feature is P 2 P 1, and so on. The outer leftover portion of the image is treated as the last feature, hence resulting in C+1 features. The extra feature comes from estimating the remaining black pixels that do not fall in any of the concentric circle. These features are then normalized by dividing them by total number of pixels of the Arabic (Indian) numeral. Fig. 3 illustrates digits 4 and 9 with the concentric circles Horizontal- and vertical-span features The whole image is divided into a number of equal horizontal and vertical bars. The number of bars in each dimension is taken, in our case, as 20 in the horizontal and 20 in the vertical directions. Hence, we are normalizing the number of features of the horizontal span (height of the digit) and the vertical span (the width of the digit) as some numerals are small like the zero while other numerals are large like digits 4 and 9. For each numeral we calculate the number of black pixels in the horizontal and vertical bars. To normalize these features we divide them by the total number of black pixels in the image. In this work, we used the above four types of features. Experimentally, as will be discussed in Section 5, we found that slicing the character at 51, using 7 slicing circles, taking 20 horizontal and 20 vertical segments gave the best recognition rates. This makes a total of 120 features. These features represent the digit as a whole. It is not possible to use HMM with one observation vector for a digit. HMM requires several observations for each digit so that transition from one state to another is possible in the training and testing phases of HMM. While other types of features extracted using sliding windows produce large number of observations, our technique uses less number of features. This reduced number of features was adequate in obtaining high recognition rates. Hence, we represented each Arabic (Indian) numeral to the HMM as a set of 12 observations, each consisting of 10 features. It is noted that other techniques using sliding windows produce much more features (more than 20-folds) than this technique for a character on the average. Fig. 4 shows the horizontal and vertical slicing of Arabic (Indian) digits 4 and Data preparation The data were collected from writers using semitransparent paper over a tabular grid. The data were collected from 44 writers; each writer wrote 48 Fig. 3. The concentric circles used for calculating the distance span features of Arabic (Indian) digits 4 and 9. Fig. 4. The horizontal and vertical slices of digits 4 and 9.

6 S. Mahmoud / Signal Processing 88 (2008) samples of each digit (0 9), a total of 480 digits per writer. The database consists of 21,120 samples. The written pages are then scanned using a scanner with a resolution of 300 pixels per inch. Fig. 5 shows the data collected from one writer. The scanned document images are transformed into binary images (viz. black and white). The black pixels represent the text lines and are given a value of one, while the white pixels represent the background and are given a value of zero. For each scanned page the horizontal histogram is computed. The resulting histogram has black and white regions. The black regions represent the text lines and white regions represent the spaces between the text lines. The locations of the black regions give the numeral lines limits. Using the numeral lines limits the numeral lines are extracted. For each line the vertical histogram is computed. The histogram will have Fig. 5. Data collected from one writer. Each digit is written 48 times.

7 850 ARTICLE IN PRESS S. Mahmoud / Signal Processing 88 (2008) black and white regions. The Black regions represent the digits and white regions represent the spaces between the digits. The locations of the black regions are used to specify the location of each digit in the line. These digits were extracted keeping each digit in a separate file with the number of rows and number of columns in the first line of the file. The remaining lines correspond to the character pixel rows where a black pixel is saved as a 1 and a white pixel as a 0. It is worth mentioning that the author will make these data freely available for interested researchers Normalization The dimensions of each numeral, in terms of height and width, are different. In order to make the technique size invariant the data were normalized. Arabic (Indian) characters differ in height and width. Arabic writing has writing line height. The individual characters are normally segmented at this height. We used this height to represent the height of all characters in the line. Arabic text is cursive and hence characters are connected normally at the writing line. Maintaining the aspect ratio of height to width is important so that connecting the characters at the right position is maintained. Although Arabic (Indian) numerals are not cursive (they are isolated) we decided to follow the same normalizing technique used for Arabic characters. This enables us to integrate our numeral recognition technique with other ATR systems. We normalized Fig. 6. Digits 0, 4, and 9 in original (first row) and normalized forms (second row). all the digits to a height of 60 pixels while maintaining the aspect ratio for each numeral. Hence, the width of the Arabic (Indian) numerals is different for the different digits. Since the samples of each digit are written on one line, we normalized the samples on the digit height. The writing line height in this case is dependent on the digit height. When numerals are used with text the normalization will be based on the text writing height. Fig. 6 shows samples of Arabic (Indian) digits before and after normalization. 4. Hidden Markov model (HMM) Several research papers are published using HMM for text recognition [15,20,21,27 29]. In order to use HMMs several researchers computed the feature vectors as a function of an independent variable. This simulates the use of HMM in speech recognition where sliding frames/windows are used. The same technique is used in off-line text recognition where the independent variable is in the direction of the line length [20,21]. In this paper we have used different techniques to extract the features of an Arabic (Indian) numeral using the numeral as a whole and not a sliding window, that calculates the features based on partial parts of the character. However, we are using the same HMM classifier without modification. In this paper we used a left to right HMM for our Arabic (Indian) handwritten numeral recognition Fig. 7 shows the case of a 5-state HMM. This is in line with several research works using HMM [20,21]. This model allows relatively large variations in horizontal position of the Arabic (Indian) numeral. The sequence of state transition in the training and testing of the model is related to each digit feature observations. In this work we experimented with using different number of states and selected the best performing one. Although each digit model could have different number of states we decided to use the same number of states for all digits as was done in [20,21]. Each Arabic (Indian) numeral is represented by a 120-dimensional feature vector (viz. 72 angle-, 8 Fig. 7. A 5-state hidden Markov model (HMM).

8 S. Mahmoud / Signal Processing 88 (2008) distance-, 20 horizontal-, and 20 vertical-span features). Each numeral requires a number of observations to train and test the HMM. So, we divided our feature vector to 12 separate sub vectors of 10 features each. Hence, each digit is represented by 12 observations of 10 features each. 5. Training and recognition In this paper we used two classifiers (viz. HMM and the nearest neighbor classifier). Each of these classifiers requires training and testing phases. The training of these classifiers is different. The way data are presented to each classifier in the training phase is different. We will address this issue separately for each classifier. We experimented with each classifier and analyzed the results of each classifier separately and then compared the results of the two classifiers. In general, in the training phase the features of training data are computed and saved as models for the trained classes. In the recognition phase an unknown character features are extracted and compared with the features of the models. The unknown character is assigned to the class whose features are the closest (or the most probable) to the new character. The implementation of this work was done using C language and MATLAB. The HTK tools [30] are used in the experimentation of HMM Hidden Markov model classifier (HMMC) In order to apply this classifier, we applied data randomization in the first phase and classification in the second phase Data randomization Since Arabic (Indian) digits are saved in separate files, in our database, it was necessary to represent Arabic (Indian) numbers as natural as possible. HMM computes the probabilities of each numeral and the probability of a digit appearing before and after other digits. HMM uses these probabilities in the training and recognition phases. In Arabic text numerals appear in equal probability and the digits have the same probability of appearing between two digits. Hence, it was necessary to randomize the presented data in the training of the HMM. The randomization is done in the length of the Arabic (Indian) number and on the appearance of the digits in a number. So the digits of an Arabic (Indian) number are combined in a random way and the Arabic (Indian) number length was randomized having arbitrary length from 5 to 15. We made a utility tool to achieve this. This tool uses two random generators, one to decide the length of the number (i.e. the number of digits in a particular line) and the other random generator to decide which digit (from 0 to 9) will be the next digit in a particular number. We used this approach for all the writers in a sequence. We selected the first writer; once all the samples for this writer are used, another writer is selected until all writers are exhausted. All these decisions were also recorded in the master label file to be used during the training and testing of the HMM. Overall we constructed 2171 numbers for use in training and testing of the HMM. Out of these 2171 numbers we used 1700 numbers for training (consisting of 16,657 digits) and the remaining 471 numbers for testing (consisting of 4463 digits). Thus in our case we have, in general, separate writers for training and testing Classification Using this classifier we experimented with different number of features and different number of states. We used a slicing angle a of 51 (resulting in 72 features), 7 concentric circles (resulting in 8 features), and 20 horizontal and 20 vertical features; a total of 120 features are used. This feature vector was split into 12 observations of 10 features each. In order to find the best number of states to use in the recognition and classification stages several runs are conducted on the data with different number of states (viz. 3, 6, 8, 10, and 12). We trained the HMM with the observations of 1700 Arabic (Indian) numbers of varying length totaling 16,657 digits and tested the HMM with the observations of new 471 Arabic (Indian) numbers of varying length totaling 4463 digits. Fig. 8 shows a summary of the results of the tested characters (the recognition rate and accuracy) using 3, 6,8,10, and 12 states. It is clear from the figure that the recognition rate increases with the increase of the number states until reaching state 10 (98.28) then dropped on using 12 states (97.32) counting the silences. The accuracy is lower than the recognition rate because it takes account of the insertion errors that the latter ignores. The recognition rate ignoring silences for the 10 state case is 97.99%. Other experiments were run using a different number of features. In these experiments we used a span angle a of 61 (resulting in 60 features), 9 concentric span circles (resulting in 10 features), and

9 852 ARTICLE IN PRESS S. Mahmoud / Signal Processing 88 (2008) Table 2 The confusion matrix for the 10 states using 12 observations of 10 features each %c %e Fig. 8. The correct recognition rate and accuracy at states 3, 6, 8, 10,and 12 with 128 code book. Table 1 The recognition rates and accuracy of different number of features using HMMs with 6, 10, and 12 states Used features States 72 Angle-, 8 distance-, 20 horizontal- and verticalspan features Recognition rate Accuracy 60 Angle-, 10 distance-, 20 horizontal- and verticalspan features Recognition rate 20 horizontal- and 20 vertical-span features. The total numbers of features used were 110. This feature vector was split into 11 observations of 10 features each. These experiments were run using an HMM with 6, 10, and 12 states. Table 1 shows the results of these experiments compared with the results of using 120 features as detailed in the previous paragraph. The confusion matrix for the 120 feature case is shown in Table 2. Here %c is percentage of recognition rate and %e is the error percentage. The average recognition rate is 97.99%, ignoring silences (98.28% including silences as produced by HMM) The nearest neighbor classifier (NNC) Accuracy We conducted several experiments using a different number of samples for training/modeling and testing. In the training phase the feature vectors (V) of the training data are extracted (viz. 72 angle-, 8 distance-, 20 horizontal- and 20 vertical-span features). The features of each digit are averaged and the averaged features are used as the models of the Arabic (Indian) numerals. Since this classifier does not use the probabilities of the occurrences of each digit and does not use the conditional probabilities of one digit coming before or after other digits, the data are simpler to present to the classier in the training phase as will be discussed below. In the testing phase the feature vector (V) for the unknown character is computed and then compared to the feature vectors of the model classes. The classification decision is based on the nearest neighbor classification method. The nearest distance is computed using a simple formula given by E i ¼ Xk j¼1 jm ij V i j, (3) where E i is the distance between the input digit and model i (i.e sum of the absolute differences between the features of the input digit and those of model i), k is the total number of parameters in the feature vector(i.e. 120), M ij is the jth feature of model i, and V j is feature j of the input digit feature vector. The distance (E i ) between the new digit and all models feature vectors are found. The argument of the minimum value found (i.e. min(e i )) yields the recognized model i. This model is considered as the class that matches most closely the obtained features vector of the unknown digit. Hence, the class of the digit is found.

10 S. Mahmoud / Signal Processing 88 (2008) In our training and testing using the NNC, we used the same number of features that were used in HMM with highest recognition rate for comparison purposes. We experimented with different number of training and testing data. After several experimentations we got the best recognition rates with using 24 writers for training (a total of 11,520 samples) and tested the system with the remaining writers (i.e. 20 writers, a total of 9600 samples). The confusion matrix is given in Table 3 along with the recognition rate for each digit. The average recognition rate is 94.35%. Fig. 9 shows the recognition rates of the tested writers 25 to 44. It is clear that the recognition rates of some writers are over 98% while other writers little over 87%. This variation is normal with different writers as each writer may have different writing styles that is distant from the averaged feature vectors of the models. This variation will be analyzed in more details later with the analysis of erroneous samples. Fig. 10 shows the recognition rates of the different digits using HMM and the NNC. It is clear from the figure that HMM outperforms the NNC, as expected. On the average HMM recognition rate is nearly 4% more than the NNC. In some cases the difference is 10% as is the case with digit 5. We analyzed the samples of the database that were misclassified. The reasons for the misclassification errors may be attributed to the following main categories: (1) Errors due to bad or corrupted data. Fig. 11 shows samples of bad or corrupted data. The number of this type of data is 55 in the testing data that accounts for 0.57% of the average Table 3 The confusion matrix of the least nearest neighbor classifier and the recognition rates of each digit % Recognition rate % Error Fig. 9. The recognition rates of the tested writers (25 44).

11 854 ARTICLE IN PRESS S. Mahmoud / Signal Processing 88 (2008) Fig. 10. Comparison of the recognition rates of the HMM and the nearest neighbor classifiers. Fig. 11. Samples of badly written or corrupted data. error rate (10% of the errors). This applies to both classifiers (viz. HMMC and NNC). (2) Errors due to deformed samples or samples with un-proportional segments in relation to other segments in length and orientation. Fig. 12 shows samples with this type of errors. There are 74 samples that are classified under this type of error. These samples account for 0.77% of the error rate (or 13.6% of the errors) of the NNC. This percentage is much less for the HMMC as HMM tolerates this type of errors. (3) Errors due to samples written with different style than the training style. For example, digit three may be written with three upward segments or two upward segments ( ). This type of error can be addressed by allowing a digit to have more than one model. Each style of a digit, if it is appreciably different from the basic model, is represented by another model. There were 16 errors of this type for one writer alone. Fig. 13 shows samples of the writer style. This type of error was present in both classifiers.

12 S. Mahmoud / Signal Processing 88 (2008) Fig. 14. Samples of digit 5 recognized as digit 0. Fig. 12. Samples of error data due deformed or un-proportional segments. Fig. 15. Zeros recognized as one. Fig. 13. Samples of digit 3 written in different style than the style of the training samples. (4) Errors related to digit pairs. Some digits are close in shape to other digits. Hence, if not written specifically they look similar to other digits. For example, if digit 5 is written in small size and the inside hole is small it is normally confused with zero. This case accounts for 94 errors (9.8% of digit 5 errors) of NNC. Fig. 14 shows samples of this category. Another example is the digit zero. It is normally a dot. Sometimes it is written like a small line or spreading pixels. When the digit is normalized it looks very similar to one. In such cases it is confused with digit 1. However, when digits are included with Arabic text then this problem is expected to disappear as the normalization will be based on the line height. Hence, digit zero will be very small compared with a one and hence will not be confused with it. Fig. 15 shows examples of this category. Both classifiers suffer from this type of error although HMMC is at much lesser extent. There are other digit pairs that are sometimes confused (viz. digit 7 with digit 8, and 2 with 4 ). (5) Genuine errors that are misclassified with no visible reason and that can be attributed to insufficient classification capability of the used features and classifiers. It is not expected to get 100% recognition rate for writer-independent handwritten offline digits from any classifier. It is to be noted here that a human may make a 1% misclassification error on classifying the data in the database if the context is not present. 6. Conclusions This paper presented a system for independent writer off-line handwritten Arabic (Indian) numeral recognition based on estimating simple and effective features. In this work we used HMM and the NNC. We analyzed the performance of the HMM using different number of features and different number of states. We selected the number of features and states giving the highest recognition rate. These same features were used with the NNC. The technique is scale and translation invariant. The experimental results indicate the effectiveness of the proposed technique in the automatic recognition of off-line Arabic (Indian) handwritten numerals. A database of 21,120 digits was used in training and testing the classifiers. In the HMM, 1700 Arabic (Indian) numbers of varying lengths (totaling 16,657 digits) were used in training the HMM and 471 numbers of varying length totaling 4463 digits were used in testing the

13 856 ARTICLE IN PRESS S. Mahmoud / Signal Processing 88 (2008) HMM. Angular-, distance-, horizontal-, and vertical-span features were used. Several experiments were conducted to achieve the best recognition rate by using different number of states in the model and by modifying the number of used features. An average recognition rate of was achieved using 120 features presented as 12 observations of 10 features per digit. An HMM of 10 states was used. Randomization of presenting the observations was necessary in the training of the HMM. Randomization was applied in the length of Arabic (Indian) numbers and in the used digits in each number. The same database and features were used for training and testing the NNC. Samples of the first 24 writers (totaling 11,520 digits) were used in the training and the last 20 writers (25 44 totaling 9600 samples) were used in the testing phase. The average recognition rate achieved was 94.35%. The researchers are currently exploring the use of more statistical and syntactical features. The same technique will be applied to Arabic text recognition. The author is investigating the use of super vector machines (SVM) and neural networks for Arabic text/numeral recognition. In addition, the use of multiple classifiers will be explored. Acknowledgments First I would like to thank the referees for their constructive criticism and stimulating remarks. The modification of the original manuscript to address those remarks improved the revised manuscript considerably. In addition, I would like to thank King Fahd University of Petroleum and Minerals for supporting this research work and providing the computing facilities. References [1] Badr Al-Badr, Sabri A. Mahmoud, Survey and bibliography of Arabic optical text recognition, J. Signal Process. 41 (1) (January 1995) [2] J. Mantas, An overview of character recognition methodologies, Pattern Recogn. 19 (6) (1986) [3] V.K. Govindan, A.P. Shivaprasad, Character recognition a review, Pattern Recogn. 23 (7) (1990) [4] C.L. Liu, K. Nakashima, H. Sako, H. Fujisawa, Handwritten digit recognition: benchmarking of state-of the-art techniques, Pattern Recogn. 36 (2003) [5] M. Shi, Y. Fujisawa, T. Wakabayashi, F. Kimura, Handwritten numeral recognition using gradient and curvature of gray scale image, Pattern Recogn. 35 (2002) [6] L.N. Teow, K.F. Loe, Robust vision-based features and classification schemes for off-line handwritten digit recognition, Pattern Recogn. 35 (2002) [7] K. Cheung, D. Yeung, R.T. Chin, A Bayesian framework for deformable pattern recognition with application to handwritten character recognition, IEEE Trans. Pattern Anal. Mach. Intell. 29 (12) (1998) [8] I.J. Tsang, I.R. Tsang, D.V. Dyck, Handwritten character recognition based on moment features derived from image partition, Int. Conf. Image Process. 2 (1998) [9] F. Al-Omari, Hand-written Indian numeral recognition systems using template matching approaches, Proc. ACS/ IEEE Int. Conf. Comput. Syst. Appl. (2001) [10] F.A. Al-Omari, O. Al-Jarrah, Handwritten Indian numerals recognition system using probabilistic neural networks, Adv. Eng. Inform. 18 (2004) [11] F. Bousalma, Structural and fuzzy techniques in the recognition of online Arabic characters, Int. J. Pattern Recogn Artif. Intell. 13 (7) (1999) [12] A. Salah, E. Albaydin, L. Akarun, A selective attentionbased method for visual pattern recognition with application to handwritten digit recognition and face recognition, IEEE Trans. Pattern Anal. Mach. Intell. 24 (3) (2002) [13] A. Hamid, R. Haraty, A neuro-heuristic approach for segmenting handwritten Arabic text, Proc. ACS/IEEE Int. Conf. Comput. Syst. Appl. 110 (3) (2001). [14] S. Saloum, Arabic hand-written text recognition, Proc. ACS/ IEEE Int. Conf. Comput. Syst. Appl. (2001) [15] S. Almaadeed, C. Higgens, D. Elliman, Recognition of off-line handwritten Arabic words using hidden Markov model approach, ICPR 2002, Quebec City, August 2002, pp [16] S. Almaadeed, C. Higgins, D.G. Elliman, Off-line recognition of handwritten Arabic words using multiple hidden Markov models, Knowledge Based Syst. 17 (2004) [17] S. Touj, N.B. Amara, H. Amiri, Arabic handwritten words recognition based on a planar hidden Markov model, Int. Arab J. Inf. Technol. 2 (4) (2005) [18] M. Khorsheed, Off-line Arabic character recognition a review, Pattern Anal. Appl. 5 (2002) [19] L.M. Lorigo, V. Govindaraju, Offline Arabic handwriting recognition: a survey, EEE Trans. Pattern Anal. Mach. Intell. 28 (5) (May 2006) [20] I. Bazzi, C. LaPre, J. Makhoul, R. Schwartz, Omnifont and unlimited vocabulary OCR for English and Arabic, in: Proceedings of the International Conference on Document Analysis and Recognition, vol. 2, Ulm, Germany, 1997, pp [21] I. Bazzi, R. Schwartz, J. Makhoul, An Omifont openvacabulary OCR system for English and Arabic, IEEE Trans. PAMI 21 (6) (1999) [22] M.H.S. Shahrezea, K. Faez, A. Khotanzad, Recognition of handwritten Persian/Arabic numerals by shadow coding and an edited probabilistic neural network, Proc. Int. Conf. Image Process. 3 (1995) [23] H.M.M. Hosseini, A. Bouzerdoum, A combined method for Persian and Arabic handwritten digit recognition. in: Proceedings of the Australian New Zealand Conference, on Intelligent Information Systems. 1996, pp [24] F.N. Said, R.A. Yacoub, C.Y. Suen, Recognition of English and Arabic numerals using a dynamic number of hidden neurons. in: Proceedings of the Fifth International Conference on Document Analysis and Recognition, 1999, pp

14 S. Mahmoud / Signal Processing 88 (2008) [25] J. Sadri, C.Y. Suen, T.D. Bui, Application of support vector machines for recognition of handwritten Arabic/Persian digits. in: Proceedings of Second Iranian Conference on Machine Vision and Image Processing, vol. 1, 2003, pp [26] H. Soltanzadeh, M. Rahmati, Recognition of Persian handwritten digits using image profiles of multiple orientations, Pattern Recogn. Lett. 25 (2004) [27] M. Mohamed, P. Gader, Handwritten word recognition using segmentation-free hidden Markov modeling and segmentation-based dynamic programming techniques, IEEE Trans. Pattern Anal. Mach. Intell. 18 (5) (May 1996) [28] J. Hu, S.G. Lim, Michael K. Brown, Writer independent online handwriting recognition using an HMM approach, Pattern Recogn. 33 (2000) [29] A.H. Hassin, X. Tang, J. Liu, W. Zhao, Printed Arabic character recognition using HMM, J. Comput. Sci. Technol. 19 (4) (July 2004) [30] HTK Speech Recognition Toolkit, /htk.eng.cam.ac.uk/s.

A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features

A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features A Decision Tree Based Method to Classify Persian Handwritten Numerals by Extracting Some Simple Geometrical Features Hamidreza Alvari, Seyed Mehdi Hazrati Fard, and Bahar Salehi Abstract Automatic recognition

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

FEATURE EXTRACTION TECHNIQUE FOR HANDWRITTEN INDIAN NUMBERS CLASSIFICATION

FEATURE EXTRACTION TECHNIQUE FOR HANDWRITTEN INDIAN NUMBERS CLASSIFICATION FEATURE EXTRACTION TECHNIQUE FOR HANDWRITTEN INDIAN NUMBERS CLASSIFICATION 1 SALAMEH A. MJLAE, 2 SALIM A. ALKHAWALDEH, 3 SALAH M. AL-SALEH 1, 3 Department of Computer Science, Zarqa University Collage,

More information

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System Mono-font Cursive Arabic Text Recognition Using Speech Recognition System M.S. Khorsheed Computer & Electronics Research Institute, King AbdulAziz City for Science and Technology (KACST) PO Box 6086, Riyadh

More information

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

PRINTED ARABIC CHARACTERS CLASSIFICATION USING A STATISTICAL APPROACH

PRINTED ARABIC CHARACTERS CLASSIFICATION USING A STATISTICAL APPROACH PRINTED ARABIC CHARACTERS CLASSIFICATION USING A STATISTICAL APPROACH Ihab Zaqout Dept. of Information Technology Faculty of Engineering & Information Technology Al-Azhar University Gaza ABSTRACT In this

More information

Handwritten Arabic Digits Recognition Using Bézier Curves

Handwritten Arabic Digits Recognition Using Bézier Curves www.ijcsi.org 57 Handwritten Arabic Digits Recognition Using Bézier Curves Aissa Kerkour El Miad and Azzeddine Mazroui University Mohammed First, Faculty of Sciences, Oujda, Morocco Abstract In this paper

More information

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition Nafiz Arica Dept. of Computer Engineering, Middle East Technical University, Ankara,Turkey nafiz@ceng.metu.edu.

More information

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM

ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM ABJAD: AN OFF-LINE ARABIC HANDWRITTEN RECOGNITION SYSTEM RAMZI AHMED HARATY and HICHAM EL-ZABADANI Lebanese American University P.O. Box 13-5053 Chouran Beirut, Lebanon 1102 2801 Phone: 961 1 867621 ext.

More information

A Fast Recognition System for Isolated Printed Characters Using Center of Gravity and Principal Axis

A Fast Recognition System for Isolated Printed Characters Using Center of Gravity and Principal Axis Applied Mathematics, 2013, 4, 1313-1319 http://dx.doi.org/10.4236/am.2013.49177 Published Online September 2013 (http://www.scirp.org/journal/am) A Fast Recognition System for Isolated Printed Characters

More information

Recognition of Persian Handwritten Numbers using LBP-HOG Descriptor

Recognition of Persian Handwritten Numbers using LBP-HOG Descriptor Bulletin of Environment, Pharmacology and Life Sciences Bull. Env.Pharmacol. Life Sci., Vol 4 [Spl issue 1] 2015: 284-290 2014 Academy for Environment and Life Sciences, India Online ISSN 2277-1808 Journal

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Stefan Müller, Gerhard Rigoll, Andreas Kosmala and Denis Mazurenok Department of Computer Science, Faculty of

More information

A Segmentation Free Approach to Arabic and Urdu OCR

A Segmentation Free Approach to Arabic and Urdu OCR A Segmentation Free Approach to Arabic and Urdu OCR Nazly Sabbour 1 and Faisal Shafait 2 1 Department of Computer Science, German University in Cairo (GUC), Cairo, Egypt; 2 German Research Center for Artificial

More information

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 27-37 TJPRC Pvt. Ltd., HANDWRITTEN GURMUKHI

More information

Online Bangla Handwriting Recognition System

Online Bangla Handwriting Recognition System 1 Online Bangla Handwriting Recognition System K. Roy Dept. of Comp. Sc. West Bengal University of Technology, BF 142, Saltlake, Kolkata-64, India N. Sharma, T. Pal and U. Pal Computer Vision and Pattern

More information

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques 1 Lohitha B.J, 2 Y.C Kiran 1 M.Tech. Student Dept. of ISE, Dayananda Sagar College

More information

An Efficient Character Segmentation Based on VNP Algorithm

An Efficient Character Segmentation Based on VNP Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5438-5442, 2012 ISSN: 2040-7467 Maxwell Scientific organization, 2012 Submitted: March 18, 2012 Accepted: April 14, 2012 Published:

More information

LECTURE 6 TEXT PROCESSING

LECTURE 6 TEXT PROCESSING SCIENTIFIC DATA COMPUTING 1 MTAT.08.042 LECTURE 6 TEXT PROCESSING Prepared by: Amnir Hadachi Institute of Computer Science, University of Tartu amnir.hadachi@ut.ee OUTLINE Aims Character Typology OCR systems

More information

Human Performance on the USPS Database

Human Performance on the USPS Database Human Performance on the USPS Database Ibrahim Chaaban Michael R. Scheessele Abstract We found that the human error rate in recognition of individual handwritten digits is 2.37%. This differs somewhat

More information

Linear Discriminant Analysis in Ottoman Alphabet Character Recognition

Linear Discriminant Analysis in Ottoman Alphabet Character Recognition Linear Discriminant Analysis in Ottoman Alphabet Character Recognition ZEYNEB KURT, H. IREM TURKMEN, M. ELIF KARSLIGIL Department of Computer Engineering, Yildiz Technical University, 34349 Besiktas /

More information

Signature Recognition by Pixel Variance Analysis Using Multiple Morphological Dilations

Signature Recognition by Pixel Variance Analysis Using Multiple Morphological Dilations Signature Recognition by Pixel Variance Analysis Using Multiple Morphological Dilations H B Kekre 1, Department of Computer Engineering, V A Bharadi 2, Department of Electronics and Telecommunication**

More information

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models Gleidson Pegoretti da Silva, Masaki Nakagawa Department of Computer and Information Sciences Tokyo University

More information

Recognition of Printed Arabic Words with Fuzzy ARTMAP Neural Network

Recognition of Printed Arabic Words with Fuzzy ARTMAP Neural Network Recognition of Printed Arabic Words with Fuzzy ARTMAP Neural Network Adnan Amin' and Nabeel Murshed2 'School of Computer Science and Engineering University of New South Wales, Sydney-Australia amin@cse.unsw.edu.au

More information

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION Binod Kumar Prasad * * Bengal College of Engineering and Technology, Durgapur, W.B., India. Rajdeep Kundu 2 2 Bengal College

More information

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation 009 10th International Conference on Document Analysis and Recognition HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation Yaregal Assabie and Josef Bigun School of Information Science,

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of

More information

Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting

Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting 2013 12th International Conference on Document Analysis and Recognition Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting Yan-Fei Lv 1, Lin-Lin

More information

Non-uniform Slant Correction using Generalized Projections

Non-uniform Slant Correction using Generalized Projections I J C T A, 9(17) 2016, pp. 8489-8497 International Science Press Non-uniform Slant Correction using Generalized Projections A. M. Hafiz * and G. M. Bhat * ABSTRACT Slant Correction is an important component

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an

More information

WORD LEVEL DISCRIMINATIVE TRAINING FOR HANDWRITTEN WORD RECOGNITION Chen, W.; Gader, P.

WORD LEVEL DISCRIMINATIVE TRAINING FOR HANDWRITTEN WORD RECOGNITION Chen, W.; Gader, P. University of Groningen WORD LEVEL DISCRIMINATIVE TRAINING FOR HANDWRITTEN WORD RECOGNITION Chen, W.; Gader, P. Published in: EPRINTS-BOOK-TITLE IMPORTANT NOTE: You are advised to consult the publisher's

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015 Offline Handwritten Signature Verification using Neural Network Pallavi V. Hatkar Department of Electronics Engineering, TKIET Warana, India Prof.B.T.Salokhe Department of Electronics Engineering, TKIET

More information

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in

More information

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION

MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION MOMENT AND DENSITY BASED HADWRITTEN MARATHI NUMERAL RECOGNITION S. M. Mali Department of Computer Science, MAEER S Arts, Commerce and Science College, Pune Shankarmali007@gmail.com Abstract In this paper,

More information

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques

Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Ajay K. Talele Department of Electronics Dr..B.A.T.U. Lonere. Sanjay L Nalbalwar

More information

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN

Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Enhancing the Character Segmentation Accuracy of Bangla OCR using BPNN Shamim Ahmed 1, Mohammod Abul Kashem 2 1 M.S. Student, Department of Computer Science and Engineering, Dhaka University of Engineering

More information

Recognition of Unconstrained Malayalam Handwritten Numeral

Recognition of Unconstrained Malayalam Handwritten Numeral Recognition of Unconstrained Malayalam Handwritten Numeral U. Pal, S. Kundu, Y. Ali, H. Islam and N. Tripathy C VPR Unit, Indian Statistical Institute, Kolkata-108, India Email: umapada@isical.ac.in Abstract

More information

Classification of Printed Chinese Characters by Using Neural Network

Classification of Printed Chinese Characters by Using Neural Network Classification of Printed Chinese Characters by Using Neural Network ATTAULLAH KHAWAJA Ph.D. Student, Department of Electronics engineering, Beijing Institute of Technology, 100081 Beijing, P.R.CHINA ABDUL

More information

Multi prototype fuzzy pattern matching for handwritten character recognition

Multi prototype fuzzy pattern matching for handwritten character recognition Multi prototype fuzzy pattern matching for handwritten character recognition MILIND E. RANE, DHABE P. S AND J. B. PATIL Dept. of Electronics and Computer, R.C. Patel Institute of Technology, Shirpur, Dist.

More information

Handwritten Devanagari Character Recognition Model Using Neural Network

Handwritten Devanagari Character Recognition Model Using Neural Network Handwritten Devanagari Character Recognition Model Using Neural Network Gaurav Jaiswal M.Sc. (Computer Science) Department of Computer Science Banaras Hindu University, Varanasi. India gauravjais88@gmail.com

More information

Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks

Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks Simulation of Zhang Suen Algorithm using Feed- Forward Neural Networks Ritika Luthra Research Scholar Chandigarh University Gulshan Goyal Associate Professor Chandigarh University ABSTRACT Image Skeletonization

More information

Offline Signature verification and recognition using ART 1

Offline Signature verification and recognition using ART 1 Offline Signature verification and recognition using ART 1 R. Sukanya K.Malathy M.E Infant Jesus College of Engineering And Technology Abstract: The main objective of this project is signature verification

More information

International Journal of Scientific & Engineering Research, Volume 8, Issue 3, March ISSN

International Journal of Scientific & Engineering Research, Volume 8, Issue 3, March ISSN International Journal of Scientific & Engineering Research, Volume 8, Issue 3, March-2017 1850 Optical Character Recognition for Running C Code Upendra Mishra 1, Shiva Panwar 2, Deeksha Upadhyay 2, Kamal

More information

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity

Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity Journal of Computer Science 5 (5): 363-368, 2009 ISSN 1549-3636 2009 Science Publications Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity Atallah Mahmoud Al-Shatnawi

More information

A Simplistic Way of Feature Extraction Directed towards a Better Recognition Accuracy

A Simplistic Way of Feature Extraction Directed towards a Better Recognition Accuracy International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 3, Issue 7 (September 2012), PP. 43-49 A Simplistic Way of Feature Extraction Directed

More information

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS

DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS DEVANAGARI SCRIPT SEPARATION AND RECOGNITION USING MORPHOLOGICAL OPERATIONS AND OPTIMIZED FEATURE EXTRACTION METHODS Sushilkumar N. Holambe Dr. Ulhas B. Shinde Shrikant D. Mali Persuing PhD at Principal

More information

A Document Image Analysis System on Parallel Processors

A Document Image Analysis System on Parallel Processors A Document Image Analysis System on Parallel Processors Shamik Sural, CMC Ltd. 28 Camac Street, Calcutta 700 016, India. P.K.Das, Dept. of CSE. Jadavpur University, Calcutta 700 032, India. Abstract This

More information

6. Applications - Text recognition in videos - Semantic video analysis

6. Applications - Text recognition in videos - Semantic video analysis 6. Applications - Text recognition in videos - Semantic video analysis Stephan Kopf 1 Motivation Goal: Segmentation and classification of characters Only few significant features are visible in these simple

More information

II. WORKING OF PROJECT

II. WORKING OF PROJECT Handwritten character Recognition and detection using histogram technique Tanmay Bahadure, Pranay Wekhande, Manish Gaur, Shubham Raikwar, Yogendra Gupta ABSTRACT : Cursive handwriting recognition is a

More information

Writer Recognizer for Offline Text Based on SIFT

Writer Recognizer for Offline Text Based on SIFT Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 5, May 2015, pg.1057

More information

Layout Segmentation of Scanned Newspaper Documents

Layout Segmentation of Scanned Newspaper Documents , pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms

More information

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier

Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Structural Feature Extraction to recognize some of the Offline Isolated Handwritten Gujarati Characters using Decision Tree Classifier Hetal R. Thaker Atmiya Institute of Technology & science, Kalawad

More information

Dynamic Stroke Information Analysis for Video-Based Handwritten Chinese Character Recognition

Dynamic Stroke Information Analysis for Video-Based Handwritten Chinese Character Recognition Dynamic Stroke Information Analysis for Video-Based Handwritten Chinese Character Recognition Feng Lin and Xiaoou Tang Department of Information Engineering The Chinese University of Hong Kong Shatin,

More information

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script

A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,

More information

Word-wise Script Identification from Video Frames

Word-wise Script Identification from Video Frames Word-wise Script Identification from Video Frames Author Sharma, Nabin, Chanda, Sukalpa, Pal, Umapada, Blumenstein, Michael Published 2013 Conference Title Proceedings 12th International Conference on

More information

A System towards Indian Postal Automation

A System towards Indian Postal Automation A System towards Indian Postal Automation K. Roy C.V.P.R Unit, I.S.I, Kolkata-108; India, Kaushik_mrg@hotmail.com S. Vajda LORIA Research Center, B.P. 239 54506, Nancy, France Szilard.Vajda@loria.fr U.

More information

Application of Geometry Rectification to Deformed Characters Recognition Liqun Wang1, a * and Honghui Fan2

Application of Geometry Rectification to Deformed Characters Recognition Liqun Wang1, a * and Honghui Fan2 6th International Conference on Electronic, Mechanical, Information and Management (EMIM 2016) Application of Geometry Rectification to Deformed Characters Liqun Wang1, a * and Honghui Fan2 1 School of

More information

Mobile Application with Optical Character Recognition Using Neural Network

Mobile Application with Optical Character Recognition Using Neural Network Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 1, January 2015,

More information

A two-stage approach for segmentation of handwritten Bangla word images

A two-stage approach for segmentation of handwritten Bangla word images A two-stage approach for segmentation of handwritten Bangla word images Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri #, Dipak Kumar Basu Computer Science & Engineering Department,

More information

Invarianceness for Character Recognition Using Geo-Discretization Features

Invarianceness for Character Recognition Using Geo-Discretization Features Computer and Information Science; Vol. 9, No. 2; 2016 ISSN 1913-8989 E-ISSN 1913-8997 Published by Canadian Center of Science and Education Invarianceness for Character Recognition Using Geo-Discretization

More information

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features Md. Abul Hasnat Center for Research on Bangla Language Processing (CRBLP) Center for Research on Bangla Language Processing

More information

Handwritten Character Recognition with Feedback Neural Network

Handwritten Character Recognition with Feedback Neural Network Apash Roy et al / International Journal of Computer Science & Engineering Technology (IJCSET) Handwritten Character Recognition with Feedback Neural Network Apash Roy* 1, N R Manna* *Department of Computer

More information

Spotting Words in Latin, Devanagari and Arabic Scripts

Spotting Words in Latin, Devanagari and Arabic Scripts Spotting Words in Latin, Devanagari and Arabic Scripts Sargur N. Srihari, Harish Srinivasan, Chen Huang and Shravya Shetty {srihari,hs32,chuang5,sshetty}@cedar.buffalo.edu Center of Excellence for Document

More information

Scene Text Detection Using Machine Learning Classifiers

Scene Text Detection Using Machine Learning Classifiers 601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department

More information

Khmer OCR for Limon R1 Size 22 Report

Khmer OCR for Limon R1 Size 22 Report PAN Localization Project Project No: Ref. No: PANL10n/KH/Report/phase2/002 Khmer OCR for Limon R1 Size 22 Report 09 July, 2009 Prepared by: Mr. ING LENG IENG Cambodia Country Component PAN Localization

More information

Texture recognition of medical images with the ICM method

Texture recognition of medical images with the ICM method Nuclear Instruments and Methods in Physics Research A 525 (2004) 387 391 Texture recognition of medical images with the ICM method Jason M. Kinser*, Guisong Wang George Mason University, Institute for

More information

Part-Based Skew Estimation for Mathematical Expressions

Part-Based Skew Estimation for Mathematical Expressions Soma Shiraishi, Yaokai Feng, and Seiichi Uchida shiraishi@human.ait.kyushu-u.ac.jp {fengyk,uchida}@ait.kyushu-u.ac.jp Abstract We propose a novel method for the skew estimation on text images containing

More information

An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text

An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text Sheikh Faisal Rashid 1, Faisal Shafait 2, and Thomas M. Breuel 1 1 Technical University of Kaiserslautern, Kaiserslautern,

More information

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE K. Kaviya Selvi 1 and R. S. Sabeenian 2 1 Department of Electronics and Communication Engineering, Communication Systems, Sona College

More information

CLASSIFICATION OF BOUNDARY AND REGION SHAPES USING HU-MOMENT INVARIANTS

CLASSIFICATION OF BOUNDARY AND REGION SHAPES USING HU-MOMENT INVARIANTS CLASSIFICATION OF BOUNDARY AND REGION SHAPES USING HU-MOMENT INVARIANTS B.Vanajakshi Department of Electronics & Communications Engg. Assoc.prof. Sri Viveka Institute of Technology Vijayawada, India E-mail:

More information

Word-wise Hand-written Script Separation for Indian Postal automation

Word-wise Hand-written Script Separation for Indian Postal automation Word-wise Hand-written Script Separation for Indian Postal automation K. Roy U. Pal Dept. of Comp. Sc. & Engg. West Bengal University of Technology, Sector 1, Saltlake City, Kolkata-64, India Abstract

More information

Locating 1-D Bar Codes in DCT-Domain

Locating 1-D Bar Codes in DCT-Domain Edith Cowan University Research Online ECU Publications Pre. 2011 2006 Locating 1-D Bar Codes in DCT-Domain Alexander Tropf Edith Cowan University Douglas Chai Edith Cowan University 10.1109/ICASSP.2006.1660449

More information

Finger Print Enhancement Using Minutiae Based Algorithm

Finger Print Enhancement Using Minutiae Based Algorithm Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 8, August 2014,

More information

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 17 (2014), pp. 1839-1845 International Research Publications House http://www. irphouse.com Recognition of

More information

I. INTRODUCTION. Figure-1 Basic block of text analysis

I. INTRODUCTION. Figure-1 Basic block of text analysis ISSN: 2349-7637 (Online) (RHIMRJ) Research Paper Available online at: www.rhimrj.com Detection and Localization of Texts from Natural Scene Images: A Hybrid Approach Priyanka Muchhadiya Post Graduate Fellow,

More information

A Review on Handwritten Character Recognition

A Review on Handwritten Character Recognition IJCST Vo l. 8, Is s u e 1, Ja n - Ma r c h 2017 ISSN : 0976-8491 (Online) ISSN : 2229-4333 (Print) A Review on Handwritten Character Recognition 1 Anisha Sharma, 2 Soumil Khare, 3 Sachin Chavan 1,2,3 Dept.

More information

A Model-based Line Detection Algorithm in Documents

A Model-based Line Detection Algorithm in Documents A Model-based Line Detection Algorithm in Documents Yefeng Zheng, Huiping Li, David Doermann Laboratory for Language and Media Processing Institute for Advanced Computer Studies University of Maryland,

More information

NEW ALGORITHMS FOR SKEWING CORRECTION AND SLANT REMOVAL ON WORD-LEVEL

NEW ALGORITHMS FOR SKEWING CORRECTION AND SLANT REMOVAL ON WORD-LEVEL NEW ALGORITHMS FOR SKEWING CORRECTION AND SLANT REMOVAL ON WORD-LEVEL E.Kavallieratou N.Fakotakis G.Kokkinakis Wire Communication Laboratory, University of Patras, 26500 Patras, ergina@wcl.ee.upatras.gr

More information

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CHAPTER 4 CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS 4.1 Introduction Optical character recognition is one of

More information

HCR Using K-Means Clustering Algorithm

HCR Using K-Means Clustering Algorithm HCR Using K-Means Clustering Algorithm Meha Mathur 1, Anil Saroliya 2 Amity School of Engineering & Technology Amity University Rajasthan, India Abstract: Hindi is a national language of India, there are

More information

A New Algorithm for Detecting Text Line in Handwritten Documents

A New Algorithm for Detecting Text Line in Handwritten Documents A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer

More information

Image Normalization and Preprocessing for Gujarati Character Recognition

Image Normalization and Preprocessing for Gujarati Character Recognition 334 Image Normalization and Preprocessing for Gujarati Character Recognition Jayashree Rajesh Prasad Department of Computer Engineering, Sinhgad College of Engineering, University of Pune, Pune, Mahaashtra

More information

SCALE INVARIANT TEMPLATE MATCHING

SCALE INVARIANT TEMPLATE MATCHING Volume 118 No. 5 2018, 499-505 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu SCALE INVARIANT TEMPLATE MATCHING Badrinaathan.J Srm university Chennai,India

More information

On-line handwriting recognition using Chain Code representation

On-line handwriting recognition using Chain Code representation On-line handwriting recognition using Chain Code representation Final project by Michal Shemesh shemeshm at cs dot bgu dot ac dot il Introduction Background When one preparing a first draft, concentrating

More information

2009 International Conference on Emerging Technologies

2009 International Conference on Emerging Technologies 2009 International Conference on Emerging Technologies A Self Organizing Map Based Urdu Nasakh Character Recognition Syed Afaq Hussain *, Safdar Zaman ** and Muhammad Ayub ** afaq.husain@mail.au.edu.pk,

More information

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts

A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts 25 A Brief Study of Feature Extraction and Classification Methods Used for Character Recognition of Brahmi Northern Indian Scripts Rohit Sachdeva, Asstt. Prof., Computer Science Department, Multani Mal

More information

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852

IJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 1.852 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY INTELLEGENT APPROACH FOR OFFLINE SIGNATURE VERIFICATION USING CHAINCODE AND ENERGY FEATURE EXTRACTION ON MULTICORE PROCESSOR Raju

More information

Optical Character Recognition

Optical Character Recognition Optical Character Recognition Jagruti Chandarana 1, Mayank Kapadia 2 1 Department of Electronics and Communication Engineering, UKA TARSADIA University 2 Assistant Professor, Department of Electronics

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

Character Recognition Using Matlab s Neural Network Toolbox

Character Recognition Using Matlab s Neural Network Toolbox Character Recognition Using Matlab s Neural Network Toolbox Kauleshwar Prasad, Devvrat C. Nigam, Ashmika Lakhotiya and Dheeren Umre B.I.T Durg, India Kauleshwarprasad2gmail.com, devnigam24@gmail.com,ashmika22@gmail.com,

More information

Toward Part-based Document Image Decoding

Toward Part-based Document Image Decoding 2012 10th IAPR International Workshop on Document Analysis Systems Toward Part-based Document Image Decoding Wang Song, Seiichi Uchida Kyushu University, Fukuoka, Japan wangsong@human.ait.kyushu-u.ac.jp,

More information

Gradient-Angular-Features for Word-Wise Video Script Identification

Gradient-Angular-Features for Word-Wise Video Script Identification Gradient-Angular-Features for Word-Wise Video Script Identification Author Shivakumara, Palaiahnakote, Sharma, Nabin, Pal, Umapada, Blumenstein, Michael, Tan, Chew Lim Published 2014 Conference Title Pattern

More information

Hidden Loop Recovery for Handwriting Recognition

Hidden Loop Recovery for Handwriting Recognition Hidden Loop Recovery for Handwriting Recognition David Doermann Institute of Advanced Computer Studies, University of Maryland, College Park, USA E-mail: doermann@cfar.umd.edu Nathan Intrator School of

More information

Radial Basis Function Neural Network Classifier

Radial Basis Function Neural Network Classifier Recognition of Unconstrained Handwritten Numerals by a Radial Basis Function Neural Network Classifier Hwang, Young-Sup and Bang, Sung-Yang Department of Computer Science & Engineering Pohang University

More information

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 11, November 2014,

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information