Stochastic Segment Modeling for Offline Handwriting Recognition

Size: px
Start display at page:

Download "Stochastic Segment Modeling for Offline Handwriting Recognition"

Transcription

1 th nternational Conference on Document Analysis and Recognition tochastic egment Modeling for Offline Handwriting Recognition Prem Natarajan, Krishna ubramanian, Anurag Bhardwaj, Rohit Prasad BBN Technologies, 10 Moulton t, Cambridge, MA {prem, Abstract n this paper, we present a novel approach for incorporating structural information into the hidden Markov Modeling (HMM) framework for offline handwriting recognition. Traditionally, structural features have been used in recognition approaches that rely on accurate segmentation of words into smaller units (sub-words or characters). However, such segmentation based approaches do not perform well on real-world handwritten images, because breaks and merges in glyphs typically create new connected components that are not observed in the training data. To mitigate the problem of having to derive accurate segmentation from connected components, we present a novel framework where the HMM based recognition system trained on shorter-span features is used to generate the 2-D character images (the tochastic egments ), and then another classifier that uses structural features extracted from the stochastic character segments generates a new set of scores. Finally, the scores from the HMM system and from structural matching are used in combination to generate a hypothesis that is better than the results from either the HMM or from structural matching alone. We demonstrate the efficacy of our approach by reporting experimental results on a large corpus of handwritten Arabic documents. 1. ntroduction Offline handwriting recognition (OHR) continues to be a challenging research problem due to a variety of reasons. Most recognition approaches that require accurate segmentation of the text into smaller units do not perform well on handwritten text. There are two primary causes for poor performance of segmentationbased approaches on real-world handwritten text. First, segmenting handwritten text for connected scripts such as Arabic is very difficult. econd, most real-world images are prone to degradations that result in breaks and merges in glyphs. This phenomenon creates new connected components that are not observed in training Anurag Bhardwaj contributed to this paper while he was a summer intern at BBN Technologies. data, and therefore the character classifier is unable to accurately recognize the glyphs. OHR systems based on hidden Markov models [1],[2],[3],[4] have been shown to outperform segmentation-based approaches. The advantage with HMM based systems is that they are segmentation-free, that is no pre-segmentation of word/line images into smaller units such as sub-words or characters is required. However, there are well-known limitations with HMM based approaches [5]. These limitations are due to two reasons: (a) the assumption of conditional independence of the observations given the state sequence, and (b) the restrictions on feature extraction imposed by frame-based observations. The limitations noted in [5] are also relevant to OHR systems as they use pixel-level features from narrow slices of the text. pecifically, the narrow windows provide very little contextual information making the conditional independence assumption in these systems unrealistic. n [6], we report on novel work on incorporating structural or longer-span shape features such as Gradient, tructure, and Concavity (GC) into the HMM framework. n that work, we use wider windows for extracting structural features than the standard shorter-span features. Following feature extraction, for each frame, the structural features are concatenated to the shorter-span feature vector for performing model training and recognition. nspired by the work in [5], we present a novel framework for combining structural matching and HMM-based recognition, which has more discriminative power than simply combining the structural and short-span features at each frame. n our framework, an HMM-based system trained on shorterspan features is used to generate the 2-D character images or stochastic segments, and then another classifier that uses structural features extracted from the stochastic character segments generates a new set of scores. The scores from the HMM system and the classifier operating on the structural features are combined to generate the best hypothesis /09 $ EEE DO /CDAR

2 2. tochastic egment Modeling n this section, we formulate the framework for stochastic segment modeling. We begin with the baseline HMM system, which is described as follows. Given a line image,, we extract features, X, using fixed size frames. These frames are typically narrow and capture short span features. The goal of the recognition task is to find a sequence of characters, that maximizes C X), the probability of a sequence of characters C given the feature vector sequence X. Using Bayes' rule, C X) may be written as: P ( C X ) X C) C) / X ) (1) n the equation above, X C) is the character glyph model which can be modeled by a hidden Markov models, and C) is the language model. We now introduce the segment model. A similar model used in speech recognition is the duration model [5]. Let be a segmentation of the image into its constituent characters C. Using Bayesian inference, we can write C ) as: C ) ) X ) C, X ) X ) C ) X ) C ) W X ) C ) C X ) W X ) C ) C X ) W C) (2) n Equation 2, C ) is the score from a VM, NN or other classifier. C X) is the posterior from a HMM. This score includes the character glyph score and also the grammar/language model score. W C) is a character width model. This can be estimated separately from training data as a distribution of normalized width. n deriving equation 2, we assert that C,X) C ); i.e., once is available to the classifier, X does not provide any more information. We also assert that X) W,C X); i.e., has two parts, the character hypothesis C and width W. Finally, we assert that W X) W C); i.e., once C is available, X provides no additional information. The introduction of C ) into equation 2 ensures a tight, interactive coupling between the long- and shortspan features. n order for Equation 2 to be computationally tractable, we restrict the set of possible segmentations,, to small perturbations around the maximum likelihood segmentation produced by the baseline HMM system. n our current paper, we use a simple VM to model C ) and show that even this simple model improves overall performance. 3. Baseline HMM-based OHR system Our HMM based recognizer is the BBN Byblos OCR system [4]. The feature extraction in BBN Byblos system involves horizontally segmenting the line image into frames and then computing a feature vector for each frame. The features used in the current baseline configuration include: Percentile of intensities, Angle, Correlation, and Energy, which we refer to as the PACE features [4]. The training module estimates multi-state, left-to-right HMMs for each character using EM algorithm for maximum likelihood training. The recognition module uses an efficient 2-pass n-best decoder [8]. The decoder performs a two-pass search using both the trained glyph models and the language models (typically a character or word n-gram model trained on a text corpus). For the forward pass of the decoder, a bigram language model is used to generate an initial lattice of hypotheses. The backward pass of the decoder uses an approximate trigram language model to generate a 1-best hypothesis as well as an n-best list or a word lattice. n this paper, we use glyph models that are trained to recognize characters. Ligatures are considered as independent characters and are modeled as such. 4. Design for tochastic egment Modeling The stochastic segment modeling framework involves the following key steps for performing recognition: 1. tochastic egment Generation: First, we generate a set of recognition hypotheses using the HMM system trained on short-span features. Then, for each hypothesis, we extract stochastic segments (2-D character images) using the character segmentation provided by the HMM. 2. egmental Classifier/corer: We extract structural features that represent shape characteristics of the character. Then, we compute a score for each character in the hypothesis using a classifier trained on the stochastic segments from the training data. n this paper, we use support vector machines (VM) as the segmental classifier. 3. core Combination from HMM and egmental Model: We use the score from the HMM and VM for each hypothesis to generate the best hypothesis. 972

3 Figure 1: Example showing the stochastic segmentation of characters in a line image. n the following we describe each of the above steps. n addition, we present the N-best rescoring procedure that we used to evaluate the efficacy of M tochastic egment Generation HMM-based OHR system produces the segmentation into characters as a byproduct of the training and recognition. Therefore, extraction of stochastic segments simply involves using the pixel position from the HMM segments to extract 2-D character images. n Figure 1, we show the segmentation of a line into constituent characters as determined by the HMM system. Figure 2. llustration of the rescoring procedure in which the VM scores are combined with the Glyph egmental Modeling using VMs We use VMs for modeling the stochastic segments using the Gradient-tructure-Concavity (GC) features that have been shown to be effective for offline handwriting recognition [6],[7]. GC features are symbolic, multi-resolution features that combine three different attributes of the shape of a character the gradient representing the local orientation of strokes; structural features that extend the gradient to longer distances and provide information about stroke trajectories; and concavity that captures stroke relationships at long distances. We used the character labels and the GC features extracted from stochastic segments to train a multiclass VM with the radial basis function (RBF) Kernel. The multi-class VM is trained such that, given a GC feature set extracted from a stochastic segment, it outputs a vector of probabilities, each of which is the probability of the given feature set belonging to a particular character. normalized GC feature belonging to the class of its associated character. For generating the composite score for each hypothesis from the segmental model, we compute the geometric mean of the VM scores from each character and use the logarithm of this score as the final VM score. Finally, we use the score from the HMM, the score from the VM, and other scores such as language model probabilities, etc. to compute an overall composite score that is used to re-rank the nbest hypothesis list. 5. Experimental Results n this section, we describe two experiments that were performed using a M built using an VM classifier and GC features. The first experiment was designed to evaluate: (a) the quality of the character segmentation from the HMM, and (b) the classification accuracy of VMs using GC features. The second experiment uses the VM/GC as a segmental scorer for rescoring n-best list together with scores generated from the HMM N-best Rescoring with M Framework A schematic diagram of the N-best rescoring procedure employed for evaluating the efficacy of our proposed stochastic segment modeling framework is shown in Figure 2. Given a test line image, we first extract PACE features from it. These features are then used by the baseline HMM system to generate a set of n-best hypotheses and character segmentation for each hypothesis in the n-best list. Using the character segmentation we extract 2-D character images. The 2D images of characters are then used to extract GC features. The segmental model, in this case the multiclass VM, is then used to generate a probability of the 5.1. Corpus Description We used two sets of corpora in our experiments one corpus is from the Applied Media Analytics (AMA), which we refer to as the AMA corpus and the second one is from the Linguistic Data Consortium (LDC), which we refer to as the LDC corpus. The AMA corpus that we use in our experiments consists of Arabic handwritten documents provided by a diverse body of writers. The collection is based on a set of 200 documents with a variety of formats and layout styles. 973

4 et #mages #Writers Train Dev Test 48 4 Table 1: LDC data used for rescoring experiments The final collection contains a TFF scanned image of each page, an XML file for each document which contains writer and page metadata, the bounding box for each word in the document in pixel coordinates, and a set of offsets representing PAWs (Parts of Arabic Word). We used a subset of the images, scanned at 300dpi for our experiments. This data set is used in our PAW classification experiments as described in ection 4.2. Our LDC corpus consisted of scanned image data of handwritten Arabic text from newswire articles, weblog posts and newsgroup posts, and the corresponding ground truth annotations including tokenized Arabic transcriptions and their English translations. t consists of 1250 images scanned at 300 dpi written by 14 different authors for training, development and testing purposes. n order to ensure a fair test set with no writer or document content in training, 229 images were held-out of the training and development sets. 125 images from this set were randomly chosen as the development set. A total of 48 images by 4 different authors constitute the test set. The details of the split are shown in the Table 1. This dataset is used in our rescoring experiments described in ection Classification using VMs n this section, we describe two experiments. The first experiment uses the VM classifier trained with GC features extracted from manually annotated PAWs in the AMA data set. The second experiment is designed to evaluate character classification accuracy using stochastic segments and also uses images from the AMA data set. n the first experiment, we used manually annotated Part-of-Arabic-Word (PAW) images and the corresponding PAW labels to train a VM classifier. The PAW images and labels were randomly chosen from the from the AMA corpus. We used the entire PAW image to extract features. A total of 6498 training samples from 34 PAW classes were used to train the classifier. The VM training setup is as described in ection 4.2 with the exception that we use PAW images instead of stochastic segments to extract features. The test set consists of 848 PAW images from the same set of 34 PAW classes. From the vector of probability scores produced by the VM for each class label, we chose the class with the highest probability as the classification label for the PAW image. The classification accuracy for this experiment was 82.8%. n the second experiment, we extract stochastic segments from word images from the AMA dataset and use the extracted character segments for training M, as described in ection 4.2. A total of 13,261 character Types of Units # classes Accuracy PAWs % tochastic 74.7% segments 40 Table 2. egment classification accuracy VM classifier. training samples from 40 character classes were used for training. This VM was then used to classify 3315 test samples and resulted in an overall accuracy of 74.7%. As shown in Table 2, the results on the PAW classification task were better than the results for the stochastic segment classification task. There could be several reasons for this degradation to have occurred. The PAWs are bigger than character segments on an average. As a result, PAWs contain more structural information than character segments. Another factor which could have resulted in the poorer performance for the character segment classification is that, since the character boundaries are stochastically determined, they may not have been very accurate or consistent, which could have resulted in features that do not have sharp distributions. We visually analyzed the extracted character images from stochastic segmentation and found them to be pretty accurate and consistent across many instances. n Figure 3, we show multiple instances of the same character appearing in different words written by different authors and see that the character segmentation is pretty consistent across these multiple instances. Hence we do not believe that the approximate stochastic segmentation is a major Figure 3: Each row shows different instances of the same character that has been stochastically segmented from the 974

5 contributor to the drop in accuracy for stochastic segments. Another factor which could have caused the drop in performance is the difference in the number of classes used in training between the two experiments. These results support our view that larger segments provide more structural information, which, when captured by a powerful feature extraction algorithm, can provide more discriminative features. These results also indicate that the GC features are capable of extracting long-span shape information N-best Rescoring with VM for mproving End-to-End Recognition Accuracy This experiment uses the VM using GC features extracted using stochastic segments to rescore an n-best list of hypotheses as described in ection 4.3. A detailed description for this experiment is provided in ection 3. The LDC corpus used for this experiment is described in ection 5.1. The amount of data used for training, development and validation is shown in Table 1. All the training data from the LDC corpus was used for training the baseline HMM system. The VM was trained using 900 randomly chosen stochastic character images for each character class. The results for this experiment, along with the results for the baseline experiment are shown in Table 3. The only difference between the two experiments is the addition of the VM scores for rescoring. The two experiments are otherwise identical. From Table 3, we see that the addition of the VM scores for rescoring improves overall system performance by 2.3% absolute. cores used for Rescoring WER (%) Glyph + LM 55.1 Glyph + LM + M 52.8 Table 3: Results from using the M for N-best rescoring. 5. Conclusions and Future work n this paper, we have described a powerful framework to integrate long-range shape information with shortspan features. Our framework is built on top of an existing HMM system and is segmentation-free, i.e. all needed segmentations are stochastically inferred. We also presented initial experimental results in which we used the stochastic segment models for N-best rescoring. The use of stochastic segment modeling resulted for rescoring gave a 2.3% gain over the baseline HMM system. Future work will involve use of lattices for stochastic segment model based rescoring as well as considering multiple segmentations instead of the 1-best segmentation into characters. 6. Acknowledgments We would like to thank Dr. Venu Govindaraju, Dr. rirangaraj etlur, and Dr. Zhixin hi at the Center for Unified Biometrics and ensors at UNY Buffalo for providing the GC feature extraction library for this research. We would also like to thank Dr. Huping Li t Applied Media Analytics, nc. and Dr. David Doermann from University of Maryland for providing us the AMA corpus. 7. References [1] C. Mokbel, H. Abi Akl, H. Greige. Automatic speech recognition of arabic digits over Telefone network.,proc. nt'l Conf. Research Trends in cience and Technology, [2] V. Märgner, M. Pechwitz, and H. ElAbed. CDAR 2005 Arabic Handwriting Recognition Competition. Proc. nt'l Conf. Document Analysis and Recognition, pp , [3] V. Märgner, M. Pechwitz, and H. ElAbed. Arabic Handwriting Recognition Competition. Proc. nt'l Conf. Document Analysis and Recognition, pp , [4] P. Natarajan,. aleem, R. Prasad, E. MacRostie, K. ubramanian, Multi-lingual Offline Handwriting Recognition Using Hidden Markov Models: A cript- ndependent Approach, pringer Book Chapter on Arabic and Chinese Handwriting Recognition, N: , Vol. 4768, pp , March [5] M. Ostendorf, V. V. Digalakis and O. A. Kimball. From HMM s to egment Models: A Unified View of tochastic Modeling for peech Recognition. EEE Transactions on peech and Audio Processing, 4(5): , 1996 [6]. aleem, K. ubramanian, M. Kamali, R. Prasad, P. Natarajan, mprovements in BBN s HMM-based Offline Arabic Handwritten Recognition ystem, submitted to CDAR [7] J. T. Favata and G. rikantan. A multiple feature/resolution approach to handprinted digit and character recognition. nternationaljournal of maging ystems Technology, 7: , [8]. Austin, R. chwartz and P. Placeway, The forwardbackward search algorithm, EEE nt. Conf. Acoustics, peech, ignal Processing, Toronto, Canada, Vol. V, 1991, pp

(12) U nited States Patent (to) Patent No.: US 8,644,611 B2 Natarajan et al. (45) Date of Patent: Feb. 4,2014

(12) U nited States Patent (to) Patent No.: US 8,644,611 B2 Natarajan et al. (45) Date of Patent: Feb. 4,2014 US008644611B2 (12) U nited States Patent (to) Patent No.: Natarajan et al. (4) Date of Patent: Feb. 4,14 (4) SEGMENTAL RESCORING IN TEXT RECOGNITION (7) Inventors: Premkumar Natarajan, Sudbury, MA (US);

More information

Gabor Features for Offline Arabic Handwriting Recognition

Gabor Features for Offline Arabic Handwriting Recognition Gabor Features for Offline Arabic Handwriting Recognition Jin Chen Lehigh University Bethlehem, PA 18015 jic207@cse.lehigh.edu Anurag Bhardwaj University of Buffalo Amherst, NY 14260 ab94@buffalo.edu Huaigu

More information

Handwritten Text Recognition

Handwritten Text Recognition Handwritten Text Recognition M.J. Castro-Bleda, S. España-Boquera, F. Zamora-Martínez Universidad Politécnica de Valencia Spain Avignon, 9 December 2010 Text recognition () Avignon Avignon, 9 December

More information

A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition

A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition Théodore Bluche, Hermann Ney, Christopher Kermorvant SLSP 14, Grenoble October

More information

A Simplistic Way of Feature Extraction Directed towards a Better Recognition Accuracy

A Simplistic Way of Feature Extraction Directed towards a Better Recognition Accuracy International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 3, Issue 7 (September 2012), PP. 43-49 A Simplistic Way of Feature Extraction Directed

More information

Handwritten Text Recognition

Handwritten Text Recognition Handwritten Text Recognition M.J. Castro-Bleda, Joan Pasto Universidad Politécnica de Valencia Spain Zaragoza, March 2012 Text recognition () TRABHCI Zaragoza, March 2012 1 / 1 The problem: Handwriting

More information

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Stefan Müller, Gerhard Rigoll, Andreas Kosmala and Denis Mazurenok Department of Computer Science, Faculty of

More information

Open-Vocabulary Recognition of Machine-Printed Arabic Text Using Hidden Markov Models

Open-Vocabulary Recognition of Machine-Printed Arabic Text Using Hidden Markov Models Open-Vocabulary Recognition of Machine-Printed Arabic Text Using Hidden Markov Models Irfan Ahmad 1,2,*, Sabri A. Mahmoud 1, and Gernot A. Fink 2 1 Information and Computer Science Department, KFUPM, Dhahran

More information

Spotting Words in Latin, Devanagari and Arabic Scripts

Spotting Words in Latin, Devanagari and Arabic Scripts Spotting Words in Latin, Devanagari and Arabic Scripts Sargur N. Srihari, Harish Srinivasan, Chen Huang and Shravya Shetty {srihari,hs32,chuang5,sshetty}@cedar.buffalo.edu Center of Excellence for Document

More information

Handwritten Word Recognition using Conditional Random Fields

Handwritten Word Recognition using Conditional Random Fields Handwritten Word Recognition using Conditional Random Fields Shravya Shetty Harish Srinivasan Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) Department of Computer Science

More information

Linguistic Resources for Arabic Handwriting Recognition

Linguistic Resources for Arabic Handwriting Recognition Linguistic Resources for Arabic Handwriting Recognition Stephanie M. Strassel Linguistic Data Consortium 3600 Market Street, Suite 810 Philadelphia, PA 19104 USA strassel@ldc.upenn.edu Abstract MADCAT

More information

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System Mono-font Cursive Arabic Text Recognition Using Speech Recognition System M.S. Khorsheed Computer & Electronics Research Institute, King AbdulAziz City for Science and Technology (KACST) PO Box 6086, Riyadh

More information

The Impact of Ruling Lines on Writer Identification

The Impact of Ruling Lines on Writer Identification The Impact of Ruling Lines on Writer Identification Jin Chen Lehigh University Bethlehem, PA 18015, USA jic207@cse.lehigh.edu Daniel Lopresti Lehigh University Bethlehem, PA 18015, USA lopresti@cse.lehigh.edu

More information

Initial Results in Offline Arabic Handwriting Recognition Using Large-Scale Geometric Features. Ilya Zavorin, Eugene Borovikov, Mark Turner

Initial Results in Offline Arabic Handwriting Recognition Using Large-Scale Geometric Features. Ilya Zavorin, Eugene Borovikov, Mark Turner Initial Results in Offline Arabic Handwriting Recognition Using Large-Scale Geometric Features Ilya Zavorin, Eugene Borovikov, Mark Turner System Overview Based on large-scale features: robust to handwriting

More information

On the use of Lexeme Features for writer verification

On the use of Lexeme Features for writer verification On the use of Lexeme Features for writer verification Anurag Bhardwaj, Abhishek Singh, Harish Srinivasan and Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) University

More information

Recognition of online captured, handwritten Tamil words on Android

Recognition of online captured, handwritten Tamil words on Android Recognition of online captured, handwritten Tamil words on Android A G Ramakrishnan and Bhargava Urala K Medical Intelligence and Language Engineering (MILE) Laboratory, Dept. of Electrical Engineering,

More information

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient

Isolated Curved Gurmukhi Character Recognition Using Projection of Gradient International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 6 (2017), pp. 1387-1396 Research India Publications http://www.ripublication.com Isolated Curved Gurmukhi Character

More information

Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting

Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting 2013 12th International Conference on Document Analysis and Recognition Learning-Based Candidate Segmentation Scoring for Real-Time Recognition of Online Overlaid Chinese Handwriting Yan-Fei Lv 1, Lin-Lin

More information

Linguistic Resources for Handwriting Recognition and Translation Evaluation

Linguistic Resources for Handwriting Recognition and Translation Evaluation Linguistic Resources for Handwriting Recognition and Translation Evaluation Zhiyi Song*, Safa Ismael*, Steven Grimes*, David Doermann, Stephanie Strassel* *Linguistic Data Consortium, University of Pennsylvania,

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information

Alternatives for Page Skew Compensation in Writer Identification

Alternatives for Page Skew Compensation in Writer Identification Alternatives for Page Skew Compensation in Writer Identification Jin Chen and Daniel Lopresti Department of Computer Science & Engineering Lehigh University Bethlehem, PA 18015, USA Email: {jic207, lopresti}@cse.lehigh.edu

More information

Semantic Word Embedding Neural Network Language Models for Automatic Speech Recognition

Semantic Word Embedding Neural Network Language Models for Automatic Speech Recognition Semantic Word Embedding Neural Network Language Models for Automatic Speech Recognition Kartik Audhkhasi, Abhinav Sethy Bhuvana Ramabhadran Watson Multimodal Group IBM T. J. Watson Research Center Motivation

More information

An Arabic Optical Character Recognition System Using Restricted Boltzmann Machines

An Arabic Optical Character Recognition System Using Restricted Boltzmann Machines An Arabic Optical Character Recognition System Using Restricted Boltzmann Machines Abdullah M. Rashwan, Mohamed S. Kamel, and Fakhri Karray University of Waterloo Abstract. Most of the state-of-the-art

More information

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation 009 10th International Conference on Document Analysis and Recognition HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation Yaregal Assabie and Josef Bigun School of Information Science,

More information

Non-uniform Slant Correction using Generalized Projections

Non-uniform Slant Correction using Generalized Projections I J C T A, 9(17) 2016, pp. 8489-8497 International Science Press Non-uniform Slant Correction using Generalized Projections A. M. Hafiz * and G. M. Bhat * ABSTRACT Slant Correction is an important component

More information

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition

A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition A Feature based on Encoding the Relative Position of a Point in the Character for Online Handwritten Character Recognition Dinesh Mandalapu, Sridhar Murali Krishna HP Laboratories India HPL-2007-109 July

More information

Hidden Loop Recovery for Handwriting Recognition

Hidden Loop Recovery for Handwriting Recognition Hidden Loop Recovery for Handwriting Recognition David Doermann Institute of Advanced Computer Studies, University of Maryland, College Park, USA E-mail: doermann@cfar.umd.edu Nathan Intrator School of

More information

Comparing Natural and Synthetic Training Data for Off-line Cursive Handwriting Recognition

Comparing Natural and Synthetic Training Data for Off-line Cursive Handwriting Recognition Comparing Natural and Synthetic Training Data for Off-line Cursive Handwriting Recognition Tamás Varga and Horst Bunke Institut für Informatik und angewandte Mathematik, Universität Bern Neubrückstrasse

More information

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques

Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques Segmentation of Kannada Handwritten Characters and Recognition Using Twelve Directional Feature Extraction Techniques 1 Lohitha B.J, 2 Y.C Kiran 1 M.Tech. Student Dept. of ISE, Dayananda Sagar College

More information

Local Segmentation of Touching Characters using Contour based Shape Decomposition

Local Segmentation of Touching Characters using Contour based Shape Decomposition 2012 10th IAPR International Workshop on Document Analysis Systems Local Segmentation of Touching Characters using Contour based Shape Decomposition Le Kang, David Doermann Institute for Advanced Computer

More information

A Hidden Markov Model for Alphabet Soup Word Recognition

A Hidden Markov Model for Alphabet Soup Word Recognition A Hidden Markov Model for Alphabet Soup Word Recognition Shaolei Feng 1 Nicholas R. Howe 2 R. Manmatha 1 1 University of Massachusetts, Amherst 2 Smith College Motivation: Inaccessible Treasures Historical

More information

Scanned Documents. LBSC 796/INFM 718R Douglas W. Oard Week 8, March 30, 2011

Scanned Documents. LBSC 796/INFM 718R Douglas W. Oard Week 8, March 30, 2011 Scanned Documents LBSC 796/INFM 718R Douglas W. Oard Week 8, March 30, 2011 Expanding the Search Space Scanned Docs Identity: Harriet Later, I learned that John had not heard High Payoff Investments

More information

Bag-of-Features Representations for Offline Handwriting Recognition Applied to Arabic Script

Bag-of-Features Representations for Offline Handwriting Recognition Applied to Arabic Script 2012 International Conference on Frontiers in Handwriting Recognition Bag-of-Features Representations for Offline Handwriting Recognition Applied to Arabic Script Leonard Rothacker, Szilárd Vajda, Gernot

More information

Content-based Information Retrieval from Handwritten Documents

Content-based Information Retrieval from Handwritten Documents Content-based Information Retrieval from Handwritten Documents Sargur Srihari, Chen Huang and Harish Srinivasan Center of Excellence for Document Analysis and Recognition (CEDAR) University at Buffalo,

More information

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015

NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: VOLUME 2, ISSUE 1 JAN-2015 Offline Handwritten Signature Verification using Neural Network Pallavi V. Hatkar Department of Electronics Engineering, TKIET Warana, India Prof.B.T.Salokhe Department of Electronics Engineering, TKIET

More information

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009 Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer

More information

Three-Dimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients

Three-Dimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients ThreeDimensional Object Detection and Layout Prediction using Clouds of Oriented Gradients Authors: Zhile Ren, Erik B. Sudderth Presented by: Shannon Kao, Max Wang October 19, 2016 Introduction Given an

More information

A Segmentation Free Approach to Arabic and Urdu OCR

A Segmentation Free Approach to Arabic and Urdu OCR A Segmentation Free Approach to Arabic and Urdu OCR Nazly Sabbour 1 and Faisal Shafait 2 1 Department of Computer Science, German University in Cairo (GUC), Cairo, Egypt; 2 German Research Center for Artificial

More information

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition Nafiz Arica Dept. of Computer Engineering, Middle East Technical University, Ankara,Turkey nafiz@ceng.metu.edu.

More information

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (IJECIERD) ISSN 2249-684X Vol.2, Issue 3 Sep 2012 27-37 TJPRC Pvt. Ltd., HANDWRITTEN GURMUKHI

More information

A New Algorithm for Detecting Text Line in Handwritten Documents

A New Algorithm for Detecting Text Line in Handwritten Documents A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer

More information

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation.

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation. Equation to LaTeX Abhinav Rastogi, Sevy Harris {arastogi,sharris5}@stanford.edu I. Introduction Copying equations from a pdf file to a LaTeX document can be time consuming because there is no easy way

More information

CS 231A Computer Vision (Winter 2014) Problem Set 3

CS 231A Computer Vision (Winter 2014) Problem Set 3 CS 231A Computer Vision (Winter 2014) Problem Set 3 Due: Feb. 18 th, 2015 (11:59pm) 1 Single Object Recognition Via SIFT (45 points) In his 2004 SIFT paper, David Lowe demonstrates impressive object recognition

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

Time series, HMMs, Kalman Filters

Time series, HMMs, Kalman Filters Classic HMM tutorial see class website: *L. R. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," Proc. of the IEEE, Vol.77, No.2, pp.257--286, 1989. Time series,

More information

A Model-based Line Detection Algorithm in Documents

A Model-based Line Detection Algorithm in Documents A Model-based Line Detection Algorithm in Documents Yefeng Zheng, Huiping Li, David Doermann Laboratory for Language and Media Processing Institute for Advanced Computer Studies University of Maryland,

More information

Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition

Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition 2012 International Conference on Frontiers in Handwriting Recognition Comparison of Bernoulli and Gaussian HMMs using a vertical repositioning technique for off-line handwriting recognition Patrick Doetsch,

More information

A Model Selection Criterion for Classification: Application to HMM Topology Optimization

A Model Selection Criterion for Classification: Application to HMM Topology Optimization A Model Selection Criterion for Classification Application to HMM Topology Optimization Alain Biem IBM T. J. Watson Research Center P.O Box 218, Yorktown Heights, NY 10549, USA biem@us.ibm.com Abstract

More information

Keyword Spotting in Document Images through Word Shape Coding

Keyword Spotting in Document Images through Word Shape Coding 2009 10th International Conference on Document Analysis and Recognition Keyword Spotting in Document Images through Word Shape Coding Shuyong Bai, Linlin Li and Chew Lim Tan School of Computing, National

More information

A New COLD Feature based Handwriting Analysis for Enthnicity/Nationality Identification

A New COLD Feature based Handwriting Analysis for Enthnicity/Nationality Identification A New COLD Feature based Handwriting Analysis for Enthnicity/Nationality Identification 1 Sauradip Nag, 2 Palaiahnakote Shivakumara, 3 Wu Yirui, 4 Umapada Pal, and 5 Tong Lu 1 Kalyani Government Engineering

More information

Integrating Visual and Textual Cues for Query-by-String Word Spotting

Integrating Visual and Textual Cues for Query-by-String Word Spotting Integrating Visual and Textual Cues for D. Aldavert, M. Rusiñol, R. Toledo and J. Lladós Computer Vision Center, Dept. Ciències de la Computació Edifici O, Univ. Autònoma de Barcelona, Bellaterra(Barcelona),

More information

Document Image Restoration Using Binary Morphological Filters. Jisheng Liang, Robert M. Haralick. Seattle, Washington Ihsin T.

Document Image Restoration Using Binary Morphological Filters. Jisheng Liang, Robert M. Haralick. Seattle, Washington Ihsin T. Document Image Restoration Using Binary Morphological Filters Jisheng Liang, Robert M. Haralick University of Washington, Department of Electrical Engineering Seattle, Washington 98195 Ihsin T. Phillips

More information

Convolution Neural Networks for Chinese Handwriting Recognition

Convolution Neural Networks for Chinese Handwriting Recognition Convolution Neural Networks for Chinese Handwriting Recognition Xu Chen Stanford University 450 Serra Mall, Stanford, CA 94305 xchen91@stanford.edu Abstract Convolutional neural networks have been proven

More information

Lattice Rescoring for Speech Recognition Using Large Scale Distributed Language Models

Lattice Rescoring for Speech Recognition Using Large Scale Distributed Language Models Lattice Rescoring for Speech Recognition Using Large Scale Distributed Language Models ABSTRACT Euisok Chung Hyung-Bae Jeon Jeon-Gue Park and Yun-Keun Lee Speech Processing Research Team, ETRI, 138 Gajeongno,

More information

Invarianceness for Character Recognition Using Geo-Discretization Features

Invarianceness for Character Recognition Using Geo-Discretization Features Computer and Information Science; Vol. 9, No. 2; 2016 ISSN 1913-8989 E-ISSN 1913-8997 Published by Canadian Center of Science and Education Invarianceness for Character Recognition Using Geo-Discretization

More information

Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents

Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents Natural Language Inspired Approach for Handwritten Text Line Detection in Legacy Documents Vicente Bosch vbosch@iti.upv.es Alejandro Hector Toselli ahector@iti.upv.es Enrique Vidal evidal@iti.upv.es Pattern

More information

2. Basic Task of Pattern Classification

2. Basic Task of Pattern Classification 2. Basic Task of Pattern Classification Definition of the Task Informal Definition: Telling things apart 3 Definition: http://www.webopedia.com/term/p/pattern_recognition.html pattern recognition Last

More information

Confidence Measures: how much we can trust our speech recognizers

Confidence Measures: how much we can trust our speech recognizers Confidence Measures: how much we can trust our speech recognizers Prof. Hui Jiang Department of Computer Science York University, Toronto, Ontario, Canada Email: hj@cs.yorku.ca Outline Speech recognition

More information

Identifying Layout Classes for Mathematical Symbols Using Layout Context

Identifying Layout Classes for Mathematical Symbols Using Layout Context Rochester Institute of Technology RIT Scholar Works Articles 2009 Identifying Layout Classes for Mathematical Symbols Using Layout Context Ling Ouyang Rochester Institute of Technology Richard Zanibbi

More information

LECTURE 6 TEXT PROCESSING

LECTURE 6 TEXT PROCESSING SCIENTIFIC DATA COMPUTING 1 MTAT.08.042 LECTURE 6 TEXT PROCESSING Prepared by: Amnir Hadachi Institute of Computer Science, University of Tartu amnir.hadachi@ut.ee OUTLINE Aims Character Typology OCR systems

More information

Segmenting Handwritten Math Symbols Using AdaBoost and Multi-Scale Shape Context Features

Segmenting Handwritten Math Symbols Using AdaBoost and Multi-Scale Shape Context Features Segmenting Handwritten Math Symbols Using AdaBoost and Multi-Scale Shape Context Features Lei Hu Department of Computer Science Rochester Institute of Technology, USA lei.hu@rit.edu Richard Zanibbi Department

More information

Detecting Dense Foreground Stripes in Arabic Handwriting for Accurate Baseline Positioning

Detecting Dense Foreground Stripes in Arabic Handwriting for Accurate Baseline Positioning Detecting Dense Foreground Stripes in Arabic Handwriting for Accurate Baseline Positioning Felix Stahlberg Qatar Computing Research Institute, HBKU Doha, Qatar Email: fstahlberg@qf.org.qa Stephan Vogel

More information

Date Field Extraction from Handwritten Documents Using HMMs

Date Field Extraction from Handwritten Documents Using HMMs Date Field Extraction from Handwritten Documents Using HMMs Ranju Mandal, Partha Pratim Roy, Umapada Pal and Michael Blumenstein School of Information and Communication Technology, Griffith University,

More information

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques Sea Chen Department of Biomedical Engineering Advisors: Dr. Charles A. Bouman and Dr. Mark J. Lowe S. Chen Final Exam October

More information

Structured Models in. Dan Huttenlocher. June 2010

Structured Models in. Dan Huttenlocher. June 2010 Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies

More information

Available online at ScienceDirect. Procedia Computer Science 45 (2015 )

Available online at  ScienceDirect. Procedia Computer Science 45 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 45 (2015 ) 205 214 International Conference on Advanced Computing Technologies and Applications (ICACTA- 2015) Automatic

More information

Discriminatory Power of Handwritten Words for Writer Recognition

Discriminatory Power of Handwritten Words for Writer Recognition Discriminatory Power of Handwritten Words for Writer Recognition Catalin I. Tomai, Bin Zhang and Sargur N. Srihari Department of Computer Science and Engineering, SUNY Buffalo Center of Excellence for

More information

An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text

An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text An evaluation of HMM-based Techniques for the Recognition of Screen Rendered Text Sheikh Faisal Rashid 1, Faisal Shafait 2, and Thomas M. Breuel 1 1 Technical University of Kaiserslautern, Kaiserslautern,

More information

Building Classifiers using Bayesian Networks

Building Classifiers using Bayesian Networks Building Classifiers using Bayesian Networks Nir Friedman and Moises Goldszmidt 1997 Presented by Brian Collins and Lukas Seitlinger Paper Summary The Naive Bayes classifier has reasonable performance

More information

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS 8.1 Introduction The recognition systems developed so far were for simple characters comprising of consonants and vowels. But there is one

More information

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CHAPTER 4 CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS 4.1 Introduction Optical character recognition is one of

More information

Accelerometer Gesture Recognition

Accelerometer Gesture Recognition Accelerometer Gesture Recognition Michael Xie xie@cs.stanford.edu David Pan napdivad@stanford.edu December 12, 2014 Abstract Our goal is to make gesture-based input for smartphones and smartwatches accurate

More information

Groundtruth Image Generation from Electronic Text (Demonstration)

Groundtruth Image Generation from Electronic Text (Demonstration) Groundtruth Image Generation from Electronic Text (Demonstration) David Doermann and Gang Zi Laboratory for Language and Media Processing, University of Maryland, College Park MD 21043, USA {doermann,

More information

WITH the increasing use of digital image capturing

WITH the increasing use of digital image capturing 800 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 3, MARCH 2011 A Hybrid Approach to Detect and Localize Texts in Natural Scene Images Yi-Feng Pan, Xinwen Hou, and Cheng-Lin Liu, Senior Member, IEEE

More information

Better Evaluation for Grammatical Error Correction

Better Evaluation for Grammatical Error Correction Better Evaluation for Grammatical Error Correction Daniel Dahlmeier 1 and Hwee Tou Ng 1,2 1 NUS Graduate School for Integrative Sciences and Engineering 2 Department of Computer Science, National University

More information

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network Utkarsh Dwivedi 1, Pranjal Rajput 2, Manish Kumar Sharma 3 1UG Scholar, Dept. of CSE, GCET, Greater Noida,

More information

Information Fusion Dr. B. K. Panigrahi

Information Fusion Dr. B. K. Panigrahi Information Fusion By Dr. B. K. Panigrahi Asst. Professor Department of Electrical Engineering IIT Delhi, New Delhi-110016 01/12/2007 1 Introduction Classification OUTLINE K-fold cross Validation Feature

More information

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models Gleidson Pegoretti da Silva, Masaki Nakagawa Department of Computer and Information Sciences Tokyo University

More information

NEW ALGORITHMS FOR SKEWING CORRECTION AND SLANT REMOVAL ON WORD-LEVEL

NEW ALGORITHMS FOR SKEWING CORRECTION AND SLANT REMOVAL ON WORD-LEVEL NEW ALGORITHMS FOR SKEWING CORRECTION AND SLANT REMOVAL ON WORD-LEVEL E.Kavallieratou N.Fakotakis G.Kokkinakis Wire Communication Laboratory, University of Patras, 26500 Patras, ergina@wcl.ee.upatras.gr

More information

A Novel Approach to On-Line Handwriting Recognition Based on Bidirectional Long Short-Term Memory Networks

A Novel Approach to On-Line Handwriting Recognition Based on Bidirectional Long Short-Term Memory Networks A Novel Approach to n-line Handwriting Recognition Based on Bidirectional Long Short-Term Memory Networks Marcus Liwicki 1 Alex Graves 2 Horst Bunke 1 Jürgen Schmidhuber 2,3 1 nst. of Computer Science

More information

Novel Sub-Character HMM Models for Arabic Text Recognition

Novel Sub-Character HMM Models for Arabic Text Recognition Published version: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6628700 Novel Sub-Character Models for Arabic Text Recognition Irfan Ahmad Information and Computer Science King Fahd University

More information

On-line handwriting recognition using Chain Code representation

On-line handwriting recognition using Chain Code representation On-line handwriting recognition using Chain Code representation Final project by Michal Shemesh shemeshm at cs dot bgu dot ac dot il Introduction Background When one preparing a first draft, concentrating

More information

WORD LEVEL DISCRIMINATIVE TRAINING FOR HANDWRITTEN WORD RECOGNITION Chen, W.; Gader, P.

WORD LEVEL DISCRIMINATIVE TRAINING FOR HANDWRITTEN WORD RECOGNITION Chen, W.; Gader, P. University of Groningen WORD LEVEL DISCRIMINATIVE TRAINING FOR HANDWRITTEN WORD RECOGNITION Chen, W.; Gader, P. Published in: EPRINTS-BOOK-TITLE IMPORTANT NOTE: You are advised to consult the publisher's

More information

Learning to Segment Document Images

Learning to Segment Document Images Learning to Segment Document Images K.S. Sesh Kumar, Anoop Namboodiri, and C.V. Jawahar Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India Abstract.

More information

Sum-Product Networks. STAT946 Deep Learning Guest Lecture by Pascal Poupart University of Waterloo October 15, 2015

Sum-Product Networks. STAT946 Deep Learning Guest Lecture by Pascal Poupart University of Waterloo October 15, 2015 Sum-Product Networks STAT946 Deep Learning Guest Lecture by Pascal Poupart University of Waterloo October 15, 2015 Introduction Outline What is a Sum-Product Network? Inference Applications In more depth

More information

Indian Multi-Script Full Pin-code String Recognition for Postal Automation

Indian Multi-Script Full Pin-code String Recognition for Postal Automation 2009 10th International Conference on Document Analysis and Recognition Indian Multi-Script Full Pin-code String Recognition for Postal Automation U. Pal 1, R. K. Roy 1, K. Roy 2 and F. Kimura 3 1 Computer

More information

HCR Using K-Means Clustering Algorithm

HCR Using K-Means Clustering Algorithm HCR Using K-Means Clustering Algorithm Meha Mathur 1, Anil Saroliya 2 Amity School of Engineering & Technology Amity University Rajasthan, India Abstract: Hindi is a national language of India, there are

More information

Spoken Document Retrieval (SDR) for Broadcast News in Indian Languages

Spoken Document Retrieval (SDR) for Broadcast News in Indian Languages Spoken Document Retrieval (SDR) for Broadcast News in Indian Languages Chirag Shah Dept. of CSE IIT Madras Chennai - 600036 Tamilnadu, India. chirag@speech.iitm.ernet.in A. Nayeemulla Khan Dept. of CSE

More information

IDIAP. Martigny - Valais - Suisse IDIAP

IDIAP. Martigny - Valais - Suisse IDIAP R E S E A R C H R E P O R T IDIAP Martigny - Valais - Suisse Off-Line Cursive Script Recognition Based on Continuous Density HMM Alessandro Vinciarelli a IDIAP RR 99-25 Juergen Luettin a IDIAP December

More information

A two-stage approach for segmentation of handwritten Bangla word images

A two-stage approach for segmentation of handwritten Bangla word images A two-stage approach for segmentation of handwritten Bangla word images Ram Sarkar, Nibaran Das, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri #, Dipak Kumar Basu Computer Science & Engineering Department,

More information

CRFs for Image Classification

CRFs for Image Classification CRFs for Image Classification Devi Parikh and Dhruv Batra Carnegie Mellon University Pittsburgh, PA 15213 {dparikh,dbatra}@ece.cmu.edu Abstract We use Conditional Random Fields (CRFs) to classify regions

More information

Writer Recognizer for Offline Text Based on SIFT

Writer Recognizer for Offline Text Based on SIFT Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 5, May 2015, pg.1057

More information

Introduction to HTK Toolkit

Introduction to HTK Toolkit Introduction to HTK Toolkit Berlin Chen 2003 Reference: - The HTK Book, Version 3.2 Outline An Overview of HTK HTK Processing Stages Data Preparation Tools Training Tools Testing Tools Analysis Tools Homework:

More information

Implicit segmentation of Kannada characters in offline handwriting recognition using hidden Markov models

Implicit segmentation of Kannada characters in offline handwriting recognition using hidden Markov models 1 Implicit segmentation of Kannada characters in offline handwriting recognition using hidden Markov models Manasij Venkatesh, Vikas Majjagi, and Deepu Vijayasenan arxiv:1410.4341v1 [cs.lg] 16 Oct 2014

More information

Search Engines. Information Retrieval in Practice

Search Engines. Information Retrieval in Practice Search Engines Information Retrieval in Practice All slides Addison Wesley, 2008 Beyond Bag of Words Bag of Words a document is considered to be an unordered collection of words with no relationships Extending

More information

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Katsuya Masuda *, Makoto Tanji **, and Hideki Mima *** Abstract This study proposes a framework to access to the

More information

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION

SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION SEVERAL METHODS OF FEATURE EXTRACTION TO HELP IN OPTICAL CHARACTER RECOGNITION Binod Kumar Prasad * * Bengal College of Engineering and Technology, Durgapur, W.B., India. Rajdeep Kundu 2 2 Bengal College

More information

Inkball Models as Features for Handwriting Recognition

Inkball Models as Features for Handwriting Recognition Inkball Models as Features for Handwriting Recognition Nicholas R. Howe Smith College Northampton, Massachusetts, USA Email: nhowe@smith.edu Andreas Fischer University of Fribourg 1700 Fribourg, Switzerland

More information

Hidden Markov Models. Gabriela Tavares and Juri Minxha Mentor: Taehwan Kim CS159 04/25/2017

Hidden Markov Models. Gabriela Tavares and Juri Minxha Mentor: Taehwan Kim CS159 04/25/2017 Hidden Markov Models Gabriela Tavares and Juri Minxha Mentor: Taehwan Kim CS159 04/25/2017 1 Outline 1. 2. 3. 4. Brief review of HMMs Hidden Markov Support Vector Machines Large Margin Hidden Markov Models

More information

Handwritten English Alphabet Recognition Using Bigram Cost Chengshu (Eric) Li Fall 2015, CS229, Stanford University

Handwritten English Alphabet Recognition Using Bigram Cost Chengshu (Eric) Li Fall 2015, CS229, Stanford University Handwritten English Alphabet Recognition Using Bigram Cost Chengshu (Eric) Li chengshu@stanford.edu Fall 2015, CS229, Stanford University Abstract: This paper describes a new approach to handwritten English

More information