Automatic Article Extraction in Old Newspapers Digitized Collections

Size: px
Start display at page:

Download "Automatic Article Extraction in Old Newspapers Digitized Collections"

Transcription

1 Automatic Article Extraction in Old Newspapers Digitized Collections David Hebert Pierrick Tranouez Thomas Palfray Thierry Paquet Stephane Nicolas ABSTRACT We present a complete method for article segmentation in old newspapers, which deals with complex layouts analysis of degraded documents. The designed workflow can process large amounts of documents and generates digital objects in METS/ALTO format in order to facilitate the indexing and the browsing of information in digital libraries. The analysis of the document image is performed by a two stages scheme. Pixels are labelled in a first stage with a Conditional Random Field model in order to intent to label the areas of interest with a low logical level. Then this first logical representation of the document content is analysed in a second stage to get a higher logical representation including article segmentation and reading order. This top-level structural analysis relies on the generation of an article separation grid applied recursively on the document image, allowing analysing any type of Manhattan page layout, even for complex structures with multiple columns and overlapping entities. This method which benefits from both a local analysis using a probabilistic model trained using machine learning procedures, and a more global structural analysis using recursive rules, is evaluated on a dataset of daily local press document images covering several time periods and different page layouts, to prove its effectiveness. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Copyright 20XX ACM X-XXXXX-XX-X/XX/XX...$ Keywords page layout analysis, information extraction from document images, logical structure, articles extraction in newspapers, document image labelling, conditional random field, structural analysis 1. INTRODUCTION During the last twenty years, the archives and national libraries worldwide have launched digitization programs of their historical collections in order to preserve them, and together providing remote access for a wider range of potential users thanks to the internet. Old newspapers archives are emblematic of this trend. However, digitization programs of such collections require indexing facilities of the document images obtained after scanning. Indeed, considering the large amount of documents these collections contain, and the large amount of articles each document can contain, smart indexing and retrieval is required. This is only conceivable through textual requires of users. This is why textual transcriptions of the digitized collections are necessary as well as deep understanding of the document structure so as to extract each article of the collection. Standard OCR technologies cannot fully automate the whole process of automatic transcription generation and page layout understanding of old newspaper, and a specific digitization process dedicated to complex page layout analysis, character recognition (OCR), logical structure detection and reading order detection. Physical structure extraction is the process that extract the physical structure Of the document such as columns and lines of text, prior to the OCR process. Logical structure analysis is the process that gives access to the information units of the document and its organization. It gives access to descriptors (logical tags also known as meta data) such as Title, Sub-Title, Chapter, Article, Paragraph, Captions, etc...

2 Generally, these two processes of document structure extraction (known as physical and logical layout analysis) operate separately and sequentially, one after the other. This is justified by the fact that most of the time physical segmentation of document images can be performed without the need for any additional knowledge, whereas logical layout extraction is generally performed thanks to the use of a document model (e.g. a style sheet) that express the relations between physical and logical entities of the two representations of the document. However, it is now well known that difficult segmentation tasks must incorporate a recognition stage so as to improve their performance. Therefore, we have developed a new methodology dedicated to logical labelling of old newspapers images. This method is intended to extract metadata in the images of the digitized, thanks to the joint use of a method of classification of sequence of pixels based on Conditional Random Field modelling, associated with a set of rules defining the concept of article within a newspaper. Based on physical descriptors, the pixel labelling perform directly a logical analysis that give us a first low logical level of segmentation. Then, based on the detected logical entities, the set of rules is able to bring a higher logical level to detect articles in a newspaper. In the first part of this paper we overview the related works in the literature. Then, the second part describes our method. The third part is dedicated to the evaluation of the approach which was tested on old newspaper issues from the Journal de Rouen, a regional French newspaper. Finally we conclude by a discussion about the potential of the method and future work. 2. RELATED WORK Since 2001 the ICDAR Conference is organizing a document page segmentation competition [2] in which some of the proposed algorithms may have goals similar to the system we propose in this paper. Nevertheless, the document dataset used for this competition contains only modern documents, therefore the proposed methods may be inefficient for old newspapers. Among the the contribution we can mention the work described in [1] which is based on pixel labelling stage which may be adaptable to old documents. In [3] an approach based on the detection of the maximal empty rectangles to delimit columns and text blocks is described. This method is integrated in the OCROPUS opensource OCR. Although interesting, this method does not cope with the difficulties inherent to old documents (skewing, deformations,...). A more interesting method taking into account these difficulties is described in [11]. The authors propose to use a multiscale approach to extract text blocks in old newspapers. This efficient method is limited to detect text blocks. No logical structure, neither reading order, is provided by this method. The approach we propose in this paper tries to bring a solution to this problem. 3. PROPOSED APPROACH The method we present in this paper has been implemented as a complete system dedicated to process a large amount of old newspaper document images. It automatically analyzes the logical organization of pages so as to extract articles. This extraction process has to make both physical and logical layout analysis. As opposed to the traditional approaches which separate physical and logical analysis, we have designed a methodology that performs logical analysis both at pixel level and at text blocs level. First, we proceed to a logical labelling of the image at pixel level which gives a low level representation of the logical structure to be extracted. This image processing stage embeds some knowledge about the physical organisation of the logical information to be extracted. It is based on a machine learning approach (Conditional Random Fields). The labelled image obtained is further analyzed during a second stage that recursively builds the logical entities of the document by referring to a generic layout model. This reconstruction stage can be viewed as a specific bidimensional parser that provides the parsed tree of the document image. Finally, the system provides XML METS/ALTO output files. These files contain both the logical structure describing the reading order of the articles, their physical layout composed of the detected text lines and their associated character recognized by an OCR. The following two paragraphs provide an in depth description of the two main steps of the methodology. 3.1 Logical labelling at pixel level The proposed method for article extraction in newspaper document images relies on a first segmentation stage using Conditional Random Fields model (CRF) with multiscale quantization feature functions. This approach has been presented in details in [8] and we recall its most important steps only. Conditional Random Fields (CRF) introduced in 2001 by Lafferty et al. [9] have opened a new way for sequence and image analysis. In its original formulation, a CRF is a stochastic model of a process that account for the dependencies between a sequence of discrete observations (originally a word sequence) and a sequence of labels that can be associated to these observations (originally Part Of Speech tags). The application of CRF to image labelling requires some adaptation so as to deal with numerical values instead of discrete observations. This adaptation can be viewed as a preprocessing step dedicated to providing the CRF discrete observations extracted from raw numerical pixel values of the image. In the field of computer vision [7, 10] use the outputs of a neural network or a SVM to feed a CRF. CRF have also been applied to document structure extraction. [12] uses a 2D-CRF based approach on top of a first neural Network classification stage. Another example can be found in [4] were the CRF model is introduced at a second stage after a pixel classification stage. Training these systems is particularly difficult be cause they require training two sub systems : a local classifier, and then a CRF that is fed with the output of the first classifier. In the proposed system, we use a conditional random field with multi-scale quantization feature functions [8]. Such an approach requires training one CRF only. Physical descriptors of the image made of Run Length features are quantized using several quantization functions and fed to the CRF. Let us define a linear quantization function with quantifier q that quantizes the continuous observation o as follows: Q(o, q) : 0 X o x = round( o q ) Assuming o is ranging within the interval [o min, o max] then the quantization function can take only Nd discrete values, with Nd = (o max o min)/q. To avoid a choice of the q value, we use multiple quantization functions. Let q 1, q 2,, q N be a set of quantizers, each

3 (a) Figure 1: The CRF logical labelling at pixel level for the image (a) defining a quantization function Q i(o) = Q(o, q i), then by choosing a dyadic law of quantifiers as follows q i = 2 q i 1 = q 1 2 i 1, we build a multi-scale quantization scheme with the ability to keep most of the original information contained in the continuous features without any assumption about the distribution of these features. With these multiscale quantizations, the CRF model can be written has in equation (1). p(y X) = 1 t=t Z(X) t=1 exp ( k=k k=1 λ k f k (y t 1, y t, Q 1(X),..., Q N (X), t) In the old newspaper article extraction workflow, we use a CRF with multiscale quantized feature functions on a set of physical descriptors made of vertical and horizontal run length, as described in [8]. We choose a first quantization step q1 = 2 with the dyadic law q i = 2 q i 1, giving on our images 10 quantization scales. One strength of a CRF model is its ability to deal with contextual information in the observation space. Concretely, a decision at the position t is not only made by taking only the observation at this position into account but also neighboring ones in a defined vicinity. The logical labelling use a sequential CRF along the horizontal direction with a context of two previous and two next observations around the current pixel to label. This system provides a fine segmentation of the image at pixel level, where each pixel is associated to a logical label specifying the logical function of the entity this pixel belongs to. The logical functions are the following ones : Vertical separator Horizontal separator Titles (composed of title characters and title interwords ) Text lines (composed of characters, inter-characters and inter-words ) Noise Background An example of the labelling result is given by the figure 1. This labelling stage provides a precise description of the ) (1) logical structure of the document at pixel level, but it does not provide the logical structure of the document. This is the goal of the second labelling stage that we present in the following paragraph. 3.2 Logical structure extraction Logical structure extraction from document images aims at producing the parsed tree of the document, where each node of the tree account for a particular entity of its logical organization e.g. Title, sub-title, paragraph, figure, caption, etcâă e In addition the order in which these entities are organized, namely the logical order, is associated. Logical structure extraction is the process that takes as input a raw level representation of the document at pixel level and provides a high level representation. Therefore, logical structure extraction can be viewed as a particular bidimensional parser. As already mentioned, this stage generally takes place after a first physical segmentation stage of the document. In the method that we propose here, we exploit both the physical properties of the image entities as well as their labelling that is produced by the CRF labelling stage. The labels are Title, Text, H separator and V separator. The approach is composed of two main steps : 1- the detection of labelled atomic entities in the image 2- the reconstruction process of higher level entities thanks to a generic document model. We now detail these two steps Labelled atomic entities detection The atomic entities on which the reconstruction process takes place are text lines, titles, horizontal and vertical separators. These entities are detected in the image thanks to the CRF labelling stage. First titles and text lines are detected by applying the following label merging rules : text line = characters + inter-characters + inter-words title = title characters + title inter-words Despite the very good performance of the CRF labelling stage, there are some pixel labelling inconsistencies that require to be detected and corrected before the reconstruction process can take place. Most of the inconsistencies occur when some entities with different labels are connected together (for example text entities and title entities being connected together). Such erroneous cases are corrected by labelling the entities involved with the most occurring label (background pixels are not considered). The text lines are obtained by extracting the connected components, which are labelled text in the resulting image. Despite of the robustness of the extraction process, some text lines may be connected because of some important deformation in the image due to some degradations or digitization artefacts. Possible connected test lines are detected by computing the average surface of the text lines in the whole document image. Then, text entities with a surface much higher than the mean surface, are considered erroneous. These situations are then corrected by a specific algorithm, which allows separating them. The results provided by this detection and labelling process are shown on figure 2. Structural entities such as text lines, titles, vertical and horizontal separators can then be extracted on the image Article extraction using a layout model The atomic entities of interest detected in the image match one of the following label: title, text, horizontal separator,

4 Figure 2: Labelled entities detection (a) Figure 3: (a): A section is the set of blocks of the same color. : The unambiguous reading order inside sections and vertical separator. Page layout of a newspaper follows some precise layout rules, namely a layout model. A page layout model is made of a precise description of the allowed spatial physical organization of articles as well as a description of the organization of an article. The model that was implemented in this study copes with complex multi-sections and multi-columns page layouts. A page is a section. A section can be composed of many sections that are organized sequentially, one below the other, or hierarchically one inside the other. They are separated with a large horizontal separator that span over the whole width of the section. Each section can contain multiple columns, with a variable number of columns between sections. Columns are separated with vertical separators that span over the whole height of the section. A section contains a sequence of articles (figure 3(a)). An article begins with a title entity followed by at least one text entity, and ends with an horizontal separator entity or another title entity. The detection of the spatial organization of pages in sections appears the key issue that further enables the detection of articles. Indeed, articles are organized sequentially within each section and are separated by titles and / or horizontal separators. Ar- Figure 4: text line The grid of cells containing at least on ticles extraction is therefore implemented following the two main steps. First, the physical grid constituted by vertical and horizontal separators is detected and text blocs are assigned to their surrounding cell. Section delimiter are also detected at the end of this process. Second, articles are detected easily as they are made of the text blocs delimited between successive titles and horizontal separators and following the reading order of the section. Detection of the separator grid and text blocks. The first step of our text blocks detection consists in extending all the entities that are logical separators. These are entities labelled Âń separator Âż and those that are labelled Âń title Âż. Extension of a horizontal (resp. vertical) separator consists in extending its width (resp. height) until touching the vertical (resp. horizontal) left and right (resp. top and bottom) separators. we apply the following steps sequentially: Create the vertical and horizontal separator mask Connect the neighbouring vertical separators Extend vertical separators as long as they do not cross a horizontal separator or a title Connect the neighbouring horizontal separators Extend horizontal separators and the titles as long as they do not cross a vertical separator At the end of this process, the grid covering the entire image is obtained and it will serve to extract the articles. For that purpose each grid cell is associated with the text lines it surrounds. The title entities are also associated to their surrounding grid cell. Cells containing no text line are removed. Finally we obtain a list of text blocks made of the remaining grid cells (figure 4). Reading order detection. Section detection is based on the detection of the horizontal section delimiters. These are the horizontal delimiters that span over multiple columns. Text blocs are grouped within each section they belong to, and they are organized sequentially following a top-down, left right reading order. By definition, a section follows this unambiguous reading order (see figure 3). Within each section, following the

5 (a) (c) (d) Figure 5: The final article segmentation Table 1: Results of the whole process of logical segmentation into articles #articles #detected #correct %correct %over-seg reading order, articles are made of the successive text blocs delimited between title entities and horizontal separators (figure 5) RESULTS Quantitative evaluation This method was tested on a dataset containing 42 document images issued from a French regional newspaper called Journal de Rouen. The results have been checked manually by visual inspection. We determined the article detection rate and the article over-segmentation rate. These results are given in table 1 below. The analysis of the errors produced shows that a great amount of them are due to labelling errors produced by the CRF segmentation stage. For example, 14 text lines have been labelled erroneously as title. This leads to the detection of 14 extra articles when applying the editorial,rules, which lead to detect 28 articles instead 14. Theses errors produce over-segmentation cases, most of the time. 4.2 Mass evaluation of the method The proposed method has been used extensively during the digitization process of the old newspaper Journal de Rouen documents made of 4 pages in average have been precessed by the workflow described in this paper. Theses documents are newspapers published between 1767 and The layout is not constant and evolves during this large period of time. The simplest one is given by the running example of this paper (figure 1(a)). Some examples of more complex layouts are shown on figures 6 and 7. Results can not be quantified but seem to follow the 85% found for the quantitative evaluation. Some regions of documents are more difficult to analyse such as documents with tables or document without any vertical separator between columns. These errors induced by missing separators can be avoided by evolving the separator definition. At this time, a separator is a physical entity but considering large white spaces as Figure 6: Some examples of layouts available in the collection and their associated article segmentation separators can solve these issues. 5. CONCLUSION AND FUTURE WORK We presented in this paper a logical segmentation method based on the analysis of low-level labelling results produced by a CRF model, using a set of rules defined by a generic layout model. The proposed method is able to segment the textual content of old newspapers with complex Manhattan structure (multi columns), using a little set of simple rules. We obtain with this method an article segmentation rate of 85.84% on a test dataset containing 42 images of Journal de Rouen, one of the oldest French regional newspapers. These first results are promising, and allow us to identify two main improvement issues. The first one consists in improving our CRF model because we noted that the majority of the errors come from this stage. In a second step we will improve further both the CRF model and the layout rules in order to be able to take into account some important other entities of the document structure, such as figures, pictures, captions and tables. 6. REFERENCES [1] C. An, D. Yin, and H. S. Baird. Document segmentation using pixel-accurate ground truth. In ICPR, pages IEEE, [2] A. Antonacopoulos, S. Pletschacher, D. Bridson, and C. Papadopoulos. Icdar 2009 page segmentation competition. In ICDAR, pages IEEE Computer Society, 2009.

6 (a) (c) (d) Figure 7: Some other examples of layouts available in the collection and their associated article segmentation [3] T. M. Breuel. Two geometric algorithms for layout analysis. In Proceedings of the 5th International Workshop on Document Analysis Systems V, DAS 02, pages , London, UK, UK, Springer-Verlag. [4] S. Chaudhury, M. Jindal, and S. Dutta Roy. Model-guided segmentation and layout labelling of document images using a hierarchical conditional random field. In Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence, PReMI 09, pages , Berlin, Heidelberg, Springer-Verlag. [5] T.-M.-T. Do and T. Artie res. Conditional Random Fields for Online Handwriting Recognition. In G. Lorette, editor, Tenth International Workshop on Frontiers in Handwriting Recognition, La Baule (France), Oct Universite de Rennes 1, Suvisoft. [6] S. Feng, R. Manmatha, and A. Mccallum. Exploring the use of conditional random field models and hmms for historical handwritten document recognition. In the Proceedings of the 2nd IEEE International Conference on Document Image Analysis for Libraries (DIAL, pages 30 37, Washington, DC, USA, [7] X. He, R. S. Zemel, and M. A. Carreira-perpinan. Multiscale conditional random fields for image labeling. In In CVPR, pages , [8] D. Hebert, T. Paquet, and S. Nicolas. Continuous crf with multi-scale quantization feature functions application to structure extraction in old newspaper. In International Conference on Document Analysis and Recognition (ICDAR), pages IEEE, March [9] J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning, ICML 01, pages , San Francisco, CA, USA, Morgan Kaufmann Publishers Inc. [10] C.-H. Lee, S. Wang, A. Murtha, M. R. G. Brown, and R. Greiner. Segmenting brain tumors using pseudo-conditional random fields. In D. N. Metaxas, L. Axel, G. Fichtinger, and G. SzA l kely, editors, MICCAI (1), volume 5241 of Lecture Notes in Computer Science, pages Springer, [11] A. Lemaitre, J. Camillerapp, and B. Couasnon. Approche perceptive pour la reconnaissance de filets bruites - application a la structuration de pages de journaux. In Actes du Xeme Colloque International Francophone sur l Ecrit et le Document, CIFED 08, pages 61 66, France, A. T. et Thierry Paquet (ed.). [12] S. Nicolas, J. Dardenne, T. Paquet, and L. Heutte. Document image segmentation using a 2d conditional random field model. In ICDAR, pages IEEE Computer Society, 2007.

The PAGE (Page Analysis and Ground-truth Elements) Format Framework

The PAGE (Page Analysis and Ground-truth Elements) Format Framework 2010,IEEE. Reprinted, with permission, frompletschacher, S and Antonacopoulos, A, The PAGE (Page Analysis and Ground-truth Elements) Format Framework, Proceedings of the 20th International Conference on

More information

Using a Neighbourhood Graph Based on Voronoï Tessellation with DMOS, a Generic Method for Structured Document Recognition

Using a Neighbourhood Graph Based on Voronoï Tessellation with DMOS, a Generic Method for Structured Document Recognition Using a Neighbourhood Graph Based on Voronoï Tessellation with DMOS, a Generic Method for Structured Document Recognition Aurélie Lemaitre 1,BertrandCoüasnon 1, and Ivan Leplumey 2 1 IRISA/INRIA, Campus

More information

Document Image Segmentation using Discriminative Learning over Connected Components

Document Image Segmentation using Discriminative Learning over Connected Components Document Image Segmentation using Discriminative Learning over Connected Components Syed Saqib Bukhari Technical University of bukhari@informatik.unikl.de Mayce Ibrahim Ali Al Azawi Technical University

More information

Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases

Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases Baptiste Poirriez, Aurélie Lemaitre, Bertrand Coüasnon To cite this version: Baptiste Poirriez,

More information

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014)

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014) I J E E E C International Journal of Electrical, Electronics ISSN No. (Online): 2277-2626 Computer Engineering 3(2): 85-90(2014) Robust Approach to Recognize Localize Text from Natural Scene Images Khushbu

More information

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction

N.Priya. Keywords Compass mask, Threshold, Morphological Operators, Statistical Measures, Text extraction Volume, Issue 8, August ISSN: 77 8X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Combined Edge-Based Text

More information

Historical Document Layout Analysis Competition

Historical Document Layout Analysis Competition 2011 International Conference on Document Analysis and Recognition Historical Document Layout Analysis Competition A. Antonacopoulos, C. Clausner, C. Papadopoulos and S. Pletschacher Pattern Recognition

More information

Learning to Segment Document Images

Learning to Segment Document Images Learning to Segment Document Images K.S. Sesh Kumar, Anoop Namboodiri, and C.V. Jawahar Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India Abstract.

More information

Ground Truth for Layout Analysis Performance Evaluation

Ground Truth for Layout Analysis Performance Evaluation Ground Truth for Layout Analysis Performance Evaluation A. Antonacopoulos 1, D. Karatzas 2, and D. Bridson 1 1 Pattern Recognition and Image Analysis (PRImA) Research Lab, School of Computing, Science

More information

On Segmentation of Documents in Complex Scripts

On Segmentation of Documents in Complex Scripts On Segmentation of Documents in Complex Scripts K. S. Sesh Kumar, Sukesh Kumar and C. V. Jawahar Centre for Visual Information Technology International Institute of Information Technology, Hyderabad, India

More information

Decoding the Text Encoding

Decoding the Text Encoding Decoding the Text Encoding Hamid Izadinia and Fereshteh Sadeghi Computer Science and Engineering University of Washington {izadinia,fsadeghi}@cs.washington.edu ABSTRACT Word clouds and text visualization

More information

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera

Recognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 17 (2014), pp. 1839-1845 International Research Publications House http://www. irphouse.com Recognition of

More information

Weighted Convolutional Neural Network. Ensemble.

Weighted Convolutional Neural Network. Ensemble. Weighted Convolutional Neural Network Ensemble Xavier Frazão and Luís A. Alexandre Dept. of Informatics, Univ. Beira Interior and Instituto de Telecomunicações Covilhã, Portugal xavierfrazao@gmail.com

More information

A typed and handwritten text block segmentation system for heterogeneous and complex documents

A typed and handwritten text block segmentation system for heterogeneous and complex documents A typed and handwritten text block segmentation system for heterogeneous and complex documents P. Barlas, S. Adam, C. Chatelain, T.Paquet Laboratoire LITIS - EA 4108, Universite de Rouen, FRANCE 76800

More information

Metrics for Complete Evaluation of OCR Performance

Metrics for Complete Evaluation of OCR Performance Int'l Conf. IP, Comp. Vision, and Pattern Recognition IPCV'18 23 Metrics for Complete Evaluation of OCR Performance R. Karpinski, D. Lohani, and A. Belaïd Université de Lorraine - LORIA, Vandoeuvre-Lès-Nancy,

More information

Text Recognition in Videos using a Recurrent Connectionist Approach

Text Recognition in Videos using a Recurrent Connectionist Approach Author manuscript, published in "ICANN - 22th International Conference on Artificial Neural Networks, Lausanne : Switzerland (2012)" DOI : 10.1007/978-3-642-33266-1_22 Text Recognition in Videos using

More information

CS 223B Computer Vision Problem Set 3

CS 223B Computer Vision Problem Set 3 CS 223B Computer Vision Problem Set 3 Due: Feb. 22 nd, 2011 1 Probabilistic Recursion for Tracking In this problem you will derive a method for tracking a point of interest through a sequence of images.

More information

Explicit fuzzy modeling of shapes and positioning for handwritten Chinese character recognition

Explicit fuzzy modeling of shapes and positioning for handwritten Chinese character recognition 2009 0th International Conference on Document Analysis and Recognition Explicit fuzzy modeling of and positioning for handwritten Chinese character recognition Adrien Delaye - Eric Anquetil - Sébastien

More information

Extending Page Segmentation Algorithms for Mixed-Layout Document Processing

Extending Page Segmentation Algorithms for Mixed-Layout Document Processing ScholarWorks Computer Science Faculty Publications and Presentations Department of Computer Science 9-18-2011 Extending Page Segmentation Algorithms for Mixed-Layout Document Processing Amy Winder Tim

More information

Recognition of Tables and Forms

Recognition of Tables and Forms Recognition of Tables and Forms Bertrand Coüasnon, Aurélie Lemaitre To cite this version: Bertrand Coüasnon, Aurélie Lemaitre. Recognition of Tables and Forms. Handbook of Document Image Processing and

More information

String distance for automatic image classification

String distance for automatic image classification String distance for automatic image classification Nguyen Hong Thinh*, Le Vu Ha*, Barat Cecile** and Ducottet Christophe** *University of Engineering and Technology, Vietnam National University of HaNoi,

More information

Scenario Driven In-Depth Performance Evaluation of Document Layout Analysis Methods *

Scenario Driven In-Depth Performance Evaluation of Document Layout Analysis Methods * 2011 International Conference on Document Analysis and Recognition Scenario Driven In-Depth Performance Evaluation of Document Layout Analysis Methods * C. Clausner, S. Pletschacher and A. Antonacopoulos

More information

Short Survey on Static Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of

More information

An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques

An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques An Objective Evaluation Methodology for Handwritten Image Document Binarization Techniques K. Ntirogiannis, B. Gatos and I. Pratikakis Computational Intelligence Laboratory, Institute of Informatics and

More information

Images Reconstruction using an iterative SOM based algorithm.

Images Reconstruction using an iterative SOM based algorithm. Images Reconstruction using an iterative SOM based algorithm. M.Jouini 1, S.Thiria 2 and M.Crépon 3 * 1- LOCEAN, MMSA team, CNAM University, Paris, France 2- LOCEAN, MMSA team, UVSQ University Paris, France

More information

ANALYTIC WORD RECOGNITION WITHOUT SEGMENTATION BASED ON MARKOV RANDOM FIELDS

ANALYTIC WORD RECOGNITION WITHOUT SEGMENTATION BASED ON MARKOV RANDOM FIELDS ANALYTIC WORD RECOGNITION WITHOUT SEGMENTATION BASED ON MARKOV RANDOM FIELDS CHRISTOPHE CHOISY AND ABDEL BELAID LORIA/CNRS Campus scientifique, BP 239, 54506 Vandoeuvre-les-Nancy cedex, France Christophe.Choisy@loria.fr,

More information

Handwritten Word Recognition using Conditional Random Fields

Handwritten Word Recognition using Conditional Random Fields Handwritten Word Recognition using Conditional Random Fields Shravya Shetty Harish Srinivasan Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) Department of Computer Science

More information

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane

More information

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models Gleidson Pegoretti da Silva, Masaki Nakagawa Department of Computer and Information Sciences Tokyo University

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes

Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes 2009 10th International Conference on Document Analysis and Recognition Fine Classification of Unconstrained Handwritten Persian/Arabic Numerals by Removing Confusion amongst Similar Classes Alireza Alaei

More information

Automatic visual recognition for metro surveillance

Automatic visual recognition for metro surveillance Automatic visual recognition for metro surveillance F. Cupillard, M. Thonnat, F. Brémond Orion Research Group, INRIA, Sophia Antipolis, France Abstract We propose in this paper an approach for recognizing

More information

Keywords Connected Components, Text-Line Extraction, Trained Dataset.

Keywords Connected Components, Text-Line Extraction, Trained Dataset. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Language Independent

More information

A Comparison of Error Metrics for Learning Model Parameters in Bayesian Knowledge Tracing

A Comparison of Error Metrics for Learning Model Parameters in Bayesian Knowledge Tracing A Comparison of Error Metrics for Learning Model Parameters in Bayesian Knowledge Tracing Asif Dhanani Seung Yeon Lee Phitchaya Phothilimthana Zachary Pardos Electrical Engineering and Computer Sciences

More information

Symbol Detection Using Region Adjacency Graphs and Integer Linear Programming

Symbol Detection Using Region Adjacency Graphs and Integer Linear Programming 2009 10th International Conference on Document Analysis and Recognition Symbol Detection Using Region Adjacency Graphs and Integer Linear Programming Pierre Le Bodic LRI UMR 8623 Using Université Paris-Sud

More information

Text Detection in Indoor/Outdoor Scene Images

Text Detection in Indoor/Outdoor Scene Images Text Detection in Indoor/Outdoor Scene Images B. Gatos, I. Pratikakis, K. Kepene and S.J. Perantonis Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Center

More information

A New Algorithm for Detecting Text Line in Handwritten Documents

A New Algorithm for Detecting Text Line in Handwritten Documents A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer

More information

Beyond Bags of Features

Beyond Bags of Features : for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015

More information

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System

Mono-font Cursive Arabic Text Recognition Using Speech Recognition System Mono-font Cursive Arabic Text Recognition Using Speech Recognition System M.S. Khorsheed Computer & Electronics Research Institute, King AbdulAziz City for Science and Technology (KACST) PO Box 6086, Riyadh

More information

Automatic Detection of Change in Address Blocks for Reply Forms Processing

Automatic Detection of Change in Address Blocks for Reply Forms Processing Automatic Detection of Change in Address Blocks for Reply Forms Processing K R Karthick, S Marshall and A J Gray Abstract In this paper, an automatic method to detect the presence of on-line erasures/scribbles/corrections/over-writing

More information

TEI, METS and ALTO, why we need all of them. Günter Mühlberger University of Innsbruck Digitisation and Digital Preservation

TEI, METS and ALTO, why we need all of them. Günter Mühlberger University of Innsbruck Digitisation and Digital Preservation TEI, METS and ALTO, why we need all of them Günter Mühlberger University of Innsbruck Digitisation and Digital Preservation Agenda Introduction Problem statement Proposed solution Starting point Mass digitisation

More information

Using Maximum Entropy for Automatic Image Annotation

Using Maximum Entropy for Automatic Image Annotation Using Maximum Entropy for Automatic Image Annotation Jiwoon Jeon and R. Manmatha Center for Intelligent Information Retrieval Computer Science Department University of Massachusetts Amherst Amherst, MA-01003.

More information

Designing a Semantic Ground Truth for Mathematical Formulas

Designing a Semantic Ground Truth for Mathematical Formulas Designing a Semantic Ground Truth for Mathematical Formulas Alan Sexton 1, Volker Sorge 1, and Masakazu Suzuki 2 1 School of Computer Science, University of Birmingham, UK, A.P.Sexton V.Sorge@cs.bham.ac.uk,

More information

International Journal of Advance Research in Engineering, Science & Technology

International Journal of Advance Research in Engineering, Science & Technology Impact Factor (SJIF): 4.542 International Journal of Advance Research in Engineering, Science & Technology e-issn: 2393-9877, p-issn: 2394-2444 Volume 4, Issue 4, April-2017 A Simple Effective Algorithm

More information

Stacked Denoising Autoencoders for Face Pose Normalization

Stacked Denoising Autoencoders for Face Pose Normalization Stacked Denoising Autoencoders for Face Pose Normalization Yoonseop Kang 1, Kang-Tae Lee 2,JihyunEun 2, Sung Eun Park 2 and Seungjin Choi 1 1 Department of Computer Science and Engineering Pohang University

More information

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script

Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Galaxy Bansal Dharamveer Sharma ABSTRACT Segmentation of handwritten words is a challenging task primarily because of structural features

More information

CRFs for Image Classification

CRFs for Image Classification CRFs for Image Classification Devi Parikh and Dhruv Batra Carnegie Mellon University Pittsburgh, PA 15213 {dparikh,dbatra}@ece.cmu.edu Abstract We use Conditional Random Fields (CRFs) to classify regions

More information

Handwritten text segmentation using blurred image

Handwritten text segmentation using blurred image Handwritten text segmentation using blurred image Aurélie Lemaitre, Jean Camillerapp, Bertrand Coüasnon To cite this version: Aurélie Lemaitre, Jean Camillerapp, Bertrand Coüasnon. Handwritten text segmentation

More information

Human Motion Detection and Tracking for Video Surveillance

Human Motion Detection and Tracking for Video Surveillance Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,

More information

The Functional Extension Parser (FEP) A Document Understanding Platform

The Functional Extension Parser (FEP) A Document Understanding Platform The Functional Extension Parser (FEP) A Document Understanding Platform Günter Mühlberger University of Innsbruck Department for German Language and Literature Studies Introduction A book is more than

More information

Grid-Based Modelling and Correction of Arbitrarily Warped Historical Document Images for Large-Scale Digitisation

Grid-Based Modelling and Correction of Arbitrarily Warped Historical Document Images for Large-Scale Digitisation Grid-Based Modelling and Correction of Arbitrarily Warped Historical Document Images for Large-Scale Digitisation Po Yang, Apostolos Antonacopoulos, Christian Clausner and Stefan Pletschacher Pattern Recognition

More information

Query Difficulty Prediction for Contextual Image Retrieval

Query Difficulty Prediction for Contextual Image Retrieval Query Difficulty Prediction for Contextual Image Retrieval Xing Xing 1, Yi Zhang 1, and Mei Han 2 1 School of Engineering, UC Santa Cruz, Santa Cruz, CA 95064 2 Google Inc., Mountain View, CA 94043 Abstract.

More information

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 231A Computer Vision (Fall 2012) Problem Set 3 CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest

More information

Date Field Extraction from Handwritten Documents Using HMMs

Date Field Extraction from Handwritten Documents Using HMMs Date Field Extraction from Handwritten Documents Using HMMs Ranju Mandal, Partha Pratim Roy, Umapada Pal and Michael Blumenstein School of Information and Communication Technology, Griffith University,

More information

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation

A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in

More information

Fully Automatic Methodology for Human Action Recognition Incorporating Dynamic Information

Fully Automatic Methodology for Human Action Recognition Incorporating Dynamic Information Fully Automatic Methodology for Human Action Recognition Incorporating Dynamic Information Ana González, Marcos Ortega Hortas, and Manuel G. Penedo University of A Coruña, VARPA group, A Coruña 15071,

More information

1. Introduction 16 / 1 SEGMENTATION AND CLASSIFICATION OF DOCUMENT IMAGES. 2. Background. A Antonacopoulos and R T Ritchings

1. Introduction 16 / 1 SEGMENTATION AND CLASSIFICATION OF DOCUMENT IMAGES. 2. Background. A Antonacopoulos and R T Ritchings SEGMENTATION AND CLASSIFICATION OF DOCUMENT IMAGES A Antonacopoulos and R T Ritchings 1. Introduction There is a significant and growing need to convert documents from printed paper to an electronic fonn.

More information

Digital Archives: Extending the 5S model through NESTOR

Digital Archives: Extending the 5S model through NESTOR Digital Archives: Extending the 5S model through NESTOR Nicola Ferro and Gianmaria Silvello Department of Information Engineering, University of Padua, Italy {ferro, silvello}@dei.unipd.it Abstract. Archives

More information

Structured Completion Predictors Applied to Image Segmentation

Structured Completion Predictors Applied to Image Segmentation Structured Completion Predictors Applied to Image Segmentation Dmitriy Brezhnev, Raphael-Joel Lim, Anirudh Venkatesh December 16, 2011 Abstract Multi-image segmentation makes use of global and local features

More information

Generating Annotations for How-to Videos Using Crowdsourcing

Generating Annotations for How-to Videos Using Crowdsourcing Generating Annotations for How-to Videos Using Crowdsourcing Phu Nguyen MIT CSAIL 32 Vassar St. Cambridge, MA 02139 phun@mit.edu Abstract How-to videos can be valuable teaching tools for users, but searching

More information

The Processing of Form Documents

The Processing of Form Documents The Processing of Form Documents David S. Doermann and Azriel Rosenfeld Document Processing Group, Center for Automation Research University of Maryland, College Park 20742 email: doermann@cfar.umd.edu,

More information

Research Article Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation

Research Article Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation Discrete Dynamics in Nature and Society Volume 2008, Article ID 384346, 8 pages doi:10.1155/2008/384346 Research Article Image Segmentation Using Gray-Scale Morphology and Marker-Controlled Watershed Transformation

More information

Hidden Loop Recovery for Handwriting Recognition

Hidden Loop Recovery for Handwriting Recognition Hidden Loop Recovery for Handwriting Recognition David Doermann Institute of Advanced Computer Studies, University of Maryland, College Park, USA E-mail: doermann@cfar.umd.edu Nathan Intrator School of

More information

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

Video Inter-frame Forgery Identification Based on Optical Flow Consistency Sensors & Transducers 24 by IFSA Publishing, S. L. http://www.sensorsportal.com Video Inter-frame Forgery Identification Based on Optical Flow Consistency Qi Wang, Zhaohong Li, Zhenzhen Zhang, Qinglong

More information

Segmentation and labeling of documents using Conditional Random Fields

Segmentation and labeling of documents using Conditional Random Fields Segmentation and labeling of documents using Conditional Random Fields Shravya Shetty, Harish Srinivasan, Matthew Beal and Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR)

More information

Using temporal seeding to constrain the disparity search range in stereo matching

Using temporal seeding to constrain the disparity search range in stereo matching Using temporal seeding to constrain the disparity search range in stereo matching Thulani Ndhlovu Mobile Intelligent Autonomous Systems CSIR South Africa Email: tndhlovu@csir.co.za Fred Nicolls Department

More information

Hybrid Page Layout Analysis via Tab-Stop Detection

Hybrid Page Layout Analysis via Tab-Stop Detection 2009 10th International Conference on Document Analysis and Recognition Hybrid Page Layout Analysis via Tab-Stop Detection Ray Smith Google Inc. 1600 Amphitheatre Parkway, Mountain View, CA 94043, USA.

More information

FIGURE DETECTION AND PART LABEL EXTRACTION FROM PATENT DRAWING IMAGES. Jaco Cronje

FIGURE DETECTION AND PART LABEL EXTRACTION FROM PATENT DRAWING IMAGES. Jaco Cronje FIGURE DETECTION AND PART LABEL EXTRACTION FROM PATENT DRAWING IMAGES Jaco Cronje Council for Scientific and Industrial Research, Pretoria, South Africa Email: jcronje@csir.co.za ABSTRACT The US Patent

More information

Identifying Layout Classes for Mathematical Symbols Using Layout Context

Identifying Layout Classes for Mathematical Symbols Using Layout Context Rochester Institute of Technology RIT Scholar Works Articles 2009 Identifying Layout Classes for Mathematical Symbols Using Layout Context Ling Ouyang Rochester Institute of Technology Richard Zanibbi

More information

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Partha Sarathi Giri Department of Electronics and Communication, M.E.M.S, Balasore, Odisha Abstract Text data

More information

Going digital Challenge & solutions in a newspaper archiving project. Andrey Lomov ATAPY Software Russia

Going digital Challenge & solutions in a newspaper archiving project. Andrey Lomov ATAPY Software Russia Going digital Challenge & solutions in a newspaper archiving project Andrey Lomov ATAPY Software Russia Problem Description Poor recognition results caused by low image quality: noise, white holes in characters,

More information

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram

Recognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram Author manuscript, published in "International Conference on Computer Analysis of Images and Patterns - CAIP'2009 5702 (2009) 205-212" DOI : 10.1007/978-3-642-03767-2 Recognition-based Segmentation of

More information

A DIGITAL APPROACH TO HANDWRITTEN DOCUMENTS. B.I.T. - Bureau Ingénieur Tomasi

A DIGITAL APPROACH TO HANDWRITTEN DOCUMENTS. B.I.T. - Bureau Ingénieur Tomasi A DIGITAL APPROACH TO HANDWRITTEN DOCUMENTS B.I.T. - Bureau Ingénieur Tomasi Introduction Handwritten documents can for the most part not be read by computers today. Our technology such as it has been

More information

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network

Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network 139 Handwritten Gurumukhi Character Recognition by using Recurrent Neural Network Harmit Kaur 1, Simpel Rani 2 1 M. Tech. Research Scholar (Department of Computer Science & Engineering), Yadavindra College

More information

Integrating Visual and Textual Cues for Query-by-String Word Spotting

Integrating Visual and Textual Cues for Query-by-String Word Spotting Integrating Visual and Textual Cues for D. Aldavert, M. Rusiñol, R. Toledo and J. Lladós Computer Vision Center, Dept. Ciències de la Computació Edifici O, Univ. Autònoma de Barcelona, Bellaterra(Barcelona),

More information

Invarianceness for Character Recognition Using Geo-Discretization Features

Invarianceness for Character Recognition Using Geo-Discretization Features Computer and Information Science; Vol. 9, No. 2; 2016 ISSN 1913-8989 E-ISSN 1913-8997 Published by Canadian Center of Science and Education Invarianceness for Character Recognition Using Geo-Discretization

More information

Staff Line Detection by Skewed Projection

Staff Line Detection by Skewed Projection Staff Line Detection by Skewed Projection Diego Nehab May 11, 2003 Abstract Most optical music recognition systems start image analysis by the detection of staff lines. This work explores simple techniques

More information

DIFFERENTIAL IMAGE COMPRESSION BASED ON ADAPTIVE PREDICTION

DIFFERENTIAL IMAGE COMPRESSION BASED ON ADAPTIVE PREDICTION DIFFERENTIAL IMAGE COMPRESSION BASED ON ADAPTIVE PREDICTION M.V. Gashnikov Samara National Research University, Samara, Russia Abstract. The paper describes the adaptive prediction algorithm for differential

More information

A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri

A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri Università degli Studi Roma TRE, Roma, Italy 2 nd Workshop on Light Fields for Computer

More information

A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points

A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points Tomohiro Nakai, Koichi Kise, Masakazu Iwamura Graduate School of Engineering, Osaka

More information

Available online at ScienceDirect. Procedia Computer Science 96 (2016 )

Available online at   ScienceDirect. Procedia Computer Science 96 (2016 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 96 (2016 ) 1409 1417 20th International Conference on Knowledge Based and Intelligent Information and Engineering Systems,

More information

A System to Automatically Index Genealogical Microfilm Titleboards Introduction Preprocessing Method Identification

A System to Automatically Index Genealogical Microfilm Titleboards Introduction Preprocessing Method Identification A System to Automatically Index Genealogical Microfilm Titleboards Samuel James Pinson, Mark Pinson and William Barrett Department of Computer Science Brigham Young University Introduction Millions of

More information

arxiv: v1 [cs.cv] 9 Aug 2017

arxiv: v1 [cs.cv] 9 Aug 2017 Anveshak - A Groundtruth Generation Tool for Foreground Regions of Document Images Soumyadeep Dey, Jayanta Mukherjee, Shamik Sural, and Amit Vijay Nandedkar arxiv:1708.02831v1 [cs.cv] 9 Aug 2017 Department

More information

Implementation and Advanced Results on the Non-Interrupted Skeletonization Algorithm

Implementation and Advanced Results on the Non-Interrupted Skeletonization Algorithm Implementation and Advanced Results on the n-interrupted Skeletonization Algorithm Khalid Saeed, Mariusz Rybnik, Marek Tabedzki Computer Engineering Department Faculty of Computer Science Bialystok University

More information

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION

Chapter 7 UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION UNSUPERVISED LEARNING TECHNIQUES FOR MAMMOGRAM CLASSIFICATION Supervised and unsupervised learning are the two prominent machine learning algorithms used in pattern recognition and classification. In this

More information

Recent Progress on RAIL: Automating Clustering and Comparison of Different Road Classification Techniques on High Resolution Remotely Sensed Imagery

Recent Progress on RAIL: Automating Clustering and Comparison of Different Road Classification Techniques on High Resolution Remotely Sensed Imagery Recent Progress on RAIL: Automating Clustering and Comparison of Different Road Classification Techniques on High Resolution Remotely Sensed Imagery Annie Chen ANNIEC@CSE.UNSW.EDU.AU Gary Donovan GARYD@CSE.UNSW.EDU.AU

More information

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation

HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation 009 10th International Conference on Document Analysis and Recognition HMM-Based Handwritten Amharic Word Recognition with Feature Concatenation Yaregal Assabie and Josef Bigun School of Information Science,

More information

Use of Shape Deformation to Seamlessly Stitch Historical Document Images

Use of Shape Deformation to Seamlessly Stitch Historical Document Images Use of Shape Deformation to Seamlessly Stitch Historical Document Images Wei Liu Wei Fan Li Chen Jun Sun Satoshi Naoi In China, efforts are being made to preserve historical documents in the form of digital

More information

Structured Models in. Dan Huttenlocher. June 2010

Structured Models in. Dan Huttenlocher. June 2010 Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an

More information

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization

Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Processing, and Visualization Revealing the Modern History of Japanese Philosophy Using Digitization, Natural Language Katsuya Masuda *, Makoto Tanji **, and Hideki Mima *** Abstract This study proposes a framework to access to the

More information

CPPP/UFMS at ImageCLEF 2014: Robot Vision Task

CPPP/UFMS at ImageCLEF 2014: Robot Vision Task CPPP/UFMS at ImageCLEF 2014: Robot Vision Task Rodrigo de Carvalho Gomes, Lucas Correia Ribas, Amaury Antônio de Castro Junior, Wesley Nunes Gonçalves Federal University of Mato Grosso do Sul - Ponta Porã

More information

Panoramic Image Stitching

Panoramic Image Stitching Mcgill University Panoramic Image Stitching by Kai Wang Pengbo Li A report submitted in fulfillment for the COMP 558 Final project in the Faculty of Computer Science April 2013 Mcgill University Abstract

More information

Keyword Spotting in Document Images through Word Shape Coding

Keyword Spotting in Document Images through Word Shape Coding 2009 10th International Conference on Document Analysis and Recognition Keyword Spotting in Document Images through Word Shape Coding Shuyong Bai, Linlin Li and Chew Lim Tan School of Computing, National

More information

MULTI-TEMPORAL SAR CHANGE DETECTION AND MONITORING

MULTI-TEMPORAL SAR CHANGE DETECTION AND MONITORING MULTI-TEMPORAL SAR CHANGE DETECTION AND MONITORING S. Hachicha, F. Chaabane Carthage University, Sup Com, COSIM laboratory, Route de Raoued, 3.5 Km, Elghazala Tunisia. ferdaous.chaabene@supcom.rnu.tn KEY

More information

A Segmentation Free Approach to Arabic and Urdu OCR

A Segmentation Free Approach to Arabic and Urdu OCR A Segmentation Free Approach to Arabic and Urdu OCR Nazly Sabbour 1 and Faisal Shafait 2 1 Department of Computer Science, German University in Cairo (GUC), Cairo, Egypt; 2 German Research Center for Artificial

More information

Film Line scratch Detection using Neural Network and Morphological Filter

Film Line scratch Detection using Neural Network and Morphological Filter Film Line scratch Detection using Neural Network and Morphological Filter Kyung-tai Kim and Eun Yi Kim Dept. of advanced technology fusion, Konkuk Univ. Korea {kkt34, eykim}@konkuk.ac.kr Abstract This

More information

Key-Words: - A.M.A operation, Contextual exploration, Filimage system, scientific formulas extraction, semantic filtering.

Key-Words: - A.M.A operation, Contextual exploration, Filimage system, scientific formulas extraction, semantic filtering. Scientific Formulas Extraction Oriented Towards Web Summarizing File Cards SHAHNAZ BEHNAMI LaLICC (Language, Logic, Informatics, Cognition, Communication), CNRS, UMR 8139 Paris-Sorbonne University 96,

More information

Layout Segmentation of Scanned Newspaper Documents

Layout Segmentation of Scanned Newspaper Documents , pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms

More information

Using a Painting Metaphor to Rate Large Numbers

Using a Painting Metaphor to Rate Large Numbers 1 Using a Painting Metaphor to Rate Large Numbers of Objects Patrick Baudisch Integrated Publication and Information Systems Institute (IPSI) German National Research Center for Information Technology

More information