A Labeling Approach for Mixed Document Blocks. A. Bela d and O. T. Akindele. Crin-Cnrs/Inria-Lorraine, B timent LORIA, Campus Scientique, B.P.
|
|
- Eustace Willis
- 6 years ago
- Views:
Transcription
1 A Labeling Approach for Mixed Document Blocks A. Bela d and O. T. Akindele Crin-Cnrs/Inria-Lorraine, B timent LORIA, Campus Scientique, B.P. 39, Vand uvre-l s-nancy Cedex. France. Abstract A block image labeling method is presented. It does not assume that the blocks to be treated are already segmented nor that they contain homogeneous data. It is based on connected component analysis to label the blocks' contents as small letter text, medium letter text, large letter text, graphics or photographs, giving the percentage of each of these components with respect to the surface area it occupies. It uses a recursive algorithm that allows one to improve on the result of segmentation. The performance of the method is given. 1 Introduction Block classication or labeling is an important and useful step in the document image recognition process. In this step, document image blocks extracted during the segmentation process, are classied into different categories such as: text, graphics, photographs, etc. depending on their contents. The labels given to blocks help in determining the type of treatment to be applied to each block during the analysis and understanding stage. There are two major approaches to block classication. In the rst approach, it is always assumed that blocks contain homogeneous data. This is the case of blocks found in composite documents such as scientic journals, newspapers, etc. Moreover, the segmentation methods employed use global spatial properties of regions to determine their frontiers, without taking into account their contents. Each block is classi- ed into the closest medium satisfying certain properties. These properties correspond mostly to statistical and textural features extracted from the block image. Among the methods in this approach, we can cite [] which uses a feature space partitioning technique to label newspaper image blocks, using regularity, abundance and width of spaces to classify a block as either a small letter, medium letter, large letter, graphics or photograph block. And also [3] that uses block size, block mean black pixel run length, density and eccentricity to classify blocks extracted with rlsa into text, graphics, halftone, horizontal line or vertical line blocks, exploiting the fact that text lines have approximately a constant and small height. In the second approach, it is assumed that a block contains a mixture of text and non-text (generally, text and graphics) such as in technical documents, tables, forms, etc. In this case, the methods employed separate text strings from non-text in the block. Some of these methods use connected component analysis to perform the text separation as in [1] where a Hough transform based algorithm is applied to group collinear connected components of similar size into logical text strings. Others are based on neighborhood line density which is suggestive to the extraction of graphics. In this paper, we describe a new labeling method that is able to locate and identify each type of data in a mixture of media in the same block. The method gives more detailed information than the previous methods and it can be used to improve on the results of the segmentation. It precisely gives the locations of each medium in the block as well as its percentage with respect to the surface area it occupies. Principle This method classies a block by giving the proportion of each of the following categories: small text, medium text, large text, graphics and photograph. It is based on connected components (cc's) analysis by studying for each set of cc's, the classes of spaces between them, as well as their sizes and regularity. The analysis is done in three steps. In the rst step, cc's are merged into sets of approximately aligned cc's. For example, a text line can be partitioned into three sets of cc's, the rst for accents and apostrophes, the second for letters and the third for punctuation. In this manner, two successive text lines are never merged, and large connected components are easily isolated. The cc's in each set
2 are analyzed individually if they are few, or globally otherwise. In the global analysis, the width of the cc's as well as the space between them are studied. If there are more than three types of spaces, the analysis is recursively applied to the two sets of cc's around the largest space (this allows the separation of two columns, for example). If there is a cc whose width is much more larger than those of the rest, it is separated and analyzed apart. If there is only one class of spaces and the regularity of the spaces is very strong, the cc set is taken as graphics, otherwise, it is considered as text. In the individual cc analysis, certain characteristics, such as density, height/width ratio, the percentage of horizontal black segments whose lengths are equal to the cc's width, etc., are extracted to determine the type of the cc. In the second step, the sets obtained in the previous step are globally analyzed with respect to their neighbors in order to either correct the errors of the previous classication or to merge similar sets into bigger ones. The last step is concerned with the calculation of the percentage of each category in the block. 3 Dierent Steps The document is deskewed if its skew-angle is greater than a certain degree harmful to horizontal alignment. After the extraction of cc's and the elimination of those considered as noise (i.e. those whose number of black pixels or surface area is less than an a priori xed threshold) we then proceed to merge them into bigger entities. The connected components are represented by the coordinates of the top left and the bottom right corners of their circumscribing rectangles, say [(x 1 ; y 1 ) (x ; y )]. They are extracted in ascending order of y. For equivalent y, they are obtained in ascending order of their x Fusion of Connected Components into Sets Two cc's are merged into the same set/line when they are approximately aligned, i.e. if the y- coordinates of their top left corners are not too far from each other, and likewise for the y-coordinates of their right bottom corners. The closeness of these coordinates is determined with the following rule: jy1? y 0 1j max[ (y?y1) max[ (y?y1) ; (y0?y0 1 ) ] ; (y0?y0 1 ) ] & jy? yj 0 It is to be noted that a line can be formed by cc's whose abscissas are far apart. With this method, it is possible to extract several line portions from a text line, and separate line portions that might likely be connected (above or below) to another line of text or graphics. 3. Fusion of Sets into Lines The line portions so formed are then merged into larger sets to obtain real text lines and to discard those that are not horizontally aligned. This is to avoid the merging of either the line of an underlined text with the text or two successive text lines. The fusion is performed if the circumscribing rectangles are very close in either the horizontal or vertical direction, or have a none empty intersection, or even overlap. This fusion of lines improves the results of the previous fusion (fusion of cc's). 3.3 Line Classication The classication of the formed lines is based on some coecients extracted from the constituting cc's (such as size, density and the percentage of the black segments whose width is approximately equal to that of the cc), as well as homogeneity of the spaces separating them. It is performed in two manners depending on the number of cc's in the lines. When there is only one cc, it is passed through a series of lters to determine its type. Otherwise, the line is either cut into smaller sets with respect to the homogeneity of spaces and sizes of its cc's or classied globally. The classication algorithm is given below Case of many cc's /* LHavg : Average Height of cc's in the Line, LWavg : Average Width of cc's in the Line, M IHslt : Minimum Height of small letter Text M AHslt : Maximum Height of small letter Text M IHmlt : Minimum Height of medium letter Text M AHmlt : Maximum Height of medium letter Text M IHllt : Minimum Height of large letter Text M AHllt : Maximum Height of large letter Text */ if LHavg < M IHslt /* very small average height of cc's */ then line_type = graphics else calculate N Bsc /* number of space_classes */ if N Bsc 3 then /* non regular spaces between cc's */ cut the line into two at the largest space; recall the classication on each sub_line else /* regular & more or less regular spaces */ if largest cc 4 LWavg then /* a cc dierent from the others */ cut the line around the largest cc (on the right and on the left);
3 largest cc recall the classication on the sub_lines and the else /* cc's of regular sizes and spaces */ if N Bsc = 1 & LHavg M IHslt then classify each cc individually; line_type = type of the majority else text : small if M IHslt LHavg M AHslt : medium if M IHmlt LHavg M AHmlt : large if M IHllt LHavg M AHllt 3.3. Case of a single cc In this case, the cc is passed through a series of lters, on the basis of attributes extracted from it, until its type is obtained. In all, there are sixteen lters which are applied in order. There are many thresholds used in these lters, but these are determined before hand during a learning stage on many kinds of documents, thus assuring their stability. The lters are given below. F1 if density < minimum density of photograph then graphics F if No. of segments (whose width 6= that of cc) < a certain threshold then if vertically extended black block (1, I) then text else graphics F3 if low density and extended block then graphics F4 if eccentricity is between that of text and photograph and high density then if the height is important than photograph else text F5 if exentricity > High threshold of that of photograph then graphics F6 if exentricity < low threshold of that of photograph then graphics F7 if height < that of text then graphics F8 if height < that of photograph and density > that of photograph then graphics F9 if height > that of photograph and density > that of photograph then graphics F10 if average number of segments per line > number of segments in a text letter then if density > that of graphics then photograph else graphics F11 if No. of segments per line - average of No. of segments per line is important then if density > that of graphics then photograph else graphics F1 if No. of segment length classes that of a graphics line and average of No. of segments per line is equal to that of graphics line then graphics F13 if No. of segment length classes > that of a letter then if density > that of graphics then photograph else graphics F14 if length of segments is very irregular then if density > that of graphics then photograph else graphics F15 if low eccentricity and density that of letter then photograph F16 if many lines with irregular segment lengths then if density > that of graphics then photograph else graphics F17 else text 3.4 Error Detection and Particular Cases It is possible to have some imperfections in the classication of the lines. Therefore, we try to detect and correct any error. This is done in two phases. Firstly, incoherences at the level of cc's are located and resolved. Secondly, the incoherences at the line level or particular cases are located and resolved Overlapping Connected Components Often, photographs and graphics are usually fragmented when passed through a scanner. Some of their fragments are usually confused with text. In order to reconstitute these kinds of patterns, we proceed to locate and study cc's that overlap with them. The correction algorithm is given below. foreach c of type photograph (P ) or graphics (G) do done foreach c 0 6= c : do done area_of(c 0 ) < area_of(c) and area_of(c \ c 0 ) > area_of(c 0 )/ if type_of(c) = P and type_of(c 0 ) 6= G then type_of(c 0 ) := P if type_of(c) = G and type_of(c 0 ) = P then type_of(c 0 ) := G 3.4. Particular Cases In this phase, we compare each line with its neighboring lines to determine if we have a particular case. A particular case can be: the accents, the apostrophes, dots on i, j, broken characters, or part of graphics mis-labeled as photograph. It is also necessary to make uniform text lines where letters, individually recognized, can have dierent sizes. Text Line with dierent sized letters When a text line contains a mixture of small, medium and large letters, line is given the label of its components that occupy the largest surface area. Misclassied Medium and large letter text A medium or large text line can be cut horizontally or vertically, or may contain some points, apostrophes or punctuation. In the case of the horizontal cut, we examine two lines that are horizontal neighbors, while
4 in the other case, we examine two lines that are vertical neighbors. In the rst case, if there is line to the left or to the right of a medium or large text line, we merge it with the text line if its height is much less than that of the text line and does not contain any component whose label is large text. This case corresponds to large characters either cut on top or containing accents or dots. In the second, if there is line to the top or to the bottom of a medium or large text line, we merge it with the text line if its height is much less than that of the text line and does not contain any component whose label is large text or medium text. This case corresponds to large characters either cut on top or bottom. Table 1 shows the results of the classication. We can observe that 6% of Medium Text blocks are classi- ed as Graphics blocks. This is due to underlined text where the letters touch the line, and also to erect and isolated letters such as l,i. The 1% of Large Text confused with Photographs is a result of isolated and dense very large letters. The 5% of Photographs labeled as either Large Text or Graphics resulted from cuts in photographs during scanning, or the fact that some photographs contain white streams. Some graphics blocks were labeled as Medium Text blocks (7%), this is due to the fact that graphics are not usually well connected and may contain small forms that are mistaken for letters (for example, graphics representing chemical structures). Examples of the results obtained are given in gure 1. Graphics Classied as photographs If a line contains a mixture of graphics and photographs and the surface area of photographs is less than a certain threshold (0% of the surface area of the line), we change the label of the line components to graphics. This is due to the fact that certain graphics may contain dense components that can be confused with photographs. 3.5 Calculation of Percentages We have chosen to use the percentage of the surface area of each type of components. However, we like to give to text a percentage close to that we would have given visually. (Human eyes often surround a text zone with an invisible rectangle and consider the interline spaces as integral part of the text). Therefore, when a text line is obtained, we do not consider the total of the surface areas of its cc's, but the area of its circumscribing rectangle. Furthermore, when a rectangle circumscribing a cc is enclosed in a rectangle circumscribing another cc, the area of the former is subtracted from that of the latter (for example, a surrounded title). 4 Experiments and Results The method has been tested on about 10 blocks for each class, chosen from scientic journals like IEEE, IBM, ACM, etc. and technical reports. When the page images are not segmented before hand, the method can be used as a means of separating dierent media in the page. We observed that Small Letter Text blocks are very rare. This reinforces the idea that the notion of Small Letter Text is very subjective. In fact, each font has a particular size for small letters. Therefore, in a multifont document, Small Letter blocks are always confused with Medium Letter blocks. MT LT Gr Ph MT 94% 0% 6% 0% LT 0% 99% 0% 1% Gr 7% 0% 93% 0% Ph 0% 5% 5% 90% Table 1: Classication Results where MT stands for Medium Text, LT for Large Text, Gr for Graphics and Ph for Photographs. Even though the method gives satisfactory results, it has its own limits. Joined letters are usually labeled as graphics due to their eccentricities. Also fragmented photographs are labeled either as text or graphics. This is as a result of lack of contextual rules to assemble fragments of the same medium. 5 Conclusion The method we present gives satisfactory results on all tested document images. It is general in that it can locate and identify any medium in a document. It can also be used to separate text from non-text in technical documents. The algorithm employed tolerates a reasonable orientation of the document images. The manner in which the results are given is very useful in document analysis and treatment because it permits one to focus on a type of medium and determine the type of treatments to be applied. References [1] L. A. Fletcher and R. Kasturi, A Robust Algorithm for Text String Separation from Mixed Text/Graphics Image. PAMI,, 10(6): , 1988
5 [] D. Wang and S. N. Srihari, Classication of Newspaper Image Blocks Using Texture Analysis. CVGIP,, 47: 37-35, 1989 [3] K. Y. Wong, R. G. Casey and F. M. Wahl, Document Analysis System. IBM Journal of Research and Development, 6(6): , 198. Petit texte :.9 % (b) Texte moyen : 3. % Grand Texte : 1. % Photographie : 43.7 % Graphique : 0.0 % (c) (d) (e) Figure 1: Labeling Results for a composite document. (a) Original Image, (b) Photograph part (43.7%), (c) Small Text (35%) and (d) Large Text part (1.%).
UW Document Image Databases. Document Analysis Module. Ground-Truthed Information DAFS. Generated Information DAFS. Performance Evaluation
Performance evaluation of document layout analysis algorithms on the UW data set Jisheng Liang, Ihsin T. Phillips y, and Robert M. Haralick Department of Electrical Engineering, University of Washington,
More informationGoing digital Challenge & solutions in a newspaper archiving project. Andrey Lomov ATAPY Software Russia
Going digital Challenge & solutions in a newspaper archiving project Andrey Lomov ATAPY Software Russia Problem Description Poor recognition results caused by low image quality: noise, white holes in characters,
More informationIsolated Handwritten Words Segmentation Techniques in Gurmukhi Script
Isolated Handwritten Words Segmentation Techniques in Gurmukhi Script Galaxy Bansal Dharamveer Sharma ABSTRACT Segmentation of handwritten words is a challenging task primarily because of structural features
More informationMulti-scale Techniques for Document Page Segmentation
Multi-scale Techniques for Document Page Segmentation Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State University of New York at Buffalo, Amherst
More informationAn Accurate and Efficient System for Segmenting Machine-Printed Text. Yi Lu, Beverly Haist, Laurel Harmon, John Trenkle and Robert Vogt
An Accurate and Efficient System for Segmenting Machine-Printed Text Yi Lu, Beverly Haist, Laurel Harmon, John Trenkle and Robert Vogt Environmental Research Institute of Michigan P. O. Box 134001 Ann
More informationCharacter Recognition
Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches
More informationLayout Segmentation of Scanned Newspaper Documents
, pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms
More informationA Document Image Analysis System on Parallel Processors
A Document Image Analysis System on Parallel Processors Shamik Sural, CMC Ltd. 28 Camac Street, Calcutta 700 016, India. P.K.Das, Dept. of CSE. Jadavpur University, Calcutta 700 032, India. Abstract This
More informationSkew Detection for Complex Document Images Using Fuzzy Runlength
Skew Detection for Complex Document Images Using Fuzzy Runlength Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition(CEDAR) State University of New York at Buffalo,
More informationCOMBINED WARNING EDITING GUIDANCE DOCUMENT. European Commission Health and Consumer Protection Directorate-General
COMBINED WARNING EDITING GUIDANCE DOCUMENT European Commission Health and Consumer Protection Directorate-General CONTENTS > INTRODUCTION 2 he longest line reaches the e > TYPOGRAPHY 3 FONT > THE COMBINED
More informationBUILDING DETECTION AND STRUCTURE LINE EXTRACTION FROM AIRBORNE LIDAR DATA
BUILDING DETECTION AND STRUCTURE LINE EXTRACTION FROM AIRBORNE LIDAR DATA C. K. Wang a,, P.H. Hsu a, * a Dept. of Geomatics, National Cheng Kung University, No.1, University Road, Tainan 701, Taiwan. China-
More informationThe Processing of Form Documents
The Processing of Form Documents David S. Doermann and Azriel Rosenfeld Document Processing Group, Center for Automation Research University of Maryland, College Park 20742 email: doermann@cfar.umd.edu,
More informationUse of Shape Deformation to Seamlessly Stitch Historical Document Images
Use of Shape Deformation to Seamlessly Stitch Historical Document Images Wei Liu Wei Fan Li Chen Jun Sun Satoshi Naoi In China, efforts are being made to preserve historical documents in the form of digital
More informationText Extraction from Gray Scale Document Images Using Edge Information
Text Extraction from Gray Scale Document Images Using Edge Information Q. Yuan, C. L. Tan Dept. of Computer Science, School of computing National University of Singapore 3 Science Drive, Singapore 117543
More informationHybrid Page Layout Analysis via Tab-Stop Detection
2009 10th International Conference on Document Analysis and Recognition Hybrid Page Layout Analysis via Tab-Stop Detection Ray Smith Google Inc. 1600 Amphitheatre Parkway, Mountain View, CA 94043, USA.
More informationSegmentation of Characters of Devanagari Script Documents
WWJMRD 2017; 3(11): 253-257 www.wwjmrd.com International Journal Peer Reviewed Journal Refereed Journal Indexed Journal UGC Approved Journal Impact Factor MJIF: 4.25 e-issn: 2454-6615 Manpreet Kaur Research
More informationRecognition-based Segmentation of Nom Characters from Body Text Regions of Stele Images Using Area Voronoi Diagram
Author manuscript, published in "International Conference on Computer Analysis of Images and Patterns - CAIP'2009 5702 (2009) 205-212" DOI : 10.1007/978-3-642-03767-2 Recognition-based Segmentation of
More information1. Introduction 16 / 1 SEGMENTATION AND CLASSIFICATION OF DOCUMENT IMAGES. 2. Background. A Antonacopoulos and R T Ritchings
SEGMENTATION AND CLASSIFICATION OF DOCUMENT IMAGES A Antonacopoulos and R T Ritchings 1. Introduction There is a significant and growing need to convert documents from printed paper to an electronic fonn.
More informationDATA EMBEDDING IN TEXT FOR A COPIER SYSTEM
DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM Anoop K. Bhattacharjya and Hakan Ancin Epson Palo Alto Laboratory 3145 Porter Drive, Suite 104 Palo Alto, CA 94304 e-mail: {anoop, ancin}@erd.epson.com Abstract
More informationA System towards Indian Postal Automation
A System towards Indian Postal Automation K. Roy C.V.P.R Unit, I.S.I, Kolkata-108; India, Kaushik_mrg@hotmail.com S. Vajda LORIA Research Center, B.P. 239 54506, Nancy, France Szilard.Vajda@loria.fr U.
More informationExtracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang
Extracting Layers and Recognizing Features for Automatic Map Understanding Yao-Yi Chiang 0 Outline Introduction/ Problem Motivation Map Processing Overview Map Decomposition Feature Recognition Discussion
More informationAutomatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques
Automatic Recognition and Verification of Handwritten Legal and Courtesy Amounts in English Language Present on Bank Cheques Ajay K. Talele Department of Electronics Dr..B.A.T.U. Lonere. Sanjay L Nalbalwar
More informationSeparation of Overlapping Text from Graphics
Separation of Overlapping Text from Graphics Ruini Cao, Chew Lim Tan School of Computing, National University of Singapore 3 Science Drive 2, Singapore 117543 Email: {caorn, tancl}@comp.nus.edu.sg Abstract
More informationProject Report for EE7700
Project Report for EE7700 Name: Jing Chen, Shaoming Chen Student ID: 89-507-3494, 89-295-9668 Face Tracking 1. Objective of the study Given a video, this semester project aims at implementing algorithms
More informationRefine boundary at resolution r. r+1 r. Update context information CI(r) based on CI(r-1) Classify at resolution r, based on CI(r), update CI(r)
Context Based Multiscale Classication of Images Jia Li Robert M. Gray EE Department EE Department Stanford Univ., CA 94305 Stanford Univ., CA 94305 jiali@isl.stanford.edu rmgray@stanford.edu Abstract This
More informationStefano Ferilli 1 Floriana Esposito 1 Domenico Redavid 2
A study on the Classification of Layout Components for Newspapers Stefano Ferilli 1 Floriana Esposito 1 Domenico Redavid 2 1 Dipartimento di Informatica Università di Bari name.surname@uniba.it 2 Artificial
More informationRobust line segmentation for handwritten documents
Robust line segmentation for handwritten documents Kamal Kuzhinjedathu, Harish Srinivasan and Sargur Srihari Center of Excellence for Document Analysis and Recognition (CEDAR) University at Buffalo, State
More informationWord extraction using irregular pyramid C. L. Tan a and P. K. Loo b
Header for SPIE use Word extraction using irregular pyramid C. L. Tan a and P. K. Loo b a School of Computing, National University of Singapore, Kent Ridge, Singapore 119260 b Civil Engineering & Building
More informationTime Stamp Detection and Recognition in Video Frames
Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th
More informationExtending Page Segmentation Algorithms for Mixed-Layout Document Processing
ScholarWorks Computer Science Faculty Publications and Presentations Department of Computer Science 9-18-2011 Extending Page Segmentation Algorithms for Mixed-Layout Document Processing Amy Winder Tim
More informationOn Segmentation of Documents in Complex Scripts
On Segmentation of Documents in Complex Scripts K. S. Sesh Kumar, Sukesh Kumar and C. V. Jawahar Centre for Visual Information Technology International Institute of Information Technology, Hyderabad, India
More informationMouse Pointer Tracking with Eyes
Mouse Pointer Tracking with Eyes H. Mhamdi, N. Hamrouni, A. Temimi, and M. Bouhlel Abstract In this article, we expose our research work in Human-machine Interaction. The research consists in manipulating
More informationThe 12 most common newsletter design mistakes
The 12 most common newsletter design mistakes www.targetmarketingnetwork.com By: Roger C. Parker Your newsletter s success depends on its design. An attractive, easy to read newsletter encourages readers
More informationLine Net Global Vectorization: an Algorithm and Its Performance Evaluation
Line Net Global Vectorization: an Algorithm and Its Performance Evaluation Jiqiang Song 1, Feng Su 1, Jibing Chen 1, Chiewlan Tai 2, and Shijie Cai 1 1 Department of Computer Science of Nanjing University,
More informationA Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script
A Survey of Problems of Overlapped Handwritten Characters in Recognition process for Gurmukhi Script Arwinder Kaur 1, Ashok Kumar Bathla 2 1 M. Tech. Student, CE Dept., 2 Assistant Professor, CE Dept.,
More informationRecognition of Gurmukhi Text from Sign Board Images Captured from Mobile Camera
International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 17 (2014), pp. 1839-1845 International Research Publications House http://www. irphouse.com Recognition of
More informationOptical Flow-Based Person Tracking by Multiple Cameras
Proc. IEEE Int. Conf. on Multisensor Fusion and Integration in Intelligent Systems, Baden-Baden, Germany, Aug. 2001. Optical Flow-Based Person Tracking by Multiple Cameras Hideki Tsutsui, Jun Miura, and
More informationAdaptive technology for mail-order segmentation. this approach lies mainly in the absence of a rigid a priori model, replaced by a simply and
Adaptive technology for mail-order segmentation 1 A.Belaïd 1, Y. Belaïd 2, Late N. Valverde 3 and S. Kébairi 3 1 LORIA-CNRS, Campus scientifique, B.P. 239, 54506 Vandoeuvre-Lès-Nancy France 2 LORIA-University
More informationPostprint.
http://www.diva-portal.org Postprint This is the accepted version of a paper presented at 14th International Conference of the Biometrics Special Interest Group, BIOSIG, Darmstadt, Germany, 9-11 September,
More informationKhmer OCR for Limon R1 Size 22 Report
PAN Localization Project Project No: Ref. No: PANL10n/KH/Report/phase2/002 Khmer OCR for Limon R1 Size 22 Report 09 July, 2009 Prepared by: Mr. ING LENG IENG Cambodia Country Component PAN Localization
More informationOCR For Handwritten Marathi Script
International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,
More informationAutomatic Detection of Change in Address Blocks for Reply Forms Processing
Automatic Detection of Change in Address Blocks for Reply Forms Processing K R Karthick, S Marshall and A J Gray Abstract In this paper, an automatic method to detect the presence of on-line erasures/scribbles/corrections/over-writing
More informationSegmentation of Bangla Handwritten Text
Thesis Report Segmentation of Bangla Handwritten Text Submitted By: Sabbir Sadik ID:09301027 Md. Numan Sarwar ID: 09201027 CSE Department BRAC University Supervisor: Professor Dr. Mumit Khan Date: 13 th
More informationSkew Detection Technique for Binary Document Images based on Hough Transform
Skew Detection Technique for Binary Document Images based on Hough Transform Manjunath Aradhya V N*, Hemantha Kumar G, and Shivakumara P Abstract Document image processing has become an increasingly important
More informationImage Segmentation Based on Watershed and Edge Detection Techniques
0 The International Arab Journal of Information Technology, Vol., No., April 00 Image Segmentation Based on Watershed and Edge Detection Techniques Nassir Salman Computer Science Department, Zarqa Private
More informationCHAPTER 4: MICROSOFT OFFICE: EXCEL 2010
CHAPTER 4: MICROSOFT OFFICE: EXCEL 2010 Quick Summary A workbook an Excel document that stores data contains one or more pages called a worksheet. A worksheet or spreadsheet is stored in a workbook, and
More informationFACIAL RECOGNITION BASED ON THE LOCAL BINARY PATTERNS MECHANISM
FACIAL RECOGNITION BASED ON THE LOCAL BINARY PATTERNS MECHANISM ABSTRACT Alexandru Blanda 1 This work presents a method of facial recognition, based on Local Binary Models. The idea of using this algorithm
More informationPrototype Selection for Handwritten Connected Digits Classification
2009 0th International Conference on Document Analysis and Recognition Prototype Selection for Handwritten Connected Digits Classification Cristiano de Santana Pereira and George D. C. Cavalcanti 2 Federal
More informationLocating 1-D Bar Codes in DCT-Domain
Edith Cowan University Research Online ECU Publications Pre. 2011 2006 Locating 1-D Bar Codes in DCT-Domain Alexander Tropf Edith Cowan University Douglas Chai Edith Cowan University 10.1109/ICASSP.2006.1660449
More informationEasyDone for AutoCAD
RECSOFT Inc. EasyDone for AutoCAD U S E R S G U I D E This software and accompanying documentation are copyrighted and all rights reserved by RECSOFT Inc. EasyDone and RECSOFT are trademarks of RECSOFT
More informationWavelet Based Page Segmentation Puneet Gupta Neeti Vohra Santanu Chaudhury Shiv Dutt Joshi
Wavelet Based Page Segmentation Puneet Gupta Neeti Vohra Santanu Chaudhury Shiv Dutt Joshi gupta@cfar.umd.edu nvohra@cise.u.edu santanuc@ee.iitd.ernet.in sdjoshi@ee.iitd.ernet.in Department of Electrical
More informationError-Diffusion Robust to Mis-Registration in Multi-Pass Printing
Error-Diffusion Robust to Mis-Registration in Multi-Pass Printing Zhigang Fan, Gaurav Sharma, and Shen-ge Wang Xerox Corporation Webster, New York Abstract Error-diffusion and its variants are commonly
More informationA New Algorithm for Detecting Text Line in Handwritten Documents
A New Algorithm for Detecting Text Line in Handwritten Documents Yi Li 1, Yefeng Zheng 2, David Doermann 1, and Stefan Jaeger 1 1 Laboratory for Language and Media Processing Institute for Advanced Computer
More informationA Graphics Image Processing System
A Graphics Image Processing System Linlin Li and Chew Lim Tan Department of Computer Science, National University of Singapore Kent Ridge, Singapore 117543 {lilinlin,lusj,tancl}@comp.nus.edu.sg Abstract
More informationLocalization, Extraction and Recognition of Text in Telugu Document Images
Localization, Extraction and Recognition of Text in Telugu Document Images Atul Negi Department of CIS University of Hyderabad Hyderabad 500046, India atulcs@uohyd.ernet.in K. Nikhil Shanker Department
More information(Refer Slide Time 00:17) Welcome to the course on Digital Image Processing. (Refer Slide Time 00:22)
Digital Image Processing Prof. P. K. Biswas Department of Electronics and Electrical Communications Engineering Indian Institute of Technology, Kharagpur Module Number 01 Lecture Number 02 Application
More informationHidden Loop Recovery for Handwriting Recognition
Hidden Loop Recovery for Handwriting Recognition David Doermann Institute of Advanced Computer Studies, University of Maryland, College Park, USA E-mail: doermann@cfar.umd.edu Nathan Intrator School of
More informationA Fast Caption Detection Method for Low Quality Video Images
2012 10th IAPR International Workshop on Document Analysis Systems A Fast Caption Detection Method for Low Quality Video Images Tianyi Gui, Jun Sun, Satoshi Naoi Fujitsu Research & Development Center CO.,
More informationArabic Newspaper Page Segmentation
Arabic Newspaper Page Segmentation Karim Hadjar and Rolf Ingold DIUF, University of Fribourg Chemin du Musée 3, 1700 Fribourg, Switzerland {karim.hadjar, rolf.ingold}@unifr.ch Abstract The aim of layout
More informationAn Accurate Method for Skew Determination in Document Images
DICTA00: Digital Image Computing Techniques and Applications, 1 January 00, Melbourne, Australia. An Accurate Method for Skew Determination in Document Images S. Lowther, V. Chandran and S. Sridharan Research
More informationHow to draw and create shapes
Adobe Flash Professional Guide How to draw and create shapes You can add artwork to your Adobe Flash Professional documents in two ways: You can import images or draw original artwork in Flash by using
More informationPage 1. Area-Subdivision Algorithms z-buffer Algorithm List Priority Algorithms BSP (Binary Space Partitioning Tree) Scan-line Algorithms
Visible Surface Determination Visibility Culling Area-Subdivision Algorithms z-buffer Algorithm List Priority Algorithms BSP (Binary Space Partitioning Tree) Scan-line Algorithms Divide-and-conquer strategy:
More informationText identification for document image analysis using a neural network
IMAVIS 1511 Image and Vision Computing 16 (1998) 879 896 Text identification for document image analysis using a neural network C. Strouthopoulos, N. Papamarkos* Electric Circuits Analysis Laboratory,
More informationTextFinder: An Automatic System To Detect And Recognize Text In Images Victor Wu, R. Manmatha, Edward M. Riseman Abstract There are many applications
TextFinder: An Automatic System To Detect And Recognize Text In Images Victor Wu, R. Manmatha, Edward M. Riseman Multimedia Indexing And Retrieval Group Computer Science Department University of Massachusetts,
More informationAdaptive Technology for Mail-Order Form Segmentation
Adaptive Technology for Mail-Order Form Segmentation A.Belaïd 1, Y. Belaïd 2, Late N. Valverde 3 and S. Kébairi 3 1 LORIA-CNRS, Campus scientifique, B.P. 239, 54506 Vandoeuvre-Lès-Nancy France 2 LORIA-University
More informationUsing Game Theory for Image Segmentation
Using Game Theory for Image Segmentation Elizabeth Cassell Sumanth Kolar Alex Yakushev 1 Introduction 21st March 2007 The goal of image segmentation, is to distinguish objects from background. Robust segmentation
More informationBiometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)
Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) References: [1] http://homepages.inf.ed.ac.uk/rbf/hipr2/index.htm [2] http://www.cs.wisc.edu/~dyer/cs540/notes/vision.html
More informationKeyword Spotting in Document Images through Word Shape Coding
2009 10th International Conference on Document Analysis and Recognition Keyword Spotting in Document Images through Word Shape Coding Shuyong Bai, Linlin Li and Chew Lim Tan School of Computing, National
More informationHandwritten text segmentation using blurred image
Handwritten text segmentation using blurred image Aurélie Lemaitre, Jean Camillerapp, Bertrand Coüasnon To cite this version: Aurélie Lemaitre, Jean Camillerapp, Bertrand Coüasnon. Handwritten text segmentation
More informationPixels. Orientation π. θ π/2 φ. x (i) A (i, j) height. (x, y) y(j)
4th International Conf. on Document Analysis and Recognition, pp.142-146, Ulm, Germany, August 18-20, 1997 Skew and Slant Correction for Document Images Using Gradient Direction Changming Sun Λ CSIRO Math.
More informationA System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation
A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation K. Roy, U. Pal and B. B. Chaudhuri CVPR Unit; Indian Statistical Institute, Kolkata-108; India umapada@isical.ac.in
More informationHandwritten Digit Recognition with a. Back-Propagation Network. Y. Le Cun, B. Boser, J. S. Denker, D. Henderson,
Handwritten Digit Recognition with a Back-Propagation Network Y. Le Cun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel AT&T Bell Laboratories, Holmdel, N. J. 07733 ABSTRACT
More informationRecognition of Multi-Oriented, Multi-Sized, and Curved Text
Recognition of Multi-Oriented, Multi-Sized, and Curved Text Yao-Yi Chiang University of Southern California, Information Sciences Institute and Spatial Sciences Institute, 4676 Admiralty Way, Marina del
More informationLOGO USE GUIDELINES BRAND GUIDELINES PUBLISHED ON FEBRUARY 17,
LOGO USE GUIDELINES BRAND GUIDELINES PUBLISHED ON FEBRUARY 17, 2014 1 LOGO USE GUIDELINES LOGO USAGE GUIDELINES 13 LOGO USAGE GUIDELINES The Gardner-Webb logo is the centerpiece of the University's visual
More informationA Model-based Line Detection Algorithm in Documents
A Model-based Line Detection Algorithm in Documents Yefeng Zheng, Huiping Li, David Doermann Laboratory for Language and Media Processing Institute for Advanced Computer Studies University of Maryland,
More information2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into
2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into the viewport of the current application window. A pixel
More informationIntegrating Low-Level and Semantic Visual Cues for Improved Image-to-Video Experiences
Integrating Low-Level and Semantic Visual Cues for Improved Image-to-Video Experiences Pedro Pinho, Joel Baltazar, Fernando Pereira Instituto Superior Técnico - Instituto de Telecomunicações IST, Av. Rovisco
More informationPrewitt. Gradient. Image. Op. Merging of Small Regions. Curve Approximation. and
A RULE-BASED SYSTEM FOR REGION SEGMENTATION IMPROVEMENT IN STEREOVISION M. Buvry, E. Zagrouba and C. J. Krey ENSEEIHT - IRIT - UA 1399 CNRS Vision par Calculateur A. Bruel 2 Rue Camichel, 31071 Toulouse
More informationKeywords Connected Components, Text-Line Extraction, Trained Dataset.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Language Independent
More informationADOBE ILLUSTRATOR CS3
ADOBE ILLUSTRATOR CS3 Chapter 2 Creating Text and Gradients Chapter 2 1 Creating type Create and Format Text Create text anywhere Select the Type Tool Click the artboard and start typing or click and drag
More informationBus Detection and recognition for visually impaired people
Bus Detection and recognition for visually impaired people Hangrong Pan, Chucai Yi, and Yingli Tian The City College of New York The Graduate Center The City University of New York MAP4VIP Outline Motivation
More informationJanitor Bot - Detecting Light Switches Jiaqi Guo, Haizi Yu December 10, 2010
1. Introduction Janitor Bot - Detecting Light Switches Jiaqi Guo, Haizi Yu December 10, 2010 The demand for janitorial robots has gone up with the rising affluence and increasingly busy lifestyles of people
More informationOptimized XY-Cut for Determining a Page Reading Order
Optimized XY-Cut for Determining a Page Reading Order Jean-Luc Meunier Xerox Research Centre Europe 6, chemin de Maupertuis F-3840 Meylan jean-luc.meunier@xrce.xerox.com Abstract In this paper, we propose
More informationSolving Word Jumbles
Solving Word Jumbles Debabrata Sengupta, Abhishek Sharma Department of Electrical Engineering, Stanford University { dsgupta, abhisheksharma }@stanford.edu Abstract In this report we propose an algorithm
More informationLecture 3 Form & Space Form Defines Space
Islamic University-Gaza Faculty of Engineering Architecture Department Principles of Architectural and Environmental Design -EARC 2417 Lecture 3 Form & Space Form Defines Space Instructor: Dr. Suheir Ammar
More informationSymbol Detection Using Region Adjacency Graphs and Integer Linear Programming
2009 10th International Conference on Document Analysis and Recognition Symbol Detection Using Region Adjacency Graphs and Integer Linear Programming Pierre Le Bodic LRI UMR 8623 Using Université Paris-Sud
More informationFRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT
International Journal of Information Technology, Modeling and Computing (IJITMC) Vol. 2, No. 1, February 2014 FRAGMENTATION OF HANDWRITTEN TOUCHING CHARACTERS IN DEVANAGARI SCRIPT Shuchi Kapoor 1 and Vivek
More informationProblem definition Image acquisition Image segmentation Connected component analysis. Machine vision systems - 1
Machine vision systems Problem definition Image acquisition Image segmentation Connected component analysis Machine vision systems - 1 Problem definition Design a vision system to see a flat world Page
More informationResearch on QR Code Image Pre-processing Algorithm under Complex Background
Scientific Journal of Information Engineering May 207, Volume 7, Issue, PP.-7 Research on QR Code Image Pre-processing Algorithm under Complex Background Lei Liu, Lin-li Zhou, Huifang Bao. Institute of
More informationFabric Defect Detection Based on Computer Vision
Fabric Defect Detection Based on Computer Vision Jing Sun and Zhiyu Zhou College of Information and Electronics, Zhejiang Sci-Tech University, Hangzhou, China {jings531,zhouzhiyu1993}@163.com Abstract.
More informationEE368 Project: Visual Code Marker Detection
EE368 Project: Visual Code Marker Detection Kahye Song Group Number: 42 Email: kahye@stanford.edu Abstract A visual marker detection algorithm has been implemented and tested with twelve training images.
More informationWhat is Publisher, anyway?
What is Publisher, anyway? Microsoft Publisher designed for users who need to create and personalize publications such as marketing materials, business stationery, signage, newsletters and other items
More informationScene Text Detection Using Machine Learning Classifiers
601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department
More informationA typed and handwritten text block segmentation system for heterogeneous and complex documents
A typed and handwritten text block segmentation system for heterogeneous and complex documents P. Barlas, S. Adam, C. Chatelain, T.Paquet Laboratoire LITIS - EA 4108, Universite de Rouen, FRANCE 76800
More informationDISCRETE DOMAIN REPRESENTATION FOR SHAPE CONCEPTUALIZATION
DISCRETE DOMAIN REPRESENTATION FOR SHAPE CONCEPTUALIZATION Zoltán Rusák, Imre Horváth, György Kuczogi, Joris S.M. Vergeest, Johan Jansson Department of Design Engineering Delft University of Technology
More informationStructural and Syntactic Techniques for Recognition of Ethiopic Characters
Structural and Syntactic Techniques for Recognition of Ethiopic Characters Yaregal Assabie and Josef Bigun School of Information Science, Computer and Electrical Engineering Halmstad University, SE-301
More informationVision. OCR and OCV Application Guide OCR and OCV Application Guide 1/14
Vision OCR and OCV Application Guide 1.00 OCR and OCV Application Guide 1/14 General considerations on OCR Encoded information into text and codes can be automatically extracted through a 2D imager device.
More informationLecture 4 Form & Space Form Defines Space
Islamic University-Gaza Faculty of Engineering Architectural Department Principles of Architectural and Environmental Design -EARC 2417 Lecture 4 Form & Space Form Defines Space Instructor: Dr. Suheir
More informationContent-based Image and Video Retrieval. Image Segmentation
Content-based Image and Video Retrieval Vorlesung, SS 2011 Image Segmentation 2.5.2011 / 9.5.2011 Image Segmentation One of the key problem in computer vision Identification of homogenous region in the
More informationText Area Detection from Video Frames
Text Area Detection from Video Frames 1 Text Area Detection from Video Frames Xiangrong Chen, Hongjiang Zhang Microsoft Research China chxr@yahoo.com, hjzhang@microsoft.com Abstract. Text area detection
More information