DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM

Size: px
Start display at page:

Download "DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM"

Transcription

1 DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM Anoop K. Bhattacharjya and Hakan Ancin Epson Palo Alto Laboratory 3145 Porter Drive, Suite 104 Palo Alto, CA {anoop, ancin}@erd.epson.com Abstract In this paper, we present a scheme for embedding data in copies (color or monochrome) of predominantly text pages that may also contain color s or graphics. Embedding data imperceptibly in documents or s is a key ingredient of watermarking and data hiding schemes. It is comparatively easy to hide a signal in natural s since the human visual system is less sensitive to signals embedded in noisy regions containing high spatial frequencies. In other instances, e.g., simple graphics or monochrome text documents, additional constraints need to be satisfied to embed signals imperceptibly. Data may be embedded imperceptibly in printed text by altering some measurable property of a font such as position of a character or font size. This scheme however, is not very useful for embedding data in copies of text pages, as that would require accurate text segmentation and possibly optical character recognition, both of which would deteriorate the error rate performance of the data-embedding system considerably. Similarly, other schemes that alter pixels on text boundaries have poor performance due to boundarydetection uncertainties introduced by scanner noise, sampling and blurring. The scheme presented in this paper ameliorates the above problems by using a textregion based embedding approach. Since the bulk of documents reproduced today contain black on white text, this data-embedding scheme can form a print-level layer in applications such as copy tracking and annotation. 1. Introduction In this paper, we present a method for embedding or hiding information in predominantly text document copies, such that the embedded signal is visually imperceptible. The method is also applicable to originals containing color graphics and s in addition to text. A number of methods have been proposed for embedding signals in s of natural scenes [1]. Data may be embedded imperceptibly in printed text by altering some measurable property of a font such as position of a character or font size. This scheme however, is not very useful for embedding data in copies of text pages, as that would require accurate text segmentation and possibly optical character recognition using the document copy, both of which would deteriorate the error rate performance of the data-embedding system considerably. Similarly, other schemes that alter pixels on text boundaries have poor performance due to boundary-detection uncertainties introduced by scanner noise, sampling and blurring. Another approach is to embed the data to be hidden, in the halftoning patterns used by the printer to generate a copy. But this approach works best for documents that contain natural s or continuous-tone content. Many printers today employ halftone patterns for printer tracking. However, these systems are inadequate for copytracking applications that may require additional annotation in terms of, say copier serial number or user identification. Since a large percentage of reproduced documents consist of black and white text, there is a need for development of schemes that can hide data imperceptibly in copies of such pages. In the scheme presented in this paper, we identify small regions (sub-character sized) that consist mainly of pixels that meet criteria of text-character parts as described below, and embed data by modulating the lightness of these regions. Although the method relies on the existence of these regions, it does not rely on the fact that these regions actually represent parts of text characters. While the variations in lightness do not affect perceived text quality, they can be picked up easily using a scanner, and can be decoded to retrieve the message. The robustness of the scheme is improved by using an error-correcting code coupled with a bit-dispersal scheme to disperse the message bits throughout the document. The steps involved in the data embedding and retrieval steps are presented in the following sections. 2. The data embedding and retrieval system This section presents the steps by which data is embedded into and retrieved from the copy of a text document. The

2 processing requires two scans of the original document. The first is a preview scan, at a lower resolution, that is used to identify the various components of the document and establish a coordinate system based on the paragraphs, lines and words found in the document. The second scan is a full-resolution scan that is used to generate the document copy. The data from this scan is processed with the results of the preview scan to embed/retrieve the embedded message. As part of a copier pipeline, this data may then be sent for printing. The principal steps of the preview processing are shown in Figure 1. Once a site list is obtained from an analysis of the preview, the bits to be embedded are used to modulate the pixel intensities in the scanned, in regions determined by the site list. Details of the preview -processing steps and data-embedding steps are provided in the following sections Preview processing Before performing the copy scan, the copier performs a preview scan to determine candidate sites in the text document for embedding data. This scan is typically of a lower resolution than the scan resolution for making a copy, so that the memory and processing requirements of the preview scan are minimized. In this paper, the preview scan is assumed to be half or a third of the copy scan resolution. The preview is first segmented into regions that approximately correspond to text, and background regions Image Segmentation. Image segmentation is a two step process. First the pixels are classified based on their luminance and color-saturation values. Pixels with low luminance and low saturation are classified as text, those with high luminance and low saturation are classified as background and the remaining pixels are classified as pixels [2]. These labels may be further refined using run-length information as described in [3], however, most documents do not require this level of sophistication for adequate initial segmentation. A morphological filter is used to delete very small and large regions of connected text labels. Pixels corresponding to the deleted text labels are marked as unknown. The binary comprised of text and non-text pixels is analyzed further to establish a rotation and translation invariant reference frame for the document Connected components labeling, deskewing and block identification. A connected components [4] algorithm is used to identify connected regions of text pixels. Text-label components with areas and lengths that are smaller or larger than preset thresholds are deleted, and the corresponding pixels are marked as non-text. Very long components are excluded as potential sites as these are susceptible to greater cumulative registration errors during the process of data extraction. The components that survive this step are used to determine the skew angle of the document so as to establish the orientation of the page. The orientation of the page is established using a Hough-transform technique using the following steps. First, the components are grouped in a hierarchical structure based on the inter-component distance. This hierarchical structure groups the components into characters, words, lines and paragraphs. This grouping is performed by calculating the distance between the elements of a group at a given level. Individual characters form the lowest level in the hierarchy. These correspond simply to the connected components themselves. Note that with this classification, characters may not correspond to actual text characters, i.e., a text character may be composed of multiple components, or multiple text characters may fuse into a single component. However, while this misclassification impacts character recognition, it does not impact the skew detection and data embedding problems. The median component height is used as a length scale to group components into word and paragraph elements. Words are formed as groups of characters that are closer than a preset inter-word distance, determined as a fixed proportion of the median component height. Similarly, a preset inter-line distance is used to group words into lines. Paragraphs are determined by two methods. The first method uses indentation of the first word in a line to find paragraphs. The second method looks for lines separated by more than a preset interparagraph distance to mark paragraphs. Once the page has been described as a collection of words, lines, and paragraphs, the centroids of all the components in a given line are used to determine its orientation. This is performed by using a Hough transform on the family of straight lines defined by the centroid of each component belonging to the same line grouping. Since the page orientation obtained in this manner is symmetric with respect to horizontal and vertical reflections, the retrieval algorithm needs to monitor two scan directions to retrieve an embedded bit stream. This ensures that if the page is rotated by 180 degrees on the scanner bed, the embedded message can still be retrieved. Once the page orientation is known, the page is deskewed, and the bounding boxes of all the components belonging to a character, word, line or paragraph grouping as described above, are used to define character, word, line and paragraph boxes respectively. The paragraph boxes are used to define multiple coordinate frames, one for each paragraph, for the entire document. With the establishment of the coordinate/reference frames, the next step involves the identification of sites for embedding the hidden message.

3 Site selection. Sites for intensity modulation are determined in one of two methods. The first uses a coordinate system associated with each paragraph or line element to embed the data. If a paragraph block is used to establish the local coordinate frame, the pixels in each paragraph block are partitioned into a fine square grid consisting of 3x3 pixels in each grid cell/partition. The sites in which data will be embedded are chosen from among the grid cells. Site selection proceeds as follows. First, the grid cells that contain predominantly text-type pixels are identified. To perform this selection, the 90th percentile of the luminance histogram of all text components is chosen as a threshold. Any grid cells that contain more than a preset percentage of pixels that are below this threshold are marked as candidate sites for data embedding. Data is embedded in these sites by modulating the luminance of all pixels belonging to a candidate site s cell. The second method for site selection uses a local coordinate frame associated with characters with long strokes. Such strokes are detected using a morphological operator. The height of the stroke provides a scaleindependent coordinate system for modulating pixel intensities at locations along the stroke defined by this local coordinate system. Two or more candidate sites are required for embedding each bit. For example, a bit may be embedded in two sites using the following scheme. If the difference between the average luminance of the pixels belonging to the current site and the next one is positive, the bit is a 1, else, if the difference is negative, the bit is a 0. Similar difference-based schemes may be used for embedding a single bit in three or more sites. For example, a bit may be embedded in three sites using average grid-cell luminance differences as follows: if the first difference is positive and the next is negative, the bit is a 1, else, if the first difference is negative and the next is positive, the bit is a 0. The number of independently controllable sites for the purpose of bit embedding is extracted from the candidate site list based on the number of sites required to embed a bit. A line or word synchronization scheme is used to minimize accumulative errors due to site-identification errors. In this scheme, message words are always embedded starting at a line or word boundary, and the embedded message is repeated multiple times in the entire document depending on the number of available sites. During data extraction, the decoder attempts to decode the embedded data from the start of every line or word boundary. This provides increased robustness with respect to accumulative errors due to random site misclassification. The site list output by the previewprocessing module consists of independently controllable sites that also satisfy the line- and paragraphsynchronization constraints. This site list also contains page orientation information so that pixels belonging to each site may be mapped to the higher scan resolution used for copying the document Data embedding and retrieval from high resolution The data to be embedded in the document is first coded using an error correcting code. The resulting bits are then scrambled so that they are dispersed uniformly across the page. This scrambling is achieved by using a disperseddither matrix, typically used for halftoning in color printers. The ranks of a dispersed dither matrix [5] have the property that each successive rank is located at a position in the matrix that is as far away (spatially) as possible from locations containing all previous ranks. Since the site list generated in the previous section has a fixed number of sites per line, all the sites can be arranged in a two-dimensional array. This array is tiled periodically by a large (512x512) dither array, and each site is assigned a rank based on the rank of the dither array and the index of the dither-array tile at that location. The rank of each site is used to index into the error-coded bit-stream to determine the bit that will be embedded in the pixels belonging to the site. During the high-resolution copy scan, data may be embedded to or extracted from the document. For data embedding, the pixel luminances are modulated based on the bit-embedding scheme described in the previous section. The degree of luminance modulation is based on the characteristics of the scanner and printer used for the copier, and is determined experimentally. For data retrieval, the average luminance for the pixels in each site is computed and the data is retrieved according to the embedding scheme and the input site list. Figure 3 shows a portion of text in which data is embedded using the scheme presented in this paper. The sites chosen for pixel modulation are marked, and copy output with and without data embedding are presented to illustrate their virtual indistinguishability. Errors may creep into the data retrieval process if the grid described in Section is not constructed similarly during data embedding and retrieval phases. Typically, there may be small translation or scaling differences in the embedding and retrieval grids. This problem is countered by performing a multiple-grid search on the highresolution scanned data. A series of site-lists are constructed during preview processing by perturbing the segmentation parameters and moving the local coordinate systems by a couple of pixels along horizontal and vertical directions. Message retrieval is then performed using these multiple-grid site lists.

4 3. Conclusions We have presented a robust method for imperceptibly embedding data in text documents. The embedded data can also be retrieved robustly. However, this algorithm does not preserve previously embedded information directly. The only way to achieve that is to first retrieve the embedded bits and then possibly, append a summary of the retrieved message to the current message to be embedded. This is a weakness that continues to challenge all algorithms for data hiding. A further drawback of this method is that not enough sites may be available in the scanned document to embed large messages. In this case, one of a series of messages with varying site requirements may need to be provided for embedding. The number of sites available for data embedding, however, increases with scanning and printing resolution. 4. References [1] W. Bender, D. Gruhl, N. Morimoto, and A. Lu, Techniques for data hiding, IBM Systems Journal, Vol 35, Nos. 3 & 4, pp , [2] H. Ancin, and A. K. Bhattacharjya, "Text enhancement for laser copiers," in Proceedings of IEEE ICIP '99, Kobe, Japan, Oct , [3] H. Ancin, Document Segmentation for High Quality Printing, IS&T/SPIE Symposium on Electronic Imaging: Science & Technology, Color Imaging: Device Independent Color, Color Hard Copy, and Graphic Arts II, pp , February [4] W. K. Pratt, Digital Image Processing, John Wiley & Sons, Inc., New York, second edition, [5] R. Ulichney, Digital Halftoning, The MIT Press, Cambridge, Massachusetts, input segmentation connected components labeling deskew block identification site selection site list Figure 1: Preview processing for data embedding/retrieval. site list bits to be embedded input identify site-list pixels modulate pixel values output Figure 2: Embedding data in the high-resolution scanned.

5 3(a) 3(b) 3(c) 3(d) Figure 3: (a) Original (scanned) text. (b) Pixels corresponding to sites that will be modulated in luminance to hide information, are shown in a different color. The word the is magnified to show more detail. (c) Print output containing embedded data. (d) Print output without embedded data.

Data Hiding in Binary Text Documents 1. Q. Mei, E. K. Wong, and N. Memon

Data Hiding in Binary Text Documents 1. Q. Mei, E. K. Wong, and N. Memon Data Hiding in Binary Text Documents 1 Q. Mei, E. K. Wong, and N. Memon Department of Computer and Information Science Polytechnic University 5 Metrotech Center, Brooklyn, NY 11201 ABSTRACT With the proliferation

More information

Locating 1-D Bar Codes in DCT-Domain

Locating 1-D Bar Codes in DCT-Domain Edith Cowan University Research Online ECU Publications Pre. 2011 2006 Locating 1-D Bar Codes in DCT-Domain Alexander Tropf Edith Cowan University Douglas Chai Edith Cowan University 10.1109/ICASSP.2006.1660449

More information

Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig

Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig Vienna University of Technology, Institute of Computer Aided Automation, Pattern Recognition and Image Processing

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

Determining Document Skew Using Inter-Line Spaces

Determining Document Skew Using Inter-Line Spaces 2011 International Conference on Document Analysis and Recognition Determining Document Skew Using Inter-Line Spaces Boris Epshtein Google Inc. 1 1600 Amphitheatre Parkway, Mountain View, CA borisep@google.com

More information

[10] Industrial DataMatrix barcodes recognition with a random tilt and rotating the camera

[10] Industrial DataMatrix barcodes recognition with a random tilt and rotating the camera [10] Industrial DataMatrix barcodes recognition with a random tilt and rotating the camera Image processing, pattern recognition 865 Kruchinin A.Yu. Orenburg State University IntBuSoft Ltd Abstract The

More information

Motivation. Intensity Levels

Motivation. Intensity Levels Motivation Image Intensity and Point Operations Dr. Edmund Lam Department of Electrical and Electronic Engineering The University of Hong ong A digital image is a matrix of numbers, each corresponding

More information

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network

Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp. 91-95 Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network Raghuraj

More information

Motion Detection Algorithm

Motion Detection Algorithm Volume 1, No. 12, February 2013 ISSN 2278-1080 The International Journal of Computer Science & Applications (TIJCSA) RESEARCH PAPER Available Online at http://www.journalofcomputerscience.com/ Motion Detection

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

A Document Image Analysis System on Parallel Processors

A Document Image Analysis System on Parallel Processors A Document Image Analysis System on Parallel Processors Shamik Sural, CMC Ltd. 28 Camac Street, Calcutta 700 016, India. P.K.Das, Dept. of CSE. Jadavpur University, Calcutta 700 032, India. Abstract This

More information

Speeding up the Detection of Line Drawings Using a Hash Table

Speeding up the Detection of Line Drawings Using a Hash Table Speeding up the Detection of Line Drawings Using a Hash Table Weihan Sun, Koichi Kise 2 Graduate School of Engineering, Osaka Prefecture University, Japan sunweihan@m.cs.osakafu-u.ac.jp, 2 kise@cs.osakafu-u.ac.jp

More information

Scene Text Detection Using Machine Learning Classifiers

Scene Text Detection Using Machine Learning Classifiers 601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department

More information

An Accurate Method for Skew Determination in Document Images

An Accurate Method for Skew Determination in Document Images DICTA00: Digital Image Computing Techniques and Applications, 1 January 00, Melbourne, Australia. An Accurate Method for Skew Determination in Document Images S. Lowther, V. Chandran and S. Sridharan Research

More information

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition

One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition One Dim~nsional Representation Of Two Dimensional Information For HMM Based Handwritten Recognition Nafiz Arica Dept. of Computer Engineering, Middle East Technical University, Ankara,Turkey nafiz@ceng.metu.edu.

More information

Identifying and Reading Visual Code Markers

Identifying and Reading Visual Code Markers O. Feinstein, EE368 Digital Image Processing Final Report 1 Identifying and Reading Visual Code Markers Oren Feinstein, Electrical Engineering Department, Stanford University Abstract A visual code marker

More information

OPTIMIZING A VIDEO PREPROCESSOR FOR OCR. MR IBM Systems Dev Rochester, elopment Division Minnesota

OPTIMIZING A VIDEO PREPROCESSOR FOR OCR. MR IBM Systems Dev Rochester, elopment Division Minnesota OPTIMIZING A VIDEO PREPROCESSOR FOR OCR MR IBM Systems Dev Rochester, elopment Division Minnesota Summary This paper describes how optimal video preprocessor performance can be achieved using a software

More information

An ICA based Approach for Complex Color Scene Text Binarization

An ICA based Approach for Complex Color Scene Text Binarization An ICA based Approach for Complex Color Scene Text Binarization Siddharth Kherada IIIT-Hyderabad, India siddharth.kherada@research.iiit.ac.in Anoop M. Namboodiri IIIT-Hyderabad, India anoop@iiit.ac.in

More information

Color Dithering with n-best Algorithm

Color Dithering with n-best Algorithm Color Dithering with n-best Algorithm Kjell Lemström, Jorma Tarhio University of Helsinki Department of Computer Science P.O. Box 26 (Teollisuuskatu 23) FIN-00014 University of Helsinki Finland {klemstro,tarhio}@cs.helsinki.fi

More information

Short Survey on Static Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of

More information

Model-based segmentation and recognition from range data

Model-based segmentation and recognition from range data Model-based segmentation and recognition from range data Jan Boehm Institute for Photogrammetry Universität Stuttgart Germany Keywords: range image, segmentation, object recognition, CAD ABSTRACT This

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

Motivation. Gray Levels

Motivation. Gray Levels Motivation Image Intensity and Point Operations Dr. Edmund Lam Department of Electrical and Electronic Engineering The University of Hong ong A digital image is a matrix of numbers, each corresponding

More information

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds 9 1th International Conference on Document Analysis and Recognition Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds Weihan Sun, Koichi Kise Graduate School

More information

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) References: [1] http://homepages.inf.ed.ac.uk/rbf/hipr2/index.htm [2] http://www.cs.wisc.edu/~dyer/cs540/notes/vision.html

More information

Error-Diffusion Robust to Mis-Registration in Multi-Pass Printing

Error-Diffusion Robust to Mis-Registration in Multi-Pass Printing Error-Diffusion Robust to Mis-Registration in Multi-Pass Printing Zhigang Fan, Gaurav Sharma, and Shen-ge Wang Xerox Corporation Webster, New York Abstract Error-diffusion and its variants are commonly

More information

AN EFFICIENT VIDEO WATERMARKING USING COLOR HISTOGRAM ANALYSIS AND BITPLANE IMAGE ARRAYS

AN EFFICIENT VIDEO WATERMARKING USING COLOR HISTOGRAM ANALYSIS AND BITPLANE IMAGE ARRAYS AN EFFICIENT VIDEO WATERMARKING USING COLOR HISTOGRAM ANALYSIS AND BITPLANE IMAGE ARRAYS G Prakash 1,TVS Gowtham Prasad 2, T.Ravi Kumar Naidu 3 1MTech(DECS) student, Department of ECE, sree vidyanikethan

More information

Medical images, segmentation and analysis

Medical images, segmentation and analysis Medical images, segmentation and analysis ImageLab group http://imagelab.ing.unimo.it Università degli Studi di Modena e Reggio Emilia Medical Images Macroscopic Dermoscopic ELM enhance the features of

More information

Chapter 3 Image Registration. Chapter 3 Image Registration

Chapter 3 Image Registration. Chapter 3 Image Registration Chapter 3 Image Registration Distributed Algorithms for Introduction (1) Definition: Image Registration Input: 2 images of the same scene but taken from different perspectives Goal: Identify transformation

More information

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Partha Sarathi Giri Department of Electronics and Communication, M.E.M.S, Balasore, Odisha Abstract Text data

More information

A Hillclimbing Approach to Image Mosaics

A Hillclimbing Approach to Image Mosaics A Hillclimbing Approach to Image Mosaics Chris Allen Faculty Sponsor: Kenny Hunt, Department of Computer Science ABSTRACT This paper presents a hillclimbing approach to image mosaic creation. Our approach

More information

Human Motion Detection and Tracking for Video Surveillance

Human Motion Detection and Tracking for Video Surveillance Human Motion Detection and Tracking for Video Surveillance Prithviraj Banerjee and Somnath Sengupta Department of Electronics and Electrical Communication Engineering Indian Institute of Technology, Kharagpur,

More information

Region-based Segmentation

Region-based Segmentation Region-based Segmentation Image Segmentation Group similar components (such as, pixels in an image, image frames in a video) to obtain a compact representation. Applications: Finding tumors, veins, etc.

More information

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov

Structured Light II. Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Structured Light II Johannes Köhler Johannes.koehler@dfki.de Thanks to Ronen Gvili, Szymon Rusinkiewicz and Maks Ovsjanikov Introduction Previous lecture: Structured Light I Active Scanning Camera/emitter

More information

Searching of meteors in astronomical images using Matlab GUI

Searching of meteors in astronomical images using Matlab GUI 1 Portál pre odborné publikovanie ISSN 1338-0087 Searching of meteors in astronomical images using Matlab GUI Kubičková Eliška Anna Informačné technológie, MATLAB/Comsol 11.05.2011 The paper deals with

More information

Physical Color. Color Theory - Center for Graphics and Geometric Computing, Technion 2

Physical Color. Color Theory - Center for Graphics and Geometric Computing, Technion 2 Color Theory Physical Color Visible energy - small portion of the electro-magnetic spectrum Pure monochromatic colors are found at wavelengths between 380nm (violet) and 780nm (red) 380 780 Color Theory

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

Robotics Programming Laboratory

Robotics Programming Laboratory Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car

More information

Measures of Dispersion

Measures of Dispersion Measures of Dispersion 6-3 I Will... Find measures of dispersion of sets of data. Find standard deviation and analyze normal distribution. Day 1: Dispersion Vocabulary Measures of Variation (Dispersion

More information

Content-based Image Retrieval (CBIR)

Content-based Image Retrieval (CBIR) Content-based Image Retrieval (CBIR) Content-based Image Retrieval (CBIR) Searching a large database for images that match a query: What kinds of databases? What kinds of queries? What constitutes a match?

More information

Image Processing Fundamentals. Nicolas Vazquez Principal Software Engineer National Instruments

Image Processing Fundamentals. Nicolas Vazquez Principal Software Engineer National Instruments Image Processing Fundamentals Nicolas Vazquez Principal Software Engineer National Instruments Agenda Objectives and Motivations Enhancing Images Checking for Presence Locating Parts Measuring Features

More information

Cs : Computer Vision Final Project Report

Cs : Computer Vision Final Project Report Cs 600.461: Computer Vision Final Project Report Giancarlo Troni gtroni@jhu.edu Raphael Sznitman sznitman@jhu.edu Abstract Given a Youtube video of a busy street intersection, our task is to detect, track,

More information

Auto-Digitizer for Fast Graph-to-Data Conversion

Auto-Digitizer for Fast Graph-to-Data Conversion Auto-Digitizer for Fast Graph-to-Data Conversion EE 368 Final Project Report, Winter 2018 Deepti Sanjay Mahajan dmahaj@stanford.edu Sarah Pao Radzihovsky sradzi13@stanford.edu Ching-Hua (Fiona) Wang chwang9@stanford.edu

More information

Visible Color. 700 (red) 580 (yellow) 520 (green)

Visible Color. 700 (red) 580 (yellow) 520 (green) Color Theory Physical Color Visible energy - small portion of the electro-magnetic spectrum Pure monochromatic colors are found at wavelengths between 380nm (violet) and 780nm (red) 380 780 Color Theory

More information

Problem definition Image acquisition Image segmentation Connected component analysis. Machine vision systems - 1

Problem definition Image acquisition Image segmentation Connected component analysis. Machine vision systems - 1 Machine vision systems Problem definition Image acquisition Image segmentation Connected component analysis Machine vision systems - 1 Problem definition Design a vision system to see a flat world Page

More information

Scalable Coding of Image Collections with Embedded Descriptors

Scalable Coding of Image Collections with Embedded Descriptors Scalable Coding of Image Collections with Embedded Descriptors N. Adami, A. Boschetti, R. Leonardi, P. Migliorati Department of Electronic for Automation, University of Brescia Via Branze, 38, Brescia,

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

Pixels. Orientation π. θ π/2 φ. x (i) A (i, j) height. (x, y) y(j)

Pixels. Orientation π. θ π/2 φ. x (i) A (i, j) height. (x, y) y(j) 4th International Conf. on Document Analysis and Recognition, pp.142-146, Ulm, Germany, August 18-20, 1997 Skew and Slant Correction for Document Images Using Gradient Direction Changming Sun Λ CSIRO Math.

More information

Review on Image Segmentation Techniques and its Types

Review on Image Segmentation Techniques and its Types 1 Review on Image Segmentation Techniques and its Types Ritu Sharma 1, Rajesh Sharma 2 Research Scholar 1 Assistant Professor 2 CT Group of Institutions, Jalandhar. 1 rits_243@yahoo.in, 2 rajeshsharma1234@gmail.com

More information

MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ)

MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ) 5 MRT based Adaptive Transform Coder with Classified Vector Quantization (MATC-CVQ) Contents 5.1 Introduction.128 5.2 Vector Quantization in MRT Domain Using Isometric Transformations and Scaling.130 5.2.1

More information

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University.

3D Computer Vision. Structured Light II. Prof. Didier Stricker. Kaiserlautern University. 3D Computer Vision Structured Light II Prof. Didier Stricker Kaiserlautern University http://ags.cs.uni-kl.de/ DFKI Deutsches Forschungszentrum für Künstliche Intelligenz http://av.dfki.de 1 Introduction

More information

HCR Using K-Means Clustering Algorithm

HCR Using K-Means Clustering Algorithm HCR Using K-Means Clustering Algorithm Meha Mathur 1, Anil Saroliya 2 Amity School of Engineering & Technology Amity University Rajasthan, India Abstract: Hindi is a national language of India, there are

More information

WATERMARKING FOR LIGHT FIELD RENDERING 1

WATERMARKING FOR LIGHT FIELD RENDERING 1 ATERMARKING FOR LIGHT FIELD RENDERING 1 Alper Koz, Cevahir Çığla and A. Aydın Alatan Department of Electrical and Electronics Engineering, METU Balgat, 06531, Ankara, TURKEY. e-mail: koz@metu.edu.tr, cevahir@eee.metu.edu.tr,

More information

Triangular Mesh Segmentation Based On Surface Normal

Triangular Mesh Segmentation Based On Surface Normal ACCV2002: The 5th Asian Conference on Computer Vision, 23--25 January 2002, Melbourne, Australia. Triangular Mesh Segmentation Based On Surface Normal Dong Hwan Kim School of Electrical Eng. Seoul Nat

More information

I. INTRODUCTION. Figure-1 Basic block of text analysis

I. INTRODUCTION. Figure-1 Basic block of text analysis ISSN: 2349-7637 (Online) (RHIMRJ) Research Paper Available online at: www.rhimrj.com Detection and Localization of Texts from Natural Scene Images: A Hybrid Approach Priyanka Muchhadiya Post Graduate Fellow,

More information

Research on QR Code Image Pre-processing Algorithm under Complex Background

Research on QR Code Image Pre-processing Algorithm under Complex Background Scientific Journal of Information Engineering May 207, Volume 7, Issue, PP.-7 Research on QR Code Image Pre-processing Algorithm under Complex Background Lei Liu, Lin-li Zhou, Huifang Bao. Institute of

More information

OBJECT SORTING IN MANUFACTURING INDUSTRIES USING IMAGE PROCESSING

OBJECT SORTING IN MANUFACTURING INDUSTRIES USING IMAGE PROCESSING OBJECT SORTING IN MANUFACTURING INDUSTRIES USING IMAGE PROCESSING Manoj Sabnis 1, Vinita Thakur 2, Rujuta Thorat 2, Gayatri Yeole 2, Chirag Tank 2 1 Assistant Professor, 2 Student, Department of Information

More information

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences UNIVERSITY OF OSLO Faculty of Mathematics and Natural Sciences Exam: INF 4300 / INF 9305 Digital image analysis Date: Thursday December 21, 2017 Exam hours: 09.00-13.00 (4 hours) Number of pages: 8 pages

More information

Massachusetts Institute of Technology. Department of Computer Science and Electrical Engineering /6.866 Machine Vision Quiz I

Massachusetts Institute of Technology. Department of Computer Science and Electrical Engineering /6.866 Machine Vision Quiz I Massachusetts Institute of Technology Department of Computer Science and Electrical Engineering 6.801/6.866 Machine Vision Quiz I Handed out: 2004 Oct. 21st Due on: 2003 Oct. 28th Problem 1: Uniform reflecting

More information

Binary Image Processing. Introduction to Computer Vision CSE 152 Lecture 5

Binary Image Processing. Introduction to Computer Vision CSE 152 Lecture 5 Binary Image Processing CSE 152 Lecture 5 Announcements Homework 2 is due Apr 25, 11:59 PM Reading: Szeliski, Chapter 3 Image processing, Section 3.3 More neighborhood operators Binary System Summary 1.

More information

Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis

Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis Sensors & Transducers 2014 by IFSA Publishing, S. L. http://www.sensorsportal.com Improvement of SURF Feature Image Registration Algorithm Based on Cluster Analysis 1 Xulin LONG, 1,* Qiang CHEN, 2 Xiaoya

More information

FAST REGISTRATION OF TERRESTRIAL LIDAR POINT CLOUD AND SEQUENCE IMAGES

FAST REGISTRATION OF TERRESTRIAL LIDAR POINT CLOUD AND SEQUENCE IMAGES FAST REGISTRATION OF TERRESTRIAL LIDAR POINT CLOUD AND SEQUENCE IMAGES Jie Shao a, Wuming Zhang a, Yaqiao Zhu b, Aojie Shen a a State Key Laboratory of Remote Sensing Science, Institute of Remote Sensing

More information

New Edge-Enhanced Error Diffusion Algorithm Based on the Error Sum Criterion

New Edge-Enhanced Error Diffusion Algorithm Based on the Error Sum Criterion New Edge-Enhanced Error Diffusion Algorithm Based on the Error Sum Criterion Jae Ho Kim* Tae Il Chung Hyung Soon Kim* Kyung Sik Son* Pusan National University Image and Communication Laboratory San 3,

More information

Robustness of Selective Desensitization Perceptron Against Irrelevant and Partially Relevant Features in Pattern Classification

Robustness of Selective Desensitization Perceptron Against Irrelevant and Partially Relevant Features in Pattern Classification Robustness of Selective Desensitization Perceptron Against Irrelevant and Partially Relevant Features in Pattern Classification Tomohiro Tanno, Kazumasa Horie, Jun Izawa, and Masahiko Morita University

More information

Localization, Extraction and Recognition of Text in Telugu Document Images

Localization, Extraction and Recognition of Text in Telugu Document Images Localization, Extraction and Recognition of Text in Telugu Document Images Atul Negi Department of CIS University of Hyderabad Hyderabad 500046, India atulcs@uohyd.ernet.in K. Nikhil Shanker Department

More information

THE preceding chapters were all devoted to the analysis of images and signals which

THE preceding chapters were all devoted to the analysis of images and signals which Chapter 5 Segmentation of Color, Texture, and Orientation Images THE preceding chapters were all devoted to the analysis of images and signals which take values in IR. It is often necessary, however, to

More information

Layout Segmentation of Scanned Newspaper Documents

Layout Segmentation of Scanned Newspaper Documents , pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms

More information

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of

More information

An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b

An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b 6th International Conference on Machinery, Materials, Environment, Biotechnology and Computer (MMEBC 2016) An adaptive container code character segmentation algorithm Yajie Zhu1, a, Chenglong Liang2, b

More information

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features 1 Kum Sharanamma, 2 Krishnapriya Sharma 1,2 SIR MVIT Abstract- To describe the image features the Local binary pattern (LBP)

More information

Image-Based Competitive Printed Circuit Board Analysis

Image-Based Competitive Printed Circuit Board Analysis Image-Based Competitive Printed Circuit Board Analysis Simon Basilico Department of Electrical Engineering Stanford University Stanford, CA basilico@stanford.edu Ford Rylander Department of Electrical

More information

Multi-scale Techniques for Document Page Segmentation

Multi-scale Techniques for Document Page Segmentation Multi-scale Techniques for Document Page Segmentation Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State University of New York at Buffalo, Amherst

More information

Measurement of 3D Foot Shape Deformation in Motion

Measurement of 3D Foot Shape Deformation in Motion Measurement of 3D Foot Shape Deformation in Motion Makoto Kimura Masaaki Mochimaru Takeo Kanade Digital Human Research Center National Institute of Advanced Industrial Science and Technology, Japan The

More information

Chain Code Histogram based approach

Chain Code Histogram based approach An attempt at visualizing the Fourth Dimension Take a point, stretch it into a line, curl it into a circle, twist it into a sphere, and punch through the sphere Albert Einstein Chain Code Histogram based

More information

IRIS SEGMENTATION OF NON-IDEAL IMAGES

IRIS SEGMENTATION OF NON-IDEAL IMAGES IRIS SEGMENTATION OF NON-IDEAL IMAGES William S. Weld St. Lawrence University Computer Science Department Canton, NY 13617 Xiaojun Qi, Ph.D Utah State University Computer Science Department Logan, UT 84322

More information

ECE 172A: Introduction to Intelligent Systems: Machine Vision, Fall Midterm Examination

ECE 172A: Introduction to Intelligent Systems: Machine Vision, Fall Midterm Examination ECE 172A: Introduction to Intelligent Systems: Machine Vision, Fall 2008 October 29, 2008 Notes: Midterm Examination This is a closed book and closed notes examination. Please be precise and to the point.

More information

EE368 Project: Visual Code Marker Detection

EE368 Project: Visual Code Marker Detection EE368 Project: Visual Code Marker Detection Kahye Song Group Number: 42 Email: kahye@stanford.edu Abstract A visual marker detection algorithm has been implemented and tested with twelve training images.

More information

Scanner Parameter Estimation Using Bilevel Scans of Star Charts

Scanner Parameter Estimation Using Bilevel Scans of Star Charts ICDAR, Seattle WA September Scanner Parameter Estimation Using Bilevel Scans of Star Charts Elisa H. Barney Smith Electrical and Computer Engineering Department Boise State University, Boise, Idaho 8375

More information

Volocity ver (2013) Standard Operation Protocol

Volocity ver (2013) Standard Operation Protocol Faculty Core Facility Volocity 6.3.0 (2013) SOP A-1 Volocity ver. 6.3.0 (2013) Standard Operation Protocol Faculty Core Facility Volocity 6.3.0 (2013) SOP A-2 A. Content Overview. 3 Start up. 3 Change

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Part 9: Representation and Description AASS Learning Systems Lab, Dep. Teknik Room T1209 (Fr, 11-12 o'clock) achim.lilienthal@oru.se Course Book Chapter 11 2011-05-17 Contents

More information

Varun Manchikalapudi Dept. of Information Tech., V.R. Siddhartha Engg. College (A), Vijayawada, AP, India

Varun Manchikalapudi Dept. of Information Tech., V.R. Siddhartha Engg. College (A), Vijayawada, AP, India Skew Correction and Localisation of Number Plate Using Hough Rectangular Transform Varun Manchikalapudi Dept. of Information Tech., V.R. Siddhartha Engg. College (A), Vijayawada, AP, India Abstract Skew

More information

Towards copy-evident JPEG images

Towards copy-evident JPEG images Towards copy-evident JPEG images Andrew B. Lewis, Markus G. Kuhn Abstract: We present a technique for adding a high-frequency pattern to JPEG images that is imperceptible to the unaided eye, but turns

More information

Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos

Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos Definition, Detection, and Evaluation of Meeting Events in Airport Surveillance Videos Sung Chun Lee, Chang Huang, and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA sungchun@usc.edu,

More information

Content-Based Image Retrieval Readings: Chapter 8:

Content-Based Image Retrieval Readings: Chapter 8: Content-Based Image Retrieval Readings: Chapter 8: 8.1-8.4 Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition 1 Content-based Image Retrieval (CBIR)

More information

Component-based Face Recognition with 3D Morphable Models

Component-based Face Recognition with 3D Morphable Models Component-based Face Recognition with 3D Morphable Models B. Weyrauch J. Huang benjamin.weyrauch@vitronic.com jenniferhuang@alum.mit.edu Center for Biological and Center for Biological and Computational

More information

ADOBE ILLUSTRATOR CS3

ADOBE ILLUSTRATOR CS3 ADOBE ILLUSTRATOR CS3 Chapter 2 Creating Text and Gradients Chapter 2 1 Creating type Create and Format Text Create text anywhere Select the Type Tool Click the artboard and start typing or click and drag

More information

Image Restoration and Reconstruction

Image Restoration and Reconstruction Image Restoration and Reconstruction Image restoration Objective process to improve an image, as opposed to the subjective process of image enhancement Enhancement uses heuristics to improve the image

More information

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin

Video Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Literature Survey Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Omer Shakil Abstract This literature survey compares various methods

More information

AUTOMATIC LOGO EXTRACTION FROM DOCUMENT IMAGES

AUTOMATIC LOGO EXTRACTION FROM DOCUMENT IMAGES AUTOMATIC LOGO EXTRACTION FROM DOCUMENT IMAGES Umesh D. Dixit 1 and M. S. Shirdhonkar 2 1 Department of Electronics & Communication Engineering, B.L.D.E.A s CET, Bijapur. 2 Department of Computer Science

More information

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Chin-Fu Tsao 1, Yu-Hao Chen 1, Jin-Hau Kuo 1, Chia-wei Lin 1, and Ja-Ling Wu 1,2 1 Communication and Multimedia Laboratory,

More information

Fast and Efficient Automated Iris Segmentation by Region Growing

Fast and Efficient Automated Iris Segmentation by Region Growing Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 6, June 2013, pg.325

More information

Critique: Efficient Iris Recognition by Characterizing Key Local Variations

Critique: Efficient Iris Recognition by Characterizing Key Local Variations Critique: Efficient Iris Recognition by Characterizing Key Local Variations Authors: L. Ma, T. Tan, Y. Wang, D. Zhang Published: IEEE Transactions on Image Processing, Vol. 13, No. 6 Critique By: Christopher

More information

Small-scale objects extraction in digital images

Small-scale objects extraction in digital images 102 Int'l Conf. IP, Comp. Vision, and Pattern Recognition IPCV'15 Small-scale objects extraction in digital images V. Volkov 1,2 S. Bobylev 1 1 Radioengineering Dept., The Bonch-Bruevich State Telecommunications

More information

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation

Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation. Range Imaging Through Triangulation Obviously, this is a very slow process and not suitable for dynamic scenes. To speed things up, we can use a laser that projects a vertical line of light onto the scene. This laser rotates around its vertical

More information

3D Convolutional Neural Networks for Landing Zone Detection from LiDAR

3D Convolutional Neural Networks for Landing Zone Detection from LiDAR 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR Daniel Mataruna and Sebastian Scherer Presented by: Sabin Kafle Outline Introduction Preliminaries Approach Volumetric Density Mapping

More information

An Efficient Character Segmentation Based on VNP Algorithm

An Efficient Character Segmentation Based on VNP Algorithm Research Journal of Applied Sciences, Engineering and Technology 4(24): 5438-5442, 2012 ISSN: 2040-7467 Maxwell Scientific organization, 2012 Submitted: March 18, 2012 Accepted: April 14, 2012 Published:

More information

Extracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang

Extracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang Extracting Layers and Recognizing Features for Automatic Map Understanding Yao-Yi Chiang 0 Outline Introduction/ Problem Motivation Map Processing Overview Map Decomposition Feature Recognition Discussion

More information

Content-Based Image Retrieval Readings: Chapter 8:

Content-Based Image Retrieval Readings: Chapter 8: Content-Based Image Retrieval Readings: Chapter 8: 8.1-8.4 Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition 1 Content-based Image Retrieval (CBIR)

More information

Segmentation of Images

Segmentation of Images Segmentation of Images SEGMENTATION If an image has been preprocessed appropriately to remove noise and artifacts, segmentation is often the key step in interpreting the image. Image segmentation is a

More information

Vision. OCR and OCV Application Guide OCR and OCV Application Guide 1/14

Vision. OCR and OCV Application Guide OCR and OCV Application Guide 1/14 Vision OCR and OCV Application Guide 1.00 OCR and OCV Application Guide 1/14 General considerations on OCR Encoded information into text and codes can be automatically extracted through a 2D imager device.

More information