Applying Preattentive Visual Guidance in Document Image Analysis

Size: px
Start display at page:

Download "Applying Preattentive Visual Guidance in Document Image Analysis"

Transcription

1 Applying Preattentive Visual Guidance in Document Image Analysis Di Wen 1,2, Xiaoqing Ding 1,2 1 Department of Electronic Engineering, Tsinghua University 2 State Key Laboratory of Intelligent Technology and Systems Beijing , P.R.China {wendi, dxq}@ocrserv.ee.tsinghua.edu.cn Abstract. In this paper, we present a novel methodology on document image analysis (DIA) which harnesses the mechanism of preattentive visual guidance in human vision. Summarizing the psychophysical research on preattentive vision, we suggest using two types of computations to simulate this biological process: the visual similarity clustering and visual saliency detection. Based on the computational implementation of these two processes, we develop a biological plausible way to guide the interpretation of document images, which is distinctively different from previous DIA methods. Experimental results prove the effectiveness of these two processes, whose outputs can be further utilized by any task-oriented DIA application. Keywords: Document image analysis, preattentive visual guidance, texture synthesis, dynamic clustering, visual saliency. 1 Introduction Detecting and segmenting semantic contents from document images is a challenging task. In recent years, there have been proposed dozens of matured algorithms in the document image analysis (DIA) domain, oriented at different application scenarios. Some of them are quite successful in automatically converting specific classes of paper-based documents in batches into their electronic counterparts [1, 2, 3, 4]. However, little attention has been paid to the adaptability of DIA methods while they are encountered with constantly switching environments, such as from simple layouts to complex layouts, from upright to geometric distorted images, from clean background to clutter background etc. As a result, current DIA systems are quite specialized for specific class of samples and quite demanding for image quality, which greatly reduce the usability of OCR techniques. In this paper, we propose a new attempt towards a generic DIA approach adapting to various cases. Our original idea comes from the mechanism of preattentive visual guidance in human vision. In the literature, most of the DIA methods process binary images by investigating simple geometric features. For example, the famous RLSA method [1] discriminates text and non-text by the distance between foreground pixels and extracted text contents after a run-length smearing preprocessing. But unfortunately, RLSA method

2 is sensitive to noise, page orientation and font size, making it clearly unsuitable for construct adaptive systems. Another bunch of methods, i.e., the connected component aggregation methods [5, 6] utilize alignment and spacing consistency of text components, performing a bottom-up hierarchical reconstruction of document layouts. These bottom-up methods can be well suited for segmenting arbitrary layouts only if the text line aggregation results are reliable. But this is hard to achieve by using geometric features only. The biggest problem of the bottom-up methods is their local merging scheme without any global directive. Furthermore, analysis on binary connected components is also unreliable in noisy, skewed and irregular documents. Limited by the simple geometric-based features, the bottom-up methods are inadequate to discriminate text and non-text contents robustly. In order to make full use of image appearance in document analysis, some researchers proposed the texture-based methods [7, 8, 9, 10], which model each homogeneous region in document images as a visual texture pattern. By extracting texture statistics as features, dynamic or static classifier techniques are applied to classify the image pixels as text or non-text. Frequently used texture features include Gabor responses [7], morphological masked pixel values [8, 10], wavelet coefficients [9] etc. Although the texture-based methods aim at modeling the visual appearance of different contents in document images, the texture features they used are always derived from mathematical perspectives and hence are not biological plausible. Therefore, it is hard to tell to what extent these texture features are capable to characterize unknown samples. As we know, model generality is very important to implement adaptability. Based on the above observation, we consider a new adaptive DIA solution from the biological plausible perspective. Summarizing the latest progress in psychophysical research on preattentive vision, we propose a novel computational model in this paper to simulate the preattentive visual guidance mechanism in human vision and apply it to document image analysis. The model contains two kinds of computation: visual similarity clustering (VSC) and visual saliency detection (VSD). The former simulates the categorical characteristics of preattentive vision to summarize homogeneous regions. And the latter simulates the visual center-surround characteristics to spot salient regions. These two parts of information can be combined to guide visual search of specific document contents in the following stage. Initial experimental results show that computation results of VSC and VSD are very adaptive and highly complementary, which is helpful for robust and efficient interpretation of various document images. The rest of the paper will be arranged as follows. In section 2, as a background of this paper, we introduce our computational model for preattentive visual guidance. Then the implementation of VSC and VSD computation is illustrated in details in section 3 and section 4 respectively. Experimental results are demonstrated in section 5, followed by conclusion and discussion in section 6. 2 Computational Model for Preattentive Visual Guidance Human vision is highly robust in capturing useful objects in a field of background distracter elements. Such adaptive power is attributed to the visual guidance

3 mechanism in the primary visual cortex. That is, the high-level cognitive functionality is not to be executed at every position in the scene. On the contrary, the visual guidance mechanism serves as a system bottle-neck to direct the interpretation of the whole scene [11]. Until now, most vision researchers agree that much work in visual guidance is contributed by the low-level neurons, which is based on simple visual properties and completed in very short instant [12]. Psychophysical experiments have shown that some visual searching task can be finished accurately within 200~250 milliseconds, which is too short for human eyes to move and pay serial attention. The visual properties affirmed in such experiments are called preattentive [12]. Pre-attentive visual feature Visual Similarity Clustering (VSC) Linear Filtering H 1 H 2 H M Homogeneous regions Selective Attention Attentive Cognition Input image Visual Saliency Detecting (VSD) F 1 F 2 F P Focus of attention S 1 S 2 S N Salient regions Attentive processing Preattentive processing Fig. 1. Our computational model for preattentive visual guidance in DIA In this paper, we notice two types of hints that can be discerned in the preattentive stage of document image analysis: visual similarity and visual saliency. The former was conjectured by Julesz and further proven by psychophysical and computational experiments [13, 14, 15]. And also, theories and initial computational models for the latter has also been proposed in recent years, contributed by Treisman, Ullman, Koch and Itti etc. [11, 16, 17, 18]. Motivated by these two streams of research efforts, we propose in this paper a computational model characterizing the preattentive visual guidance mechanism in document image analysis. As shown in figure 1, the input image is first decomposed by series of preattentive feature channels. Then these separated feature maps will go through two independent processes: visual similarity clustering (VSC) and visual saliency detection (VSD). The VSC process categorizes homogeneous regions by measuring visual similarity. On the other hand, the VSD process points out salient regions different from their neighbors. Both of these two processes compute only preattentive visual properties. In a document image, the VSC results are useful in aggregating homogeneous text contents; while the VSD results provides us with hints to find out conspicuous titles, separating lines, edges and

4 graphics etc. In the following stage named selective attention, oriented to specific object extraction task, a series of focuses of attention (FOA) will be determined by consulting the VSC and VSD results. The final attentive cognition of various document contents is executed in these selective FOAs serially. 3 Visual Similarity Clustering To discuss the subjective visual similarity perception, we must first find out the numerical way to measure it objectively. Before the studies on human vision system, this problem had no generic solution. The early characteristic features for texture were merely proposed by mathematical convenience or task dependant heuristics [19]. Later, physiological research on visual cortex revealed the spatial/frequency representation of image in human vision system. Such discovery inspired researches to use spatial/frequency localized filters as generic texture feature extractors. In 1995, Heeger and Bergen accomplished the first texture synthesis experiment by matching the histograms of image pyramids between the synthesized and target images [14]. Their experiments reveal that image pairs with matching histograms also share similar visual appearance. Later, Zhu etc. offered a stricter mathematical framework to ensure the convergence of histogram matching and further argued that marginal histogram statistics pooled from Gabor filters can serve as generic features to characterize various homogeneous textures [15, 20]. Based on Zhu s work, Liu developed the quantitative measurement of visual similarity between two image patterns, based on the χ 2 -distance of Gabor histogram features [19]. His work was applied in texture classification. In our work, we are interested to use visual similarity measurement in document image segmentation, which is a variant case of texture classification. So the first problem is how to characterize visual patterns in the document images using representative texture features. And second, the similarity-based segmentation should be self-adaptive to various document contents. Since the conceptual texture patterns can not be statically defined among different document images, the best way to accomplish adaptability is through dynamic clustering. Therefore, our major implementation problem in VSC is: first, to select a series of representative filters and histogram bins to extract the Gabor histogram features from document images; and second, to derive a mathematical plausible way to cluster the features so as to obtain adaptive segmentation results. To solve the first problem, we must investigate in what scale that the document contents will illustrate homogeneous visual textures. Generally speaking, font sizes of the perceptible text contents in document images vary from 2 pts to 32 pts (we regard that characters with sizes bigger than 32 pts can not be treated as homogeneous texture). In the very low resolution, text contents demonstrate line pattern and character blob pattern. Along with the increase of resolution, the character stroke pattern will gradually emerge. These three patterns can be well captured by Gabor and Laplacian of Gaussian filters. Therefore, we first prepare a bank of filters containing Gabor filters (annotated as G) and Laplacian of Gaussian filters (annotated as LoG)

5 covering consecutive scales and orientations in frequency domain. The mathematical expressions of these two kinds of filters are as follows: 1 Gabor x y T x y x y 2T 2π exp{ j ( xcosθ + ysin θ)} T 2 2 (,, θ) = exp{ [4( cosθ + sin θ) + ( sinθ + cos θ ) ]} 2 (1) x + y LoG( x, y T ) = C( x + y T )exp( ) (2) 2 T To capture the three typical texture patterns of text contents from 2 pts to 32 pts, we choose parameters T = 2,2,4,6,8,10 and θ =0,45,90,135, which results in a filter bank consisting 24 G filters (only use cosine components) and 6 LoG filters. To further select representative filters from these 30 filters, a visual similarity testing experiment is performed. That is, we choose a sufficient set from, with whose histograms we can fully characterize the visual appearance of the referenced image I obs, which is picked up as typical visual patterns in document images. The filter selection process is presented in [21], in which we followed the Minimax entropy principle and used Markov Chain Monte Carlo sampling [15] to match the histograms between the synthesized and original images. By matching the histograms from more and more filter channels, the synthesized image become more and more similar to the referenced image. The experiment finally selects the following 13 filters to form the representative filter set : 1. G(T, θ), T = 2, 4, and θ= 0 º, 45 º, 90 º, 135 º ; 2. G(T, θ), T = 6, and θ= 0 º, 90 º ; 3. LoG(T), T = 2, 4, Pattern1 Pattern2 Pattern Pattern1 Pattern2 Pattern Pattern1 Pattern2 Pattern3 0 0 Fig. 2. GHF of different texture patterns, with the concatenated histograms of all 13 channels and the magnified histograms in two particular channels (G(4,90º) and LoG(4)).

6 To make the histograms extracted from different images comparable, we normalize the responses of each filtered image to the fixed range [-1, 1] before pooling histograms. With 11 bins of histograms extracted for each filter channel, we construct the 143-dim Gabor histogram feature (GHF) vector for each input image. Figure 3 compares the GHF vectors of different visual patterns found in document images. In our image segmentation experiment, the GHF vector H υ in the site υ is calculated within a window surroundingυ. The window size we choose is 32. For the second problem, we define the distance metric between two GHF vectors as their Euclidian distance, just for the convenience to perform K-means clustering. () i () i 2 υ1 υ = 2 υ 1 υ2 i DH (, H ) ( ( h h ) ) Another clustering parameter, that is, the initial class number K is set to 4 in our experiment. This is empirically determined by observing that there are usually 4 types of contents in a document image: text, photograph, line drawing and white space. And also experiments have shown that setting K>4 will cause the homogeneous class corresponding to text contents to split into smaller classes. Therefore by setting K=4 initially we can conserve the homogeneity of text regions. As for the simple plain documents which probably contain less than 4 distinctive classes of visual patterns, we allow the clustering procedure to drop the empty classes. After setting the distance measurement and initial state, the K-means clustering is ready to run. The clustered results will be dynamic, depending on the specific contents of different document samples. Therefore, we also need to identify which class in the clustered results is correspondent to the text contents. To solve the uncertainty caused by unsupervised clustering, it is necessary to introduce supervised constraints. To our observation in the experiments, texture features in the main body text regions usually demonstrate higher energy in the Gabor filter responses. Therefore, we calculate the following texture energy for each GHF vector as follows: E υ i= 1 () i υ 1 2 (3) = E (4) () i Here, E υ refers to the variance of filter responses in the ith channel, which can be computed through the histogram. Then the texture energy E k for the whole class of GHF vectors can be further estimated by counting the most frequent texture energy. By this means we can sort the K classes in descending order according to their texture energies (i.e., class 1 has the maximal texture energy). We select class 1 and class 2 as the candidate classes for text contents. In our experiments, for most tested document images, text contents occupy one of these two clustered classes. 4 Visual Saliency Detection As compared with the top-down categorical VSC process, the VSD process undertakes a bottom-up investigation on how different a site is from its neighbors.

7 Here we use the computational architecture proposed by Itti [18] to obtain a salient map of the input image. In [18], Itti computed saliency in three independent feature channels: the color channel, the intensity channel and the orientation channel. In our work, since the similarity of Gabor orientation features has already been investigated in the VSC process, we carry out visual saliency detection in only two feature channels: the color and the intensity channels (for gray scale image, only intensity channel is calculated). The input image is first decomposed into one intensity and four color channels. Then multi-scale representation for each channel is constructed, using a Gaussian pyramid. The center-surround difference is calculated between different coarse scales (surrounded values) and fine scales (centered values), resulting in 12 feature maps in the color channel and 6 feature maps in the intensity channel. Finally, the color salient map, the intensity salient map and the overall salient map are calculated respectively by normalizing and combining these feature maps. Computational details can be found in [18]. 5 Experimental results We perform several groups of experiments to test the adaptability of VSC and VSD in providing homogeneous regions and salient regions among various document images. These two kinds of information can be further utilized by any task-oriented module to detect specific contents from document images. For example, one who is interested in extracting text lines can access the homogeneous regions in the VSC results. Another one looking for titles, separating lines, edges and graphics etc. can access the salient regions in the VSD results. In the VSC process, it is obvious that with more semantic homogeneous contents categorized into the same class, the more layout segmentation can benefit from it. On the other hand, the more the salient regions are separated from the homogenous regions, the easier it is to find out the salient objects. Therefore, we pay attention to two criteria in evaluating the performance of our visual preattentive guidance computation: the region homogeneity in VSC and the complementary extent between the VSC and VSD results. The first experiment is for complex newspaper images. Figure 4 shows the clustered results by VSC for 3 newspaper images scanned in 150 dpi. 4-class segmentation results are obtained for each sample, from which we can see that: the main body text contents in each sample occupy a major visual class stably. In the English sample, they belong to class 2 (dark gray); while in the Chinese samples, they belong to class 1 (black). It can be easily explained that when there is strong periodic texture pattern in the image (e.g., halftone or background texture), text contents will not occupy the first class. Otherwise, they will appear in the first class. In figure 5, the pixels clustered as text contents are picked up separately to see the homogeneity of the clustered results particularly. As we see, the majority of text contents are successfully segmented. It should be mentioned that we have not added any spatial continuity constraints in the clustering; while the homogeneity is still satisfying, which indicate that our GHF features can really reflect visual similarity. It is interesting to see that the halftone patterns in the photographs also make them stand out as a unique visual class.

8 (a) (b) (c) Fig. 3. Segmentation results in the first experiment, using K=4. The 4 pixel values: black, dark gray, light gray and white represent 4 clustered classes, sorted by their texture energy from high to low. (a) Segmentation result of a complex Chinese newspaper, with the body text occupying class 1. (b) Segmentation result of an English newspaper, with the body text occupying class 2. (c) Segmentation result of another Chinese newspaper, less complex, with the body text occupying class 1. The second experiment is for simple plain document images. The experiment is repeated in both up-right and skewed samples to test the adaptability of VSC to

9 (a) (b) (c) Fig. 4. Text contents extracted from the segmentation results in figure 4. (a) (b) (c) (d) Fig. 5. Segmentation results for simple plain documents. The algorithm automatically reduced class number K to 2 in order to adapt the simple contents. skewness. Figure 6 shows the clustered results. Notice that they have been reduced to include only 2 classes. The third experiment compares the VSC and VSD results in the same image. Figure 7a shows the salient map calculated for a newspaper image. The gray scale values in it indicate the salient values detected in these pixels. Figure 7b shows a thresholded version of 7a. We can see that the main titles, separating lines, edges and boundaries pop out in the results. As compared with the homogenous regions shown in figure 7c (i.e., the same results in figure 5c), the VSD results are highly complementary to the VSC results and they reflect the discontinuous changes and highlights in the image.

10 (a) (b) (c) Fig. 6. Comparison of the VSD and VSC results. (a) the salient map calculated by VSD; (b) the thresholded version of (a); (c) the homogeneous text contents segmented from the VSC results. 6 Conclusion and Future Work We have demonstrated a computational method to implement preattentive visual guidance in document image analysis. Our ultimate goal is to achieve real adaptability for target segmentation in any type of document samples. The VSC computation is thus proposed to categorize similar contents in the image. And the VSD computation is introduced to simulate the detection of salient regions. Both of these two processes are proposed based on the current discovery from psychophysics experiments. Initial experiments show that the VSC process is able to cluster the image contents into visual homogenous regions, especially for the main body text contents. The clustered results are quite stable in distinctively different document samples. And the VSD results reveal the salient regions in document images, corresponding to major titles, separating lines and edges etc. Both results can be further utilized in a specific DIA task to extract and interpret different types of semantic contents. Being undertaken in a data-driven manner, these two processes both have the inherent potential to implement adaptability and the experimental results also support this fact. Our future work will focus on two problems. The first is to develop more efficient visual similarity clustering algorithm. Since the current normalization method in VSC tends to diminish the deference between histograms, better normalization method and distance metric are needed to improve the numerical characterization of visual similarity. The second is to add more user-driven heuristic after the preattentive stage to extract task-oriented contents from the VSC and VSD results, with which the preattentive visual guidance can really benefit the adaptability of document image analysis. This work is supported by the National Natural Science Foundation of China (project ).

11 References [1] Wong, K.Y., R.G. Casey, and F.M. Wahl: Document Analysis System. IBM Journal Res. Develop(1982). 26(6): [2] Ittner, D.J. and H.S. Baird. Language-free layout analysis. in Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93), Oct. 1993, Tsukuba Science City, Japan, 1993//, 1993, pp [3] Tang, Y.Y., S.-W. Lee, and C.Y. Suen: Automatic document processing: a survey. Pattern Recognition(1996). 29(12): [4] Nagy, G., S. Seth, and M. Viswanathan: A prototype document image analysis system for technical journals. IEEE Computer(1992). 25(7): [5] Drivas, D. and A.Amin. Page Segmentation and Classification Utilizing Bottom-Up Approach. in Proceedings of the third International Conference on Document Analysis and Recognition, Aug , 1995, pp [6] Liang, J., I.T. Phillips, and R.M. Haralick: An optimization methodology for document structure extraction on Latin character documents. IEEE Trans on Pattern Analysis and Machine Intelligence(2001). 23(7): [7] Jain, A.K. and S. Bhattacharjee: Text Segment Using Gabor Filters for Automatic Document Processing. Machine Vision and Applications(1992). 5(3): [8] Jain, A.K. and Y. Zhong: Page Segmentation Using Texture Analysis. Pattern Recognition(1996). 29(5): [9] Li, J. and R.M. Gray: Context-Based Multiscale Classification of Document Images Using Wavelet Coefficient Distributions. IEEE Trans on Image Processing(2000). 9(9): [10] Chen, J.-L.: A simplified approach to the HMM based texture analysis and its application to document segmentation. Pattern Recognition Letters(1997). 18(10): [11] Itti, L. and K. C.: Computional Modeling of Visual Attention. Nature Reviews Neuroscience(2001). 2(3): p [12] Healey, C.G., K.S. Booth, and J.T. Enns: High-speed visual estimation using preattentive processing. ACM Transactions on Computer-Human Interaction (TOCHI)(1996). 3(2): [13] Julesz, B.: Visual pattern discrimination. IRE Transaction of Information Theory(1962)(IT-8): [14] Heeger, D.J. and J.R. Bergen. Pyramid-based texture analysis/synthesis. in Computer Graphics Proceedings. SIGGRAPH 95, Los Angeles, CA, USA, 6-11 Aug. 1995, 1995, pp. p [15] Zhu, S.C., Y.N. Wu, and D. Mumford: Minimax Entropy Principle and Its Application to Texture Modeling. Neural Computation(1997). 9(8): [16] Treisman, A. and G. Gelade: A feature integration theory of attention. Cognitive Psychology(1980). 12(2): [17] Koch C. and U. S.: Shifts in selective visual attention: towards the underlying neural circuitry. Human Neurobiology(1985). 4(4): [18] Itti, L., C. Koch, and E. Niebur: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans on Pattern Analysis and Machine Intelligence(1998). 20(11): [19] Liu, X. and D. Wang: Texture Classification Using Spectral Histogram. IEEE Trans on Image Processing(2003). 12(6): [20] Zhu, S.C., Y.N. Wu, and D.B.Mumford: FRAME : Filters, Random fields And Maximum Entropy -- towards a unified theory for texture modeling. International Journal of Computer Vision(1998). 27(3): [21] Wen, D. and X. Ding: Visual similarity based document layout analysis. To be appeared in Journal of Computer Science and Technology(2006). 21(3).

Layout Segmentation of Scanned Newspaper Documents

Layout Segmentation of Scanned Newspaper Documents , pp-05-10 Layout Segmentation of Scanned Newspaper Documents A.Bandyopadhyay, A. Ganguly and U.Pal CVPR Unit, Indian Statistical Institute 203 B T Road, Kolkata, India. Abstract: Layout segmentation algorithms

More information

CHAPTER 4 SPECTRAL HISTOGRAM: A GENERIC FEATURE FOR IMAGES

CHAPTER 4 SPECTRAL HISTOGRAM: A GENERIC FEATURE FOR IMAGES CHAPTER 4 SPECTRAL HISTOGRAM: A GENERIC FEATURE FOR IMAGES In this chapter, we propose a generic statistic feature for homogeneous texture images, which we call spectral histograms. A similarity measure

More information

Multi-scale Techniques for Document Page Segmentation

Multi-scale Techniques for Document Page Segmentation Multi-scale Techniques for Document Page Segmentation Zhixin Shi and Venu Govindaraju Center of Excellence for Document Analysis and Recognition (CEDAR), State University of New York at Buffalo, Amherst

More information

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images Karthik Ram K.V & Mahantesh K Department of Electronics and Communication Engineering, SJB Institute of Technology, Bangalore,

More information

Texture Image Segmentation using FCM

Texture Image Segmentation using FCM Proceedings of 2012 4th International Conference on Machine Learning and Computing IPCSIT vol. 25 (2012) (2012) IACSIT Press, Singapore Texture Image Segmentation using FCM Kanchan S. Deshmukh + M.G.M

More information

Parametric Texture Model based on Joint Statistics

Parametric Texture Model based on Joint Statistics Parametric Texture Model based on Joint Statistics Gowtham Bellala, Kumar Sricharan, Jayanth Srinivasa Department of Electrical Engineering, University of Michigan, Ann Arbor 1. INTRODUCTION Texture images

More information

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution Detecting Salient Contours Using Orientation Energy Distribution The Problem: How Does the Visual System Detect Salient Contours? CPSC 636 Slide12, Spring 212 Yoonsuck Choe Co-work with S. Sarma and H.-C.

More information

An Improved Image Resizing Approach with Protection of Main Objects

An Improved Image Resizing Approach with Protection of Main Objects An Improved Image Resizing Approach with Protection of Main Objects Chin-Chen Chang National United University, Miaoli 360, Taiwan. *Corresponding Author: Chun-Ju Chen National United University, Miaoli

More information

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques

Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques Partha Sarathi Giri Department of Electronics and Communication, M.E.M.S, Balasore, Odisha Abstract Text data

More information

Dynamic visual attention: competitive versus motion priority scheme

Dynamic visual attention: competitive versus motion priority scheme Dynamic visual attention: competitive versus motion priority scheme Bur A. 1, Wurtz P. 2, Müri R.M. 2 and Hügli H. 1 1 Institute of Microtechnology, University of Neuchâtel, Neuchâtel, Switzerland 2 Perception

More information

On Segmentation of Documents in Complex Scripts

On Segmentation of Documents in Complex Scripts On Segmentation of Documents in Complex Scripts K. S. Sesh Kumar, Sukesh Kumar and C. V. Jawahar Centre for Visual Information Technology International Institute of Information Technology, Hyderabad, India

More information

Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig

Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig Vienna University of Technology, Institute of Computer Aided Automation, Pattern Recognition and Image Processing

More information

The Vehicle Logo Location System based on saliency model

The Vehicle Logo Location System based on saliency model ISSN 746-7659, England, UK Journal of Information and Computing Science Vol. 0, No. 3, 205, pp. 73-77 The Vehicle Logo Location System based on saliency model Shangbing Gao,2, Liangliang Wang, Hongyang

More information

Image Classification Using Wavelet Coefficients in Low-pass Bands

Image Classification Using Wavelet Coefficients in Low-pass Bands Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan

More information

Experimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis

Experimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis Experimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis N.Padmapriya, Ovidiu Ghita, and Paul.F.Whelan Vision Systems Laboratory,

More information

A Model of Dynamic Visual Attention for Object Tracking in Natural Image Sequences

A Model of Dynamic Visual Attention for Object Tracking in Natural Image Sequences Published in Computational Methods in Neural Modeling. (In: Lecture Notes in Computer Science) 2686, vol. 1, 702-709, 2003 which should be used for any reference to this work 1 A Model of Dynamic Visual

More information

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS Kirthiga, M.E-Communication system, PREC, Thanjavur R.Kannan,Assistant professor,prec Abstract: Face Recognition is important

More information

Scene Text Detection Using Machine Learning Classifiers

Scene Text Detection Using Machine Learning Classifiers 601 Scene Text Detection Using Machine Learning Classifiers Nafla C.N. 1, Sneha K. 2, Divya K.P. 3 1 (Department of CSE, RCET, Akkikkvu, Thrissur) 2 (Department of CSE, RCET, Akkikkvu, Thrissur) 3 (Department

More information

Graph Matching Iris Image Blocks with Local Binary Pattern

Graph Matching Iris Image Blocks with Local Binary Pattern Graph Matching Iris Image Blocs with Local Binary Pattern Zhenan Sun, Tieniu Tan, and Xianchao Qiu Center for Biometrics and Security Research, National Laboratory of Pattern Recognition, Institute of

More information

TEXT DETECTION AND RECOGNITION IN CAMERA BASED IMAGES

TEXT DETECTION AND RECOGNITION IN CAMERA BASED IMAGES TEXT DETECTION AND RECOGNITION IN CAMERA BASED IMAGES Mr. Vishal A Kanjariya*, Mrs. Bhavika N Patel Lecturer, Computer Engineering Department, B & B Institute of Technology, Anand, Gujarat, India. ABSTRACT:

More information

Text Area Detection from Video Frames

Text Area Detection from Video Frames Text Area Detection from Video Frames 1 Text Area Detection from Video Frames Xiangrong Chen, Hongjiang Zhang Microsoft Research China chxr@yahoo.com, hjzhang@microsoft.com Abstract. Text area detection

More information

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM 1 PHYO THET KHIN, 2 LAI LAI WIN KYI 1,2 Department of Information Technology, Mandalay Technological University The Republic of the Union of Myanmar

More information

Journal of Asian Scientific Research FEATURES COMPOSITION FOR PROFICIENT AND REAL TIME RETRIEVAL IN CBIR SYSTEM. Tohid Sedghi

Journal of Asian Scientific Research FEATURES COMPOSITION FOR PROFICIENT AND REAL TIME RETRIEVAL IN CBIR SYSTEM. Tohid Sedghi Journal of Asian Scientific Research, 013, 3(1):68-74 Journal of Asian Scientific Research journal homepage: http://aessweb.com/journal-detail.php?id=5003 FEATURES COMPOSTON FOR PROFCENT AND REAL TME RETREVAL

More information

A Quantitative Approach for Textural Image Segmentation with Median Filter

A Quantitative Approach for Textural Image Segmentation with Median Filter International Journal of Advancements in Research & Technology, Volume 2, Issue 4, April-2013 1 179 A Quantitative Approach for Textural Image Segmentation with Median Filter Dr. D. Pugazhenthi 1, Priya

More information

2 Proposed Methodology

2 Proposed Methodology 3rd International Conference on Multimedia Technology(ICMT 2013) Object Detection in Image with Complex Background Dong Li, Yali Li, Fei He, Shengjin Wang 1 State Key Laboratory of Intelligent Technology

More information

Automatic Detection of Texture Defects using Texture-Periodicity and Gabor Wavelets

Automatic Detection of Texture Defects using Texture-Periodicity and Gabor Wavelets Abstract Automatic Detection of Texture Defects using Texture-Periodicity and Gabor Wavelets V Asha 1, N U Bhajantri, and P Nagabhushan 3 1 New Horizon College of Engineering, Bangalore, Karnataka, India

More information

Latest development in image feature representation and extraction

Latest development in image feature representation and extraction International Journal of Advanced Research and Development ISSN: 2455-4030, Impact Factor: RJIF 5.24 www.advancedjournal.com Volume 2; Issue 1; January 2017; Page No. 05-09 Latest development in image

More information

A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes

A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes Renwu Gao 1, Faisal Shafait 2, Seiichi Uchida 3, and Yaokai Feng 3 1 Information Sciene and Electrical Engineering, Kyushu

More information

Automatically Algorithm for Physician s Handwritten Segmentation on Prescription

Automatically Algorithm for Physician s Handwritten Segmentation on Prescription Automatically Algorithm for Physician s Handwritten Segmentation on Prescription Narumol Chumuang 1 and Mahasak Ketcham 2 Department of Information Technology, Faculty of Information Technology, King Mongkut's

More information

Visible and Long-Wave Infrared Image Fusion Schemes for Situational. Awareness

Visible and Long-Wave Infrared Image Fusion Schemes for Situational. Awareness Visible and Long-Wave Infrared Image Fusion Schemes for Situational Awareness Multi-Dimensional Digital Signal Processing Literature Survey Nathaniel Walker The University of Texas at Austin nathaniel.walker@baesystems.com

More information

Salient Region Detection using Weighted Feature Maps based on the Human Visual Attention Model

Salient Region Detection using Weighted Feature Maps based on the Human Visual Attention Model Salient Region Detection using Weighted Feature Maps based on the Human Visual Attention Model Yiqun Hu 2, Xing Xie 1, Wei-Ying Ma 1, Liang-Tien Chia 2 and Deepu Rajan 2 1 Microsoft Research Asia 5/F Sigma

More information

A Modified Approach to Biologically Motivated Saliency Mapping

A Modified Approach to Biologically Motivated Saliency Mapping A Modified Approach to Biologically Motivated Saliency Mapping Shane Grant Department of Computer Science University of California, San Diego La Jolla, CA 9093 wgrant@ucsd.edu Kevin A Heins Department

More information

A Hierarchial Model for Visual Perception

A Hierarchial Model for Visual Perception A Hierarchial Model for Visual Perception Bolei Zhou 1 and Liqing Zhang 2 1 MOE-Microsoft Laboratory for Intelligent Computing and Intelligent Systems, and Department of Biomedical Engineering, Shanghai

More information

Contextual priming for artificial visual perception

Contextual priming for artificial visual perception Contextual priming for artificial visual perception Hervé Guillaume 1, Nathalie Denquive 1, Philippe Tarroux 1,2 1 LIMSI-CNRS BP 133 F-91403 Orsay cedex France 2 ENS 45 rue d Ulm F-75230 Paris cedex 05

More information

Segmentation of Images

Segmentation of Images Segmentation of Images SEGMENTATION If an image has been preprocessed appropriately to remove noise and artifacts, segmentation is often the key step in interpreting the image. Image segmentation is a

More information

Visual Learning with Explicit and Implicit Manifolds

Visual Learning with Explicit and Implicit Manifolds Visual Learning with Explicit and Implicit Manifolds --- A Mathematical Model of, Texton, and Primal Sketch Song-Chun Zhu Departments of Statistics and Computer Science University of California, Los Angeles

More information

A Model-based Line Detection Algorithm in Documents

A Model-based Line Detection Algorithm in Documents A Model-based Line Detection Algorithm in Documents Yefeng Zheng, Huiping Li, David Doermann Laboratory for Language and Media Processing Institute for Advanced Computer Studies University of Maryland,

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

Automatic Shadow Removal by Illuminance in HSV Color Space

Automatic Shadow Removal by Illuminance in HSV Color Space Computer Science and Information Technology 3(3): 70-75, 2015 DOI: 10.13189/csit.2015.030303 http://www.hrpub.org Automatic Shadow Removal by Illuminance in HSV Color Space Wenbo Huang 1, KyoungYeon Kim

More information

Automated Classification of Quilt Photographs Into Crazy and Noncrazy

Automated Classification of Quilt Photographs Into Crazy and Noncrazy Automated Classification of Quilt Photographs Into Crazy and Noncrazy Alhaad Gokhale a and Peter Bajcsy b a Dept. of Computer Science and Engineering, Indian institute of Technology, Kharagpur, India;

More information

TEXTURE ANALYSIS USING GABOR FILTERS

TEXTURE ANALYSIS USING GABOR FILTERS TEXTURE ANALYSIS USING GABOR FILTERS Texture Types Definition of Texture Texture types Synthetic Natural Stochastic < Prev Next > Texture Definition Texture: the regular repetition of an element or pattern

More information

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014)

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014) I J E E E C International Journal of Electrical, Electronics ISSN No. (Online): 2277-2626 Computer Engineering 3(2): 85-90(2014) Robust Approach to Recognize Localize Text from Natural Scene Images Khushbu

More information

Texture Segmentation Using Gabor Filters

Texture Segmentation Using Gabor Filters TEXTURE SEGMENTATION USING GABOR FILTERS 1 Texture Segmentation Using Gabor Filters Khaled Hammouda Prof. Ed Jernigan University of Waterloo, Ontario, Canada Abstract Texture segmentation is the process

More information

Applying Catastrophe Theory to Image Segmentation

Applying Catastrophe Theory to Image Segmentation Applying Catastrophe Theory to Image Segmentation Mohamad Raad, Majd Ghareeb, Ali Bazzi Department of computer and communications engineering Lebanese International University Beirut, Lebanon Abstract

More information

Text Enhancement with Asymmetric Filter for Video OCR. Datong Chen, Kim Shearer and Hervé Bourlard

Text Enhancement with Asymmetric Filter for Video OCR. Datong Chen, Kim Shearer and Hervé Bourlard Text Enhancement with Asymmetric Filter for Video OCR Datong Chen, Kim Shearer and Hervé Bourlard Dalle Molle Institute for Perceptual Artificial Intelligence Rue du Simplon 4 1920 Martigny, Switzerland

More information

Segmentation Framework for Multi-Oriented Text Detection and Recognition

Segmentation Framework for Multi-Oriented Text Detection and Recognition Segmentation Framework for Multi-Oriented Text Detection and Recognition Shashi Kant, Sini Shibu Department of Computer Science and Engineering, NRI-IIST, Bhopal Abstract - Here in this paper a new and

More information

OCR For Handwritten Marathi Script

OCR For Handwritten Marathi Script International Journal of Scientific & Engineering Research Volume 3, Issue 8, August-2012 1 OCR For Handwritten Marathi Script Mrs.Vinaya. S. Tapkir 1, Mrs.Sushma.D.Shelke 2 1 Maharashtra Academy Of Engineering,

More information

Color Local Texture Features Based Face Recognition

Color Local Texture Features Based Face Recognition Color Local Texture Features Based Face Recognition Priyanka V. Bankar Department of Electronics and Communication Engineering SKN Sinhgad College of Engineering, Korti, Pandharpur, Maharashtra, India

More information

Bipartite Graph Partitioning and Content-based Image Clustering

Bipartite Graph Partitioning and Content-based Image Clustering Bipartite Graph Partitioning and Content-based Image Clustering Guoping Qiu School of Computer Science The University of Nottingham qiu @ cs.nott.ac.uk Abstract This paper presents a method to model the

More information

Saliency Detection for Videos Using 3D FFT Local Spectra

Saliency Detection for Videos Using 3D FFT Local Spectra Saliency Detection for Videos Using 3D FFT Local Spectra Zhiling Long and Ghassan AlRegib School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA ABSTRACT

More information

Texture Analysis: Representation and Matching

Texture Analysis: Representation and Matching Textures Texture Analysis: Representation and Matching Anil K. Jain and Kalle Karu Department of Computer Science Michigan State University East Lansing, MI 48824, USA jaln@cps.msu.edu, karukall@cps.msu.edu

More information

Image Segmentation Techniques

Image Segmentation Techniques A Study On Image Segmentation Techniques Palwinder Singh 1, Amarbir Singh 2 1,2 Department of Computer Science, GNDU Amritsar Abstract Image segmentation is very important step of image analysis which

More information

Wavelet Applications. Texture analysis&synthesis. Gloria Menegaz 1

Wavelet Applications. Texture analysis&synthesis. Gloria Menegaz 1 Wavelet Applications Texture analysis&synthesis Gloria Menegaz 1 Wavelet based IP Compression and Coding The good approximation properties of wavelets allow to represent reasonably smooth signals with

More information

Quasi-thematic Features Detection & Tracking. Future Rover Long-Distance Autonomous Navigation

Quasi-thematic Features Detection & Tracking. Future Rover Long-Distance Autonomous Navigation Quasi-thematic Feature Detection And Tracking For Future Rover Long-Distance Autonomous Navigation Authors: Affan Shaukat, Conrad Spiteri, Yang Gao, Said Al-Milli, and Abhinav Bajpai Surrey Space Centre,

More information

International Conference on Information Sciences, Machinery, Materials and Energy (ICISMME 2015)

International Conference on Information Sciences, Machinery, Materials and Energy (ICISMME 2015) International Conference on Information Sciences, Machinery, Materials and Energy (ICISMME 2015) Brief Analysis on Typical Image Saliency Detection Methods Wenwen Pan, Xiaofei Sun, Xia Wang, Wei Zhang

More information

Artifacts and Textured Region Detection

Artifacts and Textured Region Detection Artifacts and Textured Region Detection 1 Vishal Bangard ECE 738 - Spring 2003 I. INTRODUCTION A lot of transformations, when applied to images, lead to the development of various artifacts in them. In

More information

DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM

DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM DATA EMBEDDING IN TEXT FOR A COPIER SYSTEM Anoop K. Bhattacharjya and Hakan Ancin Epson Palo Alto Laboratory 3145 Porter Drive, Suite 104 Palo Alto, CA 94304 e-mail: {anoop, ancin}@erd.epson.com Abstract

More information

Analysis and Synthesis of Texture

Analysis and Synthesis of Texture Analysis and Synthesis of Texture CMPE 264: Image Analysis and Computer Vision Spring 02, Hai Tao 31/5/02 Extracting image structure by filter banks Q Represent image textures using the responses of a

More information

Binarization of Color Character Strings in Scene Images Using K-means Clustering and Support Vector Machines

Binarization of Color Character Strings in Scene Images Using K-means Clustering and Support Vector Machines 2011 International Conference on Document Analysis and Recognition Binarization of Color Character Strings in Scene Images Using K-means Clustering and Support Vector Machines Toru Wakahara Kohei Kita

More information

6. Dicretization methods 6.1 The purpose of discretization

6. Dicretization methods 6.1 The purpose of discretization 6. Dicretization methods 6.1 The purpose of discretization Often data are given in the form of continuous values. If their number is huge, model building for such data can be difficult. Moreover, many

More information

A Spectral Histogram Model for Textons and Texture Discrimination

A Spectral Histogram Model for Textons and Texture Discrimination A Spectral Histogram Model for Textons and Texture Discrimination Xiuwen Liu DeLiang Wang Department of Computer Science Department of Computer & Information Science P.O. Box 4530 Center for Cognitive

More information

Color-Based Classification of Natural Rock Images Using Classifier Combinations

Color-Based Classification of Natural Rock Images Using Classifier Combinations Color-Based Classification of Natural Rock Images Using Classifier Combinations Leena Lepistö, Iivari Kunttu, and Ari Visa Tampere University of Technology, Institute of Signal Processing, P.O. Box 553,

More information

Handwritten Script Recognition at Block Level

Handwritten Script Recognition at Block Level Chapter 4 Handwritten Script Recognition at Block Level -------------------------------------------------------------------------------------------------------------------------- Optical character recognition

More information

Region-based Segmentation

Region-based Segmentation Region-based Segmentation Image Segmentation Group similar components (such as, pixels in an image, image frames in a video) to obtain a compact representation. Applications: Finding tumors, veins, etc.

More information

Texture Classification Using Spectral Histograms

Texture Classification Using Spectral Histograms IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 12, NO. 6, JUNE 2003 661 Texture Classification Using Spectral Histograms Xiuwen Liu, Senior Member, IEEE, and DeLiang Wang, Senior Member, IEEE Abstract Based

More information

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009 Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer

More information

Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task

Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task Neurally Inspired Mechanisms for the Dynamic Visual Attention Map Generation Task Maria T. López 1, Miguel A. Fernández 1, Antonio Fernández-Caballero 1, and Ana E. Delgado 2 1 Departamento de Informática

More information

Texture April 17 th, 2018

Texture April 17 th, 2018 Texture April 17 th, 2018 Yong Jae Lee UC Davis Announcements PS1 out today Due 5/2 nd, 11:59 pm start early! 2 Review: last time Edge detection: Filter for gradient Threshold gradient magnitude, thin

More information

A Hierarchical Face Identification System Based on Facial Components

A Hierarchical Face Identification System Based on Facial Components A Hierarchical Face Identification System Based on Facial Components Mehrtash T. Harandi, Majid Nili Ahmadabadi, and Babak N. Araabi Control and Intelligent Processing Center of Excellence Department of

More information

A Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish

A Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish A Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish Introduction/Motivation: As autonomous vehicles, such as Google s self-driving car, have recently become

More information

I. INTRODUCTION. Figure-1 Basic block of text analysis

I. INTRODUCTION. Figure-1 Basic block of text analysis ISSN: 2349-7637 (Online) (RHIMRJ) Research Paper Available online at: www.rhimrj.com Detection and Localization of Texts from Natural Scene Images: A Hybrid Approach Priyanka Muchhadiya Post Graduate Fellow,

More information

Video annotation based on adaptive annular spatial partition scheme

Video annotation based on adaptive annular spatial partition scheme Video annotation based on adaptive annular spatial partition scheme Guiguang Ding a), Lu Zhang, and Xiaoxu Li Key Laboratory for Information System Security, Ministry of Education, Tsinghua National Laboratory

More information

An Improved Texture Synthesis Algorithm Using Morphological Processing with Image Analogy

An Improved Texture Synthesis Algorithm Using Morphological Processing with Image Analogy An Improved Texture Synthesis Algorithm Using Morphological Processing with Image Analogy Jiang Ni Henry Schneiderman CMU-RI-TR-04-52 October 2004 Robotics Institute Carnegie Mellon University Pittsburgh,

More information

Schedule for Rest of Semester

Schedule for Rest of Semester Schedule for Rest of Semester Date Lecture Topic 11/20 24 Texture 11/27 25 Review of Statistics & Linear Algebra, Eigenvectors 11/29 26 Eigenvector expansions, Pattern Recognition 12/4 27 Cameras & calibration

More information

TEXT SEGMENTATION ON PHOTOREALISTIC IMAGES

TEXT SEGMENTATION ON PHOTOREALISTIC IMAGES TEXT SEGMENTATION ON PHOTOREALISTIC IMAGES Valery Grishkin a, Alexander Ebral b, Nikolai Stepenko c, Jean Sene d Saint Petersburg State University, 7 9 Universitetskaya nab., Saint Petersburg, 199034,

More information

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain

Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Automatic Video Caption Detection and Extraction in the DCT Compressed Domain Chin-Fu Tsao 1, Yu-Hao Chen 1, Jin-Hau Kuo 1, Chia-wei Lin 1, and Ja-Ling Wu 1,2 1 Communication and Multimedia Laboratory,

More information

Dynamic Human Fatigue Detection Using Feature-Level Fusion

Dynamic Human Fatigue Detection Using Feature-Level Fusion Dynamic Human Fatigue Detection Using Feature-Level Fusion Xiao Fan, Bao-Cai Yin, and Yan-Feng Sun Beijing Key Laboratory of Multimedia and Intelligent Software, College of Computer Science and Technology,

More information

Available Online through

Available Online through Available Online through www.ijptonline.com ISSN: 0975-766X CODEN: IJPTFI Research Article ANALYSIS OF CT LIVER IMAGES FOR TUMOUR DIAGNOSIS BASED ON CLUSTERING TECHNIQUE AND TEXTURE FEATURES M.Krithika

More information

A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points

A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points A Method of Annotation Extraction from Paper Documents Using Alignment Based on Local Arrangements of Feature Points Tomohiro Nakai, Koichi Kise, Masakazu Iwamura Graduate School of Engineering, Osaka

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an

More information

Short Survey on Static Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of

More information

Texture Segmentation by Windowed Projection

Texture Segmentation by Windowed Projection Texture Segmentation by Windowed Projection 1, 2 Fan-Chen Tseng, 2 Ching-Chi Hsu, 2 Chiou-Shann Fuh 1 Department of Electronic Engineering National I-Lan Institute of Technology e-mail : fctseng@ccmail.ilantech.edu.tw

More information

An ICA based Approach for Complex Color Scene Text Binarization

An ICA based Approach for Complex Color Scene Text Binarization An ICA based Approach for Complex Color Scene Text Binarization Siddharth Kherada IIIT-Hyderabad, India siddharth.kherada@research.iiit.ac.in Anoop M. Namboodiri IIIT-Hyderabad, India anoop@iiit.ac.in

More information

AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S

AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S Radha Krishna Rambola, Associate Professor, NMIMS University, India Akash Agrawal, Student at NMIMS University, India ABSTRACT Due to the

More information

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK

AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK AN APPROACH OF SEMIAUTOMATED ROAD EXTRACTION FROM AERIAL IMAGE BASED ON TEMPLATE MATCHING AND NEURAL NETWORK Xiangyun HU, Zuxun ZHANG, Jianqing ZHANG Wuhan Technique University of Surveying and Mapping,

More information

Color based segmentation using clustering techniques

Color based segmentation using clustering techniques Color based segmentation using clustering techniques 1 Deepali Jain, 2 Shivangi Chaudhary 1 Communication Engineering, 1 Galgotias University, Greater Noida, India Abstract - Segmentation of an image defines

More information

IDIAP IDIAP. Martigny ffl Valais ffl Suisse

IDIAP IDIAP. Martigny ffl Valais ffl Suisse R E S E A R C H R E P O R T IDIAP IDIAP Martigny - Valais - Suisse ASYMMETRIC FILTER FOR TEXT RECOGNITION IN VIDEO Datong Chen, Kim Shearer IDIAP Case Postale 592 Martigny Switzerland IDIAP RR 00-37 Nov.

More information

Time Stamp Detection and Recognition in Video Frames

Time Stamp Detection and Recognition in Video Frames Time Stamp Detection and Recognition in Video Frames Nongluk Covavisaruch and Chetsada Saengpanit Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand E-mail: nongluk.c@chula.ac.th

More information

Conspicuous Character Patterns

Conspicuous Character Patterns Conspicuous Character Patterns Seiichi Uchida Kyushu Univ., Japan Ryoji Hattori Masakazu Iwamura Kyushu Univ., Japan Osaka Pref. Univ., Japan Koichi Kise Osaka Pref. Univ., Japan Shinichiro Omachi Tohoku

More information

CS4733 Class Notes, Computer Vision

CS4733 Class Notes, Computer Vision CS4733 Class Notes, Computer Vision Sources for online computer vision tutorials and demos - http://www.dai.ed.ac.uk/hipr and Computer Vision resources online - http://www.dai.ed.ac.uk/cvonline Vision

More information

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 11, November 2014,

More information

Character Recognition

Character Recognition Character Recognition 5.1 INTRODUCTION Recognition is one of the important steps in image processing. There are different methods such as Histogram method, Hough transformation, Neural computing approaches

More information

Algorithm Optimization for the Edge Extraction of Thangka Images

Algorithm Optimization for the Edge Extraction of Thangka Images 017 nd International Conference on Applied Mechanics and Mechatronics Engineering (AMME 017) ISBN: 978-1-60595-51-6 Algorithm Optimization for the Edge Extraction of Thangka Images Xiao-jing LIU 1,*, Jian-bang

More information

GABOR FILTER AND NEURAL NET APPROACH FOR BUILT UP AREA MAPPING IN HIGH RESOLUTION SATELLITE IMAGES

GABOR FILTER AND NEURAL NET APPROACH FOR BUILT UP AREA MAPPING IN HIGH RESOLUTION SATELLITE IMAGES GABOR FILTER AND NEURAL NET APPROACH FOR BUILT UP AREA MAPPING IN HIGH RESOLUTION SATELLITE IMAGES Sangeeta Khare a, *, Savitha D. K. b, S.C. Jain a a DEAL, Raipur Road, Dehradun -48001 India. b CAIR,

More information

Texture Based Image Segmentation and analysis of medical image

Texture Based Image Segmentation and analysis of medical image Texture Based Image Segmentation and analysis of medical image 1. The Image Segmentation Problem Dealing with information extracted from a natural image, a medical scan, satellite data or a frame in a

More information

Automatic Detection of Change in Address Blocks for Reply Forms Processing

Automatic Detection of Change in Address Blocks for Reply Forms Processing Automatic Detection of Change in Address Blocks for Reply Forms Processing K R Karthick, S Marshall and A J Gray Abstract In this paper, an automatic method to detect the presence of on-line erasures/scribbles/corrections/over-writing

More information

Image Segmentation Techniques for Object-Based Coding

Image Segmentation Techniques for Object-Based Coding Image Techniques for Object-Based Coding Junaid Ahmed, Joseph Bosworth, and Scott T. Acton The Oklahoma Imaging Laboratory School of Electrical and Computer Engineering Oklahoma State University {ajunaid,bosworj,sacton}@okstate.edu

More information

Image Segmentation. Srikumar Ramalingam School of Computing University of Utah. Slides borrowed from Ross Whitaker

Image Segmentation. Srikumar Ramalingam School of Computing University of Utah. Slides borrowed from Ross Whitaker Image Segmentation Srikumar Ramalingam School of Computing University of Utah Slides borrowed from Ross Whitaker Segmentation Semantic Segmentation Indoor layout estimation What is Segmentation? Partitioning

More information

Classification of Printed Chinese Characters by Using Neural Network

Classification of Printed Chinese Characters by Using Neural Network Classification of Printed Chinese Characters by Using Neural Network ATTAULLAH KHAWAJA Ph.D. Student, Department of Electronics engineering, Beijing Institute of Technology, 100081 Beijing, P.R.CHINA ABDUL

More information

Spam Filtering Using Visual Features

Spam Filtering Using Visual Features Spam Filtering Using Visual Features Sirnam Swetha Computer Science Engineering sirnam.swetha@research.iiit.ac.in Sharvani Chandu Electronics and Communication Engineering sharvani.chandu@students.iiit.ac.in

More information