Statistical Shape Features for Content-Based Image Retrieval

Size: px
Start display at page:

Download "Statistical Shape Features for Content-Based Image Retrieval"

Transcription

1 Journal of Mathematical Imaging and Vision 17: , 2002 c 2002 Kluwer Academic Publishers. Manufactured in The Netherlands. Statistical Shape Features for Content-Based Image Retrieval SAMI BRANDT Laboratory of Computational Engineering, Helsinki University of Technology, P.O. BOX 9203, FIN HUT, Finland Sami.Brandt@hut.fi JORMA LAAKSONEN AND ERKKI OJA Laboratory of Computer and Information Science, Helsinki University of Technology, P.O. BOX 5400, FIN HUT, Finland Jorma.Laaksonen@hut.fi Erkki.Oja@hut.fi Abstract. In this article the use of statistical, low-level shape features in content-based image retrieval is studied. The emphasis is on such techniques which do not demand object segmentation. PicSOM, the image retrieval system used in the experiments, requires that features are represented by constant-sized feature vectors for which the Euclidean distance can be used as a similarity measure. The shape features suggested here are edge histograms and Fourier-transform-based features computed from the image after edge detection in Cartesian or polar coordinate planes. The results show that both local and global shape features are important clues of shapes in an image. Keywords: feature extraction, content-based image retrieval, statistical shape description, relevance feedback 1. Introduction Image content can be described by visual features classified according to the properties they describe. Such classes are, for instance according to MPEG-7 International Standard [17], color, texture, and shape. Color and texture contain important information but, for instance, two images with similar color histograms can represent very different objects. Therefore the use of shape-describing features is essential in an efficient content-based image retrieval system. Although shape description has been intensively researched, there exists no direct answer which kind of feasible shape features should be incorporated into such a system. Content-based image retrieval (CBIR) has been an active research area since the early 1990 s [4, 15]. This work was supported by the Finnish Centre of Excellence Programme ( ) of the Academy of Finland, project New information processing principles, Many image retrieval systems, both commercial and research, have been built. The best known are Query By Image Content (QBIC) [5] and Photobook [18] and its new version FourEyes. Other well-known systems are the search engine family VisualSEEk, MetaSEEk and WebSEEk [1, 23, 24], NETRA [16], and Multimedia Analysis and Retrieval System (MARS) [7]. Used shape features and some other characteristics of some present systems are summarized in Table 1. Unfortunately there does not exist any standard benchmark that could be used to assess the performances of different CBIR systems. However, there are currently plans to establish such a benchmarking procedure [6]. A major problem in automatic feature extraction is segmentation. Even if it were known that there is a single object in the image, it is in general a non-trivial problem to locate it. On the other hand, when there are no specific objects the result of segmentation is probably an irrelevant part of the original image. For the

2 188 Brandt, Laaksonen and Oja Table 1. Summary of the shape features in some content-based image retrieval systems. KLT stands for Karhunen Loeve transform, FEM finite-element method, DFT discrete Fourier transform, and MFD modified Fourier discriptors. Appearance Shape QBIC photobook photobook NETRA MARS Feature type Attributes KLT FEM DFT MFD Feature dimension 22 Not specified Not specified 126 Not specified Similarity measure Euclidean Euclidean Strain energy Euclidean Several Segmentation Manual/Semi-autom. Semi-autom. Semi-autom. Autom. Semi-autom. Multiobj. segm. Yes No No Yes No Multiobj. query No No No No No use of a general database of images, such as the World Wide Web, it might then be reasonable to use some statistical shape features for the whole image instead. However, in the current literature these kinds of shape features are an exception [2]. Indeed, all the above systems having shape features rely on manual segmentation. Only NETRA has a fully automated segmentation algorithm, whose results indicate the extreme difficulty of the problem. Another central issue to be considered in selecting an appropriate shape description technique is whether some invariances are needed. Naturally many machine vision systems require invariance to spatial transformations like translation, rotation, and scaling, or to different illumination conditions such as lighting and contrast. In some cases these are not beneficial, however. In content-based image retrieval, this may occur when the user has a clear impression what the image looks like. For example, looking for a certain view in contrast to a single object the user possibly prefers a similarly rotated, translated, maybe even scaled image compared to the image the user has in mind. Therefore it may be more reasonable to speak of robustness of features rather than invariances. 2. Content-Based Image Retrieval and PicSOM System The shape features described in this paper have been used in the PicSOM content-based image retrieval system [3, 12, 13]. The system uses Self-Organizing Maps [9] in implementing relevance feedback for storing user responses and selecting images. A genuine characteristic of PicSOM is its ability to automatically adapt to the user s perception of similarity of images based on their low-level visual content even though humans perceive image similarity on abstract semantic level. An on-line demonstration of PicSOM and comprehensive documentation of it can be found at Self-Organizing Map The Self-Organizing Map (SOM) [9] is an unsupervised neural algorithm that maps a vector set from a high-dimensional input space to a typically twodimensional discrete output space which is a grid of artificial neurons or units. The SOM algorithm can be seen as a special form of vector quantization in which the topological ordering of the vectors is retained as well as possible. Therefore, adjacent input vectors tend to be mapped in neighboring SOM units and the mutual similarity of vectors is reflected in their proximity on the map grid. A model vector whose dimensionality equals the dimensionality of the input space is stored in each SOM unit. These model vectors can be initialized at random or according to the mean and principal directions of the training data. A central concept of SOM is the best-matching unit (BMU) of each vector in the vector set used in training the SOM. When the map is being trained, the training vectors are used consecutively, one by one as an input vector to the SOM. The BMU of the input vector is found as the map unit which minimizes the distance between the vector and the model vector of the unit. When training the SOM, the model vectors of the BMU and its topological neighbors on the map are modified to match the input vector even better. As a result, the distribution of the model vectors resembles the distribution of the training set and the neighboring model vectors resemble each other. The training of a SOM consists of the following two steps for each of the training vectors x(t). First, the

3 Statistical Shape Features for Content-Based Image Retrieval 189 BMU model vector m c(x) (t) is located: i: x(t) m c(x) (t) x(t) m i (t). (1) The usual distance metric used here is the Euclidean one. After finding the BMU, a subset of the model vectors constituting a neighborhood centered around node c(x) are updated as m i (t + 1) = m i (t) + h(t; c(x), i)(x(t) m i (t)). (2) Here h(t; c(x), i) is the neighborhood function, a decreasing function of the distance between the ith and c(x)th nodes on the map grid. A special form of the SOM, known as the Tree Structured Self-Organizing Map (TS-SOM) [10, 11] has been used in the PicSOM system. It differs from the original SOM in that a TS-SOM consists of a stack of SOM layers. The BMUs are first searched for on the topmost layer and the search is then continued on the next layer in a restricted area centered below the BMU on the above map. This makes the BMU search much faster, otherwise the properties of SOM and TS- SOM are similar. After one TS-SOM layer has been trained, the values of the model vectors on it are frozen and the next layer below it is initialized by interpolating and extrapolating from that layer s model vectors. We have used TS-SOMs where there are three layers: the topmost layer contains 4 4 = 16 units, the middle one = 256 units and the bottommost = 4096 units. An essential property of the PicSOM system is that a separate TS-SOM is trained for each feature type, typically color, texture and shape. The TS-SOMs have been trained by iterating all feature vectors extracted from the images in the database one hundred times. After the SOM training, each map unit on each of the maps is given a visual label by selecting among those images for which that unit is the BMU the one whose feature vecture is closest to the unit s model vector. Figure 1 illustrates a TS-SOM layer surface. It can be seen that visually similar images with respect to their spatial edge frequency and orientation are mapped in nearby SOM units Operation of the PicSOM System In the relevance feedback [20] approach of contentbased image retrieval, the system displays the user sets of images. The first set is selected at random or so that the various types of images included in the whole image database are represented in it as well as possible. When the user has selected relevant ones among the shown images, PicSOM scores the corresponding BMUs on all the feature maps with a positive value inversely proportional to the number of relevant images. In an analogous manner, BMUs of irrelevant images are scored with negative values. The score values are marked as points on the locations of the BMUs on the map surfaces. These surfaces are then low-pass filtered. This process spreads the positive and negative values to the neighboring map units. As a result, map areas that contain many relevant images close together will have larger score values than areas where the relevant images are sparsely distributed, or where the relevant and irrelevant images cancel each other out. When choosing images for the next query round, the PicSOM system selects a list of preset number, say 100, of the best-scoring, yet unseen images from each TS-SOM layer of each feature type. Next, if some images appear in more than one list, the respective score values are summed up to obtain the final scores. Finally, the images with the largest score values are shown to the user who may again select the relevant ones and continue the iteration. Because images similar in the sense of a particular feature are clustered together on the corresponding TS-SOM map, the low-pass filtering of the relevance values leads to automatic adaptation to the user s conception of image similarity and relevance. The mutual weighting of different feature types is also performed simultaneously, as features that map relevant images in tight clusters are automatically given more weight than the others [14]. 3. Statistical Shape Features The experiments study a few kinds of statistical features which do not require segmentation but are computed from the shape properties of the whole image. The question whether the invariance to the affine transformations is beneficial or not is also addressed Histogram of Edge Directions The first experiments on shape features were made by using a histogram of edge directions. The edge histogram is translation invariant and it captures the general shape information in the image. Because the

4 190 Brandt, Laaksonen and Oja Figure 1. The middle level map of the FFT128 TS-SOM where nearest images to the neurons are shown. The map ordering has a clear interpretation, see text. feature relies on local statistic of the image, it is robust to partial occlusion and structure deformations in the image. The major disadvantage is that two perceptually very different images may have similar edge histograms. The edge histogram is computed as follows. At first, the color image is transformed to the HSI space from which the hue channel is neglected. The other two channels are convolved with the eight Sobel masks [25]. Each pixel is given the maximum of the responses and the corresponding 8-quantized direction. The gradient is thresholded next. The threshold values are manually fixed to 15% of the maximum gradient value on the intensity channel and to 35% on the saturation channel. The thresholded intensity and saturation gradient images are combined by the logical OR operation (see Fig. 2). In the OR operation, if the gradient directions differ between the saturation and intensity, the originally stronger gradient direction is chosen. A gray level image I(x, y) is hence transformed as I(x, y) {I e (x, y), I d (x, y)} (3) where I e (x, y) {0, 1} represents the binary edge image, and I d (x, y) {0,...,7} the direction image. Finally the 8-bin edge histograms with bins corresponding to the quantized eight directions, are calculated by counting the edge pixels in each direction. Our experiment showed (see [2] for a full report) that it is better to normalize the histograms by the number of pixels in each image rather than by the number of edge pixels as was done in [8]. For the histogram, we thus

5 Statistical Shape Features for Content-Based Image Retrieval 191 Figure 2. operation. The original image, thresholded magnitudes of its saturation and intensity gradients, and the gradient image combined by the OR have H(i) = 1 #{x {0,...,N 1} NM y {0,...,M 1} I d (x, y) = i} (4) The effect of smoothing proposed in [8] was also studied. The smoothing should make the histograms more robust to rotation. It is performed as H s (i) = i+k l=i k H(l mod 8), (5) 2k + 1 where the parameter k determines the degree of smoothing Co-Occurrence Matrix of Edge Directions The edge histogram can yet be generalized. By taking every neighboring edge pixel pair and enumerating them based on their directions a two-dimensional histogram or co-occurrence matrix is obtained. The resulting histogram, represented by 64 bins, is normalized by the number of pixels in the image and is defined as H co (i, j) = 1 #{x {0,...,N 1} y {0,...,M 1} NM (ˆx, ŷ) U(x, y) I d (x, y) = i I d (ˆx, ŷ) = j} (6) where U(x, y) is the causal neighborhood set of the pixel (x, y). The causal neighborhood is defined as the 8-neighbors of a pixel in the west, northwest, north, and northeast directions. H co (i, j) hence indicates the number of neighboring edge pixel pairs which are positioned in the directions i and j Fourier Features The edge image contains the most relevant shape information and the discrete Fourier transform can be used to describe it. Before forming the edge image, here by using the Sobel masks and neglecting the gradient direction information (see Fig. 2), the image area is normalized with zero padding to a maximum size of such that the aspect ratio is maintained. The normalization is performed to get comparable edge responses regardless of the original sizes of the images. If both dimensions of the original image are larger than 512, the image is decimated with filtering. Otherwise the normalization is made by bicubic interpolation. After edge detection, the Fourier transform is computed for the normalized image using the FFT algorithm. The magnitude image of the Fourier spectrum is first low-pass filtered and thereafter decimated by the factor of 32. The resulting number of dimensions

6 192 Brandt, Laaksonen and Oja in the feature vectors is 128. The final reduction is made after the edge detection and FFT because the resolution of the edge detection procedure would not be sufficient to extract relevant edges from a decimated image Polar and Log-Polar Fourier Features The Fourier features described above are translation invariant but not rotation invariant. Our method, which is named as polar Fourier features, is rotation invariant with respect to the center of the image but not invariant to translation and scale. At first the image is normalized and the edge image is obtained similarly as with the Fourier features. The binary edge image I e (x, y) is then transformed to the polar coordinates by the formula I(ρ,θ) = I 1 (ρ,θ) I 2 (ρ,θ), (7),..., , me- where ρ = 0, ,...,,θ= 0, ans the binary OR operation, and 512 I 1 (ρ,θ) = I e ( Rρ cos 2πθ +c x, Rρ sin 2πθ +c y ) (8) 1 if x, y {0, 1,...,511} such that I e (x, y) = 1, ρ = (x cx ) 512 R 2 + (y c y ) 2, I 2 (ρ,θ) = and θ = π (x c x, y c y ), 0 otherwise, (9) where (c x, c y ) = (255.5, 255.5) are the coordinates of the center of the image, R = cx 2 + c2 y, and gives rounding to the nearest integer. The above procedure prevents the formation of gaps between the edge pixels in the polar coordinate system. For the polar image the Fourier transform and decimation are performed similarly as with the Fourier features and a 128-dimensional feature vector is obtained. The method is invariant to translation in the polar plane that implies the rotation invariance with respect to the center of the image. Even more invariances could be obtained by a slight modification to the feature. Log-polar [22] Fourier features are invariant to affine transformations i.e., to translation, rotation and scaling. These can be obtained by replacing R and ρ in (8) and (9) with ln R and e ρ, respectively. Translation invariance is obtained by setting the centroid (c x, c y ) to the center of mass of the binary edge image. Rotation invariance is obtained by using the magnitude spectrum of the log-polar transform as the rotations affect only the phase of the spectrum. Accordingly, invariance to scale is obtained by taking logarithm of the radius in the polar coordinate plane. All the Fourier-based features presented here are sensitive to occlusion: the direct use of the Fourier transform may lead to very different magnitude spectra for occluded images. In addition, if some parts of the central object of an image are missing, the calculation of the centroid will go wrong and significantly differing log-polar images will result. 4. Evaluation Methods The presented shape features are evaluated with methods from [12]. Let N be the total number of images in the image database D. Let C D be a class of similar images determined by some verbal criteria. In order to measure the clustering of the features we define instantaneous precision P C (n) as the probability that an image I j C has as the nth closest neighbor an image which also belongs to the class C. For each image I j C the distance to every other image is calculated and the images are sorted in the order of ascending distance. For each image I j we thus define a sequence h j (n) as { 1 if the nth closest image belongs to C, h j (n) = 0 otherwise, (10) where n = 1,...,N 1. The instantaneous precision P C (n) for the class C having N C images is then defined as P C (n) = 1 N C j I j C h j (n), n = 1,...,N 1. (11) In the optimal case P C (n) = 1ifn N C 1 and 0 otherwise. On the other hand the worst-case performance

7 Statistical Shape Features for Content-Based Image Retrieval 193 results when P C (n) equals the a priori probability of class C for all n Local Performance Good features should have high instantaneous precision for the very first indices. Therefore a local performance measure of the precision-type is defined as the average of the instantaneous precision for the first 1% of indices, i.e. η local (C) = N n=1 P C (n). (12) N The local performance measure has the ability to describe if the feature space is clustered such that there is a high probability that any image of the class C has an image of the same class near it in the feature space. Note that η local is dependent on the a priori probability of the class Global Performance The performance measure presented above gives a high performance value even if the images of the class C are clustered to many small clusters all over the feature space. This suggests using as an alternative an appropriate weighting function w(n) which would take global clustering into consideration. The instantaneous precision function can be converted to a scalar by using the sum expression η global (C) = N 1 n=1 P C (n) w(n). (13) The weighting function w(n) should be such that it rewards large P C (n) in small indices and punishes large values in large indices. For this reason the weighting function was chosen to be w(n) = (n 1)π cos N 2 NC 2 l=0 cos lπ N 2, n = 1,...,N 1. (14) Then η global approaches one for the optimal instantaneous precision case and zero for the a priori case. In addition, η global is ideally independent of the a priori probability. When the weighting function is chosen this way it measures how the feature vectors of a class tend to form one global cluster in the feature space Performance in the PicSOM System Finally it is tested how the features perform in the Pic- SOM system. Although this experiment reveals the performance of PicSOM itself, it also measures well how the particular shape features are suitable to the very system. The experiment is performed such that a computer program simulates the query process of PicSOM over a series of subsequent query rounds. For each image I j C it is measured how many images must be shown until the image I j is found from the database. The program emulates the user s relevance feedback by selecting all the shown images which belong to the class C. The resulting merit of indexing is thus defined as τ(c) = 1 N shown (I j ), (15) NN C I j C where N shown (I j ) is the rank of image I j in the output of the system. Random picking would theoretically result to a τ-value of 0.5 and, in general, the closer to zero τ is the better the obtained retrieval performance. It must be noted that the τ(c)-measure is an average for the whole class C of images. Also, it can be calculated in a single experiment in which the system ranks all images of the database based on relevance feedback given to the system according to images membership in the class C. From the output of such an experiment the ranks of all images of the class are readily available. 5. Experiments As a test database a set of 4350 miscellaneous bitmap images was used. The images were downloaded from ftp://ftp.sunet.se/pub/pictures. The ground truth classes were manually picked from the database. The classes used in the experiments were aircraft, buildings, and faces, of which the database contains 348, 492, and 361 images, respectively. The formation of the global performance measure η global is shown in Fig. 3. The results are summarized in Table 2 where the best scores in each column have been bolded. In forming the curves, the instantaneous precision values were calculated for the whole ground truth classes, whereas the results in Table 2 were obtained by averaging the figures of merit which were obtained by dividing the classes into ten subsets. The performances of the original color and texture features used in Pic- SOM are also presented to allow comparison between them and the shape features.

8 194 Brandt, Laaksonen and Oja Figure 3. Evolution of weighted, cumulative instantaneous precision. The global merit of indexing η global is defined as the y-value of the endpoint of the curve. In comparing the shape features the results showed that there is no single feature which would perform best for every image class used in the experiments. Of the local features, which are robust to occlusion and to any local disturbance in the image, both the histogram and the co-occurrence matrix of edge directions gave good results. It can also be seen that the smoothing of the edge histogram with k = 1 provided no advantage (EdgeHist vs. EdgeHistSm). From the global Fourier-transformbased features the magnitude spectrum of an edge image was found to work at least as well as the local shape features with the test database. In addition, the results of the experiments on the three different Fourier-based features support the assumption that with a general database the presence of all the three invariances is not beneficial. It is now interesting to see how the best candidate features so far perform in the PicSOM system. In the light of the τ-test, as Table 3 indicates, for the single feature maps FFT128 seems to be the best for all the classes although the difference to the second-best Co-occurrence is not very distinct. It is also interesting to note that the difference between EdgeHist and Cooccurrence seems to be larger than the previous results would suggest. On the other hand, now EdgeHist performs better than Co-occurrence for the aircraft class. This test also verifies that the best shape features presented in this chapter are superior to the used color

9 Statistical Shape Features for Content-Based Image Retrieval 195 Table 2. Performance of different features. Larger values indicate better performance. Feature Aircraft Buildings Faces η local, 0 η local 1 Color ± ± ± Texture ± ± ± EdgeHist 0.04 ± ± ± EdgeHistSm 0.04 ± ± ± Co-occurrence 0.06 ± ± ± FFT ± ± ± PolarFFT ± ± ± LogPolarFFT ± ± ± η global, 0 η global 1 Color 0.2 ± ± ± 0.05 Texture 0.12 ± ± ± 0.07 EdgeHist 0.43 ± ± ± 0.1 EdgeHistSm 0.40 ± ± ± 0.1 Co-occurrence 0.48 ± ± ± 0.1 FFT ± ± ± 0.2 PolarFFT 0.3 ± ± ± 0.1 LogPolarFFT 0.14 ± ± ± 0.09 Table 3. Performance of the features in the PicSOM system: The results of the τ-test when only one feature map is used in the query. The τ-value indicates how large proportion of the database must be shown on the average until a specific image is returned from the query. Feature Aircraft Buildings Faces Average Color Texture EdgeHist Co-occurrence FFT PolarFFT and texture features. The best performance occurs with aircraft images and FFT128 features when a desired image is seen when on the average 20% of the images in the database have been shown. Table 4 shows how PicSOM performs when different combinations of the feature maps are used. It may be seen that the combination of the color and texture maps is advantageous because the τ-values are reduced in all cases except for the aircraft class with the color feature. The τ-values for the combination of the color and texture maps also provide reference values for the original PicSOM system, with which the system with Table 4. Results of the τ-test when multiple feature maps are used. This test shows how the addition of the shape features affects the performance of PicSOM compared to that with only the color and texture features. Features Aircraft Buildings Faces Average Color, Texture Color, Texture, EdgeHist Color, Texture, Co-occurrence Color, Texture, FFT Color, Texture, PolarFFT Color, Texture, Co-occurrence, FFT128 the shape features is about to be compared. It may be seen that the addition of any of the four shape feature types increases the performance. The best results are obtained by incorporating FFT128 alone or together with Co-occurrence into the system. The results are slightly better than those which were obtained by using single feature maps. From the average probabilities of Table 4 it may be seen that the combination of the four features gives the best results indicating that the PicSOM system is capable of adapting to use the best-performing features. Finally, it is instructive to see how the images of the ground truth classes are clustered on the SOMs. In Fig. 4 the sized lowest levels of the TS- SOMs of the best candidate features are displayed together with the original average color feature of Pic- SOM. The maps of average color show that the aircraft and building classes overlap and all the three classes are widely spread on the SOMs. On the Cooccurrence and FFT128 maps, the classes are clustered tighter. FFT128 seems to separate the classes better, which thus supports the earlier results of the experiments. In Fig. 1, we have displayed the images associated with the units of sized level of the FFT128 map. The lower left hand corner consist of images with lots of small, high-frequency details the upper right hand corner represents the opposite with majoring low frequencies and lack of details. Faces and other circular shapes are accumulated in the middle on the top of the map, clear horizontal elongated shapes are notable in images on the lower right hand corner while the corresponding vertical trend is seen in the upper left hand corner. The map is also well consistent with class distributions in the bottom row of Fig. 4.

10 196 Brandt, Laaksonen and Oja Figure 4. Mappings of the ground truth classes with the color and two best shape features on the lowest level of the TS-SOMs. The class distributions have been low-pass filtered to ease the interpretation. 6. Conclusions In this work shape-describing features for general content-based image retrieval were studied. We formed various types of statistical feature vectors from the edges in non-segmented images. The best results were obtained with decimated magnitude spectrum of the edge image. Also local edge-histogram-based features, including the co-occurrence matrix of edge directions, gave good results. This indicates that both local and global information are important cues of the object shapes in an image. The experiments also suggest that with a database of miscellaneous images it might not be reasonable to require the features to be invariant to affine transformations. The results suggest that, in future, more sophisticated shape features of both local and global nature could give better results. From local, histogram-based features this could be achieved by e.g. Gabor or Gaussian derivate type features [19, 21]. From global features wavelet-transform-based features might be rewarding as it is possible to describe coarse overall shape added with details. Moreover, due to the different type of the features, a combination should work even more efficiently as learned by the experiments with the PicSOM content-based image retrieval system. References 1. M. Beigi, A. Benitez, and S.-F. Chang, MetaSEEk: A contentbased meta search engine for images, in Storage and Retrieval for Image and Video Databases, SPIE Proceedings Series, San Jose, CA, S. Brandt, Use of shape features in content-based image retrieval, Master s Thesis, Helsinki University of Technology, Available at picsom/publications.html. 3. S. Brandt, J. Laaksonen, and E. Oja, Statistical shape features in content-based image retrieval, in Proceedings of the 15th International Conference on Pattern Recognition (ICPR), Vol. 2, Barcelona, Spain, Sept. 2000, pp A. Del Bimbo, Visual Information Retrieval, Morgan Kaufmann Publishers: San Mateo, CA, M. Flickner, H. Sawhney, W. Niblack, et al. Query by image and video content: The QBIC system, IEEE Computer, Vol. 28, pp , N.J. Gunther and G. Beretta, A benchmark for image retrieval using distributed systems over the internet: BIRDS-I, Technical Report HPL , HP Labs, Available at HPL html.

11 Statistical Shape Features for Content-Based Image Retrieval T.S. Huang, S. Mehratra, and K. Ramchandran, Multimedia analysis and retrieval system (MARS) project, in Proceedings of the 33rd Annual Clinic on Library Application of Data Processing Digital Image Access and Retrieval, University of Illinois at Urbana-Champaign, March A.K. Jain and A. Vailaya, Image retrieval using color and shape, Pattern Recognition, Vol. 29, No. 8, pp , T. Kohonen, Self-Organizing Maps, 3rd edn. Vol. 30 of Springer Series in Information Sciences, Springer-Verlag: Berlin, P. Koikkalainen, Progress with the tree-structured selforganizing map, in 11th European Conference on Artificial Intelligence, European Committee for Artificial Intelligence (ECCAI), A.G. Cohn (Ed.), John Wiley & Sons: New York, Aug. 1994, pp P. Koikkalainen and E. Oja, Self-organizing hierarchical feature maps, in Proceedings of IJCNN-90, International Joint Conference on Neural Networks, San Diego, CA, 1990, pp M. Koskela, Content-based image retrieval with selforganizing maps, Master s Thesis, Helsinki University of Technology, Available at picsom/publications.html. 13. J.T. Laaksonen, J.M. Koskela, S.P. Laakso, and E. Oja, PicSOM Content-based image retrieval with self-organizing maps, Pattern Recognition Letters, Vol. 21, Nos. 13/14, pp , J. Laaksonen, M. Koskela, S. Laakso, and E. Oja, Selforganizing maps as a relevance feedback technique in contentbased image retrieval, Pattern Analysis & Applications, Vol. 4, Nos. 2/3, pp , M.S. Lew (Ed.), Principles of Visual Information Retrieval, Springer: Berlin, W.Y. Ma and B.S. Manjunath, NETRA: A toolbox for navigating large image databases, in Proceedings of IEEE International Conference on Image Processing, Vol. I, Santa Barbara, California, Oct. 1997, pp Overview of the MPEG-7 standard (version 5.0), March ISO/IEC JTC1/SC29/WG11 N A. Pentland, R.W. Picard, and S. Sclaroff, Photobook: Contentbased manipulation of image databases, International Journal of Computer Vision, Vol. 18, No. 3, pp , S. Ravela and R. Manmatha, On computing global similarity in images, in Proceedings of 4th IEEE Workshop on Applications of Computer Vision (WACV 98), Oct G. Salton and M.J. McGill, Introduction to Modern Information Retrieval, Computer Science Series, McGraw-Hill: New York, B. Schiele, Object recognition using multidimensional receptive field histograms, Ph.D. Thesis, Institut Polytechnique de Grenoble, France, English translation. 22. E.L. Schwartz, Spatial mapping in the primate sensory projection: Analytic structure and relevance to perception, Biological Cybernetics, Vol. 25, No. 4, pp , J.R. Smith and S.-F. Chang, Searching for images and videos on the World Wide Web, Technical Report , Columbia University CTR, Aug J.R. Smith and S.-F. Chang, VisualSEEk: A fully automated content-based image query system, in Proceedings of the ACM Multimedia 96, Nov M. Sonka, V. Hlavac, and R. Boyle, Image Processing, Analysis and Machine Vision, International Thomson Computer Press, Sami Brandt received the degree of Master of Science in Technology from the department of Engineering Physics and Mathematics in Helsinki University of Technology, Finland, in September His master s thesis was made on the use of shape features in content-based image retrieval at the Laboratory of Computer and Information Science. Currently, he is about to finish his doctoral thesis on the geometric branch of computer vision applied to electron tomography at the Graduate School in Electronics, Telecommunication, and Automation (GETA) and conducting research at the Laboratory of Computational Engineering. Additionally, he is a student member of the IEEE and IEEE Computer Society, member of the Pattern Recognition Society of Finland, and thereby a member of the International Association for Pattern Recognition (IAPR). Jorma Laaksonen received his Dr. of Science in Technology degree in 1997 from Helsinki University of Technology, Finland, where he is presently Senior Research Scientist at the Laboratory of Computer and Information Science. He is an author of several journal and conference papers on pattern recognition, statistical classification, and neural networks. His research interests are in content-based image retrieval and recognition of handwriting. Dr. Laaksonen is an IEEE member, a founding member of the SOM and LVQ Programming Teams and the PicSOM Development Group, and a member of the International Association of Pattern Recognition (IAPR) Technical Committee 3: Neural Networks and Machine Learning.

12 198 Brandt, Laaksonen and Oja Erkki Oja is Director of the Neural Networks Research Centre and Professor of Computer Science at the Laboratory of Computer and Information Science, Helsinki University of Technology, Finland. He received his Dr.Sc. degree in He has been research associate at Brown University, Providence, RI, and visiting Professor at Tokyo Institute of Technology. Dr. Oja is the author or coauthor of more than 240 articles and book chapters on pattern recognition, computer vision, and neural computing, as well as three books: Subspace Methods of Pattern Recognition (RSP and J. Wiley, 1983), which has been translated into Chinese and Japanese, Kohonen Maps (Elsevier, 1999), and Independent Component Analysis (J. Wiley, 2001). His research interests are in the study of principal components, independent components, self-organization, statistical pattern recognition, and applying artificial neural networks to computer vision and signal processing. Dr. Oja is member of the editorial boards of several journals and has been in the program committees of several recent conferences including ICANN, IJCNN, and ICONIP. He is member of the Finnish Academy of Sciences, Fellow of the IEEE, Founding Fellow of the International Association of Pattern Recognition (IAPR), and President of the European Neural Network Society (ENNS).

Content-Based Image Retrieval of Web Surface Defects with PicSOM

Content-Based Image Retrieval of Web Surface Defects with PicSOM Content-Based Image Retrieval of Web Surface Defects with PicSOM Rami Rautkorpi and Jukka Iivarinen Helsinki University of Technology Laboratory of Computer and Information Science P.O. Box 54, FIN-25

More information

Self-Organizing Maps of Web Link Information

Self-Organizing Maps of Web Link Information Self-Organizing Maps of Web Link Information Sami Laakso, Jorma Laaksonen, Markus Koskela, and Erkki Oja Laboratory of Computer and Information Science Helsinki University of Technology P.O. Box 5400,

More information

animals apes (1) NEW OR CONTINUED QUERY (2) CALLS (3) RETURNS THE CONTROL FILE (4) (7) RETURNS IMAGES BROWSER (6) REQUESTS IMAGES (5) HTML PAGE

animals apes (1) NEW OR CONTINUED QUERY (2) CALLS (3) RETURNS THE CONTROL FILE (4) (7) RETURNS IMAGES BROWSER (6) REQUESTS IMAGES (5) HTML PAGE published in the Proceedings of SCIA 99, Kangerlussuaq, Greenland, June 1999 available at PicSOM A Framework for Content-Based Image Database Retrieval

More information

By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

By choosing to view this document, you agree to all provisions of the copyright laws protecting it. Jussi Pakkanen and Jukka Iivarinen, A Novel Self Organizing Neural Network for Defect Image Classification. In Proceedings of the International Joint Conference on Neural Networks, pages 2553 2558, Budapest,

More information

Analyzing Low-Level Visual Features Using Content-Based Image Retrieval

Analyzing Low-Level Visual Features Using Content-Based Image Retrieval published in the Proceedings of ICONIP 2000, Taejon, Korea, November 2000 available at nalyzing Low-Level Visual Features Using Content-Based Image

More information

APPLYING TEXTURE AND COLOR FEATURES TO NATURAL IMAGE RETRIEVAL

APPLYING TEXTURE AND COLOR FEATURES TO NATURAL IMAGE RETRIEVAL APPLYING TEXTURE AND COLOR FEATURES TO NATURAL IMAGE RETRIEVAL Mari Partio, Esin Guldogan, Olcay Guldogan, and Moncef Gabbouj Institute of Signal Processing, Tampere University of Technology, P.O.BOX 553,

More information

CONTENT BASED IMAGE RETRIEVAL SYSTEM USING IMAGE CLASSIFICATION

CONTENT BASED IMAGE RETRIEVAL SYSTEM USING IMAGE CLASSIFICATION International Journal of Research and Reviews in Applied Sciences And Engineering (IJRRASE) Vol 8. No.1 2016 Pp.58-62 gopalax Journals, Singapore available at : www.ijcns.com ISSN: 2231-0061 CONTENT BASED

More information

Textural Features for Image Database Retrieval

Textural Features for Image Database Retrieval Textural Features for Image Database Retrieval Selim Aksoy and Robert M. Haralick Intelligent Systems Laboratory Department of Electrical Engineering University of Washington Seattle, WA 98195-2500 {aksoy,haralick}@@isl.ee.washington.edu

More information

Texture Classification by Combining Local Binary Pattern Features and a Self-Organizing Map

Texture Classification by Combining Local Binary Pattern Features and a Self-Organizing Map Texture Classification by Combining Local Binary Pattern Features and a Self-Organizing Map Markus Turtinen, Topi Mäenpää, and Matti Pietikäinen Machine Vision Group, P.O.Box 4500, FIN-90014 University

More information

An efficiency comparison of two content-based image retrieval systems, GIFT and PicSOM

An efficiency comparison of two content-based image retrieval systems, GIFT and PicSOM An efficiency comparison of two content-based image retrieval systems, GIFT and Mika Rummukainen, Jorma Laaksonen, and Markus Koskela Laboratory of Computer and Information Science, Helsinki University

More information

Self-Organising Maps as a Relevance Feedback Technique in Content-Based Image Retrieval

Self-Organising Maps as a Relevance Feedback Technique in Content-Based Image Retrieval Pattern Analysis & Applications (2001)4:140 152 2001 Springer-Verlag London Limited Self-Organising Maps as a Relevance Feedback Technique in Content-Based Image Retrieval Jorma Laaksonen, Markus Koskela,

More information

Image Theft Detection with Self-Organising Maps

Image Theft Detection with Self-Organising Maps Image Theft Detection with Self-Organising Maps Philip Prentis, Mats Sjöberg, Markus Koskela, and Jorma Laaksonen Czech Technical University in Prague, Helsinki University of Technology prentphi@fjfi.cvut.cz,

More information

Evaluation of Features for Defect Image Retrieval

Evaluation of Features for Defect Image Retrieval Evaluation of Features for Defect Image Retrieval Jukka Iivarinen, Antti Ilvesmäki, and Jussi Pakkanen Laboratory of Computer and Information Science Helsinki University of Technology P.O. Box 5400, FIN-02015

More information

PicSOM Self-Organizing Image Retrieval With MPEG-7 Content Descriptors

PicSOM Self-Organizing Image Retrieval With MPEG-7 Content Descriptors IEEE TRANSACTIONS ON NEURAL NETWORKS, VOL. 13, NO. 4, JULY 2002 841 PicSOM Self-Organizing Image Retrieval With MPEG-7 Content Descriptors Jorma Laaksonen, Associate Member, IEEE, Markus Koskela, and Erkki

More information

DEFECT IMAGE CLASSIFICATION WITH MPEG-7 DESCRIPTORS. Antti Ilvesmäki and Jukka Iivarinen

DEFECT IMAGE CLASSIFICATION WITH MPEG-7 DESCRIPTORS. Antti Ilvesmäki and Jukka Iivarinen DEFECT IMAGE CLASSIFICATION WITH MPEG-7 DESCRIPTORS Antti Ilvesmäki and Jukka Iivarinen Helsinki University of Technology Laboratory of Computer and Information Science P.O. Box 5400, FIN-02015 HUT, Finland

More information

Time Series Prediction as a Problem of Missing Values: Application to ESTSP2007 and NN3 Competition Benchmarks

Time Series Prediction as a Problem of Missing Values: Application to ESTSP2007 and NN3 Competition Benchmarks Series Prediction as a Problem of Missing Values: Application to ESTSP7 and NN3 Competition Benchmarks Antti Sorjamaa and Amaury Lendasse Abstract In this paper, time series prediction is considered as

More information

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION 6.1 INTRODUCTION Fuzzy logic based computational techniques are becoming increasingly important in the medical image analysis arena. The significant

More information

Image Mining Using Image Feature

Image Mining Using Image Feature Image Mining Using Image Feature Varsha Kundlikar 1, Meghana Nagori 2. Abstract In this paper, features of image used for mining images from database. Basic features of images such as colour of pixel in

More information

Content Based Image Retrieval: Survey and Comparison between RGB and HSV model

Content Based Image Retrieval: Survey and Comparison between RGB and HSV model Content Based Image Retrieval: Survey and Comparison between RGB and HSV model Simardeep Kaur 1 and Dr. Vijay Kumar Banga 2 AMRITSAR COLLEGE OF ENGG & TECHNOLOGY, Amritsar, India Abstract Content based

More information

2. LITERATURE REVIEW

2. LITERATURE REVIEW 2. LITERATURE REVIEW CBIR has come long way before 1990 and very little papers have been published at that time, however the number of papers published since 1997 is increasing. There are many CBIR algorithms

More information

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang NICTA & CSE UNSW COMP9314 Advanced Database S1 2007 jzhang@cse.unsw.edu.au Reference Papers and Resources Papers: Colour spaces-perceptual, historical

More information

A Miniature-Based Image Retrieval System

A Miniature-Based Image Retrieval System A Miniature-Based Image Retrieval System Md. Saiful Islam 1 and Md. Haider Ali 2 Institute of Information Technology 1, Dept. of Computer Science and Engineering 2, University of Dhaka 1, 2, Dhaka-1000,

More information

Identifying Layout Classes for Mathematical Symbols Using Layout Context

Identifying Layout Classes for Mathematical Symbols Using Layout Context Rochester Institute of Technology RIT Scholar Works Articles 2009 Identifying Layout Classes for Mathematical Symbols Using Layout Context Ling Ouyang Rochester Institute of Technology Richard Zanibbi

More information

Analysis of Semantic Information Available in an Image Collection Augmented with Auxiliary Data

Analysis of Semantic Information Available in an Image Collection Augmented with Auxiliary Data Analysis of Semantic Information Available in an Image Collection Augmented with Auxiliary Data Mats Sjöberg, Ville Viitaniemi, Jorma Laaksonen, and Timo Honkela Adaptive Informatics Research Centre, Helsinki

More information

An Introduction to Content Based Image Retrieval

An Introduction to Content Based Image Retrieval CHAPTER -1 An Introduction to Content Based Image Retrieval 1.1 Introduction With the advancement in internet and multimedia technologies, a huge amount of multimedia data in the form of audio, video and

More information

A Topography-Preserving Latent Variable Model with Learning Metrics

A Topography-Preserving Latent Variable Model with Learning Metrics A Topography-Preserving Latent Variable Model with Learning Metrics Samuel Kaski and Janne Sinkkonen Helsinki University of Technology Neural Networks Research Centre P.O. Box 5400, FIN-02015 HUT, Finland

More information

Holistic Correlation of Color Models, Color Features and Distance Metrics on Content-Based Image Retrieval

Holistic Correlation of Color Models, Color Features and Distance Metrics on Content-Based Image Retrieval Holistic Correlation of Color Models, Color Features and Distance Metrics on Content-Based Image Retrieval Swapnil Saurav 1, Prajakta Belsare 2, Siddhartha Sarkar 3 1Researcher, Abhidheya Labs and Knowledge

More information

Color Image Segmentation

Color Image Segmentation Color Image Segmentation Yining Deng, B. S. Manjunath and Hyundoo Shin* Department of Electrical and Computer Engineering University of California, Santa Barbara, CA 93106-9560 *Samsung Electronics Inc.

More information

COMPARISON OF SOME CONTENT-BASED IMAGE RETRIEVAL SYSTEMS WITH ROCK TEXTURE IMAGES

COMPARISON OF SOME CONTENT-BASED IMAGE RETRIEVAL SYSTEMS WITH ROCK TEXTURE IMAGES COMPARISON OF SOME CONTENT-BASED IMAGE RETRIEVAL SYSTEMS WITH ROCK TEXTURE IMAGES Leena Lepistö 1, Iivari Kunttu 1, Jorma Autio 2, and Ari Visa 1 1 Tampere University of Technology, Institute of Signal

More information

Journal of Asian Scientific Research FEATURES COMPOSITION FOR PROFICIENT AND REAL TIME RETRIEVAL IN CBIR SYSTEM. Tohid Sedghi

Journal of Asian Scientific Research FEATURES COMPOSITION FOR PROFICIENT AND REAL TIME RETRIEVAL IN CBIR SYSTEM. Tohid Sedghi Journal of Asian Scientific Research, 013, 3(1):68-74 Journal of Asian Scientific Research journal homepage: http://aessweb.com/journal-detail.php?id=5003 FEATURES COMPOSTON FOR PROFCENT AND REAL TME RETREVAL

More information

Contextual priming for artificial visual perception

Contextual priming for artificial visual perception Contextual priming for artificial visual perception Hervé Guillaume 1, Nathalie Denquive 1, Philippe Tarroux 1,2 1 LIMSI-CNRS BP 133 F-91403 Orsay cedex France 2 ENS 45 rue d Ulm F-75230 Paris cedex 05

More information

Semantic Evaluation and Efficiency Comparison of the Edge Pixel Neighboring Histogram in Image retrieval

Semantic Evaluation and Efficiency Comparison of the Edge Pixel Neighboring Histogram in Image retrieval Semantic Evaluation and Efficiency Comparison of the Edge Pixel Neighboring Histogram in Image retrieval Abdolah Chalechale and Alfred Mertins School of Electrical, Computer and Telecommunications Engineering

More information

COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS

COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS Toomas Kirt Supervisor: Leo Võhandu Tallinn Technical University Toomas.Kirt@mail.ee Abstract: Key words: For the visualisation

More information

Seismic regionalization based on an artificial neural network

Seismic regionalization based on an artificial neural network Seismic regionalization based on an artificial neural network *Jaime García-Pérez 1) and René Riaño 2) 1), 2) Instituto de Ingeniería, UNAM, CU, Coyoacán, México D.F., 014510, Mexico 1) jgap@pumas.ii.unam.mx

More information

Self-Organizing Maps for Analysis of Expandable Polystyrene Batch Process

Self-Organizing Maps for Analysis of Expandable Polystyrene Batch Process International Journal of Computers, Communications & Control Vol. II (2007), No. 2, pp. 143-148 Self-Organizing Maps for Analysis of Expandable Polystyrene Batch Process Mikko Heikkinen, Ville Nurminen,

More information

Content Based Image Retrieval Using Curvelet Transform

Content Based Image Retrieval Using Curvelet Transform Content Based Image Retrieval Using Curvelet Transform Ishrat Jahan Sumana, Md. Monirul Islam, Dengsheng Zhang and Guojun Lu Gippsland School of Information Technology, Monash University Churchill, Victoria

More information

CAP 5415 Computer Vision Fall 2012

CAP 5415 Computer Vision Fall 2012 CAP 5415 Computer Vision Fall 01 Dr. Mubarak Shah Univ. of Central Florida Office 47-F HEC Lecture-5 SIFT: David Lowe, UBC SIFT - Key Point Extraction Stands for scale invariant feature transform Patented

More information

Extraction of Color and Texture Features of an Image

Extraction of Color and Texture Features of an Image International Journal of Engineering Research ISSN: 2348-4039 & Management Technology July-2015 Volume 2, Issue-4 Email: editor@ijermt.org www.ijermt.org Extraction of Color and Texture Features of an

More information

Performance study of Gabor filters and Rotation Invariant Gabor filters

Performance study of Gabor filters and Rotation Invariant Gabor filters Performance study of Gabor filters and Rotation Invariant Gabor filters B. Ng, Guojun Lu, Dengsheng Zhang School of Computing and Information Technology University Churchill, Victoria, 3842, Australia

More information

Fuzzy Hamming Distance in a Content-Based Image Retrieval System

Fuzzy Hamming Distance in a Content-Based Image Retrieval System Fuzzy Hamming Distance in a Content-Based Image Retrieval System Mircea Ionescu Department of ECECS, University of Cincinnati, Cincinnati, OH 51-3, USA ionescmm@ececs.uc.edu Anca Ralescu Department of

More information

5.6 Self-organizing maps (SOM) [Book, Sect. 10.3]

5.6 Self-organizing maps (SOM) [Book, Sect. 10.3] Ch.5 Classification and Clustering 5.6 Self-organizing maps (SOM) [Book, Sect. 10.3] The self-organizing map (SOM) method, introduced by Kohonen (1982, 2001), approximates a dataset in multidimensional

More information

Selecting Models from Videos for Appearance-Based Face Recognition

Selecting Models from Videos for Appearance-Based Face Recognition Selecting Models from Videos for Appearance-Based Face Recognition Abdenour Hadid and Matti Pietikäinen Machine Vision Group Infotech Oulu and Department of Electrical and Information Engineering P.O.

More information

Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach Abstract Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in content-based

More information

FEATURE EXTRACTION TECHNIQUES FOR IMAGE RETRIEVAL USING HAAR AND GLCM

FEATURE EXTRACTION TECHNIQUES FOR IMAGE RETRIEVAL USING HAAR AND GLCM FEATURE EXTRACTION TECHNIQUES FOR IMAGE RETRIEVAL USING HAAR AND GLCM Neha 1, Tanvi Jain 2 1,2 Senior Research Fellow (SRF), SAM-C, Defence R & D Organization, (India) ABSTRACT Content Based Image Retrieval

More information

Texture Segmentation by Windowed Projection

Texture Segmentation by Windowed Projection Texture Segmentation by Windowed Projection 1, 2 Fan-Chen Tseng, 2 Ching-Chi Hsu, 2 Chiou-Shann Fuh 1 Department of Electronic Engineering National I-Lan Institute of Technology e-mail : fctseng@ccmail.ilantech.edu.tw

More information

Advanced visualization techniques for Self-Organizing Maps with graph-based methods

Advanced visualization techniques for Self-Organizing Maps with graph-based methods Advanced visualization techniques for Self-Organizing Maps with graph-based methods Georg Pölzlbauer 1, Andreas Rauber 1, and Michael Dittenbach 2 1 Department of Software Technology Vienna University

More information

Local Features: Detection, Description & Matching

Local Features: Detection, Description & Matching Local Features: Detection, Description & Matching Lecture 08 Computer Vision Material Citations Dr George Stockman Professor Emeritus, Michigan State University Dr David Lowe Professor, University of British

More information

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS Cognitive Robotics Original: David G. Lowe, 004 Summary: Coen van Leeuwen, s1460919 Abstract: This article presents a method to extract

More information

An Accurate Method for Skew Determination in Document Images

An Accurate Method for Skew Determination in Document Images DICTA00: Digital Image Computing Techniques and Applications, 1 January 00, Melbourne, Australia. An Accurate Method for Skew Determination in Document Images S. Lowther, V. Chandran and S. Sridharan Research

More information

Two-step Modified SOM for Parallel Calculation

Two-step Modified SOM for Parallel Calculation Two-step Modified SOM for Parallel Calculation Two-step Modified SOM for Parallel Calculation Petr Gajdoš and Pavel Moravec Petr Gajdoš and Pavel Moravec Department of Computer Science, FEECS, VŠB Technical

More information

Tools for texture/color based search of images

Tools for texture/color based search of images pp 496-507, SPIE Int. Conf. 3106, Human Vision and Electronic Imaging II, Feb. 1997. Tools for texture/color based search of images W. Y. Ma, Yining Deng, and B. S. Manjunath Department of Electrical and

More information

Shape Retrieval with Flat Contour Segments

Shape Retrieval with Flat Contour Segments Shape Retrieval with Flat Contour Segments Dalong Li 1, Steven Simske Intelligent Enterprise Technologies Laboratory HP Laboratories Palo Alto HPL-2002-250 September 9 th, 2002* image database, image retrieval,

More information

Figure 1. Overview of a semantic-based classification-driven image retrieval framework. image comparison; and (3) Adaptive image retrieval captures us

Figure 1. Overview of a semantic-based classification-driven image retrieval framework. image comparison; and (3) Adaptive image retrieval captures us Semantic-based Biomedical Image Indexing and Retrieval Y. Liu a, N. A. Lazar b, and W. E. Rothfus c a The Robotics Institute Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA 15213, USA b Statistics

More information

Water-Filling: A Novel Way for Image Structural Feature Extraction

Water-Filling: A Novel Way for Image Structural Feature Extraction Water-Filling: A Novel Way for Image Structural Feature Extraction Xiang Sean Zhou Yong Rui Thomas S. Huang Beckman Institute for Advanced Science and Technology University of Illinois at Urbana Champaign,

More information

Autoregressive and Random Field Texture Models

Autoregressive and Random Field Texture Models 1 Autoregressive and Random Field Texture Models Wei-Ta Chu 2008/11/6 Random Field 2 Think of a textured image as a 2D array of random numbers. The pixel intensity at each location is a random variable.

More information

Artifacts and Textured Region Detection

Artifacts and Textured Region Detection Artifacts and Textured Region Detection 1 Vishal Bangard ECE 738 - Spring 2003 I. INTRODUCTION A lot of transformations, when applied to images, lead to the development of various artifacts in them. In

More information

HOW USEFUL ARE COLOUR INVARIANTS FOR IMAGE RETRIEVAL?

HOW USEFUL ARE COLOUR INVARIANTS FOR IMAGE RETRIEVAL? HOW USEFUL ARE COLOUR INVARIANTS FOR IMAGE RETRIEVAL? Gerald Schaefer School of Computing and Technology Nottingham Trent University Nottingham, U.K. Gerald.Schaefer@ntu.ac.uk Abstract Keywords: The images

More information

Color-Based Classification of Natural Rock Images Using Classifier Combinations

Color-Based Classification of Natural Rock Images Using Classifier Combinations Color-Based Classification of Natural Rock Images Using Classifier Combinations Leena Lepistö, Iivari Kunttu, and Ari Visa Tampere University of Technology, Institute of Signal Processing, P.O. Box 553,

More information

AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES

AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES 1 RIMA TRI WAHYUNINGRUM, 2 INDAH AGUSTIEN SIRADJUDDIN 1, 2 Department of Informatics Engineering, University of Trunojoyo Madura,

More information

CS 664 Segmentation. Daniel Huttenlocher

CS 664 Segmentation. Daniel Huttenlocher CS 664 Segmentation Daniel Huttenlocher Grouping Perceptual Organization Structural relationships between tokens Parallelism, symmetry, alignment Similarity of token properties Often strong psychophysical

More information

Web Image Retrieval Using Visual Dictionary

Web Image Retrieval Using Visual Dictionary Web Image Retrieval Using Visual Dictionary Umesh K K 1 and Suresha 2 1 Department of Information Science and Engineering, S J College of Engineering, Mysore, India. umeshkatte@gmail.com 2 Department of

More information

Edge and local feature detection - 2. Importance of edge detection in computer vision

Edge and local feature detection - 2. Importance of edge detection in computer vision Edge and local feature detection Gradient based edge detection Edge detection by function fitting Second derivative edge detectors Edge linking and the construction of the chain graph Edge and local feature

More information

Content Based Image Retrieval (CBIR) Using Segmentation Process

Content Based Image Retrieval (CBIR) Using Segmentation Process Content Based Image Retrieval (CBIR) Using Segmentation Process R.Gnanaraja 1, B. Jagadishkumar 2, S.T. Premkumar 3, B. Sunil kumar 4 1, 2, 3, 4 PG Scholar, Department of Computer Science and Engineering,

More information

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques Doaa M. Alebiary Department of computer Science, Faculty of computers and informatics Benha University

More information

Wavelet Based Image Retrieval Method

Wavelet Based Image Retrieval Method Wavelet Based Image Retrieval Method Kohei Arai Graduate School of Science and Engineering Saga University Saga City, Japan Cahya Rahmad Electronic Engineering Department The State Polytechnics of Malang,

More information

SOM+EOF for Finding Missing Values

SOM+EOF for Finding Missing Values SOM+EOF for Finding Missing Values Antti Sorjamaa 1, Paul Merlin 2, Bertrand Maillet 2 and Amaury Lendasse 1 1- Helsinki University of Technology - CIS P.O. Box 5400, 02015 HUT - Finland 2- Variances and

More information

Graph projection techniques for Self-Organizing Maps

Graph projection techniques for Self-Organizing Maps Graph projection techniques for Self-Organizing Maps Georg Pölzlbauer 1, Andreas Rauber 1, Michael Dittenbach 2 1- Vienna University of Technology - Department of Software Technology Favoritenstr. 9 11

More information

Towards the completion of assignment 1

Towards the completion of assignment 1 Towards the completion of assignment 1 What to do for calibration What to do for point matching What to do for tracking What to do for GUI COMPSCI 773 Feature Point Detection Why study feature point detection?

More information

Lecture 7: Most Common Edge Detectors

Lecture 7: Most Common Edge Detectors #1 Lecture 7: Most Common Edge Detectors Saad Bedros sbedros@umn.edu Edge Detection Goal: Identify sudden changes (discontinuities) in an image Intuitively, most semantic and shape information from the

More information

PATTERN RECOGNITION USING NEURAL NETWORKS

PATTERN RECOGNITION USING NEURAL NETWORKS PATTERN RECOGNITION USING NEURAL NETWORKS Santaji Ghorpade 1, Jayshree Ghorpade 2 and Shamla Mantri 3 1 Department of Information Technology Engineering, Pune University, India santaji_11jan@yahoo.co.in,

More information

6. NEURAL NETWORK BASED PATH PLANNING ALGORITHM 6.1 INTRODUCTION

6. NEURAL NETWORK BASED PATH PLANNING ALGORITHM 6.1 INTRODUCTION 6 NEURAL NETWORK BASED PATH PLANNING ALGORITHM 61 INTRODUCTION In previous chapters path planning algorithms such as trigonometry based path planning algorithm and direction based path planning algorithm

More information

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Improving the Efficiency of Fast Using Semantic Similarity Algorithm International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year

More information

Selection of Scale-Invariant Parts for Object Class Recognition

Selection of Scale-Invariant Parts for Object Class Recognition Selection of Scale-Invariant Parts for Object Class Recognition Gy. Dorkó and C. Schmid INRIA Rhône-Alpes, GRAVIR-CNRS 655, av. de l Europe, 3833 Montbonnot, France fdorko,schmidg@inrialpes.fr Abstract

More information

Classification Method for Colored Natural Textures Using Gabor Filtering

Classification Method for Colored Natural Textures Using Gabor Filtering Classiication Method or Colored Natural Textures Using Gabor Filtering Leena Lepistö 1, Iivari Kunttu 1, Jorma Autio 2, and Ari Visa 1, 1 Tampere University o Technology Institute o Signal Processing P.

More information

Lecture 6: Edge Detection

Lecture 6: Edge Detection #1 Lecture 6: Edge Detection Saad J Bedros sbedros@umn.edu Review From Last Lecture Options for Image Representation Introduced the concept of different representation or transformation Fourier Transform

More information

Robust Ring Detection In Phase Correlation Surfaces

Robust Ring Detection In Phase Correlation Surfaces Griffith Research Online https://research-repository.griffith.edu.au Robust Ring Detection In Phase Correlation Surfaces Author Gonzalez, Ruben Published 2013 Conference Title 2013 International Conference

More information

Key properties of local features

Key properties of local features Key properties of local features Locality, robust against occlusions Must be highly distinctive, a good feature should allow for correct object identification with low probability of mismatch Easy to etract

More information

HOUGH TRANSFORM CS 6350 C V

HOUGH TRANSFORM CS 6350 C V HOUGH TRANSFORM CS 6350 C V HOUGH TRANSFORM The problem: Given a set of points in 2-D, find if a sub-set of these points, fall on a LINE. Hough Transform One powerful global method for detecting edges

More information

Image Matching Using Run-Length Feature

Image Matching Using Run-Length Feature Image Matching Using Run-Length Feature Yung-Kuan Chan and Chin-Chen Chang Department of Computer Science and Information Engineering National Chung Cheng University, Chiayi, Taiwan, 621, R.O.C. E-mail:{chan,

More information

Image Enhancement Techniques for Fingerprint Identification

Image Enhancement Techniques for Fingerprint Identification March 2013 1 Image Enhancement Techniques for Fingerprint Identification Pankaj Deshmukh, Siraj Pathan, Riyaz Pathan Abstract The aim of this paper is to propose a new method in fingerprint enhancement

More information

Stability Assessment of Electric Power Systems using Growing Neural Gas and Self-Organizing Maps

Stability Assessment of Electric Power Systems using Growing Neural Gas and Self-Organizing Maps Stability Assessment of Electric Power Systems using Growing Gas and Self-Organizing Maps Christian Rehtanz, Carsten Leder University of Dortmund, 44221 Dortmund, Germany Abstract. Liberalized competitive

More information

Cartographic Selection Using Self-Organizing Maps

Cartographic Selection Using Self-Organizing Maps 1 Cartographic Selection Using Self-Organizing Maps Bin Jiang 1 and Lars Harrie 2 1 Division of Geomatics, Institutionen för Teknik University of Gävle, SE-801 76 Gävle, Sweden e-mail: bin.jiang@hig.se

More information

Fast Fuzzy Clustering of Infrared Images. 2. brfcm

Fast Fuzzy Clustering of Infrared Images. 2. brfcm Fast Fuzzy Clustering of Infrared Images Steven Eschrich, Jingwei Ke, Lawrence O. Hall and Dmitry B. Goldgof Department of Computer Science and Engineering, ENB 118 University of South Florida 4202 E.

More information

Short Survey on Static Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of

More information

Histograms of Oriented Gradients

Histograms of Oriented Gradients Histograms of Oriented Gradients Carlo Tomasi September 18, 2017 A useful question to ask of an image is whether it contains one or more instances of a certain object: a person, a face, a car, and so forth.

More information

Multiresolution Texture Analysis of Surface Reflection Images

Multiresolution Texture Analysis of Surface Reflection Images Multiresolution Texture Analysis of Surface Reflection Images Leena Lepistö, Iivari Kunttu, Jorma Autio, and Ari Visa Tampere University of Technology, Institute of Signal Processing P.O. Box 553, FIN-330

More information

ELEC Dr Reji Mathew Electrical Engineering UNSW

ELEC Dr Reji Mathew Electrical Engineering UNSW ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion

More information

A Texture Feature Extraction Technique Using 2D-DFT and Hamming Distance

A Texture Feature Extraction Technique Using 2D-DFT and Hamming Distance A Texture Feature Extraction Technique Using 2D-DFT and Hamming Distance Author Tao, Yu, Muthukkumarasamy, Vallipuram, Verma, Brijesh, Blumenstein, Michael Published 2003 Conference Title Fifth International

More information

A Graph Theoretic Approach to Image Database Retrieval

A Graph Theoretic Approach to Image Database Retrieval A Graph Theoretic Approach to Image Database Retrieval Selim Aksoy and Robert M. Haralick Intelligent Systems Laboratory Department of Electrical Engineering University of Washington, Seattle, WA 98195-2500

More information

Chapter 4 - Image. Digital Libraries and Content Management

Chapter 4 - Image. Digital Libraries and Content Management Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 4 - Image Vector Graphics Raw data: set (!) of lines and polygons

More information

Query-Sensitive Similarity Measure for Content-Based Image Retrieval

Query-Sensitive Similarity Measure for Content-Based Image Retrieval Query-Sensitive Similarity Measure for Content-Based Image Retrieval Zhi-Hua Zhou Hong-Bin Dai National Laboratory for Novel Software Technology Nanjing University, Nanjing 2193, China {zhouzh, daihb}@lamda.nju.edu.cn

More information

Repeating Segment Detection in Songs using Audio Fingerprint Matching

Repeating Segment Detection in Songs using Audio Fingerprint Matching Repeating Segment Detection in Songs using Audio Fingerprint Matching Regunathan Radhakrishnan and Wenyu Jiang Dolby Laboratories Inc, San Francisco, USA E-mail: regu.r@dolby.com Institute for Infocomm

More information

On Adaptive Confidences for Critic-Driven Classifier Combining

On Adaptive Confidences for Critic-Driven Classifier Combining On Adaptive Confidences for Critic-Driven Classifier Combining Matti Aksela and Jorma Laaksonen Neural Networks Research Centre Laboratory of Computer and Information Science P.O.Box 5400, Fin-02015 HUT,

More information

The PicSOM Retrieval System: Description and Evaluations

The PicSOM Retrieval System: Description and Evaluations published in the Proceedings of CIR2000, Brighton, UK, Ma 2000 available at The PicSOM Retrieval Sstem: Description and Evaluations Markus Koskela,

More information

Effect of Grouping in Vector Recognition System Based on SOM

Effect of Grouping in Vector Recognition System Based on SOM Effect of Grouping in Vector Recognition System Based on SOM Masayoshi Ohta Graduate School of Science and Engineering Kansai University Osaka, Japan Email: k287636@kansai-u.ac.jp Yuto Kurosaki Department

More information

ADAPTIVE TILE CODING METHODS FOR THE GENERALIZATION OF VALUE FUNCTIONS IN THE RL STATE SPACE A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL

ADAPTIVE TILE CODING METHODS FOR THE GENERALIZATION OF VALUE FUNCTIONS IN THE RL STATE SPACE A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL ADAPTIVE TILE CODING METHODS FOR THE GENERALIZATION OF VALUE FUNCTIONS IN THE RL STATE SPACE A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY BHARAT SIGINAM IN

More information

Texture. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors

Texture. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors Texture The most fundamental question is: How can we measure texture, i.e., how can we quantitatively distinguish between different textures? Of course it is not enough to look at the intensity of individual

More information

FRACTAL DIMENSION BASED TECHNIQUE FOR DATABASE IMAGE RETRIEVAL

FRACTAL DIMENSION BASED TECHNIQUE FOR DATABASE IMAGE RETRIEVAL FRACTAL DIMENSION BASED TECHNIQUE FOR DATABASE IMAGE RETRIEVAL Radu DOBRESCU*, Florin IONESCU** *POLITEHNICA University, Bucharest, Romania, radud@aii.pub.ro **Technische Hochschule Konstanz, fionescu@fh-konstanz.de

More information

Practical Image and Video Processing Using MATLAB

Practical Image and Video Processing Using MATLAB Practical Image and Video Processing Using MATLAB Chapter 18 Feature extraction and representation What will we learn? What is feature extraction and why is it a critical step in most computer vision and

More information

TRANSFORM FEATURES FOR TEXTURE CLASSIFICATION AND DISCRIMINATION IN LARGE IMAGE DATABASES

TRANSFORM FEATURES FOR TEXTURE CLASSIFICATION AND DISCRIMINATION IN LARGE IMAGE DATABASES TRANSFORM FEATURES FOR TEXTURE CLASSIFICATION AND DISCRIMINATION IN LARGE IMAGE DATABASES John R. Smith and Shih-Fu Chang Center for Telecommunications Research and Electrical Engineering Department Columbia

More information