ImageCLEF 2008: visual feature analysis in segmented images
|
|
- Aubrie Peters
- 6 years ago
- Views:
Transcription
1 ImageCLEF 2008: visual feature analysis in segmented images Bálint Daróczy Zsolt Fekete Mátyás Brendel Simon Rácz András Benczúr Dávid Siklósi Attila Pereszlényi Data Mining and Web search Research Group, Informatics Laboratory Computer and Automation Research Institute of the Hungarian Academy of Sciences {daroczyb, zsfekete, mbrendel, sracz, benczur, peresz, sdavid}@ilab.sztaki.hu Abstract. We describe our image processing system used in the Image- CLEF 2008 Photo Retrieval and Visual Concept Detection tasks. Our method consists of image segmentation followed by feature generation over the segments based on color, shape and texture. In the paper we elaborate on the importance of choices in the segmentation procedure with emphasis on edge detection. We also measure the relative importance of the visual features as well as the right choice of the distance function. Finally, given a very large number of parameters in our image processing system, we give a method for parameter optimization by measuring how well the similarity measures separate sample images of the same topic from those of dierent topics. 1 Introduction The ImageCLEF 2008 Photo Retrieval [1] and Visual Concept Detection tasks [2] both targeted towards image processing and visual feature generation over the IAPR TC-12 benchmark collection [3]. While the actual systems we used in the ImageCLEF 2008 campaign are described in the Working Notes [4, 5], in this paper we concentrate on the main lessons we have learned considering the strength of various visual processing elements in image categorization and similarity search. Our image processing system common to both tasks is based on image segmentation and then feature generation for the individual image segments, typically around 100 for each image in the corpus. The segmentation procedure consist of a novel combination of the FelzenszwalbHuttenlocher graph cut method [6] with smoothing over the Gaussian-Laplacian Pyramid [7]. We map all image segments into a roughly 400-dimensional space with features describing the color, shape and texture of the segment. While in the image categorization task we can learn the relative importance of the feature classes, the similarity search procedure used in our content based retrieval system is sensible to the weight. We make an excessive analysis of the feature weights as well as give a novel This work was supported by the EU FP7 project JUMAS Judicial Management by Digital Libraries Semantics and by grants OTKA NK and NKFP-07-A2 TEXTREND.
2 dimensions description 3 Mean HSV (or RGB) 60 RGB histogram, 20 bins each 30 Hue histogram 15 Saturation histogram 15 Value histogram 210 Zig-Zag Fourier amplitude (105) and absolute phase (105) low frequency components 1 Size 1 Aspect ratio 64 Shape: density in 8x8 regions Table 1. Description and number of visual features used to characterize a single image segment. method to learn these weights based solely on the sample images of the photo retrieval topics. We briey describe our text IR system; for more details we refer to [4]. We use the Hungarian Academy of Sciences search engine as our information retrieval system that is based on Okapi BM25 with the proximity of query terms taken into account. We used the original automatic query expansion formula of [8] that, in our implementation, turned out to give minor improvement only. While we show results with and without query expansion, the improvement is minor and hence we omit detailed description and analysis from this report. We also omit details on cluster recall as clusters were typically organized based on the location of the photograph and, in our opinion, image processing could not assist in identifying images of the same cluster. 2 The Image Processing System In our system we segment images by a novel combination of the graph based image segmentation method of Felzenszwalb and Huttenlocher [6] with the Gaussian- Laplacian Pyramid. While the pyramid is used with success in the ImageCLEF campaign for example in combination with the region of interest method [9], we nd other elements of the segmentation procedure of more importance. After segmentation we map each segment into a feature space characterizing its color, shape and texture with description and dimensionality shown in Table 1. These features are used directly for image classication in Section 4. Their use in the content-based retrieval system (CBIR) of Section 3 is via the distance from sample images. Given a pair of a sample and a target image, for each sample segment we compute the distance of the closest segment in the target image. The nal (asymmetric) distance arises by simply averaging over all sample image segments. Next we describe the details of the segmentation (Section 2.1) procedure and in Section 2.2 we give a novel method to learn the weight of the feature groups based solely on the sample images of the photo retrieval queries. The
3 eect of various settings on the image processing quality is analyzed over the Visual Concept Detection (Section 4) and Photo Retrieval (Section 3) tasks of ImageCLEF Segmentation Our segmentation procedure is based on a multilevel Gaussian-Laplacian pyramid [7] that enables a gradual renement of the segments starting out from a coarse segmentation on the top level of the pyramid. Given a coarser segmentation on a higher level, we rst try to replace each segment pixel by pixel with the four lower level pixels if their similarity in the RGB space is within a threshold. If the four pixels of the ner resolution are dissimilar, we remove those pixels from the segment. The remaining segments are kept together as starting segments for the lower level procedure while the remove pixels can join existing segments or form new ones. On the top level of the pyramid we use a modied FelzenszwalbHuttenlocher graph cut method [6] that, on lower levels, simply continues the growth of the segments obtained on the higher level. Our main improvement over the original method is the use of Canny edge detection [10] values to weight the connection between neighboring pixels. The original method only uses distance in the RGB space as weight that we add to the edge detection weight. We also require a similar number of segments in the images that are large enough to be meaningful for retrieval or classication purposes. The original FelzenszwalbHuttenlocher method builds a minimum spanning forest where the addition of a new pixel to the component is constrained by the weight of the connection with the next pixel and the size of the existing component. We test two post-processing rules that reject the smallest segments. The pixels of rejected segments are then redistributed by the same minimum spanning forest method but now without any further restriction on the growth of the existing large segments. The two dierent rules are as follows: Segments of size below a threshold are rejected. All segments are rejected except for the prescribed number of largest ones. 2.2 Learning feature weights for image similarity search Our CBIR ranks images based on the distance of the target image segments with the sample image segments. Unlike image classication where classiers may be capable of learning the relative importance of the features, when considering distances in the feature space, we cannot distinguish between directions relevant or irrelevant with respect to image retrieval. When applying feature weight optimization for the Photo Retrieval task, we face several problems. First, training data consists solely of the three sample images of the topics. Second, relevance to certain Photo Retrieval topics are based on aspects other than image similarity such as the location of the scene. Third, the three sample images of the same topic are sometimes not even similar. Our method for training the image processing weights is based on a test for topic separation. We select those topics manually where the three sample images
4 are similar to one another. For ImageCLEF Photo 2008 the list of the selected topics (some of which are ImageCLEF 2007 only) is as follows: 01, 02, 04, 07, 14, 15, 17, 22, 24, 27, 33, 36, 41, 43, 45, 51, 53, 55, 58, 60. The training data consists of image pairs with an identical number of pairs from the same topic and from dierent topics. Since our distance is asymmetric, we have six pairs for one topic that results in 120 positive pairs. The negative pairs are formed by selecting two random pairs from a dierent topic for each of the 60 sample images. We optimize weights for the AUC value of the two-class classication. Since the task at hand is computationally very inexpensive, we simply performed a brute force parameter search. For larger problems we could choose from logistic regression (if we only train linear weights), simulated annealing or genetic algorithms to name a few. Given the post-campaign evaluation data, we could perform another manual parameter search to nd the best performing weights in terms of the MAP of the retrieval system. As shown in Section 3 we could reach very close to the best settings we found manually, a result that is in fact overtrained due to the use of all evaluation data. 3 The Photo Retrieval Task For the Photo Retrieval task we combine the scores of our text retrieval system (with or without query expansion) with the following visual relevance score. For a target image to be ranked we take each segment of a given topic sample image and nd the closest segment in the target image. We average distances over all these segments. Finally among the three sample images we use the smallest value that corresponds to the closest, most similar one. Since we compute distance instead of similarity, we simply negate the values. When combining the much lower quality visual scores with the text retrieval scores, we use a method that basically optimizes for early precision but reaches very good improvement in MAP as well. Due to the low quality of the visual scores, low ranked images carry little information and act as noise when combining with text retrieval. Hence we replace all except the highest scores by the same largest value among them, i.e. after some position i, for all j > i we let score j = score i. In our experiment we choose i to be the rst value where score i = score i+1. Our results are summarized in Table 2 for a choice of 100 segments with the best segmentation method that uses a 7-level Gaussian-Laplacian pyramid and Canny edge detection. We observe the following behavior. First, l 1 distance outperforms l 2 in all cases. Second, better CBIR scores translate into better combined scores. Third, the test for topic separation (method Section 2.2) nds weights that perform nearly as well as the overtrained best weight setting that we were only able to compute given all relevance assessment data and by far outperforms the all-1.0 weight case. Unfortunately query expansion gives only very minor improvement and we will have to revise this component of our system. Figure 1 shows the performance of our best methods on the dierent topics. Topics are sorted by the MAP of the pure visual result. As it can be seen, the
5 MAP P5 P20 l 1 w l 1 w l 1 TST l 2 TST l 1 best l 2 best txt txt+qe txt+l 1 w txt+l 2 w txt+tst l txt+tst l txt+best l txt+best l txt+qe+tst l txt+qe+best l l 1 l 1 norm is used l 2 l 2 norm is used TST visual feature weights optimized with test for topic separation (Section 2.2) w1.0 all 1.0 weights best weights hand picked based on the evaluation data txt text based information retrieval qe query expansion Table 2. Photo Retrieval performance of dierent methods (left) with explanation on the right. MAP P5 P20 RGB RGB + Canny RGB + pyramid RGB + Canny + pyramid RGB+HSV RGB+HSV + Canny RGB+HSV + pyramid RGB+HSV + Canny + pyramid Table 3. Performance of the variants method evaluated by dierent measures visual result improves text result in most of the topics with the exception of four topics (31, 60, 17 and 15) only. Interestingly, for ve topics (23, 59, 50 and 53) the MAP improvement is higher than the visual MAP itself. The main factors in our CBIR performance consist of l 1 distance in the HSV and DFT feature space as well as our home grown parameter optimization method. Table 3 compares some variations. In general the HSV space is better than RGB but RGB yields additional improvement in combination. The table also justies the use of both the Gaussian-Laplacian pyramid and the Canny edge weight in the FelzenszwalbHuttenlocher segmentation algorithm. Finally in Fig. 2 we compare the relative strength of the features. Five features, size, aspect ratio and the three mean HSV values themselves form a strong similarity space. This fact is due to the large number of segments so that these features act as histograms. Over this feature set DFT gives the largest additional im-
6 Fig. 1. Performance of dierent methods by topic. The di line denotes the improvement of the CBIR over text retrieval with query expansion. Fig. 2. Performance of dierent feature combinations.
7 Glob1 Large Small EER AUC Glob1 Large Small Night 10.00/ / /79.72 Overcast 18.77/ / /79.24 Vegetation 35.47/ / /77.87 Buildings 36.65/ / /73.36 Table 4. Left: Performance of the three basic methods and their combination, evaluated by dierent measures. Right: Examples of global and local types of concepts with performance given in the form of EER/AUC. provement while histograms and shape add very little, though positive, increase in MAP. 4 The Visual Concept Detection Task For the Visual Concept Detection Task we used our image processing system with three main settings: global: features computed for the whole image: mean color, histogram and DFT; medium: 50 segments, features: size, ratio, mean color, histogram, shape and DFT; small: 100 segments, features: size, ratio, mean color, histogram, shape and DFT; Logistic regression was used for classication with the global or segment features as input. For a single image we averaged the segment based predictions, which turned out more accurate than either the minimum or the maximum. We note that we did not use the class hierarchy information. Our main classication results summarized in Table 4 where, in addition to the three above settings for image processing, we give two additional combinations: Logreg: The output of the classiers are combined by logistic regression on the 1/4 random fraction training data as heldout set. For the rest of the training set predictions are generated in a 3-fold crossvalidation. Mixed: For each class the method performing best on the above dened heldout set was selected. As seen in Table 4, left, best overall performance is attained with a high dimensional global feature space, closely followed by the medium resolution segmentation. We nd a clear distinction between concepts that give an overall characterization of the image (day, night, overcast) and those that describe objects in the image (people, vegatation, buildings). The former concepts are best classied in a global while the latter in a segmentwise local feature space (Table 4, right).
8 Conclusion and future work We have demonstrated that image segmentation based retrieval and categorization systems perform well and analyzed the right choice for the segmenter and the visual features. In future work we will conduct a more thorough investigation of possible features and using of more sophisticated methods for computing image distances such as the mixture of Gaussian models. We also plan to strengthen our results by improving our query expansion procedure and using more sophisticated methods for text and image retrieval fusion as well as utilize visual concepts for retrieval. References 1. Arni, T., Clough, P., Sanderson, M., Grubinger, M.: Overview of the ImageCLEFphoto 2008 photographic retrieval task. In Peters, C., Giampiccol, D., Ferro, N., Petras, V., Gonzalo, J., Peñas, A., Deselaers, T., Mandl, T., Jones, G., Kurimo, M., eds.: Evaluating Systems for Multilingual and Multimodal Information Access 9th Workshop of the Cross-Language Evaluation Forum. Lecture Notes in Computer Science, Aarhus, Denmark (September 2008 (printed in 2009)) 2. Deselaers, T., Hanbury, A.: The visual concept detection task in ImageCLEF In Peters, C., Giampiccol, D., Ferro, N., Petras, V., Gonzalo, J., Peñas, A., Deselaers, T., Mandl, T., Jones, G., Kurimo, M., eds.: Evaluating Systems for Multilingual and Multimodal Information Access 9th Workshop of the Cross- Language Evaluation Forum. Lecture Notes in Computer Science, Aarhus, Denmark (September 2008 (printed in 2009)) 3. Grubinger, M., Clough, P., Müller, H., Deselears, T.: The IAPR TC-12 benchmark - a new evaluation resource for visual information systems. In: OntoImage. (2006) Rácz, S., Daróczy, B., Siklósi, D., Pereszlényi, A., Brendel, M., Benczúr, A.: Increasing cluster recall of cross-modal image retrieval. In: Working Notes for the CLEF 2008 Workshop, Aarhus, Denmark (September 2008) 5. Daróczy, B., Fekete, Z., Brendel, M.: imageclef 2008 visual concept detection. In: Working Notes for the CLEF 2008 Workshop, Aarhus, Denmark (September 2008) 6. Felzenszwalb, P.F., Huttenlocher, D.P.: Ecient graph-based image segmentation. International Journal of Computer Vision 59 (2004) 7. Burt, P., Adelson, E.: The Laplacian Pyramid as a Compact Image Code. Communications, IEEE Transactions on [legacy, pre-1988] 31(4) (1983) Xu, J., Croft, W.: Query expansion using local and global document analysis. Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval (1996) Ah-Pine, J., Cifarelli, C., Clinchant, S., Csurka, G., Renders, J.: XRCE's Participation to ImageCLEF In: Working Notes of the 2008 CLEF Workshop. (2008) 10. Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6) (November 1986)
ImageCLEF 2008 Bálint Daróczy
SZTAKI @ ImageCLEF 2008 Bálint Daróczy joint work with András Benczúr, Mátyás Brendel, Zsolt Fekete, Attila Pereszlényi, Simon Rácz, Dávid Siklósi Data Mining and Web Search Group Computer and Automation
More informationCross-modal retrieval by text and image feature biclustering
Cross-modal retrieval by text and image feature biclustering András Benczúr István Bíró Mátyás Brendel Károly Csalogány Bálint Daróczy Dávid Siklósi Data Mining and Web search Research Group, Informatics
More informationThe Visual Concept Detection Task in ImageCLEF 2008
The Visual Concept Detection Task in ImageCLEF 2008 Thomas Deselaers 1 and Allan Hanbury 2,3 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 PRIP, Inst.
More informationImage Query Expansion Using Semantic Selectional Restrictions
Image Query Expansion Using Semantic Selectional Restrictions Osama El Demerdash, Sabine Bergler and Leila Kosseim Concordia University {osama el,bergler,kosseim}@cse.concordia.ca CLaC Laboratory - Department
More informationImageCLEF 2011
SZTAKI @ ImageCLEF 2011 Bálint Daróczy joint work with András Benczúr, Róbert Pethes Data Mining and Web Search Group Computer and Automation Research Institute Hungarian Academy of Sciences Training/test
More informationUsing an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval
Using an Image-Text Parallel Corpus and the Web for Query Expansion in Cross-Language Image Retrieval Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National
More informationBipartite Graph Partitioning and Content-based Image Clustering
Bipartite Graph Partitioning and Content-based Image Clustering Guoping Qiu School of Computer Science The University of Nottingham qiu @ cs.nott.ac.uk Abstract This paper presents a method to model the
More informationCS 664 Segmentation. Daniel Huttenlocher
CS 664 Segmentation Daniel Huttenlocher Grouping Perceptual Organization Structural relationships between tokens Parallelism, symmetry, alignment Similarity of token properties Often strong psychophysical
More informationCOSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor
COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality
More informationRobust Shape Retrieval Using Maximum Likelihood Theory
Robust Shape Retrieval Using Maximum Likelihood Theory Naif Alajlan 1, Paul Fieguth 2, and Mohamed Kamel 1 1 PAMI Lab, E & CE Dept., UW, Waterloo, ON, N2L 3G1, Canada. naif, mkamel@pami.uwaterloo.ca 2
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW ON CONTENT BASED IMAGE RETRIEVAL BY USING VISUAL SEARCH RANKING MS. PRAGATI
More informationMedical Image Annotation in ImageCLEF 2008
Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany deselaers@cs.rwth-aachen.de 2 RWTH Aachen University,
More informationClustering for Text and Image-Based Photo Retrieval at CLEF 2009
Clustering for ext and mage-based Photo Retrieval at CLEF 2009 Qian Zhu and Diana nkpen School of nformation echnology and Engineering University of Ottawa qzhu012@uottawa.ca, diana@site.uottawa.ca Abstract.
More informationImproving the Efficiency of Fast Using Semantic Similarity Algorithm
International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year
More informationIPL at ImageCLEF 2010
IPL at ImageCLEF 2010 Alexandros Stougiannis, Anestis Gkanogiannis, and Theodore Kalamboukis Information Processing Laboratory Department of Informatics Athens University of Economics and Business 76 Patission
More informationResPubliQA 2010
SZTAKI @ ResPubliQA 2010 David Mark Nemeskey Computer and Automation Research Institute, Hungarian Academy of Sciences, Budapest, Hungary (SZTAKI) Abstract. This paper summarizes the results of our first
More informationLocal Features: Detection, Description & Matching
Local Features: Detection, Description & Matching Lecture 08 Computer Vision Material Citations Dr George Stockman Professor Emeritus, Michigan State University Dr David Lowe Professor, University of British
More informationBeyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University
Beyond Mere Pixels: How Can Computers Interpret and Compare Digital Images? Nicholas R. Howe Cornell University Why Image Retrieval? World Wide Web: Millions of hosts Billions of images Growth of video
More informationWater-Filling: A Novel Way for Image Structural Feature Extraction
Water-Filling: A Novel Way for Image Structural Feature Extraction Xiang Sean Zhou Yong Rui Thomas S. Huang Beckman Institute for Advanced Science and Technology University of Illinois at Urbana Champaign,
More informationA Graph Theoretic Approach to Image Database Retrieval
A Graph Theoretic Approach to Image Database Retrieval Selim Aksoy and Robert M. Haralick Intelligent Systems Laboratory Department of Electrical Engineering University of Washington, Seattle, WA 98195-2500
More informationImage retrieval based on bag of images
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Image retrieval based on bag of images Jun Zhang University of Wollongong
More informationCIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task
CIS UDEL Working Notes on ImageCLEF 2015: Compound figure detection task Xiaolong Wang, Xiangying Jiang, Abhishek Kolagunda, Hagit Shatkay and Chandra Kambhamettu Department of Computer and Information
More informationOverview of the ImageCLEFphoto 2008 Photographic Retrieval Task
Overview of the ImageCLEFphoto 2008 Photographic Retrieval Task Thomas Arni 1, Paul Clough 1, Mark Sanderson 1 and Michael Grubinger 2 1 Department of Information Studies, University of Sheffield, UK 2
More informationEvaluation of texture features for image segmentation
RIT Scholar Works Articles 9-14-2001 Evaluation of texture features for image segmentation Navid Serrano Jiebo Luo Andreas Savakis Follow this and additional works at: http://scholarworks.rit.edu/article
More informationObject detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation
Object detection using Region Proposals (RCNN) Ernest Cheung COMP790-125 Presentation 1 2 Problem to solve Object detection Input: Image Output: Bounding box of the object 3 Object detection using CNN
More informationPredicting Popular Xbox games based on Search Queries of Users
1 Predicting Popular Xbox games based on Search Queries of Users Chinmoy Mandayam and Saahil Shenoy I. INTRODUCTION This project is based on a completed Kaggle competition. Our goal is to predict which
More informationEE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm
EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm Group 1: Mina A. Makar Stanford University mamakar@stanford.edu Abstract In this report, we investigate the application of the Scale-Invariant
More informationA fully-automatic approach to answer geographic queries: GIRSA-WP at GikiP
A fully-automatic approach to answer geographic queries: at GikiP Johannes Leveling Sven Hartrumpf Intelligent Information and Communication Systems (IICS) University of Hagen (FernUniversität in Hagen)
More information2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data.
Code No: M0502/R05 Set No. 1 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss
More informationMultimodal Medical Image Retrieval based on Latent Topic Modeling
Multimodal Medical Image Retrieval based on Latent Topic Modeling Mandikal Vikram 15it217.vikram@nitk.edu.in Suhas BS 15it110.suhas@nitk.edu.in Aditya Anantharaman 15it201.aditya.a@nitk.edu.in Sowmya Kamath
More informationSpeed-up Multi-modal Near Duplicate Image Detection
Open Journal of Applied Sciences, 2013, 3, 16-21 Published Online March 2013 (http://www.scirp.org/journal/ojapps) Speed-up Multi-modal Near Duplicate Image Detection Chunlei Yang 1,2, Jinye Peng 2, Jianping
More informationXRCE s Participation in ImageCLEF 2009
XRCE s Participation in ImageCLEF 2009 Julien Ah-Pine 1, Stephane Clinchant 1,2, Gabriela Csurka 1, Yan Liu 1 1 Xerox Research Centre Europe, 6 chemin de Maupertuis 38240, Meylan France firstname.lastname@xrce.xerox.com
More informationTrans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval
Trans-Media Pseudo-Relevance Feedback Methods in Multimedia Retrieval Stephane Clinchant, Jean-Michel Renders, and Gabriela Csurka Xerox Research Centre Europe, 6 ch. de Maupertuis, 38240 Meylan, France
More informationConsiderations Regarding the Minimum Spanning Tree Pyramid Segmentation Method
Considerations Regarding the Minimum Spanning Tree Pyramid Segmentation Method (Why does it always find the lady?) Adrian Ion, Walter G. Kropatsch, and Yll Haxhimusa Vienna University of Technology, Pattern
More informationIPL at ImageCLEF 2017 Concept Detection Task
IPL at ImageCLEF 2017 Concept Detection Task Leonidas Valavanis and Spyridon Stathopoulos Information Processing Laboratory, Department of Informatics, Athens University of Economics and Business, 76 Patission
More informationRefine boundary at resolution r. r+1 r. Update context information CI(r) based on CI(r-1) Classify at resolution r, based on CI(r), update CI(r)
Context Based Multiscale Classication of Images Jia Li Robert M. Gray EE Department EE Department Stanford Univ., CA 94305 Stanford Univ., CA 94305 jiali@isl.stanford.edu rmgray@stanford.edu Abstract This
More informationA Survey on Multimedia Information Retrieval System
A Survey on Multimedia Information Retrieval System Pooja Shinde 1, Prof. J. V. Shinde 2 1 M.E Student, Kalyani Charitable Trust s Late G.N. Sapkal College of Engineering 2 Professor, Kalyani Charitable
More informationSchedule for Rest of Semester
Schedule for Rest of Semester Date Lecture Topic 11/20 24 Texture 11/27 25 Review of Statistics & Linear Algebra, Eigenvectors 11/29 26 Eigenvector expansions, Pattern Recognition 12/4 27 Cameras & calibration
More informationOverview of the CLEF 2009 Medical Image Annotation Track
Overview of the CLEF 2009 Medical Image Annotation Track Tatiana Tommasi 1, Barbara Caputo 1, Petra Welter 2, Mark Oliver Güld 2, and Thomas M. Deserno 2 1 Idiap Research Institute, Martigny, Switzerland,
More informationA Novel Image Retrieval Method Using Segmentation and Color Moments
A Novel Image Retrieval Method Using Segmentation and Color Moments T.V. Saikrishna 1, Dr.A.Yesubabu 2, Dr.A.Anandarao 3, T.Sudha Rani 4 1 Assoc. Professor, Computer Science Department, QIS College of
More informationSTUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES
25-29 JATIT. All rights reserved. STUDYING THE FEASIBILITY AND IMPORTANCE OF GRAPH-BASED IMAGE SEGMENTATION TECHNIQUES DR.S.V.KASMIR RAJA, 2 A.SHAIK ABDUL KHADIR, 3 DR.S.S.RIAZ AHAMED. Dean (Research),
More informationIMPROVING THE PERFORMANCE OF CONTENT-BASED IMAGE RETRIEVAL SYSTEMS WITH COLOR IMAGE PROCESSING TOOLS
IMPROVING THE PERFORMANCE OF CONTENT-BASED IMAGE RETRIEVAL SYSTEMS WITH COLOR IMAGE PROCESSING TOOLS Fabio Costa Advanced Technology & Strategy (CGISS) Motorola 8000 West Sunrise Blvd. Plantation, FL 33322
More informationPerformance Evaluation
Chapter 4 Performance Evaluation For testing and comparing the effectiveness of retrieval and classification methods, ways of evaluating the performance are required. This chapter discusses several of
More informationMachine Learning based session drop prediction in LTE networks and its SON aspects
Machine Learning based session drop prediction in LTE networks and its SON aspects Bálint Daróczy, András Benczúr Institute for Computer Science and Control (MTA SZTAKI) Hungarian Academy of Sciences Péter
More informationCPSC 425: Computer Vision
1 / 31 CPSC 425: Computer Vision Instructor: Jim Little little@cs.ubc.ca Department of Computer Science University of British Columbia Lecture Notes 2016/2017 Term 2 2 / 31 Menu March 16, 2017 Topics:
More informationAn Evaluation of Information Retrieval Accuracy. with Simulated OCR Output. K. Taghva z, and J. Borsack z. University of Massachusetts, Amherst
An Evaluation of Information Retrieval Accuracy with Simulated OCR Output W.B. Croft y, S.M. Harding y, K. Taghva z, and J. Borsack z y Computer Science Department University of Massachusetts, Amherst
More informationUsing Coherence-based Measures to Predict Query Difficulty
Using Coherence-based Measures to Predict Query Difficulty Jiyin He, Martha Larson, and Maarten de Rijke ISLA, University of Amsterdam {jiyinhe,larson,mdr}@science.uva.nl Abstract. We investigate the potential
More informationRobotics Programming Laboratory
Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car
More informationCAP 5415 Computer Vision Fall 2012
CAP 5415 Computer Vision Fall 01 Dr. Mubarak Shah Univ. of Central Florida Office 47-F HEC Lecture-5 SIFT: David Lowe, UBC SIFT - Key Point Extraction Stands for scale invariant feature transform Patented
More informationSolution: filter the image, then subsample F 1 F 2. subsample blur subsample. blur
Pyramids Gaussian pre-filtering Solution: filter the image, then subsample blur F 0 subsample blur subsample * F 0 H F 1 F 1 * H F 2 { Gaussian pyramid blur F 0 subsample blur subsample * F 0 H F 1 F 1
More informationUniversity of Amsterdam at INEX 2010: Ad hoc and Book Tracks
University of Amsterdam at INEX 2010: Ad hoc and Book Tracks Jaap Kamps 1,2 and Marijn Koolen 1 1 Archives and Information Studies, Faculty of Humanities, University of Amsterdam 2 ISLA, Faculty of Science,
More informationLinear combinations of simple classifiers for the PASCAL challenge
Linear combinations of simple classifiers for the PASCAL challenge Nik A. Melchior and David Lee 16 721 Advanced Perception The Robotics Institute Carnegie Mellon University Email: melchior@cmu.edu, dlee1@andrew.cmu.edu
More informationColor-Based Classification of Natural Rock Images Using Classifier Combinations
Color-Based Classification of Natural Rock Images Using Classifier Combinations Leena Lepistö, Iivari Kunttu, and Ari Visa Tampere University of Technology, Institute of Signal Processing, P.O. Box 553,
More informationAn Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques
An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques Doaa M. Alebiary Department of computer Science, Faculty of computers and informatics Benha University
More informationDocument Expansion for Text-based Image Retrieval at CLEF 2009
Document Expansion for Text-based Image Retrieval at CLEF 2009 Jinming Min, Peter Wilkins, Johannes Leveling, and Gareth Jones Centre for Next Generation Localisation School of Computing, Dublin City University
More informationA Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering
A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering Gurpreet Kaur M-Tech Student, Department of Computer Engineering, Yadawindra College of Engineering, Talwandi Sabo,
More informationTA Section 7 Problem Set 3. SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999)
TA Section 7 Problem Set 3 SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999) Sam Corbett-Davies TA Section 7 02-13-2014 Distinctive Image Features from Scale-Invariant
More informationSIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014
SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014 SIFT SIFT: Scale Invariant Feature Transform; transform image
More informationUsing the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection
Using the Deformable Part Model with Autoencoded Feature Descriptors for Object Detection Hyunghoon Cho and David Wu December 10, 2010 1 Introduction Given its performance in recent years' PASCAL Visual
More informationSiemens TREC-4 Report: Further Experiments with Database. Merging. Ellen M. Voorhees. Siemens Corporate Research, Inc.
Siemens TREC-4 Report: Further Experiments with Database Merging Ellen M. Voorhees Siemens Corporate Research, Inc. Princeton, NJ ellen@scr.siemens.com Abstract A database merging technique is a strategy
More informationMedical Image Annotation in ImageCLEF 2008
Medical Image Annotation in ImageCLEF 2008 Thomas Deselaers 1 and Thomas M. Deserno 2 1 RWTH Aachen University, Computer Science Department, Aachen, Germany 2 RWTH Aachen University, Dept. of Medical Informatics,
More informationI2R ImageCLEF Photo Annotation 2009 Working Notes
I2R ImageCLEF Photo Annotation 2009 Working Notes Jiquan Ngiam and Hanlin Goh Institute for Infocomm Research, Singapore, 1 Fusionopolis Way, Singapore 138632 {jqngiam, hlgoh}@i2r.a-star.edu.sg Abstract
More informationContent Based Image Retrieval (CBIR) Using Segmentation Process
Content Based Image Retrieval (CBIR) Using Segmentation Process R.Gnanaraja 1, B. Jagadishkumar 2, S.T. Premkumar 3, B. Sunil kumar 4 1, 2, 3, 4 PG Scholar, Department of Computer Science and Engineering,
More informationSemi-supervised Data Representation via Affinity Graph Learning
1 Semi-supervised Data Representation via Affinity Graph Learning Weiya Ren 1 1 College of Information System and Management, National University of Defense Technology, Changsha, Hunan, P.R China, 410073
More informationApplications. Foreground / background segmentation Finding skin-colored regions. Finding the moving objects. Intelligent scissors
Segmentation I Goal Separate image into coherent regions Berkeley segmentation database: http://www.eecs.berkeley.edu/research/projects/cs/vision/grouping/segbench/ Slide by L. Lazebnik Applications Intelligent
More informationUJM at ImageCLEFwiki 2008
UJM at ImageCLEFwiki 2008 Christophe Moulin, Cecile Barat, Mathias Géry, Christophe Ducottet, Christine Largeron To cite this version: Christophe Moulin, Cecile Barat, Mathias Géry, Christophe Ducottet,
More informationAn Introduction to Content Based Image Retrieval
CHAPTER -1 An Introduction to Content Based Image Retrieval 1.1 Introduction With the advancement in internet and multimedia technologies, a huge amount of multimedia data in the form of audio, video and
More informationLearning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009
Learning and Inferring Depth from Monocular Images Jiyan Pan April 1, 2009 Traditional ways of inferring depth Binocular disparity Structure from motion Defocus Given a single monocular image, how to infer
More informationClassification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014
Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014 Sungbin Choi, Jinwook Choi Medical Informatics Laboratory, Seoul National University, Seoul, Republic of
More informationStructural Analysis of Aerial Photographs (HB47 Computer Vision: Assignment)
Structural Analysis of Aerial Photographs (HB47 Computer Vision: Assignment) Xiaodong Lu, Jin Yu, Yajie Li Master in Artificial Intelligence May 2004 Table of Contents 1 Introduction... 1 2 Edge-Preserving
More informationShort Run length Descriptor for Image Retrieval
CHAPTER -6 Short Run length Descriptor for Image Retrieval 6.1 Introduction In the recent years, growth of multimedia information from various sources has increased many folds. This has created the demand
More informationMULTIVARIATE TEXTURE DISCRIMINATION USING A PRINCIPAL GEODESIC CLASSIFIER
MULTIVARIATE TEXTURE DISCRIMINATION USING A PRINCIPAL GEODESIC CLASSIFIER A.Shabbir 1, 2 and G.Verdoolaege 1, 3 1 Department of Applied Physics, Ghent University, B-9000 Ghent, Belgium 2 Max Planck Institute
More informationA Miniature-Based Image Retrieval System
A Miniature-Based Image Retrieval System Md. Saiful Islam 1 and Md. Haider Ali 2 Institute of Information Technology 1, Dept. of Computer Science and Engineering 2, University of Dhaka 1, 2, Dhaka-1000,
More informationNOVEL PCA-BASED COLOR-TO-GRAY IMAGE CONVERSION. Ja-Won Seo and Seong Dae Kim
NOVEL PCA-BASED COLOR-TO-GRAY IMAGE CONVERSION Ja-Won Seo and Seong Dae Kim Korea Advanced Institute of Science and Technology (KAIST) Department of Electrical Engineering 21 Daehak-ro, Yuseong-gu, Daejeon
More informationUnsupervised learning in Vision
Chapter 7 Unsupervised learning in Vision The fields of Computer Vision and Machine Learning complement each other in a very natural way: the aim of the former is to extract useful information from visual
More informationSUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS
SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS Cognitive Robotics Original: David G. Lowe, 004 Summary: Coen van Leeuwen, s1460919 Abstract: This article presents a method to extract
More informationWavelet Based Image Retrieval Method
Wavelet Based Image Retrieval Method Kohei Arai Graduate School of Science and Engineering Saga University Saga City, Japan Cahya Rahmad Electronic Engineering Department The State Polytechnics of Malang,
More informationContent-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database
Content-Based Image Retrieval with LIRe and SURF on a Smartphone-Based Product Image Database Kai Chen 1 and Jean Hennebert 2 1 University of Fribourg, DIVA-DIUF, Bd. de Pérolles 90, 1700 Fribourg, Switzerland
More informationMRIM-LIG at ImageCLEF 2009: Photo Retrieval and Photo Annotation tasks
MRIM-LIG at ImageCLEF 2009: Photo Retrieval and Photo Annotation tasks Philippe Mulhem, Jean-Pierre Chevallet, Georges Quénot, Rami Al Batal Grenoble University, LIG Philippe.Mulhem@imag.fr, Jean-Pierre.Chevallet@imag.fr
More informationEfficient Content Based Image Retrieval System with Metadata Processing
IJIRST International Journal for Innovative Research in Science & Technology Volume 1 Issue 10 March 2015 ISSN (online): 2349-6010 Efficient Content Based Image Retrieval System with Metadata Processing
More informationHIGH RESOLUTION REMOTE SENSING IMAGE SEGMENTATION BASED ON GRAPH THEORY AND FRACTAL NET EVOLUTION APPROACH
HIGH RESOLUTION REMOTE SENSING IMAGE SEGMENTATION BASED ON GRAPH THEORY AND FRACTAL NET EVOLUTION APPROACH Yi Yang, Haitao Li, Yanshun Han, Haiyan Gu Key Laboratory of Geo-informatics of State Bureau of
More informationWord Indexing Versus Conceptual Indexing in Medical Image Retrieval
Word Indexing Versus Conceptual Indexing in Medical Image Retrieval (ReDCAD participation at ImageCLEF Medical Image Retrieval 2012) Karim Gasmi, Mouna Torjmen-Khemakhem, and Maher Ben Jemaa Research unit
More informationLOCALIZATION OF FACIAL REGIONS AND FEATURES IN COLOR IMAGES. Karin Sobottka Ioannis Pitas
LOCALIZATION OF FACIAL REGIONS AND FEATURES IN COLOR IMAGES Karin Sobottka Ioannis Pitas Department of Informatics, University of Thessaloniki 540 06, Greece e-mail:fsobottka, pitasg@zeus.csd.auth.gr Index
More informationIPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching
IPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching Sheng Gao, Jean-Pierre Chevallet and Joo-Hwee Lim IPAL, Institute for Infocomm Research, A*Star, Singapore
More informationShort Survey on Static Hand Gesture Recognition
Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of
More informationExperiment for Using Web Information to do Query and Document Expansion
Experiment for Using Web Information to do Query and Document Expansion Yih-Chen Chang and Hsin-Hsi Chen * Department of Computer Science and Information Engineering National Taiwan University Taipei,
More informationClassifier C-Net. 2D Projected Images of 3D Objects. 2D Projected Images of 3D Objects. Model I. Model II
Advances in Neural Information Processing Systems 7. (99) The MIT Press, Cambridge, MA. pp.949-96 Unsupervised Classication of 3D Objects from D Views Satoshi Suzuki Hiroshi Ando ATR Human Information
More informationApplications of Neighborhood Sequence in Image Processing and Database Retrieval
Journal of Universal Computer Science, vol. 12, no. 9 (2006), 1240-1253 submitted: 31/12/05, accepted: 12/5/06, appeared: 28/9/06 J.UCS Applications of Neighborhood Sequence in Image Processing and Database
More informationCS 534: Computer Vision Segmentation and Perceptual Grouping
CS 534: Computer Vision Segmentation and Perceptual Grouping Ahmed Elgammal Dept of Computer Science CS 534 Segmentation - 1 Outlines Mid-level vision What is segmentation Perceptual Grouping Segmentation
More informationKey properties of local features
Key properties of local features Locality, robust against occlusions Must be highly distinctive, a good feature should allow for correct object identification with low probability of mismatch Easy to etract
More informationEfficient Acquisition of Human Existence Priors from Motion Trajectories
Efficient Acquisition of Human Existence Priors from Motion Trajectories Hitoshi Habe Hidehito Nakagawa Masatsugu Kidode Graduate School of Information Science, Nara Institute of Science and Technology
More informationIdentification of Paintings in Camera-Phone Images
Identification of Paintings in Camera-Phone Images Gabriel M. Hoffmann, Peter W. Kimball, Stephen P. Russell Department of Aeronautics and Astronautics Stanford University, Stanford, CA 94305 {gabeh,pkimball,sprussell}@stanford.edu
More informationColor Content Based Image Classification
Color Content Based Image Classification Szabolcs Sergyán Budapest Tech sergyan.szabolcs@nik.bmf.hu Abstract: In content based image retrieval systems the most efficient and simple searches are the color
More informationCHAPTER 4 SEMANTIC REGION-BASED IMAGE RETRIEVAL (SRBIR)
63 CHAPTER 4 SEMANTIC REGION-BASED IMAGE RETRIEVAL (SRBIR) 4.1 INTRODUCTION The Semantic Region Based Image Retrieval (SRBIR) system automatically segments the dominant foreground region and retrieves
More information70 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 6, NO. 1, FEBRUARY ClassView: Hierarchical Video Shot Classification, Indexing, and Accessing
70 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 6, NO. 1, FEBRUARY 2004 ClassView: Hierarchical Video Shot Classification, Indexing, and Accessing Jianping Fan, Ahmed K. Elmagarmid, Senior Member, IEEE, Xingquan
More informationOperators-Based on Second Derivative double derivative Laplacian operator Laplacian Operator Laplacian Of Gaussian (LOG) Operator LOG
Operators-Based on Second Derivative The principle of edge detection based on double derivative is to detect only those points as edge points which possess local maxima in the gradient values. Laplacian
More informationDESIGN OF A NOVEL IMAGE FUSION ALGORITHM FOR IMPULSE NOISE REMOVAL IN REMOTE SENSING IMAGES BY USING THE QUALITY ASSESSMENT
DESIGN OF A NOVEL IMAGE FUSION ALGORITHM FOR IMPULSE NOISE REMOVAL IN REMOTE SENSING IMAGES BY USING THE QUALITY ASSESSMENT P.PAVANI, M.V.H.BHASKARA MURTHY Department of Electronics and Communication Engineering,Aditya
More informationPredicting Query Performance on the Web
Predicting Query Performance on the Web No Author Given Abstract. Predicting performance of queries has many useful applications like automatic query reformulation and automatic spell correction. However,
More informationsecond_language research_teaching sla vivian_cook language_department idl
Using Implicit Relevance Feedback in a Web Search Assistant Maria Fasli and Udo Kruschwitz Department of Computer Science, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, United Kingdom fmfasli
More informationSemantics-based Image Retrieval by Region Saliency
Semantics-based Image Retrieval by Region Saliency Wei Wang, Yuqing Song and Aidong Zhang Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, NY 14260, USA
More information