Data Modelling and. Multimedia. Databases M. Multimedia. Information Retrieval Part II. Outline

Similar documents
Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

Machine Learning: Algorithms and Applications

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Image Alignment CSC 767

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

Object-Based Techniques for Image Retrieval

LECTURE : MANIFOLD LEARNING

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

A Binarization Algorithm specialized on Document Images and Photos

Active Contours/Snakes

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

COMPLEX WAVELET TRANSFORM-BASED COLOR INDEXING FOR CONTENT-BASED IMAGE RETRIEVAL

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Cluster Analysis of Electrical Behavior

Unsupervised Learning

Detection of an Object by using Principal Component Analysis

Unsupervised Learning and Clustering

S1 Note. Basis functions.

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

1. Introduction. Abstract

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Semantic Image Retrieval Using Region Based Inverted File

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Lecture #15 Lecture Notes

Performance Evaluation of Information Retrieval Systems

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Hybrid Non-Blind Color Image Watermarking

Mathematics 256 a course in differential equations for engineering students

An efficient method to build panoramic image mosaics

Unsupervised Learning and Clustering

Fitting: Deformable contours April 26 th, 2018

An Image Compression Algorithm based on Wavelet Transform and LZW

Photo management applications

Machine Learning 9. week

Module Management Tool in Software Development Organizations

K-means and Hierarchical Clustering

Query Clustering Using a Hybrid Query Similarity Measure

Visual Thesaurus for Color Image Retrieval using Self-Organizing Maps

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN

A Clustering Algorithm for Key Frame Extraction Based on Density Peak

Hierarchical clustering for gene expression data analysis

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Relevance Feedback for Image Retrieval

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Combination of Color and Local Patterns as a Feature Vector for CBIR

A New Approach For the Ranking of Fuzzy Sets With Different Heights

An Image Fusion Approach Based on Segmentation Region

Brushlet Features for Texture Image Retrieval

PRÉSENTATIONS DE PROJETS

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

TN348: Openlab Module - Colocalization

Matching and Retrieval Based on the Vocabulary and Grammar of Color Patterns

Biostatistics 615/815

Video Content Representation using Optimal Extraction of Frames and Scenes

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Polyhedral Compilation Foundations

KIDS Lab at ImageCLEF 2012 Personal Photo Retrieval

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures

Image Interpretation Based On Similarity Measures of Visual Content Descriptors An Insight Mungamuru Nirmala

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

An Improved Image Segmentation Algorithm Based on the Otsu Method

Research and Application of Fingerprint Recognition Based on MATLAB

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

A Deflected Grid-based Algorithm for Clustering Analysis

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Background Removal in Image indexing and Retrieval

A Study on Discrete Wavelet Transform based Texture Feature Extraction for Image Mining

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros.

Hermite Splines in Lie Groups as Products of Geodesics

Feature Extraction and Registration An Overview

Structure from Motion

Novel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition

A Novel Similarity Measure using a Normalized Hausdorff Distance for Trademarks Retrieval Based on Genetic Algorithm

A New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Local Quaternary Patterns and Feature Local Quaternary Patterns

A Novel Method for Transforming XML Documents to Time Series and Clustering Them Based on Delaunay Triangulation

Multi-view 3D Position Estimation of Sports Players

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

Reducing Frame Rate for Object Tracking

Optimal Workload-based Weighted Wavelet Synopses

MOTION BLUR ESTIMATION AT CORNERS

Searching Large Image Databases using Color Information

Comparison Study of Textural Descriptors for Training Neural Network Classifiers

Laplacian Eigenmap for Image Retrieval

Lecture 5: Multilayer Perceptrons

Grading Image Retrieval Based on DCT and DWT Compressed Domains Using Low-Level Features

Support Vector Machines

Transcription:

ALMA MATER STUDIORUM - UNIVERSITÀ DI BOLOGNA Data Modellng and Multmeda Databases M Internatonal Second cycle degree programme (LM) n Dgtal Humantes and Dgtal Knowledge (DHDK) Unversty of Bologna Multmeda Informaton Retreval Part II Home page: http://www-db.ds.unbo.t/courses/dmmmdb/ Electronc verson: 2.02.MultmedaInformatonRetreval-II.pdf Electronc verson: 2.02.MultmedaInformatonRetreval-II-2p.pdf Outlne Descrpton models for MM data retreval Low-level features for MM data content representaton Smlarty measures for MM data content comparson Regon-based mage retreval The Wndsurf system 2

MM data retreval From the prevous lesson we know that features are a smarter way to represent MM data content than ther orgnal format e.g., color and texture for an mage Today we focus on whch are the most sutable models for representng, nterpretng, descrbng and comparng such features E.g., color hstograms for mages by usng the Eucldean dstance as smlarty measure wth the fnal goal to be able to retreve from MM collectons those objects whch are most nterestng for us!! 3 Content-based search Frst approach to search for MM objects reles on standard text-based technques, provded objects come wth a precse textual descrpton of what they represent/descrbe,.e., of ther semantcs However, the annotaton of MM objects s a subjectve, tme consumng, and tedous process (completely manual!!) A more convenent approach, sutable to manage large DBs, s to automatcally extract from MM objects a set of (low-level) relevant numercal features that, at least partally, convey some of the semantcs of the objects Clearly, whch are the best features to extract depend on the specfc medum and on the applcaton at hand (.e., what we are lookng for) Look for cheetahs? Ths s fne; but, how to fnd t? 4

Content-based smlarty search Once we have feature values, we can search objects by usng them Assume a database (DB) wth N MM objects (e.g., mages) and, for each of the N objects, we have extracted the relevant features E.g., we could extract some color nformaton from mages We can now search for objects whose feature values are smlar (n some sense to be defned) to the feature values of our query [SWS+00, LSD+06, LZL+07, DJL+08] In general, ths approach, much alke as t happens n text-retreval, cannot guarantee that all and only relevant results are returned as result of a query Look for cheetahs? Oops! Not really a cheetah ;-) 5 The general scenaro In general, we have a 2-levels scenaro: Needed to support automatc retreval The objects level Ultmately, we want to fnd relevant objects Color Shape Texture The features level We extract features from the objects, and use them for queryng the DB 6

The reference archtecture For the text-based approach, the mage queryng problem can be smply transformed nto a tradtonal nformaton retreval problem as we wll see speakng about MM data annotaton GUI query mage vsualze results feature extracton mage segmentaton (optonal) query processor For content-based nformaton retreval (CBIR) more sophstcated query evaluaton technques are requred mage DB query engne ndex feature DB 7 Varables of the CBIR problem How the set of relevant results s determned depends on whch low-level features are used to characterze the MM data content on the smlarty crteron (dstance functon) used to compare such features on how DB objects are ranked wth respect to the query on whether the user s nterested n the whole MM data query or only n a part of t All these aspects strongly nfluence the query evaluaton process! Smplest case: each MM data object (.e., mage) s characterzed usng global low-level features and the result of a query conssts n the set of DB objects that better match the vsual characterstcs of the target object, accordng to a predefned smlarty crteron, whch s n turn based on such features Ths s also defned Nearest Neghbors (NN) search problem 8

Representng color In a dgtal mage, the color space that encodes the color content of each pxel of the mage s necessarly dscretzed Ths depends on how many bts per pxel (bpp) are used Example: f one represents mages n the RGB space by usng 8 3 = 24 bpp, the number of possble dstnct colors s 2 24 = 16,777,216 Wth 8 bts per channel, we have 256 possble values on each channel Although dscrete, the possble color values are stll too many f one wants to compactly represent the color content of an mage Ths also ams at achevng some robustness n the matchng process (e.g., the two RGB values (123,078,226) and (121,080,230) are almost ndstngushable) In practce, a common approach to represent color s to make use of hstograms 9 Color hstograms A color hstogram h s a D-dmensonal vector, whch s obtaned by quantzng the color space nto D dstnct colors Typcal values of D are 32, 64, 256, 1024, Example: the HSV color space can be quantzed nto D=32 colors: H s dvded nto 8 ntervals, and S nto 4 V = 0 guarantees nvarance to lght ntensty The -th component (also called bn) of h stores the percentage (number) of pxels n the mage whose color s mapped to the -th color Although conceptually smple, color hstograms are wdely used snce they are relatvely nvarant to translaton, rotaton, scale changes and partal occlusons D = 64 10

Further examples Two D=64 color hstograms 11 Comparng color hstograms Snce hstograms are vectors, we can use any Lp-norm to measure the dstance (dssmlarty) of two color hstograms However, dong so we are not takng nto account colors correlaton Dependng on the query and the dataset, we mght therefore obtan low-qualty results Weghted Lp-norms and relevance feedback can partally allevate the problem The problem s that Lp-norms just consder the dfference of correspondng bns,.e., they perform a 1-1 comparson Wth color hstograms, our coordnates are not unrelated ( cross-talk effect) 12

Recall on Lp-norms Gven two D-dmensonal vectors p and q, ther dstance n the reference D-dmensonal space based on Lp-norm s: L p D 1 / p ( p, q ) = ( p q ) = 1 p 1 p < A relevant example s the Eucldean dstance (p=2): L ( D 2 2 p, q ) = ( p q ) = 1 and ts weghed verson: L 1 / 2 ( D 2 2, W p, q, W ) = w ( p q ) = 1 where W(w 1,,w D ) s the vector of weghts that reflect the mportance of each coordnate of the D-dmensonal space 1 / 2 L 2 Weghted L 2 13 Sample queres based on color (1) QueryImage Eucldean dstance 32-D HSV hstograms Weghted Eucldean dstance 14

Sample queres based on color (2) QueryImage Eucldean dstance 32-D HSV hstograms Weghted Eucldean dstance 15 Quadratc dstance Consder two hstograms h and q, both wth D bns Ther quadratc dstance s defned as: L A (h, q;a) D D = = = 1 j= 1 a,j ( h q )( h q ) T ( h q) A ( h q) j j L A where A = {a,j } s called the (color-)smlarty matrx The value of a,j s the smlarty of the -th and the j-th colors (a, = 1) Note that: when A s a dagonal matrx we are back to the weghted Eucldean dstance, when A = I (the dentty matrx) we obtan the L 2 dstance 16

Quadratc dstance vs. Eucldean dstance As a smple example, let D = 3, wth colors red, orange, and blue Consder 3 pure-color mages and the correspondng hstograms: h1=(1,0,0) h2=(0,1,0) h3=(0,0,1) Usng L 2, the dstance between two dfferent mages s always 2 On the other hand, let the color-smlarty matrx be defned as: A 1 0.8 0 0.8 1 0 0 0 1 Now we have L A (h1,h2) = 0.4, whereas L A (h1,h3) = L A (h2,h3) = 2 17 Representng texture (1) Unlke color, texture s not a property of the sngle pxel, rather t s a collectve property of a pxel and ts, sutably defned, neghborhood mosac effect blnds effect Intutvely, texture provdes nformaton about the unformty, granularty and regularty of the mage surface It s usually computed just consderng the gray-scale values of pxels (.e., the V channel n HSV) 18

Representng texture (2) Tamura features correspond to propertes of a texture whch are readly perceved, that s coarseness, contrast and drectonalty (3-D feature vector) Coarseness - coarse vs. fne: t provdes nformaton about the granularty of the pattern Contrast - hgh vs. low contrast: t measures the amount of local changes n brghtness Drectonalty - drectonal vs. non-drectonal: t s a global property of the mage 19 Representng shape Once one has succeeded n extractng an object s contour, the next step s how to represent/encode t A common approach s to navgate the contour, whch leads to an orderng of the pxels n the contour: { (x(t),y(t)) : t = 1,M } A 2nd step s to represent the resultng curve n a parametrc form For nstance, a possblty s to resort to complex values, by settng z(t) = x(t)+ j y(t) Thus, now we have vectors of complex values The problem s that each vector has a dfferent length (.e., M depends on the specfc mage) 20

Representatve ponts The dea s to keep only the D most nterestng ponts Some methods are: Equally-spaced samplng (a) Grd-based samplng (b) Maxmum curvature ponts (c) Fourer-based methods, whch frst compute the DFT of the contour, and then keep only the frst D coeffcents Workng n the frequency doman has several advantages: It can be proved that by properly modfyng Fourer coeffcents one can acheve nvarance to scale, translaton and rotaton Further, by vewng shape as a sgnal, one can adopt dstance measures that have been developed for the comparson of tme seres and that are somewhat nsenstve to sgnals modfcatons (a) (b) (c) 21 Comparng shapes The commonest way to measure the (ds-)smlarty of two shape vectors of equal length D s based on Eucldean dstance (L 2 ) However, wth Eucldean dstance we have to face a basc problem Senstvty to algnment of values Intutvely, we would need a dstance measure that s able to match a pont of tme seres s even wth surroundng ponts of tme seres q Alternatvely, we may vew the tme axs as a stretchable one A dstance lke ths exsts, and s called Dynamc Tme Warpng (DTW) 22

Sample queres based on shape [BCP02] QueryImage R = relevant (same type of fsh) 1100 objects contours 23 Ths s not the whole story of course, many other features models (and correspondent dstance functons) have been defned for MM data Ths was just a way to provde some concrete examples of features and modaltes to comparng them! Note that, besdes generc features, any specfc mage doman/applcaton needs to extract and manage specfc features, whch n general requre much more sophstcated tools than the one we have seen E.g., face/fngerprnts recognton Nonetheless, what s mportant to stress s that the problem of how to search n large mage DB s remans (almost) the same! Let s go now nto the detals of what happens and how thngs can become complex n a real mage retreval system 24

The regon-based mage retreval approach DB populaton tme: Preprocess mages to segment them nto regons Represent regons as vectors of features Query tme: Compare query regons to DB regons Assess smlarty between mages by combnng smlarty between regons DB populaton Segmentaton Queryng Segmentaton GUI Feature Extracton Feature Extracton Features Image Features Vsualze results Query processor Image DB Feature DB Query engne Image DB Feature DB 25 http://www-db.ds.unbo.t/bartoln/publcatons.html Wndsurf case study [ABP99, BCP00, BP00, BC03, Bar09a, BCP+09, BCP10] Wndsurf: Wavelet-Based Indexng of Images Usng Regons Fragmentaton Dscrete Wavelet Transform (DWT): extracts a set of features representng the mage n the color-texture space Clusterng: fragments the mage nto a set of regons usng wavelet coeffcents Smlarty Features: used to compare regons DWT Clusterng Smlarty Features 26

Dscrete Wavelet Transform (DWT) Haar wavelet: smple and quck Each coeffcent s defned by: level DWT (l) frequency sub-band (B) color channels (H, S, V) Image Avg Dff l; B w j = ( l ; B l ; B l ; B ) w0, w1, w { LL, LH, HL HH } B, 2 Avg Dff Avg Avg Avg Dff Dff Avg Dff Dff 27 DWT: practcal example 28

Clusterng (1) K-means algorthm (3rd level and low frequence nfo) Choose k ntal centrods; Assocate each pont to ts nearest centrod; Recompute centrods and repeat prevous step; Stop when soluton does not change. Mahalanobs dstance: δ 3, LL 3, LL 2 3, LL 3, LL T 3; LL 1 3, LL 3, LL ( w, w ) = ( w w ).( C ) ( w w ) j j Correlaton between wavelet coeffcents takes nto account varatons n color,.e. texture j 29 Clusterng (2) Optmal value for k? Mnmzaton of a valdty functon Intra-cluster dstance Clusters sze Inter-cluster dstance Input mage Clusters for k=2 Clusters for k=10 Clusters for k=4 I. Bartoln (Optmal soluton) 30

Smlarty features Regon smlarty wth Bhattacharyya dstance Regons are ellpsods n 37-D feature space (all frequences nfo s used) (3-D centrod + 6-D covarance matrx + 1-D regon sze) Dstance between regons centrods (color nfo) Covarance matrces (texture nfo) 3; B 3; B C R + C R j 3; B 3; B 1 2 C + C d B j R R R µ 1 1 j R 2 3; B 3; B 8 2 2 C 2 R C R j 2 1 1 ( ) B B T R R j = + ( ) B B R, R ln µ µ ( µ ) j 31 Image smlarty Smlarty between mages s a functon of smlartes among matched regons How regons are "matched" can therefore strongly nfluence the result of a query Example: one-to-one match (formulated as Assgnment Problem) 32

Assgnment problem Goal: Fnd the optmal match where unt elements of fxed sze are matched ndvdually mage smlarty regon matchng Implemented wth the Hungaran algorthm, maxmzng a functon that s monotonc n the smlarty scores (e.g. average) r 1 r 2 r 3 r 4 r 5 q 1.52.17.41.16.29 q 2.27.19.81.35.49 q 3 1.0.11.27.24.29 r 1 r 2 r 3 r 4 r 5 q 1.52.17.41.16.29 q 2.27.19.81.35.49 q 3 1.0.11.27.24.29 I. Bartoln (.52+.81+1.0)/3=.77 (.29+.81+1.0)/3=.7 33 Sample query 34

Effectveness comparson example Flowers query Wnsdurf clusters Blobworld [CTB+99] clusters Wndsurf I. Bartoln Blobworld Data Modellng and Multmeda Databases 35 Wndsurf n a specalzed context: handwrtngs To provde an example of the generalty of the Wndsurf framework, here we show a pc of the WrtngSmlartySearch system bult on top of the Wndsurf system Let s nstantate the key ponts of the Wndsurf wthn the new context: Regons correspond to local features (.e., key ponts of SURF) Regon dstance functon s the Eucldean dstance The matchng problem s solved by means of an approxmaton of the 1-1 matchng best bn frst match I. Bartoln Data Modellng and Multmeda Databases 36

WrtngSmlartySearch: an example query Top-k results (k=9) I. Bartoln 37 37 Free exercse 1.D Let s complete our Exercse 1, n ts last part D Startng from the defntons of the low-level features you selected for descrbng the content of unstructured data nvolved n your MM applcatons, provde a concrete representaton/comparson modalty of them wth vsual examples Among features possbltes: global features vs. local features (regon-based approach) E.g., global color dstrbuton for an mage (defnton) vs. color hstograms by usng the weghted Eucldean dstance as smlarty measure (representaton/comparson modalty) In dong the exercse, let s keep n mnd the fnal goal: retreve relevant MM content! 38