Multimedia Information Retrieval

Similar documents
Digital Image Processing

Boundary descriptors. Representation REPRESENTATION & DESCRIPTION. Descriptors. Moore boundary tracking

Practical Image and Video Processing Using MATLAB

Lecture 10: Image Descriptors and Representation

One image is worth 1,000 words

Content Based Image Retrieval

Color and Shading. Color. Shapiro and Stockman, Chapter 6. Color and Machine Vision. Color and Perception

Machine vision. Summary # 6: Shape descriptors

Pop Quiz 1 [10 mins]

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University

Lecture 8 Object Descriptors

An Introduction to Content Based Image Retrieval

Multimedia Databases. 2. Summary. 2 Color-based Retrieval. 2.1 Multimedia Data Retrieval. 2.1 Multimedia Data Retrieval 4/14/2016.

Chapter 11 Representation & Description

Content Based Image Retrieval Using Curvelet Transform

Chapter 4 - Image. Digital Libraries and Content Management

Digital Image Processing Chapter 11: Image Description and Representation

VC 11/12 T14 Visual Feature Extraction

CoE4TN4 Image Processing

LECTURE 4: FEATURE EXTRACTION DR. OUIEM BCHIR

The MPEG-7 Description Standard 1

CSI 4107 Image Information Retrieval

Customisation and production of Badges. Getting started with I-Color System Basic Light

Fall 2015 Dr. Michael J. Reale


CS 556: Computer Vision. Lecture 18

Digital Image Processing Fundamentals

Ulrik Söderström 21 Feb Representation and description

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Distributed Algorithms. Image and Video Processing

The Elements of Colour

Building Multi Script OCR for Brahmi Scripts: Selection of Efficient Features

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37

Computer Graphics. Bing-Yu Chen National Taiwan University The University of Tokyo

EXAMINATIONS 2016 TRIMESTER 2

CHAPTER 3 COLOR MEASUREMENT USING CHROMATICITY DIAGRAM - SOFTWARE

CS 231A Computer Vision (Winter 2014) Problem Set 3

Digital Image Processing. Introduction

Advances in Metrology for Guide Plate Analysis

A Content Based Image Retrieval System Based on Color Features

Image Processing, Analysis and Machine Vision

Topic 6 Representation and Description

Content Based Video Retrieval

Color, Edge and Texture

Feature description. IE PŁ M. Strzelecki, P. Strumiłło

SHADOW DETECTION AND REMOVAL FROM SATELLITE CAPTURE IMAGES USING SUCCESSIVE THRESHOLDING ALGORITHM

Integrated Querying of Images by Color, Shape, and Texture Content of Salient Objects

MPEG-7 Visual shape descriptors

Game Programming. Bing-Yu Chen National Taiwan University

EXAM SOLUTIONS. Image Processing and Computer Vision Course 2D1421 Monday, 13 th of March 2006,

Object Shape Recognition in Image for Machine Vision Application

CS534 Introduction to Computer Vision Binary Image Analysis. Ahmed Elgammal Dept. of Computer Science Rutgers University

3D graphics, raster and colors CS312 Fall 2010

Image retrieval based on region shape similarity

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features

Color. making some recognition problems easy. is 400nm (blue) to 700 nm (red) more; ex. X-rays, infrared, radio waves. n Used heavily in human vision

Chapter 11 Representation & Description

Reading. 2. Color. Emission spectra. The radiant energy spectrum. Watt, Chapter 15.

Lecture 18 Representation and description I. 2. Boundary descriptors

Content based Image Retrieval Using Multichannel Feature Extraction Techniques

Crowd Density Estimation using Image Processing

ENG 7854 / 9804 Industrial Machine Vision. Midterm Exam March 1, 2010.

Evaluation of Texture Features for Content-Based Image Retrieval

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

Welcome Back to Fundamental of Multimedia (MR412) Fall, ZHU Yongxin, Winson

Chapter 8 Getting Started with Draw:

Image representation. 1. Introduction

Extraction of Color and Texture Features of an Image

Visual Perception. Basics

COLOR AND SHAPE BASED IMAGE RETRIEVAL

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

Computer Graphics. Bing-Yu Chen National Taiwan University

Short Run length Descriptor for Image Retrieval

Latest development in image feature representation and extraction

Efficient Indexing and Searching Framework for Unstructured Data

The Evolution and Trend of Chain Code Scheme

Anno accademico 2006/2007. Davide Migliore

11. Image Data Analytics. Jacobs University Visualization and Computer Graphics Lab

9 length of contour = no. of horizontal and vertical components + ( 2 no. of diagonal components) diameter of boundary B

Feature Extraction and Image Processing, 2 nd Edition. Contents. Preface

Image Formation. Ed Angel Professor of Computer Science, Electrical and Computer Engineering, and Media Arts University of New Mexico

Digital Image Processing

Description and search of multimedia data

EECS490: Digital Image Processing. Lecture #17

Multimedia Technology CHAPTER 4. Video and Animation

Object-Based Classification & ecognition. Zutao Ouyang 11/17/2015

Multimedia Information Retrieval

Texture. Texture. 2) Synthesis. Objectives: 1) Discrimination/Analysis

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

Illumination and Shading

Locating 1-D Bar Codes in DCT-Domain

Autoregressive and Random Field Texture Models

Lecture 14 Shape. ch. 9, sec. 1-8, of Machine Vision by Wesley E. Snyder & Hairong Qi. Spring (CMU RI) : BioE 2630 (Pitt)

Chapter 3 Image Registration. Chapter 3 Image Registration

Review of Content based image retrieval

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang

Searching for the Shape

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

Files Used in This Tutorial. Background. Feature Extraction with Example-Based Classification Tutorial

EE 584 MACHINE VISION

Transcription:

Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis

Why content-based? Actually, what is content-based search? Is human thinking content-based? Metadata annotation (text) is good but -

Multimedia Information Retrieval 1. What are multimedia queries? 2. Fingerprinting 3. Metadata & piggy-back retrieval 4. Automated image annotation 5 Visual content-based retrieval I (challenges + features) 6 Visual content-based retrieval II (distances) 7. Evaluation 8. Browsing, search and geography

The semantic gap 1m pixels with a spatial colour distribution faces & vase-like object victory, triumph,... disappointment,...

Polysemy

Scalability

Features and distances o x x x x Feature space

Architecture

Features Visual Colour, texture, shape, edge detection, SIFT/SURF Audio Temporal How to describe the features? For people For computers

Digital Images

Content of an image 145 153 181 165 167 121 173 151 159 149 185 187 201 213 225 173 157 161 253 251 255 141 79 97 245 247 255 93 109 117 245 247 255 97 97 115

Histogram 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 1 2 3 4 1: 2: 3: 4: 5: 6: 7: 8: 5 6 0 32 64 96 128 160 192 224 7 8 31 63 95 127 159 191 223 255

Colour phenomenon of human perception three-dimensional (RGB/CMY/HSB) spectral colour: pure light of one wavelength blue cyan green yellow spectral colours: wavelength (nm) red

Colour

http://blog.xkcd.com/2010/05/03/color-survey-

Colour: subjective http://eagereyes.org/blog/2011/you-only-see-colors-you-can-name

Colour histogram

Exercise a) Sketch a 3D colour histogram for R G B 0 255 0 0 0 255 255 255 0 0 255 0 255 0 255 255 0 0 0 255 255 255 0 255 black red green blue cyan magenta yellow white b) Sketch 2D colour histograms per channel c) Which one is more informative? Why?

Other Colour Spaces HSV, HSL, CIELAB/CIELUV

HSB colour model saturation (0% - 100%) = spectral purity hue (0-360 ) spectral colour brightness (0% - 100%) = energy or luminance chromaticity = hue+saturation

HSB colour model

HSB model disadvantage: hue coordinate is not continuous 0 and 360 degrees have the same meaning but there is a huge difference in terms of numeric distance example: red = (0,100%,50%) = (360,100%,50%) advantage: it is more natural to describe colour changes brighter blue, purer magenta, etc

Texture coarseness [Tamura et al, 1978] contrast directionality

Texture histograms Coarseness contrast Directionality [with Howarth, IEE Vision, Image & Signal Proc 15(6) 2004; Howarth PhD thesis]

Gabor filter Scale Query Orientation [with Howarth, CLEF 2004]

Shape Analysis shape = class of geometric objects invariant under translation scale (changes keeping the aspect ratio) rotations information preserving description (for compression) non-information preserving (for retrieval) boundary based (ignore interior) region based (boundary+interior) Particularly important for drwings, eg, patent drawings

Shape Analysis boundary based perimeter & area corner points circularity chain codes region based (considering interior and holes, ) not covered here

Perimeter and area parameterised curve x(t), y(t) R boundary pixel count vs count pixels in area R

Circularity A=area, P=perimeter T is 1 for a circle T is smaller than 1 for all other shapes circularity is aka compactness R

Convexity ratio of perimeter of convex hull and the original curve 1 for convex shapes, less than 1 otherwise convex hull

Chain codes contour description by a string of directions Freeman chain code f1f2f3... eight possible directions for each pixel translation invariant, not rotation invariant 3 2 1 4 0 5 6 7 = 210077654343 = 100776543432 = 007765434321 choose lowest number for closed contours

Difference chain codes sequence of angles: ai = (fi+1- fi) mod 8 0 0 1 45 2 3 90 135 4 5 6 7 180 225 270 315 rotation invariant = = =... = 070777717777

Sound

Audio Features Spectrogram graph of frequencies/energy/time tempo, pitch, mode See Z Liu, Y Wang and T Chen (1998). Audio feature extraction and analysis for scene segmentation and classification. VLSI Signal Processing 20(1-2), 69-79.

Histograms Feature vectors histograms 145 153 181 165 167 121 173 151 159 149 185 187 201 213 225 173 157 161 253 251 255 141 79 97 245 247 255 93 109 117 245 247 255 97 97 115 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 1 1: 0-31 2: 32-63 3: 64-95 4: 96 127 2 3 5: 6: 7: 8: 4 5 128 160 192 224 6 7 8 159 191 223 255

Central moments Simple statistics Mean Variance (squared standard deviation) 3rd central moment (skewness) where w is image width and h is image height

Moment features

Moment features

Global vs local Global histogram also matches polar bears, marble floors,

Localisation 0.3 0.25 0.2 0.15 0.1 0.05 0 1 2 3 4 5 64% centre 36% border 6 7 8

Tiled Histograms 1 0.9 0.8 0.7 0.4 0.6 0.35 0.7 0.5 0.6 0.4 0.5 0.3 0.25 0.2 0.4 0.3 0.3 0.2 0.1 0.1 0.05 0.2 0.1 0 0.15 0 0 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 0.6 0.8 0.7 0.4 0.3 0.3 0.2 0.2 1 2 3 4 5 6 7 8 5 6 7 8 0.4 0.3 0.25 0.2 0.15 0.1 0.05 0 0 4 0.35 0.1 0.1 3 0.5 0.4 0.5 2 0.45 0.5 0.6 1 8 0 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8

Segmentation 0.3 0.25 0.2 0.15 0.1 0.05 0 1 2 3 4 5 6 7 8 foreground background

Points of interest Many PoI, ie, many feature vectors Quantised feature vectors words Bag of word model text retrieval

Implementation speed vs flexibility vs precision Process: 1. best abstracted representation of your media 2. best method for calculating difference/similarity 3. implement efficiently, considering responsiveness and scalability

Content-Based Image Retrieval Which features are best for searching? Depends on the information need: Looking for sunset holiday pictures in your digital shoebox? Use colour histograms. Want to build a wallpaper customer database? Use colour + texture. Want to build a b/w sketch database for technical industrial designs? Use shape descriptors. Not sure which features are best for a query (eg, if you also have abstract features such as Fourier coefficients)? Deploy relevance feedback and let the system learn the relevant features for this query...

Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1. Browsing wallpaper patterns in a home decorator store 2. Finding interesting photos in a personal collection of holiday snaps 3. Managing industrial design pattern templates for a manufacturing company Think about: what types of features you might use what would the query be the user interface