Multimedia Information Retrieval

Size: px
Start display at page:

Download "Multimedia Information Retrieval"

Transcription

1 Multimedia Information Retrieval Norbert Fuhr HS-IR 98

2 Chapter 1 Introduction document structures and attributes media types terminology 1

3 Document structures and attributes IR networks heterogeneity effectivness user friendlyn. content structure head title author chapter section section chapter logical structure layout structure IR in networks J. Doe document author = J. Doe crdate = ladate = external attributes

4 audio Media types t y image text is a linear medium... video x

5 Terminology monomedia object/document: object containing data of a single media multimedia object/document: object containing data of multiple media hypertext document: nonlinear text document (i.e. with links) hypermedia document: nonlinear multimedia document

6 Course structure: 1. introduction 2. views on media 3. multimedia indexing

7 Chapter 2 Views on media views on media objects FERMI multimedia data model 6

8 2.1 Views on media objects here: images physical view pixel matrix logical view perceptive view colour texture brightness symbolic view spatial view: spatial relations (depending on modelling space) structural view set of image objects structural relations between image objects (aggregation)

9

10

11

12 2.2 The FERMI Multimedia Document Model Document structure and IR impact of structure: of multimedia information: heterogeneity of multimedia data on semantic content: logical structure ˆ= discourse structure on corpus classic IR: document = atomic unit MMIR: retrieval of document components

13 2.2.2 Elements of the multimedia data model logical structure hierarchy of structural objects leaves = single-media data implements explicit organization of discourse other data model elements refer to logical structure attributes classical attributes(author,dates,...) index expressions navigational structure links

14 The logical structure logical structure ˆ= hierarchical aggregation of structural objects: LS = (OS; str ; seq ; TYPE ST ; tst ; type st ; TYPE M ; type m ) OS: finite set of document structural objects elements: os i str : aggregative relation between structural objects, defines hierarchical composition seq : defines a linear sequence on OS (corresponds to standard, linear order to access components TYPE ST : set of types of structural objects e.g. for books: TYPE ST = fdocument; Chapter; Section; Sub Section; Paragraph; Figureg types correspond to abstraction levels tst : relation on structural object types defining hierarchy of abstraction levels type st : total function assigning each structural object its structural type in TYPE ST TYPE M : set of media types, TYPE M = ftext;image;graphic;multimediag. type m : total function assigning to each structural object its media type in TYPE M

15 Attributes A = (OS;NAME A ;VALUE A ; name a ;domain a ;value a ;SM) where: OS: the set of structural objects in the document elements: os i NAME A : set of attributes names. VALUE A : set of all possible attribute values (union of all the domain languages of all attributes) name a : partial function associating to structural objects a nonempty set of attribute names domain a : total function defining the domain of any attribute name (i.e. all the expressions of its associated language) value a : partial function assigning to structural objects the value for a related attribute name (definition allows multi-valued attributes)

16 Content Attributes single-media models involve up to five types of views: the physical view the structural view the symbolic view the spatial view (only in image and graphic models) the perceptive view (only in image and graphic models)! standard attribute names (called Content Attributes) for views: physical structural symbolic spatial perceptive

17 Indexing model indexing: assign index expressions to document structural objects retrieval of multimedia documents: retrieve smallest units that fulfill the query! index expressions assigned to parent object have to imply index expressions of its component objects index objects: structural objects that are indexed (assigned a value of attribute symbolic) index model of a document base: OI : set of index objects oi i OI OS I =(OI;TYPE I ; ind ) TYPE I : set of index object types TYPE I TYPE ST ind : relation representing structural dependency between index objects: ind OI OI

18 Example of an indexing structure example of structure and index hierarchy types (index objects of type Chapter or Subsection only) Structure Types Symbolic Types Document Chapter Section Subsection Paragraph

19 parts of the structural and semantic views of a document Structural View Semantic View Document Os1 U1 U2 Chapter Os2 Os3 Osem2 Osem3 Section Subsection Os8 Os4 Os5 Os6 Os7 U4 Os9 U6 Os10 Os11 Os12 Osem8 Osem9 Osem10 Osem11 Osem12 Paragraph Os13 Os14 Os15 Os16 Os17 U3 U5 Os18 Os19 U7

20 Chapter 3 Multimedia Indexing audio images video 19

21 3.1 Audio Sound retrieval E. Wold et al.: Content-based classification, search and retrieval of audio. IEEE Multimedia 3(3), pp Levels of audio retrieval 1. exact match of sound samples 2. inexact match of sounds, irrespective of sample rate, quantization, compresssion, inexact match of acoustic features / perceptual properties of sound 4. content-based match (for speech, musical content) here: inexact match of acoustic features and perceptual properties

22 Acoustic features aspects of sound considered: loudness root-mean-square of audio signal (in decibels) pitch greatest common divisor of peaks in Fourier spectra brightness centroid of short-time Fourier magnitude spectra (higher frequency content of signal) bandwidth magnitude-weighted average of differences between spectral components and the centroid (variation of frequencies, e.g. sine wave vs. white noise) harmonicity deviation of the sound s spectrum from a harmonic spectrum (i.e. harmonic spectra vs. inharmonic spectra vs. noise) variation of aspects over time: 1. compute aspect values at certain time intervals 2. derive features from sequences: average value variance autocorrelation (feature values weighted by amplitude)

23 sound example

24 Property Mean Variance Autocorrelation Loudness Pitch Brightness Bandwidth

25 Indexing and retrieval Indexing of a sound: compute and store feature vector a (mean, variance and autocorrelation for loudness, pitch, brightness, bandwidth and harmonicity) Retrieval: 1. conditions w.r.t. feature values 2. similarity of sounds: weighted Euclidian distance mean: µ= 1 M M a j j=1 covariance R = 1 M M (a j µ)(a j µ) T j=1 distance D = q (a b) T R 1 (a b) M # sounds considered

26 Property-based training and classification training: based on set of training sounds for a property (e.g. scratchiness) compute property-specific mean and covariance importance of feature: mean divided by standard deviation classification compute distances to means of all classes, select class with minimum distance likelihood: D 2 L = exp 2

27 Example: classification of laughter sounds

28 Example: class model for laughter Feature Mean Variance Importance Duration Loudness: Mean Variance Autocorrelation e Brightness: Mean Variance Autocorrelation Bandwidth: Mean Variance e Autocorrelation Pitch: Mean Variance Autocorrelation importance = jmeanj / p variance

29 3.1.2 Speech retrieval 1. speech recognition! uncertain term identification 2. application of text retrieval methods on recognized terms! TREC speech retrieval track Music retrieval McNab etal: The New Zealand Digital Library MELody index. D-Lib Magazine, May melody transcription 2. approximate string matching

30 3.2 Images Introduction Semantic vs. syntactic indexing and retrieval syntactic image features: color texture contour semantic image features: objects (humans, animals, buildings, art works) topics (pollution, demonstration, political visit) most image indexing methods support syntactic features only

31 Aboutness vs. ofness ofness: objects shown in the image aboutness: topic which is illustrated by the image aboutness is very much user-dependent e.g. image showing water pollution

32 3.2.2 QBIC tool for querying image and video databases example images user-constructed sketches and drawings selected color and texture patterns camera and object motion

33 System overview main components: database population: 1. processing of images and videos to extract syntactical features: colors textures shape camera motion object motion 2. storing features in database database querying 1. user composes query graphically 2. generate features from from graphical query 3. search for database objects with similar features

34

35

36 Data model basic elements: still images/scenes contain objects video shots sets of contiguous frames contain motion objects still images: scene: image or video frame object part of a scene videos: 1. break into clips (shots) 2. generate representative frame for each slot, treated as still image 3. generate motion objects from shots

37 querying: on objects images with a red, round object on scenes images with 30 % red and 20 % blue on shots shots panning from left to right on combinations images with 30 % red containing a blue object

38 Feature Calculation color color models: RGB, HSV, YUV, MTM average coordinates in color space k element histogram (typically k = 64;256)

39 texture coarseness: scale of texture contrast: vividness of a pattern (function of variance of grey-level histogram) directionality: peakedness of distribution of gradient directions in image (favoured direction (e.g. grass) vs. isotropic (e.g. sand))

40 shape area # pixels set in binary image circularity perimeter 2 /area major axis orientation 1. compute 2nd order covariance matrix from boundary pixels 2. major axis orientation = direction of largest eigenvector eccentricity = (largest eigenvalue) / (smallest eigenvalue) algebraic moment invariances consider 18 features invariant to affine transformations compute first m central moments as eigenvalues of predefined matrices

41 sketch based on reduced resolution edge map: 1. convert color image to single band luminance 2. compute binary edge image 3. reduce edge image to thin reduced image

42 Sample queries average color queries search for images/objects with similar color computed as weighted Euclidian distance in color space histogram color queries search for images with specified color distribution based on 256-element histogram: Q query histogram D image histogram Z element difference histogram: Z = Q D A symmetric color similarity matrix a(i; j)=1 d(c i ;c j )=d max c k kth color in histogram d(c i ;c j ) MTM color distance d max maximum distance between any two colors similarity: jjrjj = Z T AZ

43

44

45

46 texture queries user selects texture from a sampler compute weighted Euclidian distance in 3D texture space (coarseness, contrast, directionality) object shape user draws shape shape features: area, circularity, eccentricity, majoraxis-direction, object moments, tangent angles around object perimeter compute weighted Euclidian distance, weights are inverse variances of features query by sketch user draws dominant lines and edges 1. reduce user sketch to for each db image, correlate sketch with user sketch, based on edge/no edge comparison 3. compute correlation scores

47

48

49 3.2.3 IRIS semantic indexing of images 1. image analysis color contour texture 2. object recognition (a) basic objects: clouds, snow, water, sky, forest, grass, sand, stone (b) high-level objects: forestscene, skyscene, mountainscene, landscapescene,...

50 Image Analysis Color IRIS subdivides color space into about 20 different colors 1. subdivide image into nonoverlapping tiles 2. compute color histogram for each tile 3. most frequent color =: color of tile 4. join tiles with similar colors and compute circumscribing rectangle 5. compute attributes of color rectangles: position size color color density (# tiles with color / # tiles in rectangle) color evidence

51 original image

52 color-based segmentation:... colour2 HOR=mid,VER=up,SIZ=XL,SHP=Rect,COL=BLUE, UL=0 1,LR=44 11,DEN= colour3 HOR=mid,VER=mid,SIZ=M,SHP=Rect,COL=BLUE, UL=15 10,LR=44 17,DEN= colour4 HOR=left,VER=mid,SIZ=XS,SHP=Quad,COL=BLUE, UL=1 11,LR=1 11,DEN=1 1 colour5 HOR=left,VER=mid,SIZ=XS,SHP=Rect,COL=BLUE, UL=3 11,LR=14 12,DEN=

53 Texture consider local distribution and variation of grey values 1. compute normalized co-occurrence matrix p for 4 directions: 0,90,45, for each of the four directions, compute the following features from C: angular second moment contrast (local variations) correlation (linear relationship between pixel values) variance (deviation from the average) entropy 3. for each of the five parameters, compute the average from the values for the 4 directions (! invariance against rotation)

54 4. feed average values into neural network hidden-layer hidden-layer output-layer input-layer constrast asm variance correlation entropy forest gras sand water stone sky clouds ice 5. NN yields texture for each tile 6. join tiles with identical textures and compute circumscribing rectangles 7. compute attributes of texture rectangles: position size texture texture density (# tiles with texture / # tiles in rectangle)

55 ... texture3 HOR=mid,VER=mid,SIZ=L,SHP=Rect,TEX=ice, UL=2 2,LR=10 3,DEN=11 18 texture4 HOR=left,VER=mid,SIZ=S,SHP=Path,TEX=clouds, UL=0 3,LR=3 3,DEN=4 4 texture5 HOR=left,VER=mid,SIZ=S,SHP=Quad,TEX=stone, UL=4 3,LR=5 4,DEN=3 4 texture6 HOR=mid,VER=mid,SIZ=S,SHP=Rect,TEX=clouds, UL=5 3,LR=8 4,DEN=5 8...

56

57 Contour basedongreylevelimage 1. gradient-based edge detection 2. determination of object contours 3. shape analysis: compute position of centroid size of region bound coordinates of region

58

59 Object Recognition 1. step from syntactical to semantical features: identification of primitive objects 2. derivation of higher-level semantical features identification of primitive objects basis: color, texture and contour features for each feature, consider corresponding region form graph describing topological relationships between feature regions: node = feature edge = topological relationship: overlaps, meets, contains overlaps T contains CL T CT meets CT T CL CL CT T CT

60 formulate graph grammar rules for detecting primitive objects Mountainlake Clouds Texture Segment Sky Lake Mountain Clouds Forest Color Segment Contour Segment Conditions of "Clouds" predicate((valcompeq(*self(2,"colorseg","col"),"blue") valcompeq(*self(2,"colorseg","col"),"white")) && valcompeq(*self(2,"colorseg","ver"),"up")); predicate(nrkind(*self(1,"contourseg"),"contains",*self(1,"colorseg")) && nrkind(*self(1,"contourseg"),"contains",*self(1,"textureseg")));

61 3.2.4 Photobook developed at MIT Media Lab goal: semantic retrieval of images based on semantics-preserving image compression types of descriptions: appearance (faces) shape texture

62 Appearance based on eigenimage representations Training: Building Eigenrepresentations 1. preprocessing of input images: normalize w.r.t. position, scale, orientation 2. computation of eigenvectors of normalized image covariance for training images (faces) subregions of training images (eyes, nose, mouth)

63 mean and first few eigenvectors:

64 Retrieval Γ: new image (region) 1. transform Γ into face space 2. retrieval based on similarity measure

65

66

67 Shape representation based on modelling of physical deformations finite element method! stiffness matrix! eigenvectors Retrieval compute amount of energy needed to align object

68

69

70 Texture representation based on Wold decomposition for regular stochastic processes in 2D ˆ= sum of three orthogonal components: 1. harmonic field 2. generalized-evanescent field 3. purely-indeterministic field retrieval 1. derive parameters of Wold decompositions 2. compute similarity of parameter vectors

71

72

73 3.3 Video QBIC Representation of video data 1. shot detection 2. creation of representative frame 3. identify moving structures/objects Shot detection set of frames grouped into shots because they depict same scene signify single camera operation contain distinct event/action are chosen as single indexable unit

74 representative frame generation representative frames treated as still images in database population in retrieval returned for as answer representing shot representative frame generation methods: random frame from a shot synthesized r-frames mosaicking all frames in a panning shot remove moving objects

75

76 layered representation different layers used for identifying significant objects in the scene algorithm divides a shot into a number of layers, each with its own 2D affine motion parameters region of support in each frame

77

78

79 3.4 Summary: media indexing and matching 1. exact match 2. inexact media match (irrespective of digitization parameters) 3. inexact media feature match 4. content-based match

80 Chapter 4 Conclusions Issues in MMIR syntactic (signal-based) vs. semantic (symbolic) indexing of MM objects dealing with document structure IR models for multimedia documents 79

Introduction. Chapter 1. Multimedia Information Retrieval. SIGIR 98. Norbert Fuhr. images audio. media types: text.

Introduction. Chapter 1. Multimedia Information Retrieval. SIGIR 98. Norbert Fuhr. images audio. media types: text. Multimedia Information Retrieval Norbert Fuhr Tutorial @ SIGIR 98 Chapter 1 Introduction media types: text images audio video 1 terminology: monomedia object/document: object containing data of a single

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Part 9: Representation and Description AASS Learning Systems Lab, Dep. Teknik Room T1209 (Fr, 11-12 o'clock) achim.lilienthal@oru.se Course Book Chapter 11 2011-05-17 Contents

More information

Lecture 8 Object Descriptors

Lecture 8 Object Descriptors Lecture 8 Object Descriptors Azadeh Fakhrzadeh Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University 2 Reading instructions Chapter 11.1 11.4 in G-W Azadeh Fakhrzadeh

More information

Chapter 11 Representation & Description

Chapter 11 Representation & Description Chapter 11 Representation & Description The results of segmentation is a set of regions. Regions have then to be represented and described. Two main ways of representing a region: - external characteristics

More information

Multimedia Database Systems. Retrieval by Content

Multimedia Database Systems. Retrieval by Content Multimedia Database Systems Retrieval by Content MIR Motivation Large volumes of data world-wide are not only based on text: Satellite images (oil spill), deep space images (NASA) Medical images (X-rays,

More information

CoE4TN4 Image Processing

CoE4TN4 Image Processing CoE4TN4 Image Processing Chapter 11 Image Representation & Description Image Representation & Description After an image is segmented into regions, the regions are represented and described in a form suitable

More information

Boundary descriptors. Representation REPRESENTATION & DESCRIPTION. Descriptors. Moore boundary tracking

Boundary descriptors. Representation REPRESENTATION & DESCRIPTION. Descriptors. Moore boundary tracking Representation REPRESENTATION & DESCRIPTION After image segmentation the resulting collection of regions is usually represented and described in a form suitable for higher level processing. Most important

More information

Multimedia Information Retrieval

Multimedia Information Retrieval Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis Why content-based? Actually, what is content-based

More information

Content Based Image Retrieval

Content Based Image Retrieval Content Based Image Retrieval R. Venkatesh Babu Outline What is CBIR Approaches Features for content based image retrieval Global Local Hybrid Similarity measure Trtaditional Image Retrieval Traditional

More information

Region-based Segmentation

Region-based Segmentation Region-based Segmentation Image Segmentation Group similar components (such as, pixels in an image, image frames in a video) to obtain a compact representation. Applications: Finding tumors, veins, etc.

More information

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang NICTA & CSE UNSW COMP9314 Advanced Database S1 2007 jzhang@cse.unsw.edu.au Reference Papers and Resources Papers: Colour spaces-perceptual, historical

More information

ECE 176 Digital Image Processing Handout #14 Pamela Cosman 4/29/05 TEXTURE ANALYSIS

ECE 176 Digital Image Processing Handout #14 Pamela Cosman 4/29/05 TEXTURE ANALYSIS ECE 176 Digital Image Processing Handout #14 Pamela Cosman 4/29/ TEXTURE ANALYSIS Texture analysis is covered very briefly in Gonzalez and Woods, pages 66 671. This handout is intended to supplement that

More information

An Introduction to Content Based Image Retrieval

An Introduction to Content Based Image Retrieval CHAPTER -1 An Introduction to Content Based Image Retrieval 1.1 Introduction With the advancement in internet and multimedia technologies, a huge amount of multimedia data in the form of audio, video and

More information

Chapter 11 Representation & Description

Chapter 11 Representation & Description Chain Codes Chain codes are used to represent a boundary by a connected sequence of straight-line segments of specified length and direction. The direction of each segment is coded by using a numbering

More information

Wavelet Applications. Texture analysis&synthesis. Gloria Menegaz 1

Wavelet Applications. Texture analysis&synthesis. Gloria Menegaz 1 Wavelet Applications Texture analysis&synthesis Gloria Menegaz 1 Wavelet based IP Compression and Coding The good approximation properties of wavelets allow to represent reasonably smooth signals with

More information

Video search requires efficient annotation of video content To some extent this can be done automatically

Video search requires efficient annotation of video content To some extent this can be done automatically VIDEO ANNOTATION Market Trends Broadband doubling over next 3-5 years Video enabled devices are emerging rapidly Emergence of mass internet audience Mainstream media moving to the Web What do we search

More information

Content-based Image Retrieval (CBIR)

Content-based Image Retrieval (CBIR) Content-based Image Retrieval (CBIR) Content-based Image Retrieval (CBIR) Searching a large database for images that match a query: What kinds of databases? What kinds of queries? What constitutes a match?

More information

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++

Dietrich Paulus Joachim Hornegger. Pattern Recognition of Images and Speech in C++ Dietrich Paulus Joachim Hornegger Pattern Recognition of Images and Speech in C++ To Dorothea, Belinda, and Dominik In the text we use the following names which are protected, trademarks owned by a company

More information

Large-Scale 3D Point Cloud Processing Tutorial 2013

Large-Scale 3D Point Cloud Processing Tutorial 2013 Large-Scale 3D Point Cloud Processing Tutorial 2013 Features The image depicts how our robot Irma3D sees itself in a mirror. The laser looking into itself creates distortions as well as changes in Prof.

More information

A SYNTAX FOR IMAGE UNDERSTANDING

A SYNTAX FOR IMAGE UNDERSTANDING A SYNTAX FOR IMAGE UNDERSTANDING Narendra Ahuja University of Illinois at Urbana-Champaign May 21, 2009 Work Done with. Sinisa Todorovic, Mark Tabb, Himanshu Arora, Varsha. Hedau, Bernard Ghanem, Tim Cheng.

More information

Content-Based Image Retrieval Readings: Chapter 8:

Content-Based Image Retrieval Readings: Chapter 8: Content-Based Image Retrieval Readings: Chapter 8: 8.1-8.4 Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition 1 Content-based Image Retrieval (CBIR)

More information

Chapter 3. Indexing. basic steps: 1. feature identification 2. feature weighting

Chapter 3. Indexing. basic steps: 1. feature identification 2. feature weighting Chapter 3 Indexing basic steps: 1. feature identification 2. feature weighting 1 3.1 Text 3.1.1 Feature identification basic method: feature = single word text: Experiments with Indexing Methods. The analysis

More information

9 length of contour = no. of horizontal and vertical components + ( 2 no. of diagonal components) diameter of boundary B

9 length of contour = no. of horizontal and vertical components + ( 2 no. of diagonal components) diameter of boundary B 8. Boundary Descriptor 8.. Some Simple Descriptors length of contour : simplest descriptor - chain-coded curve 9 length of contour no. of horiontal and vertical components ( no. of diagonal components

More information

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval Lesson 11 Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Retrieval = Query + Search Informational Retrieval: Get required information from database/web

More information

Chapter 3 Image Registration. Chapter 3 Image Registration

Chapter 3 Image Registration. Chapter 3 Image Registration Chapter 3 Image Registration Distributed Algorithms for Introduction (1) Definition: Image Registration Input: 2 images of the same scene but taken from different perspectives Goal: Identify transformation

More information

Chapter 4 - Image. Digital Libraries and Content Management

Chapter 4 - Image. Digital Libraries and Content Management Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 4 - Image Vector Graphics Raw data: set (!) of lines and polygons

More information

Lecture 10: Image Descriptors and Representation

Lecture 10: Image Descriptors and Representation I2200: Digital Image processing Lecture 10: Image Descriptors and Representation Prof. YingLi Tian Nov. 15, 2017 Department of Electrical Engineering The City College of New York The City University of

More information

TEXTURE. Plan for today. Segmentation problems. What is segmentation? INF 4300 Digital Image Analysis. Why texture, and what is it?

TEXTURE. Plan for today. Segmentation problems. What is segmentation? INF 4300 Digital Image Analysis. Why texture, and what is it? INF 43 Digital Image Analysis TEXTURE Plan for today Why texture, and what is it? Statistical descriptors First order Second order Gray level co-occurrence matrices Fritz Albregtsen 8.9.21 Higher order

More information

CHAPTER 3 FEATURE EXTRACTION

CHAPTER 3 FEATURE EXTRACTION 37 CHAPTER 3 FEATURE EXTRACTION 3.1. INTRODUCTION This chapter presents feature representation of information of a frame in a video. The feature representation for image objects, the feature representation

More information

A Content Based Image Retrieval System Based on Color Features

A Content Based Image Retrieval System Based on Color Features A Content Based Image Retrieval System Based on Features Irena Valova, University of Rousse Angel Kanchev, Department of Computer Systems and Technologies, Rousse, Bulgaria, Irena@ecs.ru.acad.bg Boris

More information

Image Processing, Analysis and Machine Vision

Image Processing, Analysis and Machine Vision Image Processing, Analysis and Machine Vision Milan Sonka PhD University of Iowa Iowa City, USA Vaclav Hlavac PhD Czech Technical University Prague, Czech Republic and Roger Boyle DPhil, MBCS, CEng University

More information

Texture. Texture is a description of the spatial arrangement of color or intensities in an image or a selected region of an image.

Texture. Texture is a description of the spatial arrangement of color or intensities in an image or a selected region of an image. Texture Texture is a description of the spatial arrangement of color or intensities in an image or a selected region of an image. Structural approach: a set of texels in some regular or repeated pattern

More information

Practical Image and Video Processing Using MATLAB

Practical Image and Video Processing Using MATLAB Practical Image and Video Processing Using MATLAB Chapter 18 Feature extraction and representation What will we learn? What is feature extraction and why is it a critical step in most computer vision and

More information

Facial Expression Detection Using Implemented (PCA) Algorithm

Facial Expression Detection Using Implemented (PCA) Algorithm Facial Expression Detection Using Implemented (PCA) Algorithm Dileep Gautam (M.Tech Cse) Iftm University Moradabad Up India Abstract: Facial expression plays very important role in the communication with

More information

ECG782: Multidimensional Digital Signal Processing

ECG782: Multidimensional Digital Signal Processing Professor Brendan Morris, SEB 3216, brendan.morris@unlv.edu ECG782: Multidimensional Digital Signal Processing Spring 2014 TTh 14:30-15:45 CBC C313 Lecture 06 Image Structures 13/02/06 http://www.ee.unlv.edu/~b1morris/ecg782/

More information

Spectral Classification

Spectral Classification Spectral Classification Spectral Classification Supervised versus Unsupervised Classification n Unsupervised Classes are determined by the computer. Also referred to as clustering n Supervised Classes

More information

COMPUTER AND ROBOT VISION

COMPUTER AND ROBOT VISION VOLUME COMPUTER AND ROBOT VISION Robert M. Haralick University of Washington Linda G. Shapiro University of Washington A^ ADDISON-WESLEY PUBLISHING COMPANY Reading, Massachusetts Menlo Park, California

More information

Schedule for Rest of Semester

Schedule for Rest of Semester Schedule for Rest of Semester Date Lecture Topic 11/20 24 Texture 11/27 25 Review of Statistics & Linear Algebra, Eigenvectors 11/29 26 Eigenvector expansions, Pattern Recognition 12/4 27 Cameras & calibration

More information

Contend Based Multimedia Retrieval

Contend Based Multimedia Retrieval Contend Based Multimedia Retrieval CBIR Query Types Semantic Gap Features Segmentation High dimension IBMS QBIC GIFT, MRML Blobworld CLUE SIMPLIcity CBMR Multimedia Automatic Video Analysis 1 CBIR Contend

More information

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 6, Ver. VI (Nov Dec. 2014), PP 29-33 Analysis of Image and Video Using Color, Texture and Shape Features

More information

Digital Image Processing Chapter 11: Image Description and Representation

Digital Image Processing Chapter 11: Image Description and Representation Digital Image Processing Chapter 11: Image Description and Representation Image Representation and Description? Objective: To represent and describe information embedded in an image in other forms that

More information

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction Stefan Müller, Gerhard Rigoll, Andreas Kosmala and Denis Mazurenok Department of Computer Science, Faculty of

More information

Content-Based Image Retrieval Readings: Chapter 8:

Content-Based Image Retrieval Readings: Chapter 8: Content-Based Image Retrieval Readings: Chapter 8: 8.1-8.4 Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition 1 Content-based Image Retrieval (CBIR)

More information

Content-Based Image Retrieval. Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition

Content-Based Image Retrieval. Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition Content-Based Image Retrieval Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition 1 Content-based Image Retrieval (CBIR) Searching a large database for

More information

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards

Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Workshop W14 - Audio Gets Smart: Semantic Audio Analysis & Metadata Standards Jürgen Herre for Integrated Circuits (FhG-IIS) Erlangen, Germany Jürgen Herre, hrr@iis.fhg.de Page 1 Overview Extracting meaning

More information

CS 664 Segmentation. Daniel Huttenlocher

CS 664 Segmentation. Daniel Huttenlocher CS 664 Segmentation Daniel Huttenlocher Grouping Perceptual Organization Structural relationships between tokens Parallelism, symmetry, alignment Similarity of token properties Often strong psychophysical

More information

Image and Multidimensional Signal Processing

Image and Multidimensional Signal Processing Image and Multidimensional Signal Processing Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ Representation and Description 2 Representation and

More information

Introduction to digital image classification

Introduction to digital image classification Introduction to digital image classification Dr. Norman Kerle, Wan Bakx MSc a.o. INTERNATIONAL INSTITUTE FOR GEO-INFORMATION SCIENCE AND EARTH OBSERVATION Purpose of lecture Main lecture topics Review

More information

CSI 4107 Image Information Retrieval

CSI 4107 Image Information Retrieval CSI 4107 Image Information Retrieval This slides are inspired by a tutorial on Medical Image Retrieval by Henning Müller and Thomas Deselaers, 2005-2006 1 Outline Introduction Content-based image retrieval

More information

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences UNIVERSITY OF OSLO Faculty of Mathematics and Natural Sciences Exam: INF 4300 / INF 9305 Digital image analysis Date: Thursday December 21, 2017 Exam hours: 09.00-13.00 (4 hours) Number of pages: 8 pages

More information

Multiple-Choice Questionnaire Group C

Multiple-Choice Questionnaire Group C Family name: Vision and Machine-Learning Given name: 1/28/2011 Multiple-Choice naire Group C No documents authorized. There can be several right answers to a question. Marking-scheme: 2 points if all right

More information

CHAPTER 8 Multimedia Information Retrieval

CHAPTER 8 Multimedia Information Retrieval CHAPTER 8 Multimedia Information Retrieval Introduction Text has been the predominant medium for the communication of information. With the availability of better computing capabilities such as availability

More information

CIE L*a*b* color model

CIE L*a*b* color model CIE L*a*b* color model To further strengthen the correlation between the color model and human perception, we apply the following non-linear transformation: with where (X n,y n,z n ) are the tristimulus

More information

Image representation. 1. Introduction

Image representation. 1. Introduction Image representation Introduction Representation schemes Chain codes Polygonal approximations The skeleton of a region Boundary descriptors Some simple descriptors Shape numbers Fourier descriptors Moments

More information

5. Feature Extraction from Images

5. Feature Extraction from Images 5. Feature Extraction from Images Aim of this Chapter: Learn the Basic Feature Extraction Methods for Images Main features: Color Texture Edges Wie funktioniert ein Mustererkennungssystem Test Data x i

More information

Color and Shading. Color. Shapiro and Stockman, Chapter 6. Color and Machine Vision. Color and Perception

Color and Shading. Color. Shapiro and Stockman, Chapter 6. Color and Machine Vision. Color and Perception Color and Shading Color Shapiro and Stockman, Chapter 6 Color is an important factor for for human perception for object and material identification, even time of day. Color perception depends upon both

More information

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology

Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology Course Presentation Multimedia Systems Image III (Image Compression, JPEG) Mahdi Amiri April 2011 Sharif University of Technology Image Compression Basics Large amount of data in digital images File size

More information

One image is worth 1,000 words

One image is worth 1,000 words Image Databases Prof. Paolo Ciaccia http://www-db. db.deis.unibo.it/courses/si-ls/ 07_ImageDBs.pdf Sistemi Informativi LS One image is worth 1,000 words Undoubtedly, images are the most wide-spread MM

More information

Image Segmentation. Srikumar Ramalingam School of Computing University of Utah. Slides borrowed from Ross Whitaker

Image Segmentation. Srikumar Ramalingam School of Computing University of Utah. Slides borrowed from Ross Whitaker Image Segmentation Srikumar Ramalingam School of Computing University of Utah Slides borrowed from Ross Whitaker Segmentation Semantic Segmentation Indoor layout estimation What is Segmentation? Partitioning

More information

Statistical image models

Statistical image models Chapter 4 Statistical image models 4. Introduction 4.. Visual worlds Figure 4. shows images that belong to different visual worlds. The first world (fig. 4..a) is the world of white noise. It is the world

More information

CS4733 Class Notes, Computer Vision

CS4733 Class Notes, Computer Vision CS4733 Class Notes, Computer Vision Sources for online computer vision tutorials and demos - http://www.dai.ed.ac.uk/hipr and Computer Vision resources online - http://www.dai.ed.ac.uk/cvonline Vision

More information

Multimedia Technology CHAPTER 4. Video and Animation

Multimedia Technology CHAPTER 4. Video and Animation CHAPTER 4 Video and Animation - Both video and animation give us a sense of motion. They exploit some properties of human eye s ability of viewing pictures. - Motion video is the element of multimedia

More information

Texture Segmentation

Texture Segmentation Texture Segmentation Introduction to Signal and Image Processing Prof. Dr. Philippe Cattin MIAC, University of Basel 1 of 48 22.02.2016 09:20 Contents Contents Abstract 2 1 Introduction What is Texture?

More information

- Low-level image processing Image enhancement, restoration, transformation

- Low-level image processing Image enhancement, restoration, transformation () Representation and Description - Low-level image processing enhancement, restoration, transformation Enhancement Enhanced Restoration/ Transformation Restored/ Transformed - Mid-level image processing

More information

Extraction of Color and Texture Features of an Image

Extraction of Color and Texture Features of an Image International Journal of Engineering Research ISSN: 2348-4039 & Management Technology July-2015 Volume 2, Issue-4 Email: editor@ijermt.org www.ijermt.org Extraction of Color and Texture Features of an

More information

ELEC Dr Reji Mathew Electrical Engineering UNSW

ELEC Dr Reji Mathew Electrical Engineering UNSW ELEC 4622 Dr Reji Mathew Electrical Engineering UNSW Review of Motion Modelling and Estimation Introduction to Motion Modelling & Estimation Forward Motion Backward Motion Block Motion Estimation Motion

More information

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures Pattern recognition Classification/Clustering GW Chapter 12 (some concepts) Textures Patterns and pattern classes Pattern: arrangement of descriptors Descriptors: features Patten class: family of patterns

More information

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37 Extended Contents List Preface... xi About the authors... xvii CHAPTER 1 Introduction 1 1.1 Overview... 1 1.2 Human and Computer Vision... 2 1.3 The Human Vision System... 4 1.3.1 The Eye... 5 1.3.2 The

More information

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT 2.1 BRIEF OUTLINE The classification of digital imagery is to extract useful thematic information which is one

More information

THREE-DIMENSIONA L ELECTRON MICROSCOP Y OF MACROMOLECULAR ASSEMBLIE S. Visualization of Biological Molecules in Their Native Stat e.

THREE-DIMENSIONA L ELECTRON MICROSCOP Y OF MACROMOLECULAR ASSEMBLIE S. Visualization of Biological Molecules in Their Native Stat e. THREE-DIMENSIONA L ELECTRON MICROSCOP Y OF MACROMOLECULAR ASSEMBLIE S Visualization of Biological Molecules in Their Native Stat e Joachim Frank CHAPTER 1 Introduction 1 1 The Electron Microscope and

More information

Analysis of Planar Anisotropy of Fibre Systems by Using 2D Fourier Transform

Analysis of Planar Anisotropy of Fibre Systems by Using 2D Fourier Transform Maroš Tunák, Aleš Linka Technical University in Liberec Faculty of Textile Engineering Department of Textile Materials Studentská 2, 461 17 Liberec 1, Czech Republic E-mail: maros.tunak@tul.cz ales.linka@tul.cz

More information

Digital Image Processing

Digital Image Processing Digital Image Processing Third Edition Rafael C. Gonzalez University of Tennessee Richard E. Woods MedData Interactive PEARSON Prentice Hall Pearson Education International Contents Preface xv Acknowledgments

More information

EE 701 ROBOT VISION. Segmentation

EE 701 ROBOT VISION. Segmentation EE 701 ROBOT VISION Regions and Image Segmentation Histogram-based Segmentation Automatic Thresholding K-means Clustering Spatial Coherence Merging and Splitting Graph Theoretic Segmentation Region Growing

More information

LECTURE 4: FEATURE EXTRACTION DR. OUIEM BCHIR

LECTURE 4: FEATURE EXTRACTION DR. OUIEM BCHIR LECTURE 4: FEATURE EXTRACTION DR. OUIEM BCHIR RGB COLOR HISTOGRAM HSV COLOR MOMENTS hsv_image = rgb2hsv(rgb_image) converts the RGB image to the equivalent HSV image. RGB is an m-by-n-by-3 image array

More information

2D/3D Geometric Transformations and Scene Graphs

2D/3D Geometric Transformations and Scene Graphs 2D/3D Geometric Transformations and Scene Graphs Week 4 Acknowledgement: The course slides are adapted from the slides prepared by Steve Marschner of Cornell University 1 A little quick math background

More information

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques Doaa M. Alebiary Department of computer Science, Faculty of computers and informatics Benha University

More information

CHAPTER 4 TEXTURE FEATURE EXTRACTION

CHAPTER 4 TEXTURE FEATURE EXTRACTION 83 CHAPTER 4 TEXTURE FEATURE EXTRACTION This chapter deals with various feature extraction technique based on spatial, transform, edge and boundary, color, shape and texture features. A brief introduction

More information

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University CS443: Digital Imaging and Multimedia Binary Image Analysis Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines A Simple Machine Vision System Image segmentation by thresholding

More information

Topic 6 Representation and Description

Topic 6 Representation and Description Topic 6 Representation and Description Background Segmentation divides the image into regions Each region should be represented and described in a form suitable for further processing/decision-making Representation

More information

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures Pattern recognition Classification/Clustering GW Chapter 12 (some concepts) Textures Patterns and pattern classes Pattern: arrangement of descriptors Descriptors: features Patten class: family of patterns

More information

Shape Classification and Cell Movement in 3D Matrix Tutorial (Part I)

Shape Classification and Cell Movement in 3D Matrix Tutorial (Part I) Shape Classification and Cell Movement in 3D Matrix Tutorial (Part I) Fred Park UCI icamp 2011 Outline 1. Motivation and Shape Definition 2. Shape Descriptors 3. Classification 4. Applications: Shape Matching,

More information

Machine learning Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Machine learning Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures Machine learning Pattern recognition Classification/Clustering GW Chapter 12 (some concepts) Textures Patterns and pattern classes Pattern: arrangement of descriptors Descriptors: features Patten class:

More information

Fundamentals of Digital Image Processing

Fundamentals of Digital Image Processing \L\.6 Gw.i Fundamentals of Digital Image Processing A Practical Approach with Examples in Matlab Chris Solomon School of Physical Sciences, University of Kent, Canterbury, UK Toby Breckon School of Engineering,

More information

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy BSB663 Image Processing Pinar Duygulu Slides are adapted from Selim Aksoy Image matching Image matching is a fundamental aspect of many problems in computer vision. Object or scene recognition Solving

More information

CHAPTER 4 SEMANTIC REGION-BASED IMAGE RETRIEVAL (SRBIR)

CHAPTER 4 SEMANTIC REGION-BASED IMAGE RETRIEVAL (SRBIR) 63 CHAPTER 4 SEMANTIC REGION-BASED IMAGE RETRIEVAL (SRBIR) 4.1 INTRODUCTION The Semantic Region Based Image Retrieval (SRBIR) system automatically segments the dominant foreground region and retrieves

More information

The MPEG-7 Description Standard 1

The MPEG-7 Description Standard 1 The MPEG-7 Description Standard 1 Nina Jaunsen Dept of Information and Media Science University of Bergen, Norway September 2004 The increasing use of multimedia in the general society and the need for

More information

Tutorial 5. Jun Xu, Teaching Asistant March 2, COMP4134 Biometrics Authentication

Tutorial 5. Jun Xu, Teaching Asistant March 2, COMP4134 Biometrics Authentication Tutorial 5 Jun Xu, Teaching Asistant nankaimathxujun@gmail.com COMP4134 Biometrics Authentication March 2, 2017 Table of Contents Problems Problem 1: Answer The Questions Problem 2: Indeterminate Region

More information

EE795: Computer Vision and Intelligent Systems

EE795: Computer Vision and Intelligent Systems EE795: Computer Vision and Intelligent Systems Spring 2012 TTh 17:30-18:45 WRI C225 Lecture 04 130131 http://www.ee.unlv.edu/~b1morris/ecg795/ 2 Outline Review Histogram Equalization Image Filtering Linear

More information

Searching Video Collections:Part I

Searching Video Collections:Part I Searching Video Collections:Part I Introduction to Multimedia Information Retrieval Multimedia Representation Visual Features (Still Images and Image Sequences) Color Texture Shape Edges Objects, Motion

More information

Spectral modeling of musical sounds

Spectral modeling of musical sounds Spectral modeling of musical sounds Xavier Serra Audiovisual Institute, Pompeu Fabra University http://www.iua.upf.es xserra@iua.upf.es 1. Introduction Spectral based analysis/synthesis techniques offer

More information

Texture Analysis. Selim Aksoy Department of Computer Engineering Bilkent University

Texture Analysis. Selim Aksoy Department of Computer Engineering Bilkent University Texture Analysis Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Texture An important approach to image description is to quantify its texture content. Texture

More information

IN5520 Digital Image Analysis. Two old exams. Practical information for any written exam Exam 4300/9305, Fritz Albregtsen

IN5520 Digital Image Analysis. Two old exams. Practical information for any written exam Exam 4300/9305, Fritz Albregtsen IN5520 Digital Image Analysis Two old exams Practical information for any written exam Exam 4300/9305, 2016 Exam 4300/9305, 2017 Fritz Albregtsen 27.11.2018 F13 27.11.18 IN 5520 1 Practical information

More information

Processing of binary images

Processing of binary images Binary Image Processing Tuesday, 14/02/2017 ntonis rgyros e-mail: argyros@csd.uoc.gr 1 Today From gray level to binary images Processing of binary images Mathematical morphology 2 Computer Vision, Spring

More information

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING

IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING SECOND EDITION IMAGE ANALYSIS, CLASSIFICATION, and CHANGE DETECTION in REMOTE SENSING ith Algorithms for ENVI/IDL Morton J. Canty с*' Q\ CRC Press Taylor &. Francis Group Boca Raton London New York CRC

More information

2. LITERATURE REVIEW

2. LITERATURE REVIEW 2. LITERATURE REVIEW CBIR has come long way before 1990 and very little papers have been published at that time, however the number of papers published since 1997 is increasing. There are many CBIR algorithms

More information

Local Feature Detectors

Local Feature Detectors Local Feature Detectors Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr Slides adapted from Cordelia Schmid and David Lowe, CVPR 2003 Tutorial, Matthew Brown,

More information

Object Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision

Object Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision Object Recognition Using Pictorial Structures Daniel Huttenlocher Computer Science Department Joint work with Pedro Felzenszwalb, MIT AI Lab In This Talk Object recognition in computer vision Brief definition

More information

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig

Multimedia Databases. Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig Multimedia Databases Wolf-Tilo Balke Younès Ghammad Institut für Informationssysteme Technische Universität Braunschweig http://www.ifis.cs.tu-bs.de Previous Lecture Audio Retrieval - Query by Humming

More information

Video Mosaics for Virtual Environments, R. Szeliski. Review by: Christopher Rasmussen

Video Mosaics for Virtual Environments, R. Szeliski. Review by: Christopher Rasmussen Video Mosaics for Virtual Environments, R. Szeliski Review by: Christopher Rasmussen September 19, 2002 Announcements Homework due by midnight Next homework will be assigned Tuesday, due following Tuesday.

More information

Perception. Autonomous Mobile Robots. Sensors Vision Uncertainties, Line extraction from laser scans. Autonomous Systems Lab. Zürich.

Perception. Autonomous Mobile Robots. Sensors Vision Uncertainties, Line extraction from laser scans. Autonomous Systems Lab. Zürich. Autonomous Mobile Robots Localization "Position" Global Map Cognition Environment Model Local Map Path Perception Real World Environment Motion Control Perception Sensors Vision Uncertainties, Line extraction

More information