VC 11/12 T14 Visual Feature Extraction

Similar documents
VC 12/13 T16 Video Compression

VC 10/11 T4 Colour and Noise

VC 17/18 TP14 Pattern Recognition

MPEG-7 Visual shape descriptors

VC 10/11 T9 Region-Based Segmentation

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang

Practical Image and Video Processing Using MATLAB

Texture. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors

5. Feature Extraction from Images

VC 16/17 TP5 Single Pixel Manipulation

The MPEG-7 Description Standard 1

Content-based Image Retrieval (CBIR)

Multimedia Information Retrieval

CHAPTER 4 SEMANTIC REGION-BASED IMAGE RETRIEVAL (SRBIR)

Adaptive Learning of an Accurate Skin-Color Model

VC 11/12 T11 Optical Flow

Content-Based Image Retrieval of Web Surface Defects with PicSOM

Content Based Image Retrieval

An Introduction to Content Based Image Retrieval

Autoregressive and Random Field Texture Models

Boundary descriptors. Representation REPRESENTATION & DESCRIPTION. Descriptors. Moore boundary tracking

Latest development in image feature representation and extraction

Color and Shading. Color. Shapiro and Stockman, Chapter 6. Color and Machine Vision. Color and Perception

Region-based Segmentation

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT

Content-Based Image Retrieval Readings: Chapter 8:

IT Digital Image ProcessingVII Semester - Question Bank

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37

Lecture 8: Multimedia Information Retrieval (I)

Content based Image Retrieval Using Multichannel Feature Extraction Techniques

AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES

Filtering Images. Contents

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

COLOR AND SHAPE BASED IMAGE RETRIEVAL

Part 3: Image Processing

CHAPTER 4 TEXTURE FEATURE EXTRACTION

International Journal of Computer Engineering and Applications, Volume XII, Issue XII, Dec. 18, ISSN

Efficient Content Based Image Retrieval System with Metadata Processing

MPEG-7 Framework. Compression Coding MPEG-1,-2,-4. Management Filtering. Transmission Retrieval Streaming. Acquisition Authoring Editing

Chapter 4. The Classification of Species and Colors of Finished Wooden Parts Using RBFNs

Digital Image Processing

Extraction of Color and Texture Features of an Image

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Dominant colour extraction in DCT domain

EE795: Computer Vision and Intelligent Systems

Machine learning Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

Content-Based Image Retrieval Readings: Chapter 8:

DIGITAL IMAGE ANALYSIS. Image Classification: Object-based Classification

Shape Context Matching For Efficient OCR

JNTUWORLD. 4. Prove that the average value of laplacian of the equation 2 h = ((r2 σ 2 )/σ 4 ))exp( r 2 /2σ 2 ) is zero. [16]

LECTURE 4: FEATURE EXTRACTION DR. OUIEM BCHIR

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

11. Image Data Analytics. Jacobs University Visualization and Computer Graphics Lab

MR IMAGE SEGMENTATION

Distributed Algorithms. Image and Video Processing

A Study on Feature Extraction Techniques in Image Processing

A Survey on Feature Extraction Techniques for Shape based Object Recognition

Lecture 4: Spatial Domain Transformations

Content-Based Image Retrieval. Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition

Topic 4 Image Segmentation

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

Babu Madhav Institute of Information Technology Years Integrated M.Sc.(IT)(Semester - 7)

Image Segmentation and Registration

UNIVERSITY OF OSLO. Faculty of Mathematics and Natural Sciences

Digital Image Processing (EI424)

TEXTURE. Plan for today. Segmentation problems. What is segmentation? INF 4300 Digital Image Analysis. Why texture, and what is it?

CSI 4107 Image Information Retrieval

Digital Image Processing Chapter 11: Image Description and Representation

Automatic Colorization of Grayscale Images

One image is worth 1,000 words

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska

CHAPTER 3 FEATURE EXTRACTION

Feature extraction. Bi-Histogram Binarization Entropy. What is texture Texture primitives. Filter banks 2D Fourier Transform Wavlet maxima points

COMPARISION OF NORMAL Vs HERNIATED CERVICAL IMAGES USING GRAY LEVEL TEXTURE FEATURES

SuRVoS Workbench. Super-Region Volume Segmentation. Imanol Luengo

Automatic annotation of digital photos

CIRCULAR MOIRÉ PATTERNS IN 3D COMPUTER VISION APPLICATIONS

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Computer vision is the analysis of digital images by a computer for such applications as:

EE 701 ROBOT VISION. Segmentation

Lecture 7: Most Common Edge Detectors

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval

Feature Extraction and Image Processing, 2 nd Edition. Contents. Preface

ECE 176 Digital Image Processing Handout #14 Pamela Cosman 4/29/05 TEXTURE ANALYSIS

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

2. LITERATURE REVIEW

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

TEXTURE CLASSIFICATION BASED ON GABOR WAVELETS

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features

Lecture 6: Edge Detection

Automated Classification of Quilt Photographs Into Crazy and Noncrazy

Extracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang

3. (a) Prove any four properties of 2D Fourier Transform. (b) Determine the kernel coefficients of 2D Hadamard transforms for N=8.

IMAGE SEGMENTATION. Václav Hlaváč

DEFECT IMAGE CLASSIFICATION WITH MPEG-7 DESCRIPTORS. Antti Ilvesmäki and Jukka Iivarinen

Transcription:

VC 11/12 T14 Visual Feature Extraction Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos Miguel Tavares Coimbra

Outline Feature Vectors Colour Texture Shape

Topic: Feature Vectors Feature Vectors Colour Texture Shape

The earth is blue, white and brown Visual Features The earth is round The south pole has a smooth texture

Visual Features Features Measure specific characteristics. Numerical values. May have multiple values. Visual Features Quantify visual characteristics of an image. Popular features. Colour, Texture, Shape

Feature vector Feature F i F i f i Feature F i with N values. i F f f,...,, i1 i2 Feature vector F with M features. F F F... 1 2 f in F M Naming conventions for this module: Elements of a feature vector are called coefficients. Features may have one or more coefficients. Feature vectors may have one or more features.

Low-level visual features Objective Directly reflect specific visual features. Colour Texture Shape Etc.

Features & Decisions Low-Level Features Various Possible Starting point of most Solutions classification tasks! Decision One Solution Middle-Level Features How do I decide? Decision High-Level Features

How to quantify visual features? Many possibilities! We need a standard. MPEG-7 Standard Developed by the Moving Pictures Expert Group. is a standard for describing the multimedia content data that supports some degree of interpretation of the information meaning, which can be passed onto, or accessed by, a device or a computer code [MPEG-7 Overview (version 10), ISO/IEC JTC1/SC29/WG11N6828]

The MPEG-7 standard Provides a rich set of standardized tools to describe multimedia content. Computer annotation. Human annotation. Audiovisual Description Tools Descriptors Descriptor Schemes Target functionality: Feature Vectors Efficient search, filtering and browsing of multimedia content.

MPEG-7 Links MPEG website http://www.chiariglione.org/mpeg MPEG-7 Industry Forum website http://www.mpegif.com MPEG-7 Consortium website http://mpeg7.nist.gov MPEG-7 Overview (version 10) http://www.chiariglione.org/mpeg/standard s/mpeg-7/mpeg-7.htm#e9e3

Topic: Colour Feature Vectors Colour Texture Shape

Gray-Level Histogram Intensity distribution (HSI). We can define the number of histogram bins. Histogram bins = Feature coefficients. F [ f 0,..., f255]

Colour Histogram We typically have three histograms Ex: RGB Colour space - Red Histogram - Green Histogram - Blue Histogram How do we build a feature vector? Concatenate vectors. Multi-dimensional quantization of colour space. Red Green Blue

RGB Histogram Simply concatenate vectors. Not very smart. (why?) F F F R G B [ f [ f [ f R0 G0 B0,...,,...,,..., f f f R255 B255 ] G255 ] ] F F F RGB R G F B

Combined Histogram Quantize multi-dimensional colour space. RGB Each coefficient is a small cube inside the RGB cube. F f,..., f [ 0 N ]

HSI Histogram Quantize HSI space. Define number of bins N. Feature vector F HSI [ f 0,..., f N Typically better for object description ]

MPEG-7 Dominant Colour Clusters colors into a small number of representative colors (salient colors) F = { {c i, p i, v i }, s} c i : Representative colors p i : Their percentages in the region v i : Color variances s : Spatial coherency

MPEG-7 Scalable Colour HSI Histogram Typical quantization: 256 bins. 16 levels in H 4 levels in S 4 levels in I F SC [ f 0,..., f255] Very popular for CBIR (Content-Based Image Retrieval).

[Sikora 2001]

MPEG-7 Colour Layout Clusters the image into 64 (8x8) blocks Derives the average color of each block (or using DCD) Applies (8x8)DCT and encoding Efficient for Sketch-based image retrieval Content Filtering using image indexing

MPEG-7 Colour Structure Scanns the image by an 8x8 struct. element Counts the number of blocks containing each color Generates a color histogram (HMMD/4CSQ operating points) COLOR BIN C0 C1 +1 C2 C3 +1 C4 C5 C6 C7 +1 8 x 8 structuring element

Topic: Texture Feature Vectors Colour Texture Shape

What is texture? Texture gives us information about the spatial arrangement of the colours or intensities in an image. [L. Shapiro]

Two approaches to texture Structural approach Texture is a set of primitive texels in some regular or repeated relationship. Good for regular, man-made textures. Statistical approach Texture is a quantitative measure of the arrangement of intensities in a region. More general and easier to compute.

Statistical approaches Grey level of central pixels Average of grey levels in window Median Standard deviation of grey levels Difference of maximum and minimum grey levels Difference between average grey level in small and large windows Sobel feature Kirsch feature Derivative in x window Derivative in y window Diagonal derivatives Combine features How do I pick one??

MPEG-7 Homogenous Texture Filters the image with a set of orientation and scale sensitive filters. Computes mean and standard deviation of response. 30 channels 6 in angular direction, 5 in radial direction. F HT f DC, f, e d SC 1, e2,..., e30, d1, d2,..., 30 fdc, fsc are the mean intensity and the standard deviation of image texture), where ex and dx are the logarithmically scaled texture energy and texture energy deviation coefficients.

HT Channels

MPEG-7 Local Edge Histogram Image divided into 4x4 sub-regions. Edge histogram computer for each subregion. Five bins: Vertical, horizontal, 45 diagonal, 135 diagonal, and isotropic. 80 total bins. F LEH [ f 0,..., f79]

Topic: Shape Feature Vectors Colour Texture Shape

Definitions Geometric definition Two sets have the same shape if one can be transformed into another by a combination of translations, rotations and uniform scaling operations.

Shape and Segmentation Shape implies a segmentation step. Segmentation has multiple solutions (middlelevel feature). But the shape feature itself has a single solution! How do we describe shapes? Chain-codes Statistical descriptors.

Freeman Chain Code Chains represent the borders of objects. Coding with chain codes. Relative. Assume an initial starting point for each object. How do we build a feature vector? Freeman Chain Code Using a Freeman Chain Code and considering the top-left pixel as the starting point: 70663422

MPEG-7 Region-Based Shape Uses a set of separable ART (angular radial transformation) functions. Classifies shape along various angular and radial directions. Totals 35 coefficients. F RBS [ f,..., f 0 34 ]

ART Basis Functions Applicable to figures (a) (e) Distinguishes (i) from (g) and (h) (j), (k), and (l) are similar

MPEG-7 Contour-Based Finds curvature zero crossing points of the shape s contour (key points) Reduces the number of key points step by step, by applying Gaussian smoothing The position of key points are expressed relative to the length of the contour curve Descriptor

Applicable to (a) Distinguishes differences in (b) Find similarities in (c) - (e) Advantages: Captures the shape very well Robust to the noise, scale, and orientation It is fast and compact

Comparison Blue: Similar shapes by Region-Based Yellow: Similar shapes by Contour-Based

Resources L. Shapiro, Chapters 6 and 7 T. Sikora, MPEG-7 Visual Standard for Content Description An Overview, http://ieeexplore.ieee.org/iel5/76/20050/00 927422.pdf?arnumber=927422