Chapter 4 - Image. Digital Libraries and Content Management

Similar documents
Chapter 2 - Concepts and Definitions

Digital Image Processing

Multimedia Information Retrieval

Extraction of Color and Texture Features of an Image

An Introduction to Content Based Image Retrieval

Content Based Image Retrieval

Chapter 3 Image Registration. Chapter 3 Image Registration

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang

Color and Shading. Color. Shapiro and Stockman, Chapter 6. Color and Machine Vision. Color and Perception

Lecture 8 Object Descriptors

Ulrik Söderström 21 Feb Representation and description

Content-based Image Retrieval (CBIR)

Image Processing: Final Exam November 10, :30 10:30

Computer Graphics Fundamentals. Jon Macey

Machine vision. Summary # 6: Shape descriptors

Boundary descriptors. Representation REPRESENTATION & DESCRIPTION. Descriptors. Moore boundary tracking

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University

Anno accademico 2006/2007. Davide Migliore

Practical Image and Video Processing Using MATLAB

Points Lines Connected points X-Y Scatter. X-Y Matrix Star Plot Histogram Box Plot. Bar Group Bar Stacked H-Bar Grouped H-Bar Stacked

EUROPEAN COMPUTER DRIVING LICENCE / INTERNATIONAL COMPUTER DRIVING LICENCE IMAGE EDITING

LECTURE 4: FEATURE EXTRACTION DR. OUIEM BCHIR

Multimedia Databases. 2. Summary. 2 Color-based Retrieval. 2.1 Multimedia Data Retrieval. 2.1 Multimedia Data Retrieval 4/14/2016.

How to create shapes. Drawing basic shapes. Adobe Photoshop Elements 8 guide

CSI 4107 Image Information Retrieval

Region-based Segmentation

Content-Based Image Retrieval Readings: Chapter 8:

CoE4TN4 Image Processing

4. Basic Mapping Techniques

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

Animation & Rendering

Multimedia Technology CHAPTER 4. Video and Animation

Rough Feature Selection for CBIR. Outline

Colour Reading: Chapter 6. Black body radiators

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

[10] Industrial DataMatrix barcodes recognition with a random tilt and rotating the camera

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Content Based Image Retrieval: Survey and Comparison between RGB and HSV model

One image is worth 1,000 words

Figure 1: Workflow of object-based classification

SUPPLEMENTARY INFORMATION

SHAPE, SPACE & MEASURE

Data Representation in Visualisation

Multimedia Database Systems. Retrieval by Content

A Content Based Image Retrieval System Based on Color Features

Histograms of Oriented Gradients

AQA GCSE Maths - Higher Self-Assessment Checklist

Logo & Icon. Fit Together Logo (color) Transome Logo (black and white) Quick Reference Print Specifications

HISTOGRAMS OF ORIENTATIO N GRADIENTS

Read, write compare and order numbers beyond 1000 in numerals and words Read Roman numerals to 100 and understand how they have changed through time

Visualisatie BMT. Rendering. Arjan Kok

CIRCULAR MOIRÉ PATTERNS IN 3D COMPUTER VISION APPLICATIONS

CONTENT BASED IMAGE RETRIEVAL SYSTEM USING IMAGE CLASSIFICATION

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Middle School Math Course 3

Modify Panel. Flatten Tab

Unit #13 : Integration to Find Areas and Volumes, Volumes of Revolution

L1 - Introduction. Contents. Introduction of CAD/CAM system Components of CAD/CAM systems Basic concepts of graphics programming

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into

My Target Level 1c. My areas for development:

Fuzzy Hamming Distance in a Content-Based Image Retrieval System

11. Image Data Analytics. Jacobs University Visualization and Computer Graphics Lab

TIMSS 2011 Fourth Grade Mathematics Item Descriptions developed during the TIMSS 2011 Benchmarking

6 PETAL Math Pacing Guide Lynchburg City Schools

Number/Computation. addend Any number being added. digit Any one of the ten symbols: 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9

Content-Based Image Retrieval Readings: Chapter 8:

Ripplet: a New Transform for Feature Extraction and Image Representation

Color and Shape Index for Region-Based Image Retrieval

1 to 5 NUMBER Numbers and Operations Place Value, Rounding and Estimation Understanding Products ALGEBRA Expressions and Formulae

Lesson 11. Media Retrieval. Information Retrieval. Image Retrieval. Video Retrieval. Audio Retrieval

Digital Image Fundamentals

Computer Graphics 1. Chapter 2 (May 19th, 2011, 2-4pm): 3D Modeling. LMU München Medieninformatik Andreas Butz Computergraphik 1 SS2011

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

Machine learning Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Paint/Draw Tools. Foreground color. Free-form select. Select. Eraser/Color Eraser. Fill Color. Color Picker. Magnify. Pencil. Brush.

UNIT 15 GRAPHICAL PRESENTATION OF DATA-I

form are graphed in Cartesian coordinates, and are graphed in Cartesian coordinates.

Medical Image Processing using MATLAB

Symmetry Based Semantic Analysis of Engineering Drawings

The base of a solid is the region in the first quadrant bounded above by the line y = 2, below by

PyPlot. The plotting library must be imported, and we will assume in these examples an import statement similar to those for numpy and math as

(Refer Slide Time 00:17) Welcome to the course on Digital Image Processing. (Refer Slide Time 00:22)

Chapter - 2: Geometry and Line Generations

Extracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang

A Symmetry Operator and Its Application to the RoboCup

TABLE OF CONTENTS PRODUCT DESCRIPTION VISUALIZATION OPTIONS MEASUREMENT OPTIONS SINGLE MEASUREMENT / TIME SERIES BEAM STABILITY POINTING STABILITY

Graphical Presentation for Statistical Data (Relevant to AAT Examination Paper 4: Business Economics and Financial Mathematics) Introduction

Anatomy of Tables and Graphs. Matthew Wettergreen, PhD

Pattern recognition. Classification/Clustering GW Chapter 12 (some concepts) Textures

Graphics and Interaction Rendering pipeline & object modelling

CMP Book: Investigation Number Objective: PASS: 1.1 Describe data distributions and display in line and bar graphs

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

CS 231A Computer Vision (Fall 2012) Problem Set 3

Approaches to Visual Mappings

Digital Image Processing. Introduction

Color. making some recognition problems easy. is 400nm (blue) to 700 nm (red) more; ex. X-rays, infrared, radio waves. n Used heavily in human vision

Your Flowchart Secretary: Real-Time Hand-Written Flowchart Converter

7 Fractions. Number Sense and Numeration Measurement Geometry and Spatial Sense Patterning and Algebra Data Management and Probability

Colour computer vision: fundamentals, applications and challenges. Dr. Ignacio Molina-Conde Depto. Tecnología Electrónica Univ.

Birkdale High School - Higher Scheme of Work

Transcription:

Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 4 - Image

Vector Graphics Raw data: set (!) of lines and polygons with coordinates and attributes (line width, color) Registration data: coordinate system (cartesian, polar) definitions for colors, textures Description data: grouping related lines and polygons into geometric, higher order objects: squares, rectangles, projections of 3D-objects start point end point width 0 9 20 0 30 8 10 10 20 10 20 0 30 10 30 30 2 1 2 2 10 0 10 10 1 11 8 10 10 1 (0, 0) (20, 0) (0, 30) (20, 30) 20 10 0 10 2 b) resulting graphics

Vector Graphics (2) Operations: focus on output manipulation by special editors (not part of the repository) "simple" manipulations in the repository without replacing the complete graphics Input: via file or interchange format Output: to a file, on a device (display, plotter) line by line (numberoflines, getalllines) Modification: add/delete line translate, rotate, resize Analysis, aggregation, extraction extract rectangular section (clipping) reconstruction of geometric objects

Search Comparison (for search) using description data (-> text) using raw data similarity of geometric figures (for very simple graphics) Significant extensions required for curves surfaces segments (collection of arbitrary related elements) Subtypes: technical drawing (CAD) bar charts, flow charts, pie charts street maps

(Raster) Image Raw data pixel matrix (pixel = picture element, or "pel") Registration data # of bits/pixel (pixel depth, usually 1, 8, or 24) # of pixels/line (picture width) # of lines (picture height) linearization: by line or column pixel semantics: grayscale, color definition, index into colormap (optional) colormap with specific number of entries, length of entries (e.g., 24 bit) (optional) definition of color space (RGB, IHS, ) and more Descriptive data text, keywords, knowledge representation describing content recognized lines, areas appearing in the image resulting 2-D objects such as circles, elipses, polygons,

Image Operations Input/Output file (SUN Rasterfile, GIF, TIFF, JPEG,...) main memory data structure (matrix) device (scanner, camera / display, printer) Modification: set individual pixel(s) change color map for artificial colors (tomography) for analysis (contrast) bitmap operations overlay with other images Analysis, aggregation, extraction increasing contrast line recognition resize, zoom, crop (window) convert to grayscale

Image Search Significant research efforts over the last 20 years a number of techniques and systems available Alternative approaches: attribute-based text decription (annotation) content-based image retrieval (CBIR) elementary features (low-level) semantic features, object recognition (high-level)

Text-based Image Retrieval Image description unrestricted (free) text Queries keyword or free text with/without boolean operators Search uses conventional IR techniques (see chapter 3) Differences to text search manual annotation required (unless captions are already present in the image file) needs to be efficient, complete, consistent requires domain knowledge, thesaurus Advantages abstractions, concepts can be used ("smile", "happiness") hard to achieve with other techniques Disadvantages visual appearance (e.g., texture) is hard to describe textually query-by-example-image is not supported

CBIR: Color-based Search Main idea given an image, find images with a similar color appearance three primary colors, or color channels (e.g., RGB) each channel is divided into m discrete intervals results in m 3 different color combinations (bins) Color histogram H(M) for image M vector (h 1, h 2,..., h j,..., h n ) with n = number of bins h j = number of pixels for image M that fall into bin j Query is mapped to a histogram as well compute from example image, or "guess" from description Search compute distance between histograms of stored images and query image return images below a given threshold, or first k (sorted ascending by distance)

Color Histograms Distance metrics numerous proposals simplest: L-1 dh (, H) = h h n 1 2 1, l 2, l l= 1 Limitations of basic approach similarity of colors (by human perception) and bins is ignored assumption: all images have N pixels if not, then perform a normalization step maximum distance of two images: 2N there is no bin in which both images have pixels what if images do not have exactly the same colors, but colors perceived to be similar? queries only give an approximation of the desired colors colors may be slightly different due to noise, lighting conditions, etc. problem is made worse by the exact definition of bins

Accounting For Color Similarity Account for color perception in distance computation method by Niblack et. al. (used in QBIC) query histogram X, database image histogram Y normalized: 0 x i, y i 1; Σx i = Σy i = 1 bin-by-bin distance histogram Z = X Y -1 z i 1; Σz i = 0 distance between X and Y A is the symmetric color similarity matrix with a(i, j) = 1 d(c i, c j ) / d max c i and c j are the colors of bin i and j in the histogram, d(c i, c j ) is the distance of colors, d max is the maximum distance of all colors for similar colors, d(c i, c j ) is small, so a(i, j) is close to 1 2 t d ( X, Y) = ( X Y) A( X Y) = a ( x y )( x y ) hist k i presence of similar colors will result in reducing the distance k j ij i i j j Example: bins for red, orange, blue R = [1.0, 0.0, 0.0] O = [0.0, 1.0, 0.0] B = [0.0, 0.0, 1.0] A red,orange,blue = 2 d hist 2 d hist 1.0 0.9 0.0 0.9 1.0 0.0 0.0 0.0 1.0 (R,O) = 0.2 (R, B) = 2.0

Perceptually Weighted Histograms Determine representative colors of a color space For each color, create a color bin equal distribution across color space Histogram computation: for each pixel, determine the ten most similar representative colors determine the color distance assign a weight proportional to the inverse distance and add to the corresponding color bin pixel counts (fractionally) in multiple bins Most important difference to previous approaches: histogram computation already considers color similarity Apparently results in performance advantage

Image Regions Consider position of colors within the image's spatial dimensions Partition the image into regions and compute histograms for each region raster-based (e.g., 4 x 4 regions of fixed size) separation of foreground and background background often dominate regarding image area foreground usually more important for search separate histograms should be computed segmentation does not need to be precise minimum bounding rectangle, can be determined automatically based on horizontal and vertical pixel variations, or can be manually assigned Search may exploit the presence of regions (or foreground/background) in different ways global search, considering region-specific similarity measures focused search, referring to a specific region (e.g., foreground only) weighting regions differently Experiments show significant improvements

Color Histograms Additional Aspects Color distribution so far, we partitioned the color space into bins in an equi-distant manner, without considering which colors are actually used alternative approaches: use finer-grained partitioning for color intervals that receive more pixels Color variability which color spaces are best suited for distance computation? when should pixels in different pictures be considered "equivalent"? color spaces RGB is device-dependent, requires White and three primary colors for reference, is very sensitive regarding lighting conditions, surface reflections, etc. Munsell or Lab spaces achieve a better perceptual uniformity HSV (hue, saturation, value) hue is invariant regarding illumination

Shape-based Search Requires segmentation semi-automatic methods exist Shape representation and similarity every shape should have a unique representation, invariant to translation, rotation, scaling similar shapes should have similar representations to allow for search based on distance measures Query query-by-example, sketch Terminology major axis: straight line between the two most distant points on the boundary of the shape minor axis: straight line perpendicular to the major axis such that a minimum bounding rectangle can be constructed which is parallel to the major and minor axis and width/ height correspond to the lengths of the major/minor axis base rectangle: the minimum bounding rectangle described above excentricity: ratio of major to minor axis ( 1)

Shape Representation Simple shape representation based on the four measures defined above may be used for representation, search is only a "weak" characterization of the shape usually combined with additional characteristics example (QBIC system by IBM): area, circularity, major axis orientation, Region-based shape representation closer to perceived shape by including regions promises better retrieval quality

Shape Regions - Idea Raster definition quadratic cells of equal size Cells raster is just large enough to cover the shape may be completely or partially "filled" by the form, or may be "empty" are labeled with "1" when filled at least 15%, otherwise 0 ordered left to right, top to bottom within the raster: bitstring for the shape compact, easy to compute, invariant to translations Raster size smaller cells result in more accurate shape representation, but require more resources compromise: cell size from 10x10 up to 20x20 pixels

Region Normalization Achieves uniform shape orientation and scale Rotation Scale major axis parallel to x-axis still two possible positions results in two bitstrings not for stored objects (would double storage space requirements) but for queries proportionally scale the shape to a fixed length of the major axis (e.g., 192 pixels) Results in unique shape representation provided that the major axis is unique Bitstring length raster is just large enough for normalized shape: #cells along x-axis is always the same example: 8 cells with cell size 24x24, major axis length 192 pixels number of cells along the y-axis may vary (difference in excentricity), but is always smaller than along the x-axis has to be accounted for by the similarity measure

Similarity Measure Idea: distance = number of unequal cells Excentricity needs to be accounted for equal raster size: bitwise comparison significant difference in #cells along y-axis: shapes are regarded as different threshold depends on cell size and application typical example: 3 small difference for y-axis: fill with zeroes, do bitwise comparison Reflections (horizontal/vertical) should be considered similar two additional bitstrings for each query Multiple possible major axis need to store all possible major axes, determine all possible major axes for query pairwise distance comparison, keep minimum

Indexing and Retrieval For every shape in the database determine major/minor axis, excentricity normalize regarding rotation and scale apply raster, determine bitstring store bitstring, length of minor axis For every query determine bitstring in the same manner but include bitstrings for 180 rotation, reflections search for bitstrings in the database with (almost) equal minor axis length (similar excentricity) compute distance for all matches return sorted ascending by distance

Texture-based Search Texture is hard to characterize Approach by Tamura et.al: six different features coarseness most important feature size of distinguishable picture elements contrast grayscale levels, edge sharpness, periodicity of repeated elements directionality shape and position of elements line-likeness shape of elements regularity variation in placement of elements roughness

Systems Have to support combinations of different kind of search especially for elementary features and text QBIC Virage query image (color, shape, texture) plus keywords partly included in DB2 wwwqbic.almaden.ibm.com Features: color, shape, texture www.virage.com WebSEEK www.ctr.columbia.edu/webseek

Summary Vector graphics media object raw/registration/description data, operations search Raster image media object raw/registration/description data, operations text-based image retrieval Content-based image retrieval (CBIR) mostly based on elementary (low-level) features: color-based histograms, color similarity matrix, perceptually weighted histograms, regions shape-based representation raster, normalization, similarity texture-based