Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

Similar documents
Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

Multiresolution Image Classification by Hierarchical Modeling with Two-Dimensional Hidden Markov Models

Jia Li Department of Statistics The Pennsylvania State University University Park, PA 16802

Mining Digital Imagery Data for Automatic Linguistic Indexing of Pictures Λ

340 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 13, NO. 3, MARCH Studying Digital Imagery of Ancient Paintings by Mixtures of Stochastic Models

From Pixels to Semantics Mining Digital Imagery Data for Automatic Linguistic Indexing of Pictures

Image classification by a Two Dimensional Hidden Markov Model

Resolution 1. Resolution 2. Resolution 3. Model about concept 1. concept 2. Resolution 1. Resolution 2. Resolution 3. Model about concept 2.

Fabric Image Retrieval Using Combined Feature Set and SVM

Probabilistic Graphical Models Part III: Example Applications

A Quantitative Approach for Textural Image Segmentation with Median Filter

FRACTAL DIMENSION BASED TECHNIQUE FOR DATABASE IMAGE RETRIEVAL

COMS 4771 Clustering. Nakul Verma

Latest development in image feature representation and extraction

Image Similarity Measurements Using Hmok- Simrank

Large-scale Satellite Image Browsing using Automatic Semantic Categorization and Content-based Retrieval

A Study on the Effect of Codebook and CodeVector Size on Image Retrieval Using Vector Quantization

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Texture Image Segmentation using FCM

Differential Compression and Optimal Caching Methods for Content-Based Image Search Systems

Two hours UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE. Date: Thursday 16th January 2014 Time: 09:45-11:45. Please answer BOTH Questions

Random projection for non-gaussian mixture models

Clustering Documents in Large Text Corpora

AS STORAGE and bandwidth capacities increase, digital

Rough Feature Selection for CBIR. Outline

10-701/15-781, Fall 2006, Final

A Visualization Tool to Improve the Performance of a Classifier Based on Hidden Markov Models

Video Key-Frame Extraction using Entropy value as Global and Local Feature


Content-Based Image Retrieval of Web Surface Defects with PicSOM

Hidden Markov Models. Slides adapted from Joyce Ho, David Sontag, Geoffrey Hinton, Eric Xing, and Nicholas Ruozzi

INFORMATION RETRIEVAL USING MARKOV MODEL MEDIATORS IN MULTIMEDIA DATABASE SYSTEMS. Mei-Ling Shyu, Shu-Ching Chen, and R. L.

D-Separation. b) the arrows meet head-to-head at the node, and neither the node, nor any of its descendants, are in the set C.

Topic Diversity Method for Image Re-Ranking

Lecture 8: The EM algorithm

Content Based Image Retrieval Using Curvelet Transform

Handwritten Word Recognition using Conditional Random Fields

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

Lecture 3: Conditional Independence - Undirected

Content based Image Retrieval Using Multichannel Feature Extraction Techniques

Texture Modeling using MRF and Parameters Estimation

Text Document Clustering Using DPM with Concept and Feature Analysis

PERFORMANCE EVALUATION OF ONTOLOGY AND FUZZYBASE CBIR

Bi-Level Classification of Color Indexed Image Histograms for Content Based Image Retrieval

Evaluation of texture features for image segmentation

Fingerprint Image Compression

Probabilistic Graphical Models

10601 Machine Learning. Hierarchical clustering. Reading: Bishop: 9-9.2

Hierarchical GEMINI - Hierarchical linear subspace indexing method

Textural Features for Image Database Retrieval

Introduction to Mobile Robotics

Multimodal Information Spaces for Content-based Image Retrieval

Automated Extraction of Event Details from Text Snippets

Fast Decision of Block size, Prediction Mode and Intra Block for H.264 Intra Prediction EE Gaurav Hansda

What is this Song About?: Identification of Keywords in Bollywood Lyrics

Datasets Size: Effect on Clustering Results

Simulating Geological Structures Based on Training Images and Pattern Classifications

Re-Ranking of Web Image Search Using Relevance Preserving Ranking Techniques

ENHANCEMENT OF METICULOUS IMAGE SEARCH BY MARKOVIAN SEMANTIC INDEXING MODEL

Large Scale Chinese News Categorization. Peng Wang. Joint work with H. Zhang, B. Xu, H.W. Hao

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

MR IMAGE SEGMENTATION

Computer vision: models, learning and inference. Chapter 10 Graphical Models

Sum-Product Networks. STAT946 Deep Learning Guest Lecture by Pascal Poupart University of Waterloo October 15, 2015

AN IMAGE segmentation algorithm aims to assign a class

Structured Learning. Jun Zhu

VIDEO OBJECT SEGMENTATION BY EXTENDED RECURSIVE-SHORTEST-SPANNING-TREE METHOD. Ertem Tuncel and Levent Onural

Enhanced Image Retrieval using Distributed Contrast Model

CHAPTER 2 TEXTURE CLASSIFICATION METHODS GRAY LEVEL CO-OCCURRENCE MATRIX AND TEXTURE UNIT

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

Automatic Image Annotation and Retrieval Using Hybrid Approach

A New Configuration of Adaptive Arithmetic Model for Video Coding with 3D SPIHT

Data Mining Chapter 9: Descriptive Modeling Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

Mine Boundary Detection Using Markov Random Field Models

Hidden Markov Models in the context of genetic analysis

Predicting 3D People from 2D Pictures

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Keywords Clustering, Goals of clustering, clustering techniques, clustering algorithms.

Image retrieval based on bag of images

Learning Representative Objects from Images Using Quadratic Optimization

Learning Objects and Parts in Images

Forensic Image Recognition using a Novel Image Fingerprinting and Hashing Technique

Computer Vision Group Prof. Daniel Cremers. 4. Probabilistic Graphical Models Directed Models

Research on Remote Sensing Image Template Processing Based on Global Subdivision Theory

ANALYSIS OF SPIHT ALGORITHM FOR SATELLITE IMAGE COMPRESSION

A REVIEW ON IMAGE RETRIEVAL USING HYPERGRAPH

Object of interest discovery in video sequences

Closest Keywords Search on Spatial Databases

CONTENT BASED IMAGE RETRIEVAL SYSTEM USING IMAGE CLASSIFICATION

Wavelet Based Image Retrieval Method

Image Segmentation using Gaussian Mixture Models

Automatic annotation of digital photos

Exact Inference: Elimination and Sum Product (and hidden Markov models)

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

CS Introduction to Data Mining Instructor: Abdullah Mueen

Content Based Image Retrieval: Survey and Comparison between RGB and HSV model

Image Analysis, Classification and Change Detection in Remote Sensing

International Journal of Electrical, Electronics ISSN No. (Online): and Computer Engineering 3(2): 85-90(2014)

Transcription:

Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

Outline Objective Approach Experiment Conclusion and Future work

Objective Automatically establish linguistic indexing of pictures for a CBIR system CBIR: Content-based image retrieval is aimed at efficient retrieval of relevant images from large image databases based on automatically derived imagery features.

Objective Categories of CBIR technology High level semantics description: 1). Image knowledge base e.g. MINDSWAP, The Helsinki University Museum; 2). Drawbacks Low level feature-based classification: 1). PicHunter, PicSeek etc. 2).Drawbacks e.g Give me one image of this dog when it was a puppy Link high level semantic description to low level feature information by automatically assigning comprehensive textual description to pictures.

Problems of automatic annotation to images Automatic mapping between low level feature and knowledge e.g. Who s that guy in the picture? What is he doing? How does one model the semantic content? Can computer use what kind of domain knowledge to describe an image and how can computer acquire and store this knowledge? This article solves a part of the first problem.

Approach 2D MHMM (2-dimensional multiresolution Hidden Markov Model) Why does it choose 2D MHMM? 1-D HMM is suitable for block-based image classification. For HMM, the image s category is the hidden state, and its feature vector is observation symbols for the state. Compared 1-D HMM, 2-D HMM solves the problem of overlocalization.

Approach

Approach From the view of computational complexity, it s necessary to increase the size of one block and prevent from including too many objects in 2-D HMM. For this purpose, the authors introduce multiresolution.

Approach Here one block is represented by a feature vector.

Application architecture

Select one category of images to train for one concept A concept corresponds to a particular category of images. (A concept doesn t just correspond to one word. A cluster of words can be considered as a concept.)

Extract features from this category of images Every picture s size is 384 * 256. An image is partitioned into 4 * 4 blocks. For each block, the system extracts a feature vector of six dimension using wavelet transform.

Statistical Modeling Assumptions Where Si,j the state of block (i, j), Ui,j the feature vector of block (i, j), m = Si-1,j n = Si,j-1 l = Si,j; (i, j) is the coordinate of the state. This formula means the Si,j depends on Si,j. (i,j ) < (i,j) means the block (i,j ) is before the block (i,j)

Assumptions 2 Given every state, the feature vectors follow a Gaussian distribution. And the parameters of the Gaussian distribution only depend on this state in its resolution.

Assumptions 3 For the MHMM, denote the set of resolutions by, with r = R the finest resolution. Let the collection of block indices at resolution r be Where w is the maximal X coordinate of the state in the finest resolution, z is the maximal Y coordinate of the state in the finest resolution.

Assumptions 4 This formula means that given the states at the parent resolution, the states at the current resolution are conditionally independent of the other preceding resolutions, so that

Assumptions 5 The feature vector is conditionally independent of information on other blocks once the state of a block of the feature vector is known.

Assumptions 6 According to the above assumption, we can get the joint probability of a particular set of states and the feature vector:

Assumptions 7 Child blocks descended from different parent blocks are conditionally independent. The state transition probabilities depend on the state of their parent block. So compute the transition probabilities in this formula: Where

Statistical Modeling The joint probability of states and feature vectors at all the resolutions is then derived as To summarize, a 2D MHMM captures both the interscale and intra-scale statistical dependence. This model is trained using EM algorithm.

Automatic Linguistic Indexing of Pictures Use the models of every concept to compute the log probabilities of generating, that is Sort the log value to find K top ranked categories.

Automatic Linguistic Indexing of Pictures After getting K candidate concepts, the author doesn t use these concepts to annotate the image. j, k: The word appear j times in k categories. p is the percentage of image categories in the database that are annotated with this word.

Experiment

Experiment 600 categories, 100 images, 40 images per each concept. 4,630 test images outside the training set are used to test classification accuracy.

Accuracy Accuracy means the match percentage of 4,630 images. match means the test image annotated by this system is actually included in this category.

Conclusion My opinion They link concepts and features in order to establish the concept indexing to make keyword queries a little intelligent. Because it doesn t care the second problems, it still isn t intelligent enough for contentbased image retrieval. Proposed a 2D MHMM modeling approach to solve the problem of automatic linguistic indexing of pictures.

Conclusion Advantage of this approach Models for different concepts can be independently trained and retrained. Hence the system has good scalability; Spatial relation among image pixels within and across resolutions is taken into consideration with probabilistic likelihood as a universal measure.

Conclusion Limitation Train the concept dictionary using only 2D images without a sense of object size. 40 training images are insufficient for the computer program to build a reliable model for a complex concept.

Future work Improve the indexing speed of the system by using approximation in the likelihood computation. A rule-based system may be used to process the words annotated automatically to eliminate conflicting semantics.

Reference [1] J. Li, R.M. Gray, and R.A. Olshen, Multiresolution Image Classification by Hierarchical Modeling with Two Dimensional Hidden Markov Models, IEEE Trans. Information Theory, vol. 46, no. 5, pp. 1826-41, Aug. 2000. [2] J. Li, A. Najmi, and R.M. Gray, Image Classification by a Two Dimensional Hidden Markov Model, IEEE Trans. Signal Processing, vol. 48, no. 2, pp. 517-33, Feb. 2000. [3] J.Z. Wang, J. Li, and G. Wiederhold, SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 23, no. 9, pp. 947-963, Sept. 2001.