An Efficient Content Based Image Retrieval using EI Classification and Color Features

Similar documents
Edge Histogram Descriptor, Geometric Moment and Sobel Edge Detector Combined Features Based Object Recognition and Retrieval System

An Introduction to Content Based Image Retrieval

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

CLASSIFICATION OF BOUNDARY AND REGION SHAPES USING HU-MOMENT INVARIANTS

Content Based Image Retrieval Using Color Quantizes, EDBTC and LBP Features

An Efficient Semantic Image Retrieval based on Color and Texture Features and Data Mining Techniques

Content Based Image Retrieval: Survey and Comparison between RGB and HSV model

AN EFFICIENT BATIK IMAGE RETRIEVAL SYSTEM BASED ON COLOR AND TEXTURE FEATURES

Content-based Image Retrieval (CBIR)

Color and Texture Feature For Content Based Image Retrieval

Scene Text Detection Using Machine Learning Classifiers

A Novel Image Retrieval Method Using Segmentation and Color Moments

CHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION

A Quantitative Approach for Textural Image Segmentation with Median Filter

International Journal of Advance Research in Engineering, Science & Technology

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

2. LITERATURE REVIEW

AN EFFICIENT APPROACH FOR IMPROVING CANNY EDGE DETECTION ALGORITHM

Holistic Correlation of Color Models, Color Features and Distance Metrics on Content-Based Image Retrieval

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

A Comparative Study on Retrieved Images by Content Based Image Retrieval System based on Binary Tree, Color, Texture and Canny Edge Detection Approach

Image Segmentation Techniques

Shape Descriptor using Polar Plot for Shape Recognition.

A New Algorithm for Shape Detection

Contour-Based Large Scale Image Retrieval

Detecting Printed and Handwritten Partial Copies of Line Drawings Embedded in Complex Backgrounds

Consistent Line Clusters for Building Recognition in CBIR

IRIS SEGMENTATION OF NON-IDEAL IMAGES

RESTORATION OF DEGRADED DOCUMENTS USING IMAGE BINARIZATION TECHNIQUE

FEATURE EXTRACTION TECHNIQUES FOR IMAGE RETRIEVAL USING HAAR AND GLCM

COLOR FEATURE EXTRACTION FOR CBIR

Content-Based Image Retrieval Readings: Chapter 8:

Image Retrieval Based on its Contents Using Features Extraction

The Vehicle Logo Location System based on saliency model

Efficient Image Compression of Medical Images Using the Wavelet Transform and Fuzzy c-means Clustering on Regions of Interest.

An Improved CBIR Method Using Color and Texture Properties with Relevance Feedback

Hierarchical GEMINI - Hierarchical linear subspace indexing method

Image Repossession Based on Content Analysis Focused by Color, Texture and Pseudo-Zernike Moments features of an Image

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

Patent Image Retrieval

[2006] IEEE. Reprinted, with permission, from [Wenjing Jia, Huaifeng Zhang, Xiangjian He, and Qiang Wu, A Comparison on Histogram Based Image

Image retrieval based on bag of images

Detection of a Single Hand Shape in the Foreground of Still Images

Content-Based Image Retrieval. Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object Recognition

Image retrieval based on region shape similarity

Image Matching Using Run-Length Feature

Journal of Asian Scientific Research FEATURES COMPOSITION FOR PROFICIENT AND REAL TIME RETRIEVAL IN CBIR SYSTEM. Tohid Sedghi

An Efficient Methodology for Image Rich Information Retrieval

FPGA IMPLEMENTATION FOR REAL TIME SOBEL EDGE DETECTOR BLOCK USING 3-LINE BUFFERS

MORPHOLOGICAL BOUNDARY BASED SHAPE REPRESENTATION SCHEMES ON MOMENT INVARIANTS FOR CLASSIFICATION OF TEXTURES

Color Image Segmentation

Content Based Image Retrieval Using Color and Texture Feature with Distance Matrices

Implementation of Texture Feature Based Medical Image Retrieval Using 2-Level Dwt and Harris Detector

A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering

Get High Precision in Content-Based Image Retrieval using Combination of Color, Texture and Shape Features

Automatic Image Annotation by Classification Using Mpeg-7 Features

An Efficient Single Chord-based Accumulation Technique (SCA) to Detect More Reliable Corners

PERFORMANCE EVALUATION OF ONTOLOGY AND FUZZYBASE CBIR

A METHOD FOR CONTENT-BASED SEARCHING OF 3D MODEL DATABASES

Efficient Content Based Image Retrieval System with Metadata Processing

COLOR AND SHAPE BASED IMAGE RETRIEVAL

Countermeasure for the Protection of Face Recognition Systems Against Mask Attacks

Lane Detection using Fuzzy C-Means Clustering

Water-Filling: A Novel Way for Image Structural Feature Extraction

An ICA based Approach for Complex Color Scene Text Binarization

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Application of Geometry Rectification to Deformed Characters Recognition Liqun Wang1, a * and Honghui Fan2

Moving Object Segmentation Method Based on Motion Information Classification by X-means and Spatial Region Segmentation

Video De-interlacing with Scene Change Detection Based on 3D Wavelet Transform

Raghuraman Gopalan Center for Automation Research University of Maryland, College Park

Content-Based Image Retrieval Readings: Chapter 8:

An Approach for Reduction of Rain Streaks from a Single Image

ScienceDirect. Reducing Semantic Gap in Video Retrieval with Fusion: A survey

Cursive Handwriting Recognition System Using Feature Extraction and Artificial Neural Network

I. INTRODUCTION. Figure-1 Basic block of text analysis

Efficient Indexing and Searching Framework for Unstructured Data

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Supervised texture detection in images

Color-Texture Segmentation of Medical Images Based on Local Contrast Information

An Efficient Approach for Color Pattern Matching Using Image Mining

Specular 3D Object Tracking by View Generative Learning

Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

Aggregated Color Descriptors for Land Use Classification

Color Content Based Image Classification

Robust Shape Retrieval Using Maximum Likelihood Theory

A Miniature-Based Image Retrieval System

Identifying Layout Classes for Mathematical Symbols Using Layout Context

Lecture 6: Multimedia Information Retrieval Dr. Jian Zhang

Content Based Image Retrieval with Semantic Features using Object Ontology

CHAPTER 8 COMPOUND CHARACTER RECOGNITION USING VARIOUS MODELS

CHAPTER 6 MODIFIED FUZZY TECHNIQUES BASED IMAGE SEGMENTATION

A Robust and Efficient Motion Segmentation Based on Orthogonal Projection Matrix of Shape Space

Multimedia Information Retrieval

Image Classification Using Wavelet Coefficients in Low-pass Bands

Analysis of Image and Video Using Color, Texture and Shape Features for Object Identification

Critique: Efficient Iris Recognition by Characterizing Key Local Variations

Semantics-based Image Retrieval by Region Saliency

Structural Analysis of Aerial Photographs (HB47 Computer Vision: Assignment)

Cell Clustering Using Shape and Cell Context. Descriptor

Transcription:

An Efficient Content Based Image Retrieval using EI Classification and Color Features M. Yasmin 1, M. Sharif* 2, I. Irum 3 and S. Mohsin 4 1,2,3,4 Department of Computer Science COMSATS Institute of Information Technology, Pakistan *muhammadsharifmalik@yahoo.com 2 Department of Computer Science COMSATS Institute of Information Technology, Pakistan Pakistan ABSTRACT An efficient method for image search and retrieval has been proposed in this study. For this purpose images are decomposed in equal squares of minimum 24x16 size and then edge detection is applied to those decomposed parts. Pixels classification is done on the basis of edge pixels and inner pixels. Features are selected from edge pixels for populating the database. Moreover, color differences are used to cluster same color retrieved results. Precision and recall rates have been used as quantification measures. It can be seen from the results that proposed method shows a very good balance of precision and recall in minimum retrieval time, achieved results are comprised of 66%-100% rate for precision and 68%-80% for recall. Keywords: Color Distances, Edge Pixels, Feature Extraction, Image Clustering, Inner Pixels. 1. Introduction Digital images are everywhere. The immense use of digital camera has led to millions of image databases. Digital images are used in countless applications like in education, agriculture, medical sciences, defense, and entertainment and so on. Images features color, texture, shape, spatial layout etc are used as content to search and match the images [1-3]. Therefore, there is an intense need of image searching, matching and retrieval at different steps of applications. A prominent example of this is in agriculture, an application that is classification of healthy and ill seeds which needs image matching with a standard healthy seed. An application of image retrieval named comic image retrieval system; introduced in [4] utilized the image features for image retrieval. Content based image retrieval known as CBIR has been getting great attention of researchers since the last decades because of its usage in a vast number of applications as one is discussed in [5]. An efficient method for image content detection has been presented in [6] that used pixel value histogram and found the correlation coefficient between those values. Corner matching is also a powerful technique in context of image match estimations [7]. A powerful descriptor based on angle edge and histograms is also presented for retrieval of images [8]. Application of Pythagorean theory to image yielded very valuable characteristics for the matching, searching and retrieval of images. Another efficient method for image retrieval is discussed in [9] by combining the multiple features of color, shape and relevance feedback. Shape descriptors are stronger than other features like texture and color in semantically describing the image [10]. However, the challenge is the extraction and representation of shape information accurately [11]. Performance of shape descriptors is categorized as qualitative and quantitative descriptions. Qualitative description involves the degree of accuracy in representing the true shape information and quantitative description involves the amount of data needs to be indexed for searching the image [12]. CBIR has also been extended up to 3D image shape retrieval [13] but still 2D shape retrieval is practical because of its superior quantitative characteristics in many tasks like pattern Journal of Applied Research and Technology 877

recognition, character recognition and image matching etc. [14]. Shape descriptors are broadly categorized into two groups namely contour based descriptors and region based descriptors. Contour based descriptors mainly use the extracted boundary information of image whereas region based descriptors use all information existing in the image region. Both are beneficial and necessary according to nature of task. The challenging task in designing a descriptor is coping with geometric transformations and semantic gap. In [15] color distances are used as color features but both [15, 16] lack the handling of geometric transformations. An effort has been put for reducing the semantic gap in [17]. Color features have been used to propose an improved CBIR system [18]. It has been noted that only the color features are not sufficient for efficient and effective image retrieval. If color features are combined with shape features, it can become a very strong descriptor for image search. In [19] the shape and color features are combined to achieve an effective retrieval system. In this paper image edge features are combined with color features in order to achieve the desired results. The organization structure of rest of the paper is as follows: in Section 2 proposed work has been discussed in detail. Section 3 presents results and discussion and Section 4 concludes the overall method. References are included in Section 5. Figure 1. Generalized Structure of Proposed Method. 2.1 Image Decomposition m n Let I denotes the image with dimensions Image is decomposed into equal squares known as cells C ij such that mni,,, j N. m / i 0 and n / j 0 means i, j are one of multiples of mn, but the minimum size of Cij will be 24 16 The main reason of this decomposition is to control the orientation of image edges. If image has been passed under any orientation like rotation, translation or scaling, it does not affect the descriptor. By collecting local features in this sense, decomposition plays a very important role in the solution going to be proposed. An example of decomposed image is shown in Fig.2. 2. Proposed Work The typical steps involved in CBIR system are feature extraction, populating the database with those features, extracting the same features from query image, comparing the features of query image and database features and on the basis of this comparison the images are finally retrieved. The proposed solution also follows the above mentioned steps. Feature extraction involves image decomposition and EI classification. Clustering is then used for grouping the retrieved results on the basis of color. The structure of proposed method is shown in Fig.1. Figure 2. Example of decomposed image. 2.2 EI (Edge & Inner) Pixels Classification EI classification is done to classify the image pixels into two classes namely edge pixels and inner pixels. The class of edge pixels is used to extract required edge related features. Fig.3 depicts the EI classification. 878 Vol. 12, October 2014

Figure 3. Pixels Classification. For each C ij gradient g ij C ij is computed using canny edge detector. g g g (1) 2 2 ij i j Disk folder structure has been used to store the downloaded images. All images have been stored in JPEG format. Inner Pixels are used for clustering the images before presenting to users. For this color distances are used by selecting inner pixels. The positive distances Red-Green, Green- Blue and Red-Blue are used for this purpose. dis( RG) R G (6) dis( GB) G B (7) dis( RB) R B (8) g j arctan g i (2) Then overall distance is calculated as follows: D dis( RG) dis( GB) dis( RB) (9) Now C ij is a combination of black & white pixels (0, 1), where c ij is 1 that is classified as edge pixel and where c ij is 0 that is classified as inner pixel. In this way the background is separated from foreground pixels. This leads to the simplicity of features extraction process. For the whole image I mn for each C ij, i& j are recorded for edge pixel. To populate the database of search images sum of i& j is entered into the database as feature. S i, j cij 1 (3) 2.3 Image Search, Clustering and Database Let Q mn be the query image and q ij are decomposed parts of query image. Decomposition of query image is done in the same way as done with database images. Same features are computed for query image and compared using Euclidean distance formula, the smaller the difference the similar the images. S (4) i, j qij 1 (5) 2 2 dis S, S S S These features are also stored in database with search images and retrieved results are clustered in a group of same color images on the basis of value of D, where the users have an opportunity of selecting the color of their own choice. 3. Experimental Results and Discussion The COREL database [20, 21] is a huge collection of color images of different categories consisting of 10,000 images. The dataset selected for experimental results of the proposed technique contains 2000 images. Dataset has been divided into 20 categories with 100 images in each category. Experimental images consist of flowers, animals, landscapes, buildings, cars and plants etc. System is CORE i3, 2.8 GHz, 2 GB RAM with Windows 7 operating system. System has been designed in such a manner that it retrieves 80 images against a query image. The tool used is MATLAB 7.0. Widely accepted Precision and Recall rates are used as quantification measures. Precision=relevant retrieved images/all retrieved images Recall=relevant retrieved images/all relevant images in database Fig.4, Fig.5 and Fig.6 show the results of retrieved images against a query image given to Journal of Applied Research and Technology 879

the system. Top 10 to 15 results have been shown here only due to the space limitations. First a query image of parrot is given to search and retrieve the related images. Query Image Retrieved Results Figure 4. Query Image and Top 15 Retrieved Results for Image Parrot, Results are Clustered on Basis of Inner Pixels Colors. A query image of Red fort is given to system to search and retrieve the related images. Query Image 880 Vol. 12, October 2014

Retrieved Results Figure 5. Query Image and Top 10 Retrieved Results for Image Waterfall, Results are Clustered on Basis of Inner Pixels Colors. Query Image Retrieved Results Journal of Applied Research and Technology 881

Figure 6. Query Image and Top 15 Retrieved Results for Image Red fort, Results are Clustered on Basis of Inner Pixels Colors. Implementing a CBIR system has many challenges to be faced. These include Size of images, Angle of orientation, Color descriptors and Selection of unique identifier for deciding the class of object and distance of acquisition instrument from the image which are of great importance. It is a very challenging task to design a robust algorithm for image search. In the proposed solution best possible effort has been made to make the retrieval process efficient. As one can easily see from the results that proposed solution provides a good ratio of relevance and a good balance between precision and recall, the angle of orientation can be one of 30, 45, 90, and 135. It has been set to 90 for all images before populating the database and each time for query image also. It really helps in accurate retrieval process. Allowed dimensions of images are between 640x480 pixels and 1024x840 pixels. In case 60% or above similarity between query image decomposed parts and database images is found, the query image is considered to be a relevant image. Table1 and Table2, Fig.7 and Fig.8 provide the results of Precision and Recall rates and computation time respectively. Query Image Total Retrieved Images Relevant Retrieved Images Irrelevant Retrieved Images Precision Recall Parrot 80 76 4 0.95 0.76 Waterfall 80 68 12 0.85 0.68 Redfort 80 80 0 1.00 0.80 Rose 80 72 8 0.90 0.72 Tiger 80 53 27 0.66 0.27 Sky 80 77 3 0.96 0.77 LandScape 80 68 12 0.85 0.68 Fruit 80 79 1 0.99 0.79 Table 1. Image Retrieval Results for Different Query Images. 90 80 70 60 50 40 30 20 10 0 Total Retrieved Images Relevant Retrieved Images Irrelevant Retrieved Images Precision Recall Figure 7. Relationship between Precion and Recall. 882 Vol. 12, October 2014

There were 100 images for each category of images from parrot to fruit and there were 2000 images in database. The algorithm searches on the basis of defined features and compares the query image after extracting the same features from query image. Proposed solution is designed to retrieve 80 images per search. One can easily observe from the results that it extracts the most relevant images, higher the degree of relevance, greater the balance between precision and recall rates. Precision defines the ratio between relevant retrieved images and total retrieved images; recall is actual relevant retrieved images and total relevant images in database. Table 1 describes these ratios for all categories in database. Many experiments have been performed by querying different images of each category and the results of experiments have proven the consistency of proposed method for each type of query image. Fig.7 shows the results of Table 1 graphically for analysis. Table 2 shows the maximum time for retrieval that is 0.7 sec and practically implementable. For fruit category it is 0.4. Because of simple edge regions, it takes less time for searching and retrieving the images. Experiments have also been performed to record the behavior of retrieval process using color distance only as described in proposed solution. Table 3 and Fig.8 clearly show the difference of precision and recall rates for color and edge features combined and only color features. One can easily see that when color and edge features are combined the retrieval performance is better. Hence when shape features are combined with color features they make a stronger descriptor for image matching, searching and retrieval. Query Image Total Relevant Images Total Retrieved Images Computation Time (sec) Parrot 100 80 0.755 Waterfall 100 80 0.756 Redfort 100 80 0.787 Rose 100 80 0.982 Tiger 100 80 0.734 Sky 100 80 0.798 LandScape 100 80 0.765 Fruit 100 80 0.458 Table 2. Retrieval Time for Different Query Images. 1.2 1 0.8 0.6 0.4 0.2 0 Precision Recall Computation Time (sec) Figure 8. Comparison of Precision and Recall Rates. Journal of Applied Research and Technology 883

Query Image Based on Color+Edge Features Based on Color Features Precision Recall Precision Recall Parrot 0.95 0.76 0.45 0.26 Waterfall 0.85 0.68 0.35 0.18 Redfort 1.00 0.80 0.50 0.20 Rose 0.90 0.72 0.60 0.32 Tiger 0.66 0.27 0.36 0.20 Sky 0.96 0.77 0.56 0.12 LandScape 0.85 0.68 0.45 0.23 Fruit 0.99 0.79 0.79 0.39 Table 3. Comparison of Proposed Method with Retrieval Results of Color Features. 4. Conclusion Proposed solution has been successfully implemented and tested and obtained results are quite satisfactory. System accepts a color image about which no information exists in the system. It then analyzes it and produces similar results. This is a prominent feature of CBIR and of the proposed solution as well. Due to its computation simplicity proposed solution will definitely add an effective elucidation to the research area of CBIR. It has been proved that by combining shape features with color features a strong descriptor for image matching is obtained. References [1] Mehwish R., Muhammad I., Muhammad S. and Mudassar R., Content Based Image Retrieval: Survey, World Applied Sciences Journal, vol. 3, no. 19 (2012), 404-412. [2] Mussarat Y., Muhammad S. and Sajjad M., Use of Low Level Features for Content Based Image Retrieval: Survey, Research Journal of Recent Sciences, vol. 2, no. 11 (2013), 65-75. [3] Mussarat Y., Muhammad S. and Sajjad M., Image Retrieval Techniques Using Shapes of Objects: A Survey, Science International (Lahore), vol. 25, no. 4 (2013), 723-729. [4] Dongkeun K. and Chijung H., Comic Image Retrieval System using Image Features, Information Journal, vol. 16, no. 3 (2013), 1541-1546. [5] N. Vazque et.al, Automatic System for Localization and Recognition of Vehicle Plate Numbers, Journal of Applied Research and Technology, vol. 1, no.1 (2003), 63-77. [6] Kousuke I., Hideo K. and Makoto F., Image Content Detection Method Using Correlation Coefficient between Pixel Value Histograms, Information Journal, vol. 16, no. 3 (2013), 675-680. [7] H. S. Cruz and E. Bribiesca, Polygonal Approximation of Contour Shapes using Corner Detection, Journal of Applied Research and Technology, vol. 7, no. 3 (2009), 275-291 [8] Mussarat Y., Muhammad S., Isma I. and Sajjad M., Powerful Descriptor for Image Retrieval Based on Angle Edge and Histograms, Journal of Applied Research and Technology, vol. 11, no. 5 (2013), 727-732. 884 Vol. 12, October 2014

[9] Mussarat Y., Muhammad S., Sajjad M. and Isma I., Content Based Image Retrieval Using Combined Features of Shape, Color and Relevance Feedback, KSII Transactions on Internet and Information Systems, vol. 7, no. 12 (2013), 3149-3165. [10] Mussarat Y., Sajjad M., Isma I. and Muhammad S., Content Based Image Retrieval by Shape, Color and Relevance Feedback, Life Science Journal, vol. 10, no. 6 (2013), 593-598. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 9 (2003), 1075-1088. [21] James Wang Z., Jia L. and Gio W., SIMPLIcity: Semantics-sensitive Integrated Matching for Picture Libraries, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 23, no. 9 (2001), 947-963. [11] Belongie, S., Malik, J. and Puzicha, J., Shape matching and object recognition using shape contexts, IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 4 (2002), 509 522. [12] Berretti, S., Del Bimbo, A. and Pala, P., Retrieval by shape similarity with perceptual distance and effective indexing, IEEE Trans. Multimedia, vol. 2, no. 4 (2000), 225 239. [13] Tangelder, J. and Veltkamp, R., A survey of content based 3D shape retrieval methods, Multimedia Tools Appl., vol. 39, no. 3 (2008), 441 471. [14] Paquet, E., Rioux, M., Murching, A. and Naveen, T., Description of shape information for 2-D and 3-D objects Signal Process., Image Communication., vol. 16, no. 1 (2000), 103-122. [15] Ju H. and Kai-Kuang M., Fuzzy Color Histogram and Its Use in Color Image Retrieval, IEEE Transactions on Image Processing, vol. 11, no. 8 (2002), 944-952. [16] Wei-Ta C., Wei-Chuan L. and Ming-Syan C., Adaptive Color Feature Extraction Based on Image Color Distributions, IEEE Transactions Image Processing, vol. 19, no. 8 (2010), 2005-2016. [17] Wei J., Guihua E. and Qionghai D., Similarity- Based Online Feature Selection in Content-Based Image Retrieval, IEEE Transactions on Image Processing, vol. 15, no. 3 (2006), 702-712. [18] Asmita D. and Garge P., An Improved Content Based Image Retrieval, ICCCT, (2011), 191-195. [19] Divya R. V. and Dr. Divakar Y., Image Query Based Search Engine Using Image Content Retrieval, International Conference on modeling and Simulation. (2012), 283-286. [20] Jia Li. and James Wang Z., Automatic linguistic indexing of pictures by a statistical modeling approach, Journal of Applied Research and Technology 885