Image Classification Using Wavelet Coefficients in Low-pass Bands

Similar documents
Effects of the Number of Hidden Nodes Used in a. Structured-based Neural Network on the Reliability of. Image Classification

An Improved Algorithm for Learning Long-Term Dependency Problems in Adaptive Processing of Data Structures

Neural Network based textural labeling of images in multimedia applications

Efficient Image Compression of Medical Images Using the Wavelet Transform and Fuzzy c-means Clustering on Regions of Interest.

IMAGE DIGITIZATION BY WAVELET COEFFICIENT WITH HISTOGRAM SHAPING AND SPECIFICATION

TEXT DETECTION AND RECOGNITION IN CAMERA BASED IMAGES

Wavelet Based Image Retrieval Method

A 3-D Virtual SPIHT for Scalable Very Low Bit-Rate Embedded Video Compression

FEATURE EXTRACTION TECHNIQUES FOR IMAGE RETRIEVAL USING HAAR AND GLCM

Wavelet Transform in Face Recognition

Graph Matching Iris Image Blocks with Local Binary Pattern

Texture Analysis of Painted Strokes 1) Martin Lettner, Paul Kammerer, Robert Sablatnig

Image Resolution Improvement By Using DWT & SWT Transform

A NEW ROBUST IMAGE WATERMARKING SCHEME BASED ON DWT WITH SVD

A combined fractal and wavelet image compression approach

TEVI: Text Extraction for Video Indexing

Fingerprint Recognition using Texture Features

A Miniature-Based Image Retrieval System

CORRELATION BASED CAR NUMBER PLATE EXTRACTION SYSTEM

Change Detection in Remotely Sensed Images Based on Image Fusion and Fuzzy Clustering

Stripe Noise Removal from Remote Sensing Images Based on Stationary Wavelet Transform

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Adaptive Quantization for Video Compression in Frequency Domain

Evolved Multi-resolution Transforms for Optimized Image Compression and Reconstruction under Quantization

Multi-focus Image Fusion Using Stationary Wavelet Transform (SWT) with Principal Component Analysis (PCA)

Self-Organizing Maps for cyclic and unbounded graphs

Comparative Evaluation of DWT and DT-CWT for Image Fusion and De-noising

IMAGE PROCESSING USING DISCRETE WAVELET TRANSFORM

Content Based Image Retrieval Using Combined Color & Texture Features

Real-time target tracking using a Pan and Tilt platform

HANDWRITTEN GURMUKHI CHARACTER RECOGNITION USING WAVELET TRANSFORMS

TEXTURE SEGMENTATION USING AREA MORPHOLOGY LOCAL GRANULOMETRIES

Time Series Prediction as a Problem of Missing Values: Application to ESTSP2007 and NN3 Competition Benchmarks

Automatic Texture Segmentation for Texture-based Image Retrieval

A Boosting-Based Framework for Self-Similar and Non-linear Internet Traffic Prediction

Region Based Image Fusion Using SVM

Color-Based Classification of Natural Rock Images Using Classifier Combinations

A Comparative Study of Conventional and Neural Network Classification of Multispectral Data

Latest development in image feature representation and extraction

ADAPTIVE TEXTURE IMAGE RETRIEVAL IN TRANSFORM DOMAIN

ADVANCED IMAGE PROCESSING METHODS FOR ULTRASONIC NDE RESEARCH C. H. Chen, University of Massachusetts Dartmouth, N.

Texture classification using convolutional neural networks

Handwritten Script Recognition at Block Level

A Survey on Feature Extraction Techniques for Palmprint Identification

A Data Classification Algorithm of Internet of Things Based on Neural Network

Object detection using non-redundant local Binary Patterns

Beyond Bags of Features

Biometric Security System Using Palm print

Image Compression Algorithm for Different Wavelet Codes

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

WAVELET USE FOR IMAGE CLASSIFICATION. Andrea Gavlasová, Aleš Procházka, and Martina Mudrová

An Image Coding Approach Using Wavelet-Based Adaptive Contourlet Transform

Satellite Image Processing Using Singular Value Decomposition and Discrete Wavelet Transform

Visible and Long-Wave Infrared Image Fusion Schemes for Situational. Awareness

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Robust color segmentation algorithms in illumination variation conditions

Wavelet Transform (WT) & JPEG-2000

Efficient Content Based Image Retrieval System with Metadata Processing

Fast and Memory Efficient 3D-DWT Based Video Encoding Techniques

7.1 INTRODUCTION Wavelet Transform is a popular multiresolution analysis tool in image processing and

Image Fusion Using Double Density Discrete Wavelet Transform

Dynamic Clustering of Data with Modified K-Means Algorithm

New wavelet based ART network for texture classification

Ensemble of Bayesian Filters for Loop Closure Detection

A New Feature Local Binary Patterns (FLBP) Method

ROBUST WATERMARKING OF REMOTE SENSING IMAGES WITHOUT THE LOSS OF SPATIAL INFORMATION

A Novel Statistical Distortion Model Based on Mixed Laplacian and Uniform Distribution of Mpeg-4 FGS

Embedded Descendent-Only Zerotree Wavelet Coding for Image Compression

Content-based Image Retrieval using Image Partitioning with Color Histogram and Wavelet-based Color Histogram of the Image

Discrete Wavelet Transform in Face Recognition

Application of partial differential equations in image processing. Xiaoke Cui 1, a *

Projection of undirected and non-positional graphs using self organizing maps

Detection, Classification, Evaluation and Compression of Pavement Information

CHAPTER 6 DETECTION OF MASS USING NOVEL SEGMENTATION, GLCM AND NEURAL NETWORKS

An Improved Images Watermarking Scheme Using FABEMD Decomposition and DCT

A Wavelet-based Feature Selection Scheme for Palm-print Recognition

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

Interactive Video Retrieval System Integrating Visual Search with Textual Search

Optimizing the Deblocking Algorithm for. H.264 Decoder Implementation

Ranking web pages using machine learning approaches

Query by Fax for Content-Based Image Retrieval

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

DWT-SVD based Multiple Watermarking Techniques

Story Unit Segmentation with Friendly Acoustic Perception *

Research Fellow, Korea Institute of Civil Engineering and Building Technology, Korea (*corresponding author) 2

EDGE DETECTION IN MEDICAL IMAGES USING THE WAVELET TRANSFORM

Comparative Analysis of 2-Level and 4-Level DWT for Watermarking and Tampering Detection

D.A. Karras 1, S.A. Karkanis 2, D. Iakovidis 2, D. E. Maroulis 2 and B.G. Mertzios , Greece,

The Vehicle Logo Location System based on saliency model

Yui-Lam CHAN and Wan-Chi SIU

INCREASING CLASSIFICATION QUALITY BY USING FUZZY LOGIC

DOI: /jos Tel/Fax: by Journal of Software. All rights reserved. , )

Gabor Surface Feature for Face Recognition

CHAPTER 3 DIFFERENT DOMAINS OF WATERMARKING. domain. In spatial domain the watermark bits directly added to the pixels of the cover

Image Processing, Analysis and Machine Vision

Video De-interlacing with Scene Change Detection Based on 3D Wavelet Transform

A Quantized Transform-Domain Motion Estimation Technique for H.264 Secondary SP-frames

AN EFFICIENT VIDEO WATERMARKING USING COLOR HISTOGRAM ANALYSIS AND BITPLANE IMAGE ARRAYS

Visual object classification by sparse convolutional neural networks

Feature Based Watermarking Algorithm by Adopting Arnold Transform

Transcription:

Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August -7, 007 Image Classification Using Wavelet Coefficients in Low-pass Bands Weibao Zou, Member, IEEE, and Yan Li, Member, IEEE Abstract In this paper, a method based on wavelet coefficients in low-pass bands is proposed for the image classification with adaptive processing of data structures to organize a large image database. After an image is decomposed by wavelet, its features can be characterized by the distribution of histograms of wavelet coefficients. The coefficients are respectively projected onto x and y directions. For different images, the distribution of histograms of wavelet coefficients in low-pass bands is substantially different. However, the one in high-pass bands is not as different, which makes the performance of classification not reliable. This paper presents a method for image classification based on wavelet coefficients in low-pass bands only. Images are arranged into a tree structure. The nodes can then be represented by the distribution of histograms of these wavelet coefficients. 940 images derived from seven categories are used for image classification. Based on the wavelet coefficients in low-pass bands, the improvement of classification rate on the training data set is up to %, and the improvement of classification rate on the testing data set reaches 0%. Experimental results show that our proposed approach for image classification is more effective and reliable. T I. INTRODUCTION he various applications of image processing on image content representation has drawn much attention in the past decade. A set of keywords is used to represent image contents in most of the systems. However, keyword-based taxonomy reduces the system complexity while increasing the requirements of users' knowledge. On the other hand, much research has been conducted on representing image contents with visual features. Most of the applications represent images using low level visual features, such as color, texture, shape and spatial layout in a very high dimensional feature space, either globally or locally. However, the most popular distance metrics, for example, Euclidean distance, cannot guarantee that the contents are similar even their visual features are very close in the high dimensional feature space. Feature-based representation of images using wavelet potentially offers an attractive solution to this problem. It involves a number of features extracted from raw images based on wavelet []. The features of images, such as edges of Weibao Zou is with the Centre for Computing & Data Simulation, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Floor 3, Block A, Nanshan Medical Instruments Industry Park, 3rd Gongye Road, Shekou, Shenzhen, China (email: hkzouwb@yahoo.com). Yan Li is with the Department of Mathematics and Computing, The University of Southern Queensland, QLD 4350, Australia (e-mail: liyan@usq.edu.au). an object, can be projected by the wavelet coefficients in low-pass and high-pass sub-bands []. The statistic of wavelet coefficients in low-pass bands is obviously different for different images. However, in high-pass bands, there is no big difference as shown later in this paper. This will make the classification unreliable. This study conducts a systematic investigation into the improvement of image classification using wavelet coefficients in low-pass bands only with structure-based neural network to organize a large image database. The region-based representation of image contents is a potentially attractive solution to the problem of image processing and scene classification. Features and their spatial relationship among them play more important roles in characterizing image contents, because they convey more semantic meanings. Therefore, the image content can be better described by a tree-structure representation [3] [4]. Adaptive processing of data structures using neural networks is of paramount importance for structural pattern recognition and classification [5]. The main motivation of this adaptive processing is that neural networks are able to classify static information or temporal sequences and to perform automatic inferring or learning [6-8]. Sperduti and Starita have proposed supervised neural networks for classification of data structures [9]. This approach is based on using generalized recursive neurons and the Backpropagation Through Structure (BPTS) algorithm [5] [9] to encode data structures recursively that a copy of the same neural network is used to encode every node of the tree structure. Cho et al. improved the BPTS algorithm by optimizing the free learning parameters of the neural network in the node representation by using least-squares-based optimization methods in a layer-by-layer fashion, which not only makes convergence speed fast, but the long-term dependency problem is also overcome [3]. In this study, the improved BPTS algorithm is used. By organizing wavelet coefficients into a tree representation, image contents can be represented more comprehensively at various levels of details, which will be beneficial for image classification []. This section is followed by an introduction of the decomposition of an image by wavelet. The representation of an image and its feature sets for neural network are then described. The data sets for classification are given in Section V and the results of performance are reported and analyzed in Section VI. The conclusions are drawn in the final section. -444-380-X/07/$5.00 007 IEEE

II. DECOMPOSITION OF AN IMAGE WITH WAVELET In order to extract the features of an image, as an application of the wavelet decomposition performed by a computer, a discrete wavelet transform is applied and is summarized in Fig.. The pyramid structure is also shown in Fig. and its block chart shown in Fig.. From Fig., it is noted that the original image (or previously low-pass results at level j ) can be decomposed into a low-pass band ( L j+ ) and a high-pass band ( H ) through filters at level j+ j +. Through a low-pass filter and a high-pass filter, the low-pass band ( ) is again decomposed into a low-pass band ( ) and a high-pass band ( LH ). The high-pass band ( ) is j+ H j + decomposed into a low-pass band ( HL j ) and a high-pass + band ( ) through filters. Sub-bands, and j+ LH j+ HL j+ represent the characteristics of the image in the horizon, j+ Lj+ LL j+ vertical, and diagonal views, respectively. The local details of an image, such as edges of objects, are preserved in high-pass bands. The basic energy of the image is preserved in LL j. + We will refer to the four sub-bands created at each level of the decomposition as LL (Low/Low), (Low/High), j + LH j+ HL (High/Low), and (High/High). j + j + tree structure shown in Fig. 3. It is seen from Fig. 3(b) that, at level one ( L ), the original image (the top image) is decomposed into four bands. The leftmost hand side in the line of L is the low-pass band ( LL ), and the three right ones are the high-pass bands ( LH, and ). At level two HL ( L ), the previous low-pass band is decomposed into four bands again. The leftmost hand side in the line of L is the low-pass band ( LL ), and the other three ones are the high-pass bands ( LH, and ). Obviously, it can be HL found that the basic energy and the edges of the object can be observed in low-pass bands and high-pass bands at each level, respectively. The same happens in Fig. 3(c). (a) A pyramid structural representation; I 3LL I 3HL I HL I 3LH I 3 I LH I I HL I 0HL I LH I L LH I 0 I 0 L Fig.. The pyramid structure of decomposing an image L Decimate columns by L j+ by by LL j+ LH j+ at level j+ (b) A tree representation of a decomposed image containing a boy Original image or previous low-pass results, at level j H Decimate columns by H j+ by by HL j+ j+ L represents the convolution of the input image by the filter Fig.. Block chart of wavelet decomposition of an image III. REPRESENTATIONS OF A DECOMPOSED IMAGE An image decomposed by wavelet can be represented by a L (c) A tree representation of a decomposed image containing grass Fig. 3. Representation of an image decomposed by wavelet

IV. DESCRIPTION OF FEATURE SETS WITH HISTOGRAMS OF WAVELET COEFFICIENTS The feature sets used as attributes for neural networks consist of wavelet coefficients. It is observed that wavelet coefficients corresponding to the edges of an object are greater than a certain value. Therefore, a threshold can be set. The wavelet coefficients which are greater than the threshold are selected into a feature set. These coefficients are projected onto the x-axis and y-axis, respectively. The edges of the object can be described by the distribution of the projections of wavelet coefficients, which are statistic by histograms with 8 bins in the x direction and 8 bins in the y direction. For example, for the high frequency bands, their features are represented by the histograms in x-axis and y-axis after wavelet coefficients are projected. A similar approach is applied to the low frequency bands. Actually, such histograms can effectively represent the outline of objects. However, for the original image, its histograms are the projections of the grey values of pixels. The tree representation of an image described by histograms of projections of wavelet coefficients is illustrated in Fig. 4. Therefore, it naturally leads to an idea that only the histograms of projections of wavelet coefficients in low-pass bands be used for classification could make the results more reliable. V. DATA SETS In order to prove the efficiency of the proposed method, a series of images are taken. There are seven categories of original images, namely, building, beetle, fish, grass, people, warship and vehicle, used in the experiments. In each category of images, there are ten independent images and five of them are shown in Fig. 5. (a) (b) (c) (d) (a) Histogram corresponding to Fig.3 (b) (e) (f) (g) Fig. 5. Images for experiments: (a) Building; (b) Beetle; (c) Fish; (d) Grass; (e) People; (f) Warship; (g) Vehicle. (b) Histogram corresponding to Fig.3 (c) Fig. 4. Histograms of projections of wavelet coefficients arranged in tree structures Figs. 4(a) and (b) are the histograms corresponding to Figs. 3(b) and 3(c), respectively. Features described by wavelet coefficients can greatly improve the image classification []. However, from Figs. 4(a) and 4(b), it is clear that the histograms of projections of wavelet coefficients in high-pass bands are not very different. This will not benefit the performance of the classification. On the contrary, the histograms in low-pass bands are obviously different. There are two parts in performing image classification. One is for training the neural network, and the other is for testing. In order to make findings more reliable, many more images are generated by translating the original images. For example, a window is set up for an original image, and it shifts from left to right, then top to bottom. Of course, the object must be contained in the window. But, the background is changing with the shifting window. Corresponding to each window, a new image is generated. Therefore, from each image, 40 new images are generated from this translation operation. The information of images is tabulated in Table I. Together with the original images, 470 images are used for training the neural network and another 470 images are used 3

for testing. There are two groups of experiments for comparison. Different features are adopted in each group. In the first group, the wavelet coefficients in low-pass and high-pass bands are used together as feature sets. In the second group, the wavelet coefficients in low-pass bands only are used as features. The wavelet coefficients in high-pass bands are set to zeroes. The neural network adopted is based on structure improved by Cho et al. and the details can be referred to [3]. Category of image TABLE I IMAGE DATABASES FOR EXPERIMENTS The number of original image in each category The number of derived images from an independent image The number of images in each category after image translation Building 0 40 400 Beetle 0 40 400 Fish 0 40 400 Grass 0 40 400 People 0 40 400 Warship 0 40 400 Vehicle 0 40 400 Total number 70 80 800 VI. RESULTS AND ANALYSIS A. Results of the Performance The results of the classification are reported in this section. A vector of 6 inputs defined by histograms derived by the proposed method is the attributes of all the nodes in the tree. In this investigation, a single hidden-layer of the neural network is sufficient for the neural classifier, which has 6 input nodes and 7 output nodes. Since the number of hidden nodes affects the result, a series of different numbers of the hidden nodes are tested in the experiments in order to get the best result. The classification rates on training set and testing set are shown in Tables II and III. A graphical presentation of Table II is shown in Fig. 6 and Table III in Fig. 7. TABLE II CLASSIFICATION RESULTS WITH DIFFERENT HIDDEN NODES ON TRAINING SET Number of hidden nodes Classification rate on training set with wavelet coefficients (%) Classification rate on training set with wavelet coefficients in low pass-band only (%) Improve d rate (%) 0 8 9 0.98 3 87 9 5.75 5 88 96 9.09 7 90 94 4.44 9 89 94 5.6 wavelet coefficients is 90%. Obviously, the former is better than the latter and the rate is greatly improved. It is also noticed that when the number of hidden nodes is 5, the classification rate by wavelet coefficients in low-pass band only is the best. TABLE III CLASSIFICATION RESULTS WITH DIFFERENT HIDDEN NODES ON TESTING SET Number of hidden nodes Classification rate on testing set with wavelet coefficients (%) Classification rate on testing set with wavelet coefficients in low-pass band only (%) Improv ed rate (%) 0 74 89 0.7 3 86 87.6 5 85 9 8.4 7 87 90 3.45 9 85 89 4.7 With the trained neural network, the testing is carried out with another set of images not included in our training set. From Table III and Fig. 7, it is seen that the classification rate with wavelet coefficients in low-pass bands only is up to 9%. For 7 hidden nodes, its classification rate is the second best. With all wavelet coefficients, the rate is 87%. For 5 hidden nodes, the latter classification rate is the best. Obviously, based on the proposed method, the neural network is strong enough for classifying many categories of images and the classification rate is greatly improved. Fig. 6 Variation of image classification rate with different hidden nodes on training set B. Analysis of the Results From Table II and Fig. 6, it is observed that the classification rate on training set with wavelet coefficients in low-pass bands only is up to 96% while the rate with all Fig. 7 Variation of image classification rate with different hidden nodes on testing set 4

VII. CONCLUSION A method based on wavelet coefficients in low-pass bands only is proposed for the image classification with adaptive processing of data structures to organize a large image database in this paper. We describe the classification features including features extracted with wavelet, histogram of projections of wavelet coefficients and tree structural image representation. We use 6 bins of histograms as the input attributes for a neural network. There are 470 images used for training the neural network. Based on the proposed method, the classification rate on training set is improved by %. Using such a trained neural network, with another 470 images, the testing is carried out and its classification rate is improved by 0%. From the results, it is found that: () the classification rate can be greatly improved using wavelet coefficients in low-pass bands only; () the effects of the number of hidden nodes on the classification rate is significant but non-linear. When the number of hidden nodes is 5, optimal results can be achieved. There is a trade-off between the classification rate and the computational complexity, of course. The above conclusions are based on the experiments with structured-based neural network. [0] C. Goller and A. Kuchler, Learning Task-dependent distributed representations by Back-propagation Through Structure, Proc. IEEE Int. Conf. Nerual Networks, 996, pp. 347-35. [] S. Cho, Z. Chi, Z. Wang and W. Siu, An Efficient Learning Algorithm for Adaptive Processing of Data Structure, Neural Processing Letters vol. 7, pp. 75-90, 003. [] F. Costa, P. Frasconi, V. Lombardo and G. Soda, Towards Incremental Parsing of Natural Language Using Recursive Neural Networks, Applied Intelligence, vol.. 9, n.-, pp.9-5. ACKNOWLEDGMENT The work described in this paper was partly supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project no. PolyU 59/03E). REFERENCES [] A. Cohen and R. D. Ryan: Wavelets and Multiscale Signal Processing. Chapman & Hall Press, 995. [] W. Zou, K. C. Lo and Z. Chi, Structured-based Neural Network Classification of Images Using Wavelet Coefficients, Proceedings of Third International Symposium on Neural Networks (ISNN), Lecture Notes in Computer Science 397: Advances in Neural Networks-ISNN006, Chengdu, China, May 8-June, 006, part II, pp. 33-336, Springer-Verlag. [3] S. Y. Cho, Z. R. Chi, W. C. Siu and A. C. Tsio, An improved algorithm for learning long-term dependency problems in adaptive processing of data structures, IEEE Trans. On Neural Network, vol. 4, pp 78-793, July, 003. [4] S. Y. Cho and Z. R. Chi, Genetic evolution processing of data structures for image classification, IEEE Trans. On Knowledge and Data Engineering, vol. 7, pp. 6-3, Feb. 005. [5] C. L. Giles and M. Gori, Adaptive Processing of Sequences and Data Structures, Springer, Berlin New York, 998. [6] A. C. Tsoi: Adaptive Processing of Data Structures: An Expository Overview and Comments. Technical report, Faculty of Informatics, University of Wollongong, Australia, 998. [7] A. C. Tsoi, M. Hangenbucnher, Adaptive Processing of Data Structures, Keynote Speech, Proceedings of Third International Conference on Computational Intelligence and Multimedia Applications (ICCIMA '99), 999, - (summary). [8] P. Frasconi, M. Gori and A. Sperduti, A General Framework for Adaptive Processing of Data Structures, IEEE Trans. on Neural Networks vol. 9, pp 768-785, 998. [9] A. Sperduti and A. Starita, Supervised Neural Networks for Classification of Structures, IEEE Trans. on Neural Networks, vol. 8(3), pp. 74-735, 997. 5