Salient Region Detection and Segmentation in Images using Dynamic Mode Decomposition

Similar documents
A Survey on Detecting Image Visual Saliency

AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S

Main Subject Detection via Adaptive Feature Selection

Saliency Detection for Videos Using 3D FFT Local Spectra

PERFORMANCE ANALYSIS OF COMPUTING TECHNIQUES FOR IMAGE DISPARITY IN STEREO IMAGE

Video saliency detection by spatio-temporal sampling and sparse matrix decomposition

Image Compression and Resizing Using Improved Seam Carving for Retinal Images

Robust Frequency-tuned Salient Region Detection

Salient Region Detection and Segmentation

IMAGE SALIENCY DETECTION VIA MULTI-SCALE STATISTICAL NON-REDUNDANCY MODELING. Christian Scharfenberger, Aanchal Jain, Alexander Wong, and Paul Fieguth

2.1 Optimized Importance Map

Saliency Detection Using Quaternion Sparse Reconstruction

Image Resizing Based on Gradient Vector Flow Analysis

FACULTY OF ENGINEERING AND INFORMATION TECHNOLOGY DEPARTMENT OF COMPUTER SCIENCE. Project Plan

International Conference on Information Sciences, Machinery, Materials and Energy (ICISMME 2015)

Detection of Salient Region by Local Spatial Support & High Dimensional Color Transform

Object and Action Detection from a Single Example

A Model of Dynamic Visual Attention for Object Tracking in Natural Image Sequences

A Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish

A Novel Approach for Saliency Detection based on Multiscale Phase Spectrum

Dynamic visual attention: competitive versus motion priority scheme

An Efficient Salient Feature Extraction by Using Saliency Map Detection with Modified K-Means Clustering Technique

DETECTION OF IMAGE PAIRS USING CO-SALIENCY MODEL

Salient Region Detection using Weighted Feature Maps based on the Human Visual Attention Model

An Improved Image Resizing Approach with Protection of Main Objects

Evaluation of regions-of-interest based attention algorithms using a probabilistic measure

Asymmetry as a Measure of Visual Saliency

A New Framework for Multiscale Saliency Detection Based on Image Patches

Small Object Segmentation Based on Visual Saliency in Natural Images

ADAPTIVE LOW RANK AND SPARSE DECOMPOSITION OF VIDEO USING COMPRESSIVE SENSING

Partial Calibration and Mirror Shape Recovery for Non-Central Catadioptric Systems

International Journal of Mechatronics, Electrical and Computer Technology

Saliency Detection: A Spectral Residual Approach

A Hierarchial Model for Visual Perception

Modeling Attention to Salient Proto-objects

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 14, NO. 4, AUGUST

Data-driven Saliency Region Detection Based on Undirected Graph Ranking

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Hierarchical Saliency Detection Supplementary Material

Supplementary Material for submission 2147: Traditional Saliency Reloaded: A Good Old Model in New Shape

Image Inpainting by Hyperbolic Selection of Pixels for Two Dimensional Bicubic Interpolations

Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach

Image enhancement for face recognition using color segmentation and Edge detection algorithm

Graph Matching Iris Image Blocks with Local Binary Pattern

Motion Detection Algorithm

Salient Region Extraction for 3D-Stereoscopic Images

Fast and Efficient Saliency Detection Using Sparse Sampling and Kernel Density Estimation

Saliency Detection using Region-Based Incremental Center-Surround Distance

Factorization with Missing and Noisy Data

Focusing Attention on Visual Features that Matter

Hybrid Approach for MRI Human Head Scans Classification using HTT based SFTA Texture Feature Extraction Technique

The most cited papers in Computer Vision

Unsupervised learning in Vision

Design & Implementation of Saliency Detection Model in H.264 Standard

Applying Catastrophe Theory to Image Segmentation

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

Supplementary Materials for Salient Object Detection: A

A Novel Approach to Saliency Detection Model and Its Applications in Image Compression

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

Short Survey on Static Hand Gesture Recognition

Chapter 7. Conclusions and Future Work

Limitations of Matrix Completion via Trace Norm Minimization

Speaker: Ming-Ming Cheng Nankai University 15-Sep-17 Towards Weakly Supervised Image Understanding

AUTOMATIC SALIENT OBJECT DETECTION IN UAV IMAGERY

The Use of Attention and Spatial Information for Rapid Facial Recognition in Video

SALIENCY CLASSIFICATION OBJECT AND REGION DETECTION USING HDCT METHOD

Multi-View Image Coding in 3-D Space Based on 3-D Reconstruction

LOCAL AND GLOBAL DESCRIPTORS FOR PLACE RECOGNITION IN ROBOTICS

PROBLEM FORMULATION AND RESEARCH METHODOLOGY

Removing Shadows from Images

An ICA based Approach for Complex Color Scene Text Binarization

Adaptive Zoom Distance Measuring System of Camera Based on the Ranging of Binocular Vision

How Important is Location in Saliency Detection

Robustness Test of Discrete Cosine Transform Algorithm in Digital Image Watermarking on Android Platform

Image Retargetting on Video Based Detection

Direction-Length Code (DLC) To Represent Binary Objects

Salient Region Detection via High-Dimensional Color Transform

Suspicious Activity Detection of Moving Object in Video Surveillance System

Sketchable Histograms of Oriented Gradients for Object Detection

COMPARATIVE ANALYSIS OF POWER METHOD AND GAUSS-SEIDEL METHOD IN PAGERANK COMPUTATION

A Hierarchical Compositional System for Rapid Object Detection

Sparse Models in Image Understanding And Computer Vision

KEY FRAMES EXTRACTION USING GRAPH MODULARITY CLUSTERING FOR EFFICIENT VIDEO SUMMARIZATION

Effective Moving Object Detection and Retrieval via Integrating Spatial-Temporal Multimedia Information

Independent Component Analysis (ICA) in Real and Complex Fourier Space: An Application to Videos and Natural Scenes

What is Computer Vision?

A Hybrid Face Detection System using combination of Appearance-based and Feature-based methods

Rendering and Modeling of Transparent Objects. Minglun Gong Dept. of CS, Memorial Univ.

Fabric Defect Detection Based on Computer Vision

Face Detection Using Color Based Segmentation and Morphological Processing A Case Study

Learning video saliency from human gaze using candidate selection

Experimentation on the use of Chromaticity Features, Local Binary Pattern and Discrete Cosine Transform in Colour Texture Analysis

A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes

TOWARDS THE ESTIMATION OF CONSPICUITY WITH VISUAL PRIORS

Object Extraction Using Image Segmentation and Adaptive Constraint Propagation

2. LITERATURE REVIEW

Similarity Image Retrieval System Using Hierarchical Classification

Face Quality Assessment System in Video Sequences

Perceptual Quality Improvement of Stereoscopic Images

Reconstructing Images of Bar Codes for Construction Site Object Recognition 1

Transcription:

Salient Region Detection and Segmentation in Images using Dynamic Mode Decomposition Sikha O K 1, Sachin Kumar S 2, K P Soman 2 1 Department of Computer Science 2 Centre for Computational Engineering and Networking Amrita Viswa Vidyapeetham, Coimbatore - 641 112 ABSTRACT Visual Saliency is the capability of vision system to select distinctive parts of scene and reduce the amount of visual data that need to be processed. The present paper introduces (1) a novel approach to detect salient regions by considering color and luminance based saliency scores using Dynamic Mode Decomposition (DMD), (2) a new interpretation to use DMD approach in static image processing. This approach integrates two data analysis methods: (1) Fourier Transform, (2) Principle Component Analysis. The key idea of our work is to create a color based saliency map. This is based on the observation that salient part of an image usually have distinct colors compared to the remaining portion of the image. We have exploited the power of different color spaces to model the complex and nonlinear behavior of human visual system to generate a color based saliency map. To further improve the effect of final saliency map, we utilized luminance information exploiting the fact that human eye is more sensitive towards brightness than color. The experimental results shows that our method based on DMD theory is effective in comparison with previous state-of-art saliency estimation approaches. The approach presented in this paper is evaluated using ROC curve, F-measure rate, Precision-Recall rate, AUC score etc. 1. INTRODUCTION Human visual system is capable of filtering visually distinctive image regions, rapidly and effortlessly (pre-attentive stage). The filtered part is then processed in detail to obtain high level inferences (attentive stage) and better understanding. The part of an image that stands out from the neighborhood and attracts human attentions in the first sight is termed to be salient region. Computationally obtaining salient regions has been investigated by cognitive philosophers [2,3] for the last few decades. And was well appreciated by the computer vision community due to its applicability to tackle complex computer vision problems such as object recognition, video compression, automatic image cropping, scene understanding etc. The term saliency was first introduced by Itty et al. in their work on rapid scene analysis in1998 [1]. They come up with a bottom up saliency model capable of capturing spatial discontinuities in scenes. Following Itty s model, a number of methods have been developed in the literature. Zhang et al. proposed an attention model based on fuzzy logic and local contrast analysis [5]. Achanta et al. adopted luminance and color based features to generate saliency maps [6]. In [7], the authors introduced a graph based approach to compute feature based activation map and is then normalized to generate final saliency score. E Rahtu et al. [8] described a statistical framework based saliency detection by utilizing color and motion information. SVD based [9] visual saliency computes saliency map by regularizing singular values of input image. More Recently, Jiwhan et al. used high dimensional features and introduced a trimap based saliency map generation in [10]. In this paper, we have used DMD approach to detect salient regions in static images. To the best of authors knowledge Static image interpretation of the DMD method has not been explored in literature till date. Dynamic Mode Decomposition (DMD) is a mathematical theory developed by Peter Schmid in 2008 to simulate and understand complex nonlinear systems without knowing the underlying governing equations of the model [4]. DMD can be used to process experimental data collected in snapshots through time to analyze dimensionality or state of the model and can be used to predict the future state. The theory was originally introduced in natural science and fluid mechanics for time-series data based learning. Recently it was adopted by computer vision community for video processing applications. By nature, video frames are equally spaced in time. The pixels of each snapshot can be vectorized to apply DMD approach which is well suitable for video processing applications such as moving object detection. In [11], authors introduced a real time DMD based method for robustly separating video frames into sparse (foreground) and low-rank (background) components.

2. BACKGROUND Static Image Interpretation via DMD method Dynamic Mode Decomposition integrates two of the prominent data analysis methods known today: Fourier Transform and Principle Components. Recently, the theory is adopted to video processing applications as a video sequence is a set of frames that are equally spaced in time and is can be vectorized easily. DMD modes whose Fourier frequencies near to zero (known as zero modes) are considered as background (low-rank component) and those who bounded away from the origin are interpreted as foreground (sparse component). The power of DMD method is still unknown to the static image processing area as there is no interpretation of static images suitable for DMD is available. The work presented in this paper tries to connect Dynamic Mode Decomposition to static image processing applications. The initial objective of the work is to come up with a new interpretation for static images such that it offers an appropriate application for DMD. We have used both color and luminance information to represent a static image as a time-dependent data matrix. To model the nonlinear and highly complex behavior of human visual system, we have exploited the power of different color space representations as each color space provide different color similarity measures. Color based representation M N In our method, an RGB image of size ( ) is first converted into YUV, YCbCr and CIELab color spaces. The idea of color space transformation is to have a better model that is capable of separating Luminance (L) from Chrominance(C), since human visual system is more sensitive towards brightness (luminance) than to color information (chrominance) [12]. We used multiple color space representations to provide a series of sequences for DMD algorithm. The Y and L channels from YUV, YCbCr and CIELab are employed to generate salient regions based on luminance and U, V, Cb, Cr, a and b are analyzed based on color information. A color space study has been done prior to the actual experiment to analyze the effect of different color channels on the salient regions. From this, an interesting phenomenon has been observed that salient region of an image has different degree of obviousness in different color channels. A salient region is always prominent in more than one color channel. Figure 1 shows color space study for 'flower pot' and 'ostrich' images. It is obvious that for the 'flower pot' image, the salient region is clear in a, b, V and Cr channels and for ostrich image a, U, Cb color spaces carries more salient information. In figure 1, the image corresponding to the location of bold character denotes prominent salient region image obtained in each color space. Analyzing the color spaces for several images, we came to a conclusion that the combination of b from CIELab color space, V from YUV color space and Cr from YCbCr color space works well for images similar to 'flowerpot' (background and salient part have clear distinction). Similarly, a from CIELab color space, U from YUV color space and Cb from YCbCr color space works well for images similar to the 'ostrich' image (blurred background). Original Image Color Space Channel 1 Color Space Channel 2 Color Space Channel 3 L a b Y U V

Y Cb Cr L a b Y U V Y Cb Cr Figure 1: Color Space Analysis Based on the above observations, we formulated two matrices 1, 2 mn3 mn3 denoted as column vectors. The columns of matrices 1, 2 mn3 mn3 C C which contains color channels C C 1 b V Cr C 2 a U Cb C are permuted and repeated to form an input for DMD method to generate color based saliency. Repetition of columns are suggested as DMD method gives accurate separation of salient part from the background if the number of columns of the underlying data matrix is as large as possible which has been proved experimentally[13]. Luminance based representation In order to incorporate the luminance information into the DMD approach, we have used singular values computed using SVD (Singular Value Decomposition) of luminance components (L, Y).. We adopt the hypothesis introduced by Xiaolong et al. in [9] that intermediate singular values represents salient part of an image while smaller and larger values represents non salient parts. Figure 2 shows the reconstructed SVD images for different set of singular values based on 'L' color channel of CIELab color space. In Figure 2, (a) shows the original image (b) shows image reconstructed using the largest singular value which contains the background information. Figure 2 (d)-(f) shows images reconstructed using respective combinations of singular values shown at the bottom of the figure. It is evident that starting from figure 2(b) each of the reconstructed image contains more information about the salient object. As the number of singular values used for reconstruction increases, salient region become more clear while the background remains relatively static after a set of iterations. Based on the above observation it is noticed that SVD reconstructed images for various intermediate singular values offers an appropriate application for DMD method.

Each of the reconstructed SVD images are reshaped to 1-dimensional column vectors and are united into single data I x, x, x... x are extracted, where each matrix. Given intermediate singular values, the n1vectors 1 2 3 represents vectorized SVD image. The obtained time-dependent matrix is then fed into DMD algorithm for low ranksparse separation. x i (a) (b) (c ) (d) (e ) (f ) Figure 2. SVD reconstructed image for various singular values (a) Original image (b) Largest singular value (c ) Singular values 3-5 (d) Singular values 3-10 (e) Singular values 3-15 (f) Singular values ( 3-20) 3. RESULTS OBTAINED The dataset used for the evaluations are taken from [14] along with few downloaded natural scene images from several web pages. Obtained results are shown in figure 3. ORIGINAL IMAGE COLOR SALIENCY MAP INTENSITY SALIENCY MAP COMBINED SALIENCY MAP

Figure 3. Results Salient objects are segmented out from the obtained saliency map, and the results are evaluated against state-of-art saliency detection algorithms in terms of ROC curve, F-measure rate, Precision-Recall rate, AUC score etc. The details of the evaluations with entire work will be published soon. 4. CONCLUSION We have presented a Dynamic Mode Decomposition based salient region detection. Saliency detection is considered as a problem of matrix separation into low-rank and sparse. The paper also introduces a new interpretation to use DMD approach in static image processing. This approach integrates two data analysis methods: (1) Fourier Transform, (2) Principle Component Analysis. The experimental results shows that our method based on DMD theory is effective in comparison with previous state-of-art saliency estimation approaches. The approach presented in this paper is evaluated using ROC curve, F-measure rate, Precision-Recall rate, AUC score etc.

5. REFERENCES 1. Itti, Laurent, Christof Koch, and Ernst Niebur. "A model of saliency-based visual attention for rapid scene analysis." IEEE Transactions on pattern analysis and machine intelligence 20.11 (1998): 1254-1259. 2. H. Teuber. Physiological psychology. Annual Review of Psychology,6(1):267 296, 1955. 409 3. J. M. Wolfe and T. S. Horowitz. What attributes guide the deployment of visual attention and how do they do it? Nature Reviews Neuroscience, pages 5:1 7, 2004. 409 4. Schmid, Peter J. "Dynamic mode decomposition of numerical and experimental data." Journal of Fluid Mechanics 656 (2010): 5-28. 5. Ma, Yu-Fei, and Hong-Jiang Zhang. "Contrast-based image attention analysis by using fuzzy growing." Proceedings of the eleventh ACM international conference on Multimedia. ACM, 2003. 6. Achanta, Radhakrishna, et al. "Frequency-tuned salient region detection."computer vision and pattern recognition, 2009. cvpr 2009. ieee conference on. IEEE, 2009. 7. Harel, Jonathan, Christof Koch, and Pietro Perona. "Graph-based visual saliency." Advances in neural information processing systems. 2006. 8. Rahtu, Esa, et al. "Segmenting salient objects from images and videos."computer Vision ECCV 2010. Springer Berlin Heidelberg, 2010. 366-379. 9. Ma, Xiaolong, et al. "Saliency detection based on singular value decomposition." Journal of Visual Communication and Image Representation32 (2015): 95-106. 10. Kim, Jiwhan, et al. "Salient Region Detection via High-Dimensional Color Transform and Local Spatial Support." IEEE Transactions on Image Processing 25.1 (2016): 9-23. 11. Cevher, Volkan, et al. "Compressive sensing for background subtraction."european Conference on Computer Vision. Springer Berlin Heidelberg, 2008. 12. Rudz, Steve, et al. "On the evaluation of segmentation methods for wildland fire." International Conference on Advanced Concepts for Intelligent Vision Systems. Springer Berlin Heidelberg, 2009. 13. Grosek, Jacob. Robust real-time image processing through dynamic mode decomposition. Diss. University of Washington, 2013. 14. Martin, David, et al. "A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics." Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on. Vol. 2. IEEE, 2001.