Image Resizing Based on Gradient Vector Flow Analysis

Similar documents
An Improved Image Resizing Approach with Protection of Main Objects

Image Compression and Resizing Using Improved Seam Carving for Retinal Images

2.1 Optimized Importance Map

Image Retargeting for Small Display Devices

Seam-Carving. Michael Rubinstein MIT. and Content-driven Retargeting of Images (and Video) Some slides borrowed from Ariel Shamir and Shai Avidan

Shift-Map Image Editing

Wook Kim. 14 September Korea University Computer Graphics Lab.

International Journal of Mechatronics, Electrical and Computer Technology

Rectangling Panoramic Images via Warping

A Novel Approach to Saliency Detection Model and Its Applications in Image Compression

Fast Non-Linear Video Synopsis

Artificial Mosaics with Irregular Tiles BasedonGradientVectorFlow

Texture Sensitive Image Inpainting after Object Morphing

Similarity criterion for image resizing

Evaluation of regions-of-interest based attention algorithms using a probabilistic measure

Content-Aware Image Resizing

Salient Region Detection and Segmentation in Images using Dynamic Mode Decomposition

Saliency Detection in Aerial Imagery

Salient Region Detection and Segmentation

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Stereo Image Rectification for Simple Panoramic Image Generation

Saliency Detection for Videos Using 3D FFT Local Spectra

GPU Video Retargeting with Parallelized SeamCrop

AUTONOMOUS IMAGE EXTRACTION AND SEGMENTATION OF IMAGE USING UAV S

Lecture #9: Image Resizing and Segmentation

Main Subject Detection via Adaptive Feature Selection

technique: seam carving Image and Video Processing Chapter 9

A MULTI-RESOLUTION APPROACH TO DEPTH FIELD ESTIMATION IN DENSE IMAGE ARRAYS F. Battisti, M. Brizzi, M. Carli, A. Neri

Importance Filtering for Image Retargeting

Fusing Warping, Cropping, and Scaling for Optimal Image Thumbnail Generation

Alignment and Mosaicing of Non-Overlapping Images

Perceptual Grouping from Motion Cues Using Tensor Voting

Image gradients and edges April 11 th, 2017

Image gradients and edges April 10 th, 2018

Improved Seam Carving for Video Retargeting. By Erik Jorgensen, Margaret Murphy, and Aziza Saulebay

SCALE INVARIANT TEMPLATE MATCHING

Salient Region Detection using Weighted Feature Maps based on the Human Visual Attention Model

Human Motion Detection and Tracking for Video Surveillance

Use of Shape Deformation to Seamlessly Stitch Historical Document Images

Hierarchical Representation of 2-D Shapes using Convex Polygons: a Contour-Based Approach

FASA: Fast, Accurate, and Size-Aware Salient Object Detection

arxiv: v1 [cs.cv] 28 Sep 2018

CS 231A Computer Vision (Fall 2012) Problem Set 3

CS 223B Computer Vision Problem Set 3

An Efficient Salient Feature Extraction by Using Saliency Map Detection with Modified K-Means Clustering Technique

Prof. Fanny Ficuciello Robotics for Bioengineering Visual Servoing

Comparative Analysis of Image Compression Using Wavelet and Ridgelet Transform

A Novel Approach to Image Segmentation for Traffic Sign Recognition Jon Jay Hack and Sidd Jagadish

Texture. CS 419 Slides by Ali Farhadi

Image Inpainting by Hyperbolic Selection of Pixels for Two Dimensional Bicubic Interpolations

CHAPTER 3 RETINAL OPTIC DISC SEGMENTATION

CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT

A Novel Approach for Saliency Detection based on Multiscale Phase Spectrum

Image gradients and edges

Robust Ring Detection In Phase Correlation Surfaces

Supplementary Materials for Salient Object Detection: A

On the Performances of Computer Vision Algorithms on Mobile Platforms

CSCI 1290: Comp Photo

From Ramp Discontinuities to Segmentation Tree

Detecting motion by means of 2D and 3D information

Detecting Multiple Symmetries with Extended SIFT

Small Object Segmentation Based on Visual Saliency in Natural Images

An Implementation on Histogram of Oriented Gradients for Human Detection

GLOBAL SAMPLING OF IMAGE EDGES. Demetrios P. Gerogiannis, Christophoros Nikou, Aristidis Likas

Enhanced Hemisphere Concept for Color Pixel Classification

How Important is Location in Saliency Detection

Line Segment Based Watershed Segmentation

Focusing Attention on Visual Features that Matter

Colour Segmentation-based Computation of Dense Optical Flow with Application to Video Object Segmentation

Ensemble of Bayesian Filters for Loop Closure Detection

Image Retargetting on Video Based Detection

Implemented by Valsamis Douskos Laboratoty of Photogrammetry, Dept. of Surveying, National Tehnical University of Athens

Snakes reparameterization for noisy images segmentation and targets tracking

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into

Finger Print Enhancement Using Minutiae Based Algorithm

IMAGE RETRIEVAL USING VLAD WITH MULTIPLE FEATURES

WITH the development of mobile devices, image retargeting

Robotics Programming Laboratory

Building Roof Contours Extraction from Aerial Imagery Based On Snakes and Dynamic Programming

An ICA based Approach for Complex Color Scene Text Binarization

Character Recognition

CS 231A Computer Vision (Autumn 2012) Problem Set 1

Optimized Image Resizing Using Seam Carving and Scaling

Artifacts and Textured Region Detection

Robust Frequency-tuned Salient Region Detection

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

PERFORMANCE ANALYSIS OF COMPUTING TECHNIQUES FOR IMAGE DISPARITY IN STEREO IMAGE

IMAGE SALIENCY DETECTION VIA MULTI-SCALE STATISTICAL NON-REDUNDANCY MODELING. Christian Scharfenberger, Aanchal Jain, Alexander Wong, and Paul Fieguth

CHAPTER-4 LOCALIZATION AND CONTOUR DETECTION OF OPTIC DISK

Comparison of Graph Cuts with Belief Propagation for Stereo, using Identical MRF Parameters

A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes

Light Field Occlusion Removal

Tri-modal Human Body Segmentation

Enhanced and Efficient Image Retrieval via Saliency Feature and Visual Attention

Medial Features for Superpixel Segmentation

Sketchable Histograms of Oriented Gradients for Object Detection

Model parametrization strategies for Newton-based acoustic full waveform

Dimensionality Reduction using Relative Attributes

Specular Reflection Separation using Dark Channel Prior

A Model of Dynamic Visual Attention for Object Tracking in Natural Image Sequences

Transcription:

Image Resizing Based on Gradient Vector Flow Analysis Sebastiano Battiato battiato@dmi.unict.it Giovanni Puglisi puglisi@dmi.unict.it Giovanni Maria Farinella gfarinellao@dmi.unict.it Daniele Ravì rav@dmi.unict.it Abstract Content-aware image resizing is an effective technique that allows to take into account the visual content of images during the resizing process. The basic idea beyond these algorithms is the resizing of an image by considering vertical and/or horizontal paths of pixels (i.e., seams) which contain low salient information. In this paper we exploit the Gradient Vector Flow (GVF) of the image to establish the paths to be considered during the resizing. The relevance of each path is derived from a saliency map obtained by considering the magnitude of the GVF associated to the image under consideration. The proposed technique has been tested, both qualitatively and quantitatively, by considering a representative set of images labeled with corresponding salient objects (i.e., ground-truth maps). Experimental results demonstrate that our method preserves crucial salient regions better than other state-of-the-art algorithms. Keywords: Content-Aware Image Resizing, Image Retargeting, Visual Saliency, Gradient Vector Flow 1 Introduction The extensive use of different display devices with different resolution (e.g., on pc, tablet, smartphone, etc.) increases the demand of image resizing techniques which consider the visual content during the scaling process. Standard resizing techniques considering only geometric constraints, such as scaling, can be used only to change the image size (width and height) of a fixed percentage with respect to the original one. Scaling does not take into account the visual importance of pixels during image resizing (i.e., a resizing with respect to only one of the two dimensions introduces artifacts and distortions). Other standard operations, such as cropping, in which outer parts of an image are removed, could produce images with loss of salient information (e.g., objects or part of them). In the last years, several techniques for content-aware image resizing have been proposed [1, 2, 3, 4, 5, 6]. The main aim of a content-aware image resizing is the preservation of relevant visual information into the resized image. Intuitively, the goal is to remove unnoticeable paths of pixels that blend well with their surroundings, and retain the salient pixels. Seam Carving, proposed by Avidan et al. in

[1], is probably the most popular approach to address this problem. Such a technique reduces or expands the image by removing or duplicating connected path of pixels (called seams) having low-energy in the map related to the image to be resized. In [1] different strategies have been compared to compute the energy map (e.g., the entropy energy computed for each pixel taking into account a fixed window, the magnitude of the gradient computed on each pixel, a saliency measure of each pixel computed as in [7], etc.). An interesting and powerful extension of standard resizing operators (i.e., scaling, cropping, etc.) and content-aware based algorithms (i.e., seam carving) can be obtained by their combination, as proposed by Rubinstein et al. in [2]. They propose an algorithm able to search for the optimal sequence of operators to be applied at each step of the resizing to get better results in terms of visual quality of the final reduced image. A drawback of this approach is that the computational complexity increases due to the use of different operators. Among others, patch-based methods have been also proposed for image retargeting or summarization. Cho et al. [3] suggested an algorithm to find an arrangement of patches of the original image that well fit in the resized image, whereas Pritch et al. [4] introduced a method to find the best Shift-Map which defines the pixel displacement useful to produce the output image. Gallea et al. [5] proposed a fast method for image retargeting based on the solution of a linear system. This model aims to find shift values for each line (row/column) preserving the distance among the relevant ones. The linearity of the considered model allows them to elaborate even large images in reasonable computational time. Building on this last technique, in our previous work [6] we have proposed different strategies to be employed for content-aware image resizing on mobile devices. In this paper we propose a novel algorithm for content aware image resizing. The technique exploits the properties of Gradient Vector Flow (GVF) [8] to properly select the seams to be removed, without introducing artifacts in the resized image. Specifically, GVF is used to produce a vector field useful to preserve salient information. The proposed approach has been tested and compared, both qualitatively Figure 1: Input image with its corresponding GVF field overimposed. The seam selected by the proposed resizing approach is shown in red. and quantitatively, with respect to state-of-theart approaches. Experimental results confirm the effectiveness of the proposed approach in terms of preservation of salient regions. The paper is organized as follows: Section 2 presents the proposed image resizing method. In Section 3 the experimental phase and the results are detailed. Finally, conclusions and hints for future works are given in Section 4. 2 PROPOSED METHOD One of the main issues related to the content aware image resizing is the preservation of the salient information contained in the image under analysis. To this aim, our algorithm makes use of the properties of the Gradient Vector Flow (GVF) [8]. GVF is a dense force field [8] useful to solve the classical problems that affect snakes: sensitivity to initialization and poor convergence to boundary concavity. Starting from the gradient of an image, this field is computed through diffusion equations. Formally, GVF is the field F of vectors v = [u, v] that minimizes the following energy function: E = µ(u 2 x + u 2 y + vx 2 + vy) 2 + f 2 v f 2 (1) where the subscripts represent partial derivatives along x and y axes respectively, µ is a regularization parameter, and f is the gradient computed from the intensity of the input image. Due to the above formulation, GVF field values are close to f values where this quantity is large (energy E, to be minimized, is dominated by f 2 v f 2 ), and are slow-varying in ho-

GVF norm Figure 2: An example of seam generation. Among the three possible directions (in red) the one closest to the GV F norm orientation (in blue) is selected. Figure 3: Image Resizing Based on GVF mogeneous regions (the energy E is dominated by the sum of the squares of the partial derivatives of GVF field). An example of GVF field is shown in Fig. 1. GVF is stronger close to the edges of objects within the image. We use this vector field to effectively drive the selection of pixels belonging to a seam. The proposed algorithm works as follows (Fig. 3). Let I be an image with H rows and W columns to be resized with respect to the width. First the GVF and its normalized version GV F norm (i.e., each vector with norm one) are computed from the input image I considering the luminance channel. Several seams {s 1, s 2,..., s K } are then built (i.e., SeamsComputation) starting from the top of the image making use of the directions of the already computed GV F norm. It is worth noting that the directions suggested by GV F norm cannot be always followed. Specifically, considering a generic pixel p of coordinates (i, j) belonging to a seam s k, the next element of s k has to be chosen among (i + 1, j 1), (i + 1, j), (i + 1, j + 1). These pixels can be related to the following unit vectors ( 2/2, 2/2), (0, 1), ( 2/2, 2/2). Among the aforementioned unit vectors associated to a specific direction, the one making the smallest angle with GV F norm (i, j) is hence selected (see Fig. 3). To this aim, a simple dot product between GV F norm (i, j) and the three considered unit vectors is employed. To sum up a generic seam s k is built repeating H 1 times the aforementioned direction selection algorithm starting from a pixel p with coordinates (1, w) at the top of the image (w = 1,..., W at the first iteration of the resizing). The proposed algorithm works similarly for the resizing with respect to the height. A cost function is then associated to each computed seam considering the sum of the GVF magnitude GV F related to the pixels belonging to the seam. Specifically the cost c k of a seam s k is computed as follows (i.e., SeamsCost): c k = GV F (i, j) (2) (i,j) s k For the resizing, the seam with the lower cost c k is hence removed from the image at each iteration (i.e., RemoveSeam). The GVF map is then updated and a new iteration of the seam removal algorithm is performed. It is worth noting that the algorithm described above is repeated for each seam to be removed. Specifically, if N seams have to be removed, the algorithm is repeated N times. 3 EXPERIMENTAL RESULTS The performance of a content-aware image resizing algorithm strongly depends on the adopted energy map which captures the salient regions of an image. As described in previous section, we propose to use GVF to build and select the seams during the resizing. In order to evaluate the results of our approach, we compared it with respect to the classic Seam Carving algorithm proposed by Avidan et al. [1] and with respect to the approach recently proposed by Gallea et al. [5]. The approach in [1] is a local-based approach which takes into account the gradient of the image to select the seams to be removed, whereas the one proposed in [5] is a global-based approach in which an objective function is considered to solve an optimization problem. In [5] the product of the gradient of

the image and the saliency map proposed by Itti et al. [9] is taken into account as energy map during the resizing. To properly assess and compare the different methods, we used 50 images from the image database proposed in [10] 1. The salient object of each image is labeled with 9 bounding boxes marked from nine different users. Hence, for each image a ground-truth map related to the salient object is computed through a voting procedure. Specifically, a pixel in the ground-truth map has a saliency cost equal to n 9 if that pixel belongs to n 9 bounding boxes. So, for each image I of the dataset the computed groundtruth map G I denotes the importance of each pixel of the image. In Fig. 4(a) and Fig. 4(e) are shown respectively an image considered in the experiments and its corresponding ground-truth map (i.e., G I ). Since the aim of content-aware image resizing is to preserve salient regions, in order to evaluate the performances of a specific algorithm A, the following cost function can be taken into account: Cost(I, A) = G I (p) (3) p ψ A where ψ A is the final set of pixels removed by employing the algorithm A during the resizing of the image I, and G I (p) indicates the importance of the removed pixel p. This cost can be used to compare the performances of the different involved algorithms. A lower cost value indicates better performances (i.e., more salient pixels are preserved in the resizing). In Fig. 4 is reported an example of an image resizing at 60% of the width obtained by employing the three considered algorithms. Fig. 4(b), (c), (d) correspond to the maps ψ A (i.e., removed seams) obtained with the three different algorithms, whereas Fig. 4(f), (g), (h) depict the values G I (p) used to compute the cost function in (3) taking into account of one of the three compared algorithms. In Fig. 5 are reported the results obtained by the different algorithms at varying of the percentage of the resizing with respect to width and height. These results have been obtained by averaging the costs computed as in (3) related to the 50 considered images (given a percentage of resizing and a specific 1 Specifically, we selected 50 images from the Image set B. See [10] for more details. (a) (b) (c) (d) (e) (f) (g) (h) Figure 4: (a) original image. (b), (c), (d) show the seams removed employing respectively the proposed approach, Gallea et al. [5] and Avidan et al. [1]. (e) ground-truth saliency map related the image in (a). In (f), (g), (h) are shown the final maps obtained by combining the information in the saliency map shown in (e) and the maps of the removed seams in (b), (c) and (d) respectively. These last maps contain the weights indicating how important each removed pixel is in terms of saliency. The values of these last three maps are used to compute the corresponding saliency costs (i.e., the sum of values for each map) and hence employed to compare the different algorithms. algorithm). The proposed method achieves the best results demonstrating that it more effectively preserves salient areas and hence removes less crucial pixels. In Fig. 6 are shown some visual results obtained by resizing the original images at 60% of the width. A visual comparison reveals that the proposed approach better preserves the main salient regions (i.e., the areas with objects). 4 CONCLUSION AND FUTURE WORKS In this paper we proposed a content-aware image resizing algorithm which exploits information extracted through Gradient Vector Flow to establish the paths to be considered during the resizing of an image. The proposed approach has been compared with respect to state-of-theart algorithms achieving at least comparable visual results, and outperforming existent strategies in terms of preservation of salient regions. Future works will be devoted to improve the selection of the paths extracted through Gradient Vector Flow by coupling this information with different energy maps able to better capture the

Proposed approach Gallea et al. [5] Avidan et al. [1] Proposed approach 0,44 0,40 0,42 0,38 0,40 0,36 0,38 Avidan et al. [1] 0,34 0,36 0,32 0,34 0,30 0,32 0,28 Average cost 0,30 Average cost Gallea et al. [5] 0,28 0,26 0,24 0,22 0,20 0,18 0,26 0,24 0,22 0,20 0,18 0,16 0,16 0,14 0,14 0,12 0,12 0,10 0,10 0,08 0,08 0,06 0,06 0,04 0,04 90% 80% 70% 60% 50% Resizing with respect to height 90% 80% 70% 60% 50% Resizing with respect to width Figure 5: Average cost computed over 50 test images at varying of percentage of resizing. A lower value indicates that more salient pixels are preserved (i.e., better performances). salient regions of an image (e.g., the one proposed by Itti et al. in [9]). Moreover, a blending method will be considered to obtain better visual results for the joining of paths adjacent to the removed seams. 5 ACKNOWLEDGMENTS We would like to thank the authors of [5] for providing us their original implementation. tion on Graphics, 26(3):1 10, 2007. [2] M. Rubinstein, A. Shamir, and S. Avidan. Multi-operator media retargeting. ACM Transaction on Graphics, 28(3):23:1 23:11, 2009. [3] T. Cho, M. Butman, S. Avidan, and W. Freeman. The patch transform and its applications to image editing. In IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pages 1 8, 2008. [4] Y. Pritch, E. Kav-Venaki, and S. Peleg. Shift - map image editing. In International Conference on Computer Vision (ICCV), pages 151 158, 2009. [5] R. Gallea, E. Ardizzone, and R. Pirrone. Realtime content-aware resizing using reduced linear model. In IEEE International Conference on Image Processing, pages 2813 2816, 2010. [6] S. Battiato, G. M. Farinella, N. Grippaldi, and G. Puglisi. Content-based image resizing on mobile devices. In International Conference on Computer Vision Theory and Applications (VISAPP), pages 1 4, 2012. [7] L. Itti and C. Koch. A comparison of feature combination strategies for saliency-based visual attention systems. Journal of Electronic Imaging, 10:161 169, 1999. [8] C. Xu and L. Prince. Snakes, shapes, and gradient vector flow. IEEE Transactions on Image Processing, 7(3):359 369, 1998. Figure 6: Examples of resizing at 60% of the width. 1st column: original image. 2nd column: proposed approach. 3rd column: Gallea et al. [5]. 4th column: Avidan et al. [1]. References [1] S. Avidan and A. Shamir. Seam carving for content-aware image resizing. ACM Transac- [9] L. Itti, C. Koch, and E. Niebur. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(11):1254 1259, 1998. [10] T. Liu, J. Sun, N.-N. Zheng, X. Tang, and H.-Y. Shum. Learning to detect a salient object. In IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pages 1 8, 2007.