Detecting Salient Contours Using Orientation Energy Distribution. Part I: Thresholding Based on. Response Distribution

Similar documents
RELATIONSHIP BETWEEN SUSPICIOUS COINCIDENCE IN NATURAL IMAGES AND CONTOUR-SALIENCE IN ORIENTED FILTER RESPONSES. A Thesis SUBRAMONIA P.

Schedule for Rest of Semester

A SYNOPTIC ACCOUNT FOR TEXTURE SEGMENTATION: FROM EDGE- TO REGION-BASED MECHANISMS

Artifacts and Textured Region Detection

TEXTURE ANALYSIS USING GABOR FILTERS

CMPSCI 670: Computer Vision! Grouping

MRI Brain Image Segmentation Using an AM-FM Model

Color Image Segmentation

Efficient Visual Coding: From Retina To V2

Image pyramids and their applications Bill Freeman and Fredo Durand Feb. 28, 2006

Texture Similarity Measure. Pavel Vácha. Institute of Information Theory and Automation, AS CR Faculty of Mathematics and Physics, Charles University

Segmentation and Grouping April 19 th, 2018

C. Poultney S. Cho pra (NYU Courant Institute) Y. LeCun

Region-based Segmentation

EE 701 ROBOT VISION. Segmentation

SPARSE CODES FOR NATURAL IMAGES. Davide Scaramuzza Autonomous Systems Lab (EPFL)

Segmentation. Bottom Up Segmentation

Local qualitative shape from stereo. without detailed correspondence. Extended Abstract. Shimon Edelman. Internet:

ICA mixture models for image processing

Solution: filter the image, then subsample F 1 F 2. subsample blur subsample. blur

SuRVoS Workbench. Super-Region Volume Segmentation. Imanol Luengo

Computational Models of V1 cells. Gabor and CORF

Filtering Images. Contents

An ICA based Approach for Complex Color Scene Text Binarization

Robotics Programming Laboratory

Stacked Denoising Autoencoders for Face Pose Normalization

CAP 5415 Computer Vision Fall 2012

A Model of Dynamic Visual Attention for Object Tracking in Natural Image Sequences

FACE RECOGNITION USING INDEPENDENT COMPONENT

Combining Top-down and Bottom-up Segmentation

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

The goals of segmentation

Oriented Filters for Object Recognition: an empirical study

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

TEXTURE ANALYSIS USING GABOR FILTERS FIL

Targil 12 : Image Segmentation. Image segmentation. Why do we need it? Image segmentation

IDENTIFYING GEOMETRICAL OBJECTS USING IMAGE ANALYSIS

CS4442/9542b Artificial Intelligence II prof. Olga Veksler

Edge Detection (with a sidelight introduction to linear, associative operators). Images

Peripheral drift illusion

Edges Are Not Just Steps

Statistical image models

3D Object Recognition: A Model of View-Tuned Neurons

Image Segmentation. Selim Aksoy. Bilkent University

CS 4495 Computer Vision. Segmentation. Aaron Bobick (slides by Tucker Hermans) School of Interactive Computing. Segmentation

A Quantitative Approach for Textural Image Segmentation with Median Filter

Adaptive Wavelet Image Denoising Based on the Entropy of Homogenus Regions

A Hierarchial Model for Visual Perception

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Locating ego-centers in depth for hippocampal place cells

Texture. Announcements. 2) Synthesis. Issues: 1) Discrimination/Analysis

6.801/866. Segmentation and Line Fitting. T. Darrell

Analysis and Synthesis of Texture

Computational Photography Denoising

Edge Detection Techniques in Processing Digital Images: Investigation of Canny Algorithm and Gabor Method

Recovering Intrinsic Images from a Single Image

Computer Vision I. Announcement. Corners. Edges. Numerical Derivatives f(x) Edge and Corner Detection. CSE252A Lecture 11

Image segmentation using an annealed Hopfield neural network

Image Analysis. Edge Detection

Outline. Segmentation & Grouping. Examples of grouping in vision. Grouping in vision. Grouping in vision 2/9/2011. CS 376 Lecture 7 Segmentation 1

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

Edges and Binary Images

Image Analysis - Lecture 5

Survey on Multi-Focus Image Fusion Algorithms

What is an edge? Paint. Depth discontinuity. Material change. Texture boundary

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

Fuzzy Inference System based Edge Detection in Images

Concepts in. Edge Detection

Robust contour extraction and junction detection by a neural model utilizing recurrent long-range interactions

Image Segmentation. Srikumar Ramalingam School of Computing University of Utah. Slides borrowed from Ross Whitaker

Computer Vision 2. SS 18 Dr. Benjamin Guthier Professur für Bildverarbeitung. Computer Vision 2 Dr. Benjamin Guthier

MORPH-II: Feature Vector Documentation

Image Segmentation. Selim Aksoy. Bilkent University

Motion Detection Algorithm

A Remote Sensing Image Segmentation Method Based On Spectral and Texture Information

Segmentation and Grouping

Texture. COS 429 Princeton University

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Texture. Texture is a description of the spatial arrangement of color or intensities in an image or a selected region of an image.

A Simplified Texture Gradient Method for Improved Image Segmentation

Segmentation & Grouping Kristen Grauman UT Austin. Announcements

Main Subject Detection via Adaptive Feature Selection

CS 534: Computer Vision Texture

A Computational Approach To Understanding The Response Properties Of Cells In The Visual System

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Digital Image Processing. Prof. P.K. Biswas. Department of Electronics & Electrical Communication Engineering

Nicolai Petkov Intelligent Systems group Institute for Mathematics and Computing Science

A Modified Approach to Biologically Motivated Saliency Mapping

A New Feature Local Binary Patterns (FLBP) Method

Motion and Optical Flow. Slides from Ce Liu, Steve Seitz, Larry Zitnick, Ali Farhadi

VIDEO DENOISING BASED ON ADAPTIVE TEMPORAL AVERAGING

Segmentation Computer Vision Spring 2018, Lecture 27

Towards direct motion and shape parameter recovery from image sequences. Stephen Benoit. Ph.D. Thesis Presentation September 25, 2003

Color-Texture Segmentation of Medical Images Based on Local Contrast Information

Discovering Visual Hierarchy through Unsupervised Learning Haider Razvi

CHAPTER 9 INPAINTING USING SPARSE REPRESENTATION AND INVERSE DCT

CS 534: Computer Vision Texture

3 October, 2013 MVA ENS Cachan. Lecture 2: Logistic regression & intro to MIL Iasonas Kokkinos

Computer Vision I - Filtering and Feature detection

IMAGE DENOISING TO ESTIMATE THE GRADIENT HISTOGRAM PRESERVATION USING VARIOUS ALGORITHMS

Transcription:

Detecting Salient Contours Using Orientation Energy Distribution The Problem: How Does the Visual System Detect Salient Contours? CPSC 636 Slide12, Spring 212 Yoonsuck Choe Co-work with S. Sarma and H.-C. Lee Based on Lee and Choe (23); Sarma (23); Sarma and Choe (26) 1 Neurons in the visual cortex have Gabor-like receptive fields. Looking at the response properties of these neurons can help us answer the question. The simplest statistical property can be measured by looking at the response histogram. Questioning from a slightly different perspective, how can the particular response property of visual cortical neurons be utilized by later processing? 2 Observation Grayscale intensity distributions are quite different across different images. Part I: Thresholding Based on However, Gabor response distributions are quite similar across different images. Response Distribution 3 4

A Typical Grayscale Image Grayscale Intensity Distribution 1 8 yosemite7 histogram 1 8 bird8 histogram 1 8 rocky1 histogram 6 6 6 4 4 4 2 2 2 5 1 15 2 25 3 gray-level 5 1 15 2 25 3 gray-level 5 1 15 2 25 3 gray-level Grayscale intensity histograms are drastically different across different images. Although not evident from the above, the intensity histogram can be widely different across different images. Thus, a general algorithm for utilizing the intensity distribution cannot be easily derived. 5 6 A Typical Gabor Response (Orientation Energy) High values near contours or edges. The energy distribution is strikingly uniform across images. 7 log probability Gabor Response Distribution 1 1 1 1 1 1.1.1.1.1 1e-5 E distributions 1e-6 1 1 1 1 log(e) The Gabor response (or orientation energy; E) distributions on the other hand are quite similar across differen images (shown in Log-Log plot). The distribution shows a power law property (f(x) = 1/x a ): sharp peak and heavy tail. 8 f e d c b a

Summary So Far What to Make of the Power Law? h(e) g(e) Input * * = Response Probability log Probability LGN V1 L1 L2 L1 L2 E log E ( a) linear scale ( b) log log scale Intensity Distribution Response Distribution Input and response disrtibutions show quite different statistical properties. 9 Comparing the power law distribution with a normal distribution with the same variance can provide us with some information. Assumption: normal distribution can be a suitable standard. The point L2 where h(e) becomes greater than g(e) may be important, i.e., orientation energy is suspiciously high. 1 Exploiting the Power Law in E L2 4 35 3 25 2 15 1 5 human chosen thresholds vs. L2 5 1 15 2 fit human chosen thresholds High orientation energy E indicate a strong edge component in images. Can there be a relationship between the threshold of E above which humans see it as salient and the point L2? Clearly, there is a linear relationship between the two! 11 Further Discoveries: L2 and σ standard deviation of E 14 12 1 8 6 4 2 L2 vs. SD of E 5 1 15 2 25 3 Further, the raw standard deviation σ of the orientation energy L2 distribution is linearly related to L2. Question: Is there an analytical solution to 1/x a = b exp( x 2 /c), where the constants a, b, and c depend on σ? 12 fit

Using σ to Estimate Optimal E Threshold Application: Thresholding E SD of E vs. human chosen thresholds human chosen thresholds 14 fit 12 1 8 6 4 2 2 4 6 8 1 12 standard deviation of E Relating σ back to the human-chosen E threshold gives again a linear relation: T σ = 1.37σ 2176.59. Thus, instead of calculating the histogram, etc., we can simply calculate the raw standard deviation σ to estimate the appropriate E threshold. 13 Original, human-selected, 85-percentile, and T σ. 14 Extraction of Salient Edges Thresholding E: Limitations of Fixed Percentile (a) Original Image (b) Thresholded Edges (c) Magnified (b) Using T σ as a threshold gives good results, comparable to humans preference. 15 Original, human-selected, 85-percentile, and T σ. 16

Thresholding E: Limitations of Global Thresholding Summary of Thresholding Results Fixed percentile thresholding does not give consistent results. The σ-based T σ threshold works well. However, globally applying the same threshold has limitations. This problem can be overcome by applying the same principle derived here to calculate the local thresholds. The proposed method is an efficient way of detecting salient contours. Original, human-selected, 85-percentile, T σ, and T σ local. 17 18 Approach Part II: Quantitative Comparison Generate synthetic image (left) with known salient edges (right) and compare the thresholded version to this ground truth. Add noise and vary number of objects to make it interesting. 19 2

Compared Thresholding Methods Experiment I: Vary Noise Level Global OED: threshold derived from OED from the entire response matrix, using the normal distribution baseline. Local OED: threshold derived from OED from the local surrounding area of a pixel in the response matrix. using the normal distribution baseline. Global 85%: threshold derived from OED from the entire (a) Input (b) Orientation Energy (c) Reference response matrix, using the 85-percentile point as the threshold. Local 85%: threshold derived from OED from the local surrounding area of a pixel in the response matrix, using the 85-percentile point as the threshold. (d) Global OED (e) Global 85% (f ) Local OED (g) Local 85% 21 22 Experiment I: Vary Noise Level Experiment II: Vary Input Count 7.5e+2 7e+2 Sum of Squared Error 6.5e+2 6e+2 5.5e+2 Global OED Global 85 Local OED Experiments Local 85 (a) Input (b) Orientation Energy (c) Reference Global OED Global 85% Local OED Local 85% Global OED X < (p=.18) < (p=.35) < (p=.21) Global 85% X X > (p=.19) > (p=.14) Local OED X X X < (p=.25) Local 85% X X X X 23 (d) Global OED (e) Global 85% (f ) Local OED (g) Local 85% 24

Experiment II: Vary Input Count Summary Sum of Squared Error 1.7e+22 1.6e+22 1.5e+22 1.4e+22 1.3e+22 1.2e+22 1.1e+22 1e+22 9e+21 Thresholding based on orientation energy distribution (OED) did consistently better than fixed-percentile methods in noisy, synthetic images. Differences between glocal and local OED thresholding were not significant. Global OED Global 85 Local OED Experiments Local 85 Global OED Global 85% Local OED Local 85% Global OED X < (p=.17) > (p=.258) < (p=.14) Global 85% X X > (p=.11) > (p=.6) Local OED X X X < (p=.15) Local 85% X X X X 25 26 Error in Thresholding (Both Experiments): The global and local OED-derived method have significantly smaller sum of squared error values than the fixed 85-percentile methods. Relationship Between T σ Thresholding and Suspicious Coincidence What is the relationship between salience defined as Part III: Analysis super-gaussian and the conventional definition of suspiciousness (Barlow 1994, 1989)? P (A, B) > P (A)P (B). 27 28

White-Noise Analysis Gabor Response to White Noise Images 1.1 h(e) g(e) log Probability.1.1.1 If the Gaussian baseline assumption was correct, the E response distribution to white noise images should not be perceived as salient compared to a Gaussian with the same variance. In white-noise images, each pixel is independent, so, given pixel A and pixel B: P (A, B) = P (A)P (B). 1e-5 1e-6 1 1 1 log(e) The orientation energy distribution is very close to a Gaussian, especially near the high E values. Thus, the T σ thresholding will result in no salient contours in white noise images. 29 3 Use of White Noise Response as a Baseline Can we use the white-noise response as a baseline for thresholding E?: Yes! Generate white noise response, and scale it by σ h /σ r where σ h and σ r are the STD in the natural image response and the white noise response. Recalculate the response distribution (if necessary). New Baseline for Salience vs. Humans Threshold_Human (x1) 14 linear fit 12 1 8 6 4 2 5 1 15 2 25 3 35 4 45 5 55 New_L2 (x1) New L 2 vs. Human Chosen Threshold (r =.98) Strong linearity is found between the new L 2 and the human selected threshold. This is much tighter than the Gaussian baseline (r =.91)! 31 32

New Baseline for Salience vs. σ Related Work Sigma (x1) 14 linear fit 12 1 8 6 4 2 5 1 15 2 25 3 35 4 45 5 55 New_L2 (x1) New L 2 vs. σ (r =.91) The same linearity between L 2 and the σ is maintained. Malik et al. (Malik et al. 1999) used peak values of orientation energy to define boundaries of regions of coherent brightness and texture. The non-gaussian nature of orientation energy (or wavelet response) histograms has also been recognized and utilized for some time now, especially in denoising and compression (Simoncelli and Adelson 1996). Other kinds of histograms, e.g., spectral histogram by Liu and Wang (22), or spatial frequency distributions (Field 1987), may be amenable to a similar analysis. 33 34 Discussion The local (or even global) threshold calculation can be easily implemented in a neural network. σ 2 = i,j w ij g(v ij ), Summary Gaussian baseline was found to have a close relationship to the idea of suspicious coincidence by Barlow (1994) where w ij are connection weights serving as normalization constants, g(x) = x 2, and V ij is the V1 response at location i, j. The resulting value can be passed through another activation function f(x) = x. These are all plausible functions that can be implemented in a biological neural network. 35 36

Lesson Learned Conclusion Studying statistical properties of raw natural signal distributions can be useful in determining why the visual system is structured in the current form (e.g., PCA, ICA, etc. predicts the receptive field shape). However, what s more interesting is that the response properties of cortical neurons can have certain invariant properties and this can be exploited. So, we need to go beyond finding out what receptive fields look like and why, and start to explore how cortical neuron response can be utilized by the rest of the brain. Cortical response distribution has a unique invariant property (the power-law). Such properties can be exploited in tasks such as salient contour detection. Gaussian distribution forms a good baseline for determining the threshold. The above may be related to the idea of suspicious coincidence. 37 38 References Barlow, H. (1994). What is the computational goal of the neocortex? In Koch, C., and Davis, J. L., editors, Large Scale Neuronal Theories of the Brain, 1 22. Cambridge, MA: MIT Press. Sarma, S. P. (23). Relationship between suspicious coincidence in natural images and contour-salience in oriented filter responses. Master s thesis, Department of Computer Science, Texas A&M University. Simoncelli, E. P., and Adelson, E. H. (1996). Noise removal via bayesian wavelet coring. In Proceedings of IEEE International Conference on Image Processing, vol. I, 379 382. Barlow, H. B. (1989). Unsupervised learning. Neural Computation, 1:295 311. Field, D. J. (1987). Relations between the statistics of natural images and the response properties of cortical cells. Journal of the Optical Society of America A, 4:2379 2394. Lee, H.-C., and Choe, Y. (23). Detecting salient contours using orientation energy distribution. In Proceedings of the International Joint Conference on Neural Networks, 26 211. IEEE. Liu, X., and Wang, D. (22). A spectral histogram model for texton modeling and texture discrimination. Vision Research, 42:2617 2634. Malik, J., Belongie, S., Shi, J., and Leung, T. K. (1999). Textons, contours and regions: Cue integration in image segmentation. In ICCV(2), 918 925. Sarma, S., and Choe, Y. (26). Salience in orientation-filter response measured as suspicious coincidence in natural images. In Gil, Y., and Mooney, R., editors, Proceedings of the 21st National Conference on Artificial Intelligence(AAAI 26), 193 198. 38-1 38-2