Computer Vision Grouping and Segmentation. Grouping and Segmentation

Similar documents
CS443: Digital Imaging and Multimedia Perceptual Grouping Detecting Lines and Simple Curves

Feature Detectors and Descriptors: Corners, Lines, etc.

CS 534: Computer Vision Segmentation and Perceptual Grouping

Image Segmentation. Selim Aksoy. Bilkent University

Image Segmentation. Selim Aksoy. Bilkent University

Fitting. Choose a parametric object/some objects to represent a set of tokens Most interesting case is when criterion is not local

CS 664 Segmentation. Daniel Huttenlocher

6.801/866. Segmentation and Line Fitting. T. Darrell

CS 534: Computer Vision Segmentation and Perceptual Grouping

Outline. Advanced Digital Image Processing and Others. Importance of Segmentation (Cont.) Importance of Segmentation

Photo by Carl Warner

Model Fitting, RANSAC. Jana Kosecka

CS 664 Image Matching and Robust Fitting. Daniel Huttenlocher

6. Object Identification L AK S H M O U. E D U

Fitting. Fitting. Slides S. Lazebnik Harris Corners Pkwy, Charlotte, NC

Lecture 9: Hough Transform and Thresholding base Segmentation

University of Florida CISE department Gator Engineering. Clustering Part 4

EE795: Computer Vision and Intelligent Systems

Fitting D.A. Forsyth, CS 543

Image Processing. Bilkent University. CS554 Computer Vision Pinar Duygulu

Clustering Part 4 DBSCAN

10/03/11. Model Fitting. Computer Vision CS 143, Brown. James Hays. Slides from Silvio Savarese, Svetlana Lazebnik, and Derek Hoiem

Prof. Kristen Grauman

Image Analysis Lecture Segmentation. Idar Dyrdal

Fitting: The Hough transform

Model Fitting. Introduction to Computer Vision CSE 152 Lecture 11

Estimate Geometric Transformation

Feature Matching and Robust Fitting

EE 701 ROBOT VISION. Segmentation

Uncertainties: Representation and Propagation & Line Extraction from Range data

Binary Image Processing. Introduction to Computer Vision CSE 152 Lecture 5

Fitting a transformation: Feature-based alignment April 30 th, Yong Jae Lee UC Davis

Lecture 15: Segmentation (Edge Based, Hough Transform)

COMPUTER AND ROBOT VISION

Announcements, schedule. Lecture 8: Fitting. Weighted graph representation. Outline. Segmentation by Graph Cuts. Images as graphs

Model Fitting: The Hough transform I

human vision: grouping k-means clustering graph-theoretic clustering Hough transform line fitting RANSAC

Multi-stable Perception. Necker Cube

Vision par ordinateur

Spatial Outlier Detection

Experiments with Edge Detection using One-dimensional Surface Fitting

CHAPTER 1 Introduction 1. CHAPTER 2 Images, Sampling and Frequency Domain Processing 37

INFO0948 Fitting and Shape Matching

Feature Extraction and Image Processing, 2 nd Edition. Contents. Preface

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis

University of Florida CISE department Gator Engineering. Clustering Part 5

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

SECTION 5 IMAGE PROCESSING 2

Normalized cuts and image segmentation

RANSAC and some HOUGH transform

Nonparametric estimation of multiple structures with outliers

EE795: Computer Vision and Intelligent Systems

Prof. Noah Snavely CS Administrivia. A4 due on Friday (please sign up for demo slots)

Lecture 6: Edge Detection

Fitting: The Hough transform

Object Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision

Final Exam Assigned: 11/21/02 Due: 12/05/02 at 2:30pm

Nonparametric estimation of multiple structures with outliers

Sensor Tasking and Control

Morphological Image Processing

CSE 527: Introduction to Computer Vision

HOUGH TRANSFORM CS 6350 C V

Features. Places where intensities vary is some prescribed way in a small neighborhood How to quantify this variability

Variational Methods II

Instance-level recognition

Fitting: The Hough transform

Fitting. Lecture 8. Cristian Sminchisescu. Slide credits: K. Grauman, S. Seitz, S. Lazebnik, D. Forsyth, J. Ponce

Instance-level recognition

Motion Tracking and Event Understanding in Video Sequences

Introduction to Medical Imaging (5XSA0) Module 5

A NEW HYBRID DIFFERENTIAL FILTER FOR MOTION DETECTION

Experimental Data and Training

Unsupervised Learning and Clustering

Leow Wee Kheng CS4243 Computer Vision and Pattern Recognition. Robust Methods. CS4243 Robust Methods 1

Image Analysis Image Segmentation (Basic Methods)

CS231A Midterm Review. Friday 5/6/2016

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

[ ] Review. Edges and Binary Images. Edge detection. Derivative of Gaussian filter. Image gradient. Tuesday, Sept 16

Lecture 8 Fitting and Matching

Lecture 9 (4.2.07) Image Segmentation. Shahram Ebadollahi 4/4/ DIP ELEN E4830

Image segmentation. Stefano Ferrari. Università degli Studi di Milano Methods for Image Processing. academic year

Homography estimation

Lecture 4: Finding lines: from detection to model fitting

Technology Assignment: Limits at Infinity

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.

2D image segmentation based on spatial coherence

Data Mining. Part 2. Data Understanding and Preparation. 2.4 Data Transformation. Spring Instructor: Dr. Masoud Yaghini. Data Transformation

DETECTION AND ROBUST ESTIMATION OF CYLINDER FEATURES IN POINT CLOUDS INTRODUCTION

Detecting and Identifying Moving Objects in Real-Time

Albert M. Vossepoel. Center for Image Processing

Robotics Programming Laboratory

Targil 12 : Image Segmentation. Image segmentation. Why do we need it? Image segmentation

COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION

Data Clustering Hierarchical Clustering, Density based clustering Grid based clustering

EECS 556 Image Processing W 09. Image enhancement. Smoothing and noise removal Sharpening filters

ECLT 5810 Data Preprocessing. Prof. Wai Lam

Last update: May 4, Vision. CMSC 421: Chapter 24. CMSC 421: Chapter 24 1

Edge Detection Lecture 03 Computer Vision

CS 231A Computer Vision (Winter 2014) Problem Set 3

Transcription:

Computer Vision Grouping and Segmentation Professor Hager http://www.cs.jhu.edu/~hager Grouping and Segmentation G&S appear to be one of the early processes in human vision They are a way of *organizing* image content into semantically related groups In some applications, segmentation is the crucial step (e.g. some types of aerial image interpretation). We ll just touch on a few simple approaches of image grouping and segmentation We ll also present a couple of useful geometric grouping methods. 1

Grouping Grouping is the process of associating similar image features together Grouping Grouping is the process of associating similar image features together The Gestalt School: Proimity:tokens that are nearby tend to be grouped. Similarity:similar tokens tend to be grouped together. Common fate:tokens that have coherent motion tend to be grouped together. Common region:tokens that lie inside the same closed region tend to be grouped together. Parallelism:parallel curves or tokens tend to be grouped together. Closure:tokens or curves that tend to lead to closed curves tend to be grouped together. Symmetry:curves that lead to symmetric groups are grouped together. Continuity:tokens that lead to continuous (as in joining up nicely, rather than in the formal sense): curves tend to be grouped. Familiar Conguration:tokens that, when grouped, lead to a familiar object,tend to be grouped together 2

3

4

The Hough Transform How can we detect (group) etended line structures in images? y m b y = m + b 5

The Hough Transform How can we detect etended line structures in images? b y m b = - m + y Hough Transform Idea Each edge point in an image is a constraint on line parameters: constraint is a line each unique point adds another constraint Algorithm: Initialize a 2-D array of counters to zero. For each edge point (,y), increment any counter which contains a parameter point (b,m) satisfying b = - m + y Threshold counters Group edgels that belong to above threshold counters into contours Optional: Refit lines to get higher precision. 6

Hough Transform Variation Problem: m and b are, in principle, unbounded rewrite as sin(t) + cos(t) y = d For each discrete value of t from 0 to pi, increment counter with nominal d minimal distance from eact value coarse grid leads to grouping of distinct lines in post-fitting stage, re-hough at higher resolution Generalizations Any linear in parameters model: e.g a 2 + b y 2 = 1 can use the same algorithm For f(,a) = 0, choose any cell a c s.t. f(,a c ) < t for some threshold t. Limitations: the curse of dimensionality RANSAC: A More General Method The Hough transform has the nice feature that it provides a method for detecting geometric structure in clutter. The HT suffers from the curse of dimensionality. RANSAC (Random Sample Consensus) is a robust estimation technique that avoids (to some degree) the curse of dimensionality 7

RANSAC: General Idea How to find the line in this? RANSAC: General Idea Basic observation: if we were to choose the right pair of points 8

RANSAC: General Idea Basic observation: if we were to choose the right pair of points then it would be fairly obvious that many other points would agree Algorithm: For some number of tries N pick a pair of points fit a line to these points count the number of other pts that agree Return: line parameters with best agreement RANSAC: General Idea Algorithm: For some number of tries N pick a pair of points fit a line to these points count the number of other pts that agree Return: line parameters with best agreement Questions: 1) How many points 2) What do we mean agree 9

RANSAC # of points is N = log(1-p)/log(1-(1-ε) s ) assumes sampling is independent p is the probability one of the samples is all inliers s is sample size (2 for us) ε is percentage of outliers Consensus: Assume the points (inliners) are contaminated with Gaussian noise Distance to the line is a χ 2 m model where (in our case) m =1 Choose a probability threshold from χ 2 1 giving a distance t= 3.82 σ2 for α =.95 inliner if distance 2 < t 2 outlier if distance 2 >= t 2 Image Segmentation: A Brief Overview Segmentation criteria region group and counting 10

Segmentation: Definitions An affinity measure d(r 1,R 2 ) Æ or a homogeneity measure m(r) note possibly d(r 1,R 2 ) = m(r 1 ) M(R 2 ) An threshold τ (could operate on distance or homogeneity) A region definition (e.g. square tiles) A neighborhood definition 4 neighbors 0 8 neighbors 0 Choose an image criterion c Simple Thresholding Compute a binary image by b(i,j) = 1 if c(i(i,j)) > t; 0 otherwise Perform cleanup operations (image morphology) Perform grouping Compute connected components and/or statistics thereof 11

An Eample: Motion Detecting motion: Detecting motion: Thresholded Motion 50 Candidate areas for motion 12

A Closer Look Color: A Second Eample Color representation DRM [Klinker et al., 1990]: if P is Lambertian, has matte line and highlight line User selects matte piels in R Compute first and second order statistics of cluster T Decompose ellipsoid ( S, R, T) of variance of matte cluster Color similarity γ ( I(, y)) is defined by Mahalanobis distance S 1 R T ( I(, y) T) 13

Homogeneous Color Region: Photometry Homogeneous Region: Photometry Sample PCA-fitted ellipsoid 14

Binary Image Processing After thresholding an image, we want to know something about the regions found... How many objects are in the image? Where are the distinct object components? Cleaning up a binary image? Connected Component Labeling 0 1 Goal: Label contiguous areas of a segmented image with unique labels 0 1 2 4 neighbors vs. 8 neighbors 15

Connected Component Labeling 0 1 0 1 2 Algorithm 1. Image is A. Let A = -A; 2. Start in upper left and work L to R, Top to Bottom, looking for an unprocessed (-1) piel. 3. When one is found, change its label to the net unused integer. Relabel all of that piel s unprocessed neighbors and their neighbors recursively. 4. When there are no more unprocessed neighbors, resume searching at step 2 -- but do so where you left off the last time. Morphological Operators summarized Let S t be the translation of a set of piels S by t. S t = { + t S } The dilation of a binary image A by a mask S is then A S = UA b b S The erosion of a binary image A by a mask S is A S = { + b A, b S} The closing of a binary image A by S is A S = (A S) S The opening of a binary image A by S is A S = (A S) S 16

The Poor Man s Closing Note that median (or more generally any order statistic) filtering is one way of achieving similar effects. On binary images this can be also implemented using the averaging filter Limitations of Thresholding A uniform threshold may not apply across the image It measures the uniformity of regions (in some sense), but doesn t eamine the inter-relationship between regions. Local disturbances can break up nominally consistent regions note hysteresis thresholding is one solution to this!! Later in the course we ll return to talk about methods around this 17