CS443: Digital Imaging and Multimedia Perceptual Grouping Detecting Lines and Simple Curves

Similar documents
CS 534: Computer Vision Segmentation and Perceptual Grouping

CS 534: Computer Vision Segmentation and Perceptual Grouping

Fitting. Choose a parametric object/some objects to represent a set of tokens Most interesting case is when criterion is not local

6.801/866. Segmentation and Line Fitting. T. Darrell

Computer Vision 6 Segmentation by Fitting

Fitting D.A. Forsyth, CS 543

CS 556: Computer Vision. Lecture 18

HOUGH TRANSFORM CS 6350 C V

What is Computer Vision?

Lecture 15: Segmentation (Edge Based, Hough Transform)

Announcements. Edges. Last Lecture. Gradients: Numerical Derivatives f(x) Edge Detection, Lines. Intro Computer Vision. CSE 152 Lecture 10

Image Segmentation. Selim Aksoy. Bilkent University

Image Segmentation. Selim Aksoy. Bilkent University

Template Matching Rigid Motion. Find transformation to align two images. Focus on geometric features

Issues with Curve Detection Grouping (e.g., the Canny hysteresis thresholding procedure) Model tting They can be performed sequentially or simultaneou

HOUGH TRANSFORM. Plan for today. Introduction to HT. An image with linear structures. INF 4300 Digital Image Analysis

Lecture 9: Hough Transform and Thresholding base Segmentation

Types of Edges. Why Edge Detection? Types of Edges. Edge Detection. Gradient. Edge Detection

Fitting. Lecture 8. Cristian Sminchisescu. Slide credits: K. Grauman, S. Seitz, S. Lazebnik, D. Forsyth, J. Ponce

CS334: Digital Imaging and Multimedia Edges and Contours. Ahmed Elgammal Dept. of Computer Science Rutgers University

CS534: Introduction to Computer Vision Edges and Contours. Ahmed Elgammal Dept. of Computer Science Rutgers University

Computer Vision Grouping and Segmentation. Grouping and Segmentation

Segmentation and Grouping April 19 th, 2018

Generalized Hough Transform, line fitting

Multimedia Computing: Algorithms, Systems, and Applications: Edge Detection

Computer Vision 5 Segmentation by Clustering

Template Matching Rigid Motion

Finding 2D Shapes and the Hough Transform

Solution: filter the image, then subsample F 1 F 2. subsample blur subsample. blur

CHAPTER 6 PERCEPTUAL ORGANIZATION BASED ON TEMPORAL DYNAMICS

Edge linking. Two types of approaches. This process needs to be able to bridge gaps in detected edges due to the reason mentioned above

Segmentation by Clustering

Introduction. Chapter Overview

Segmentation by Clustering. Segmentation by Clustering Reading: Chapter 14 (skip 14.5) General ideas

CAP 5415 Computer Vision Fall 2012

Outline. Segmentation & Grouping. Examples of grouping in vision. Grouping in vision. Grouping in vision 2/9/2011. CS 376 Lecture 7 Segmentation 1

Segmentation by Clustering Reading: Chapter 14 (skip 14.5)

Edge detection. Convert a 2D image into a set of curves. Extracts salient features of the scene More compact than pixels

CMPSCI 670: Computer Vision! Grouping

OBJECT detection in general has many applications

Grouping and Segmentation

CS443: Digital Imaging and Multimedia Binary Image Analysis. Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University

CS 664 Segmentation. Daniel Huttenlocher

Feature Detectors and Descriptors: Corners, Lines, etc.

Model Fitting: The Hough transform II

Straight Lines and Hough

Circular Hough Transform

Optimal Grouping of Line Segments into Convex Sets 1

Model Fitting: The Hough transform I

human vision: grouping k-means clustering graph-theoretic clustering Hough transform line fitting RANSAC

GENERALIZING THE HOUGH TRANSFORM TO DETECT ARBITRARY SHAPES. D. H. Ballard Pattern Recognition Vol. 13 No

CS534 Introduction to Computer Vision Binary Image Analysis. Ahmed Elgammal Dept. of Computer Science Rutgers University

In this lecture, we are going to talk about image segmentation, essentially defined as methods for grouping pixels together.

E0005E - Industrial Image Analysis

Grouping and Segmentation

Segmentation & Clustering

EE795: Computer Vision and Intelligent Systems

Segmentation and Grouping April 21 st, 2015

Image Processing and Image Analysis VU

An edge is not a line... Edge Detection. Finding lines in an image. Finding lines in an image. How can we detect lines?

CS 2770: Computer Vision. Edges and Segments. Prof. Adriana Kovashka University of Pittsburgh February 21, 2017

Symmetry Based Semantic Analysis of Engineering Drawings

Elaborazione delle Immagini Informazione Multimediale. Raffaella Lanzarotti

CS 534: Computer Vision Segmentation II Graph Cuts and Image Segmentation

Good Continuation of Boundary Contour

Fitting: Voting and the Hough Transform April 23 rd, Yong Jae Lee UC Davis

Feature Matching and Robust Fitting

Review for the Final

From Szymon Rusinkiewicz, Princeton

Other Linear Filters CS 211A

The goals of segmentation

Fitting: The Hough transform

Segmentation and low-level grouping.

Why study Computer Vision?

WHAT YOU SHOULD LEARN

SEGMENTATION USING CLUSTERING METHODS

Thanks to Chris Bregler. COS 429: Computer Vision

Final Exam Assigned: 11/21/02 Due: 12/05/02 at 2:30pm

Lecture 16 Segmentation and Scene understanding

Chapter 11 Arc Extraction and Segmentation

DETECTION AND ROBUST ESTIMATION OF CYLINDER FEATURES IN POINT CLOUDS INTRODUCTION

Segmentation and Grouping

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

2D rendering takes a photo of the 2D scene with a virtual camera that selects an axis aligned rectangle from the scene. The photograph is placed into

Fitting: The Hough transform

Lecture 10: Semantic Segmentation and Clustering

Example 1: Regions. Image Segmentation. Example 3: Lines and Circular Arcs. Example 2: Straight Lines. Region Segmentation: Segmentation Criteria

Let be a function. We say, is a plane curve given by the. Let a curve be given by function where is differentiable with continuous.

Example 2: Straight Lines. Image Segmentation. Example 3: Lines and Circular Arcs. Example 1: Regions

Lecture 8: Fitting. Tuesday, Sept 25

CPSC 425: Computer Vision

ATIP A Tool for 3D Navigation inside a Single Image with Automatic Camera Calibration

Edge Detection. EE/CSE 576 Linda Shapiro

Extracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang

Edge Detection. Announcements. Edge detection. Origin of Edges. Mailing list: you should have received messages

2D image segmentation based on spatial coherence

CS 4495 Computer Vision A. Bobick. Motion and Optic Flow. Stereo Matching

Lecture 16: Computer Vision

ECG782: Multidimensional Digital Signal Processing

Fitting: The Hough transform

Transcription:

CS443: Digital Imaging and Multimedia Perceptual Grouping Detecting Lines and Simple Curves Spring 2008 Ahmed Elgammal Dept. of Computer Science Rutgers University Outlines Perceptual Grouping and Segmentation Detecting Lines with Hough Transform Detecting Circles and Ellipses. Sources: Burger and Burge Digital Image Processing Chapter 9 Fosyth and Ponce Computer Vision a Modern approach 1

Mid-level vision Vision as an inference problem: Some observation/measurements (images) A model Objective: what caused this measurement? What distinguishes vision from other inference problems? A lot of data. We don t know which of these data may be useful to solve the inference problem and which may not. Which pixels are useful and which are not? Which edges are useful and which are not? Why do these tokens belong together? It is difficult to tell whether a pixel (token) lies on a surface by simply looking at the pixel 2

Segmentation Can we achieve a compact and suggestive representation of the interesting image data that emphasizes the properties that make it interesting Segmentation Grouping Perceptual organization Fitting What is interesting and what is not depends on the application General ideas tokens whatever we need to group (pixels, points, surface elements, etc., etc.) top down segmentation tokens belong together because they lie on the same object bottom up segmentation tokens belong together because they are locally coherent Grouping (or clustering) collect together tokens that belong together Fitting associate a model with tokens issues which model? which token goes to which element? how many elements in the model? 3

Segmentation Different problems same problem: segmentation Summarizing a video: segment a video into shots, find coherent segments in the video, find key frames Finding machine parts: finding lines, circles, Finding people: find body segments, find human motion patterns Finding buildings from aerial imagery: find polygonal regions, line segments Searching a collection of images: find coherent color, texture regions, shape Segmentation Segmentation is a big topic We will look into: Segmentation by clustering: Forming image segments: How to decompose the image into superpixels image regions that are coherent in color and texture Shape of the region is not that important while segmenting Segmentation by model fitting: Fitting lines and curves to edge points: Which points belong to which line, how many lines? What about more complicated models, e.g. fitting a deformable contour! 4

Segmentation as Clustering Objective: Which components of a data set naturally belong together Partitioning Decomposition: Starting from a large data set how to partition it into pieces given some notion of association between data items Decompose an image into regions that have coherent color and texture Decompose a video sequence into shots Grouping Collect sets of data item that make sense together given our notion of association Collect together edge segments that seems to belong to a line Question: what is our notion of association? One view of segmentation is that it determines which component of the image form the figure and which form the ground. What is the figure and the background in this image? 5

Grouping and Gestalt Gestalt: German for form, whole, group Laws of Organization in Perceptual Forms (Gestalt school of psychology) Max Wertheimer 1912-1923 there are contexts in which what is happening in the whole cannot be deduced from the characteristics of the separate pieces, but conversely; what happens to a part of the whole is, in clearcut cases, determined by the laws of the inner structure of its whole Muller-Layer effect: This effect arises from some property of the relationships that form the whole rather than from the properties of each separate segment. Grouping and Gestalt Can we write down a series of rules by which image elements would be associated together and interpreted as a group? What are the factors that makes a set of elements to be grouped Human vision uses these factors in some way 6

Not Grouped Proximity: Tokens that are nearby tend to be grouped together. Similarity: Similar tokens tend to be grouped together. Similarity Common Fate: Tokens that have coherent motion tend to be grouped together Common Region: Tokens that lie inside the same closed region tend to be grouped together Parallelism: Parallel curves or tokens tend to be grouped together Symmetry: Curves that lead to symmetric groups are grouped together Continuity: Tokens that lead to continuous curves tend to be grouped Closure: Tokens or curves that tend to lead to closed curves tend to be grouped together. 7

Familiar configuration: tokens that, when grouped, lead to a familiar object tend to be grouped 8

Occlusion appears to be a very important cue in grouping Illusory contours: tokens are grouped together because they provide a cue to the presence of an occluding object 9

Detecting lines and simple curves Finding salient structures How to group collections of edge points (or corners) belonging to the same line or curve? Search for globally apparent structures that inherently express certain common shape features Many man-made objects exhibits simple geometric forms: lines, circles, ellipses. 10

Hough Transform Goal: find which tokens belongs to which objects Hough transform: localizing any shape that can be defined parametrically within a distribution of points (Paul Hough) Example: lines, circles, ellipses. 11

12

13 All lines passing through an image point (x i,y i ) are characterized by a line in the parameter space with equation:

Accumulator array: a discrete representation of the parameter space as a 2D array Given an image point, we increment all the points on it s corresponding line in the parameter space A line in the image will be the intersection of multiple lines in the parameter space. A better line representation The Hessian Normal Form (HNF) of a line gives a better representation. Why? HNF: Parameter space is defined by r and theta 14

tokens votes 15

tokens votes 16

Mechanics of the Hough transform Construct an array representing θ, r For each point, render the curve (θ, r) into this array, adding one at each cell Difficulties how big should the cells be? (too big, and we cannot distinguish between quite different lines; too small, and noise causes lines to be missed) How to find peaks? Threshold Non-maxima suppression How many lines? count the peaks in the Hough array Who belongs to which line? tag the votes Hardly ever satisfactory in practice, because problems with noise and cell size defeat it 17

18

Circles and ellipses Better Idea: If we know the radius, the locations of all possible centers lie on a circle 19

For a given point (u,v), at each plane ρ i along the ρ axis, a circle centered at u,v with radius ρ i is rendered Detecting Ellipses? An ellipse has five dimensional parameter space! There are better ways! Generalized Hough transform 20

21 Circles and ellipses