Recognizing Deformable Shapes. Salvador Ruiz Correa (CSE/EE576 Computer Vision I)

Similar documents
Recognizing Deformable Shapes. Salvador Ruiz Correa Ph.D. Thesis, Electrical Engineering

Recognizing Deformable Shapes. Salvador Ruiz Correa UW Ph.D. Graduate Researcher at Children s Hospital

Recognizing Deformable Shapes. Salvador Ruiz Correa Ph.D. UW EE

High-Level Computer Vision

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 28, NO. 1, JANUARY Symbolic Signatures for Deformable Shapes

Discriminating deformable shape classes

A New Paradigm for Recognizing 3-D Object Shapes from Range Data

Compu&ng Correspondences in Geometric Datasets. 4.2 Symmetry & Symmetriza/on

A Learning Approach to 3D Object Representation for Classification

A New Paradigm for Recognizing 3-D Object Shapes from Range Data

3D Models and Matching

The SIFT (Scale Invariant Feature

3D Models and Matching

Correspondence. CS 468 Geometry Processing Algorithms. Maks Ovsjanikov

Patch-based Object Recognition. Basic Idea

Categorization by Learning and Combining Object Parts

Visual Object Recognition

3D object recognition used by team robotto

Component-based Face Recognition with 3D Morphable Models

Shape Matching. Brandon Smith and Shengnan Wang Computer Vision CS766 Fall 2007

Part based models for recognition. Kristen Grauman

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

Local features: detection and description. Local invariant features

Announcements. Recognition (Part 3) Model-Based Vision. A Rough Recognition Spectrum. Pose consistency. Recognition by Hypothesize and Test

Recognition (Part 4) Introduction to Computer Vision CSE 152 Lecture 17

Component-based Face Recognition with 3D Morphable Models

Category-level localization

Structured light 3D reconstruction

ARTICLE IN PRESS. Pattern Recognition

Recognizing Objects by Matching Oriented Points

Bridging the Gap Between Local and Global Approaches for 3D Object Recognition. Isma Hadji G. N. DeSouza

The correspondence problem. A classic problem. A classic problem. Deformation-Drive Shape Correspondence. Fundamental to geometry processing

Part-based and local feature models for generic object recognition

Using Spin-Images for Efficient Object Recognition in Cluttered 3-D Scenes

An Evaluation of Volumetric Interest Points

Loop detection and extended target tracking using laser data

Determinant of homography-matrix-based multiple-object recognition

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

A New Shape Matching Measure for Nonlinear Distorted Object Recognition

Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes

Local features: detection and description May 12 th, 2015

Local invariant features

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882

CS 223B Computer Vision Problem Set 3

Overview of Computer Vision. CS308 Data Structures

3D Shape Analysis for Quantification, Classification and Retrieval

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Multiple Kernel Learning for Emotion Recognition in the Wild

Local Feature Detectors

School of Computing University of Utah

Breaking it Down: The World as Legos Benjamin Savage, Eric Chu

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

A Salient-Point Signature for 3D Object Retrieval

Detecting Multiple Symmetries with Extended SIFT

Shape Matching and Object Recognition using Low Distortion Correspondences

Efficient Symbolic Signatures for Classifying Craniosynostosis Skull Deformities

CS 468 Data-driven Shape Analysis. Shape Descriptors

Image Processing (IP)

Registration of Dynamic Range Images

Previously. Part-based and local feature models for generic object recognition. Bag-of-words model 4/20/2011

A Non-Rigid Feature Extraction Method for Shape Recognition

Abstract We present a system which automatically generates a 3D face model from a single frontal image of a face. Our system consists of two component

CS 231A Computer Vision (Fall 2012) Problem Set 3

Motion Estimation and Optical Flow Tracking

The Novel Approach for 3D Face Recognition Using Simple Preprocessing Method

TA Section: Problem Set 4

Visuelle Perzeption für Mensch- Maschine Schnittstellen

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

Lecture 7: Image Morphing. Idea #2: Align, then cross-disolve. Dog Averaging. Averaging vectors. Idea #1: Cross-Dissolving / Cross-fading

Person Detection in Images using HoG + Gentleboost. Rahul Rajan June 1st July 15th CMU Q Robotics Lab

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

3D Perception. CS 4495 Computer Vision K. Hawkins. CS 4495 Computer Vision. 3D Perception. Kelsey Hawkins Robotics

Face Cyclographs for Recognition

Automatic Image Alignment (feature-based)

Road-Sign Detection and Recognition Based on Support Vector Machines. Maldonado-Bascon et al. et al. Presented by Dara Nyknahad ECG 789

Geometric Registration for Deformable Shapes 3.3 Advanced Global Matching

BSB663 Image Processing Pinar Duygulu. Slides are adapted from Selim Aksoy

An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012

CHAPTER 1 INTRODUCTION

Object Recognition. The Chair Room

Object recognition (part 2)

3D Photography: Active Ranging, Structured Light, ICP

Generic Face Alignment Using an Improved Active Shape Model

CS 231A Computer Vision (Fall 2011) Problem Set 4

Geometric Modeling Assignment 5: Shape Deformation

Deep condolence to Professor Mark Everingham

Face detection and recognition. Detection Recognition Sally

Projected Texture for Hand Geometry based Authentication

Translation Symmetry Detection: A Repetitive Pattern Analysis Approach

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Stable and Multiscale Topological Signatures

Multiple-Choice Questionnaire Group C

3-D MRI Brain Scan Classification Using A Point Series Based Representation

Eppur si muove ( And yet it moves )

Shape Matching for 3D Retrieval and Recognition. Agenda. 3D collections. Ivan Sipiran and Benjamin Bustos

CSE 252B: Computer Vision II

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Computer Vision: Making machines see

Detecting Object Instances Without Discriminative Features

Parallel Computation of Spherical Parameterizations for Mesh Analysis. Th. Athanasiadis and I. Fudos University of Ioannina, Greece

Transcription:

Recognizing Deformable Shapes Salvador Ruiz Correa (CSE/EE576 Computer Vision I)

Input 3-D Object Goal We are interested in developing algorithms for recognizing and classifying deformable object shapes from range data. 3-D D Laser Scanner Range data (Cloud of 3-D D points) Post- processing 3-D D Output Surface Mesh This is a difficult problem that is relevant in several application fields.

Applications Computer Vision: - Scene analysis - Industrial Inspection - Robotics Medical Diagnosis: - Classification and - Detection of craniofacial deformations.

Basic Idea Generalize existing numeric surface representations for matching 3-D 3 D objects to the problem of identifying shape classes.

Main Contribution An algorithmic framework based on symbolic shape descriptors that are robust to deformations as opposed to numeric descriptors that are often tied to specific shapes.

What Kind Of Deformations? Normal Neurocranium Normal Mandibles Abnormal 3-D Faces Abnormal Toy animals Shape classes: significant amount of intra-class variability

Deformed Infants Skulls Normal Sagittal Synostosis Bicoronal Synostosis Metopic Coronal Sagittal Fused Sutures Occurs when sutures of the cranium fuse prematurely (synostosis).

More Craniofacial Deformations Unicoronal Synostosis Metopic Synostosis Bicoronal Synostosis Sagittal Synostosis Facial Asymmetry

Alignment-verification Objects Database Find correspondences using numeric signature information. Estimate candidate transformations. 3-D Range Scene Recognized Models Models Verification process selects the transformation that produces the best alignment.

Alignment-Verification Limitations The approach does not extend well to the problem of identifying classes of similar shapes. In general: Numeric shape representations are not robust to deformations. There are not exact correspondences between model and scene. Objects in a shape class do not align.

Component-Based Methodology 1 Numeric Signatures define 2 Overcomes the limitations of the alignment-verification approach Components 4 Describe spatial configuration 3 Symbolic Signatures + Architecture of Classifiers Recognition And Classification Of Deformable Shapes

Recognition Problem (1) We are given a set of surface meshes {C 1,C 2,,C n } which are random samples of two shape classes C C 1 C 2 C k C n

Recognition Problem (2) The problem is to use the given meshes and labels to construct an algorithm that determines whether shape class members are present in a single view range scene.

Classification Problem (1) We are given a set of surface meshes {C 1,C 2,,C n } which are random samples of two shape classes C +1 and C -1, where each surface mesh is labeled either by +1 or -1. +1 Normal Skulls C +1 Abnormal Skulls C -1 +1 +1 +1-1 -1-1 -1 C 1 C 2 C k C k+1 C k+2 C n

Classification Problem (2) The problem is to use the given meshes and labels to construct an algorithm that predicts the label of a new surface mesh C new. Is this skull normal (+1) or abnormal (-1)? C new

Classification Problem (3) We also consider the case of missing information: Shape class of normal heads (+1) Shape class of abnormal heads (-1) 3-D Range Scene Single View Clutter and Occlusion Are these heads normal or abnormal?

Assumptions All shapes are represented as oriented surface meshes of fixed resolution. The vertices of the meshes in the training set are in full correspondence. Finding full correspondences : hard problem yes but it is approachable ( use morphable models technique: Blantz and Vetter, SIGGRAPH 99; C. R. Shelton, IJCV, 2000; Allen et al., SIGGRAPH 2003).

Four Key Elements To Our Approach 1 Numeric Signatures 2 Components 4 3 Symbolic Signatures + Architecture of Classifiers Recognition And Classification Of Deformable Shapes

Numeric Signatures 1 Numeric Signatures Encode Local Surface Geometry of an Object 2 Components 4 3 Symbolic Signatures + Architecture of Classifiers

The Spin Image Signature P is the selected vertex. X is a contributing point of the mesh. n β X P α tangent plane at P α is the perpendicular distance from X to P s surface normal. β is the signed perpendicular distance from X to P s tangent plane.

Spin Image Construction A spin image is constructed - about a specified oriented point o of the object surface - with respect to a set of contributing points C, which is controlled by maximum distance and angle from o. It is stored as an array of accumulators S(α,β) computed via: For each point c in C(o) 1. compute α and β for c. 2. increment S (α,β) o

Numeric Signatures: Spin Images P 3-D faces Spin images for point P Rich set of surface shape descriptors. Their spatial scale can be modified to include local and non-local surface features. Representation is robust to scene clutter and occlusions.

Components 1 Numeric Signatures define 2 3 Components Symbolic Signatures Equivalent Numeric Signatures: Encode Local Geometry of a Shape Class + 4 Architecture of Classifiers

Select Seed Points How To Extract Shape Class Components? Training Set Compute Numeric Signatures Region Growing Algorithm Component Detector Grown components around seeds

Component Extraction Example Selected 8 seed points by hand Labeled Surface Mesh Region Growing Grow one region at the time (get one detector per component) Detected components on a training sample

How To Combine Component Information? 2 2 1 2 1 2 1 2 2 1 Extracted components on test samples 4 3 2 5 6 7 8 Note: Numeric signatures are invariant to mirror symmetry; our approach preserves such an invariance.

Symbolic Signatures 1 Numeric Signatures 2 Components 4 3 Symbolic Signatures Encode Geometrical Relationships Among Components + Architecture of Classifiers

Symbolic Signature Critical Point P Labeled Surface Mesh Encode Geometric Configuration Symbolic Signature at P 4 5 3 8 6 7 Matrix storing component labels

Symbolic Signature Construction Critical Point P Normal tangent plane P a b Coordinate system defined up to a rotation Project labels to tangent plane at P b P 4 5 3 8 6 7 a

Symbolic Signatures Are Robust To Deformations P 4 3 5 8 6 7 4 3 4 3 4 3 4 3 5 5 5 5 8 8 8 8 6 7 6 7 6 7 6 7 Relative position of components is stable across deformations: experimental evidence

Architecture of Classifiers 1 2 Numeric Signatures Learns Components And Their Geometric Relationships Components 4 3 Symbolic Signatures + Architecture of Classifiers

Proposed Architecture (Classification Example) Verify spatial configuration of the components Input Identify Components Labeled Mesh Identify Symbolic Signatures Class Label -1 (Abnormal) Surface Mesh Two classification stages

Surface Mesh At Classification Time (1) Labeled Surface Mesh Multi-way classifier Bank of Component Detectors Assigns Component Labels Identify Components

At Classification Time (2) 1 Labeled Surface Mesh Symbolic pattern for components 1,2,4 +1 4 6 2 5 8 Bank of Symbolic Signatures Detectors Two detectors Assigns Symbolic Labels Symbolic pattern for components 5,6,8-1

Finding Critical Points On Test +1 Samples Critical Point Margin associated with the component detector classifiers 1 0 Confidence Level -1

Architecture Implementation ALL our classifiers are (off-the the-shelf) ν- Support Vector Machines (ν-svms( SVMs) (Schölkopf et al., 2000 and 2001). Component (and symbolic signature) detectors are one-class classifiers. Component label assignment: performed with a multi-way classifier that uses pairwise classification scheme. Gaussian kernel.

Experimental Validation Recognition Tasks: 4 (T1 - T4) Classification Tasks: 3 (T5 T7) No. Experiments: 5470 Rotary Table Setup Laser Recognition Classification

Shape Classes

Enlarging Training Sets Using Virtual Samples Morphs Displacement Vectors Original (14) Twist (5deg) + Taper -Push + Spherify (10%) Push +Twist (10 deg) +Scale (1.2) Originals Morphs Global Morphing Operators Physical Modeling Electrical Engineering University of Washington

Task 1: Recognizing Single Objects (1) No. Shape classes: 9. Training set size: 400 meshes. Testing set size: 200 meshes. No. Experiments: 1960. No. Component detectors:3. No. Symbolic signature detectors: 1. Numeric signature size: 40x40. Symbolic signature size: 20x20. No clutter and occlusion.

Task 1: Recognizing Single Objects (2) Snowman: 93%. Rabbit: 92%. Dog: 89%. Cat: 85.5%. Cow: 92%. Bear: 94%. Horse: 92.7%. Human head: 97.7%. Human face: 76%. Recognition rates (true positives) (No clutter, no occlusion, complete models)

Tasks 2-3: 2 Recognition In Complex Scenes (1) No. Shape classes: 3. Training set size: 400 meshes. Testing set size: 200 meshes. No. Experiments: 1200. No. Component detectors:3. No. Symbolic signature detectors: 1. Numeric signature size: 40x40. Symbolic signature size: 20x20. T2 low clutter and occlusion.

Task 2-3: 2 Recognition in Complex Scenes (2) Shape True False True False Class Positives Positives Positives Positives Snowmen 91% 31% 87.5% 28% Rabbit 90.2% 27.6% 84.3% 24% Dog 89.6% 34.6% 88.12% 22.1% Task 2 Task 3

Task 2-3: 2 Recognition in Complex Scenes (3)

Task 4: Recognizing Human Heads (1) No. Shape classes: 1. Training set size: 400 meshes. Testing set size: 250 meshes. No. Experiments: 710. No. Component detectors:8. No. Symbolic signature detectors: 2. Numeric signature size: 70x70. Symbolic signature size: 12x12.

Task 4: Recognizing Human Heads (2) Recognition Rate (44,0.9) % Clutter < 40 % Occlusion < 40 (40,0.88) % Occlusion % Clutter FP rate: ~1%,

Task 4: Recognizing Human Heads (3)

Task 5: Classifying Normal vs. Abnormal Human Heads (1) No. Shape classes: 6. Training set size: 400 meshes. Testing set size: 200 meshes. No. Experiments: 1200. No. Component detectors:3. No. Symbolic signature detectors: 1. Numeric signature size: 50x50. Symbolic signature size: 12x12.

Task 5: Classifying Normal vs. Abnormal Human Heads (1) Five Cases Shape Classes Normal vs. Abnormal 1 Normal vs. Abnormal 2 Classification Accuracy % 98 100 Abnormal 1 vs. 3 98 Abnormal 1 vs. 4 97 Normal Abnormal 1 2 3 4 5 Abnormal 1 vs. 5 92 Full models 65%-35% 50%-50% 25%-75% (convex combinations of Normal and Abnormal 1)

Task 6: Classifying Normal vs. Abnormal Human Heads In Complex Scenes(1) No. Shape classes: 2. Training set size: 400 meshes. Testing set size: 200 meshes. No. Experiments: 1200. No. Component detectors:3. No. Symbolic signature detectors: 1. Numeric signature size: 100x100. Symbolic signature size: 12x12.

Task 6: Classifying Normal vs. Abnormal Human Heads In Complex Scenes(1) Shape Classes Normal vs. Abnormal 1 Classification Accuracy % 88 Range scenes single view Clutter < 15% and occlusion < 50%

Task 7: Classifying Normal vs. Abnormal Neurocranium (1) No. Shape classes: 2. Training set size: 400 meshes. Testing set size: 200 meshes. No. Experiments: 2200. No. Component detectors:3. No. Symbolic signature detectors: 1. Numeric signature size: 50x50. Symbolic signature size: 15x15.

Task 7: Classifying Normal vs. Abnormal Neurocranium (2) 100 Experiments Shape Classes Normal vs. Abnormal Classificatio n Accuracy % 89 No clutter and occlusion Normal Abnormal (sagittal synostosis ) Superimposed models

Main Contributions (1) A novel symbolic signature representation of deformable shapes that is robust to intra-class variability and missing information, as opposed to a numeric representation which is often tied to a specific shape. A novel kernel function for quantifying symbolic signature similarities.

Main Contributions (2) A region growing algorithm for learning shape class components. A novel architecture of classifiers for abstracting the geometry of a shape class. A validation of our methodology in a set of large scale recognition and classification experiments aimed at applications in scene analysis and medical diagnosis.

Main Contributions (3) Our approach: - Is general can be applied to a variety of shape classes. - Is robust to clutter and occlusion - It Works in practice - Is a step forward in 3-D 3 D object recognition research.