Announcements. Recognition I. Optical Flow: Where do pixels move to? dy dt. I + y. I = x. di dt. dx dt. = t

Similar documents
Announcements. Recognition I. Gradient Space (p,q) What is the reflectance map?

Announcements. Recognition. Recognition. Recognition. Recognition. Homework 3 is due May 18, 11:59 PM Reading: Computer Vision I CSE 152 Lecture 14

Applications Video Surveillance (On-line or off-line)

Announcements. Stereo Vision Wrapup & Intro Recognition

Midterm Wed. Local features: detection and description. Today. Last time. Local features: main components. Goal: interest operator repeatability

Dimension Reduction CS534

Recognition: Face Recognition. Linda Shapiro EE/CSE 576

Recognition. Computer Vision I. CSE252A Lecture 20

Computer Vision Lecture 20

Recognition (Part 4) Introduction to Computer Vision CSE 152 Lecture 17

Announcements. Recognition (Part 3) Model-Based Vision. A Rough Recognition Spectrum. Pose consistency. Recognition by Hypothesize and Test

Robust Face Recognition via Sparse Representation Authors: John Wright, Allen Y. Yang, Arvind Ganesh, S. Shankar Sastry, and Yi Ma

Announcements. Motion. Structure-from-Motion (SFM) Motion. Discrete Motion: Some Counting

Optical flow. Cordelia Schmid

Lecture 19: Motion. Effect of window size 11/20/2007. Sources of error in correspondences. Review Problem set 3. Tuesday, Nov 20

Image-Based Face Recognition using Global Features

Outline. Advanced Digital Image Processing and Others. Importance of Segmentation (Cont.) Importance of Segmentation

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Face Detection and Recognition in an Image Sequence using Eigenedginess

Active Appearance Models

Structure from Motion. Introduction to Computer Vision CSE 152 Lecture 10

Recognition, SVD, and PCA

Optical flow. Cordelia Schmid

Learning to Recognize Faces in Realistic Conditions

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Announcements. Motion. Structure-from-Motion (SFM) Motion. Discrete Motion: Some Counting

Visual motion. Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys

Visual Learning and Recognition of 3D Objects from Appearance

What do we mean by recognition?

Finally: Motion and tracking. Motion 4/20/2011. CS 376 Lecture 24 Motion 1. Video. Uses of motion. Motion parallax. Motion field

Linear Discriminant Analysis in Ottoman Alphabet Character Recognition

Stereo Wrap + Motion. Computer Vision I. CSE252A Lecture 17

3D Models and Matching

Face Recognition using Tensor Analysis. Prahlad R. Enuganti

Robust Pose Estimation using the SwissRanger SR-3000 Camera

Nonparametric Multiscale Multimodal Models for Detection/Recognition John Fisher & John Reif Eagle Eye Software (919)

Recognition of Non-symmetric Faces Using Principal Component Analysis

Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition

CS4670: Computer Vision

CREATING 3D WRL OBJECT BY USING 2D DATA

Face detection and recognition. Detection Recognition Sally

Scale Invariant Feature Transform (SIFT) CS 763 Ajit Rajwade

Face Recognition for Different Facial Expressions Using Principal Component analysis

Motion and Tracking. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

What is Computer Vision? Introduction. We all make mistakes. Why is this hard? What was happening. What do you see? Intro Computer Vision

Announcements. Introduction. Why is this hard? What is Computer Vision? We all make mistakes. What do you see? Class Web Page is up:

Local features: detection and description May 12 th, 2015

Last Lecture. Edge Detection. Filtering Pyramid

Computer Vision Lecture 18

Multibody Motion Estimation and Segmentation from Multiple Central Panoramic Views

EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation

Last Time. Correct Transparent Shadow. Does Ray Tracing Simulate Physics? Does Ray Tracing Simulate Physics? Refraction and the Lifeguard Problem

MAN-522: COMPUTER VISION SET-2 Projections and Camera Calibration

Recognition problems. Face Recognition and Detection. Readings. What is recognition?

ECG782: Multidimensional Digital Signal Processing

The Isometric Self-Organizing Map for 3D Hand Pose Estimation

22 October, 2012 MVA ENS Cachan. Lecture 5: Introduction to generative models Iasonas Kokkinos

Warping, Morphing and Mosaics

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS

Lecture 14 Shape. ch. 9, sec. 1-8, of Machine Vision by Wesley E. Snyder & Hairong Qi. Spring (CMU RI) : BioE 2630 (Pitt)

calibrated coordinates Linear transformation pixel coordinates

Announcements. Computer Vision I. Motion Field Equation. Revisiting the small motion assumption. Visual Tracking. CSE252A Lecture 19.

Unsupervised Learning

Image Processing. Image Features

Motion estimation. Lihi Zelnik-Manor

An Integrated Face Recognition Algorithm Based on Wavelet Subspace

Illumination Models and Shading

Local features: detection and description. Local invariant features

Image warping. image filtering: change range of image. image warping: change domain of image g(x) = f(h(x)) h(y)=0.5y+0.5. h([x,y])=[x,y/2] f h

Capturing, Modeling, Rendering 3D Structures

Facial Expression Detection Using Implemented (PCA) Algorithm

Object and Action Detection from a Single Example

How is project #1 going?

A Dynamic Human Model using Hybrid 2D-3D Representations in Hierarchical PCA Space

CS5670: Computer Vision

Static Gesture Recognition with Restricted Boltzmann Machines

Announcements. Equation of Perspective Projection. Image Formation and Cameras

Tracking system. Danica Kragic. Object Recognition & Model Based Tracking

Face Recognition Using SIFT- PCA Feature Extraction and SVM Classifier

Two-View Geometry (Course 23, Lecture D)

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

Feature Detectors and Descriptors: Corners, Lines, etc.

Dimension Reduction of Image Manifolds

Final Project Face Detection and Recognition

Multi-Pose Face Recognition And Tracking System

FACE RECOGNITION BASED ON GENDER USING A MODIFIED METHOD OF 2D-LINEAR DISCRIMINANT ANALYSIS

Feature Based Registration - Image Alignment

Mobile Face Recognization

An Effective Approach in Face Recognition using Image Processing Concepts

CSC 411: Lecture 14: Principal Components Analysis & Autoencoders

Multiple-Choice Questionnaire Group C

An Automatic Face Identification System Using Flexible Appearance Models

EECS 556 Image Processing W 09

Face Recognition using Tensor Analysis. Prahlad R. Enuganti

Face recognition based on improved BP neural network

Object. Radiance. Viewpoint v

CSC 411: Lecture 14: Principal Components Analysis & Autoencoders

On Modeling Variations for Face Authentication

All good things must...

Transcription:

Announcements I Introduction to Computer Vision CSE 152 Lecture 18 Assignment 4: Due Toda Assignment 5: Posted toda Read: Trucco & Verri, Chapter 10 on recognition Final Eam: Wed, 6/9/04, 11:30-2:30, WLH 2207 (here) Optical Flow: Where do piels move to? Virtual Cinematograph: Making 'The Matri' Sequels George Borshukov VFX Technolog Supervisor, ESC Entertainment Frida, June 4, 2004 1:00 p.m. to 2:30 p.m. [Pizza lunch will precede the event from noon to 1 p.m.] Main Auditorium, San Diego Supercomputer Center The presentation will cover the ke technologies that had to be developed and deploed to create the snthetic human sequences in the Matri sequels including Universal Capture - image-based facial animation, realistic human face rendering, and use of measured BRDF in film production. It will also feature a breakdown of The Superpunch shot (pictured above) from "The Matri Revolutions" (the bullet time punch that Neo delivers to Agent Smith during the film's last face-off). This difficult, important, epensive, and challenging shot was entirel computer generated and showcased the technological developments of 3.5+ ears at their best b showing a full-frame close-up of a known human actor. Mathematical formulation [Note change of notation: image coordinates now (,), not (u,v)] I (,,t) brightness at image point (,) at time t Consider scene (or camera) to be moving, so (t), (t) Brightness constanc assumption: d d I ( + δt, + δt, t + δt) I(,, t) Optical flow constraint equation : di d + d di + t Measurements I, I, I t t Flow vector d u, d v

Two was to get flow de( u, v) du de( u, v) dv 2I 2I ( I u + I v + I ) ( I u + I v + I ) t t Ω 1. Think globall, and regularize over image 2. Look over window and assume constant motion in the window Given a database of objects and an image determine what, if an of the objects are present in the image. Challenges Within-class variabilit Different objects within the class have different shapes or different material characteristics Deformable Articulated Compositional Pose variabilit: 2-D Image transformation (translation, rotation, scale) 3-D Pose Variabilit (perspective, orthographic projection) Lighting Direction (multiple sources & tpe) Color Shadows Occlusion partial Clutter in background -> false positives Object Issues: How general is the problem? 2D vs. 3D range of viewing conditions available contet segmentation cues What sort of data is best suited to the problem? Whole images Local 2D features (color, teture, 3D (range) data What information do we have in the database? Collection of images? 3-D models? Learned representation? Learned classifiers? How man objects are involved? small: brute force search large:??

A Rough Spectrum Appearance-Based (Eigenface, Fisherface) Shape Contets Geometric Invariants Image Abstractions/ Volumetric Primitives Appearance-Based Vision: A Pattern Classification Viewpoint Local Features + Spatial Relations Aspect Graphs Increasing Generalit 3-D Model-Based Function 1. Feature Space + Nearest Neighbor 2. Dimensionalit Reduction 3. Baesian Classification 4. Appearance Manifolds Image (window) Sketch of a Pattern Architecture Feature Etraction Feature Vector Classification Object Identit Eample: Face Detection Scan window over image. Classif window as either: Face Non-face Window Classifier Face Non-face The Problem of Given an image I and a database of k objects and a representation R j for object j in the database, recognition can be epressed as: Image as a Feature Vector 2 i arg min c( R j [1, L, k ] j, I) 1 3 where c(r j,,i) is a function which gives the compatibilit or consistenc of representation R j with the image. Consider an n-piel image (window) to be a point in an n-dimensional space, R n. Each piel value is a coordinate of.

Simplest Scheme R j is an image. c(r j, I) is Euclidean distance. R 1 I 1 2 3 Nearest Neighbor Classifier R 2 Comments Sometimes called Template Matching Variations on distance function (e.g. L 1, robust distances) Multiple templates per class- perhaps man training images per class. Epensive to compute k distances, especiall when each image is big (N dimensional). Ma not generalize well to unseen eamples of class. Some solutions: Dimensionalit reduction Baesian classification Eigenfaces: Linear Projection Eigenfaces: Principal Component Analsis (PCA) An n-piel image R n can be projected to a low-dimensional feature space R m b W where W is an m b n matri. is performed using nearest neighbor in R m. How do we choose a good W? Some details: Use Singular value decomposition, trick described in appendi of tet to compute basis when n<<d Mean First Principal Component Direction of Maimum Variance Eigenfaces Modeling 1. Given a collection of n labeled training images, 2. Compute mean image and covariance matri. 3. Compute k Eigenvectors (note that these are images) of covariance matri corresponding to k largest Eigenvalues. (Or perform using SVD!!) 4. Project the training images to the k-dimensional Eigenspace. 1. Given a test image, project to Eigenspace. 2. Perform classification to the projected training images.

Eigenfaces: Training Images Eigenfaces [ Turk, Pentland 91 Mean Image Basis Images Basis Images for Variable Lighting Projection, and reconstruction An n-piel image R n can be projected to a low-dimensional feature space R m b W From R m, the reconstruction of the point is W T The error of the reconstruction is: -W T W Reconstruction using Eigenfaces Given image on left, project to Eigenspace, then reconstruct an image (right). Face detection using distance to face space Scan a window ω across the image, and classif the window as face/not face as follows: Project window to subspace, and reconstruct as described earlier. Compute distance between ω and reconstruction. Local minima of distance over all image locations less than some treshold are taken as locations of faces. Repeat at different scales. Possibl normalize windows intensit so that ω 1.