Face Detection and Alignment. Prof. Xin Yang HUST

Similar documents
arxiv: v1 [cs.cv] 16 Nov 2015

Hand Gesture Extraction by Active Shape Models

Lecture 8 Object Descriptors

Supplementary Material Estimating Correspondences of Deformable Objects In-the-wild

Visual Representations for Machine Learning

Face Alignment Across Large Poses: A 3D Solution

Image Processing. Image Features

Locating Facial Landmarks Using Probabilistic Random Forest

An Adaptive Eigenshape Model

CS 4495 Computer Vision Motion and Optic Flow

Data preprocessing Functional Programming and Intelligent Algorithms

Subdivision Curves and Surfaces

EE795: Computer Vision and Intelligent Systems

Unsupervised Learning

Dimension Reduction CS534

Linear Methods for Regression and Shrinkage Methods

Dense Image-based Motion Estimation Algorithms & Optical Flow

CS143 Introduction to Computer Vision Homework assignment 1.

Computer Vision I. Announcements. Fourier Tansform. Efficient Implementation. Edge and Corner Detection. CSE252A Lecture 13.

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

A Method of Automated Landmark Generation for Automated 3D PDM Construction

Image Coding with Active Appearance Models

Feature Tracking and Optical Flow

Feature Detection and Tracking with Constrained Local Models

2. Data Preprocessing

Classification of Hyperspectral Breast Images for Cancer Detection. Sander Parawira December 4, 2009

Active Shape Models - 'Smart Snakes'

Clustering and Visualisation of Data

The Novel Approach for 3D Face Recognition Using Simple Preprocessing Method

Autonomous Navigation for Flying Robots

Linear Discriminant Analysis for 3D Face Recognition System

Extended Supervised Descent Method for Robust Face Alignment

Computer Vision I. Announcement. Corners. Edges. Numerical Derivatives f(x) Edge and Corner Detection. CSE252A Lecture 11

Edge Detection. Computer Vision Shiv Ram Dubey, IIIT Sri City

COMBINED METHOD TO VISUALISE AND REDUCE DIMENSIONALITY OF THE FINANCIAL DATA SETS

Announcements. Edges. Last Lecture. Gradients: Numerical Derivatives f(x) Edge Detection, Lines. Intro Computer Vision. CSE 152 Lecture 10

Network Traffic Measurements and Analysis

Geometric Modeling in Graphics

Face Recognition Using Active Appearance Models

Principal Component Image Interpretation A Logical and Statistical Approach

Generic Face Alignment Using an Improved Active Shape Model

Matching. Compare region of image to region of image. Today, simplest kind of matching. Intensities similar.

Robust Face Recognition via Sparse Representation Authors: John Wright, Allen Y. Yang, Arvind Ganesh, S. Shankar Sastry, and Yi Ma

Evaluating Error Functions for Robust Active Appearance Models

Graphical Models for Computer Vision

Geometric Registration for Deformable Shapes 3.3 Advanced Global Matching

OVer the last few years, cascaded-regression (CR) based

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

22 October, 2012 MVA ENS Cachan. Lecture 5: Introduction to generative models Iasonas Kokkinos

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Edge and corner detection

CS 521 Data Mining Techniques Instructor: Abdullah Mueen

CS 229 Midterm Review

Active Appearance Models

Automatic Modelling Image Represented Objects Using a Statistic Based Approach

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

Sparse Shape Registration for Occluded Facial Feature Localization

CS-465 Computer Vision

Robust Facial Landmark Detection under Significant Head Poses and Occlusion

The Automation of the Feature Selection Process. Ronen Meiri & Jacob Zahavi

Local Linear Embedding. Katelyn Stringer ASTR 689 December 1, 2015

Facial Expression Detection Using Implemented (PCA) Algorithm

Mapping of Hierarchical Activation in the Visual Cortex Suman Chakravartula, Denise Jones, Guillaume Leseur CS229 Final Project Report. Autumn 2008.

Learning and Inferring Depth from Monocular Images. Jiyan Pan April 1, 2009

Automatic Countenance Recognition Using DCT-PCA Technique of Facial Patches with High Detection Rate

Object and Action Detection from a Single Example

What is Computer Vision?

Recognition: Face Recognition. Linda Shapiro EE/CSE 576

Gaussian and Mean Curvature Planar points: Zero Gaussian curvature and zero mean curvature Tangent plane intersects surface at infinity points Gauss C

ECG782: Multidimensional Digital Signal Processing

Lecture 15 Active Shape Models

Computer Vision Group Prof. Daniel Cremers. 8. Boosting and Bagging

Introduction to ANSYS DesignXplorer

Human pose estimation using Active Shape Models

Lecture 16: Computer Vision

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Data Compression. The Encoder and PCA

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17

Digital Image Processing Chapter 11: Image Description and Representation

Mask R-CNN. By Kaiming He, Georgia Gkioxari, Piotr Dollar and Ross Girshick Presented By Aditya Sanghi

Face detection and recognition. Detection Recognition Sally

CSE152 Introduction to Computer Vision Assignment 3 (SP15) Instructor: Ben Ochoa Maximum Points : 85 Deadline : 11:59 p.m., Friday, 29-May-2015

One Millisecond Face Alignment with an Ensemble of Regression Trees

Facial Expression Recognition

3D Pose Estimation of Cactus Leaves using an Active Shape Model

SuRVoS Workbench. Super-Region Volume Segmentation. Imanol Luengo

Object Detection with Discriminatively Trained Part Based Models

Image Segmentation and Registration

PCA and KPCA algorithms for Face Recognition A Survey

Biology Project 1

CS 565 Computer Vision. Nazar Khan PUCIT Lectures 15 and 16: Optic Flow

Learning from Data: Adaptive Basis Functions

Flexible 3D Models from Uncalibrated Cameras

Image processing and features

Multiple Model Estimation : The EM Algorithm & Applications

On-line, Incremental Learning of a Robust Active Shape Model

VC 11/12 T11 Optical Flow

Face Alignment Under Various Poses and Expressions

Coarse-to-Fine Statistical Shape Model by Bayesian Inference

SIFT: SCALE INVARIANT FEATURE TRANSFORM SURF: SPEEDED UP ROBUST FEATURES BASHAR ALSADIK EOS DEPT. TOPMAP M13 3D GEOINFORMATION FROM IMAGES 2014

Transcription:

Face Detection and Alignment Prof. Xin Yang HUST

Applications Virtual Makeup

Applications Virtual Makeup

Applications Video Editing

Active Shape Models Suppose we have a statistical shape model Trained from sets of examples How do we use it to interpret new images? Use an Active Shape Model Iterative method of matching model to image

Building Models Require labelled training images landmarks represent correspondences

Building Shape Models Given aligned shapes, { } x i Apply PCA x x Pb P First t eigenvectors of covar. matrix b Shape model parameters

PCA: General From k original variables: x 1,x 2,...,x k : Produce k new variables: y 1,y 2,...,y k : y 1 = a 11 x 1 + a 12 x 2 +... + a 1k x k y 2 = a 21 x 1 + a 22 x 2 +... + a 2k x k... y k = a k1 x 1 + a k2 x 2 +... + a kk x k such that: y k 's are uncorrelated (orthogonal) y 1 explains as much as possible of original variance in data set y 2 explains as much as possible of remaining variance etc.

5 4 2nd Principal Component, y 2 1st Principal Component, y 1 3 2 4.0 4.5 5.0 5.5 6.0

PCA Scores 5 x i2 4 y i,1 y i,2 3 2 4.0 4.5 5.0 x 5.5 6.0 i1

PCA Eigenvalues 5 λ 1 λ 2 4 3 2 4.0 4.5 5.0 5.5 6.0

PCA: Another Explanation From k original variables: x 1,x 2,...,x k : Produce k new variables: y 1,y 2,...,y k : y 1 = a 11 x 1 + a 12 x 2 +... + a 1k x k y 2 = a 21 x 1 + a 22 x 2 +... + a 2k x k... y k = a k1 x 1 + a k2 x 2 +... + a kk x k y k 's are Principal Components such that: y k 's are uncorrelated (orthogonal) y 1 explains as much as possible of original variance in data set y 2 explains as much as possible of remaining variance etc.

Principal Components Analysis on: Covariance Matrix: Variables must be in same units Emphasizes variables with most variance

PCA: General {a 11,a 12,...,a 1k } is 1st Eigenvector of correlation/covariance matrix, and coefficients of first principal component {a 21,a 22,...,a 2k } is 2nd Eigenvector of correlation/covariance matrix, and coefficients of 2nd principal component {a k1,a k2,...,a kk } is kth Eigenvector of correlation/covariance matrix, and coefficients of kth principal component

Building Shape Models Given aligned shapes, { } x i Apply PCA x x Pb P First t eigenvectors of covar. matrix b Shape model parameters

Hand Shape Model Varying b Varying b Varying b 1 2 3

BUILDING THE MODELS All Faces Modes of Variation First mode Second mode Third mode

BUILDING THE MODELS Neutral - Appearance Variation Changing the first three modes of variation simultaneously

Active Shape Models Match shape model to new image Require: Statistical shape model Model of image structure at each point Model Point Model of Profile

Placing model in image The model points are defined in a model coordinate frame Must apply global transformation,t, to place in image x x Pb T ( x; X, Y, s, ) c c X T ( x Pb ) Model Frame Image

ASM Search Overview Local optimisation Initialise near target Search along profiles for best match,x Update parameters to match to X. ( X i, Yi )

Local Structure Models Need to search for local match for each point Model Strongest edge Statistical model of profile

Computing Normal to Boundary Tangent Normal ), ( ), ( x y y x t t n n ), ( y t x t ), ( 1 1 i X i Y ), ( 1 1 i X i Y 2 2 ), ( ), ( y x y x y x d d d d t t 1 1 1 1 i i y i i x Y Y d X X d (Unit vector)

Sampling along profiles Model boundary Profile normal to boundary Model point ( X, Y ) Interpolate at these points ( X, Y ) i( s n, s n i... 2, 1,0,1,2,... n x n y ) s n Tak e step s of length s n alo ng ( n x, n y ) s n n y s n n x

Noise reduction In noisy images, average orthogonal to profile Improves signal-to-noise along profile g i1 g i2 g i3 U se g i 0.25g i1 0.5 g i 2 0. 25 g i 3 Sampled profile is g (..., g -2, g -1, g 0, g 1, g 2,...)

Searching for strong edges g ( x) dg ( x) dx x dg( x) dx 0.5( g ( x 1) g ( x 1)) Select point along profile at strongest edge

Profile Models Sometimes true point not on strongest edge Strongest edge True position Model local structure to help locate the point

Statistical Profile Models Estimate p.d.f. for sample on profile Normalise to allow for global lighting variations From training set learn g p(g)

Profile Models For each point in model For each training image Sample values along profile Normalise Build statistical model eg Gaussian PDF using eigen-model approach

Searching Along Profiles During search we look along a normal for the best match for each profile g ( x) p ( g ( x )) g( x) x Form vector from samples about x

Search algorithm Search along profile Update global transformation, T, and parameters, b, to minimise X T ( x Pb ) 2 ( X i, Yi )

Updating parameters Find pose and model parameters to minimise Either Put into general optimiser Use two stage iterative approach 2 ),,, ; ( ),,,, ( s Y X T s Y X f c c c c Pb x X b

Updating Parameters 2 ),,, ; ( ),,,, ( s Y X T s Y X f c c c c Pb x X b 2 ) ( minimise w hich ),,, ( find and F ix Pb x X b T s Y X c c 2 ) ( minimises w hich find and ),,, ( F ix Pb x X b T s Y X c c ) ) ( ( 1 x X P b T T Repeat until convergence: Analytic solution exists (see notes)

Update step Hard constraints M inimise X T ( x Pb ) 2 sub ject to p( b ) p t e.g. b 3 i λ i

Multi-Resolution Search Train models at each level of pyramid Gaussian pyramid with step size 2 Use same points but different local models Start search at coarse resolution Refine at finer resolution

Gaussian Pyramids To generate image at level L Smooth image at level L-1 with gaussian filter (eg (1 5 8 5 1)/20) Sub-sample every other pixel Each level half the size of the one below

Multi-Resolution Search Start at coarse resolution For each resolution Search along profiles for best matches Update parameters to fit matches (Apply constraints to parameters) Until converge at this resolution

ASM Fitting Process

ASM Fitting Process

Face Tracking Framework: Active Face Detection Shape Model Start Shape on the ROI Feature Extraction & Search Shape Realignment on the ROI Output the Shape Landmarks of Active Shape Model

Algorithm Face Detection Start Shape on the ROI Feature Extraction & Search Shape Realignment on the ROI Output the Shape

Algorithm Face Detection Start Shape on the ROI Feature Extraction & Search Shape Realignment on the ROI Output the Shape

Algorithm Face Detection Start Shape on the ROI Feature Extraction & Search Shape Realignment on the ROI Output the Shape

Algorithm Face Detection Start Shape on the ROI Feature Extraction & Search Shape Realignment on the ROI Output the Shape

Algorithm Face Detection Start Shape on the ROI Feature Extraction & Search Shape Realignment on the ROI Output the Shape

Algorithm Face Detection Start Shape on the ROI replace Feature Extraction & Search Shape Realignment on the ROI Output the Shape

Experiment Data: 1000 frames, 800x600 (hy1.avi) Intel(R) Core(TM) i3-2350 CPU @ 2.30 GHz LDB-ASM: 15.503s (64fps) HAT-ASM: 65.123s (15fps) Mobile Platform (Qualcomm 8974) LDB-ASM: 00:01:22m (12fps) HAT-ASM: 00:08:13m (2fps)

LDB-ASM HAT-ASM

LDB-ASM HAT-ASM STD-X STD-X STD-Y STD-Y

Active Shape Models Advantages Fast, simple, accurate Efficient to extend to 3D Disadvantages Only sparse use of image information Treat local models as independent

Cascade Shape Regression Framework Stage t = 0 t = 3 t = 5 Cascaded pose regression, Dollar et. al., CVPR 2010

Face Alignment based on LBF-Regression Tree Induced Local Binary Features learned from data global optimization much stronger than previous regression trees efficient training / testing Best accuracy on challenging benchmarks 3,000 FPS on desktop, or 300 FPS on mobile first face tracking method on mobile

Face Alignment based on LBF-Regression Shape-index Feature

Face Alignment based on LBF-Regression A simple form sum of a large number of regression trees Novel two step learning 1. Local learning of tree structure learn an easier task and better features 2. Global optimization of tree output enforce dependence between points and reduce local estimation errors

Local Learning of Tree Structure Target: one point Random forest learn standard random forests for each local point standard regression tree using pixel difference features only use pixels in the local patch around the point regularization of feature selection

From Local to Global Target: one point Random forest Fix tree structures and optimize tree leave s output

Global Optimization of Tree Output Regression Target Feature Mapping Function

Global Optimization of Tree Output

Tree Induced Binary Features Each leave is a binary indicator function 1 if the image sample arrives at the leaf 0 otherwise Trees -> high dimension sparse binary features Learning global linear regression

Experiments Benchmark #landmarks #training images #testing images LFPW 29 717 249 Helen 194 2000 330 300-W 68 3149 689 Two variants of our method Accurate: LBF 1200 trees with depth 7 Fast: LBF fast 300 trees with depth 5

LFPW (29 landmarks) Method Error FPS [1] 3.99 ESR [2] 3.47 220 RCPR [3] 3.50 - SDM [4] 3.49 160 EGM [5] 3.98 LBF 3.35 460 LBF fast 3.35 4200 Helen (194 landmarks) Method Error FPS STASM [6] 11.1 - CompASM [7] 9.10 - ESR [2] 5.70 70 PCPR [3] 6.50 - SDM [4] 5.85 21 LBF 5.41 200 LBF fast 5.80 1500 300-W (68 landmarks) Method Fullset Common Subset Challenging Subset FPS ESR [2] 7.58 5.28 17.00 120 SDM [4] 7.52 5.60 15.40 70 LBF 6.32 4.95 11.98 320 LBF fast 7.37 5.38 15.50 3100 LBF is much more accurate and a few times faster LBF fast is slightly more accurate and dozens of times faster

Face Alignment at 3000 FPS

Summary State-of-the-art face alignment Best accuracy on challenging benchmarks Dozens of times faster than previous methods faster than real time face tracking on mobile