Face Alignment Across Large Poses: A 3D Solution

Similar documents
Supplemental Material for Face Alignment Across Large Poses: A 3D Solution

3D Face Modelling Under Unconstrained Pose & Illumination

Face Recognition Markus Storer, 2007

3D Active Appearance Model for Aligning Faces in 2D Images

Multi-View AAM Fitting and Camera Calibration

Face Alignment across Large Pose via MT-CNN based 3D Shape Reconstruction

Generic Face Alignment Using an Improved Active Shape Model

Single view-based 3D face reconstruction robust to self-occlusion

Active Appearance Models

Deep Face Feature for Face Alignment and Reconstruction

Shape Augmented Regression for 3D Face Alignment

arxiv: v1 [cs.cv] 16 Nov 2015

3D Morphable Model Parameter Estimation

Landmark Weighting for 3DMM Shape Fitting

3DFaceNet: Real-time Dense Face Reconstruction via Synthesizing Photo-realistic Face Images

In Between 3D Active Appearance Models and 3D Morphable Models

Face Detection and Alignment. Prof. Xin Yang HUST

Facial Feature Detection

Joint Head Pose Estimation and Face Alignment Framework Using Global and Local CNN Features

Gesture Recognition: Hand Pose Estimation. Adrian Spurr Ubiquitous Computing Seminar FS

Active Wavelet Networks for Face Alignment

DA Progress report 2 Multi-view facial expression. classification Nikolas Hesse

Supplementary Material for Synthesizing Normalized Faces from Facial Identity Features

Pose-Invariant Face Alignment via CNN-Based Dense 3D Model Fitting

HeadFusion: 360 Head Pose tracking combining 3D Morphable Model and 3D Reconstruction

Analyzing the Relationship Between Head Pose and Gaze to Model Driver Visual Attention

On the Dimensionality of Deformable Face Models

AAM Based Facial Feature Tracking with Kinect

Face Recognition At-a-Distance Based on Sparse-Stereo Reconstruction

Increasing the Density of Active Appearance Models

arxiv: v2 [cs.cv] 8 Sep 2017

Face Recognition A Deep Learning Approach

MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Landmark Detection and 3D Face Reconstruction using Modern C++

arxiv: v1 [cs.cv] 29 Sep 2016

Abstract We present a system which automatically generates a 3D face model from a single frontal image of a face. Our system consists of two component

2D vs. 3D Deformable Face Models: Representational Power, Construction, and Real-Time Fitting

/10/$ IEEE 4048

3D Face Modeling. Lacey Best- Rowden, Joseph Roth Feb. 18, MSU

Fitting a Single Active Appearance Model Simultaneously to Multiple Images

Enhanced Active Shape Models with Global Texture Constraints for Image Analysis

4422 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 27, NO. 9, SEPTEMBER 2018

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

Face Alignment Under Various Poses and Expressions

3D-MAM: 3D Morphable Appearance Model for Efficient Fine Head Pose Estimation from Still Images

Occluded Facial Expression Tracking

Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250 Hz Supplemental Material

Pix2Face: Direct 3D Face Model Estimation

Automatic Construction of Active Appearance Models as an Image Coding Problem

Self-supervised CNN for Unconstrained 3D Facial Performance Capture from an RGB-D Camera

3D Shape Representation and Analysis of the Human Body and Ontology for Anthropometric Landmarks

Rapid 3D Face Modeling using a Frontal Face and a Profile Face for Accurate 2D Pose Synthesis

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. Charles R. Qi* Hao Su* Kaichun Mo Leonidas J. Guibas

Learning a generic 3D face model from 2D image databases using incremental structure from motion

Describable Visual Attributes for Face Verification and Image Search

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing

FITTING AND TRACKING 3D/4D FACIAL DATA USING A TEMPORAL DEFORMABLE SHAPE MODEL. Shaun Canavan, Xing Zhang, and Lijun Yin

Multi-view Facial Expression Recognition Analysis with Generic Sparse Coding Feature

Facial Expression Recognition Using Non-negative Matrix Factorization

Integrating Range and Texture Information for 3D Face Recognition

Pose Independent Face Recognition by Localizing Local Binary Patterns via Deformation Components

Deep Learning for Virtual Shopping. Dr. Jürgen Sturm Group Leader RGB-D

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling

[Nirgude* et al., 4(1): January, 2017] ISSN Impact Factor 2.675

Inception Network Overview. David White CS793

CHAPTER 5 GLOBAL AND LOCAL FEATURES FOR FACE RECOGNITION

Cascaded Regression with Sparsified Feature Covariance Matrix for Facial Landmark Detection

Integral Channel Features with Random Forest for 3D Facial Landmark Detection

22 October, 2012 MVA ENS Cachan. Lecture 5: Introduction to generative models Iasonas Kokkinos

Multiple Kernel Learning for Emotion Recognition in the Wild

OVer the last few years, cascaded-regression (CR) based

Face Alignment Robust to Occlusion

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur

On 3D face reconstruction via cascaded regression in shape space

Face Recognition Across Poses Using A Single 3D Reference Model

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

Dense 3D Face Alignment from 2D Videos in Real-Time

Face Image Synthesis and Interpretation Using 3D Illumination-Based AAM Models

Linear Discriminant Analysis for 3D Face Recognition System

Adaptive Binary Quantization for Fast Nearest Neighbor Search

Generic 3D Face Pose Estimation using Facial Shapes

Tweaked residual convolutional network for face alignment

Real-time Scalable 6DOF Pose Estimation for Textureless Objects

Mobile Face Recognization

Recovering 3D Facial Shape via Coupled 2D/3D Space Learning

FACE RECOGNITION USING INDEPENDENT COMPONENT

Conditional Random Fields as Recurrent Neural Networks

Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing Supplementary Material

3D Statistical Shape Model Building using Consistent Parameterization

Learning GMRF Structures for Spatial Priors

Model-based 3D Shape Recovery from Single Images of Unknown Pose and Illumination using a Small Number of Feature Points

Causal and compositional generative models in online perception

Learning-based Localization

Landmark Localisation in 3D Face Data

The First 3D Face Alignment in the Wild (3DFAW) Challenge

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Convolutional Experts Constrained Local Model for 3D Facial Landmark Detection

Face Alignment with Part-Based Modeling

DeepIM: Deep Iterative Matching for 6D Pose Estimation - Supplementary Material

Statistics of Natural Image Categories

Transcription:

Face Alignment Across Large Poses: A 3D Solution

Outline Face Alignment Related Works 3D Morphable Model Projected Normalized Coordinate Code Network Structure 3D Image Rotation Performance on Datasets Extensions

What is Face Alignment? Why do we need it? Face alignment can be formulated as a problem of searching over a face image for the pre-defined facial points (also called face shape), which typically starts from a coarse initial shape, and proceeds by refining the shape estimate step by step until convergence. Can be applied to face detection, face recognition, expression detection and so on. http://blog.csdn.net/zhangjunhit/article/details/78435972

More about face alignment Modeling: each facial point can be detected according to visual patterns. However, when faces deviate from the frontal view, some landmarks become invisible. In medium-pose scenario, this can be tackled by changing semantic positions. Fitting: traditional linear/nonlinear cascaded regression is not enough. Data labeling: manual data labeling is not practical

Related works Generic face alignment: locate a sparse set of fiducial landmarks in 2D space Active Appearance Model: x = μ + P*b From mean model μ and some (b) linear combination of principal components P Fit one image to another image region, by altering b according to ΔI Constrained Local Model Large pose alignment 3D Alignment of Face in a Single Image (L. Gu, 2006) Construct a compact 3D shape prior on the sparse 3D point set, and apply it to constrain the 2D facial points in different views. Only based on sparse patch and facial points

3D Morphable Model In original Morphable 3D model (3DMM): S is shape vector (X1, Y1, Z1, X2,, Yn, Zn) T is color values of n vertices (R1, G1, B1, R2,, Gn, Bn) A Morphable Model For The Synthesis Of 3D Faces, V. Blanz, 1999

3D Morphable Model A Morphable Model For The Synthesis Of 3D Faces, V. Blanz, 1999

3D Morphable Model V(p) is the model construction and projection function, leading to the 2D positions of model vertices, where f is the scale factor, Pr is the orthographic projection matrix. The collection of all the model parameters is p = [f, pitch, yaw, roll, t_2d, ɑ_id, ɑ_exp], which is 234-dimensional.

Projected Normalized Coordinate Code

Cascaded CNN Network Cost function:

Cost function analysis CNN first concentrates on the parameters with larger wi, such as scale, rotation and translation. As the trained parameters p closer to ground truth, CNN will optimize less important parameters.

What is face profiling and why do we need it? All the discriminative methods rely on training data, especially for CNN Few released face alignment database contains large-pose samples Face profiling: generate profile view of faces from medium-pose images. By doing face profiling, the authors are able to generate as enough training data as possible

3D Image Meshing Fit a 3DMM through the Multi-Features Framework (MFF) The MFF algorithm can always fit with the groundtruth landmarks Few failed samples can be easily adjusted manually On external regions, follow the 3D meshing algorithm to estimate depth information of anchor points

Generating images by 3D rotation When the depth information is estimated, the face image can be rotated in 3D space to generate the appearances in larger poses. The authors enlarge the yaw of the depth image at the step of 5 degree until 90 degree. Through face profiling, we not only obtain the face appearances in large poses and but also augment the dataset to a large scale.

Initialization Regeneration How to avoid overfitting? Model the perturbation of a training sample with a set of similar face posture samples Create validation set from members which share similar face postures. Update: By doing so, guide derivative in a meaningful way.

Landmark refinement HOG is adopted to refine the location of landmarks after 3DDFA.

Experiments: w/o regeneration

Experiments: different cost functions

Experiments: comparison

Conclusion Solve dense face alignment across large poses New face profiling algorithm generates dataset with large-pose faces Simple yet effective cost function Fit dense 3D morphable model with cascaded CNN

Extensions Since this method is still based on 3D morphable model, so the parameters has to based on PCA results from training data. Doing PCA over large dataset can be challenging. They still use HOG features to do post-processing for landmarks, which is not efficient. Potential improvement is to stacking more CNN layers to extract better feature map and then generate structure parameters or facial point positions. Instead of feeding new p into the next phase of network, maybe we can take advantage of the rotation parameters, thus iteratively improving the fitting.