Supplemental Material for Face Alignment Across Large Poses: A 3D Solution

Similar documents
Face Alignment Across Large Poses: A 3D Solution

Supplementary Material for Synthesizing Normalized Faces from Facial Identity Features

Landmark Weighting for 3DMM Shape Fitting

Face Recognition Markus Storer, 2007

MoFA: Model-based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction

Deep Face Feature for Face Alignment and Reconstruction

Robust FEC-CNN: A High Accuracy Facial Landmark Detection System

Locating Facial Landmarks Using Probabilistic Random Forest

Supplementary Materials for A Lighting Robust Fitting Approach of 3D Morphable Model For Face Reconstruction

arxiv: v2 [cs.cv] 8 Sep 2017

A Fully End-to-End Cascaded CNN for Facial Landmark Detection

Face Alignment across Large Pose via MT-CNN based 3D Shape Reconstruction

Pose-Invariant Face Alignment with a Single CNN

arxiv: v1 [cs.cv] 11 Jun 2015

Intensity-Depth Face Alignment Using Cascade Shape Regression

Faster Than Real-time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses

Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250 Hz Supplemental Material

The Nottingham eprints service makes this work by researchers of the University of Nottingham available open access under the following conditions.

Pix2Face: Direct 3D Face Model Estimation

arxiv: v1 [cs.cv] 29 Sep 2016

Shape Augmented Regression for 3D Face Alignment

Facial Landmark Tracking by Tree-based Deformable Part Model Based Detector

3D Active Appearance Model for Aligning Faces in 2D Images

Motion capturing via computer vision and machine learning

3D Face Modelling Under Unconstrained Pose & Illumination

arxiv: v1 [cs.cv] 16 Nov 2015

arxiv: v1 [cs.cv] 24 Sep 2018

Human Face Shape Analysis under Spherical Harmonics Illumination Considering Self Occlusion

Evaluation of Dense 3D Reconstruction from 2D Face Images in the Wild

On 3D face reconstruction via cascaded regression in shape space

OVer the last few years, cascaded-regression (CR) based

Face Alignment Under Various Poses and Expressions

Combining Local and Global Features for 3D Face Tracking

How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)

Towards Large-Pose Face Frontalization in the Wild

arxiv:submit/ [cs.cv] 7 Sep 2017

Improved Face Detection and Alignment using Cascade Deep Convolutional Network

arxiv: v3 [cs.cv] 26 Aug 2018 Abstract

Pose-Invariant Face Alignment via CNN-Based Dense 3D Model Fitting

Disentangling Features in 3D Face Shapes for Joint Face Reconstruction and Recognition

arxiv: v1 [cs.cv] 21 Mar 2018

Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network

arxiv: v1 [cs.cv] 17 Aug 2017

3D Morphable Model Parameter Estimation

arxiv: v5 [cs.cv] 26 Jul 2017

Unconstrained Face Alignment without Face Detection

Joint Head Pose Estimation and Face Alignment Framework Using Global and Local CNN Features

Learn to Combine Multiple Hypotheses for Accurate Face Alignment

In Between 3D Active Appearance Models and 3D Morphable Models

Tweaked residual convolutional network for face alignment

Simultaneous Facial Landmark Detection, Pose and Deformation Estimation under Facial Occlusion

Reflective Regression of 2D-3D Face Shape Across Large Pose

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

Cascade Multi-view Hourglass Model for Robust 3D Face Alignment

Deep Alignment Network: A convolutional neural network for robust face alignment

Dataset Augmentation for Pose and Lighting Invariant Face Recognition

Holistically Constrained Local Model: Going Beyond Frontal Poses for Facial Landmark Detection

arxiv: v2 [cs.cv] 4 Mar 2018

Sparse Shape Registration for Occluded Facial Feature Localization

Deep Evolutionary 3D Diffusion Heat Maps for Large-pose Face Alignment

Rapid 3D Face Modeling using a Frontal Face and a Profile Face for Accurate 2D Pose Synthesis

Real-time Multi-view Facial Landmark Detector Learned by the Structured Output SVM

arxiv: v1 [cs.cv] 2 Sep 2017

Enhanced Active Shape Models with Global Texture Constraints for Image Analysis

Single view-based 3D face reconstruction robust to self-occlusion

Generic Face Alignment Using an Improved Active Shape Model

Landmark Detection and 3D Face Reconstruction using Modern C++

Dynamic Attention-controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-set Sample Weighting

Sieving Regression Forest Votes for Facial Feature Detection in the Wild

arxiv: v2 [cs.cv] 10 Aug 2017

Analyzing the Relationship Between Head Pose and Gaze to Model Driver Visual Attention

Abstract We present a system which automatically generates a 3D face model from a single frontal image of a face. Our system consists of two component

Face Recognition Using a Unified 3D Morphable Model

Convolutional Experts Constrained Local Model for 3D Facial Landmark Detection

Learning Detailed Face Reconstruction from a Single Image

Face View Synthesis Across Large Angles

Face Detection by Aggregating Visible Components

Partial Face Matching between Near Infrared and Visual Images in MBGC Portal Challenge

Coarse-to-Fine Statistical Shape Model by Bayesian Inference

The First 3D Face Alignment in the Wild (3DFAW) Challenge

Face Detection and Tracking Control with Omni Car

SUPPLEMENTARY MATERIAL FOR: ADAPTIVE CASCADED REGRESSION

Face Detection, Bounding Box Aggregation and Pose Estimation for Robust Facial Landmark Localisation in the Wild

Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Fitting a Single Active Appearance Model Simultaneously to Multiple Images

A Deep Regression Architecture with Two-Stage Re-initialization for High Performance Facial Landmark Detection

Cross-pose Facial Expression Recognition

MULTI-POSE FACE HALLUCINATION VIA NEIGHBOR EMBEDDING FOR FACIAL COMPONENTS. Yanghao Li, Jiaying Liu, Wenhan Yang, Zongming Guo

Face Detection, Bounding Box Aggregation and Pose Estimation for Robust Facial Landmark Localisation in the Wild

Face Detection Algorithm Based on a Cascade of Ensembles of Decision Trees

Facial shape tracking via spatio-temporal cascade shape regression

TEXTURE OVERLAY ONTO NON-RIGID SURFACE USING COMMODITY DEPTH CAMERA

High-Fidelity Pose and Expression Normalization for Face Recognition in the Wild

Facial Feature Points Tracking Based on AAM with Optical Flow Constrained Initialization

Occlusion Robust Multi-Camera Face Tracking

Collaborative Facial Landmark Localization for Transferring Annotations Across Datasets

De-mark GAN: Removing Dense Watermark With Generative Adversarial Network

Face Recognition Based on Pose-Variant Image Synthesis and Multi-level Multi-feature Fusion

arxiv: v5 [cs.cv] 13 Apr 2018

Generic Active Appearance Models Revisited

Transcription:

Supplemental Material for Face Alignment Across Large Poses: A 3D Solution Xiangyu Zhu 1 Zhen Lei 1 Xiaoming Liu 2 Hailin Shi 1 Stan Z. Li 1 1 Institute of Automation, Chinese Academy of Sciences 2 Department of Computer Science and Engineering, Michigan State University {xiangyu.zhu,zlei,hailin.shi,szli}@nlpr.ia.ac.cn liuxm@msu.edu 1. Results Demo In this section, we demonstrate some alignment results of 3DDFA in AFLW in Fig. 1. Since the landmark visibility can be easily computed from the fitted dense 3D model [1], we also demonstrate the landmark visibility. Figure 1. The results of 3DDFA in AFLW. For each pair, the left one renders the fitted 3D shape with the mean texture, which is made transparent to demonstrate the fitting accuracy. The right one shows the landmarks overlayed on the 3D face model. The blue/red ones indicate visible/invisible landmarks. 1

Figure 2. The 300W-3D database. For each sample, the left one is the original image, the right one is the fitted 3DMM. 2. Database 2.1. 300W-3D This database contains the images in 300W [4] and their ground truth 3D faces. It is constructed by fitting the 3D Morphable Model with the Multi-Features Framework [3]. Different from the original algorithm, the 3D landmarks are adjusted by landmark marching [5] and the 68-landmarks constraint is adopted throughout the fitting process. Each sample is checked, few failed samples are adjusted manually. Fig. 2 demonstrates some samples in the database. 2.2. 300W-LP The 300W across Large Poses (300W-LP) database contains the synthesized face images from the face profiling algorithm described in Section 4. Some samples are demonstrated in Fig. 4. Besides, Fig. 3(a) and Fig. 3(b) demonstrate the yaw angle distribution before and after face profiling respectively. The distribution is reversed since face profiling only enlarges the yaw angle without shrinking it. Considering that the fidelity of a synthesized sample is negatively related with the yaw, we augment each training sample with the times of d(5 0.8 yaw/5 )e. Fig. 3(c) shows the yaw distribution after augmentation. (a) (b) (c) Figure 3. The yaw angle distribution of (a) 300W, (b) 300W-LP, (c) After augmentation. 2.3. AFLW2000-3D The AFLW2000-3D database contains the first 2000 samples in AFLW [2] with their ground truth 3D faces. This database is more challenging to construct than 300W-3D because the AFLW ignores the occluded landmarks (both occluded and self-

Figure 4. The 300W-LP database. For each sample, the first is the original image, followed by synthesized larger-pose faces, each with increased 10 degree yaw. Figure 5. The AFLW2000-3D database. For each sample, the left one is the original image, the right one is the fitted 3DMM. occluded) and the landmarks do not contain expression information (without lip landmarks). As a result we need a specifical method to deal with the unstandardised landmarks. Firstly, since AFLW provides the ground truth pose information, we

train a pose-dependent SDM (a SDM model for each yaw interval) with 68-landmark makeup on 300W-LP, with which the AFLW2000 is coarsely aligned for initialization. Secondly, we run Multi-Features Framework (MFF) 3DMM fitting method initialized by the 68 landmarks and constrained by the provided 21 visible landmarks. Finally, for the failed results, we label the necessary landmarks (always the upper and lower lip landmarks) and re-run the MFF. Fig. 5 demonstrates some samples in AFLW2000-3D. 3. Derivative of Vertex Distance Cost (VDC) The Vertex Distance Cost is defined in Section 3.4.2: E vdc = V (p 0 + p) V (p g ) 2 (1) where p = [f, pit, yaw, rol, t 2d, α id, α exp ] T contains all the parameters including the scale f, rotation angles pit, yaw, rol (which are short for pitch,yaw,roll), translation t 2d, shape α id and expression α exp. V (p) is the model construction and projection process defined as: V (p) = f Pr R (S + A id α id + A exp α exp ) + t 2d (2) [ ] 1 0 0 Pr = R = R 0 1 0 pit R yaw R rol (3) 1 0 0 cos(yaw) 0 sin(yaw) cos(rol) sin(rol) 0 R pit = 0 cos(pit) sin(pit) R yaw = 0 1 0 R rol = sin(rol) cos(rol) 0 (4) 0 sin(pit) cos(pit) sin(yaw) 0 cos(yaw) 0 0 1 where S is the mean 3D shape, A id and A exp are the shape and expression PCA models respectively and Pr is the weak perspective projection matrix. The derivative of VDC is [ p = f pit E vdc p = (V (p0 + p) V (p g )) T p yaw rol t 2dx p=p0 + p t 2dy α id ] (:) α exp (5) (6) R pit = f = Pr R (S + A idα id + A exp α exp ) (7) pit = f Pr R pit R yaw R rol (S + A id α id + A exp α exp ) (8) yaw = f Pr R pit R yaw R rol (S + A id α id + A exp α exp ) (9) rol = f Pr R pit R yaw R rol (S + A id α id + A exp α exp ) (10) [ ] [ ] 1 1 1 0 0 0 = = (11) t 2dx 0 0 0 t 2dy 1 1 1 = f Pr R A id = f Pr R A exp α id α exp (12) 0 0 0 0 sin(pit) cos(pit) R yaw = 0 cos(pit) sin(pit) where (:) is the concatenation operator which is the same as Matlab. sin(yaw) 0 cos(yaw) sin(rol) cos(rol) 0 0 0 0 R rol = cos(rol) sin(rol) 0, cos(yaw) 0 sin(yaw) 0 0 0 (13)

References [1] T. Hassner, S. Harel, E. Paz, and R. Enbar. Effective face frontalization in unconstrained images. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015. 1 [2] M. Köstinger, P. Wohlhart, P. M. Roth, and H. Bischof. Annotated facial landmarks in the wild: A large-scale, real-world database for facial landmark localization. In Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on, pages 2144 2151. IEEE, 2011. 2 [3] S. Romdhani and T. Vetter. Estimating 3D shape and texture using pixel intensity, edges, specular highlights, texture constraints and a prior. In Computer Vision and Pattern Recognition (CVPR), 2005 IEEE Conference on, volume 2, pages 986 993. IEEE, 2005. 2 [4] C. Sagonas, G. Tzimiropoulos, S. Zafeiriou, and M. Pantic. 300 faces in-the-wild challenge: The first facial landmark localization challenge. In Computer Vision Workshops (ICCVW), 2013 IEEE International Conference on, pages 397 403. IEEE, 2013. 2 [5] X. Zhu, Z. Lei, J. Yan, D. Yi, and S. Z. Li. High-fidelity pose and expression normalization for face recognition in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 787 796, 2015. 2