Abstract We present a system which automatically generates a 3D face model from a single frontal image of a face. Our system consists of two component

Similar documents
Face Tracking. Synonyms. Definition. Main Body Text. Amit K. Roy-Chowdhury and Yilei Xu. Facial Motion Estimation

3D Face Modelling Under Unconstrained Pose & Illumination

Image Morphing. Application: Movie Special Effects. Application: Registration /Alignment. Image Cross-Dissolve

Pose Normalization for Robust Face Recognition Based on Statistical Affine Transformation

3D FACE RECONSTRUCTION BASED ON EPIPOLAR GEOMETRY

Synthesizing Realistic Facial Expressions from Photographs

Occlusion Robust Multi-Camera Face Tracking

Registration of Dynamic Range Images

Face Relighting with Radiance Environment Maps

Morphable Displacement Field Based Image Matching for Face Recognition across Pose

Nonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.

Supplementary Material for Synthesizing Normalized Faces from Facial Identity Features

In Between 3D Active Appearance Models and 3D Morphable Models

The Future of Natural User Interaction

Flexible Calibration of a Portable Structured Light System through Surface Plane

Component-based Face Recognition with 3D Morphable Models

Construction of Frontal Face from Side-view Images using Face Mosaicing

FACIAL ANIMATION FROM SEVERAL IMAGES

[Nirgude* et al., 4(1): January, 2017] ISSN Impact Factor 2.675

arxiv: v1 [cs.cv] 28 Sep 2018

Efficient Model-based Linear Head Motion Recovery from Movies

Rapid 3D Face Modeling using a Frontal Face and a Profile Face for Accurate 2D Pose Synthesis

Illumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model

Lecture 7: Image Morphing. Idea #2: Align, then cross-disolve. Dog Averaging. Averaging vectors. Idea #1: Cross-Dissolving / Cross-fading

Depth estimation of face images

Component-based Face Recognition with 3D Morphable Models

The Novel Approach for 3D Face Recognition Using Simple Preprocessing Method

AAM Based Facial Feature Tracking with Kinect

arxiv: v1 [cs.cv] 28 Sep 2018

A Morphable Model for the Synthesis of 3D Faces

Rectification and Distortion Correction

A FAST 3D FACE RECONSTRUCTION METHOD FROM A SINGLE IMAGE USING ADJUSTABLE MODEL. Tao Wu, Fei Zhou, Qingmin Liao*

Accurate 3D Face and Body Modeling from a Single Fixed Kinect

Lecture 14: Basic Multi-View Geometry

Light Field Appearance Manifolds

Illumination invariant face recognition and impostor rejection using different MINACE filter algorithms

Face Re-Lighting from a Single Image under Harsh Lighting Conditions

Landmark Detection on 3D Face Scans by Facial Model Registration

Object. Radiance. Viewpoint v

MERGING POINT CLOUDS FROM MULTIPLE KINECTS. Nishant Rai 13th July, 2016 CARIS Lab University of British Columbia

A Survey of Light Source Detection Methods

Face Alignment Under Various Poses and Expressions

Face Alignment Across Large Poses: A 3D Solution

Displacement estimation

Lucas-Kanade Scale Invariant Feature Transform for Uncontrolled Viewpoint Face Recognition

TEXTURE OVERLAY ONTO NON-RIGID SURFACE USING COMMODITY DEPTH CAMERA

Face View Synthesis Across Large Angles

Facial Animation System Design based on Image Processing DU Xueyan1, a

Face Relighting with Radiance Environment Maps

Full-Motion Recovery from Multiple Video Cameras Applied to Face Tracking and Recognition

Shweta Gandhi, Dr.D.M.Yadav JSPM S Bhivarabai sawant Institute of technology & research Electronics and telecom.dept, Wagholi, Pune

VIDEO FACE BEAUTIFICATION

REAL-TIME FACE SWAPPING IN VIDEO SEQUENCES: MAGIC MIRROR

Face Cyclographs for Recognition

Registration of Expressions Data using a 3D Morphable Model

Recognizing Deformable Shapes. Salvador Ruiz Correa (CSE/EE576 Computer Vision I)

Introduction to Computer Vision

Image Coding with Active Appearance Models

Recovering Non-Rigid 3D Shape from Image Streams

A Novel Image Super-resolution Reconstruction Algorithm based on Modified Sparse Representation

Hand-Eye Calibration from Image Derivatives

Learning Articulated Skeletons From Motion

3D Active Appearance Model for Aligning Faces in 2D Images

Projector Calibration for Pattern Projection Systems

A Multiresolutional Approach for Facial Motion Retargetting Using Subdivision Wavelets

Robust Facial Feature Detection for Registration

Head Pose Estimation by using Morphological Property of Disparity Map

Integrating Range and Texture Information for 3D Face Recognition

FAST REGISTRATION OF TERRESTRIAL LIDAR POINT CLOUD AND SEQUENCE IMAGES

FACE ANALYSIS AND SYNTHESIS FOR INTERACTIVE ENTERTAINMENT

Fast Nonrigid Mesh Registration with a Data-Driven Deformation Prior

Model Based Perspective Inversion

Advanced Computer Graphics

Single view-based 3D face reconstruction robust to self-occlusion

Non-rigid stereo factorization

Learning a generic 3D face model from 2D image databases using incremental structure from motion

TEXTURE OVERLAY ONTO NON-RIGID SURFACE USING COMMODITY DEPTH CAMERA

Personalized Facial Animation Based on 3D Model Fitting from Two Orthogonal Face Images

Parametric Manifold of an Object under Different Viewing Directions

Multi-View AAM Fitting and Camera Calibration

NON-RIGID 3D SHAPE RECOVERY USING STEREO FACTORIZATION

Face Recognition based on a 3D Morphable Model

Algorithm research of 3D point cloud registration based on iterative closest point 1

Landmark Weighting for 3DMM Shape Fitting

Adaptive Skin Color Classifier for Face Outline Models

Tracking facial features using low resolution and low fps cameras under variable light conditions

Sample Based Texture extraction for Model based coding

Passive driver gaze tracking with active appearance models

Articulated Structure from Motion through Ellipsoid Fitting

calibrated coordinates Linear transformation pixel coordinates

Advances in Face Recognition Research

Handbook of Face Recognition

c 2014 Gregory Paul Meyer

Dynamic 2D/3D Registration for the Kinect

Computer Vision: Lecture 3

Announcements. Recognition (Part 3) Model-Based Vision. A Rough Recognition Spectrum. Pose consistency. Recognition by Hypothesize and Test

CHAPTER 3 DISPARITY AND DEPTH MAP COMPUTATION

Speech Driven Synthesis of Talking Head Sequences

Partial Calibration and Mirror Shape Recovery for Non-Central Catadioptric Systems

Image-based Motion-driven Facial Texture

Transcription:

A Fully Automatic System To Model Faces From a Single Image Zicheng Liu Microsoft Research August 2003 Technical Report MSR-TR-2003-55 Microsoft Research Microsoft Corporation One Microsoft Way Redmond, WA 98052 http://www.research.microsoft.com

Abstract We present a system which automatically generates a 3D face model from a single frontal image of a face. Our system consists of two components. The first component is the feature feature detection, and the second component is the model fitting. We use an existing feature detection software for the first component. After we detect the face features, we fit a 3D face model by using a linear space of face geometries and assuming an orthogonal projection. Even though the depths of the resulting face models are usually not accurate, the models look recognizable as long as the view angle is not too far from the front. Our system has the advantage that it is fully automatic, robust, and fast. It can be used in a variety of applications for which the accuracy of depths are not critical such as games, avatars, face recognition on close-to-front-view images, etc. 1 Introduction There has been a lot of work on face modeling from images. One technique which has been used in many commercial system is to use two orthogonal views [1,6, 3]: one frontal view and one side view. Such systems require the users to manually specify the face features on the two images. Some of the commercial systems have tried to use some feature findings to reduce the amount of manual work. Another type of face modeling system creates face models from a video sequence [4, 8]. Since it has images of multiple views, it can potentially compute the correct depth and can generate a texture image for the entire face. However, it requires the user to have a video camera. In addition, it usually requires some amount of user input to make it robust. In the situations where the user only has a single image, our system is more useful. Blanz and Vetter [2] developed a system to create face models from a single image. Their system uses both a geometry database and an image database. It can only model the faces whose skin colors are covered by their database. Their system is computationally more expensive. 2 Linear class of face geometries We the same representation for the face models as in [8]. A face is represented as a linear combination of a neutral face and some number of face metrics. A metric is vector that linearly deforms a face in certain way, such astomake the head wider, make the nose bigger, etc. Let us denote the face geometry by avector S = (v T 1 ;:::;vt n ) T where v i = (X i ;Y i ;Z i ) T (i = 1;:::;n) are the vertices, and a metric by avector M =(ffiv1;:::;ffiv n ) T, where ffiv i =(ffix i ;ffiy i ;ffiz i ) T. Given a neutral face S 0 = (v 0 1 T ;:::;v 0 nt ) T, and a set of m metrics M j = mx (ffiv1 j T ;:::;ffivnt j ) T, the linear space of face geometries spanned by these metrics is S = S 0 + c j M j subject to c j 2 [l j ;u j ] (1) j=1 1

where c j 's are the metric coefficients and l j and u j are the valid range of c j. The neutral face and all the metrics are designed by an artist, and it is done once. The neutral face (see Figure 1) contains 194 vertices and 360 triangles. There are 65 metrics. Figure 1: Neutral face. 3 Face modeling from a single view 3.1 Face feature alignment Given an image of a face, to find the feature points on the face, we first use an existing face detector software [7] to detect the face. We then use the face alignment software by Yan et al [9] to find the face features. Figure 2 shows an input image and the alignment result. Figure 2: Left: Input image. Right: The result from image alignment. 3.2 Model fitting We assume that the projection is orthogonal, and there is no out of the plane rotations for the face. Without loss of generality, let us denote v i =(X i ;Y i ;Z i ) T,(i = 1;:::;f)tobethefeaturepoints. Denote μv i =(X i ;Y i ) to be the projection of v i 2

on the XY plane. For each feature point v i, denote m i to be its corresponding coordinate on the input image. Let R denote the 2x2 rotation matrix, t be the 2D translation vector, and s be the scale. We then have the following equation: From equation 1, we have Therefore sr μv i + t = m i (2) mx μv i = μv 0 i + c j ffiμv j i (3) j=1 mx sr(μv 0 i + c j ffiμv j i )+t = m i (4) j=1 We solve this equation iteratively. We first estimate the s, R, and t by assuming c i to be zero vector. This is done by using the technique as described in [5]. Then we fixs, R, andt, and equation 4 becomes a linear system which can be solved by using a linear least square procedure. We can then re-estimate s, R, and t by using the new estimates of c i 's, and so on. In our experiments, we find that one or two iterations are usually sufficient. 4 Results Figure 3 shows the different views of the reconstructed 3D model based on the input image in Figure 2. We can see the frontal view (the image in the middle) looks very good as expected. There are quite large rotations for the images on the left and right. These two images still look quite recognizable. Once the models are constructed, we can immediately animate the face including generating different expressions and using text-to-speech to create lipsynced animation. We use the same mechanism as in [4, 8] to create facial animations. Figure 4 shows the three facial expressions generated by our system. The images we use are 640x480. The total computation time for each image is about 7 seconds on a 1.7GHz PC. The main computation cost is the face alignment program. 5 Conclusion We have presented a system to construct 3D face models for a single front image. The system is fully automatic. It is fast compared to the other face modeling systems. Furthermore, it is very robust. It can be used to construct personalized face models for games, online chat, etc. It can also be used as a tool to generate database of faces with various poses which are needed by the face recognition systems. 3

Figure 3: The different views of the 3D model generated from the input image in Figure2. Figure 4: Generating different expressions for the constructed face model. References [1] T. Akimoto, Y. Suenaga, and R. S. Wallace. Automatic 3d facial models. IEEE Computer Graphics and Applications, 13(5):16 22, September 1993. [2] V. Blanz and T. Vetter. A morphable model for the synthesis of 3d faces. In Computer Graphics, Annual Conference Series, pages 187 194. Siggraph, August 1999. [3] B. Dariush, S. B. Kang, and K. Waters. Spatiotemporal analysis of face profiles: Detection, segmentation, and registration. In Proc. of the 3rd International Conference on Automatic Face and Gesture Recognition, pages 248 253. IEEE, April 1998. [4] P. Fua and C. Miccio. From regular images to animated heads: A least squares approach. In Eurographics of Computer Vision, pages 188 202, 1996. [5] B. K. Horn. Closed-form Solution of Absolute Orientation using Unit Quaternions. Journal of the Optical Society A, 4(4):629 642, Apr. 1987. [6] H. H.S.Ip and L. Yin. Constructing a 3d individualized head model from two orthogonal views. The Visual Computer, (12):254 266, 1996. [7] S. Z. Li and L. Gu. Real-time multi-view face detection, tracking, pose estimation, alignment, and recognition. In IEEE Conf. on Computer Visioin and Pattern Recognition Demo Summary, 2001. 4

[8] Z. Liu, Z. Zhang, C. Jacobs, and M. Cohen. Rapid modeling of animated faces from video. Journal of Visualization and Computer Animation, 12(4):227 240, Sep. 2001. [9] S. Yan and et al. Ranking prior local confidence model for statistical shape localization. In submitted, 2003. 5