Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization and Confidence-Constraint Shape Modeling

Similar documents
Enhanced Active Shape Models with Global Texture Constraints for Image Analysis

Generic Face Alignment Using an Improved Active Shape Model

Face Alignment Under Various Poses and Expressions

Feature Detection and Tracking with Constrained Local Models

Disguised Face Identification Based Gabor Feature and SVM Classifier

Intensity-Depth Face Alignment Using Cascade Shape Regression

Hierarchical Ensemble of Gabor Fisher Classifier for Face Recognition

Pose Normalization for Robust Face Recognition Based on Statistical Affine Transformation

Human pose estimation using Active Shape Models

Sparse Shape Registration for Occluded Facial Feature Localization

DISTANCE MAPS: A ROBUST ILLUMINATION PREPROCESSING FOR ACTIVE APPEARANCE MODELS

Active Appearance Models

Real-time facial feature point extraction

Eye Detection by Haar wavelets and cascaded Support Vector Machine

AAM Based Facial Feature Tracking with Kinect

Active Wavelet Networks for Face Alignment

Texture Features in Facial Image Analysis

Automatic Rapid Segmentation of Human Lung from 2D Chest X-Ray Images

Facial Feature Detection

Subject-Oriented Image Classification based on Face Detection and Recognition

Robust Face Alignment Based on Hierarchical Classifier Network

Face Alignment Using Statistical Models and Wavelet Features

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Face Analysis Using Curve Edge Maps

Mouse Pointer Tracking with Eyes

A Multi-Stage Approach to Facial Feature Detection

Face detection and recognition. Detection Recognition Sally

Fully Automatic Multi-organ Segmentation based on Multi-boost Learning and Statistical Shape Model Search

A Robust Image Zero-Watermarking Algorithm Based on DWT and PCA

Face Detection Using Look-Up Table Based Gentle AdaBoost

Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Illumination Invariant Face Alignment Using Multi-band Active Appearance Model

Detection of a Single Hand Shape in the Foreground of Still Images

Graph Matching Iris Image Blocks with Local Binary Pattern

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

Face Recognition Using Long Haar-like Filters

Feature-Aging for Age-Invariant Face Recognition

Facial Feature Points Tracking Based on AAM with Optical Flow Constrained Initialization

Facial Landmarks Detection Based on Correlation Filters

Face Tracking in Video

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur

FACE DETECTION BY HAAR CASCADE CLASSIFIER WITH SIMPLE AND COMPLEX BACKGROUNDS IMAGES USING OPENCV IMPLEMENTATION

Partial Face Matching between Near Infrared and Visual Images in MBGC Portal Challenge

Binary Principal Component Analysis

NOWADAYS, there are many human jobs that can. Face Recognition Performance in Facing Pose Variation

Face Alignment Using Active Shape Model And Support Vector Machine

Hand Posture Recognition Using Adaboost with SIFT for Human Robot Interaction

On-line, Incremental Learning of a Robust Active Shape Model

Probabilistic Facial Feature Extraction Using Joint Distribution of Location and Texture Information

Learning to Fuse 3D+2D Based Face Recognition at Both Feature and Decision Levels

Dynamically Adaptive Tracking of Gestures and Facial Expressions

Hand Gesture Extraction by Active Shape Models

Decorrelated Local Binary Pattern for Robust Face Recognition

Face Analysis using Curve Edge Maps

Illumination-Robust Face Recognition based on Gabor Feature Face Intrinsic Identity PCA Model

Criminal Identification System Using Face Detection and Recognition

Face Detection and Alignment. Prof. Xin Yang HUST

The most cited papers in Computer Vision

Human Upper Body Pose Estimation in Static Images

Face recognition based on improved BP neural network

Localized Principal Component Analysis Learning for Face Feature Extraction and Recognition

Cross-pose Facial Expression Recognition

Facial Feature Extraction Based On FPD and GLCM Algorithms

3D Active Appearance Model for Aligning Faces in 2D Images

A Hybrid Face Detection System using combination of Appearance-based and Feature-based methods

An Adaptive Eigenshape Model

Determination of 3-D Image Viewpoint Using Modified Nearest Feature Line Method in Its Eigenspace Domain

Context-based additive logistic model for facial keypoint localization

Classification of Face Images for Gender, Age, Facial Expression, and Identity 1

A Bayesian Model for Extracting Facial Features

Face Alignment with Part-Based Modeling

A Cascade of Feed-Forward Classifiers for Fast Pedestrian Detection

Face and Nose Detection in Digital Images using Local Binary Patterns

Multi-Attribute Robust Facial Feature Localization

Face Tracking Implementation with Pose Estimation Algorithm in Augmented Reality Technology

Face View Synthesis Across Large Angles

Learning the Three Factors of a Non-overlapping Multi-camera Network Topology

Progress Report of Final Year Project

K-Means Based Matching Algorithm for Multi-Resolution Feature Descriptors

Pixel-Pair Features Selection for Vehicle Tracking

Robust 3D Face Tracking on Unknown Users with Dynamical Active Models

Video Google faces. Josef Sivic, Mark Everingham, Andrew Zisserman. Visual Geometry Group University of Oxford

Statistical Models of Face Images - Improving Specificity

Robust FEC-CNN: A High Accuracy Facial Landmark Detection System

Gender Classification Technique Based on Facial Features using Neural Network

Head Frontal-View Identification Using Extended LLE

Detection of a Hand Holding a Cellular Phone Using Multiple Image Features

Combining Gabor Features: Summing vs.voting in Human Face Recognition *

Reliable Face Recognition for Intelligent CCTV

Gabor Volume based Local Binary Pattern for Face Representation and Recognition

A Comparative Evaluation of Active Appearance Model Algorithms

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling

Robust Lip Contour Extraction using Separability of Multi-Dimensional Distributions

Dynamic Human Fatigue Detection Using Feature-Level Fusion

Improving Recognition through Object Sub-categorization

HUMAN S FACIAL PARTS EXTRACTION TO RECOGNIZE FACIAL EXPRESSION

Learning the Deep Features for Eye Detection in Uncontrolled Conditions

Coarse-to-Fine Statistical Shape Model by Bayesian Inference

Transcription:

Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization and Confidence-Constraint Shape Modeling Zhiheng Niu 1, Shiguang Shan 2, Xilin Chen 1,2, Bingpeng Ma 2,3, and Wen Gao 1,2,3 1 Department of Computer Science and Engineering, Harbin Institute of echnology, Harbin, 150001, China zhniu@jdl.ac.cn 2 Institute of Computing echnology, Chinese Academy of Sciences, Beijing, 100080, China {sgshan,xlchen,bpma,wgao}@jdl.ac.cn 3 Graduate School of the Chinese Academy of Sciences, Beijing, 100039, China Abstract. Active Shape Model (ASM) has been recognized as one of the typical methods for image understanding. Simply speaking, it iterates two steps: profile-based landmarks local searching, and statistics-based global shape modeling. We argue that the simple 1D profile matching may not localize landmarks accurately enough, and the unreliable localized landmarks will mislead the following shape matching. Considering these two problems, we propose to enhance ASM from two aspects: (1) in the landmarks local searching step, we introduce more efficient AdaBoost method to localize some salient landmarks instead of the relatively simple profile matching as in the traditional ASMs; (2) in the global shape modeling step, the confidences of the landmark localization are exploited to constrain the shape modeling and reconstruction procedure by not using those unreliably located landmarks to eliminate their negative effects. We experimentally show that the proposed strategies can impressively improve the accuracy of the traditional ASMs. 1 Introduction In most pattern recognition and computer vision tasks, the localization and alignment of target object from an image is a task of great importance. o deal with the problem, many methods have been proposed in recent years including active contour models (snake) [1], deformable template [2], elastic bunch graph matching [3], Gabor wavelet networks [4], Active Shape Models (ASMs) [5] and Active Appearance Models (AAMs) [6] etc. Among all these methods, ASMs and AAMs, both based on statistical models, has been recognized to be efficient and effective for image interpretation. Simply speaking, ASMs iterate two steps: profile-based landmarks local searching, and statistics-based global shape modeling. In the first step, local texture on the direction perpendicular to the contour, so-called profile, is exploited to model the local texture of each landmark and search for the landmarks locally. he global shape models are then applied in the second step to correct the local search result according to the statistical shape model. he profile-based searching in ASMs provides an efficient way to localize landmarks, which has made ASMs computationally fast. However, due to the variations in pose, lighting, and expressions, local texture of the same landmark may vary dramati- S.Z. Li et al. (Eds.): IWBRS 2005, LNCS 3781, pp. 9 14, 2005. Springer-Verlag Berlin Heidelberg 2005

10 Zhiheng Niu et al. cally. So, the simple 1D profile model may not localize landmarks accurately enough for further process. herefore, we argue that some elaborate local texture matching and searching method should improve the final performance of the ASMs, especially for those most significant landmarks. In this paper, AdaBoost based on Haar wavelets which has achieved great success in face detection [7] is introduced to localize some salient landmarks. Furthermore, in the shape matching step of ASMs, all the landmarks are equivalently treated. However, one can intuitively imagine that some landmarks inherently cannot be located accurately, e.g., some contour points, while others may inherently be easy to localize, e.g., the eyes. It is easy to understand that the imprecisely located landmarks will mislead the following shape matching process. Aiming at this problem, this paper proposes to exploit the confidence of the local matching and reconstruct the shape model according to those reliably localized landmarks with high confidence. hus, the possibly incorrect landmarks are not involved in the shape matching process. 2 Enhanced ASMs (EASMs) he system overview of the proposed Enhanced ASMs is illustrated in Figure 1, in which two main contributions can be highlighted: one is the AdaBoost-based salient landmarks location, the other is the shape fitting based on only reliable landmarks. In the following two sections, we will take facial landmarks localization for example to describe these two contributions in detail. Readers can easily generalize it to the analysis of other objects. Object Detected erminated Searching by AdaBoost Detector N Y Cconverged? Reconstructed Shape Searched Feature Points Low Response Unreliable Feature Points High Response Reliable Feature Points Evaluating the Unknown Expected Feature Points Shape Model Reconstruct the Shape Fig. 1. System overview 2.1 AdaBoost-Based Salient Landmarks Localization he Adaboost we used for salient landmarks localization is quite similar to that for face detection proposed by Viola and Jones [7]. he main difference lies in the adoption of more constraints when preparing the training data, especially the negative (non-landmark) examples, which are all derived from the neighborhood region of the specific facial landmark. his is reasonable considering that we need only to search these landmarks in the face region detected by a previous face detector, such as Viola and Jones AdaBoost method. For the same reason, in AdaBoost searching, the candidate search region for each landmark is also restricted elaborately. Besides, all the AdaBoost-based landmark detectors can share the same integral image, which can further speed up the procedure.

Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization 11 For face case, we define totally 44 landmarks, 13 contour points and 31 inner points respectively. Since the contour points are intuitively more appropriate to localize by using profile-based local search, only the 31 inner points are localized using the abovementioned AdaBoost-based method. Note that, in the iteration of the ASM searching, each AdaBoost-based landmark detector naturally provides a detection confidence indicating the accuracy of the detection, which will in turn determine whether the corresponding landmark can be used for sequential shape fitting. Cristinacce and Cootes [8], [9] has also used boosted classifiers similarly, with a Shape Constrained manner, and achieve good experimental result. 2.2 Confidences-Constraint Shape Reconstruction In traditional ASMs, once the landmarks are locally located in the local search step, a statistical model, pre-trained from a training set based on PCA, is acted on all the candidate landmarks to find a model best matching with the current candidate shape. his procedure can be formulated as follows: x = x +Pb (1) where x = [ x, y, x, y,, x, y ] is the shape vector concatenate the coordinates 0 0 1 1 n 1 n 1 of all the landmarks, x is the mean shape vector, P is the matrix containing the principle components of the shape vector, and b is the shape parameters of x when projected to the shape subspace. In this paper, we argue that candidate landmarks with very low detection confidence may decrease the accuracy of the sequential model fitting. So, we propose to utilize only those landmarks with high enough confidence for model fitting. Firstly, in the current iteration, all the candidate landmarks are categorized into two sets, reliable candidates set (RCS) and unreliable candidates set (UCS), by the following criteria: Li RCS,if Ci > θi (2) Li UCS,if Ci θi where Li denotes the i th landmark, C i is the detection confidence of L i, and θi is the threshold for L i. And we assume the number of landmarks in RCS and UCS are r and u respectively. hen, we try to perform PCA by using only the r reliable landmarks, considering the u unreliable landmarks as missing data. hus, in shape vector x, there are m =2r elements are known and k =2u elements missing. We have the transform matrix P that will transform the covariance matrix of x to 1/2 diagonal matrix Λ which contains its eigenvalues, and Q=PΛ that will transform it to identity matrix. herefore the coordinates in variance-normalized space is denoted as z=q (x-x). here exists a row transform matrix R, where RR=RR =I, that will transform x to x=r(x-x) whose first m elements are known and the last k elements are unknown. he objective function we try to minimize is the variance-normalized distance from shape x to the mean shape x, it comes from the fact that the shorter the distance is the shapes are more like to be.

12 Zhiheng Niu et al. zz= Q(x-x) Q(x-x)=x RQQRx (3) αm It is actually a least square problem. We partition them as x=r(x-x) = αk Cmm Cmk and C=R QQ R=, Where C C km C ij is i j matrix and α i is i-dimensional kk column vector. α is the only independent variable of objective function. Finally we k represent the unknown elements (unreliable points) by the know elements (reliable points) and the previous established shape model. 1 α ˆ k =-Ckk Ckmα m (4) he predicted shape vector is: αm xˆ = Rxˆ + x = R + ˆ x (5) αk Radically our reconstructing of shape by a part of landmarks, reliable points, is a kind of PCA with missing data. We develop this simple but efficient method to solve the problem instead of complicated EM algorithm and Probabilistic Principal Component Analysis [10], [11]. 3 Experiments and Analysis We test the algorithm on two data sets. he first is the face database built by our lab. he images are 240*320, and with 103 manually labeled landmarks. We compare the method introduced in this paper and the conventional ASM on this data set. he second is a publicly available image set known as the BioID database with 20 manually labeled landmarks, used by D. Cristinacce and. Cootes [9]. We have a comparison with their approach. Our algorithm relies on a global face detector to detect a rectangle of upright front face which is part of the algorithm s input. We use the face detector proposed by Viola and Jones [7]. 3.1 ests on Our Own Database We test 400 images contained a frontal face on the first data set. We measure the positional error of each feature and express it as a proportion of the inter-ocular distance. he x-axis is the No. of the facial feature points, and the y-axis is the error distance. he mean error of our method is 0.047, while the conventional ASM is 0.084. Our method achieves a good accuracy. We tests three different methods: the EASM with AdaBoost-based Searching and Confidence-Constraint Shape (ASM A+C ), ASM and ASM with AdaBoost-based Searching (ASM A ). Figure 2 shows our method is significantly accuracy and demonstrates that using reliable points to reconstruct shape make the algorithm more robust. his is because the mechanism of reconstructing shape decreases the effect of the improperly searched points to the final shape.

Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization 13 Fig. 2. Average positional error of the 44 feature points comparing with ASM and ASM A 3.2 ests on BioID Face Database he BioID images consist of 1521 images of frontal faces with a set of 20 manually labeled feature points. he images are taken in uncontrolled conditions using a web camera within an office environment. We test our method under the same esting Criteria described by D. Cristinacce and. Cootes [9]. he comparison of EASM and SOS is shown in able 1, where m e17 is the mean error distance of the 17 points [9]. Note that the proportion of successful searches of our EASM should be relatively higher if our face detection failures are the same 74 as in SOS, considering that most of the detection failures are low quality images much harder to analyze. able 1. Comparison of EASM and SOS [9] using the same 17 feature points Methods Global face Proportion of successful searches detection failures m e17 <0.10 m e17 <0.15 EASM 51 84% 97% SOS[9] 74 85% 96% Because of the eyes localization is much important in face recognition and some other applications we pay much concentration in eyes searching and localization. Our method achieves high accuracy in eyes localization with 95% of the tested images on BioID data set whose errors are less than 0.1 and in this condition the conventional ASM achieves only 22% (shown in right side of Figure 3). Fig. 3. Some examples of searching results of ASM (up) and EASM (down), and the right side shows average positional error of the eyes, comparing EASM and ASM, on BioID 4 Conclusions and Future Work In this paper, we present our Enhanced Active Shape Models (EASMs) by using the more accurate AdaBoost-based landmark detector instead of the relatively simple

14 Zhiheng Niu et al. profile matching, and exploiting the confidences of the landmark localization to constrain the shape modeling and reconstruction in order to avoid the negative effects of those unreliably located landmarks. We empirically show that these two improvements can impressively improve the accuracy of the ASMs. Especially, the technique that only the reliable points are used to reconstruct the shape has many advantages: firstly we have decreased the effect of the improperly searched points to the final shape; secondly we can use only part of the landmark points to set the initial shape instead of specific feature points (e.g. eyes), if these feature points are sheltered, it does not necessarily change the final result; thirdly we can easily change the searching strategy and keep the shape model unchanged just by setting the unused feature points invalid. Currently, the landmarks are directly divided two distinct groups according to their localization confidence, and only the so-called reliable landmarks are involved in the shape modeling. However, considering that the confidences are real continuous values, in the future, we will try confidences-weighted shape modeling methods to make better use of the information. How to further expand the proposed methods to AAMs will also be considered in the future. Acknowledgments his research is partially sponsored by Natural Science Foundation of China under contract No.60332010, "100 alents Program" of CAS, ShangHai Municipal Sciences and echnology Committee (No.03DZ15013), and ISVISION echnologies Co., Ltd. References 1. M. Kass, A. Witkin, and D. erzopoulos. Snakes: Active contour models. International Journal of Computer Vision, pp. 321 331, 1988. 2. A.L. Yuille. Deformable templates for face detection, J. Cogn. neurosci. 3, pp.59 70, 1991. 3. L. Wiskott, J. M. Fellous, N. Kruger, C. Malsburg, Face Recogniton by Elastic Bunch Graph Matching, IEEE PAMI, Vol.19, No. 7, pp. 775 779, 1997. 4. V. Kruger and G. Sommer. Gabor wavelet networks for object representation. echnical Report 2002, Institute of Computer Science, University of Kiel, 2000. 5..F. Cootes, C. J. aylor, D.H. Cooper, and J. Graham. Active shape models - their training and application. CVIU, 61(1), pp. 38 59, January 1995. 6..F. Cootes, G. J. Edwards, and C. J. aylor. Active appearance models. In 5 th ECCV, volume 2, pp.484 498. Springer, Berlin, 1998. 7. P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. CVPR 2001, volume 1, pp.511 518, Kauai, Hawaii, 2001. 8. D. Cristinacce and. F. Cootes. Facial feature detection using AdaBoost with shape constraints. In 14 th BMVC, pp. 231 240, 2003. 9. D. Cristinacce and. F. Cootes. A Comparison of Shape Constrained Facial Feature Detectors. Proceedings of the Sixth IEEE FGR, Seoul, Korea, 2004. 10. Haifeng Chen. Principal Component Analysis with Missing Data and Outliers. echnical Report 2002, Electrical and Computer Engineering Department Rutgers University, Piscataway, NJ, 08854. 11. M.E. ipping and C. M. Bishop. Probabilistic principal component analysis. Journal of the Royal Statistical Society, Series B, 61, Part 3, pp. 611 622, 1999.