An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012

Similar documents
Tom-vs-Pete Classifiers and Identity- Preserving Alignment for Face Verification. Thomas Berg Peter N. Belhumeur Columbia University

CS 4495 Computer Vision A. Bobick. CS 4495 Computer Vision. Features 2 SIFT descriptor. Aaron Bobick School of Interactive Computing

Learning to Recognize Faces in Realistic Conditions

DA Progress report 2 Multi-view facial expression. classification Nikolas Hesse

Feature descriptors. Alain Pagani Prof. Didier Stricker. Computer Vision: Object and People Tracking

Describable Visual Attributes for Face Verification and Image Search

Partial Face Recognition

SCALE INVARIANT FEATURE TRANSFORM (SIFT)

Window based detectors

2D Image Processing Feature Descriptors

arxiv: v1 [cs.cv] 4 Aug 2011

Face Recognition A Deep Learning Approach

CS4495/6495 Introduction to Computer Vision. 8C-L1 Classification: Discriminative models

Introduction to SLAM Part II. Paul Robertson

Hybrid Deep Learning for Face Verification

Scale Invariant Feature Transform

Building a Panorama. Matching features. Matching with Features. How do we build a panorama? Computational Photography, 6.882

PEOPLE naturally recognize others from their face appearance.

The SIFT (Scale Invariant Feature

Local Features Tutorial: Nov. 8, 04

Face detection and recognition. Detection Recognition Sally

Patch-based Object Recognition. Basic Idea

Experiments of Image Retrieval Using Weak Attributes

LOCAL APPEARANCE BASED FACE RECOGNITION USING DISCRETE COSINE TRANSFORM

Scale Invariant Feature Transform

School of Computing University of Utah

Vignette: Reimagining the Analog Photo Album

FACIAL POINT DETECTION BASED ON A CONVOLUTIONAL NEURAL NETWORK WITH OPTIMAL MINI-BATCH PROCEDURE. Chubu University 1200, Matsumoto-cho, Kasugai, AICHI

Motion Estimation and Optical Flow Tracking

Gender Classification

Scale Invariant Feature Transform by David Lowe

EECS150 - Digital Design Lecture 14 FIFO 2 and SIFT. Recap and Outline

Feature Based Registration - Image Alignment

Multi-view Facial Expression Recognition Analysis with Generic Sparse Coding Feature

Features Points. Andrea Torsello DAIS Università Ca Foscari via Torino 155, Mestre (VE)

Classification and Detection in Images. D.A. Forsyth

Feature Detection. Raul Queiroz Feitosa. 3/30/2017 Feature Detection 1

Object Recognition with Invariant Features

TIED FACTOR ANALYSIS FOR FACE RECOGNITION ACROSS LARGE POSE DIFFERENCES

Multiple Kernel Learning for Emotion Recognition in the Wild

Local Gradient Order Pattern for Face Representation and Recognition

CS 223B Computer Vision Problem Set 3

Deep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong

Image Features: Local Descriptors. Sanja Fidler CSC420: Intro to Image Understanding 1/ 58

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Deep Convolutional Neural Network using Triplet of Faces, Deep Ensemble, and Scorelevel Fusion for Face Recognition

Implementing the Scale Invariant Feature Transform(SIFT) Method

Part-based Face Recognition Using Near Infrared Images

arxiv: v1 [cs.cv] 16 Nov 2015

Applications Video Surveillance (On-line or off-line)

SUMMARY: DISTINCTIVE IMAGE FEATURES FROM SCALE- INVARIANT KEYPOINTS

Part-based Face Recognition Using Near Infrared Images

Improving Face Recognition by Exploring Local Features with Visual Attention

Face2Face Comparing faces with applications Patrick Pérez. Inria, Rennes 2 Oct. 2014

Designing Applications that See Lecture 7: Object Recognition

Local invariant features

State of The Art In 3D Face Recognition

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Dynamic Local Ternary Pattern for Face Recognition and Verification

Lucas-Kanade Scale Invariant Feature Transform for Uncontrolled Viewpoint Face Recognition

Local Features: Detection, Description & Matching

Cross-pose Facial Expression Recognition

Illumination invariant face recognition and impostor rejection using different MINACE filter algorithms

ROBUST PARTIAL FACE RECOGNITION USING INSTANCE-TO-CLASS DISTANCE

A GENERIC FACE REPRESENTATION APPROACH FOR LOCAL APPEARANCE BASED FACE VERIFICATION

Probabilistic Elastic Matching for Pose Variant Face Verification

Real-time Object Detection CS 229 Course Project

Using Geometric Blur for Point Correspondence

arxiv: v1 [cs.cv] 12 Mar 2014

Computer Vision for HCI. Topics of This Lecture

Feature Descriptors. CS 510 Lecture #21 April 29 th, 2013

FACE-GRAB: Face Recognition with General Region Assigned to Binary Operator

Automated Canvas Analysis for Painting Conservation. By Brendan Tobin

Sparse Variation Dictionary Learning for Face Recognition with A Single Training Sample Per Person

Tri-modal Human Body Segmentation

Local Image Features

Discriminative classifiers for image recognition

Recognizing Micro-Expressions & Spontaneous Expressions

Hybrid Deep Learning for Face Verification. Yi Sun, Xiaogang Wang, Member, IEEE, and Xiaoou Tang, Fellow, IEEE

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015

EE368 Project Report CD Cover Recognition Using Modified SIFT Algorithm

TEXTURE CLASSIFICATION METHODS: A REVIEW

Lecture 10 Detectors and descriptors

NOWADAYS, there are many human jobs that can. Face Recognition Performance in Facing Pose Variation

On Modeling Variations for Face Authentication

CEE598 - Visual Sensing for Civil Infrastructure Eng. & Mgmt.

Image Processing Pipeline for Facial Expression Recognition under Variable Lighting

Object Category Detection: Sliding Windows

Computer Vision. Recap: Smoothing with a Gaussian. Recap: Effect of σ on derivatives. Computer Science Tripos Part II. Dr Christopher Town

Object Detection Design challenges

TA Section 7 Problem Set 3. SIFT (Lowe 2004) Shape Context (Belongie et al. 2002) Voxel Coloring (Seitz and Dyer 1999)

Face Detection Algorithm Based on a Cascade of Ensembles of Decision Trees

ROBUST LDP BASED FACE DESCRIPTOR

Recognition of Animal Skin Texture Attributes in the Wild. Amey Dharwadker (aap2174) Kai Zhang (kz2213)

Outline 7/2/201011/6/

On RGB-D Face Recognition using Kinect

SURF. Lecture6: SURF and HOG. Integral Image. Feature Evaluation with Integral Image

Eppur si muove ( And yet it moves )

Gender Recognition using Complexity-Aware Local Features

Transcription:

An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012, 19.01.2012 INSTITUTE FOR ANTHROPOMATICS, FACIAL IMAGE PROCESSING AND ANALYSIS YIG University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu

Outline Introduction Motivation Related works Basic ideas Approach scheme Identity data set Face components features Settings estimation Approach Appearance-prediction Likelihood-prediction Switching mechanism Results 2 19.01.2012

Introduction Basic ideas Approach Results 3 19.01.2012

Motivation Different pose Different illumination Different expression Simply different 4 19.01.2012

Motivation: the same / different settings Jeff Hawkins s On intelligence brain study Two types of face matching 1) Similar settings Direct matching (just measure component distances and compare them) Yes No 2) Different settings Distances not informative direct matching inefficient Life full of faces our memory == big face image gallery Use memory as bridge between two images Associate-predict matching 5 19.01.2012

Motivation: different settings Different settings 1) associate in memory database similar faces 2) predict from memory similar faces under searched settings 3) direct matching? Yes 6 19.01.2012

Related works Attribute and Simile Classifiers by Kumar et. al [ICCV 2009] 7 19.01.2012

Introduction Basic ideas Approach Results 8 19.01.2012

Memory People s memory == Machine s gallery 9 19.01.2012

Goal Main goal of our approach: to deal with intra-personal variation Basic idea: By different settings Find in the gallery suitable bridge between two compared images Two steps 10 19.01.2012

Association step First step: Associate face B with the most alike group from memory 11 19.01.2012

Prediction step Second step: Find the image with searched settings That will be our predict Similar settings: -frontal -illumination -neutral expression Details about settings estimation in further slides 12 19.01.2012

Big picture 13 19.01.2012

Identity data set 7 poses -60% -40% -20% 0% 20% 40% 60%) no flash left flash 4 illuminations right flash left-right flash 200 ids (persons) from Multi-PIE (CMU Face Database) For each person: 7 poses, 4 illuminations, 1 expression 14 19.01.2012

Feature extraction Four landmarks automatically detected Alignments for 12 components 15 19.01.2012

Descriptors LBP extract intensity for each pixel and its neighboring invariant to rotation and grayscale (intensity) changes SIFT Differences of Gaussians (DoG) - invariant to rotation and image scale 1) DoG scale-space extrema regions 2) gradients keypoints description LE extract local microstructures (e.g., edges, lines, spots, flat areas) invariant to grayscale changes Gabor robustness against varying brightness, varying contrast certain amount of robustness against translation, distortion, rotation, and scaling 16 19.01.2012

Setting estimation Pose Templates Illumination Templates Input face...... - Measure distances between extracted feature vectors (Input, Template) -Take the settings of the nearest template 17 19.01.2012 Template = Average-face across all left-oriented images in gallery

Introduction Basic ideas Approach Results 18 19.01.2012

Associate-Predict Model Associate the component - Measure distances between extracted feature vectors (A, gallery images) -Take the nearest id (person) 19 19.01.2012

Appearance prediction Prediction possibility a) Appearance-prediction Settings: - right-oriented, - right flash Choose inside of associated id the image with settings of B: - frontal - left-right flash Calculate distance d A = f A f B Settings: - frontal, - left-right-flash 20 19.01.2012

Appearance prediction A A B ----------> d A = f A f B B B A ----------> d B = f B f A Final distance Simple average of both With weights α A, α B d p 1 2 ( d A d B ) d p A 1 B *( A * d A B * d B ) A e f A 1 f A' B e f B 1 f B' 21 19.01.2012

Appearance prediction Results Fusion of 12 predicted components (A 1, A 2,, A 12,) = appearance-prediction result component distances ( d p, d,..., ) 1 p d 2 p 12 linear SVM Fused decision: the same / not-thesame 22 19.01.2012

Likelihood prediction Prediction possibility b) Likelihood-prediction 23 19.01.2012

Likelihood prediction 20 ids = negative samples (20/200 = 10%) Select K number of positive ids (nearest neighbors) By associate-step instead of 1 nearest neighbor, we select K nearest neighbors (K the most similar ids) Positive sample set = K * (# images per person) +1 input-image! Or subset of this number 24 19.01.2012

Likelihood prediction Rest: negative samples We separate positive/negative with LDA: 25 19.01.2012

Likelihood prediction For each new sample B LDA tells us: P(B belongs to the positive sample set) =? d i 0,8 Likelihood distance high (Not-the-same) 26 19.01.2012

Likelihood prediction Build A-Classifier + feed new sample B Likelihood distance Build B-Classifier + feed new sample A Likelihood distance d A d B Average: d p 1 2 ( d A d B ) With weights: d p 1 *( A * d A B * d B ) A B d p < Threshold the positive sample component distances ( d p, d,..., ) 1 p d 2 p 12 linear SVM Fused decision: the same / not-the-same 27 19.01.2012

Switching mechanism Pair A,B is Comparable if Pose Illumination P A P B < 3 and L A L B < 3 Not comparable else and and P A P B = 6 L A L B = 3 28 19.01.2012

Switching mechanism Final matching distance: d sw = Yes d a, if comparable direct matching d p, if not comparable associate-predict model Switching reduces risk of inaccurate association/prediction 29 19.01.2012

Introduction Basic ideas Approach Results 30 19.01.2012

Experimental results Training set Multi-PIE: 200 persons (from CMU, over all 337 persons, >750,000 images) Test sets Multi-PIE: 49 persons mutually exclusive to training set 10 folds cross-validation Each fold has 300 intra-personal pairs, 300 extra-personal pairs LFW (Labeled Faces in the Wild, over all 5749 people, >13.000 images) Restricted protocol (fixed number of intra-personal and extra-personal pairs provided for training) 10 folds cross-validation Each fold has 300 intra-personal pairs, 300 extra-personal pairs Unrestricted protocol (random number of training pairs can be generated based the given faces labels) 31 19.01.2012

Experimental results Holistic vs. Component on Multi-PIE Direct matching always worse than other by appearance: component ~3% better by likelihhood: component ~4% better 32 19.01.2012

Experimental results Effect of positive sample number for likelyhood-prediction on Multi- PIE benchmark (LBP feature) 1 sample vs. 71 samples improvement of ~10% Each id has 28 different images For K associated ids max. 28*K +1 images For K=3 from 59 to 78 positive images 33 19.01.2012

Experimental results Improvement of Switching on Multi-PIE dataset improvement of ~2,5% on LFW dataset improvement of ~5% 34 19.01.2012

Experimental results Result on Multi-Pie benchmark Fusion (the best experimental model): 94% It means: 60% of errors were eliminated Direct LE descriptor: 84.2% 35 19.01.2012

Experimental results Result on LFW benchmark Again clear improvement Likelihood a little bit better than appearance Fusion = appearance & likelihoood fused by linear SVM 36 19.01.2012

Experimental results (LFW benchmark) Restricted protocol Unrestricted protocol - 90.57% the best experimental model 37 19.01.2012

Final remarks Advantages of Associate-Predict model Using universal identities as bridge between two images Effective use of gallery with flexible switch model Achievements Good handling of intra-personal variation (pose, illumination) Best result under restricted protocol on LFW Improvement ideas More prior knowledge better results 38 19.01.2012

The End Thanks for your attention! Questions? 39 19.01.2012

References Q. Yin, X. Tang, and J. Sun. An associate-predict model for face recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011. Z. Cao, Q. Yin, J. Sun, and X. Tang. Face recognition with Learningbased Descriptor. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010. N. Kumar, A. C. Berg, P. N. Belhumeur, and S. K. Nayar. Attribute and Simile Classifiers for Face Verification. International Conference on Computer Vision (ICCV), 2009. 40 19.01.2012

Learning-based descriptor (LE) learning-based descriptor pipeline 41 19.01.2012