Face Image Quality Assessment for Face Selection in Surveillance Video using Convolutional Neural Networks

Similar documents
Traffic Sign Localization and Classification Methods: An Overview

Skin and Face Detection

Face and Nose Detection in Digital Images using Local Binary Patterns

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Object Detection Design challenges

Study of Viola-Jones Real Time Face Detector

Image Processing. Image Features

Detecting and Reading Text in Natural Scenes

Object Detection. Part1. Presenter: Dae-Yong


Describable Visual Attributes for Face Verification and Image Search

Window based detectors

Recap Image Classification with Bags of Local Features

COMBINING NEURAL NETWORKS FOR SKIN DETECTION

LBP Based Facial Expression Recognition Using k-nn Classifier

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Rich feature hierarchies for accurate object detection and semantic segmentation

Classification of Protein Crystallization Imagery

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Face/Flesh Detection and Face Recognition

Dimensionality Reduction, including by Feature Selection.

A HYBRID APPROACH BASED ON PCA AND LBP FOR FACIAL EXPRESSION ANALYSIS

Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking

Face Quality Assessment System in Video Sequences

Large-scale Video Classification with Convolutional Neural Networks

Face recognition algorithms: performance evaluation

Deep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong

Decorrelated Local Binary Pattern for Robust Face Recognition

Semi-Supervised PCA-based Face Recognition Using Self-Training

Learning to Recognize Faces in Realistic Conditions

THE performance of automatic face recognition systems

Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach

Detection of a Single Hand Shape in the Foreground of Still Images

Robust Face Recognition via Sparse Representation Authors: John Wright, Allen Y. Yang, Arvind Ganesh, S. Shankar Sastry, and Yi Ma

Partial Face Recognition

Face detection and recognition. Detection Recognition Sally

Finger Print Enhancement Using Minutiae Based Algorithm

Distance-Based Descriptors and Their Application in the Task of Object Detection

An ICA based Approach for Complex Color Scene Text Binarization

Face Detection and Alignment. Prof. Xin Yang HUST

Object detection using Region Proposals (RCNN) Ernest Cheung COMP Presentation

Progress Report of Final Year Project

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

Texture. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors. Frequency Descriptors

LOCAL FEATURE EXTRACTION METHODS FOR FACIAL EXPRESSION RECOGNITION

Face Detection and Recognition in an Image Sequence using Eigenedginess

Object Category Detection: Sliding Windows

Visual features detection based on deep neural network in autonomous driving tasks

Using Machine Learning for Classification of Cancer Cells

Digital Image Processing

Graph Matching Iris Image Blocks with Local Binary Pattern

Convolution Neural Networks for Chinese Handwriting Recognition

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015

CHAPTER 6 RESULTS AND DISCUSSIONS

Informative Census Transform for Ver Resolution Image Representation. Author(s)Jeong, Sungmoon; Lee, Hosun; Chong,

Learning Visual Semantics: Models, Massive Computation, and Innovative Applications

arxiv: v1 [cs.cv] 26 Mar 2013

Fast, Accurate Detection of 100,000 Object Classes on a Single Machine

Vignette: Reimagining the Analog Photo Album

[2008] IEEE. Reprinted, with permission, from [Yan Chen, Qiang Wu, Xiangjian He, Wenjing Jia,Tom Hintz, A Modified Mahalanobis Distance for Human

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Criminal Identification System Using Face Detection and Recognition

Multiple-Person Tracking by Detection

Previously. Window-based models for generic object detection 4/11/2011

Three Embedded Methods

Available online at ScienceDirect. Procedia Computer Science 54 (2015 )

Computationally Efficient Serial Combination of Rotation-invariant and Rotation Compensating Iris Recognition Algorithms

INTELLIGENT transportation systems have a significant

Mobile Human Detection Systems based on Sliding Windows Approach-A Review

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

Object Category Detection: Sliding Windows

FACE DETECTION AND RECOGNITION OF DRAWN CHARACTERS HERMAN CHAU

NOWADAYS, there are many human jobs that can. Face Recognition Performance in Facing Pose Variation

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

Object detection using non-redundant local Binary Patterns

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

Lecture 4 Face Detection and Classification. Lin ZHANG, PhD School of Software Engineering Tongji University Spring 2018

Morphable Displacement Field Based Image Matching for Face Recognition across Pose

A Deep Learning Framework for Authorship Classification of Paintings

An Exploration of Computer Vision Techniques for Bird Species Classification

Recent Researches in Automatic Control, Systems Science and Communications

Schedule for Rest of Semester

An Adaptive Threshold LBP Algorithm for Face Recognition

Automatic Fatigue Detection System

A Real-Time License Plate Localization Method Based on Vertical Edge Analysis

Histograms of Oriented Gradients for Human Detection p. 1/1

Learning to Detect Faces. A Large-Scale Application of Machine Learning

FACE DETECTION BY HAAR CASCADE CLASSIFIER WITH SIMPLE AND COMPLEX BACKGROUNDS IMAGES USING OPENCV IMPLEMENTATION

DA Progress report 2 Multi-view facial expression. classification Nikolas Hesse

Structured Models in. Dan Huttenlocher. June 2010

A Real Time Facial Expression Classification System Using Local Binary Patterns

Encoder-Decoder Networks for Semantic Segmentation. Sachin Mehta

Deformable Part Models

A Laplacian Based Novel Approach to Efficient Text Localization in Grayscale Images

Pedestrian Detection and Tracking in Images and Videos

Hierarchical Ensemble of Gabor Fisher Classifier for Face Recognition

PHOG:Photometric and geometric functions for textured shape retrieval. Presentation by Eivind Kvissel

Three-Dimensional Face Recognition: A Fishersurface Approach

Transcription:

Face Image Quality Assessment for Face Selection in Surveillance Video using Convolutional Neural Networks Vignesh Sankar, K. V. S. N. L. Manasa Priya, Sumohana Channappayya Indian Institute of Technology Hyderabad

Literature Review Berrani et al - Statistical approaches Wong et al - Standard face Chen et al. - Multiple feature fusion and learning to rank

FACE RECOGNITION SYSTEM

Face Detection and Localization Viola Jones Haar feature based cascade classifier. Localization was done and resized to 64x64 pixels.

Face Feature Extraction Feature vectors are formed from the high quality face images given by subset selection. Two Feature Extraction techniques, Local binary patterns (LBP) Histogram of Oriented Gradients (HOG) LBP makes use of both shape and texture information. HOG makes use of the distribution of intensity gradient or edge directions to describe a image. These two form a reasonable subset to prove that the developed FQA algorithm works across the different FR algorithm

Face Matching Mutual Subspace Method (MSM) is used for image set matching. For each video sequence in the gallery set, feature vectors are calculated and compared with feature vectors of probe sequence using MSM. The two image sets are considered similar if the canonical angle between two image sets is within the threshold. This threshold is calculated from minimum error rate.

Face Subset Selection Different Face Recognition (FR) algorithms have different advantages. Fixed definition for determining the quality of face image doesn t take the full advantage of the given FR algorithm.

Face Subset Selection Goal is to select the faces images that perform best in the second module and are considered as high quality face images. Quality of the face image is defined with respect to the FR algorithm. Convolutional Neural Networks (CNN) is used to model the performance of the FR algorithm.

Why CNN?? CNN accepts the entire 2D image as input so there is no need of using explicit feature extraction. This is useful in our case where the definition of quality of face image is not fixed. CNN learns its parameters and defines the quality of the face image depending on the FR algorithm.

EXPERIMENTAL SETUP FOR FQA

EXPERIMENTAL SETUP FOR FQA ChokePoint dataset which is ideally suited for face recognition/ verification in the surveillance scenario is used. The dataset consists of 25 subjects (19 male and 6 female) with 64,204 face images We divide the images into training and testing sets. Set 1 contains image sequences of 13 subjects for training the CNN Set 2 contains the rest of the images sequences to evaluate the performance of the FR algorithm. PCA whitening is done to reduce the redundancy and correlation in the input face image.

Training Face image with the corresponding quality score (MSM score) is used to train the network. S = 1 N N n= 1 f ( I, W ) n Q n 2 2 ' W = arg min S where I n be the pre processed input face image, Q n be the quality/msm score, W is the weight matrix of the network, f(i n,w) be the predicted score of the input face image W

Testing Pre processed face images in the probe sequence is given as the input to the trained CNN. Quality scores are predicted for each face image in the sequence and sorted to get Top N high quality images.

Experimental Setup OF FR Testing set is divided into G1 and G2 where each dataset plays the role of development and evaluation sets respectively. By considering one group as development set (labeled set), we calculated matched and mismatched scores. Threshold is where the sum of False Acceptance Rate (FAR) and False Rejection Rate (FRR) is minimum i.e., Minimum Error Rate. Applying this threshold on the scores of pairs of evaluation sets, recognition rate (R G2 ) is calculated as follows R G 2 [( 1 FAR) + ( FRR) ] = 0.5 1 Then roles of G1 and G2 are changed to calculate R G1 Final Recognition rate is the average of RG1 and RG2. avg ( R R ) R = + 0.5 G1 G2

Results Video-based face verification performance on the ChokePoint dateset, using LBP and MSM Video-based face verification performance on the ChokePoint dateset, using HOG and MSM Subset Selection Method N = 4 N = 8 N = 16 Sequential 0.6114 0.6174 0.6278 Random 0.6825 0.6910 0.7040 Probabilistic Based 0.6995 0.7181 0.7252 Rank Based 0.7328 0.7511 0.7645 Proposed Method 0.7226 0.7564 0.7786 Subset Selection Method N = 4 N = 8 N = 16 Sequential 0.6419 0.6504 0.6669 Random 0.7329 0.7552 0.7706 Probabilistic Based 0.7603 0.7753 0.7876 Rank Based 0.7843 0.7870 0.7857 Proposed Method 0.7589 0.7775 0.7917

Conclusion From the results, we can infer that high verification performance is achieved by the proposed method which in turn implies that it is able to select the best subset of faces from the sequence of faces. From the initial results, we strongly believe that the proposed algorithm is promising and has attractive features. As part of future work, we plan to improve the performance of the algorithm by fine-tuning the parameters.