DeepFace: Closing the Gap to Human-Level Performance in Face Verification

Similar documents
Face Recognition A Deep Learning Approach

Deep Learning for Vision

Deep Face Recognition. Nathan Sun

FaceNet. Florian Schroff, Dmitry Kalenichenko, James Philbin Google Inc. Presentation by Ignacio Aranguren and Rahul Rana

DeepFace: Closing the Gap to Human-Level Performance in Face Verification

The Analysis of Faces in Brains and Machines

Robust Face Recognition Based on Convolutional Neural Network

Clustering Lightened Deep Representation for Large Scale Face Identification

A Patch Strategy for Deep Face Recognition

Deep Learning and Its Applications

An Exploration of Computer Vision Techniques for Bird Species Classification

Deep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong

Keras: Handwritten Digit Recognition using MNIST Dataset

Efficient Deep Learning Optimization Methods

Index. Springer Nature Switzerland AG 2019 B. Moons et al., Embedded Deep Learning,

CS 179 Lecture 16. Logistic Regression & Parallel SGD

Supplementary Material: Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

SHIV SHAKTI International Journal in Multidisciplinary and Academic Research (SSIJMAR) Vol. 7, No. 2, April 2018 (ISSN )

arxiv: v3 [cs.cv] 24 Apr 2017

Deep Learning. Volker Tresp Summer 2015

Deep Learning. Visualizing and Understanding Convolutional Networks. Christopher Funk. Pennsylvania State University.

Deep Convolutional Neural Network using Triplet of Faces, Deep Ensemble, and Scorelevel Fusion for Face Recognition

End-to-End Localization and Ranking for Relative Attributes

Neural Network Optimization and Tuning / Spring 2018 / Recitation 3

Deep Learning for Computer Vision II

Face Recognition by Deep Learning - The Imbalance Problem

Bidirectional Recurrent Convolutional Networks for Video Super-Resolution

DEEP LEARNING IN PYTHON. The need for optimization

CENG 783. Special topics in. Deep Learning. AlchemyAPI. Week 11. Sinan Kalkan

Parallel Stochastic Gradient Descent

Facial Expression Classification with Random Filters Feature Extraction

DECISION TREES & RANDOM FORESTS X CONVOLUTIONAL NEURAL NETWORKS

A Deep Learning Approach to Vehicle Speed Estimation

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

Implementing Deep Learning for Video Analytics on Tegra X1.

Report: Privacy-Preserving Classification on Deep Neural Network

Deep Learning with Tensorflow AlexNet

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015

Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition

CS230: Lecture 3 Various Deep Learning Topics

REGRESSION ANALYSIS : LINEAR BY MAUAJAMA FIRDAUS & TULIKA SAHA

arxiv: v1 [cs.cv] 7 Dec 2015

A performance comparison of Deep Learning frameworks on KNL

Deep neural networks II

Perceptron: This is convolution!

An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012

Machine Learning. MGS Lecture 3: Deep Learning

Training Deep Neural Networks (in parallel)

Face2Face Comparing faces with applications Patrick Pérez. Inria, Rennes 2 Oct. 2014

Hybrid Deep Learning for Face Verification. Yi Sun, Xiaogang Wang, Member, IEEE, and Xiaoou Tang, Fellow, IEEE

MoonRiver: Deep Neural Network in C++

Tutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY

DL Tutorial. Xudong Cao

arxiv: v3 [cs.cv] 18 Jan 2017

arxiv: v4 [cs.cv] 14 Nov 2016

Facial Key Points Detection using Deep Convolutional Neural Network - NaimishNet

CS489/698: Intro to ML

Parallel Deep Network Training

FACE verification in unconstrained settings is a challenging

Deep Neural Networks Optimization

Deep Learning. Volker Tresp Summer 2017

Final Report: Classification of Plankton Classes By Tae Ho Kim and Saaid Haseeb Arshad

Study of Residual Networks for Image Recognition

Machine Learning With Python. Bin Chen Nov. 7, 2017 Research Computing Center

Deep Learning Cook Book

Lecture 2 Notes. Outline. Neural Networks. The Big Idea. Architecture. Instructors: Parth Shah, Riju Pahwa

Unsupervised Learning of Spatiotemporally Coherent Metrics

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

This Talk. 1) Node embeddings. Map nodes to low-dimensional embeddings. 2) Graph neural networks. Deep learning architectures for graphstructured

Dynamic Routing Between Capsules

Gradient Descent Optimization Algorithms for Deep Learning Batch gradient descent Stochastic gradient descent Mini-batch gradient descent

Deep Learning. Vladimir Golkov Technical University of Munich Computer Vision Group

A new approach for supervised power disaggregation by using a deep recurrent LSTM network

arxiv: v1 [cs.cv] 31 Mar 2017

Improving Face Recognition by Exploring Local Features with Visual Attention

Deep Learning for Embedded Security Evaluation

Deep Fisher Faces. 1 Introduction. Harald Hanselmann

A Comparison of Sequence-Trained Deep Neural Networks and Recurrent Neural Networks Optical Modeling For Handwriting Recognition

ART 알고리즘특강자료 ( 응용 01)

COMP9444 Neural Networks and Deep Learning 7. Image Processing. COMP9444 c Alan Blair, 2017

arxiv: v1 [cs.cv] 15 Nov 2018

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon

arxiv: v1 [cs.cv] 3 Mar 2018

CPSC 340: Machine Learning and Data Mining. Deep Learning Fall 2016

Vignette: Reimagining the Analog Photo Album

Conflict Graphs for Parallel Stochastic Gradient Descent

Parallel Stochastic Gradient Descent: The case for native GPU-side GPI

Matrix Computations and " Neural Networks in Spark

Neural Bag-of-Features Learning

CPSC 340: Machine Learning and Data Mining. Multi-Dimensional Scaling Fall 2017

Linear Regression Optimization

Learning to Recognize Faces in Realistic Conditions

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17

FACIAL POINT DETECTION BASED ON A CONVOLUTIONAL NEURAL NETWORK WITH OPTIMAL MINI-BATCH PROCEDURE. Chubu University 1200, Matsumoto-cho, Kasugai, AICHI

Neural networks. About. Linear function approximation. Spyros Samothrakis Research Fellow, IADS University of Essex.

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset. By Joa õ Carreira and Andrew Zisserman Presenter: Zhisheng Huang 03/02/2018

BAYESIAN GLOBAL OPTIMIZATION

Deep Indian Delicacy: Classification of Indian Food Images using Convolutional Neural Networks

arxiv: v1 [cs.cv] 15 Jun 2018

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

Transcription:

DeepFace: Closing the Gap to Human-Level Performance in Face Verification Report on the paper Artem Komarichev February 7, 2016

Outline New alignment technique New DNN architecture New large dataset with labeled face images Experiments and Results on benchmarks dataset LFW and YTF

Introduction The goal of face verification is to define either two face images belong to the same person or not. There exist other face recognition tasks such as: face identification, face clustering and face detection. In their paper they are considering problem in unconstrained environment.

Benchmark datasets 1. LFW (Labeled Faces in the Wild) consists of 13,323 web photos of famous people. 5,749 unique identities. 6,000 face pairs. 2. YTF (Youtube Face Database) contains 3,425 youtube videos of 1,595 unique identities. They splitted all this videos on 5,000 video pairs.

New collected dataset Their DeepFace model was trained on a new huge dataset extracted from Facebook photos, so called Social Face Classification (SFC) dataset. SFC dataset contains 4.4M labeled face images belonging to 4030 unique identities. 5% of this dataset was used for testing purposes.

Face alignment Alignment is still considered to be difficult problem to solve especially in the unconstrained environment. In the recent few years 3D modeling was used extensively. Frontalization procedure are illustrated below:

Face alignment (example from paper)

DeepFace Architecture The big picture of DeepFace model is represented below:

DeepFace Architecture (DNN from paper)

DeepFace Architecture As loss function was chosen cross-entropy loss: L = N log(p i ) (1) i where p i is a prediction that this image belongs to class i, and N is a number of classes. To update parameters was used standard BackPropagation algorithm and Stochastic Gradient Descent (SGD). The size of mini-batch was equal to 128. Non-linear activation function: rectifier linear units - max(0, x).

DeepFace Architecture The number of parameters was close to 120M, where 95% was from LC and FC layers. Sparse architecture: 75% of gradients were equal to zero in the last five layers. Sparsity was encorouged in the last time (dropout, maxout). Training process took three days to run model on the whole SFC dataset for 15 epochs. Learning rate was decreased during the training procedure up to 0.0001 starting from 0.01.

Verification Metrics Unsupervised: inner product of feature vectors Siammese network: d(f 1, f 2 ) = N i w i f (i) 1 f (i) 2, where N is a number of learned features, f 1 and f 2 - feature vectors for first face image and for second one, and w i - trainable weights. Chi squared distance: χ 2 (f 1, f 2 ) = N i (f w (i) (i) 1 f 2 )2 i (f (i) 1 f (i) 2 )

Experiments and Results Models Datasets LFW Youtube Faces DB FaceNet 99.63%±0.15 95.12%±0.39 DeepFace 97.35%±0.25 91.4%±1.1 Parkhi s approach 98.65% 97.3% DeepID2...... DeepID2+...... DeepID3............... Table: State-of-the-art in face verification.

Experiments and Results (size of training dataset) They provided several experiments with the size of the training dataset, different number of unique identities. When whole 4K identities was used they got the highest accuracy. It is not suprising fact, because the more diverse our training dataset, the more better features we can learn. The origianl SFC was reduced up to 10% with the same 4K people showed twice less accuracy than 50% dataset.

Experiments and Results (different architectures) Comparing different reduced architectures: without second conovolutional layer without first two locally connected layers without three these layers with the original one, they showed that the deep of the architecture is a critical issue. Deeper DNN can learn better feature representations. Therefore, deeper architectures shows higher final accuracy than shallower ones does.

Conclusion Main contributions: They achieved the new state-of-the-art on the LFW and YTF datasets by the time when the paper was published They proposed a new alignment techniques based on 3D modeling of the face images in the unconstrained settings They came up with a new DNN architecture which was trained on their own huge dataset