Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition

Size: px
Start display at page:

Download "Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition"

Transcription

1 Emotion Recognition In The Wild Challenge and Workshop (EmotiW 2013) Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition Mengyi Liu, Ruiping Wang, Zhiwu Huang, Shiguang Shan, Xilin Chen Institute of Computing Technology, Chinese Academy of Sciences

2 Outline Problem Related work Our Method Experiments Conclusion 2

3 Outline Problem Related work Our Method Experiments Conclusion 3

4 Emotion recognition in the wild Challenges Large data variations head pose, illumination, partial occlusion, etc. Lack of labeled data Manual annotation is hard as spontaneous expression is ambiguous in the real world. 4

5 Outline Problem Related work Our Method Experiments Conclusion 5

6 Video-based emotion recognition Acoustic information based Time domain and frequency domain e.g. pitch, intensity, pitch contour, Low Short-time Energy Ratio (LSTER), maximum bandwidth, Vision information based Spatial space and temporal space e.g. Optical flow, 3D descriptor (LBP-TOP, HOG 3D), tracking based (AAM, CLM), probabilistic graph model (HMM, CRF), 6

7 Outline Problem Related work Our Method Experiments Conclusion 7

8 Key issue Our method How to model the emotion video clip? Motivation Alleviate the effect of mis-alignment of facial images Encode the data variations among video frames Basic idea Inspired by recent progress of image set-based face recognition [1] Treat the video clip as an image set, i.e., a collection of frames Linear subspace for video (image set) modeling 8 [1] R. Wang, H. Guo, L. S. Davis, and Q. Dai. Covariance discriminative learning: A natural and efficient approach to image set classification. CVPR, 2012.

9 Our method An overview Preprocessing Feature Designing Classification Original aligned face images Purified face images Mid-level image features Video/Image set features One-to-Rest PLS classification Video Filtering out non-face in PCA subspace Subspace learning on Grassmannian manifold Video-Audio Fusion Audio Original audio data Clip-wise audio features extracted using opensmile toolkit*[2] One-to-Rest PLS classification 9 [2] F. Eyben, M. Wollmer, and B. Schuller. Opensmile: the munich versatile and fast open-source audio feature extractor. ACM MM, 2010.

10 Preprocessing Our method Original face alignment using MoPS [3] (provided by organizer) Purification of face images Original aligned face images set: X = x 1, x 2,, x n, x i R D. PCA projection learned on X by preserving low energy: W. Mean reconstruction error of each image: MeanErr t = 1 D x t W T 2 Wx t Non-face/Badly-aligned face images tend to have large MeanErr t. 10 [3] X. Zhu, and D. Ramanan. Face detection, pose estimation, and landmark localization in the wild. CVPR, 2012.

11 The Number of Samples Preprocessing Our method The distribution of MeanErr t on training set in EmotiW Threshold The Mean Reconstruction Error * Threshold is for filtering out non-face in PCA space. 11

12 Our method Preprocessing An example of 100 samples with largest mean reconstruction error. Most are non-face images or mis-alignment results. 12

13 Our method An overview Preprocessing Feature Designing Classification Original aligned face images Purified face images Mid-level image features Video/Image set features One-to-Rest PLS classification Video Filtering out non-face in PCA subspace Subspace learning on Grassmannian manifold Video-Audio Fusion Audio Original audio data Clip-wise audio features extracted using opensmile toolkit*[2] One-to-Rest PLS classification 13

14 Feature designing Image feature [4] Our method Convolution Filters 6x6x100 Max-Pooling 3x3 Face Image 32x32 Filter Maps 27x27x100 Mid-level Feature 9x9x100 [4] M. Liu, S. Li, S. Shan, X. Chen. AU-aware Deep Networks for Facial Expression Recognition. FG,

15 Our method Feature designing Video feature Each video clip is a set of images, denoted as S i R f n i, where f is the dimension of image feature, and n i is the number of frames. The video S i can be represented as a linear subspace P i, s.t. S i S i T = P i Λ i P i T Thus all the video clips can be modeled as a collection of subspaces, which are also the points on Grassmannian manifold. 15

16 Feature designing Video feature Our method An illustration of subspaces on Grassmannian manifold Video Clip 1 Video Clip 2 Similarity M 16

17 Our method Feature designing Video feature The similarity between two points P i and P j on manifold M can be measured by a linear combination of Grassmannian kernels. Projection kernel[5]: k proj ij = P T 2 i P j F. Canonical correlation kernel[6]: k CC ij = max ap span P i max bq span P j a T p b q. Linear combination: k com ij = k proj ij + αk CC ij. The kernels of each point (i.e., each video) to all training points serve as its final feature representation for classification. [5] J. Hamm, D. Lee. Grassmann discriminant analysis: a unifying view on subspace-based learning. ICML, [6] M. Harandi, C. Sanderson, S. Shirazi, B.C. Lovell. Graph embedding discriminant analysis on Grassmannian manifolds for improved image set matching. CVPR,

18 Our method An overview Preprocessing Feature Designing Classification Original aligned face images Purified face images Mid-level image features Video/Image set features One-to-Rest PLS classification Video Filtering out non-face in PCA subspace Subspace learning on Grassmannian manifold Video-Audio Fusion Audio Original audio data Clip-wise audio features extracted using opensmile toolkit*[2] One-to-Rest PLS classification 18

19 Our method Classification Partial Least Squares (PLS) for classification [1] Maximize the covariance between observations and class labels Space X Space Y Each sample is a feature vector Each sample is a 0/1 class label Linear Projection Latent Space T X = T * P Y = T * B * C = X * B_pls T is the common latent representation [1] R. Wang, H. Guo, L. S. Davis, and Q. Dai. Covariance discriminative learning: A natural and efficient approach to image set classification. CVPR,

20 Classification 20 One-to-Rest PLS Our method Suppose there are c categories and N training samples, we train c One-to-Rest PLS classifiers to predict each class simultaneously. Effectively to handle the hard classes, e.g. Sad vs. Disgust Binarize Original training label vector Y R N 1 Binary training label matrix Y R N c Separate One-to-Rest training label vectors, y 1, y 2,, y c R N 1

21 Our method Classification One-to-Rest PLS Training and test process Training data X + One-to-Rest training label vectors y 1, y 2,, y c Test sample One-to-Rest PLS(1) One-to-Rest PLS(2) One-to-Rest PLS(3) One-to-Rest PLS(c-1) One-to-Rest PLS(c) Test result: Fit R c 1 21

22 Our method Classification Video-Audio fusion for final test output For a given test video, using the c PLS classifiers for video and audio respectively, we obtain two prediction vectors Fit video, Fit audio R c 1. We conduct a linear fusion at decision level using weighted parameter λ Fit fusion = (1 λ) Fit video +λfit audio. The category corresponding to the maximum value in Fit fusion is determined to be the recognition result. 22

23 Outline Problem Related work Our Method Experiments Conclusion 23

24 Recognition Accuracy Experiments Discussion of Parameters The fusion weights of Grassmannian kernels Train-Val Val-Train k ij com = k ij proj + αk ij CC = 2 6, 2 5 = The Value of Grassmannian Kernels Combination Weight (Alpha) 24

25 Recognition Accuracy (Video) Experiments Discussion of Parameters The dimension of One-to-Rest PLS (video) Train-Train Train-Val Val-Val Val-Train = 10 = The Dimensions of PLS 25

26 Recognition Accuracy (Audio) Experiments Discussion of Parameters The dimension of One-to-Rest PLS (audio) Train-Train Train-Val Val-Val Val-Train = 5 = The Dimensions of PLS 26

27 Recognition Accuracy Experiments Discussion of Parameters The fusion weights of video and audio modalities Train-Val Val-Train Fit fusion = (1 λ) Fit video +λfit audio = = The Value of Video-Audio Fusion Weight (Lambda) 27

28 Experiments Results comparison Performance Comparison Audio only One-to-Rest PLS Grassmannian Discriminant Analysis [6] Video only Grassmannian Kernels + One-to-Rest PLS Original data Feature-level fusion Multi-class LR One-to-Rest PLS Audio + Video Decisionlevel fusion One-to-Rest PLS Purified data Decisionlevel fusion One-to-Rest PLS Ours Baseline Val % 30.81% 32.07% 22.48% 24.24% 34.34% 35.86% Test* % % 33.01% 34.61% Val 19.95% 27.27% 22.22% Test 22.44% 22.75% 27.56% [6] M. Harandi, C. Sanderson, S. Shirazi, B.C. Lovell. Graph embedding discriminant analysis on Grassmannian manifolds for improved image set matching. CVPR,

29 Outline Problem Related work Our Method Experiments Conclusion 29

30 Conclusion Key points of the current method PCA-based data purifying to filter out mis-alignment faces Linear subspace modeling of video data variations Multiple video features fusion by Grassmannian kernels combination Multi-modality fusion at decision level of video and audio Issues to further address 30 Exploration of video temporal dynamics information More sophisticated video modeling More effective fusion at feature level

31 Thank you. Question?

Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition

Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition Mengyi Liu, Ruiping Wang, Zhiwu Huang, Shiguang Shan, Xilin Chen Key Lab of Intelligence Information Processing Institute

More information

Multiple Kernel Learning for Emotion Recognition in the Wild

Multiple Kernel Learning for Emotion Recognition in the Wild Multiple Kernel Learning for Emotion Recognition in the Wild Karan Sikka, Karmen Dykstra, Suchitra Sathyanarayana, Gwen Littlewort and Marian S. Bartlett Machine Perception Laboratory UCSD EmotiW Challenge,

More information

Facial Expression Analysis

Facial Expression Analysis Facial Expression Analysis Jeff Cohn Fernando De la Torre Human Sensing Laboratory Tutorial Looking @ People June 2012 Facial Expression Analysis F. De la Torre/J. Cohn Looking @ People (CVPR-12) 1 Outline

More information

DA Progress report 2 Multi-view facial expression. classification Nikolas Hesse

DA Progress report 2 Multi-view facial expression. classification Nikolas Hesse DA Progress report 2 Multi-view facial expression classification 16.12.2010 Nikolas Hesse Motivation Facial expressions (FE) play an important role in interpersonal communication FE recognition can help

More information

MULTI-POSE FACE HALLUCINATION VIA NEIGHBOR EMBEDDING FOR FACIAL COMPONENTS. Yanghao Li, Jiaying Liu, Wenhan Yang, Zongming Guo

MULTI-POSE FACE HALLUCINATION VIA NEIGHBOR EMBEDDING FOR FACIAL COMPONENTS. Yanghao Li, Jiaying Liu, Wenhan Yang, Zongming Guo MULTI-POSE FACE HALLUCINATION VIA NEIGHBOR EMBEDDING FOR FACIAL COMPONENTS Yanghao Li, Jiaying Liu, Wenhan Yang, Zongg Guo Institute of Computer Science and Technology, Peking University, Beijing, P.R.China,

More information

Cross-pose Facial Expression Recognition

Cross-pose Facial Expression Recognition Cross-pose Facial Expression Recognition Abstract In real world facial expression recognition (FER) applications, it is not practical for a user to enroll his/her facial expressions under different pose

More information

A Real Time Facial Expression Classification System Using Local Binary Patterns

A Real Time Facial Expression Classification System Using Local Binary Patterns A Real Time Facial Expression Classification System Using Local Binary Patterns S L Happy, Anjith George, and Aurobinda Routray Department of Electrical Engineering, IIT Kharagpur, India Abstract Facial

More information

Facial Expression Recognition with Emotion-Based Feature Fusion

Facial Expression Recognition with Emotion-Based Feature Fusion Facial Expression Recognition with Emotion-Based Feature Fusion Cigdem Turan 1, Kin-Man Lam 1, Xiangjian He 2 1 The Hong Kong Polytechnic University, Hong Kong, SAR, 2 University of Technology Sydney,

More information

Deep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong

Deep Learning for Face Recognition. Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong Deep Learning for Face Recognition Xiaogang Wang Department of Electronic Engineering, The Chinese University of Hong Kong Deep Learning Results on LFW Method Accuracy (%) # points # training images Huang

More information

Learning to Recognize Faces in Realistic Conditions

Learning to Recognize Faces in Realistic Conditions 000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050

More information

Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis

Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis Mengyi Liu 1,2, Shaoxin Li 1,2, Shiguang Shan 1, Ruiping Wang 1, Xilin Chen 1,3 1 Key Laboratory of Intelligent Information

More information

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur

IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS Kirthiga, M.E-Communication system, PREC, Thanjavur R.Kannan,Assistant professor,prec Abstract: Face Recognition is important

More information

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015

on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015 on learned visual embedding patrick pérez Allegro Workshop Inria Rhônes-Alpes 22 July 2015 Vector visual representation Fixed-size image representation High-dim (100 100,000) Generic, unsupervised: BoW,

More information

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon

Deep Learning For Video Classification. Presented by Natalie Carlebach & Gil Sharon Deep Learning For Video Classification Presented by Natalie Carlebach & Gil Sharon Overview Of Presentation Motivation Challenges of video classification Common datasets 4 different methods presented in

More information

Facial Expression Analysis

Facial Expression Analysis Facial Expression Analysis Faces are special Face perception may be the most developed visual perceptual skill in humans. Infants prefer to look at faces from shortly after birth (Morton and Johnson 1991).

More information

CS231N Section. Video Understanding 6/1/2018

CS231N Section. Video Understanding 6/1/2018 CS231N Section Video Understanding 6/1/2018 Outline Background / Motivation / History Video Datasets Models Pre-deep learning CNN + RNN 3D convolution Two-stream What we ve seen in class so far... Image

More information

Tri-modal Human Body Segmentation

Tri-modal Human Body Segmentation Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4

More information

Face2Face Comparing faces with applications Patrick Pérez. Inria, Rennes 2 Oct. 2014

Face2Face Comparing faces with applications Patrick Pérez. Inria, Rennes 2 Oct. 2014 Face2Face Comparing faces with applications Patrick Pérez Inria, Rennes 2 Oct. 2014 Outline Metric learning for face comparison Expandable parts model and occlusions Face sets comparison Identity-based

More information

Enhanced Active Shape Models with Global Texture Constraints for Image Analysis

Enhanced Active Shape Models with Global Texture Constraints for Image Analysis Enhanced Active Shape Models with Global Texture Constraints for Image Analysis Shiguang Shan, Wen Gao, Wei Wang, Debin Zhao, Baocai Yin Institute of Computing Technology, Chinese Academy of Sciences,

More information

Appearance Manifold of Facial Expression

Appearance Manifold of Facial Expression Appearance Manifold of Facial Expression Caifeng Shan, Shaogang Gong and Peter W. McOwan Department of Computer Science Queen Mary, University of London, London E1 4NS, UK {cfshan, sgg, pmco}@dcs.qmul.ac.uk

More information

Recognition: Face Recognition. Linda Shapiro EE/CSE 576

Recognition: Face Recognition. Linda Shapiro EE/CSE 576 Recognition: Face Recognition Linda Shapiro EE/CSE 576 1 Face recognition: once you ve detected and cropped a face, try to recognize it Detection Recognition Sally 2 Face recognition: overview Typical

More information

Image Set Classification Based on Synthetic Examples and Reverse Training

Image Set Classification Based on Synthetic Examples and Reverse Training Image Set Classification Based on Synthetic Examples and Reverse Training Qingjun Liang 1, Lin Zhang 1(&), Hongyu Li 1, and Jianwei Lu 1,2 1 School of Software Engineering, Tongji University, Shanghai,

More information

Supplementary Material for: Video Prediction with Appearance and Motion Conditions

Supplementary Material for: Video Prediction with Appearance and Motion Conditions Supplementary Material for Video Prediction with Appearance and Motion Conditions Yunseok Jang 1 2 Gunhee Kim 2 Yale Song 3 A. Architecture Details (Section 3.2) We provide architecture details of our

More information

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17 Understanding Faces Detection, Recognition, and 12/5/17 Transformation of Faces Lucas by Chuck Close Chuck Close, self portrait Some slides from Amin Sadeghi, Lana Lazebnik, Silvio Savarese, Fei-Fei Li

More information

Improving Face Recognition by Exploring Local Features with Visual Attention

Improving Face Recognition by Exploring Local Features with Visual Attention Improving Face Recognition by Exploring Local Features with Visual Attention Yichun Shi and Anil K. Jain Michigan State University Difficulties of Face Recognition Large variations in unconstrained face

More information

arxiv: v1 [cs.cv] 16 Nov 2015

arxiv: v1 [cs.cv] 16 Nov 2015 Coarse-to-fine Face Alignment with Multi-Scale Local Patch Regression Zhiao Huang hza@megvii.com Erjin Zhou zej@megvii.com Zhimin Cao czm@megvii.com arxiv:1511.04901v1 [cs.cv] 16 Nov 2015 Abstract Facial

More information

Deep Convolutional Neural Network using Triplet of Faces, Deep Ensemble, and Scorelevel Fusion for Face Recognition

Deep Convolutional Neural Network using Triplet of Faces, Deep Ensemble, and Scorelevel Fusion for Face Recognition IEEE 2017 Conference on Computer Vision and Pattern Recognition Deep Convolutional Neural Network using Triplet of Faces, Deep Ensemble, and Scorelevel Fusion for Face Recognition Bong-Nam Kang*, Yonghyun

More information

Partial Face Matching between Near Infrared and Visual Images in MBGC Portal Challenge

Partial Face Matching between Near Infrared and Visual Images in MBGC Portal Challenge Partial Face Matching between Near Infrared and Visual Images in MBGC Portal Challenge Dong Yi, Shengcai Liao, Zhen Lei, Jitao Sang, and Stan Z. Li Center for Biometrics and Security Research, Institute

More information

Sparsity Preserving Canonical Correlation Analysis

Sparsity Preserving Canonical Correlation Analysis Sparsity Preserving Canonical Correlation Analysis Chen Zu and Daoqiang Zhang Department of Computer Science and Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China {zuchen,dqzhang}@nuaa.edu.cn

More information

Recognizing people. Deva Ramanan

Recognizing people. Deva Ramanan Recognizing people Deva Ramanan The goal Why focus on people? How many person-pixels are in a video? 35% 34% Movies TV 40% YouTube Let s start our discussion with a loaded question: why is visual recognition

More information

An efficient face recognition algorithm based on multi-kernel regularization learning

An efficient face recognition algorithm based on multi-kernel regularization learning Acta Technica 61, No. 4A/2016, 75 84 c 2017 Institute of Thermomechanics CAS, v.v.i. An efficient face recognition algorithm based on multi-kernel regularization learning Bi Rongrong 1 Abstract. A novel

More information

Heat Kernel Based Local Binary Pattern for Face Representation

Heat Kernel Based Local Binary Pattern for Face Representation JOURNAL OF LATEX CLASS FILES 1 Heat Kernel Based Local Binary Pattern for Face Representation Xi Li, Weiming Hu, Zhongfei Zhang, Hanzi Wang Abstract Face classification has recently become a very hot research

More information

Feature-Aging for Age-Invariant Face Recognition

Feature-Aging for Age-Invariant Face Recognition Feature-Aging for Age-Invariant Face Recognition Huiling Zhou, Kwok-Wai Wong, and Kin-Man Lam, Centre for Signal Processing, Department of Electronic and Information Engineering, The Hong Kong Polytechnic

More information

Dynamic Facial Expression Recognition Using A Bayesian Temporal Manifold Model

Dynamic Facial Expression Recognition Using A Bayesian Temporal Manifold Model Dynamic Facial Expression Recognition Using A Bayesian Temporal Manifold Model Caifeng Shan, Shaogang Gong, and Peter W. McOwan Department of Computer Science Queen Mary University of London Mile End Road,

More information

HUMAN S FACIAL PARTS EXTRACTION TO RECOGNIZE FACIAL EXPRESSION

HUMAN S FACIAL PARTS EXTRACTION TO RECOGNIZE FACIAL EXPRESSION HUMAN S FACIAL PARTS EXTRACTION TO RECOGNIZE FACIAL EXPRESSION Dipankar Das Department of Information and Communication Engineering, University of Rajshahi, Rajshahi-6205, Bangladesh ABSTRACT Real-time

More information

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling

Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling [DOI: 10.2197/ipsjtcva.7.99] Express Paper Cost-alleviative Learning for Deep Convolutional Neural Network-based Facial Part Labeling Takayoshi Yamashita 1,a) Takaya Nakamura 1 Hiroshi Fukui 1,b) Yuji

More information

Learning based face hallucination techniques: A survey

Learning based face hallucination techniques: A survey Vol. 3 (2014-15) pp. 37-45. : A survey Premitha Premnath K Department of Computer Science & Engineering Vidya Academy of Science & Technology Thrissur - 680501, Kerala, India (email: premithakpnath@gmail.com)

More information

Boosting Coded Dynamic Features for Facial Action Units and Facial Expression Recognition

Boosting Coded Dynamic Features for Facial Action Units and Facial Expression Recognition Boosting Coded Dynamic Features for Facial Action Units and Facial Expression Recognition Peng Yang Qingshan Liu,2 Dimitris N. Metaxas Computer Science Department, Rutgers University Frelinghuysen Road,

More information

WHO MISSED THE CLASS? - UNIFYING MULTI-FACE DETECTION, TRACKING AND RECOGNITION IN VIDEOS. Yunxiang Mao, Haohan Li, Zhaozheng Yin

WHO MISSED THE CLASS? - UNIFYING MULTI-FACE DETECTION, TRACKING AND RECOGNITION IN VIDEOS. Yunxiang Mao, Haohan Li, Zhaozheng Yin WHO MISSED THE CLASS? - UNIFYING MULTI-FACE DETECTION, TRACKING AND RECOGNITION IN VIDEOS Yunxiang Mao, Haohan Li, Zhaozheng Yin Department of Computer Science Missouri University of Science and Technology,

More information

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Exploratory data analysis tasks Examine the data, in search of structures

More information

Facial Expression Classification with Random Filters Feature Extraction

Facial Expression Classification with Random Filters Feature Extraction Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle

More information

Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features. Wei-Ta Chu

Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features. Wei-Ta Chu 1 Video Aesthetic Quality Assessment by Temporal Integration of Photo- and Motion-Based Features Wei-Ta Chu H.-H. Yeh, C.-Y. Yang, M.-S. Lee, and C.-S. Chen, Video Aesthetic Quality Assessment by Temporal

More information

Convolutional Neural Network for Facial Expression Recognition

Convolutional Neural Network for Facial Expression Recognition Convolutional Neural Network for Facial Expression Recognition Liyuan Zheng Department of Electrical Engineering University of Washington liyuanz8@uw.edu Shifeng Zhu Department of Electrical Engineering

More information

Face detection and recognition. Detection Recognition Sally

Face detection and recognition. Detection Recognition Sally Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification

More information

An Algorithm based on SURF and LBP approach for Facial Expression Recognition

An Algorithm based on SURF and LBP approach for Facial Expression Recognition ISSN: 2454-2377, An Algorithm based on SURF and LBP approach for Facial Expression Recognition Neha Sahu 1*, Chhavi Sharma 2, Hitesh Yadav 3 1 Assistant Professor, CSE/IT, The North Cap University, Gurgaon,

More information

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS

LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS LEARNING TO GENERATE CHAIRS WITH CONVOLUTIONAL NEURAL NETWORKS Alexey Dosovitskiy, Jost Tobias Springenberg and Thomas Brox University of Freiburg Presented by: Shreyansh Daftry Visual Learning and Recognition

More information

Action recognition in videos

Action recognition in videos Action recognition in videos Cordelia Schmid INRIA Grenoble Joint work with V. Ferrari, A. Gaidon, Z. Harchaoui, A. Klaeser, A. Prest, H. Wang Action recognition - goal Short actions, i.e. drinking, sit

More information

How to Generate Keys from Facial Images and Keep your Privacy at the Same Time

How to Generate Keys from Facial Images and Keep your Privacy at the Same Time How to Generate Keys from Facial Images and Keep your Privacy at the Same Time Orr Dunkelman, University of Haifa Joint work with Mahmood Sharif and Margarita Osadchy Overview Motivation Background: The

More information

IJCAI Dept. of Information Engineering

IJCAI Dept. of Information Engineering IJCAI 2007 Wei Liu,Xiaoou Tang, and JianzhuangLiu Dept. of Information Engineering TheChinese University of Hong Kong Outline What is sketch-based facial photo hallucination Related Works Our Approach

More information

An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012

An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012 An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012, 19.01.2012 INSTITUTE FOR ANTHROPOMATICS, FACIAL IMAGE PROCESSING AND ANALYSIS YIG University of the State of Baden-Wuerttemberg

More information

Structured Models in. Dan Huttenlocher. June 2010

Structured Models in. Dan Huttenlocher. June 2010 Structured Models in Computer Vision i Dan Huttenlocher June 2010 Structured Models Problems where output variables are mutually dependent or constrained E.g., spatial or temporal relations Such dependencies

More information

Outline 7/2/201011/6/

Outline 7/2/201011/6/ Outline Pattern recognition in computer vision Background on the development of SIFT SIFT algorithm and some of its variations Computational considerations (SURF) Potential improvement Summary 01 2 Pattern

More information

Color Local Texture Features Based Face Recognition

Color Local Texture Features Based Face Recognition Color Local Texture Features Based Face Recognition Priyanka V. Bankar Department of Electronics and Communication Engineering SKN Sinhgad College of Engineering, Korti, Pandharpur, Maharashtra, India

More information

RSRN: Rich Side-output Residual Network for Medial Axis Detection

RSRN: Rich Side-output Residual Network for Medial Axis Detection RSRN: Rich Side-output Residual Network for Medial Axis Detection Chang Liu, Wei Ke, Jianbin Jiao, and Qixiang Ye University of Chinese Academy of Sciences, Beijing, China {liuchang615, kewei11}@mails.ucas.ac.cn,

More information

Deep Learning for Virtual Shopping. Dr. Jürgen Sturm Group Leader RGB-D

Deep Learning for Virtual Shopping. Dr. Jürgen Sturm Group Leader RGB-D Deep Learning for Virtual Shopping Dr. Jürgen Sturm Group Leader RGB-D metaio GmbH Augmented Reality with the Metaio SDK: IKEA Catalogue App Metaio: Augmented Reality Metaio SDK for ios, Android and Windows

More information

Face Detection and Recognition in an Image Sequence using Eigenedginess

Face Detection and Recognition in an Image Sequence using Eigenedginess Face Detection and Recognition in an Image Sequence using Eigenedginess B S Venkatesh, S Palanivel and B Yegnanarayana Department of Computer Science and Engineering. Indian Institute of Technology, Madras

More information

Image Processing Pipeline for Facial Expression Recognition under Variable Lighting

Image Processing Pipeline for Facial Expression Recognition under Variable Lighting Image Processing Pipeline for Facial Expression Recognition under Variable Lighting Ralph Ma, Amr Mohamed ralphma@stanford.edu, amr1@stanford.edu Abstract Much research has been done in the field of automated

More information

Locating Facial Landmarks Using Probabilistic Random Forest

Locating Facial Landmarks Using Probabilistic Random Forest 2324 IEEE SIGNAL PROCESSING LETTERS, VOL. 22, NO. 12, DECEMBER 2015 Locating Facial Landmarks Using Probabilistic Random Forest Changwei Luo, Zengfu Wang, Shaobiao Wang, Juyong Zhang, and Jun Yu Abstract

More information

Intensity-Depth Face Alignment Using Cascade Shape Regression

Intensity-Depth Face Alignment Using Cascade Shape Regression Intensity-Depth Face Alignment Using Cascade Shape Regression Yang Cao 1 and Bao-Liang Lu 1,2 1 Center for Brain-like Computing and Machine Intelligence Department of Computer Science and Engineering Shanghai

More information

Deep Fusion: An Attention Guided Factorized Bilinear Pooling for Audio-video Emotion Recognition

Deep Fusion: An Attention Guided Factorized Bilinear Pooling for Audio-video Emotion Recognition Deep Fusion: An Attention Guided Factorized Bilinear Pooling for Audio-video Emotion Recognition Yuanyuan Zhang, Zi-Rui Wang, Jun Du National Engineering Laboratory for Speech and Language Information

More information

Generic Face Alignment Using an Improved Active Shape Model

Generic Face Alignment Using an Improved Active Shape Model Generic Face Alignment Using an Improved Active Shape Model Liting Wang, Xiaoqing Ding, Chi Fang Electronic Engineering Department, Tsinghua University, Beijing, China {wanglt, dxq, fangchi} @ocrserv.ee.tsinghua.edu.cn

More information

Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis

Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis Mengyi Liu 1,2, Shaoxin Li 1,2, Shiguang Shan 1(B), Ruiping Wang 1, and Xilin Chen 1,3 1 Key Laboratory of Intelligent

More information

Structured Face Hallucination

Structured Face Hallucination 2013 IEEE Conference on Computer Vision and Pattern Recognition Structured Face Hallucination Chih-Yuan Yang Sifei Liu Ming-Hsuan Yang Electrical Engineering and Computer Science University of California

More information

Human pose estimation using Active Shape Models

Human pose estimation using Active Shape Models Human pose estimation using Active Shape Models Changhyuk Jang and Keechul Jung Abstract Human pose estimation can be executed using Active Shape Models. The existing techniques for applying to human-body

More information

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach

Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach Vandit Gajjar gajjar.vandit.381@ldce.ac.in Ayesha Gurnani gurnani.ayesha.52@ldce.ac.in Yash Khandhediya khandhediya.yash.364@ldce.ac.in

More information

Misalignment-Robust Face Recognition

Misalignment-Robust Face Recognition Misalignment-Robust Face Recognition Huan Wang 1 Shuicheng Yan 2 Thomas Huang 3 Jianzhuang Liu 1 Xiaoou Tang 1,4 1 IE, Chinese University 2 ECE, National University 3 ECE, University of Illinois 4 Microsoft

More information

Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization and Confidence-Constraint Shape Modeling

Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization and Confidence-Constraint Shape Modeling Enhance ASMs Based on AdaBoost-Based Salient Landmarks Localization and Confidence-Constraint Shape Modeling Zhiheng Niu 1, Shiguang Shan 2, Xilin Chen 1,2, Bingpeng Ma 2,3, and Wen Gao 1,2,3 1 Department

More information

Reconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling

Reconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling Reconstructive Sparse Code Transfer for Contour Detection and Semantic Labeling Michael Maire 1,2 Stella X. Yu 3 Pietro Perona 2 1 TTI Chicago 2 California Institute of Technology 3 University of California

More information

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,

More information

Face Image Quality Assessment for Face Selection in Surveillance Video using Convolutional Neural Networks

Face Image Quality Assessment for Face Selection in Surveillance Video using Convolutional Neural Networks Face Image Quality Assessment for Face Selection in Surveillance Video using Convolutional Neural Networks Vignesh Sankar, K. V. S. N. L. Manasa Priya, Sumohana Channappayya Indian Institute of Technology

More information

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601

Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network. Nathan Sun CIS601 Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601 Introduction Face ID is complicated by alterations to an individual s appearance Beard,

More information

SLIDING WINDOW BASED MICRO-EXPRESSION SPOTTING: A BENCHMARK

SLIDING WINDOW BASED MICRO-EXPRESSION SPOTTING: A BENCHMARK SLIDING WINDOW BASED MICRO-EXPRESSION SPOTTING: A BENCHMARK Thuong-Khanh Tran, Xiaopeng Hong, and Guoying Zhao Center for Machine Vision and Signal Analysis, University of Oulu, Finland, fkhanh.tran, xiaopeng.hong,

More information

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014

Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014 Is 2D Information Enough For Viewpoint Estimation? Amir Ghodrati, Marco Pedersoli, Tinne Tuytelaars BMVC 2014 Problem Definition Viewpoint estimation: Given an image, predicting viewpoint for object of

More information

An Adaptive Threshold LBP Algorithm for Face Recognition

An Adaptive Threshold LBP Algorithm for Face Recognition An Adaptive Threshold LBP Algorithm for Face Recognition Xiaoping Jiang 1, Chuyu Guo 1,*, Hua Zhang 1, and Chenghua Li 1 1 College of Electronics and Information Engineering, Hubei Key Laboratory of Intelligent

More information

CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS

CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS 38 CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS 3.1 PRINCIPAL COMPONENT ANALYSIS (PCA) 3.1.1 Introduction In the previous chapter, a brief literature review on conventional

More information

Facial Expression Recognition with PCA and LBP Features Extracting from Active Facial Patches

Facial Expression Recognition with PCA and LBP Features Extracting from Active Facial Patches Facial Expression Recognition with PCA and LBP Features Extracting from Active Facial Patches Yanpeng Liu a, Yuwen Cao a, Yibin Li a, Ming Liu, Rui Song a Yafang Wang, Zhigang Xu, Xin Ma a Abstract Facial

More information

Tracking. Hao Guan( 管皓 ) School of Computer Science Fudan University

Tracking. Hao Guan( 管皓 ) School of Computer Science Fudan University Tracking Hao Guan( 管皓 ) School of Computer Science Fudan University 2014-09-29 Multimedia Video Audio Use your eyes Video Tracking Use your ears Audio Tracking Tracking Video Tracking Definition Given

More information

Motion Estimation and Optical Flow Tracking

Motion Estimation and Optical Flow Tracking Image Matching Image Retrieval Object Recognition Motion Estimation and Optical Flow Tracking Example: Mosiacing (Panorama) M. Brown and D. G. Lowe. Recognising Panoramas. ICCV 2003 Example 3D Reconstruction

More information

Dynamic facial expression recognition using a behavioural model

Dynamic facial expression recognition using a behavioural model Dynamic facial expression recognition using a behavioural model Thomas Robin Michel Bierlaire Javier Cruz STRC 2009 10th september The context Recent interest for emotion recognition in transportation

More information

arxiv: v3 [cs.cv] 1 Apr 2015

arxiv: v3 [cs.cv] 1 Apr 2015 Representation Learning with Deep Extreme Learning Machines for Efficient Image Set Classification Muhammad Uzair 1, Faisal Shafait 1, Bernard Ghanem 2 and Ajmal Mian 1 1 Computer Science & Software Engineering,

More information

Large-scale Video Classification with Convolutional Neural Networks

Large-scale Video Classification with Convolutional Neural Networks Large-scale Video Classification with Convolutional Neural Networks Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei Note: Slide content mostly from : Bay Area

More information

arxiv: v1 [cs.cv] 6 Jul 2016

arxiv: v1 [cs.cv] 6 Jul 2016 arxiv:607.079v [cs.cv] 6 Jul 206 Deep CORAL: Correlation Alignment for Deep Domain Adaptation Baochen Sun and Kate Saenko University of Massachusetts Lowell, Boston University Abstract. Deep neural networks

More information

Leveraging Textural Features for Recognizing Actions in Low Quality Videos

Leveraging Textural Features for Recognizing Actions in Low Quality Videos Leveraging Textural Features for Recognizing Actions in Low Quality Videos Saimunur Rahman, John See, Chiung Ching Ho Centre of Visual Computing, Faculty of Computing and Informatics Multimedia University,

More information

Lecture 7: Semantic Segmentation

Lecture 7: Semantic Segmentation Semantic Segmentation CSED703R: Deep Learning for Visual Recognition (207F) Segmenting images based on its semantic notion Lecture 7: Semantic Segmentation Bohyung Han Computer Vision Lab. bhhanpostech.ac.kr

More information

Action Recognition in Video by Sparse Representation on Covariance Manifolds of Silhouette Tunnels

Action Recognition in Video by Sparse Representation on Covariance Manifolds of Silhouette Tunnels Action Recognition in Video by Sparse Representation on Covariance Manifolds of Silhouette Tunnels Kai Guo, Prakash Ishwar, and Janusz Konrad Department of Electrical & Computer Engineering Motivation

More information

Illumination Normalization in Face Recognition Using DCT and Supporting Vector Machine (SVM)

Illumination Normalization in Face Recognition Using DCT and Supporting Vector Machine (SVM) Illumination Normalization in Face Recognition Using DCT and Supporting Vector Machine (SVM) 1 Yun-Wen Wang ( 王詠文 ), 2 Wen-Yu Wang ( 王文昱 ), 2 Chiou-Shann Fuh ( 傅楸善 ) 1 Graduate Institute of Electronics

More information

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science.

Colorado School of Mines. Computer Vision. Professor William Hoff Dept of Electrical Engineering &Computer Science. Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Statistical Models for Shape and Appearance Note some material for these slides came from Algorithms

More information

3D Human Motion Analysis and Manifolds

3D Human Motion Analysis and Manifolds D E P A R T M E N T O F C O M P U T E R S C I E N C E U N I V E R S I T Y O F C O P E N H A G E N 3D Human Motion Analysis and Manifolds Kim Steenstrup Pedersen DIKU Image group and E-Science center Motivation

More information

END-TO-END CHINESE TEXT RECOGNITION

END-TO-END CHINESE TEXT RECOGNITION END-TO-END CHINESE TEXT RECOGNITION Jie Hu 1, Tszhang Guo 1, Ji Cao 2, Changshui Zhang 1 1 Department of Automation, Tsinghua University 2 Beijing SinoVoice Technology November 15, 2017 Presentation at

More information

COMPRESSED FACE HALLUCINATION. Electrical Engineering and Computer Science University of California, Merced, CA 95344, USA

COMPRESSED FACE HALLUCINATION. Electrical Engineering and Computer Science University of California, Merced, CA 95344, USA COMPRESSED FACE HALLUCNATON Sifei Liu Ming-Hsuan Yang Electrical Engineering and Computer Science University of California, Merced, CA 95344, USA ABSTRACT n this paper, we propose an algorithm to hallucinate

More information

LOCAL APPEARANCE BASED FACE RECOGNITION USING DISCRETE COSINE TRANSFORM

LOCAL APPEARANCE BASED FACE RECOGNITION USING DISCRETE COSINE TRANSFORM LOCAL APPEARANCE BASED FACE RECOGNITION USING DISCRETE COSINE TRANSFORM Hazim Kemal Ekenel, Rainer Stiefelhagen Interactive Systems Labs, University of Karlsruhe Am Fasanengarten 5, 76131, Karlsruhe, Germany

More information

Human detection using local shape and nonredundant

Human detection using local shape and nonredundant University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2010 Human detection using local shape and nonredundant binary patterns

More information

Learning Expressionlets on Spatio-Temporal Manifold for Dynamic Facial Expression Recognition

Learning Expressionlets on Spatio-Temporal Manifold for Dynamic Facial Expression Recognition 2014 IEEE Conference on Computer Vision and Pattern Recognition Learning Expressionlets on Spatio-Temporal Manifold for Dynamic Facial Expression Recognition Mengyi Liu 1,2, Shiguang Shan 1, Ruiping Wang

More information

Thermal to Visible Face Recognition

Thermal to Visible Face Recognition Thermal to Visible Face Recognition Jonghyun Choi, Shuowen Hu, S. Susan Young and Larry S. Davis University of Maryland, College Park, MD U.S. Army Research Laboratory, Adelphi, MD ABSTRACT In low light

More information

COMBINING SPEEDED-UP ROBUST FEATURES WITH PRINCIPAL COMPONENT ANALYSIS IN FACE RECOGNITION SYSTEM

COMBINING SPEEDED-UP ROBUST FEATURES WITH PRINCIPAL COMPONENT ANALYSIS IN FACE RECOGNITION SYSTEM International Journal of Innovative Computing, Information and Control ICIC International c 2012 ISSN 1349-4198 Volume 8, Number 12, December 2012 pp. 8545 8556 COMBINING SPEEDED-UP ROBUST FEATURES WITH

More information

International Journal of Modern Engineering and Research Technology

International Journal of Modern Engineering and Research Technology Volume 4, Issue 3, July 2017 ISSN: 2348-8565 (Online) International Journal of Modern Engineering and Research Technology Website: http://www.ijmert.org Email: editor.ijmert@gmail.com A Novel Approach

More information

Simultaneous Feature and Sample Reduction for Image-Set Classification

Simultaneous Feature and Sample Reduction for Image-Set Classification Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) Simultaneous Feature and Sample Reduction for Image-Set Classification Man Zhang, Ran He*, Dong Cao, Zhenan Sun, Tieniu

More information

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

COMP 551 Applied Machine Learning Lecture 16: Deep Learning COMP 551 Applied Machine Learning Lecture 16: Deep Learning Instructor: Ryan Lowe (ryan.lowe@cs.mcgill.ca) Slides mostly by: Class web page: www.cs.mcgill.ca/~hvanho2/comp551 Unless otherwise noted, all

More information

Face Alignment Under Various Poses and Expressions

Face Alignment Under Various Poses and Expressions Face Alignment Under Various Poses and Expressions Shengjun Xin and Haizhou Ai Computer Science and Technology Department, Tsinghua University, Beijing 100084, China ahz@mail.tsinghua.edu.cn Abstract.

More information