Subspace Indexing on Grassmann Manifold for Large Scale Visual Recognition
|
|
- Francis Atkins
- 5 years ago
- Views:
Transcription
1 Subspace Indexing on Grassmann Manifold for Large Scale Visual Recognition Zhu Li Univ of Missouri, Kansas City p. 1
2 Outline Short Intro Research Motivation and Background Current Research Work Subspace Indexing on Grassmann Manifold for Large Visual Recognition Problem Summary p. 2
3 About Myself Zhu Li Associate Professor Director, MC2 Lab Short Bio: PhD, ECE, Northwestern University, Research Experiences: Research Interests: Visual Computing Media Computing & Communication Lab Object Re-ID, Image Tagging, Point Cloud/Light Field Compression Visual Communication Video compression, QoE, new media transport Networking and Mobile Edge Computing algorithm & computing image understanding visual communication mobile edge computing & communication p. 3
4 Research Motivation 2016 p. 4
5 Media Computing & Communication in 2016 Devices Networks mmwave 5G 4k/8k UHD Video Free Viewpoint TV FD-MIMO D2D Immersive Visual Communication SDN/MEC Navigation & Object Re-Identification p. 5
6 Mobile Visual Search : Object Re-ID Mobile Query-by-Capture Object Recognition Pipeline KD: Keypoint Detection (Box Filtering ) Recall FS: Feature Selection (Affine transformed 2 way-matching) GD/LD: global and local descriptor generation (AKULA, Collision Optimized Hash) GD KD FS LD Descriptor Extraction Global Descriptor bitstream Local Descriptors Descriptor Encoding Precision p. 6
7 Next Gen Immersive Visual Communication Platform HMD Tracking (6 DoF) Track movement: dx, dy, dz Track angular movement RF+Visual based tracking Stereo Camera LED Gesture Tracking (27 DoF) 27 DoF, very complex Infra structured light + stereo camera HMD Eye Gaze Tracking (2 DoF) Some how easier, only 2DoF Multi-infra source illumination Purkenje image based tracking p. 7
8 Point Cloud Capture & Compression Free Viewpoint TV from 3d Point Cloud Capture VR applications: photorealistic objects/human insertion in games Free View Point: replay technology Compression Challenges Plenoptic signal extremely large volume Communication & storage need compression at x100 ratio MPEG working group on PCC Octree decomposition of the space, arithmetic coding of DFS octree nodes Our work: plane projection approximation, Graph Signal Processing Dynamic point cloud compression p. 8
9 Compact Knowledge Distilling for Embedded Devices Automatic Image Tagging: Mapping image pixel values to tags Device based recognition: Distilling the knowledge into a compact form Compact CNN model for knowledge distilling BIGDATA training set: >10k images per tag training set augmentation:» affine transforms, simulated lighting changes Pruning filters from the CNN structure p. 9
10 Robust Content Identification & De-Duplication Rate Agnostic Robust Video Fingerprinting Robust and scalable video fingerprint for duplicate and near duplicate search and mining applications (Funded by Microsoft Research grant, HK RGC GRF grant, Samsung DMC) Playback Verification: Differential feature for playback verification (ads enforcement, broadcast monitoring), 200 bps, very lightweight signature. Multi-Source Synchronization: synch multi-camera video and audio sources, content based approach Content Identification: content naming service for ICN, cache manifest and de-duplication. p. 10
11 Subspace Indexing on Grassmann Manifold for Large Scale Visual Recognition p. 11
12 Appearance Modeling Finding a good f() Find a good f() Such that after projecting the appearance onto the subspace, the data points belong to different classes are easily separable This kind of good function is usually non-linear, hard to obtain from training data. p. 12
13 A Good Linear Model: f(x) = Ax LDA (R. A. Fisher 1936/P. Belhumeu 1997) : - Maximizing inter-class scatter over intra-class scatter - Intra and Inter scatter - Solve by a generalized Eigen problem:.. a1 a2 a3 a4 a5 LDA/Fisherface Basis p. 13
14 State of Art in Linear Models: Graph Embedding Projection LPP (X-. F. He and P. Niyogi, circa 2004): - Minimizing weighted distance (a graph) after projection -Solve by: - Embed a graph (as specified by wj,k) with pruned edges, via a projection A. - The training data is reflected by the graph p. 14
15 Common Graph Embedding Interpretation Both LPP and LDA are graph embedding linear projections: LDA:» preserve the affinity matrix that has zero affinity for data points pairs that are not belonging to the same class LPP:» Have more flexibility in modeling affinity wjk. LPP Affinity Graph LDA Affinity Graph p. 15
16 Non-Linear Models Non-Linear Solutions and challenges: Kernel method: e.g K-PCA, K-LDA, K-LPP, SVM» Evaluate inner product <xj,xk> with a kernel function k(xj, xk), which if satisfy the conditions in Mercer s Theorem, implicitly maps data via a non-linear function.» Typically involves a QP problem with a Hessian of size n x n, when n is large, not solvable. LLE /Graph Laplacian:» An algorithm that maps input data {xk} to {yk} that tries to preserve an embedded graph structure among data points.» The mapping is data dependent and has difficulty handling new data outside the training set, e.g., a new query point How to compromise? Piece-wise Linear Approximation p. 16
17 Piece-wise Linear : Query Driven Query-Driven Piece-wise Linear Model No pre-determined structure on the training data Local neighborhood data patch identified from query point q, Local model built with local data, A(X, q) Local Graph Model: X A(X,q) q + Y=A(X,q)X Local data: N(X, q) + p. 17
18 DoF Criteria for Data Driven Local Model What is a good local data size N(X, q)? The DoF of a linear model: DoF(A) = w x h x d Example: DoF(A) = 20x24x5=2400 Discriminant Information to be captured: LDA: a graph with edges pruned for intra-class points LPP; pruned graph Information function n non-zeros as number of edges/relationship among data points n What is a good compromise of data complexity and model power? p. 18
19 Discriminant Power Co-efficient (DPC) DPC is computed as ratio of DoF and Information function of the local model overfitting Model validation accuracy as functions of DPC p. 19
20 Head Pose Recognition Performance Recognition rate is improved: DoF: w=18, h=18, d=16, 32 Local Data Size: K=30 And the cost in computation is rather modest Matlab code, online local model A(X,q) learning and NN classification: p. 20
21 Face Recognition Performance Local model combination in face recognition Query point drives 3 local models, A1(X, q), A2(X, q), A3(X, q) Local model classification error estimation, Combining the results weighted voting Multiple face models with different area and scale: (a) Upper face model (18 16). (b) Lower face model (14 18). (c) Full face model (21 28). ORL data set test: leave 2 out: p. 21
22 Query Driven Piece-Wise Linear Summary Solid gains over the state of art global linear models Allows for piece wise local linear approximation of the global non-linearity Reduced data point size n makes the kernel solution more computationally tractable. Computational/Storage Complexity Data driven: need to compute a run-time model that adds computational complexity Storage Penalty: need extra storage to store all training data, because the local NN data patch is generated at run time, as function of the query point. Indexing challenge: for very large scale problem, not practical to store all training data, and effective indexing needed to support efficient access of query induced local training data. Can we have a more compact way of distilling this intelligence? p. 22
23 Closer Examination of the DoF Stiefel manifold All possible p othonormal basis functions in d-dimensional space, Apxd, spans Stiefel Manifold, S(p, d) in Rdxp, d > p. It reequires othonormal relationship among basis vectors, so the DoF is not pd What is missing? a1 a2 p. 23
24 Model DoF on Grassmannian Manifolds Grassmann manifold G(p, d) identifies p-dimensional subspaces in d-dimensional space, it consists of p othonormal bases in d-diemnsional space (Stiefel Manifold) and an equivalence constraint on the rotation of the basis functions: G(p, d) is the quotient space of S(p, d)/o(p), i.e., A1=A2, The DoF of subspaces on Grassmann manifold a1 a2 p. 24
25 Visualizing Subspace Models DoF Consider a typical appearance modeling Image size 12x10 pixel, appearance space dimension d=120, model dimension p=8. 3D visualization of all S(8, 120) and their covariance eigenvalues Grassmann Manifolds are quotient space S(8, 120)/O(8) p. 25
26 Principal Angles The principal angles between two subspaces: For A1, and A2 in G(p, d), their principal angles are defined as Where, {uk} and {vk} are called principal dimensions for span(a1) and span(a2). p. 26
27 Principal Angles Computing The principle angles between two subspaces: Rotating A1, and A2 in G(p, d), such that they are maximally aligned solving by SVD: Where, U=[u1, u2,, up], and V=[v1, v2,, vp] are the principle angles. The diagonal of S, [s1, s2,..., sp] are the cosine of principal angles, and principal angles are computed as, p. 27
28 Subspace Metrics on Grassmann Manifold Subspace distance metrics: Projection Distance Def: Computing: Binet-Cauchy Distance Def: Computing: p. 28
29 Subspace Distance on Grassmannian Manifold Subspace distances Arc Distance Def: Also known as geodesic distance. It traverse the Grassmannian surface, and two subspace collapse into one, when all principal angles becomes zero. * A2 A1* p. 29
30 Combination of two subspaces on Grassmann Manifold What if we need merge two subspaces? Motivation:» say if subspace A1 is best for data set S1, and subspace A2 is best for data set S2, can we find a subspace A3 that is good for both? When two subspaces are sufficiently close on Grassmannian manifold, we can approximate this by, A3=[t1, t2,.] tk = n1 n2 uk + vk n1 + n2 n1 + n2 Where n1,2 are the size of data set S1,2 A1* + + * A2 The new sets of basis may not be orthogonal. Can be corrected by Gram-Schmidt orthogonalization. p. 30
31 Building Local Models from BIGDATA Training Set The Plan: Partition the (unlabeled) large training data set into local data patches Compute local models for each data patch with labels, and then optimize a subset for the recognition Data Space Partition Partition the training data set by kd-tree, for the kd-tree height of h, we have 2 h local data patch as leaf node For each leaf node data patch k, build a local LDA/LPP model Ak: p. 31
32 Subspace Indexing Subspace Clustering by Grassmann Metric: It is a VQ like process. Start with a data partition kd-tree, their leaf nodes and associated subspaces {A k}, k=1..2h Repeat» Find Ai and Aj, if darc(ai, Aj) is the smallest among all, and the associated data patch are adjacent in the data space.» Delete Ai and Aj, replace with merged new subspace, and update associated data patch leaf nodes set.» Compute the empirical identification accuracy for the merged subspace» Add parent pointer to the merged new subspace for Ai and Aj.» Stop if only 1 subspace left. Benefit:» avoid forced merging of subspace models at data patches that are very different, though adjacent. p. 32
33 MHT Based Identification MHT operation Organize the leaf nodes models into a new hierarchy, with new models and associated accuracy (error rate) estimation When a probe point comes, first identify its leaf nodes from the data partition tree. Then traverse the MHT from leaf nodes up, until it hits the root, which is the global model, and choose the best model along the path for identification p. 33
34 Simulation The data set MSRA Multimedia data set, 65k images with class and relevance labels: p. 34
35 Simulation Data selection and features Selected 12 classes with 11k images and use the original combined 889d features from color, shape and texture Performance compared with PCA, LDA and LPP modeling p. 35
36 Simulation Face data set Mixed data set of 242 individuals, and 4840 face images Performance compared with PCA, LDA and LPP modeling p. 36
37 Summary Contributions Address the DoF issues of the BIGDATA recognition problem. Piece-wise linear approach is effective in modeling non-linearity in visual manifolds for a variety of recognition problem. Subspace indexing on Grassmann manifold offers a systemic approach in optimizing the linear models for the localized problem. Solid performance gains over the state of art global linear models and their kernelized non-linear models. p. 37
38 Publications & Future Work Future work Grassmann Hashing Penalize projection selection with Grassmannian metric, offers performance gains over LSH and spectral hashing. Grassmann Local Descriptor Aggregation (MPEG CDVS) significantly improved the indexing efficiency when work with Fisher Vector. Publications: Z. Li, A. Nagar, G. Srivastava, and K. Park, CDVS: GRASF - Grassmann Subspace Feature for Key Point Aggregation, ISO/IEC SC29/WG11/MPEG2014/m32536, San Jose, USA, X. Wang, Z. Li, and D. Tao, "Subspace Indexing on Grassmann Manifold for Image Search", IEEE Trans. on Image Processing, vol. 20(9), X. Wang, Z. Li, L. Zhang, and J. Yuan, "Grassmann Hashing for Approx Nearest Neighbour Search in High Dimensional Space", Proc. of IEEE Int'l Conf on Multimedia & Expo (ICME), Barcelona, Spain, H. Xu, J. Wang, Z. Li, G. Zeng, S. Li, Complementary Hashing for Approximate Nearest Neighbor Search, IEEE Int'l Conference on Computer Vision (ICCV), Barcelona, Spain, Yun Fu, Z. Li, J. Yuan, Ying Wu, and Thomas S. Huang, "Locality vs. Globality: Query-Driven Localized Linear Models for Facial Image Computing," IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), vol. 18(12), pp , December, p. 38
39 Acknowledgment Grants: the work is partially supported by; a Hong Kong RGC Grant, and Microsoft Research Asia faculty grant. My Student:» Xinchao Wang, valedictorian of Dept of COMP, HK Polytechnic University, class 2010, completed PhD at EPFL (Pascal Fua), now at UIUC IFP. p. 39
40 Q&A Thanks! p. 40
Image Analysis & Retrieval. CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W Lec 16
Image Analysis & Retrieval CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W 4-5:15pm@Bloch 0012 Lec 16 Subspace/Transform Optimization Zhu Li Dept of CSEE, UMKC Office: FH560E, Email:
More informationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Recognition
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Recognition Zhu Li Media Networking Lab FutureWei (Huawei) Technology, USA Bridgewater, NJ 1 Outline Bio Research motivations and projects
More informationImage Analysis & Retrieval. CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W Lec 18.
Image Analysis & Retrieval CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W 4-5:15pm@Bloch 0012 Lec 18 Image Hashing Zhu Li Dept of CSEE, UMKC Office: FH560E, Email: lizhu@umkc.edu, Ph:
More informationAbout Me. Bio: Research Interests:
About Me Bio: Sr. Staff Researcher, Core Networks R&D, Huawei Tech USA, 2010.10~ to date Asst Prof, HK Polytechnic Univ, CTO, Mudi Tech, 2008.04~2010.09 Senior, Senior Staff, and then Principal Staff Researcher,
More informationImage Analysis & Retrieval
Outline CS/EE 5590 / ENG 401 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W 4-5:15pm@Bloch 0012 Image Analysis & Retrieval Background Objective of the class Prerequisite Lecture Plan Course Project
More informationAKULA Adaptive Cluster Aggregation for Visual Search
AKULA Adaptive Cluster Aggregation for Visual Search Abhishek Nagar *, Zhu Li *, Gaurav Srivastava *, and Kyungmo Park + * Samsung Research America 1301 E. Lookout Dr. Richardson, TX 75082, USA {a.nagar,
More informationDimension Reduction CS534
Dimension Reduction CS534 Why dimension reduction? High dimensionality large number of features E.g., documents represented by thousands of words, millions of bigrams Images represented by thousands of
More informationImage Analysis & Retrieval Lec-01: Introduction
CS/EE 5590 / ENG 401 Special Topics Spring 2017, M/W 5:30-6:45pm@Haag 309 Image Analysis & Retrieval Lec-01: Introduction Zhu Li Dept of CSEE, UMKC Office: FH560E, Email: lizhu@umkc.edu, Ph: x 2346. http://l.web.umkc.edu/lizhu
More informationIEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 9, SEPTEMBER
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 9, SEPTEMBER 2011 2627 Subspaces Indexing Model on Grassmann Manifold for Image Search Xinchao Wang, Zhu Li, Senior Member, IEEE, and Dacheng Tao, Member,
More informationImage Analysis & Retrieval
CS/EE 5590 / ENG 401 Special Topics, Spring 2018 Image Analysis & Retrieval Zhu Li Dept of CSEE, UMKC Office: FH560E, Email: lizhu@umkc.edu, Ph: x 2346. http://l.web.umkc.edu/lizhu Z. Li: Image Analysis
More informationFace Recognition using Laplacianfaces
Journal homepage: www.mjret.in ISSN:2348-6953 Kunal kawale Face Recognition using Laplacianfaces Chinmay Gadgil Mohanish Khunte Ajinkya Bhuruk Prof. Ranjana M.Kedar Abstract Security of a system is an
More informationRecognition: Face Recognition. Linda Shapiro EE/CSE 576
Recognition: Face Recognition Linda Shapiro EE/CSE 576 1 Face recognition: once you ve detected and cropped a face, try to recognize it Detection Recognition Sally 2 Face recognition: overview Typical
More informationImage Analysis & Retrieval. CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W Lec 13
Image Analysis & Retrieval CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W 4-5:15pm@Bloch 0012 Lec 13 Dimension Reduction: SVD and PCA Zhu Li Dept of CSEE, UMKC Office: FH560E, Email:
More informationThe Analysis of Parameters t and k of LPP on Several Famous Face Databases
The Analysis of Parameters t and k of LPP on Several Famous Face Databases Sujing Wang, Na Zhang, Mingfang Sun, and Chunguang Zhou College of Computer Science and Technology, Jilin University, Changchun
More informationLearning to Recognize Faces in Realistic Conditions
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
More informationImage Analysis & Retrieval Lec 10 - Classification II
CS/EE 5590 / ENG 401 Special Topics, Spring 2018 Image Analysis & Retrieval Lec 10 - Classification II Zhu Li Dept of CSEE, UMKC http://l.web.umkc.edu/lizhu Office Hour: Tue/Thr 2:30-4pm@FH560E, Contact:
More informationLocal Features Tutorial: Nov. 8, 04
Local Features Tutorial: Nov. 8, 04 Local Features Tutorial References: Matlab SIFT tutorial (from course webpage) Lowe, David G. Distinctive Image Features from Scale Invariant Features, International
More informationEmotion Classification
Emotion Classification Shai Savir 038052395 Gil Sadeh 026511469 1. Abstract Automated facial expression recognition has received increased attention over the past two decades. Facial expressions convey
More informationOutline 7/2/201011/6/
Outline Pattern recognition in computer vision Background on the development of SIFT SIFT algorithm and some of its variations Computational considerations (SURF) Potential improvement Summary 01 2 Pattern
More informationCOSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor
COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality
More informationSCALE INVARIANT FEATURE TRANSFORM (SIFT)
1 SCALE INVARIANT FEATURE TRANSFORM (SIFT) OUTLINE SIFT Background SIFT Extraction Application in Content Based Image Search Conclusion 2 SIFT BACKGROUND Scale-invariant feature transform SIFT: to detect
More informationGeneralized trace ratio optimization and applications
Generalized trace ratio optimization and applications Mohammed Bellalij, Saïd Hanafi, Rita Macedo and Raca Todosijevic University of Valenciennes, France PGMO Days, 2-4 October 2013 ENSTA ParisTech PGMO
More informationFrom Pixels to Information Recent Advances in Visual Search
From Pixels to Information Recent Advances in Visual Search Bernd Girod Stanford University bgirod@stanford.edu Augmented Reality 3 Augmented Reality 2014 2012 2015 4 Future: Smart Contact Lenses Sight:
More informationLarge-scale visual recognition The bag-of-words representation
Large-scale visual recognition The bag-of-words representation Florent Perronnin, XRCE Hervé Jégou, INRIA CVPR tutorial June 16, 2012 Outline Bag-of-words Large or small vocabularies? Extensions for instance-level
More informationLarge-Scale Face Manifold Learning
Large-Scale Face Manifold Learning Sanjiv Kumar Google Research New York, NY * Joint work with A. Talwalkar, H. Rowley and M. Mohri 1 Face Manifold Learning 50 x 50 pixel faces R 2500 50 x 50 pixel random
More informationUnsupervised Learning
Unsupervised Learning Learning without Class Labels (or correct outputs) Density Estimation Learn P(X) given training data for X Clustering Partition data into clusters Dimensionality Reduction Discover
More informationLearning based face hallucination techniques: A survey
Vol. 3 (2014-15) pp. 37-45. : A survey Premitha Premnath K Department of Computer Science & Engineering Vidya Academy of Science & Technology Thrissur - 680501, Kerala, India (email: premithakpnath@gmail.com)
More informationImage-Based Face Recognition using Global Features
Image-Based Face Recognition using Global Features Xiaoyin xu Research Centre for Integrated Microsystems Electrical and Computer Engineering University of Windsor Supervisors: Dr. Ahmadi May 13, 2005
More informationA Study on Different Challenges in Facial Recognition Methods
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 6, June 2015, pg.521
More informationScale Invariant Feature Transform
Scale Invariant Feature Transform Why do we care about matching features? Camera calibration Stereo Tracking/SFM Image moiaicing Object/activity Recognition Objection representation and recognition Image
More informationApplications Video Surveillance (On-line or off-line)
Face Face Recognition: Dimensionality Reduction Biometrics CSE 190-a Lecture 12 CSE190a Fall 06 CSE190a Fall 06 Face Recognition Face is the most common biometric used by humans Applications range from
More informationRobotics Programming Laboratory
Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car
More informationLearning a Manifold as an Atlas Supplementary Material
Learning a Manifold as an Atlas Supplementary Material Nikolaos Pitelis Chris Russell School of EECS, Queen Mary, University of London [nikolaos.pitelis,chrisr,lourdes]@eecs.qmul.ac.uk Lourdes Agapito
More informationA Survey on Feature Extraction Techniques for Palmprint Identification
International Journal Of Computational Engineering Research (ijceronline.com) Vol. 03 Issue. 12 A Survey on Feature Extraction Techniques for Palmprint Identification Sincy John 1, Kumudha Raimond 2 1
More informationTechnical Report. Title: Manifold learning and Random Projections for multi-view object recognition
Technical Report Title: Manifold learning and Random Projections for multi-view object recognition Authors: Grigorios Tsagkatakis 1 and Andreas Savakis 2 1 Center for Imaging Science, Rochester Institute
More informationScale Invariant Feature Transform
Why do we care about matching features? Scale Invariant Feature Transform Camera calibration Stereo Tracking/SFM Image moiaicing Object/activity Recognition Objection representation and recognition Automatic
More informationHUMAN-CENTERED COMPUTING (HCC), a recently
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 18, NO. 12, DECEMBER 2008 1741 Locality Versus Globality: Query-Driven Localized Linear Models for Facial Image Computing Yun Fu, Member,
More informationSchool of Computer and Communication, Lanzhou University of Technology, Gansu, Lanzhou,730050,P.R. China
Send Orders for Reprints to reprints@benthamscienceae The Open Automation and Control Systems Journal, 2015, 7, 253-258 253 Open Access An Adaptive Neighborhood Choosing of the Local Sensitive Discriminant
More informationLec 08 Feature Aggregation II: Fisher Vector, Super Vector and AKULA
Image Analysis & Retrieval CS/EE 5590 Special Topics (Class Ids: 44873, 44874) Fall 2016, M/W 4-5:15pm@Bloch 0012 Lec 08 Feature Aggregation II: Fisher Vector, Super Vector and AKULA Zhu Li Dept of CSEE,
More informationCS 229 Midterm Review
CS 229 Midterm Review Course Staff Fall 2018 11/2/2018 Outline Today: SVMs Kernels Tree Ensembles EM Algorithm / Mixture Models [ Focus on building intuition, less so on solving specific problems. Ask
More informationLocality Preserving Projections (LPP) Abstract
Locality Preserving Projections (LPP) Xiaofei He Partha Niyogi Computer Science Department Computer Science Department The University of Chicago The University of Chicago Chicago, IL 60615 Chicago, IL
More informationFACE RECOGNITION FROM A SINGLE SAMPLE USING RLOG FILTER AND MANIFOLD ANALYSIS
FACE RECOGNITION FROM A SINGLE SAMPLE USING RLOG FILTER AND MANIFOLD ANALYSIS Jaya Susan Edith. S 1 and A.Usha Ruby 2 1 Department of Computer Science and Engineering,CSI College of Engineering, 2 Research
More informationThe Curse of Dimensionality
The Curse of Dimensionality ACAS 2002 p1/66 Curse of Dimensionality The basic idea of the curse of dimensionality is that high dimensional data is difficult to work with for several reasons: Adding more
More informationParallel Architecture & Programing Models for Face Recognition
Parallel Architecture & Programing Models for Face Recognition Submitted by Sagar Kukreja Computer Engineering Department Rochester Institute of Technology Agenda Introduction to face recognition Feature
More informationSIFT Descriptor Extraction on the GPU for Large-Scale Video Analysis. Hannes Fassold, Jakub Rosner
SIFT Descriptor Extraction on the GPU for Large-Scale Video Analysis Hannes Fassold, Jakub Rosner 2014-03-26 2 Overview GPU-activities @ AVM research group SIFT descriptor extraction Algorithm GPU implementation
More informationLinear Discriminant Analysis for 3D Face Recognition System
Linear Discriminant Analysis for 3D Face Recognition System 3.1 Introduction Face recognition and verification have been at the top of the research agenda of the computer vision community in recent times.
More informationRendering and Modeling of Transparent Objects. Minglun Gong Dept. of CS, Memorial Univ.
Rendering and Modeling of Transparent Objects Minglun Gong Dept. of CS, Memorial Univ. Capture transparent object appearance Using frequency based environmental matting Reduce number of input images needed
More informationHeat Kernel Based Local Binary Pattern for Face Representation
JOURNAL OF LATEX CLASS FILES 1 Heat Kernel Based Local Binary Pattern for Face Representation Xi Li, Weiming Hu, Zhongfei Zhang, Hanzi Wang Abstract Face classification has recently become a very hot research
More informationCSE 6242 A / CX 4242 DVA. March 6, Dimension Reduction. Guest Lecturer: Jaegul Choo
CSE 6242 A / CX 4242 DVA March 6, 2014 Dimension Reduction Guest Lecturer: Jaegul Choo Data is Too Big To Analyze! Limited memory size! Data may not be fitted to the memory of your machine! Slow computation!
More informationGlobally and Locally Consistent Unsupervised Projection
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Globally and Locally Consistent Unsupervised Projection Hua Wang, Feiping Nie, Heng Huang Department of Electrical Engineering
More informationA REVIEW ON IMAGE RETRIEVAL USING HYPERGRAPH
A REVIEW ON IMAGE RETRIEVAL USING HYPERGRAPH Sandhya V. Kawale Prof. Dr. S. M. Kamalapur M.E. Student Associate Professor Deparment of Computer Engineering, Deparment of Computer Engineering, K. K. Wagh
More informationAnnouncements. Recognition I. Gradient Space (p,q) What is the reflectance map?
Announcements I HW 3 due 12 noon, tomorrow. HW 4 to be posted soon recognition Lecture plan recognition for next two lectures, then video and motion. Introduction to Computer Vision CSE 152 Lecture 17
More informationVisuelle Perzeption für Mensch- Maschine Schnittstellen
Visuelle Perzeption für Mensch- Maschine Schnittstellen Vorlesung, WS 2009 Prof. Dr. Rainer Stiefelhagen Dr. Edgar Seemann Institut für Anthropomatik Universität Karlsruhe (TH) http://cvhci.ira.uka.de
More informationA Fast Personal Palm print Authentication based on 3D-Multi Wavelet Transformation
A Fast Personal Palm print Authentication based on 3D-Multi Wavelet Transformation * A. H. M. Al-Helali, * W. A. Mahmmoud, and * H. A. Ali * Al- Isra Private University Email: adnan_hadi@yahoo.com Abstract:
More informationNon-linear dimension reduction
Sta306b May 23, 2011 Dimension Reduction: 1 Non-linear dimension reduction ISOMAP: Tenenbaum, de Silva & Langford (2000) Local linear embedding: Roweis & Saul (2000) Local MDS: Chen (2006) all three methods
More informationContent-based Image and Video Retrieval. Image Segmentation
Content-based Image and Video Retrieval Vorlesung, SS 2011 Image Segmentation 2.5.2011 / 9.5.2011 Image Segmentation One of the key problem in computer vision Identification of homogenous region in the
More informationManifold Constrained Deep Neural Networks for ASR
1 Manifold Constrained Deep Neural Networks for ASR Department of Electrical and Computer Engineering, McGill University Richard Rose and Vikrant Tomar Motivation Speech features can be characterized as
More informationCHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS
38 CHAPTER 3 PRINCIPAL COMPONENT ANALYSIS AND FISHER LINEAR DISCRIMINANT ANALYSIS 3.1 PRINCIPAL COMPONENT ANALYSIS (PCA) 3.1.1 Introduction In the previous chapter, a brief literature review on conventional
More informationFacial Expression Recognition using Principal Component Analysis with Singular Value Decomposition
ISSN: 2321-7782 (Online) Volume 1, Issue 6, November 2013 International Journal of Advance Research in Computer Science and Management Studies Research Paper Available online at: www.ijarcsms.com Facial
More informationECE 484 Digital Image Processing Lec 17 - Part II Review & Final Projects Topics
ECE 484 Digital Image Processing Lec 17 - Part II Review & Final Projects opics Zhu Li Dept of CSEE, UMKC Office: FH560E, Email: lizhu@umkc.edu, Ph: x 2346. http://l.web.umkc.edu/lizhu slides created with
More informationFacial Expression Recognition Using Non-negative Matrix Factorization
Facial Expression Recognition Using Non-negative Matrix Factorization Symeon Nikitidis, Anastasios Tefas and Ioannis Pitas Artificial Intelligence & Information Analysis Lab Department of Informatics Aristotle,
More informationAn Integrated Face Recognition Algorithm Based on Wavelet Subspace
, pp.20-25 http://dx.doi.org/0.4257/astl.204.48.20 An Integrated Face Recognition Algorithm Based on Wavelet Subspace Wenhui Li, Ning Ma, Zhiyan Wang College of computer science and technology, Jilin University,
More informationGeometric Registration for Deformable Shapes 3.3 Advanced Global Matching
Geometric Registration for Deformable Shapes 3.3 Advanced Global Matching Correlated Correspondences [ASP*04] A Complete Registration System [HAW*08] In this session Advanced Global Matching Some practical
More informationLocal features and image matching. Prof. Xin Yang HUST
Local features and image matching Prof. Xin Yang HUST Last time RANSAC for robust geometric transformation estimation Translation, Affine, Homography Image warping Given a 2D transformation T and a source
More informationPartial Least Squares Regression on Grassmannian Manifold for Emotion Recognition
Emotion Recognition In The Wild Challenge and Workshop (EmotiW 2013) Partial Least Squares Regression on Grassmannian Manifold for Emotion Recognition Mengyi Liu, Ruiping Wang, Zhiwu Huang, Shiguang Shan,
More informationIMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS. Kirthiga, M.E-Communication system, PREC, Thanjavur
IMPROVED FACE RECOGNITION USING ICP TECHNIQUES INCAMERA SURVEILLANCE SYSTEMS Kirthiga, M.E-Communication system, PREC, Thanjavur R.Kannan,Assistant professor,prec Abstract: Face Recognition is important
More informationRecognition, SVD, and PCA
Recognition, SVD, and PCA Recognition Suppose you want to find a face in an image One possibility: look for something that looks sort of like a face (oval, dark band near top, dark band near bottom) Another
More informationLarge scale object/scene recognition
Large scale object/scene recognition Image dataset: > 1 million images query Image search system ranked image list Each image described by approximately 2000 descriptors 2 10 9 descriptors to index! Database
More informationFACE RECOGNITION USING SUPPORT VECTOR MACHINES
FACE RECOGNITION USING SUPPORT VECTOR MACHINES Ashwin Swaminathan ashwins@umd.edu ENEE633: Statistical and Neural Pattern Recognition Instructor : Prof. Rama Chellappa Project 2, Part (b) 1. INTRODUCTION
More informationLocality Preserving Projections (LPP) Abstract
Locality Preserving Projections (LPP) Xiaofei He Partha Niyogi Computer Science Department Computer Science Department The University of Chicago The University of Chicago Chicago, IL 60615 Chicago, IL
More informationDistance-driven Fusion of Gait and Face for Human Identification in Video
X. Geng, L. Wang, M. Li, Q. Wu, K. Smith-Miles, Distance-Driven Fusion of Gait and Face for Human Identification in Video, Proceedings of Image and Vision Computing New Zealand 2007, pp. 19 24, Hamilton,
More informationCSE 6242 A / CS 4803 DVA. Feb 12, Dimension Reduction. Guest Lecturer: Jaegul Choo
CSE 6242 A / CS 4803 DVA Feb 12, 2013 Dimension Reduction Guest Lecturer: Jaegul Choo CSE 6242 A / CS 4803 DVA Feb 12, 2013 Dimension Reduction Guest Lecturer: Jaegul Choo Data is Too Big To Do Something..
More informationA New Multi Fractal Dimension Method for Face Recognition with Fewer Features under Expression Variations
A New Multi Fractal Dimension Method for Face Recognition with Fewer Features under Expression Variations Maksud Ahamad Assistant Professor, Computer Science & Engineering Department, Ideal Institute of
More informationSVM-KNN : Discriminative Nearest Neighbor Classification for Visual Category Recognition
SVM-KNN : Discriminative Nearest Neighbor Classification for Visual Category Recognition Hao Zhang, Alexander Berg, Michael Maire Jitendra Malik EECS, UC Berkeley Presented by Adam Bickett Objective Visual
More informationRobust Face Recognition Using Enhanced Local Binary Pattern
Bulletin of Electrical Engineering and Informatics Vol. 7, No. 1, March 2018, pp. 96~101 ISSN: 2302-9285, DOI: 10.11591/eei.v7i1.761 96 Robust Face Recognition Using Enhanced Local Binary Pattern Srinivasa
More informationFace Recognition Using Wavelet Based Kernel Locally Discriminating Projection
Face Recognition Using Wavelet Based Kernel Locally Discriminating Projection Venkatrama Phani Kumar S 1, KVK Kishore 2 and K Hemantha Kumar 3 Abstract Locality Preserving Projection(LPP) aims to preserve
More informationA Taxonomy of Semi-Supervised Learning Algorithms
A Taxonomy of Semi-Supervised Learning Algorithms Olivier Chapelle Max Planck Institute for Biological Cybernetics December 2005 Outline 1 Introduction 2 Generative models 3 Low density separation 4 Graph
More informationRobust Face Recognition via Sparse Representation Authors: John Wright, Allen Y. Yang, Arvind Ganesh, S. Shankar Sastry, and Yi Ma
Robust Face Recognition via Sparse Representation Authors: John Wright, Allen Y. Yang, Arvind Ganesh, S. Shankar Sastry, and Yi Ma Presented by Hu Han Jan. 30 2014 For CSE 902 by Prof. Anil K. Jain: Selected
More informationClassification. Vladimir Curic. Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University
Classification Vladimir Curic Centre for Image Analysis Swedish University of Agricultural Sciences Uppsala University Outline An overview on classification Basics of classification How to choose appropriate
More informationTraining-Free, Generic Object Detection Using Locally Adaptive Regression Kernels
Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIENCE, VOL.32, NO.9, SEPTEMBER 2010 Hae Jong Seo, Student Member,
More informationAdaptive Binary Quantization for Fast Nearest Neighbor Search
IBM Research Adaptive Binary Quantization for Fast Nearest Neighbor Search Zhujin Li 1, Xianglong Liu 1*, Junjie Wu 1, and Hao Su 2 1 Beihang University, Beijing, China 2 Stanford University, Stanford,
More informationLinear Discriminant Analysis in Ottoman Alphabet Character Recognition
Linear Discriminant Analysis in Ottoman Alphabet Character Recognition ZEYNEB KURT, H. IREM TURKMEN, M. ELIF KARSLIGIL Department of Computer Engineering, Yildiz Technical University, 34349 Besiktas /
More informationFADA: An Efficient Dimension Reduction Scheme for Image Classification
Best Paper Candidate in Retrieval rack, Pacific-rim Conference on Multimedia, December 11-14, 7, Hong Kong. FADA: An Efficient Dimension Reduction Scheme for Image Classification Yijuan Lu 1, Jingsheng
More informationFace Recognition for Mobile Devices
Face Recognition for Mobile Devices Aditya Pabbaraju (adisrinu@umich.edu), Srujankumar Puchakayala (psrujan@umich.edu) INTRODUCTION Face recognition is an application used for identifying a person from
More informationObject and Class Recognition I:
Object and Class Recognition I: Object Recognition Lectures 10 Sources ICCV 2005 short courses Li Fei-Fei (UIUC), Rob Fergus (Oxford-MIT), Antonio Torralba (MIT) http://people.csail.mit.edu/torralba/iccv2005
More informationIEEE International Conference on Data Mining Proceedings. Copyright IEEE, Computer Society.
Title Cross-domain web image annotation Author(s Si, S; Tao, D; Chan, KP Citation The IEEE International Conference on Data Mining Workshops (ICDMW 2009, Miami, FL., 6 December 2009. In Proceedings of
More information3D Models and Matching
3D Models and Matching representations for 3D object models particular matching techniques alignment-based systems appearance-based systems GC model of a screwdriver 1 3D Models Many different representations
More informationFace Recognition Based on LDA and Improved Pairwise-Constrained Multiple Metric Learning Method
Journal of Information Hiding and Multimedia Signal Processing c 2016 ISSN 2073-4212 Ubiquitous International Volume 7, Number 5, September 2016 Face Recognition ased on LDA and Improved Pairwise-Constrained
More informationObject and Action Detection from a Single Example
Object and Action Detection from a Single Example Peyman Milanfar* EE Department University of California, Santa Cruz *Joint work with Hae Jong Seo AFOSR Program Review, June 4-5, 29 Take a look at this:
More informationDeep Learning for Computer Vision
Deep Learning for Computer Vision Spring 2018 http://vllab.ee.ntu.edu.tw/dlcv.html (primary) https://ceiba.ntu.edu.tw/1062dlcv (grade, etc.) FB: DLCV Spring 2018 Yu Chiang Frank Wang 王鈺強, Associate Professor
More informationLecture 4 Face Detection and Classification. Lin ZHANG, PhD School of Software Engineering Tongji University Spring 2018
Lecture 4 Face Detection and Classification Lin ZHANG, PhD School of Software Engineering Tongji University Spring 2018 Any faces contained in the image? Who are they? Outline Overview Face detection Introduction
More informationHETEROGENEOUS FACE RECOGNITION VIA GRASSMANNIAN BASED NEAREST SUBSPACE SEARCH
HETEROGENEOUS FACE RECOGNITION VIA GRASSMANNIAN BASED NEAREST SUBSPACE SEARCH Yuan Tian 1, Cheng Yan 1, Xiao Bai 1, Jun Zhou 2 1 School of Computer Science and Engineering, Beihang University, Beijing,
More informationA REVIEW ON SEARCH BASED FACE ANNOTATION USING WEAKLY LABELED FACIAL IMAGES
A REVIEW ON SEARCH BASED FACE ANNOTATION USING WEAKLY LABELED FACIAL IMAGES Dhokchaule Sagar Patare Swati Makahre Priyanka Prof.Borude Krishna ABSTRACT This paper investigates framework of face annotation
More informationThree things everyone should know to improve object retrieval. Relja Arandjelović and Andrew Zisserman (CVPR 2012)
Three things everyone should know to improve object retrieval Relja Arandjelović and Andrew Zisserman (CVPR 2012) University of Oxford 2 nd April 2012 Large scale object retrieval Find all instances of
More informationFast Indexing and Search. Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation
Fast Indexing and Search Lida Huang, Ph.D. Senior Member of Consulting Staff Magma Design Automation Motivation Object categorization? http://www.cs.utexas.edu/~grauman/slides/jain_et_al_cvpr2008.ppt Motivation
More informationCogniSight, image recognition engine
CogniSight, image recognition engine Making sense of video and images Generating insights, meta data and decision Applications 2 Inspect, Sort Identify, Track Detect, Count Search, Tag Match, Compare Find,
More informationSu et al. Shape Descriptors - III
Su et al. Shape Descriptors - III Siddhartha Chaudhuri http://www.cse.iitb.ac.in/~cs749 Funkhouser; Feng, Liu, Gong Recap Global A shape descriptor is a set of numbers that describes a shape in a way that
More informationOn Classification: An Empirical Study of Existing Algorithms Based on Two Kaggle Competitions
On Classification: An Empirical Study of Existing Algorithms Based on Two Kaggle Competitions CAMCOS Report Day December 9th, 2015 San Jose State University Project Theme: Classification The Kaggle Competition
More informationAn efficient face recognition algorithm based on multi-kernel regularization learning
Acta Technica 61, No. 4A/2016, 75 84 c 2017 Institute of Thermomechanics CAS, v.v.i. An efficient face recognition algorithm based on multi-kernel regularization learning Bi Rongrong 1 Abstract. A novel
More informationInstance-level recognition
Instance-level recognition 1) Local invariant features 2) Matching and recognition with local features 3) Efficient visual search 4) Very large scale indexing Matching of descriptors Matching and 3D reconstruction
More information