Adaptive Gesture Recognition System Integrating Multiple Inputs
|
|
- Lucas Ferguson
- 5 years ago
- Views:
Transcription
1 Adaptive Gesture Recognition System Integrating Multiple Inputs Master Thesis - Colloquium Tobias Staron University of Hamburg Faculty of Mathematics, Informatics and Natural Sciences Technical Aspects of Multimodal Systems May 19, 2015 Tobias Staron 1
2 Table of contents Introduction Set-Up Multiple Inputs & Adaptivity Conclusion Tobias Staron 2
3 Introduction University of Hamburg Overview Introduction Set-Up Multiple Inputs & Adaptivity Conclusion Tobias Staron 3
4 Introduction - Gesture Recognition in General Overview Introduction Gesture Recognition in General Motivation Set-Up Multiple Inputs & Adaptivity Conclusion Tobias Staron 4
5 Introduction - Gesture Recognition in General Gesture Recognition in General several applications (more natural interaction with robots, way of communication, sign language,...) Gesture recognition is the process by which the gestures made by the user are recognized by the receiver. (Mitra & Acharya, 2007 [3]) static vs. dynamic gestures Tobias Staron 5
6 Introduction - Motivation Overview Introduction Gesture Recognition in General Motivation Set-Up Multiple Inputs & Adaptivity Conclusion Tobias Staron 6
7 Introduction - Motivation Previous Work TAMS - Master Project Intelligent Robotics ( ) vision-based system (Microsoft Kinect) for recognizing static gestures depth images and Support Vector Machines (SVMs) project paper (Paetzel & Staron, 2014 [4]) Tobias Staron 7
8 Introduction - Motivation Problems in Gesture Recognition recognition results in general context-depended applications changed circumstances, e.g. new users / users with different figures, changed environments, changed camera properties (position, calibration,...), light changes,... exploiting features of Robotics (a robot might have more than one sensor; possible interaction between user and robot) Tobias Staron 8
9 Introduction - Motivation Hypotheses use of multiple inputs improved recognition results (& context-independent systems) use of multiple inputs robustness possible interaction between user and robot ability of the system to adapt to changed circumstances possible interaction between user and robot omitting of preliminary training development of an adaptive gesture recognition system that makes use of multiple inputs Tobias Staron 9
10 Set-Up University of Hamburg Overview Introduction Set-Up Multiple Inputs & Adaptivity Conclusion Tobias Staron 10
11 Set-Up - Inputs University of Hamburg Overview Introduction Set-Up Inputs Data Evaluation Criteria k-nearest Neighbor (k-nn) Classifiers Multiple Inputs & Adaptivity Conclusion Tobias Staron 11
12 Set-Up - Inputs University of Hamburg Depth Images gray value images information about distances to the camera preprocessing (noise reduction, foreground separation, histogram equalization, grid) (Biswas & Basu, 2011 [2]) gray value binning in grid cells 520 features Tobias Staron 12
13 Set-Up - Inputs University of Hamburg Exemplary Preprocessing of a Depth Image RGB image of an exemplary gesture. Tobias Staron 13
14 Set-Up - Inputs University of Hamburg Exemplary Preprocessing of a Depth Image The corresponding depth image prior to preprocessing. Tobias Staron 13
15 Set-Up - Inputs University of Hamburg Exemplary Preprocessing of a Depth Image The depth image but with reduced noise. Tobias Staron 13
16 Set-Up - Inputs University of Hamburg Exemplary Preprocessing of a Depth Image Only the foreground of the depth image. Tobias Staron 13
17 Set-Up - Inputs University of Hamburg Exemplary Preprocessing of a Depth Image The foreground of the depth image after histogram equalization. Tobias Staron 13
18 Set-Up - Inputs University of Hamburg Exemplary Preprocessing of a Depth Image The equalized foreground of the depth image with a grid put on it. Tobias Staron 13
19 Set-Up - Inputs University of Hamburg Skeletal Information OpenNI tracker position and orientation of several joints of the human skeleton a coordinate frame for each joint transformations into target frame 8 joints 56 features Tobias Staron 14
20 Set-Up - Data University of Hamburg Overview Introduction Set-Up Inputs Data Evaluation Criteria k-nearest Neighbor (k-nn) Classifiers Multiple Inputs & Adaptivity Conclusion Tobias Staron 15
21 Set-Up - Data University of Hamburg Collecting Training and Test Data 12 gestures 10 test users different groups (users with similar/differing figures) different poses and positions (to the left or right) but no different distances to the camera different environments camera calibration and illumination remained unchanged Tobias Staron 16
22 Set-Up - Data University of Hamburg Different Environments Tobias Staron 17
23 Set-Up - Evaluation Criteria Overview Introduction Set-Up Inputs Data Evaluation Criteria k-nearest Neighbor (k-nn) Classifiers Multiple Inputs & Adaptivity Conclusion Tobias Staron 18
24 Set-Up - Evaluation Criteria Evaluation Criteria precision: proportion of test instances classified correctly recall: proportion of instances that should have been classified as a certain gesture that have actually got the respective label F 1 -score = (2 precision recall)/(precision + recall) average classification and (initial) training time nr. of training instances Tobias Staron 19
25 Set-Up - k-nearest Neighbor (k-nn) Classifiers Overview Introduction Set-Up Inputs Data Evaluation Criteria k-nearest Neighbor (k-nn) Classifiers Multiple Inputs & Adaptivity Conclusion Tobias Staron 20
26 Set-Up - k-nearest Neighbor (k-nn) Classifiers k-nearest Neighbor (k-nn) Classifier supervised learning method arbitrary number of dimensions no explicit training (computations during classification) label that occurred most among the k-nearest neighbors of a query instances is chosen distance measure (e.g. Euclidean distance) Tobias Staron 21
27 Three classes, represented by blue squares, magenta diamonds and yellow Tobias Staron pentagons respectively; the red square is the query instance. 22 University of Hamburg Set-Up - k-nearest Neighbor (k-nn) Classifiers Exemplary Dataset in the 2-Dimensional Space
28 Set-Up - k-nearest Neighbor (k-nn) Classifiers Weighted k-nn Classifier if a training example matches the query instance, its label will be chosen Generalization the nearer one of its k-nearest neighbors lies by the query instance, the higher the probability that its label is the result Tobias Staron 23
29 Set-Up - k-nearest Neighbor (k-nn) Classifiers Training of Classifiers classifiers for each kind of input, for each group of users and for each environment the same amount of training (and test) data for each classifier Tobias Staron 24
30 Multiple Inputs & Adaptivity Overview Introduction Set-Up Multiple Inputs & Adaptivity Conclusion Tobias Staron 25
31 Multiple Inputs & Adaptivity - Combining Multiple Inputs Overview Introduction Set-Up Multiple Inputs & Adaptivity Combining Multiple Inputs Adaptivity Integrated System Conclusion Tobias Staron 26
32 Multiple Inputs & Adaptivity - Combining Multiple Inputs Sensor Fusion low-level sensor fusion: fusion at signal level, one classifier high-level sensor fusion: fusion at a more symbolic level, one classifier per input, classification results are fused low-level sensor fusion does not allow for variations regarding the chosen inputs (e.g. adding or removing of sensors) without omitting previous data / retraining everything high-level sensor fusion was chosen Tobias Staron 27
33 Multiple Inputs & Adaptivity - Combining Multiple Inputs Hypotheses Verification inspired by Aldoma et al. (Aldoma et al., 2013 [1]) high-level sensor fusion approach one classifier per kind of input each classifier can generate an unspecified number of hypotheses each hypothesis is weighted hypothesis with the highest weight is chosen as recognition result Tobias Staron 28
34 Multiple Inputs & Adaptivity - Combining Multiple Inputs Weighting Cues each hypothesis is weighted by an unspecified number of weighting cues neighboring cue (in case of k-nn classifiers): all labels occurring among k-nearest neighbors as hypotheses; weights depend on nr. of examples with respective labels / on their distance to the query instance meta-features: e.g. reliability of classifiers summation of weights of a hypothesis Tobias Staron 29
35 Multiple Inputs & Adaptivity - Combining Multiple Inputs Evaluation (1) the same data were used as for testing the classifiers with depth respectively skeletal information individually k-nn classifier: best performance for the neighboring cue weighted k-nn classifier outperformed the standard one improved robustness Tobias Staron 30
36 Comparison of the individual inputs and their combination via Tobias Staron neighboring cues for the k-nn classifier, trained and tested on data from 31 University of Hamburg Multiple Inputs & Adaptivity - Combining Multiple Inputs Evaluation (1) F 1 -score depth images skeletal information neighboring cue k
37 Comparison of the individual inputs and their combination via Tobias Staron neighboring cues for the weighted k-nn classifier, trained and tested on 31 University of Hamburg Multiple Inputs & Adaptivity - Combining Multiple Inputs Evaluation (1) F 1 -score depth images skeletal information neighboring cue k
38 Multiple Inputs & Adaptivity - Combining Multiple Inputs Evaluation (1) - depth images skeletal data combined inputs F 1 -score Table: Comparison of the individual inputs and their combination via neighboring cues for the weighted 5-NN classifier, trained on data from users with similar figures and tested on data from the same users, but in a different environment. Tobias Staron 31
39 Multiple Inputs & Adaptivity - Adaptivity Overview Introduction Set-Up Multiple Inputs & Adaptivity Combining Multiple Inputs Adaptivity Integrated System Conclusion Tobias Staron 32
40 Multiple Inputs & Adaptivity - Adaptivity Online Learning goal: recognition of gestures under changed circumstances classifiers try to recognize query instances and are told the correct label afterwards to update their model no online version for SVMs (they need to be retrained every time new training are added) k-nn classifiers different points when to learn showed no apparent effects Tobias Staron 33
41 Multiple Inputs & Adaptivity - Adaptivity Evaluation (2) 5-NN classifier trained on depth images from users with similar figures and tested on depth images from the same users, but in a different environment online learning after each misclassification training data and the test data of iteration 1 the same as for previous tests similar tests in the remaining iterations, but with newly sampled test data Tobias Staron 34
42 Multiple Inputs & Adaptivity - Adaptivity Evaluation (2) F 1 -score nr. of training data nr. of retraining F 1 -score # iteration Tobias Staron 35
43 Multiple Inputs & Adaptivity - Adaptivity Evaluation (2) average classification time nr. of training data nr. of retraining time (sec) # iteration Tobias Staron 35
44 Multiple Inputs & Adaptivity - Integrated System Overview Introduction Set-Up Multiple Inputs & Adaptivity Combining Multiple Inputs Adaptivity Integrated System Conclusion Tobias Staron 36
45 Multiple Inputs & Adaptivity - Integrated System Multiple Inputs Hypotheses Verification additional weighting cues are enabled by online learning: the experience of a classifier (the number of examples it has been trained with) Tobias Staron 37
46 Multiple Inputs & Adaptivity - Integrated System Adaptivity online Learning what examples to learn previously: all misclassified ones alternative: misclassified examples as soon as the fusion result is wrong, too Tobias Staron 38
47 Multiple Inputs & Adaptivity - Integrated System Evaluation (3) 5-NN classifier trained on data from users with similar figures and tested on data from the same users, but in a different environment depth images and skeletal data combined via neighboring cue online learning after each misclassification training data and the test data of iteration 1 the same as for previous tests similar tests in the remaining iterations, but with newly sampled test data Tobias Staron 39
48 Multiple Inputs & Adaptivity - Integrated System Evaluation (3) F 1 -score learning after false classifier prediction learning after false classifier and overall prediction iteration Tobias Staron 40
49 Multiple Inputs & Adaptivity - Integrated System Evaluation (3) time (sec) learning after false classifier prediction learning after false classifier and overall prediction iteration Tobias Staron 40
50 Multiple Inputs & Adaptivity - Integrated System Evaluation (4) - Final Test weighted 5-NN classifier depth images and skeletal data combined via neighboring cue online learning after each misclassification when fusion result false, too no preliminary training test data from users with similar figures (first ten and last ten iterations), data from users with varying figures (iteration 11-20), original users, but in a different environment (iteration 21-30) and the users with the varying figures in that environment (iteration 31-40) Tobias Staron 41
51 Multiple Inputs & Adaptivity - Integrated System Evaluation (4) - Final Test F 1 -score # F score nr. of training data - depth images nr. of training data - skeletal information iteration Tobias Staron 42
52 Conclusion University of Hamburg Overview Introduction Set-Up Multiple Inputs & Adaptivity Conclusion Tobias Staron 43
53 Conclusion - Summary Overview Introduction Set-Up Multiple Inputs & Adaptivity Conclusion Summary References Tobias Staron 44
54 Conclusion - Summary Hypotheses use of multiple inputs lead to improved recognition results as well as a more robust system system is able to adapt to changed circumstances due to online learning preliminary training can be omitted because of online learning adaptive gesture recognition system that makes use of multiple inputs Tobias Staron 45
55 Conclusion - References Overview Introduction Set-Up Multiple Inputs & Adaptivity Conclusion Summary References Tobias Staron 46
56 Conclusion - References References I 1. Aitor Aldoma, Federico Tombari, Johann Prankl, Andreas Richtsfeld, Luigi Di Stefano, and Markus Vincze. Multimodal Cue Integration through Hypotheses Verification for RGB-D Object Recognition and 6DOF Pose Estimation. In Robotics and Automation (ICRA), 2013 IEEE International Conference on, pages IEEE, May K. K. Biswas and Saurav Kumar Basu. Gesture Recognition using Microsoft Kinect R. In Automation, Robotics and Applications (ICARA), th International Conference on., pages December Tobias Staron 47
57 Conclusion - References References II 3. Sushmita Mitra and Tinku Acharya. Gesture Recognition: A Survey. Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on, 37(3): , Maike Paetzel and Tobias Staron. Gesture Recognition. Project Paper, Technical Aspects of Multimodal Systems, Department of Informatics, MIN-Faculty, University of Hamburg, March Tobias Staron 48
58 Conclusion - University of Hamburg The End Thanks for Your Attention! Tobias Staron 49
Object Classification in Domestic Environments
Object Classification in Domestic Environments Markus Vincze Aitor Aldoma, Markus Bader, Peter Einramhof, David Fischinger, Andreas Huber, Lara Lammer, Thomas Mörwald, Sven Olufs, Ekaterina Potapova, Johann
More information3D object recognition used by team robotto
3D object recognition used by team robotto Workshop Juliane Hoebel February 1, 2016 Faculty of Computer Science, Otto-von-Guericke University Magdeburg Content 1. Introduction 2. Depth sensor 3. 3D object
More informationTri-modal Human Body Segmentation
Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4
More informationSimultaneous Localization and Mapping
Sebastian Lembcke SLAM 1 / 29 MIN Faculty Department of Informatics Simultaneous Localization and Mapping Visual Loop-Closure Detection University of Hamburg Faculty of Mathematics, Informatics and Natural
More informationThe Kinect Sensor. Luís Carriço FCUL 2014/15
Advanced Interaction Techniques The Kinect Sensor Luís Carriço FCUL 2014/15 Sources: MS Kinect for Xbox 360 John C. Tang. Using Kinect to explore NUI, Ms Research, From Stanford CS247 Shotton et al. Real-Time
More informationA Framework for Efficient Fingerprint Identification using a Minutiae Tree
A Framework for Efficient Fingerprint Identification using a Minutiae Tree Praveer Mansukhani February 22, 2008 Problem Statement Developing a real-time scalable minutiae-based indexing system using a
More informationComputer and Machine Vision
Computer and Machine Vision Lecture Week 10 Part-2 Skeletal Models and Face Detection March 21, 2014 Sam Siewert Outline of Week 10 Lab #4 Overview Lab #5 and #6 Extended Lab Overview SIFT and SURF High
More informationSeminar Heidelberg University
Seminar Heidelberg University Mobile Human Detection Systems Pedestrian Detection by Stereo Vision on Mobile Robots Philip Mayer Matrikelnummer: 3300646 Motivation Fig.1: Pedestrians Within Bounding Box
More informationCHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION
CHAPTER 4 DETECTION OF DISEASES IN PLANT LEAF USING IMAGE SEGMENTATION 4.1. Introduction Indian economy is highly dependent of agricultural productivity. Therefore, in field of agriculture, detection of
More informationHuman Motion Detection in Manufacturing Process
Proceedings of the 2 nd World Congress on Electrical Engineering and Computer Systems and Science (EECSS'16) Budapest, Hungary August 16 17, 2016 Paper No. MVML 110 DOI: 10.11159/mvml16.110 Human Motion
More informationMULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION
MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of
More informationFOREGROUND DETECTION ON DEPTH MAPS USING SKELETAL REPRESENTATION OF OBJECT SILHOUETTES
FOREGROUND DETECTION ON DEPTH MAPS USING SKELETAL REPRESENTATION OF OBJECT SILHOUETTES D. Beloborodov a, L. Mestetskiy a a Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University,
More informationComponent-based Face Recognition with 3D Morphable Models
Component-based Face Recognition with 3D Morphable Models B. Weyrauch J. Huang benjamin.weyrauch@vitronic.com jenniferhuang@alum.mit.edu Center for Biological and Center for Biological and Computational
More informationShort Survey on Static Hand Gesture Recognition
Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of
More informationDetection and Fine 3D Pose Estimation of Texture-less Objects in RGB-D Images
Detection and Pose Estimation of Texture-less Objects in RGB-D Images Tomáš Hodaň1, Xenophon Zabulis2, Manolis Lourakis2, Šťěpán Obdržálek1, Jiří Matas1 1 Center for Machine Perception, CTU in Prague,
More informationMultimodal Gesture Recognition using Multi-stream Recurrent Neural Network
Multimodal Gesture Recognition using Multi-stream Recurrent Neural Network Noriki Nishida and Hideki Nakayama Machine Perception Group Graduate School of Information Science and Technology The University
More informationShape Context Matching For Efficient OCR
Matching For Efficient OCR May 14, 2012 Matching For Efficient OCR Table of contents 1 Motivation Background 2 What is a? Matching s Simliarity Measure 3 Matching s via Pyramid Matching Matching For Efficient
More informationObject Recognition and Pose Estimation by means of Cross-Correlation of Mixture of Projected Gaussian
Object Recognition and Pose Estimation by means of Cross-Correlation of Mixture of Projected Gaussian Mauro Antonello 1, Sukhan Lee 2, Naguib Ahmed 2, and Emanuele Menegatti 1 1 University of Padova, department
More informationTemporal Integration of Feature Correspondences For Enhanced Recognition in Cluttered And Dynamic Environments
Temporal Integration of Feature Correspondences For Enhanced Recognition in Cluttered And Dynamic Environments Thomas Fäulhammer, Aitor Aldoma, Michael Zillich and Markus Vincze Abstract We propose a method
More informationText classification II CE-324: Modern Information Retrieval Sharif University of Technology
Text classification II CE-324: Modern Information Retrieval Sharif University of Technology M. Soleymani Fall 2015 Some slides have been adapted from: Profs. Manning, Nayak & Raghavan (CS-276, Stanford)
More informationDeep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks
Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks Si Chen The George Washington University sichen@gwmail.gwu.edu Meera Hahn Emory University mhahn7@emory.edu Mentor: Afshin
More informationCAP 6412 Advanced Computer Vision
CAP 6412 Advanced Computer Vision http://www.cs.ucf.edu/~bgong/cap6412.html Boqing Gong April 21st, 2016 Today Administrivia Free parameters in an approach, model, or algorithm? Egocentric videos by Aisha
More informationClassifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped
More informationAutomatic Colorization of Grayscale Images
Automatic Colorization of Grayscale Images Austin Sousa Rasoul Kabirzadeh Patrick Blaes Department of Electrical Engineering, Stanford University 1 Introduction ere exists a wealth of photographic images,
More informationCHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION 1.1 Introduction Pattern recognition is a set of mathematical, statistical and heuristic techniques used in executing `man-like' tasks on computers. Pattern recognition plays an
More informationArm-hand Action Recognition Based on 3D Skeleton Joints Ling RUI 1, Shi-wei MA 1,a, *, Jia-rui WEN 1 and Li-na LIU 1,2
1 International Conference on Control and Automation (ICCA 1) ISBN: 97-1-9-39- Arm-hand Action Recognition Based on 3D Skeleton Joints Ling RUI 1, Shi-wei MA 1,a, *, Jia-rui WEN 1 and Li-na LIU 1, 1 School
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REVIEW ON CONTENT BASED IMAGE RETRIEVAL BY USING VISUAL SEARCH RANKING MS. PRAGATI
More informationMultimodal Cue Integration through Hypotheses Verification for RGB-D Object Recognition and 6DOF Pose Estimation
Multimodal Cue Integration through Hypotheses Verification for RGB-D Object Recognition and 6DOF Pose Estimation A. Aldoma 1 and F. Tombari 2 and J. Prankl 1 and A. Richtsfeld 1 and L. Di Stefano 2 and
More informationLBP Based Facial Expression Recognition Using k-nn Classifier
ISSN 2395-1621 LBP Based Facial Expression Recognition Using k-nn Classifier #1 Chethan Singh. A, #2 Gowtham. N, #3 John Freddy. M, #4 Kashinath. N, #5 Mrs. Vijayalakshmi. G.V 1 chethan.singh1994@gmail.com
More informationModel-based segmentation and recognition from range data
Model-based segmentation and recognition from range data Jan Boehm Institute for Photogrammetry Universität Stuttgart Germany Keywords: range image, segmentation, object recognition, CAD ABSTRACT This
More informationBackground subtraction in people detection framework for RGB-D cameras
Background subtraction in people detection framework for RGB-D cameras Anh-Tuan Nghiem, Francois Bremond INRIA-Sophia Antipolis 2004 Route des Lucioles, 06902 Valbonne, France nghiemtuan@gmail.com, Francois.Bremond@inria.fr
More informationCOSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor
COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality
More informationLoop detection and extended target tracking using laser data
Licentiate seminar 1(39) Loop detection and extended target tracking using laser data Karl Granström Division of Automatic Control Department of Electrical Engineering Linköping University Linköping, Sweden
More informationAn Algorithm based on SURF and LBP approach for Facial Expression Recognition
ISSN: 2454-2377, An Algorithm based on SURF and LBP approach for Facial Expression Recognition Neha Sahu 1*, Chhavi Sharma 2, Hitesh Yadav 3 1 Assistant Professor, CSE/IT, The North Cap University, Gurgaon,
More informationEnsemble Learning for Object Recognition and Pose Estimation
Ensemble Learning for Object Recognition and Pose Estimation German Aerospace Center (DLR) May 10, 2013 Outline 1. Introduction 2. Overview of Available Methods 3. Multi-cue Integration Complementing Modalities
More informationDeep Incremental Scene Understanding. Federico Tombari & Christian Rupprecht Technical University of Munich, Germany
Deep Incremental Scene Understanding Federico Tombari & Christian Rupprecht Technical University of Munich, Germany C. Couprie et al. "Toward Real-time Indoor Semantic Segmentation Using Depth Information"
More informationCS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning
CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning Justin Chen Stanford University justinkchen@stanford.edu Abstract This paper focuses on experimenting with
More informationTRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK
TRANSPARENT OBJECT DETECTION USING REGIONS WITH CONVOLUTIONAL NEURAL NETWORK 1 Po-Jen Lai ( 賴柏任 ), 2 Chiou-Shann Fuh ( 傅楸善 ) 1 Dept. of Electrical Engineering, National Taiwan University, Taiwan 2 Dept.
More informationAn Efficient Approach for Color Pattern Matching Using Image Mining
An Efficient Approach for Color Pattern Matching Using Image Mining * Manjot Kaur Navjot Kaur Master of Technology in Computer Science & Engineering, Sri Guru Granth Sahib World University, Fatehgarh Sahib,
More informationLast week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints
Last week Multi-Frame Structure from Motion: Multi-View Stereo Unknown camera viewpoints Last week PCA Today Recognition Today Recognition Recognition problems What is it? Object detection Who is it? Recognizing
More information6. Multimodal Biometrics
6. Multimodal Biometrics Multimodal biometrics is based on combination of more than one type of biometric modalities or traits. The most compelling reason to combine different modalities is to improve
More informationA Keypoint Descriptor Inspired by Retinal Computation
A Keypoint Descriptor Inspired by Retinal Computation Bongsoo Suh, Sungjoon Choi, Han Lee Stanford University {bssuh,sungjoonchoi,hanlee}@stanford.edu Abstract. The main goal of our project is to implement
More informationHuman Body Recognition and Tracking: How the Kinect Works. Kinect RGB-D Camera. What the Kinect Does. How Kinect Works: Overview
Human Body Recognition and Tracking: How the Kinect Works Kinect RGB-D Camera Microsoft Kinect (Nov. 2010) Color video camera + laser-projected IR dot pattern + IR camera $120 (April 2012) Kinect 1.5 due
More informationBeyond Bags of Features
: for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015
More informationDistribution-free Predictive Approaches
Distribution-free Predictive Approaches The methods discussed in the previous sections are essentially model-based. Model-free approaches such as tree-based classification also exist and are popular for
More informationSensor Fusion-Based Parking Assist System
Sensor Fusion-Based Parking Assist System 2014-01-0327 Jaeseob Choi, Eugene Chang, Daejoong Yoon, and Seongsook Ryu Hyundai & Kia Corp. Hogi Jung and Jaekyu Suhr Hanyang Univ. Published 04/01/2014 CITATION:
More informationThe Comparative Study of Machine Learning Algorithms in Text Data Classification*
The Comparative Study of Machine Learning Algorithms in Text Data Classification* Wang Xin School of Science, Beijing Information Science and Technology University Beijing, China Abstract Classification
More informationFitting (LMedS, RANSAC)
Fitting (LMedS, RANSAC) Thursday, 23/03/2017 Antonis Argyros e-mail: argyros@csd.uoc.gr LMedS and RANSAC What if we have very many outliers? 2 1 Least Median of Squares ri : Residuals Least Squares n 2
More informationCSE 573: Artificial Intelligence Autumn 2010
CSE 573: Artificial Intelligence Autumn 2010 Lecture 16: Machine Learning Topics 12/7/2010 Luke Zettlemoyer Most slides over the course adapted from Dan Klein. 1 Announcements Syllabus revised Machine
More informationDetecting motion by means of 2D and 3D information
Detecting motion by means of 2D and 3D information Federico Tombari Stefano Mattoccia Luigi Di Stefano Fabio Tonelli Department of Electronics Computer Science and Systems (DEIS) Viale Risorgimento 2,
More informationHuman Detection, Tracking and Activity Recognition from Video
Human Detection, Tracking and Activity Recognition from Video Mihir Patankar University of California San Diego Abstract - Human detection, tracking and activity recognition is an important area of research
More informationEfficient Grasping from RGBD Images: Learning Using a New Rectangle Representation. Yun Jiang, Stephen Moseson, Ashutosh Saxena Cornell University
Efficient Grasping from RGBD Images: Learning Using a New Rectangle Representation Yun Jiang, Stephen Moseson, Ashutosh Saxena Cornell University Problem Goal: Figure out a way to pick up the object. Approach
More informationKernels and Clustering
Kernels and Clustering Robert Platt Northeastern University All slides in this file are adapted from CS188 UC Berkeley Case-Based Learning Non-Separable Data Case-Based Reasoning Classification from similarity
More informationAn ICA based Approach for Complex Color Scene Text Binarization
An ICA based Approach for Complex Color Scene Text Binarization Siddharth Kherada IIIT-Hyderabad, India siddharth.kherada@research.iiit.ac.in Anoop M. Namboodiri IIIT-Hyderabad, India anoop@iiit.ac.in
More informationNearest Neighbor Classifiers
Nearest Neighbor Classifiers CSE 6363 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington 1 The Nearest Neighbor Classifier Let X be the space
More informationGesture Recognition: Hand Pose Estimation. Adrian Spurr Ubiquitous Computing Seminar FS
Gesture Recognition: Hand Pose Estimation Adrian Spurr Ubiquitous Computing Seminar FS2014 27.05.2014 1 What is hand pose estimation? Input Computer-usable form 2 Augmented Reality Gaming Robot Control
More information10-701/15-781, Fall 2006, Final
-7/-78, Fall 6, Final Dec, :pm-8:pm There are 9 questions in this exam ( pages including this cover sheet). If you need more room to work out your answer to a question, use the back of the page and clearly
More informationClustering & Classification (chapter 15)
Clustering & Classification (chapter 5) Kai Goebel Bill Cheetham RPI/GE Global Research goebel@cs.rpi.edu cheetham@cs.rpi.edu Outline k-means Fuzzy c-means Mountain Clustering knn Fuzzy knn Hierarchical
More informationExtracting Layers and Recognizing Features for Automatic Map Understanding. Yao-Yi Chiang
Extracting Layers and Recognizing Features for Automatic Map Understanding Yao-Yi Chiang 0 Outline Introduction/ Problem Motivation Map Processing Overview Map Decomposition Feature Recognition Discussion
More informationSelective Search for Object Recognition
Selective Search for Object Recognition Uijlings et al. Schuyler Smith Overview Introduction Object Recognition Selective Search Similarity Metrics Results Object Recognition Kitten Goal: Problem: Where
More informationReal Time Person Detection and Tracking by Mobile Robots using RGB-D Images
Real Time Person Detection and Tracking by Mobile Robots using RGB-D Images Duc My Vo, Lixing Jiang and Andreas Zell Abstract Detecting and tracking humans are key problems for human-robot interaction.
More informationImproving Multi-view Object Recognition by Detecting Changes in Point Clouds
Improving Multi-view Object Recognition by Detecting Changes in Point Clouds Martin Velas 1, Thomas Fäulhammer 2, Michal Spanel 1, Michael Zillich 2, Markus Vincze 2 Abstract This paper proposes the use
More informationRobot Learning. There are generally three types of robot learning: Learning from data. Learning by demonstration. Reinforcement learning
Robot Learning 1 General Pipeline 1. Data acquisition (e.g., from 3D sensors) 2. Feature extraction and representation construction 3. Robot learning: e.g., classification (recognition) or clustering (knowledge
More informationCS231N Section. Video Understanding 6/1/2018
CS231N Section Video Understanding 6/1/2018 Outline Background / Motivation / History Video Datasets Models Pre-deep learning CNN + RNN 3D convolution Two-stream What we ve seen in class so far... Image
More informationGenerating Object Candidates from RGB-D Images and Point Clouds
Generating Object Candidates from RGB-D Images and Point Clouds Helge Wrede 11.05.2017 1 / 36 Outline Introduction Methods Overview The Data RGB-D Images Point Clouds Microsoft Kinect Generating Object
More informationCS6375: Machine Learning Gautam Kunapuli. Mid-Term Review
Gautam Kunapuli Machine Learning Data is identically and independently distributed Goal is to learn a function that maps to Data is generated using an unknown function Learn a hypothesis that minimizes
More informationAnalysis of Local Appearance-based Face Recognition on FRGC 2.0 Database
Analysis of Local Appearance-based Face Recognition on FRGC 2.0 Database HAZIM KEMAL EKENEL (ISL),, Germany e- mail: ekenel@ira.uka.de web: http://isl.ira.uka.de/~ekenel/ http://isl.ira.uka.de/face_recognition/
More informationLarge-Scale Traffic Sign Recognition based on Local Features and Color Segmentation
Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,
More informationBeyond Bags of features Spatial information & Shape models
Beyond Bags of features Spatial information & Shape models Jana Kosecka Many slides adapted from S. Lazebnik, FeiFei Li, Rob Fergus, and Antonio Torralba Detection, recognition (so far )! Bags of features
More informationFeature Extractors. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. The Perceptron Update Rule.
CS 188: Artificial Intelligence Fall 2007 Lecture 26: Kernels 11/29/2007 Dan Klein UC Berkeley Feature Extractors A feature extractor maps inputs to feature vectors Dear Sir. First, I must solicit your
More informationPROBLEM 4
PROBLEM 2 PROBLEM 4 PROBLEM 5 PROBLEM 6 PROBLEM 7 PROBLEM 8 PROBLEM 9 PROBLEM 10 PROBLEM 11 PROBLEM 12 PROBLEM 13 PROBLEM 14 PROBLEM 16 PROBLEM 17 PROBLEM 22 PROBLEM 23 PROBLEM 24 PROBLEM 25
More informationStatic Gesture Recognition with Restricted Boltzmann Machines
Static Gesture Recognition with Restricted Boltzmann Machines Peter O Donovan Department of Computer Science, University of Toronto 6 Kings College Rd, M5S 3G4, Canada odonovan@dgp.toronto.edu Abstract
More informationA Human Activity Recognition System Using Skeleton Data from RGBD Sensors Enea Cippitelli, Samuele Gasparrini, Ennio Gambi and Susanna Spinsante
A Human Activity Recognition System Using Skeleton Data from RGBD Sensors Enea Cippitelli, Samuele Gasparrini, Ennio Gambi and Susanna Spinsante -Presented By: Dhanesh Pradhan Motivation Activity Recognition
More informationHuman Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg
Human Detection A state-of-the-art survey Mohammad Dorgham University of Hamburg Presentation outline Motivation Applications Overview of approaches (categorized) Approaches details References Motivation
More informationConstruction Progress Management and Interior Work Analysis Using Kinect 3D Image Sensors
33 rd International Symposium on Automation and Robotics in Construction (ISARC 2016) Construction Progress Management and Interior Work Analysis Using Kinect 3D Image Sensors Kosei Ishida 1 1 School of
More informationVerification: is that a lamp? What do we mean by recognition? Recognition. Recognition
Recognition Recognition The Margaret Thatcher Illusion, by Peter Thompson The Margaret Thatcher Illusion, by Peter Thompson Readings C. Bishop, Neural Networks for Pattern Recognition, Oxford University
More informationWhat do we mean by recognition?
Announcements Recognition Project 3 due today Project 4 out today (help session + photos end-of-class) The Margaret Thatcher Illusion, by Peter Thompson Readings Szeliski, Chapter 14 1 Recognition What
More informationApparel Classifier and Recommender using Deep Learning
Apparel Classifier and Recommender using Deep Learning Live Demo at: http://saurabhg.me/projects/tag-that-apparel Saurabh Gupta sag043@ucsd.edu Siddhartha Agarwal siagarwa@ucsd.edu Apoorve Dave a1dave@ucsd.edu
More informationA method for depth-based hand tracing
A method for depth-based hand tracing Khoa Ha University of Maryland, College Park khoaha@umd.edu Abstract An algorithm for natural human-computer interaction via in-air drawing is detailed. We discuss
More informationObject Recognition Using Pictorial Structures. Daniel Huttenlocher Computer Science Department. In This Talk. Object recognition in computer vision
Object Recognition Using Pictorial Structures Daniel Huttenlocher Computer Science Department Joint work with Pedro Felzenszwalb, MIT AI Lab In This Talk Object recognition in computer vision Brief definition
More informationPEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE
PEOPLE IN SEATS COUNTING VIA SEAT DETECTION FOR MEETING SURVEILLANCE Hongyu Liang, Jinchen Wu, and Kaiqi Huang National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Science
More informationCS 343: Artificial Intelligence
CS 343: Artificial Intelligence Kernels and Clustering Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley.
More information08 An Introduction to Dense Continuous Robotic Mapping
NAVARCH/EECS 568, ROB 530 - Winter 2018 08 An Introduction to Dense Continuous Robotic Mapping Maani Ghaffari March 14, 2018 Previously: Occupancy Grid Maps Pose SLAM graph and its associated dense occupancy
More informationAn Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012
An Associate-Predict Model for Face Recognition FIPA Seminar WS 2011/2012, 19.01.2012 INSTITUTE FOR ANTHROPOMATICS, FACIAL IMAGE PROCESSING AND ANALYSIS YIG University of the State of Baden-Wuerttemberg
More informationCS4670: Computer Vision
CS4670: Computer Vision Noah Snavely Lecture 6: Feature matching and alignment Szeliski: Chapter 6.1 Reading Last time: Corners and blobs Scale-space blob detector: Example Feature descriptors We know
More informationA Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering
A Review: Content Base Image Mining Technique for Image Retrieval Using Hybrid Clustering Gurpreet Kaur M-Tech Student, Department of Computer Engineering, Yadawindra College of Engineering, Talwandi Sabo,
More informationVideo shot segmentation using late fusion technique
Video shot segmentation using late fusion technique by C. Krishna Mohan, N. Dhananjaya, B.Yegnanarayana in Proc. Seventh International Conference on Machine Learning and Applications, 2008, San Diego,
More informationKeywords Binary Linked Object, Binary silhouette, Fingertip Detection, Hand Gesture Recognition, k-nn algorithm.
Volume 7, Issue 5, May 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Hand Gestures Recognition
More informationK-Nearest Neighbor Classification Approach for Face and Fingerprint at Feature Level Fusion
K-Nearest Neighbor Classification Approach for Face and Fingerprint at Feature Level Fusion Dhriti PEC University of Technology Chandigarh India Manvjeet Kaur PEC University of Technology Chandigarh India
More informationCS 664 Segmentation. Daniel Huttenlocher
CS 664 Segmentation Daniel Huttenlocher Grouping Perceptual Organization Structural relationships between tokens Parallelism, symmetry, alignment Similarity of token properties Often strong psychophysical
More informationSkin and Face Detection
Skin and Face Detection Linda Shapiro EE/CSE 576 1 What s Coming 1. Review of Bakic flesh detector 2. Fleck and Forsyth flesh detector 3. Details of Rowley face detector 4. Review of the basic AdaBoost
More informationCS 231A Computer Vision (Fall 2012) Problem Set 3
CS 231A Computer Vision (Fall 2012) Problem Set 3 Due: Nov. 13 th, 2012 (2:15pm) 1 Probabilistic Recursion for Tracking (20 points) In this problem you will derive a method for tracking a point of interest
More informationAUTOMATIC 3D HUMAN ACTION RECOGNITION Ajmal Mian Associate Professor Computer Science & Software Engineering
AUTOMATIC 3D HUMAN ACTION RECOGNITION Ajmal Mian Associate Professor Computer Science & Software Engineering www.csse.uwa.edu.au/~ajmal/ Overview Aim of automatic human action recognition Applications
More informationk-nearest Neighbor (knn) Sept Youn-Hee Han
k-nearest Neighbor (knn) Sept. 2015 Youn-Hee Han http://link.koreatech.ac.kr ²Eager Learners Eager vs. Lazy Learning when given a set of training data, it will construct a generalization model before receiving
More informationCRF Based Point Cloud Segmentation Jonathan Nation
CRF Based Point Cloud Segmentation Jonathan Nation jsnation@stanford.edu 1. INTRODUCTION The goal of the project is to use the recently proposed fully connected conditional random field (CRF) model to
More informationCS6716 Pattern Recognition
CS6716 Pattern Recognition Prototype Methods Aaron Bobick School of Interactive Computing Administrivia Problem 2b was extended to March 25. Done? PS3 will be out this real soon (tonight) due April 10.
More informationData Mining. 3.5 Lazy Learners (Instance-Based Learners) Fall Instructor: Dr. Masoud Yaghini. Lazy Learners
Data Mining 3.5 (Instance-Based Learners) Fall 2008 Instructor: Dr. Masoud Yaghini Outline Introduction k-nearest-neighbor Classifiers References Introduction Introduction Lazy vs. eager learning Eager
More informationTask analysis based on observing hands and objects by vision
Task analysis based on observing hands and objects by vision Yoshihiro SATO Keni Bernardin Hiroshi KIMURA Katsushi IKEUCHI Univ. of Electro-Communications Univ. of Karlsruhe Univ. of Tokyo Abstract In
More informationCase-Based Reasoning. CS 188: Artificial Intelligence Fall Nearest-Neighbor Classification. Parametric / Non-parametric.
CS 188: Artificial Intelligence Fall 2008 Lecture 25: Kernels and Clustering 12/2/2008 Dan Klein UC Berkeley Case-Based Reasoning Similarity for classification Case-based reasoning Predict an instance
More informationCS 188: Artificial Intelligence Fall 2008
CS 188: Artificial Intelligence Fall 2008 Lecture 25: Kernels and Clustering 12/2/2008 Dan Klein UC Berkeley 1 1 Case-Based Reasoning Similarity for classification Case-based reasoning Predict an instance
More information