A STUDY ON CLASSIFIERS ACCURACY FOR HAND POSE RECOGNITION

Size: px
Start display at page:

Download "A STUDY ON CLASSIFIERS ACCURACY FOR HAND POSE RECOGNITION"

Transcription

1 BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Tomul LIX (LXIII), Fasc. 2, 2013 SecŃia AUTOMATICĂ şi CALCULATOARE A STUDY ON CLASSIFIERS ACCURACY FOR HAND POSE RECOGNITION BY CONSTANTINA RALUCA MIHALACHE and BOGDAN APOSTOL * Gheorghe Asachi Technical University of Iaşi, Faculty of Automatic Control and Computer Engineering Received: May 17, 2013 Accepted for publication: June 21, 2013 Abstract. This paper presents a comparative study between accuracy rates obtained by using different classification architectures for hand pose estimation in RGB-D data. The segmentation of a hand pose is optimized by using depth data in correlation with the grey scale image obtained from a Kinect sensor. We define an observation model composed of feature vectors obtained by calculating the histograms of oriented gradients on colour and depth data and also fingertip positions. A contour tracking algorithm is applied to track the contour of the hand and find the fingertip positions. The most relevant features from the observation model are selected and are served as input to all the classifiers. For this work we have considered Linear, Random Forest (RF), Support Vector Machine (SVM) and Decision Trees (DT) classifiers for posture classification. Experimental results show a 84.18% recognition accuracy is achieved by using the RF classifier, a 79.29% recognition accuracy is achieved using the DT classifier and a 78.27% recognition accuracy for SVM classifier. The multinomial regression is also used for classification purpose but shows a poor 44.26% recognition accuracy. Key words: hand pose recognition, RGB-D, Kinect, histogram of oriented gradients (HOG) features, decision tree, random forest, support vector machines (SVM), linear Mathematics Subject Classification: 65D18, 68T05. * Corresponding authors; bogdan.apostol@tuiasi.ro

2 70 Constantina Raluca Mihalache and Bogdan Apostol 1. Introduction The last few years witnessed an increasing interest in the field of Human Computer Interaction (HCI). Between the wide varieties of hand posture recognition techniques, marker less visual based recognition has brought to us non-restrictive systems for HCI (Rautaray & Agrawal, 2012). With the release of the Kinect sensor in 2010 the capturing of scene colour and depth data has become affordable. The device opens whole new opportunities for applications in the domain of computer vision and pattern recognition. These techniques have applications in sign language recognition, gestural communication and device automatic controlling. Classification of hand gestures is the problem of taking a given hand observation model for an instance whose category is unknown and finding a category that the model is closest to. Based on features extracted from the current model and the data learned from the training set a prediction is done for the current instance. The classification process is the last operation that a recognition system performs. Before classification we need to make an initial segmentation to extract the region of interest, correlate depth and colour data, extract hand features from the selected region and decide which features are the most relevant to the recognition process. In this paper we compare the classification accuracy of four different classifiers applied for the same dataset. The dataset contains features extracted for four different hand poses and a wide variety of camera angles. a b c d Fig. 1 Four hand poses classes contained by the training dataset: a open hand, b peace sign, c ok pose, d like sign. The considered hand gestures are: open hand pose, peace pose, like pose and ok pose, as it is shown in Fig. 1. Decision Trees, Random Forests, Support Vector Machines and Multinomial Regression machine learning approaches are used for comparison purposes. The reminder of this paper is as follows: Section II explains the details of calculating and selecting the most important feature vectors obtained through fingertip recognition and calculating the histogram of gradients (HOG) on both colour and depth data; Section III reviews the four hand pose classification

3 Bul. Inst. Polit. Iaşi, t. LIX (LXIII), f. 2, techniques that we have chosen for our study; Section IV describes the setup for each classifier and compares the accuracy rates obtained through experiments run on the same feature dataset and Section V concludes with a summary and discussion. 2. Feature Extraction from RGB-D Data The input of the classification algorithm is represented by a set of the most important features extracted from the raw video data. The RGB-D format of the stream input captured by a Microsoft Windows Kinect sensor combines visual (RGB colours) and geometric information (depth) in a synchronized format that provides us with the possibility of extracting features from both. Segmentation is an important first step in the process of feature extraction, as it eliminates the background and keeps just the tracked object. In this paper we do a segmentation based only on the depth data and create a validity mask that can be applied on both depth and colour data to extract the hand region. From the segmented data we extract fingertip positions and features that characterize the local appearance of the hand. We track the contour of the hand in the valid depth data by classifying the pixels correspondent to the validity mask in either interior pixels or contour pixels. Contour tracking of objects in binary images is a well-known subject and solutions such as in (Ren et al., 2002; Chang et al., 2004; Yan & Min, 2011) scan the images pixel by pixel in different directions to find the pixels in the contours and have different starting points. In this paper we start bottom up and scan each line of the image for a valid contour point. A valid contour point is a point that has at least another point that is not in the validity mask and thus is an exterior point. After finding the first contour point we use a 3 3 neighbourhood to look for the next pixel of the contour. Once this is found we add it to an ordered list and keep the search direction of the last valid contour point. A k curvature algorithm similar to the one in (Ryan, 2012) is used to find the curves along the tracked contour of the hand. For each point P i that is a valid contour point the algorithm chooses to points P i k and P i+k and calculates the angle ω between them. We consider that P i is a valid curve point if ω is smaller than an empirically chosen value. In this paper we have used a k value of 20 and a ω value of 0.87 radians. The curvature points found are not all fingertip positions and these points can also correspond to valleys in the contour of the hand. We decide if a point is a fingertip by calculating the bisect between the P i P i k and P i P i+k, and choosing the ones for which the bisect points to the interior of the hand, see Fig. 2. Methods like Harris-Corner detection, Scale Invariant Feature Transform (SIFT), and Histograms of oriented gradients (HOG) successfully extract features from visual data by aggregating gradients. These methods split the grayscale images into equal regions and apply masks to calculate gradients orientations.

4 72 Constantina Raluca Mihalache and Bogdan Apostol Fig. 2 Choosing fingertips by calculating bisect orientation of curvature points. We calculate two sets of features corresponding to both visual and geometric data. From the colour image we obtain a grayscale image on which we compute HOG feature vectors. In order to obtain a well-defined grayscale image on which to apply HOG feature extraction we have to apply a transformation on the depth data. A Cumulative Distribution Function (CDF) is used to make a normal distribution of the depth values and represent them in a grayscale image. For the tests in this paper we split the grayscale images obtained from both colour and depth source into 6 6 pixel cells, organized in blocks composed of 3 3 cells and quantize all the gradient directions in the image into 9 valid directions (Mihalache & Apostol, 2013). We reduce the dimensionality of the HOG features extracted from RGB-D data and remove redundant features by using a Kernel Principal Component Analysis (KPCA) method. This method performs better than linear PCA as it ignores noise from input features and removes noise from test features by projecting the data onto the manifold (Cheng et al., 2009). For the experiments conducted in this paper we used a Gaussian Radial Basis Function (RBF) with a value for σ of Similar to (Oikonomidis et al., 2011) we define the observation model that feeds the rest of the algorithm composed of two arrays containing the selection of the most important HOG features from the segmented image of the hand and the corresponding valid depth map, and an array of fingertip positions. 3. Hand Pose Recognition Classifiers From last few years, the task of hand pose recognition are extensively studied and rapid progress seems in this area, including the machine learning approaches such as Decision Trees, Random Forests, Support Vector Machines, and Liniar Multinomial Regression.

5 Bul. Inst. Polit. Iaşi, t. LIX (LXIII), f. 2, In this section we will detail the proposed method of handpose estimation using above mentioned classifiers and clearly present the training and testing steps. The training phase consists of three steps: (1) get RGB images and their associated depth information for different m hand poses from Kinect sensor, (2) reducing obtained HOG features of both colour and depth images using KPCA, (3) building input patterns and train the classifier. Training patterns are presented according our observation model where we define the set of p training patterns over m hand poses as I train = {O ij }, j= 1, m, i= 1, p. As we apply a supervised learning method for each used classifier we split this training set into m classes corresponding to the number of considered hand poses (Fig. 1). A training pattern that is considered an input in the trained classifier is a vector that results by applying KPCA on the HOG. We choose the set of l rgb (l rgb < n rgb ) and the set of l d (l d < n d ) eigenvectors which have the l rgb and l d largest eigenvalues (l rgb and n rgb, l d and n d are the number l of the largest eigenvalues and thedimension of the covariance matrix for the colour and depth HOG features, respective). Proposed pattern representation includes the first l rgb and l d principal components of HOG data applied on i th colour and depth image, respective. The last element in our training pattern is represented by the positions of identified fingers. The trained model is tested on q testing patterns (I test = {O ij }, j= 1, m, i= 1, q, q >> p) over the same m hand poses. Testing is made offline and for each received image from Kinect sensor an observation model is created in order to estimate its hand position Decision Trees Decision trees traditional algorithm is based on the recursive partitioning greedy algorithm which builds a decision tree in a top-down manner. The algorithm starts with the original set X as the root node, iterates through each unused attribute of the set X and computes the information gain IG, where IG(Y X) = H(Y) H(Y X). The information gain is obtained by deducting conditional entropy H(Y X) of the given attribute with the total entropy H(X): m n p( yi, x j ) H( Y X) = p( yi, x j ) log i= 1 j= 1 p( x j ) (1) n H X = p x p x (2) ( ) ( ) log ( ( )) i= 1 i i

6 74 Constantina Raluca Mihalache and Bogdan Apostol The method used for attribute selection is minimizing the value of entropy and maximizing the information gain. The process of decision tree generation by repeatedly splitting on attributes is equivalent to partitioning the initial training set into smaller training set, until the entropy of each of these subsets is zero. At any stage of this process, splitting on any attribute has the property that the average entropy of the resulting subsets will be less than that of the previous training set. Gain ratio (GR) heuristics (uncertainty coefficient) can also be used for choosing best feature and is calculated by dividing its information gain IG with its information value IV. n IV Y X = p( y, x ) log p( y, x ) (3) ( ) j ( j ) j= 1 ( Y X) GR ( X) ( X) = IG Y IV Y (4) 3.2. Random Forests Random forests are an ensemble learning method for classification that operate by building a collection of decision trees at training time and outputting the class that is the mode of the classes output by individual trees. Let n pairs of random variables (X, Y), (X 1, Z 1 ),, (X n, Z n ), where X d is a feature vector that takes its values in R and Y (the label) is a binary {0; 1}- valued random variable. We define a random forest (Biau et al., 2008), with m trees a classifier consisting of a set of randomized base tree classifiers g n (x, Z 1 ),, g n (x, Z m ) which are identically distributed random vectors, independent conditionally on X, Y and D n, where D n is a training dataset and is the collection (X 1, Z 1 ),, (X n, Z n ). The randomizing variable shows how the successive cuts are performed when building the tree such as selection of the node and the coordinate to split, as well as the position of the split. The random forest classifier takes a majority vote among the random tree classifiers. If m is large, the random forest classifier is well approximated by the averaged classifier. A random tree classifier g n (x, Z) is constructed as follows. All nodes of the tree are associated with rectangular cells such that at each step of the construction of the tree, the collection of cells associated with the leaves of the forms a partition of [0,1] d. The root of the random tree is [0,1] d itself. At each step of the construction of the tree, a leaf is chosen uniformly at random. The split variable J is then selected uniformly at random from the d candidates x (1),, x (d). Finally, the selected cell is split along the randomly chosen variable at a random location, chosen according to a uniform random variable on the length of the chosen side of the selected cell. The procedure is repeated k times, where k 1 is a deterministic parameter, fixed before hand by the user, and

7 Bul. Inst. Polit. Iaşi, t. LIX (LXIII), f. 2, possibly depending on n. The randomized classifier g n (x, Z) takes a majority vote among all Y i for which the corresponding feature vector X i falls in the same cell of the random partition as x Support Vector Machines Support vector machines are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification analysis (Cheng et al., 2009). n Given a set of training patterns (x 1, y 1 ),, (x n, y n ) in R R, according to undefined probability distribution P(x, y), and an error function V(y, f(x)) where f(x) is the predicted output instead of ideal output y for the input x our problem consists in finding the function f that minimizes the error: n V( y, f ( x) ) P( x, y) dxd y (5) 1 Having non-linear separable training patterns we map them using a Ф(x) kernel function into a higher dimension space in order to make them linearly separable. The goal of SVM is to find a hyper plane w x b = 0, which separates the two different samples accurately by maximizing the bilateral blank area w maximum. Thus, our problem resumes to, min Φ ( w) = w = w w where yi ( w xi b) 1, i= 1, n Multinomial Regression The generalized linear modeling technique of multinomial logistic regression can be used to model unordered categorical response variables. Multinomial Regression is useful for situations in which is desired to classify subjects based on values of a set of predictor variables. This type of regression is similar to logistic regression, but it is more general because the dependent variable is not restricted to two categories. Parameter estimation is performed through an iterative maximum-likelihood algorithm. Let the response variable Y have r categories, and X 1,, X k be explanatory variables, y i = (y i1,, y ir ) be the response values in i th subgroup having multinomial distribution M n = (n i, ρ i1,, ρ ir ), β j = (β 0j, β 1j,, β kj )' be regression coefficients for the j th response category with respect to the j* th (reference) one, and x i = (x i1,, x ik )' be actual values of explanatory variables for the i th subgroup, then general multinomial regression model is:

8 76 Constantina Raluca Mihalache and Bogdan Apostol p log p ij ij * = xi β j, * j j (6) and log-likelihood function is: l( β ) = log + y log( p ), β = β, j= 1,..., r, j j n n r y i! r i 1 y! = i 1 j 1 j 1 ij = = = ij ij j * ( ) (7) 4. Experimental Results This section will present some experimental hand pose estimation results that are obtained by applying the classifiers described in the previous sections. We compare the accuracy rates of four different classification algorithms which are: Decision Trees, Random Forests, Support Vector Machines and Linear (Multinomial Regression). The performance of the classification algorithms is affected by the quality of data source. Features that are redundant or are irrelevant to hand pose estimation are already taken out through a previous KPCA filtration. The same definite set of filtered features is fed to each of the classification algorithms. In this paper we used features obtained from 1000 RGB-D images for building the training set and an additional set of 300 RGB-D images for the testing data. Images contain four different valid postures of the hand (open hand, peace sign, ok pose, like sign) and also images that show no hand object. Experimental results were obtained by using the Rattle library from R (Zhao, 2012). The Rattle package provides a graphical user interface for data mining that uses the R language. The optimal parameters were chosen empirically and are: a) 20 min splits, 10 min buckets, 30 max depth and a complexity of 0.01 for Decision Trees classifier; b) 400 trees, a number of 10 variables tried at each split for Random Forests classifier; c) Gaussian Radial Basis kernel function, a value of 0.06 for sigma and a cost=1 for Support Vector Machine classifier; d) 1000 max iterations for Linear classifier; We can see the performance of these four algorithms by looking at the confusion matrixes shown in Tables 1 4 for the tests conducted for each of the classifiers on the test dataset.

9 Bul. Inst. Polit. Iaşi, t. LIX (LXIII), f. 2, Actual Actual Actual Actual Table 1 Confusion Matrix for Decision Trees Classifier Predicted Open Peace Like OK pose pose pose pose Accuracy Open pose % Peace pose % Like pose % OK pose % Average 79.29% Table 2 Confusion Matrix for Random Forest Classifier Predicted Open Peace Like OK pose pose pose pose Accuracy Open pose % Peace pose % Like pose % OK pose % Average 84.18% Table 3 Confusion Matrix for Support Vector Machines Classifier Predicted Open Peace Like OK pose pose pose pose Accuracy Open pose % Peace pose % Like pose % OK pose % Average 78.27% Table 4 Confusion Matrix for Linear (Multinomial Regression) Classifier Predicted Open Peace Like OK pose pose pose pose Accuracy Open pose % Peace pose % Like pose % OK pose % Average 44.26%

10 78 Constantina Raluca Mihalache and Bogdan Apostol We can see in Fig. 3 a comparative representation of accuracy detection rates for all classifiers and each hand pose. One can easily see that all classifiers obtain 100% accuracy rates for open hand pose. Both peace hand pose and ok pose are best detected by SVM and RF classifiers, while like pose is best detected by DT and RF classifiers. Fig. 3 Accuracy detection rates for each hand pose. Fig. 4 shows the average accuracy rates of the compared classifiers. As figures shows the Random Forest classifier obtains the best average recognition rate. Fig. 4 Comparison of classifiers average accuracy rates.

11 Bul. Inst. Polit. Iaşi, t. LIX (LXIII), f. 2, Conclusions In this paper we presented a comparative study on four well known classifiers for pattern recognition. We applied these algorithms on the same dataset with application in the domain of hand gesture recognition. As we previously detailed we used four hand gesture classes and compared the accuracy rates for each pose recognition and the overall accuracy rates. In the process of creating the dataset we used RGB-D data from a Kinect sensor and extracted features from both hand shape and aspect. By applying KPCA algorithm we chose the most relevant features from the multitude of extracted HOG features from both colour and depth images. Experimental results show a 84.18% recognition accuracy is achieved by using the RF classifier, a 79.29% recognition accuracy is achieved using the DT classifier and a 78.27% recognition accuracy for SVM classifier. The multinomial regression is also used for classification purpose but shows a poor 44.26% recognition accuracy. It is shown that random forest classifier outperforms the other three algorithms. REFERENCES Biau G., Devroye L., Lugosi G., Consistency of Random Forests and Other Averaging Classifier. The Journal of Machine Learning Research, 9, , 6/1/2008. Chang F., Chen C., Lu C., A Linear-Time Component-Labelling Algorithm Using Contour Tracing Technique. Computer Vision and Image Understanding, , Cheng P., Li W., Ogunbona P., Kernel PCA of HOG Features for Posture Detection. International Conference: Image and Vision Computing New Zealand, , Mihalache C.R., Apostol B., Hand Pose Estimation Using HOG Features From RGB-D Data. In System Theory, Control and Computing (ICSTCC), th International Conference on, Oikonomidis I., Kyriazis N., Argyros A., Efficient Model-Based 3D Tracking of Hand Articulations Using Kinect. Proceedings of the British Machine Vision Conference, 1-11, Rautaray S.S., Agrawal A., Vision Based Hand Gesture Recognition for Human Computer Interaction: a Survey. Artificial Intelligence Review, 1 54, Ren M., Yang J., Sun H., Tracing Boundary Contours in a Binary Image. Image and Vision Computing , Ryan D.J., Finger and Gesture Recognition with Microsoft Kinect. Master s Thesis in Computer Science (TN-IDE), Yan L., Min Z., A New Contour Tracing Automaton in Binary Image IEEE International Conference on Computer Scienceand Automation Engineering (CSAE), , 2011.

12 80 Constantina Raluca Mihalache and Bogdan Apostol Zhao Y., R and Data Mining: Examples and Case Studies. Published by Elsevier in December STUDIU PRIVIND PRECIZIA CLASIFICATORILOR PENTRU RECUNOAŞTEREA POSTURII MÂINII (Rezumat) În ultimii ani interacńiunea om-calculator (HCI) a devenit un domeniu de interes tot mai studiat. Captarea secvenńelor video precum şi corelarea acestora cu informańiile de adâncime (RGB-D) au devenit accesibile odată cu lansarea în 2010 a senzorului Kinect. Această lucrare prezintă un studiu comparativ între preciziile obńinute la testare prin aplicarea mai multor tehnici de clasificare folosind date ce reprezintă caracterisiticile extrase din informańii de tip RGB-D. Se defineşte un model de observare format din vectori de caracteristici obńinuńi prin calculul histogramelor orientărilor gradienńilor (HOG) atât pe datele de culoare cât şi pe datele de adâncime. Tot în acest model se introduc şi informańii despre numărul şi pozińia degetelor. Sunt selectate cele mai importante caracteristici ale modelului de observare şi utilizate ca date de intrare pentru fiecare dintre clasificatorii considerańi. În această lucrare s-au considerat următorii clasificatori pentru recunoaşterea posturii mâinii: Linear, Random Forests (RF), Support Vector Machines (SVM) şi Decision Trees (DT). Rezultatele experimentale arată o precizie de recunoaştere de 84,18% prin utilizarea clasificatorului RF, o precizie de recunoaştere de 79,29% pentru clasificatorul DT şi o precizie de recunoaştere 78,27% pentru clasificatorul SVM. Se utilizează, de asemenea, un clasificator liniar care obńine o precizie de recunoaştere de doar 44,26%.

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor

COSC160: Detection and Classification. Jeremy Bolton, PhD Assistant Teaching Professor COSC160: Detection and Classification Jeremy Bolton, PhD Assistant Teaching Professor Outline I. Problem I. Strategies II. Features for training III. Using spatial information? IV. Reducing dimensionality

More information

Tri-modal Human Body Segmentation

Tri-modal Human Body Segmentation Tri-modal Human Body Segmentation Master of Science Thesis Cristina Palmero Cantariño Advisor: Sergio Escalera Guerrero February 6, 2014 Outline 1 Introduction 2 Tri-modal dataset 3 Proposed baseline 4

More information

Classification of objects from Video Data (Group 30)

Classification of objects from Video Data (Group 30) Classification of objects from Video Data (Group 30) Sheallika Singh 12665 Vibhuti Mahajan 12792 Aahitagni Mukherjee 12001 M Arvind 12385 1 Motivation Video surveillance has been employed for a long time

More information

VARIATION OF INTERNAL FORCES USING ARTIFICIAL NEURONAL NETWORK

VARIATION OF INTERNAL FORCES USING ARTIFICIAL NEURONAL NETWORK BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Volumul 63 (67), Numărul 1, 2017 Secţia CONSTRUCŢII. ARHITECTURĂ VARIATION OF INTERNAL FORCES USING

More information

Robotics Programming Laboratory

Robotics Programming Laboratory Chair of Software Engineering Robotics Programming Laboratory Bertrand Meyer Jiwon Shin Lecture 8: Robot Perception Perception http://pascallin.ecs.soton.ac.uk/challenges/voc/databases.html#caltech car

More information

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers A. Salhi, B. Minaoui, M. Fakir, H. Chakib, H. Grimech Faculty of science and Technology Sultan Moulay Slimane

More information

Image Processing. Image Features

Image Processing. Image Features Image Processing Image Features Preliminaries 2 What are Image Features? Anything. What they are used for? Some statements about image fragments (patches) recognition Search for similar patches matching

More information

Region-based Segmentation

Region-based Segmentation Region-based Segmentation Image Segmentation Group similar components (such as, pixels in an image, image frames in a video) to obtain a compact representation. Applications: Finding tumors, veins, etc.

More information

Posture detection by kernel PCA-based manifold learning

Posture detection by kernel PCA-based manifold learning University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2010 Posture detection by kernel PCA-based manifold learning Peng

More information

Short Survey on Static Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition Short Survey on Static Hand Gesture Recognition Huu-Hung Huynh University of Science and Technology The University of Danang, Vietnam Duc-Hoang Vo University of Science and Technology The University of

More information

APPLICATIONS OF MICROSOFT EXCEL - SOLVER FOR HORIZONTAL AND LEVELLING NETWORKS ADJUSTMENT

APPLICATIONS OF MICROSOFT EXCEL - SOLVER FOR HORIZONTAL AND LEVELLING NETWORKS ADJUSTMENT BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Volumul 63 (67), Numărul 1-2, 2017 Secţia HIDROTEHNICĂ APPLICATIONS OF MICROSOFT EXCEL - SOLVER FOR

More information

ImageCLEF 2011

ImageCLEF 2011 SZTAKI @ ImageCLEF 2011 Bálint Daróczy joint work with András Benczúr, Róbert Pethes Data Mining and Web Search Group Computer and Automation Research Institute Hungarian Academy of Sciences Training/test

More information

Object Classification Problem

Object Classification Problem HIERARCHICAL OBJECT CATEGORIZATION" Gregory Griffin and Pietro Perona. Learning and Using Taxonomies For Fast Visual Categorization. CVPR 2008 Marcin Marszalek and Cordelia Schmid. Constructing Category

More information

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints

Last week. Multi-Frame Structure from Motion: Multi-View Stereo. Unknown camera viewpoints Last week Multi-Frame Structure from Motion: Multi-View Stereo Unknown camera viewpoints Last week PCA Today Recognition Today Recognition Recognition problems What is it? Object detection Who is it? Recognizing

More information

Sketchable Histograms of Oriented Gradients for Object Detection

Sketchable Histograms of Oriented Gradients for Object Detection Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The

More information

Robot Learning. There are generally three types of robot learning: Learning from data. Learning by demonstration. Reinforcement learning

Robot Learning. There are generally three types of robot learning: Learning from data. Learning by demonstration. Reinforcement learning Robot Learning 1 General Pipeline 1. Data acquisition (e.g., from 3D sensors) 2. Feature extraction and representation construction 3. Robot learning: e.g., classification (recognition) or clustering (knowledge

More information

BOOLEAN FUNCTION DECOMPOSITION BASED ON FPGA BASIC CELL STRUCTURE

BOOLEAN FUNCTION DECOMPOSITION BASED ON FPGA BASIC CELL STRUCTURE BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Tomul LXI (LXV), Fasc. 1, 2015 SecŃia AUTOMATICĂ şi CALCULATOARE BOOLEAN FUNCTION DECOMPOSITION BASED

More information

The Novel Approach for 3D Face Recognition Using Simple Preprocessing Method

The Novel Approach for 3D Face Recognition Using Simple Preprocessing Method The Novel Approach for 3D Face Recognition Using Simple Preprocessing Method Parvin Aminnejad 1, Ahmad Ayatollahi 2, Siamak Aminnejad 3, Reihaneh Asghari Abstract In this work, we presented a novel approach

More information

Investigating the Effect of Different Kernel Functions on the Performance of SVM for Recognizing Arabic Characters

Investigating the Effect of Different Kernel Functions on the Performance of SVM for Recognizing Arabic Characters Investigating the Effect of Different Functions on the Performance of SVM for Recognizing Arabic Characters Sayed Fadel 1,2, Said Ghoniemy 1,2, Mohamed Abdallah 1,3, Hussein Abu Sorra 1, Amira Ashour 1,4,

More information

Random Forest A. Fornaser

Random Forest A. Fornaser Random Forest A. Fornaser alberto.fornaser@unitn.it Sources Lecture 15: decision trees, information theory and random forests, Dr. Richard E. Turner Trees and Random Forests, Adele Cutler, Utah State University

More information

Enhancing Forestry Object Detection using Multiple Features

Enhancing Forestry Object Detection using Multiple Features Enhancing Forestry Object Detection using Multiple Features A THESIS Submitted in partial fulfillment of requirements for Master Degree of Computing Science By: Ahmad Ostovar ahos0003@student.umu.se Supervisors:

More information

10-701/15-781, Fall 2006, Final

10-701/15-781, Fall 2006, Final -7/-78, Fall 6, Final Dec, :pm-8:pm There are 9 questions in this exam ( pages including this cover sheet). If you need more room to work out your answer to a question, use the back of the page and clearly

More information

Robust PDF Table Locator

Robust PDF Table Locator Robust PDF Table Locator December 17, 2016 1 Introduction Data scientists rely on an abundance of tabular data stored in easy-to-machine-read formats like.csv files. Unfortunately, most government records

More information

CS 229 Midterm Review

CS 229 Midterm Review CS 229 Midterm Review Course Staff Fall 2018 11/2/2018 Outline Today: SVMs Kernels Tree Ensembles EM Algorithm / Mixture Models [ Focus on building intuition, less so on solving specific problems. Ask

More information

Facial Expression Classification with Random Filters Feature Extraction

Facial Expression Classification with Random Filters Feature Extraction Facial Expression Classification with Random Filters Feature Extraction Mengye Ren Facial Monkey mren@cs.toronto.edu Zhi Hao Luo It s Me lzh@cs.toronto.edu I. ABSTRACT In our work, we attempted to tackle

More information

CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning

CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning Justin Chen Stanford University justinkchen@stanford.edu Abstract This paper focuses on experimenting with

More information

Tracking system. Danica Kragic. Object Recognition & Model Based Tracking

Tracking system. Danica Kragic. Object Recognition & Model Based Tracking Tracking system Object Recognition & Model Based Tracking Motivation Manipulating objects in domestic environments Localization / Navigation Object Recognition Servoing Tracking Grasping Pose estimation

More information

Data mining with Support Vector Machine

Data mining with Support Vector Machine Data mining with Support Vector Machine Ms. Arti Patle IES, IPS Academy Indore (M.P.) artipatle@gmail.com Mr. Deepak Singh Chouhan IES, IPS Academy Indore (M.P.) deepak.schouhan@yahoo.com Abstract: Machine

More information

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN 2016 International Conference on Artificial Intelligence: Techniques and Applications (AITA 2016) ISBN: 978-1-60595-389-2 Face Recognition Using Vector Quantization Histogram and Support Vector Machine

More information

Face detection and recognition. Detection Recognition Sally

Face detection and recognition. Detection Recognition Sally Face detection and recognition Detection Recognition Sally Face detection & recognition Viola & Jones detector Available in open CV Face recognition Eigenfaces for face recognition Metric learning identification

More information

Hand Gesture Extraction by Active Shape Models

Hand Gesture Extraction by Active Shape Models Hand Gesture Extraction by Active Shape Models Nianjun Liu, Brian C. Lovell School of Information Technology and Electrical Engineering The University of Queensland, Brisbane 4072, Australia National ICT

More information

Information theory methods for feature selection

Information theory methods for feature selection Information theory methods for feature selection Zuzana Reitermanová Department of Computer Science Faculty of Mathematics and Physics Charles University in Prague, Czech Republic Diplomový a doktorandský

More information

Exploring Curve Fitting for Fingers in Egocentric Images

Exploring Curve Fitting for Fingers in Egocentric Images Exploring Curve Fitting for Fingers in Egocentric Images Akanksha Saran Robotics Institute, Carnegie Mellon University 16-811: Math Fundamentals for Robotics Final Project Report Email: asaran@andrew.cmu.edu

More information

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong)

Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) Biometrics Technology: Image Processing & Pattern Recognition (by Dr. Dickson Tong) References: [1] http://homepages.inf.ed.ac.uk/rbf/hipr2/index.htm [2] http://www.cs.wisc.edu/~dyer/cs540/notes/vision.html

More information

Probabilistic Facial Feature Extraction Using Joint Distribution of Location and Texture Information

Probabilistic Facial Feature Extraction Using Joint Distribution of Location and Texture Information Probabilistic Facial Feature Extraction Using Joint Distribution of Location and Texture Information Mustafa Berkay Yilmaz, Hakan Erdogan, Mustafa Unel Sabanci University, Faculty of Engineering and Natural

More information

Announcements. Recognition I. Gradient Space (p,q) What is the reflectance map?

Announcements. Recognition I. Gradient Space (p,q) What is the reflectance map? Announcements I HW 3 due 12 noon, tomorrow. HW 4 to be posted soon recognition Lecture plan recognition for next two lectures, then video and motion. Introduction to Computer Vision CSE 152 Lecture 17

More information

AUTONOMOUS ROBOT NAVIGATION BASED ON FUZZY LOGIC AND REINFORCEMENT LEARNING

AUTONOMOUS ROBOT NAVIGATION BASED ON FUZZY LOGIC AND REINFORCEMENT LEARNING BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi, Tomul LVI (LX), Fasc. 4, 2010 Secţia CONSTRUCŢII DE MAŞINI AUTONOMOUS ROBOT NAVIGATION BASED ON FUZZY

More information

Hand gesture recognition with Leap Motion and Kinect devices

Hand gesture recognition with Leap Motion and Kinect devices Hand gesture recognition with Leap Motion and devices Giulio Marin, Fabio Dominio and Pietro Zanuttigh Department of Information Engineering University of Padova, Italy Abstract The recent introduction

More information

Fast or furious? - User analysis of SF Express Inc

Fast or furious? - User analysis of SF Express Inc CS 229 PROJECT, DEC. 2017 1 Fast or furious? - User analysis of SF Express Inc Gege Wen@gegewen, Yiyuan Zhang@yiyuan12, Kezhen Zhao@zkz I. MOTIVATION The motivation of this project is to predict the likelihood

More information

Transfer Forest Based on Covariate Shift

Transfer Forest Based on Covariate Shift Transfer Forest Based on Covariate Shift Masamitsu Tsuchiya SECURE, INC. tsuchiya@secureinc.co.jp Yuji Yamauchi, Takayoshi Yamashita, Hironobu Fujiyoshi Chubu University yuu@vision.cs.chubu.ac.jp, {yamashita,

More information

HISTOGRAMS OF ORIENTATIO N GRADIENTS

HISTOGRAMS OF ORIENTATIO N GRADIENTS HISTOGRAMS OF ORIENTATIO N GRADIENTS Histograms of Orientation Gradients Objective: object recognition Basic idea Local shape information often well described by the distribution of intensity gradients

More information

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia

Computer Science Faculty, Bandar Lampung University, Bandar Lampung, Indonesia Application Object Detection Using Histogram of Oriented Gradient For Artificial Intelegence System Module of Nao Robot (Control System Laboratory (LSKK) Bandung Institute of Technology) A K Saputra 1.,

More information

Methods for Intelligent Systems

Methods for Intelligent Systems Methods for Intelligent Systems Lecture Notes on Clustering (II) Davide Eynard eynard@elet.polimi.it Department of Electronics and Information Politecnico di Milano Davide Eynard - Lecture Notes on Clustering

More information

Pattern Recognition. Kjell Elenius. Speech, Music and Hearing KTH. March 29, 2007 Speech recognition

Pattern Recognition. Kjell Elenius. Speech, Music and Hearing KTH. March 29, 2007 Speech recognition Pattern Recognition Kjell Elenius Speech, Music and Hearing KTH March 29, 2007 Speech recognition 2007 1 Ch 4. Pattern Recognition 1(3) Bayes Decision Theory Minimum-Error-Rate Decision Rules Discriminant

More information

Discriminative classifiers for image recognition

Discriminative classifiers for image recognition Discriminative classifiers for image recognition May 26 th, 2015 Yong Jae Lee UC Davis Outline Last time: window-based generic object detection basic pipeline face detection with boosting as case study

More information

CIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, :59pm, PDF to Canvas [100 points]

CIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, :59pm, PDF to Canvas [100 points] CIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, 2015. 11:59pm, PDF to Canvas [100 points] Instructions. Please write up your responses to the following problems clearly and concisely.

More information

Credit card Fraud Detection using Predictive Modeling: a Review

Credit card Fraud Detection using Predictive Modeling: a Review February 207 IJIRT Volume 3 Issue 9 ISSN: 2396002 Credit card Fraud Detection using Predictive Modeling: a Review Varre.Perantalu, K. BhargavKiran 2 PG Scholar, CSE, Vishnu Institute of Technology, Bhimavaram,

More information

Kernel PCA of HOG features for posture detection

Kernel PCA of HOG features for posture detection University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Kernel PCA of HOG features for posture detection Peng Cheng University

More information

MIT 801. Machine Learning I. [Presented by Anna Bosman] 16 February 2018

MIT 801. Machine Learning I. [Presented by Anna Bosman] 16 February 2018 MIT 801 [Presented by Anna Bosman] 16 February 2018 Machine Learning What is machine learning? Artificial Intelligence? Yes as we know it. What is intelligence? The ability to acquire and apply knowledge

More information

3D Perception. CS 4495 Computer Vision K. Hawkins. CS 4495 Computer Vision. 3D Perception. Kelsey Hawkins Robotics

3D Perception. CS 4495 Computer Vision K. Hawkins. CS 4495 Computer Vision. 3D Perception. Kelsey Hawkins Robotics CS 4495 Computer Vision Kelsey Hawkins Robotics Motivation What do animals, people, and robots want to do with vision? Detect and recognize objects/landmarks Find location of objects with respect to themselves

More information

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao

Classifying Images with Visual/Textual Cues. By Steven Kappes and Yan Cao Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao Motivation Image search Building large sets of classified images Robotics Background Object recognition is unsolved Deformable shaped

More information

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others

Introduction to object recognition. Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others Introduction to object recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and others Overview Basic recognition tasks A statistical learning approach Traditional or shallow recognition

More information

Task analysis based on observing hands and objects by vision

Task analysis based on observing hands and objects by vision Task analysis based on observing hands and objects by vision Yoshihiro SATO Keni Bernardin Hiroshi KIMURA Katsushi IKEUCHI Univ. of Electro-Communications Univ. of Karlsruhe Univ. of Tokyo Abstract In

More information

ABOUT MANUFACTURING PROCESSES CAPABILITY ANALYSIS

ABOUT MANUFACTURING PROCESSES CAPABILITY ANALYSIS BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Tomul LIX (LXIII), Fasc. 4, 013 Secţia CONSTRUCŢII DE MAŞINI ABOUT MANUFACTURING PROCESSES CAPABILITY

More information

CSE 158. Web Mining and Recommender Systems. Midterm recap

CSE 158. Web Mining and Recommender Systems. Midterm recap CSE 158 Web Mining and Recommender Systems Midterm recap Midterm on Wednesday! 5:10 pm 6:10 pm Closed book but I ll provide a similar level of basic info as in the last page of previous midterms CSE 158

More information

2. Data Preprocessing

2. Data Preprocessing 2. Data Preprocessing Contents of this Chapter 2.1 Introduction 2.2 Data cleaning 2.3 Data integration 2.4 Data transformation 2.5 Data reduction Reference: [Han and Kamber 2006, Chapter 2] SFU, CMPT 459

More information

Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition

Verification: is that a lamp? What do we mean by recognition? Recognition. Recognition Recognition Recognition The Margaret Thatcher Illusion, by Peter Thompson The Margaret Thatcher Illusion, by Peter Thompson Readings C. Bishop, Neural Networks for Pattern Recognition, Oxford University

More information

Business Club. Decision Trees

Business Club. Decision Trees Business Club Decision Trees Business Club Analytics Team December 2017 Index 1. Motivation- A Case Study 2. The Trees a. What is a decision tree b. Representation 3. Regression v/s Classification 4. Building

More information

CHECKING THE HOMOGENEITY OF CONCRETE USING ARTIFICIAL NEURAL NETWORK

CHECKING THE HOMOGENEITY OF CONCRETE USING ARTIFICIAL NEURAL NETWORK BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Tomul LXI (LXV), Fasc., 05 Secţia CONSTRUCŢII. ARHITECTURĂ CHECKING THE HOMOGENEITY OF CONCRETE USING

More information

What do we mean by recognition?

What do we mean by recognition? Announcements Recognition Project 3 due today Project 4 out today (help session + photos end-of-class) The Margaret Thatcher Illusion, by Peter Thompson Readings Szeliski, Chapter 14 1 Recognition What

More information

Decision Trees Dr. G. Bharadwaja Kumar VIT Chennai

Decision Trees Dr. G. Bharadwaja Kumar VIT Chennai Decision Trees Decision Tree Decision Trees (DTs) are a nonparametric supervised learning method used for classification and regression. The goal is to create a model that predicts the value of a target

More information

[2008] IEEE. Reprinted, with permission, from [Yan Chen, Qiang Wu, Xiangjian He, Wenjing Jia,Tom Hintz, A Modified Mahalanobis Distance for Human

[2008] IEEE. Reprinted, with permission, from [Yan Chen, Qiang Wu, Xiangjian He, Wenjing Jia,Tom Hintz, A Modified Mahalanobis Distance for Human [8] IEEE. Reprinted, with permission, from [Yan Chen, Qiang Wu, Xiangian He, Wening Jia,Tom Hintz, A Modified Mahalanobis Distance for Human Detection in Out-door Environments, U-Media 8: 8 The First IEEE

More information

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation

Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation Large-Scale Traffic Sign Recognition based on Local Features and Color Segmentation M. Blauth, E. Kraft, F. Hirschenberger, M. Böhm Fraunhofer Institute for Industrial Mathematics, Fraunhofer-Platz 1,

More information

A Novel Extreme Point Selection Algorithm in SIFT

A Novel Extreme Point Selection Algorithm in SIFT A Novel Extreme Point Selection Algorithm in SIFT Ding Zuchun School of Electronic and Communication, South China University of Technolog Guangzhou, China zucding@gmail.com Abstract. This paper proposes

More information

MAC LEVEL BASED QUALITY OF SERVICE MANAGEMENT IN IEEE NETWORKS

MAC LEVEL BASED QUALITY OF SERVICE MANAGEMENT IN IEEE NETWORKS BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Tomul LVII (LXI), Fasc. 4, 2011 SecŃia ELECTROTEHNICĂ. ENERGETICĂ. ELECTRONICĂ MAC LEVEL BASED QUALITY

More information

Part-based and local feature models for generic object recognition

Part-based and local feature models for generic object recognition Part-based and local feature models for generic object recognition May 28 th, 2015 Yong Jae Lee UC Davis Announcements PS2 grades up on SmartSite PS2 stats: Mean: 80.15 Standard Dev: 22.77 Vote on piazza

More information

EE 701 ROBOT VISION. Segmentation

EE 701 ROBOT VISION. Segmentation EE 701 ROBOT VISION Regions and Image Segmentation Histogram-based Segmentation Automatic Thresholding K-means Clustering Spatial Coherence Merging and Splitting Graph Theoretic Segmentation Region Growing

More information

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION

MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION MULTI ORIENTATION PERFORMANCE OF FEATURE EXTRACTION FOR HUMAN HEAD RECOGNITION Panca Mudjirahardjo, Rahmadwati, Nanang Sulistiyanto and R. Arief Setyawan Department of Electrical Engineering, Faculty of

More information

Louis Fourrier Fabien Gaie Thomas Rolf

Louis Fourrier Fabien Gaie Thomas Rolf CS 229 Stay Alert! The Ford Challenge Louis Fourrier Fabien Gaie Thomas Rolf Louis Fourrier Fabien Gaie Thomas Rolf 1. Problem description a. Goal Our final project is a recent Kaggle competition submitted

More information

Model-based Visual Tracking:

Model-based Visual Tracking: Technische Universität München Model-based Visual Tracking: the OpenTL framework Giorgio Panin Technische Universität München Institut für Informatik Lehrstuhl für Echtzeitsysteme und Robotik (Prof. Alois

More information

CS229: Action Recognition in Tennis

CS229: Action Recognition in Tennis CS229: Action Recognition in Tennis Aman Sikka Stanford University Stanford, CA 94305 Rajbir Kataria Stanford University Stanford, CA 94305 asikka@stanford.edu rkataria@stanford.edu 1. Motivation As active

More information

Integral Channel Features with Random Forest for 3D Facial Landmark Detection

Integral Channel Features with Random Forest for 3D Facial Landmark Detection MSc Artificial Intelligence Track: Computer Vision Master Thesis Integral Channel Features with Random Forest for 3D Facial Landmark Detection by Arif Qodari 10711996 February 2016 42 EC Supervisor/Examiner:

More information

Patch-based Object Recognition. Basic Idea

Patch-based Object Recognition. Basic Idea Patch-based Object Recognition 1! Basic Idea Determine interest points in image Determine local image properties around interest points Use local image properties for object classification Example: Interest

More information

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe

Face detection and recognition. Many slides adapted from K. Grauman and D. Lowe Face detection and recognition Many slides adapted from K. Grauman and D. Lowe Face detection and recognition Detection Recognition Sally History Early face recognition systems: based on features and distances

More information

MODELING THE FORCE-ELONGATION CURVE OF SINGLE YARNS

MODELING THE FORCE-ELONGATION CURVE OF SINGLE YARNS BULETINUL INSTITUTULUI POLITEHNIC DIN IAŞI Publicat de Universitatea Tehnică Gheorghe Asachi din Iaşi Tomul LVI (LX), Fasc. 1, 2010 SecŃia TEXTILE. PIELĂRIE MODELING THE FORCE-ELONGATION CURVE OF SINGLE

More information

Hand part classification using single depth images

Hand part classification using single depth images Hand part classification using single depth images Myoung-Kyu Sohn, Dong-Ju Kim and Hyunduk Kim Department of IT Convergence, Daegu Gyeongbuk Institute of Science & Technology (DGIST), Daegu, South Korea

More information

Estimating normal vectors and curvatures by centroid weights

Estimating normal vectors and curvatures by centroid weights Computer Aided Geometric Design 21 (2004) 447 458 www.elsevier.com/locate/cagd Estimating normal vectors and curvatures by centroid weights Sheng-Gwo Chen, Jyh-Yang Wu Department of Mathematics, National

More information

SUPPLEMENTARY FILE S1: 3D AIRWAY TUBE RECONSTRUCTION AND CELL-BASED MECHANICAL MODEL. RELATED TO FIGURE 1, FIGURE 7, AND STAR METHODS.

SUPPLEMENTARY FILE S1: 3D AIRWAY TUBE RECONSTRUCTION AND CELL-BASED MECHANICAL MODEL. RELATED TO FIGURE 1, FIGURE 7, AND STAR METHODS. SUPPLEMENTARY FILE S1: 3D AIRWAY TUBE RECONSTRUCTION AND CELL-BASED MECHANICAL MODEL. RELATED TO FIGURE 1, FIGURE 7, AND STAR METHODS. 1. 3D AIRWAY TUBE RECONSTRUCTION. RELATED TO FIGURE 1 AND STAR METHODS

More information

The SIFT (Scale Invariant Feature

The SIFT (Scale Invariant Feature The SIFT (Scale Invariant Feature Transform) Detector and Descriptor developed by David Lowe University of British Columbia Initial paper ICCV 1999 Newer journal paper IJCV 2004 Review: Matt Brown s Canonical

More information

3D Models and Matching

3D Models and Matching 3D Models and Matching representations for 3D object models particular matching techniques alignment-based systems appearance-based systems GC model of a screwdriver 1 3D Models Many different representations

More information

3D object recognition used by team robotto

3D object recognition used by team robotto 3D object recognition used by team robotto Workshop Juliane Hoebel February 1, 2016 Faculty of Computer Science, Otto-von-Guericke University Magdeburg Content 1. Introduction 2. Depth sensor 3. 3D object

More information

A method for depth-based hand tracing

A method for depth-based hand tracing A method for depth-based hand tracing Khoa Ha University of Maryland, College Park khoaha@umd.edu Abstract An algorithm for natural human-computer interaction via in-air drawing is detailed. We discuss

More information

Beyond Bags of Features

Beyond Bags of Features : for Recognizing Natural Scene Categories Matching and Modeling Seminar Instructed by Prof. Haim J. Wolfson School of Computer Science Tel Aviv University December 9 th, 2015

More information

Network Traffic Measurements and Analysis

Network Traffic Measurements and Analysis DEIB - Politecnico di Milano Fall, 2017 Sources Hastie, Tibshirani, Friedman: The Elements of Statistical Learning James, Witten, Hastie, Tibshirani: An Introduction to Statistical Learning Andrew Ng:

More information

CS 231A CA Session: Problem Set 4 Review. Kevin Chen May 13, 2016

CS 231A CA Session: Problem Set 4 Review. Kevin Chen May 13, 2016 CS 231A CA Session: Problem Set 4 Review Kevin Chen May 13, 2016 PS4 Outline Problem 1: Viewpoint estimation Problem 2: Segmentation Meanshift segmentation Normalized cut Problem 1: Viewpoint Estimation

More information

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation.

Equation to LaTeX. Abhinav Rastogi, Sevy Harris. I. Introduction. Segmentation. Equation to LaTeX Abhinav Rastogi, Sevy Harris {arastogi,sharris5}@stanford.edu I. Introduction Copying equations from a pdf file to a LaTeX document can be time consuming because there is no easy way

More information

Classification/Regression Trees and Random Forests

Classification/Regression Trees and Random Forests Classification/Regression Trees and Random Forests Fabio G. Cozman - fgcozman@usp.br November 6, 2018 Classification tree Consider binary class variable Y and features X 1,..., X n. Decide Ŷ after a series

More information

3. Data Preprocessing. 3.1 Introduction

3. Data Preprocessing. 3.1 Introduction 3. Data Preprocessing Contents of this Chapter 3.1 Introduction 3.2 Data cleaning 3.3 Data integration 3.4 Data transformation 3.5 Data reduction SFU, CMPT 740, 03-3, Martin Ester 84 3.1 Introduction Motivation

More information

Facial expression recognition using shape and texture information

Facial expression recognition using shape and texture information 1 Facial expression recognition using shape and texture information I. Kotsia 1 and I. Pitas 1 Aristotle University of Thessaloniki pitas@aiia.csd.auth.gr Department of Informatics Box 451 54124 Thessaloniki,

More information

Face recognition based on improved BP neural network

Face recognition based on improved BP neural network Face recognition based on improved BP neural network Gaili Yue, Lei Lu a, College of Electrical and Control Engineering, Xi an University of Science and Technology, Xi an 710043, China Abstract. In order

More information

8. Tree-based approaches

8. Tree-based approaches Foundations of Machine Learning École Centrale Paris Fall 2015 8. Tree-based approaches Chloé-Agathe Azencott Centre for Computational Biology, Mines ParisTech chloe agathe.azencott@mines paristech.fr

More information

Subpixel Corner Detection Using Spatial Moment 1)

Subpixel Corner Detection Using Spatial Moment 1) Vol.31, No.5 ACTA AUTOMATICA SINICA September, 25 Subpixel Corner Detection Using Spatial Moment 1) WANG She-Yang SONG Shen-Min QIANG Wen-Yi CHEN Xing-Lin (Department of Control Engineering, Harbin Institute

More information

Cellular Tree Classifiers. Gérard Biau & Luc Devroye

Cellular Tree Classifiers. Gérard Biau & Luc Devroye Cellular Tree Classifiers Gérard Biau & Luc Devroye Paris, December 2013 Outline 1 Context 2 Cellular tree classifiers 3 A mathematical model 4 Are there consistent cellular tree classifiers? 5 A non-randomized

More information

Network Traffic Measurements and Analysis

Network Traffic Measurements and Analysis DEIB - Politecnico di Milano Fall, 2017 Introduction Often, we have only a set of features x = x 1, x 2,, x n, but no associated response y. Therefore we are not interested in prediction nor classification,

More information

ECE 285 Class Project Report

ECE 285 Class Project Report ECE 285 Class Project Report Based on Source localization in an ocean waveguide using supervised machine learning Yiwen Gong ( yig122@eng.ucsd.edu), Yu Chai( yuc385@eng.ucsd.edu ), Yifeng Bu( ybu@eng.ucsd.edu

More information

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIENCE, VOL.32, NO.9, SEPTEMBER 2010 Hae Jong Seo, Student Member,

More information

MULTIVARIATE TEXTURE DISCRIMINATION USING A PRINCIPAL GEODESIC CLASSIFIER

MULTIVARIATE TEXTURE DISCRIMINATION USING A PRINCIPAL GEODESIC CLASSIFIER MULTIVARIATE TEXTURE DISCRIMINATION USING A PRINCIPAL GEODESIC CLASSIFIER A.Shabbir 1, 2 and G.Verdoolaege 1, 3 1 Department of Applied Physics, Ghent University, B-9000 Ghent, Belgium 2 Max Planck Institute

More information

The Curse of Dimensionality

The Curse of Dimensionality The Curse of Dimensionality ACAS 2002 p1/66 Curse of Dimensionality The basic idea of the curse of dimensionality is that high dimensional data is difficult to work with for several reasons: Adding more

More information

Fast Edge Detection Using Structured Forests

Fast Edge Detection Using Structured Forests Fast Edge Detection Using Structured Forests Piotr Dollár, C. Lawrence Zitnick [1] Zhihao Li (zhihaol@andrew.cmu.edu) Computer Science Department Carnegie Mellon University Table of contents 1. Introduction

More information

Face Detection and Recognition in an Image Sequence using Eigenedginess

Face Detection and Recognition in an Image Sequence using Eigenedginess Face Detection and Recognition in an Image Sequence using Eigenedginess B S Venkatesh, S Palanivel and B Yegnanarayana Department of Computer Science and Engineering. Indian Institute of Technology, Madras

More information