Feature Subset Selection Utilizing BioMechanical Characteristics for Hand Gesture Recognition

Similar documents
USC Real-time Pattern Isolation and Recognition Over Immersive Sensor Data Streams

Feature Selection. CE-725: Statistical Pattern Recognition Sharif University of Technology Spring Soleymani

HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery

Image Classification Using Wavelet Coefficients in Low-pass Bands

A Model of Machine Learning Based on User Preference of Attributes

An Empirical Study on feature selection for Data Classification

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1395

Structure of Association Rule Classifiers: a Review

An Efficient Approach for Color Pattern Matching Using Image Mining

DI TRANSFORM. The regressive analyses. identify relationships

Improving the Efficiency of Fast Using Semantic Similarity Algorithm

Feature Selection Technique to Improve Performance Prediction in a Wafer Fabrication Process

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1394

A Two-stage Scheme for Dynamic Hand Gesture Recognition

Short Survey on Static Hand Gesture Recognition

CLASSIFICATION FOR SCALING METHODS IN DATA MINING

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

Best First and Greedy Search Based CFS and Naïve Bayes Algorithms for Hepatitis Diagnosis

WEIGHTED K NEAREST NEIGHBOR CLASSIFICATION ON FEATURE PROJECTIONS 1

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

Lorentzian Distance Classifier for Multiple Features

Feature Selection Using Modified-MCA Based Scoring Metric for Classification

University of Florida CISE department Gator Engineering. Data Preprocessing. Dr. Sanjay Ranka

CS231A Course Project Final Report Sign Language Recognition with Unsupervised Feature Learning

FEATURE EVALUATION FOR EMG-BASED LOAD CLASSIFICATION

Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

Dynamic Human Shape Description and Characterization

Essentials for Modern Data Analysis Systems

ANALYSIS OF DENSE AND SPARSE PATTERNS TO IMPROVE MINING EFFICIENCY

Data Preprocessing. Data Preprocessing

An Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm

BENCHMARKING ATTRIBUTE SELECTION TECHNIQUES FOR MICROARRAY DATA

Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset

Enhancing K-means Clustering Algorithm with Improved Initial Center

Recommendation System for Location-based Social Network CS224W Project Report

Leveraging Set Relations in Exact Set Similarity Join

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Supervised Variable Clustering for Classification of NIR Spectra

Action Recognition & Categories via Spatial-Temporal Features

Effect of Principle Component Analysis and Support Vector Machine in Software Fault Prediction

Feature Selection Using Principal Feature Analysis

Correlation Based Feature Selection with Irrelevant Feature Removal

Trust4All: a Trustworthy Middleware Platform for Component Software

REMOVAL OF REDUNDANT AND IRRELEVANT DATA FROM TRAINING DATASETS USING SPEEDY FEATURE SELECTION METHOD

Multivariate Time Series Classification Using Inter-leaved Shapelets

Artificial Neural Network-Based Prediction of Human Posture

Handling Missing Values via Decomposition of the Conditioned Set

Image Mining: frameworks and techniques

Express Letters. A Simple and Efficient Search Algorithm for Block-Matching Motion Estimation. Jianhua Lu and Ming L. Liou

A Conflict-Based Confidence Measure for Associative Classification

Non-rigid body Object Tracking using Fuzzy Neural System based on Multiple ROIs and Adaptive Motion Frame Method

EFFICIENT ADAPTIVE PREPROCESSING WITH DIMENSIONALITY REDUCTION FOR STREAMING DATA

Feature Subset Selection on Multivariate Time Series with Extremely Large Spatial Features

Efficient Tuning of SVM Hyperparameters Using Radius/Margin Bound and Iterative Algorithms

Feature Subset Selection Problem using Wrapper Approach in Supervised Learning

Task analysis based on observing hands and objects by vision

Flexible-Hybrid Sequential Floating Search in Statistical Feature Selection

PATTERN RECOGNITION USING NEURAL NETWORKS

Object Detection in a Fixed Frame

1. Introduction. 2. Motivation and Problem Definition. Volume 8 Issue 2, February Susmita Mohapatra

International Journal of Computer Engineering and Applications, Volume XII, Special Issue, March 18, ISSN

Rank Measures for Ordering

Value Added Association Rules

Information-Theoretic Feature Selection Algorithms for Text Classification

Engineering Drawings Recognition Using a Case-based Approach

A Combined Method for On-Line Signature Verification

Searching Image Databases Containing Trademarks

Seeing and Reading Red: Hue and Color-word Correlation in Images and Attendant Text on the WWW

Measuring and Evaluating Dissimilarity in Data and Pattern Spaces

Fabric Image Retrieval Using Combined Feature Set and SVM

Using a genetic algorithm for editing k-nearest neighbor classifiers

The Encoding Complexity of Network Coding

Estimating Missing Attribute Values Using Dynamically-Ordered Attribute Trees

Comparison of Optimization Methods for L1-regularized Logistic Regression

Recent Progress on RAIL: Automating Clustering and Comparison of Different Road Classification Techniques on High Resolution Remotely Sensed Imagery

Study on Classifiers using Genetic Algorithm and Class based Rules Generation

Face Recognition Using Vector Quantization Histogram and Support Vector Machine Classifier Rong-sheng LI, Fei-fei LEE *, Yan YAN and Qiu CHEN

Normalized Texture Motifs and Their Application to Statistical Object Modeling

Object Purpose Based Grasping

Purna Prasad Mutyala et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 2 (5), 2011,

Improving Latent Fingerprint Matching Performance by Orientation Field Estimation using Localized Dictionaries

Gesture Recognition using Neural Networks

Inital Starting Point Analysis for K-Means Clustering: A Case Study

Data Collection, Preprocessing and Implementation

Disguised Face Identification Based Gabor Feature and SVM Classifier

Fast Efficient Clustering Algorithm for Balanced Data

The USC 2 T Project The USC Clever Transportation Project

Linear Discriminant Analysis in Ottoman Alphabet Character Recognition

Fractional Discrimination for Texture Image Segmentation

Content Based Image Retrieval system with a combination of Rough Set and Support Vector Machine

Dimensionality Reduction, including by Feature Selection.

Research Article Path Planning Using a Hybrid Evolutionary Algorithm Based on Tree Structure Encoding

The Un-normalized Graph p-laplacian based Semi-supervised Learning Method and Speech Recognition Problem

Bottom-up Integration Testing with the Technique of Metamorphic Testing

Hand gesture recognition with Leap Motion and Kinect devices

Structure-adaptive Image Denoising with 3D Collaborative Filtering

Noise-based Feature Perturbation as a Selection Method for Microarray Data

Development of real-time motion capture system for 3D on-line games linked with virtual character

Sensor Based Time Series Classification of Body Movement

Transcription:

Feature Subset Selection Utilizing BioMechanical Characteristics for Hand Gesture Recognition Farid Parvini Computer Science Department University of Southern California Los Angeles, USA Dennis McLeod Computer Science Department University of Southern California Los Angeles, USA Abstract Feature Subset Selection has become the focus of much research in areas of application for Multivariate Time Series (MTS). MTS data sets are common in many multimedia and medical applications such as gesture recognition, video sequence matching and EEG/ECG data analysis. MTS data sets are high dimensional as they consist of a series of observations of many variables at a time. The objective of feature subset selection is two-fold: providing a faster and more cost-effective process and a better understanding of the underlying process that generated the data. We propose a subset selection approach based on biomechanical characteristics, a simple yet effective technique for MTS. We apply our approach for recognizing ASL static signs using Neural Network and Multi-Layer Neural Network and show that we can maintain the same accuracy by selecting just 50% of the generated data. 1. Introduction With the potential for many interactive applications, automatic hand gesture recognition has been actively investigated in the computer pattern recognition and Bio-Informatics communities. Hand gestures are often subtle, can happen at various timescales, and may exhibit long-range dependencies. All these issues make hand gesture recognition a challenging problem. Sometimes, using all the features available has a detrimental effect on accuracy when compared to using some subset of the features. This is because some features are dependent on other features and often noisy, hence, there is no need to include those features. Furthermore, the more features there are, the more likely that some feature will randomly fit the data and hence the probability of over fitting increases. Thus removing these features would lead to both improved accuracy, and clearer descriptions of learned concepts, since there are less features with which the user has to deal with. However, there is a problem: feature subset selection is a difficult problem. If there are n features, then there are 2 n possible feature subsets. Searching this space for a good set of features can be extremely time-consuming.[7] We present a new method for finding the subset data that are well suited for hand gesture recognition. Our method is based on evaluating data received from each sensor and calculating the range of motion for each sensor individually. Our approach is distinct and novel in the following two aspects. First, to the best of our knowledge, our approach utilizing a bio-mechanical characteristics for feature subset selection is unique among all the studies which have intended to apply FSS to Multivariate Time Series for analyzing the collected raw data for gesture recognition. Second, our approach addresses the major challenges involved in FSS: accuracy and time complexity. The remainder of this paper is organized as follows. Section 2 discusses related work. We present our approach in Section 3. The results of our experiments are reported in Section 4. Finally, Section 5 concludes this paper and discusses our future research plans. Figure 1. ASL Alphabets

2. Related Work There are two possible approaches to feature subset selection: the wrapper approach and the filter approach [1]. In the wrapper approach, the learner is applied to subsets of features and tested on a hold-out set (or, if execution time is not an issue, using cross-validation). From the results of these tests, a good subset of features is selected. Yand et al [9] used a wrapper genetic algorithm to select such subsets, achieving multi-criteria optimization in terms of generalization accuracy and costs associated with the features. Bin-Bin Qu [8] adopted an approximation quality concept and hierarchy structure for feature subset selection. The other approach to feature subset selection is the filter approach, which looks at correlations between features and other performance measures to decide a priori, independent of any particular learner, to find what a good subset of features is. Although such techniques do not perform nearly as well as the wrapper approach, they are much faster. Many approaches have been proposed in the area of Multivariate Time Series. Chakraborty [2] summarizes the current techniques for feature subset selection and classification for MTS data sets. Shababi et. al. [3]proposed a family of unsupervised methods for feature subset selection from Multivariate Time Series based on Common Principal Component Analysis. They also proposed an approach for MTS with Extremely Large Spatial Features [4].Tucker [5]presented a method for decomposing high dimensional MTS into mutually exclusive subsets of variables where within-group dependencies are high and between group dependencies are low. Yang [6] utilized the properties of the principal components to retain the correlation information among original features. of one possible starting posture and the posture representing ASL [16] sign L are shown in Figure 2. Figure 2. ASL sign L and its representation with k=4 The process of gesture recognition starts with collecting data from the sensors attached to the hand of a user. At each sensor clock, the sensory device driver captures one sample by acquiring data from all of the n sensors of the device. Each sample is stored in a tuple with n fields where each field is associated with one sensor. We represent a sample at time t by St = (s1, s2,..., sn ), where each si is a real number indicating the value that is acquired from sensor i at time t. As time evolves, a data set of samples is acquired and generated. An example of such a data set of samples is shown in Figure 3. 3. Feature Subset Selection Based on the Range of Motion Gesture recognition by utilizing bio-mechanical characteristics, originally proposed in [11], is inspired by the observation that all forms of hand signs include finger-joint movements from a starting posture to a final posture. We utilize the concept of range of motion from the BioMechanical literature at each joint to abstract this movement. Range of motion (ROM) [15] is a quantity that defines the joint movement by measuring the angle from the starting position of an axis to its position at the end of its full range of the movement. We compute the range of motion per joint by using the sensor values acquired by the sensory device attached to a human hand. Suppose that a user is making the sign by wearing a sensory device. The user is required to start making the sign from a starting posture toward a final posture. An example Figure 3. A snapshot of MTS generated by sensory device Suppose St0 = (s1,..., sn ) and St = (s01,..., s0n ) represent the samples of the start and final postures for making a sign, respectively. We calculate the range of motion tuple R4t as follows : R4t = St St0 = (r1,..., rn ) i (1,..., n), ri = s0i si and 4t = t t0 R4t is a tuple consisting of n positive or negative real numbers depending on the direction of the movement. In order to normalize the values of the movement, we first construct R = ( r1,..., rn ). The rationale behind using

Figure 4. ASL alphabets and N R presentation for each sensor Figure 5. Cyberglove with numbered sensors absolute values is that smaller values (i.e., larger negative numbers) in R do not necessarily mean less movement. To capture the direction of the movement and differentiate between movements in opposite directions, for each R, we calculate D which holds directional information as follows: D t = (d 1,..., d n ) where { 1 if ri 0 d i = 1 otherwise Subsequently, we find the maximum and minimum values within R and represent them as M(R) and m(r), respectively. We then normalize each value in R by subtracting m(r) and dividing the result by (M(R) m(r)). We represent the result of this normalized R which consists of values between 0 and 1 with NR. Finally, we discretize the values of NR with a given discretization parameter k(> 1). For example, if k = 2, we replace each value of NR with 0 if its value is less than 0.5, and 1 otherwise. The resulting NR with k = 4 for all ASL alphabets are displayed in Figure 4 Our hypothesis is that there is a direct correlation between the range of motion and the signature that represents the sign. Consequently, subset selection needs to be based on the maximum range of motion. To this end, we propose eliminating the data received from the sensors whose range of motion are minimal compared to other sensors.

Figure 6. Comprehensive Experiment Result for Three Sets of Experiment In the next section, we evaluate our approach and provide the result of our experiments. 4. Experimental Results and Observations For our experiments, we used the CyberGlove [17] as a virtual reality user interface for acquiring data. CyberGlove is a glove that provides up to 22 joint-angle measurements. It uses proprietary resistive bend-sensing technology to transform hand and finger motions into real-time digital joint-angle data. A picture of this glove and the location of each sensor are shown in Figure 5. We initiated our experiments by collecting data from ten signers wearing the glove and making all 24 ASL static signs representing the 24 English alphabet from the starting posture as shown in Figure 1. Note that signs for letter J and Z involve the movement of the hand and hence considered as dynamic signs and consequently are excluded from our experiments. Figure 1 shows there are not enough sensors in the glove to capture the relationship between the index and middle fingers which is essential to differentiate R from U. Thus we omitted the signs H and R from of our experiments. For some data files, complete sensor values are not captured for some lines. We also excluded all the data lines that did not have all 22 sensor values. For each sign, we collected 140 tuples from the starting posture to the final posture, each including 22 sensor values. For the first set of experiments, we recognized all 24 signs using Neural Network [12], [13] and Multi-layer Neural Network [14]. We used nine data sets for training and tuning and one data set for testing the system and repeated the experiment 10 times. The result of this set of experiment is displayed in the first two columns of Figure 2. We then sorted the sensors based on their contribution to the total range of motion while making all signs. The result are displayed in Figure 4. This figure clearly states that since the cumulative range of motion for sensors 3,7,8,10 and 11 is zero, they have no role in making signs and can easily be omitted. So in the next set of experiments, we omitted data received

from these sensors and repeated the exact experiment as the first step (9 sets for training, 9 sets for tuning and 1 set for testing, repeating 10 times). The result of this set of experiments are display in Figure 6. As can be observed, the overall accuracy did not drop and is very close to the results obtained from the experiment including all sensors. In the last set of experiments, we omitted 5 mores sensors (12,1,4,2,9). These are the sensors that have the minimum cumulative range of motion while making ASL signs. We again repeated the experiment as step one, except we omitted ten sensors from our dataset. The result of this set of experiments is shown in Figure 6. 5. Conclusion & Future Work We proposed an approach for feature subset selection for hand gestures recognition based on the bio-mechanical characteristics of the hand during the formation of the gesture. Our approach has the following advantages over previous studies: It is simple It is application-independent Time complexity is very low Our approach also provides a very good understanding of the process and explains intuitively why some of the sensors can be omitted, the feature that most FSS are lacking. Our experimental results by recognizing static ASL signs confirm the effectiveness of our approach when applied to MTS generated by sensors attached to human hands. We plan to extend this work in two directions. First, we intend to extend our technique for recognizing hand gestures with out sensory device, e.g. vision based recognition. Second, we would like to show that in general, utilizing any other characteristic that defines the system on a higher level or abstraction rather than data layer (e.g. a bio-mechanical characteristic) provides both higher accuracy on result and less dependency on the data gathering process. References [1] G. H. John, R. Kohavi, and K. Pfleger : Irrelevant features and the subset selection problem. In Proceedings of the International Conference on Machile Learning 1994, pages 121-129, 1994 [2] B. Chakraborty : Feature Selection and Classification Techniques for Multivariate Time Series, Innovative Computing,Information and Control, International Conference on, pp. 42, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007), 2007 [4] H. Yoon; C. Shahabi: Feature Subset Selection on Multivariate Time Series with Extremely Large Spatial Features Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on Volume, Issue, Dec. 2006 Page(s):337-342 Digital Object Identifier 10.1109/ICDMW.2006.81 [5] A. Tucker, S. Swift, X. Liu: Variable grouping in multivariate time series via correlation. IEEE Trans. on Systems, Man, and Cybernetics, Part B 31 (2001) [6] K. Yang, H. Yoon, C. Shahabi: Clever: a feature subset selection technique for multivariate time series. Technical report, University of Southern California (2005) [7] E. Cantu-Paz, S. Newsam, C. Kamath: Feature Selection in Scientific Applications, Proceedings, ACM International Conference on Knowledge Discovery and Data Mining, pp 788-793, August 22-25, 2004, Seattle, WA. UCRL-CONF-202657 [8] J. Bi, K. Bennett, M. Embrechts, C. Breneman, and M. Song.: Dimensionality reduction via sparse support vector machines. JMLR, 3:12291243, 2003 [9] J. Yang, V. Honavar: Feature subset selection using a genetic algorithm Intelligent Systems and their Applications, IEEE Volume 13, Issue 2, Mar/Apr 1998 Page(s):44-49 [10] B.B. Qu, Y. Lu : A hierarchy reduct algorithm for feature subset selection, Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on Volume 2, Issue, 26-29 Aug. 2004 Page(s): 1157-1161 vol.2 [11] F. Parvini, C. Shahabi: An algorithmic approach for static and dynamic gesture recognition utilising mechanical and biomechanical characteristics, International Journal of Bioinformatics Research and Applications, Vol. 3, No. 1, 2007 [12] K. Boehm,W. Broll, and M. Sokolewicz: Dynamic gesture recognition using neural networks; a fundament for advanced interaction construction. SPIE Conference Electronic Imaging Science and Technology, San Jose, CA,1994. [13] K. Murakami, H. Taguchi: Gesture recognition using recurrent neural networks. CHI 91 Conference Proceedings, pages 237242,1991. [14] J. Eisenstein, S. Ghandeharizadeh, L. Golubchik, C. Shahabi, D. Yan and R. Zimmermann: Device Independence and Extensibility in Gesture Recognition, IEEE Virtual Reality Conference (VR), LA, CA, 2003 [15] N. B. Reese: Joint Range of Motion and Muscle Length Testing, ISBN 0721689426, Saunders Publication, 2001. [16] http://www.lifeprint.com/ [17] http://www.cyberglovesystems.com/products/cyberglove.php [3] H. Yoon, K. Yang, and C. Shahabi: Feature subset selection and feature ranking for multivariate time series. IEEE Trans. Knowl. Data Eng., 17(9):11861198, 2005