Sensor Based Time Series Classification of Body Movement

Similar documents
Activity Recognition Using Cell Phone Accelerometers

A novel approach to classify human-motion in smart phone using 2d-projection method

HOT asax: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery

Chapter 1, Introduction

Implementation of an IoT Sensor Data Collection and Analysis Library

Mobile Health Monitoring Based On New Power Management Approach

Namita Lokare, Daniel Benavides, Sahil Juneja and Edgar Lobaton

International Journal of Advanced Research in Computer Science and Software Engineering

FEATURE EVALUATION FOR EMG-BASED LOAD CLASSIFICATION

Web page recommendation using a stochastic process model

PIECEWISE LINEAR DYNAMICAL MODEL FOR HUMAN ACTIONS CLUSTERING FROM INERTIAL BODY SENSORS WITH CONSIDERATIONS OF HUMAN FACTORS

Human Activity Recognition in WSN: A Comparative Study

ISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies

Deploying Distributed Real-time Healthcare Applications on Wireless Body Sensor Networks. Sameer Iyengar Allen Yang

Fall 2017 ECEN Special Topics in Data Mining and Analysis

GENERATING HIGH LEVEL CONTEXT FROM SENSOR DATA FOR MOBILE APPLICATIONS

Tutorial on Machine Learning Tools

Using Weka for Classification. Preparing a data file

Visualization and Statistical Analysis of Multi Dimensional Data of Wireless Sensor Networks Using Self Organising Maps

SCENARIO BASED ADAPTIVE PREPROCESSING FOR STREAM DATA USING SVM CLASSIFIER

Activity recognition and energy expenditure estimation

Performance Analysis of Data Mining Classification Techniques

Flexible-Hybrid Sequential Floating Search in Statistical Feature Selection

Real-time interactive medical consultation using a pervasive healthcare architecture

Tour-Based Mode Choice Modeling: Using An Ensemble of (Un-) Conditional Data-Mining Classifiers

Detecting Harmful Hand Behaviors with Machine Learning from Wearable Motion Sensor Data

IJMIE Volume 2, Issue 9 ISSN:

WARD: A Wearable Action Recognition Database

CS229 Final Project: Predicting Expected Response Times

Event Detection using Archived Smart House Sensor Data obtained using Symbolic Aggregate Approximation

i-eeg: A Software Tool for EEG Feature Extraction, Feature Selection and Classification

Real-Time, Automatic and Wireless Bridge Monitoring System Based on MEMS Technology

COMPARATIVE ANALYSIS OF EYE DETECTION AND TRACKING ALGORITHMS FOR SURVEILLANCE

International Journal of Modern Trends in Engineering and Research e-issn No.: , Date: 2-4 July, 2015

TUBE: Command Line Program Calls

Fall Detection System for Elderly Based on Android Smartphone

Validation of Automated Mobility Assessment using a Single 3D Sensor

Programming-By-Example Gesture Recognition Kevin Gabayan, Steven Lansel December 15, 2006

TEMPORAL data mining is a research field of growing

Using Decision Boundary to Analyze Classifiers

Bayes Classifiers and Generative Methods

The Impact of Clustering on the Average Path Length in Wireless Sensor Networks

Supervised and unsupervised classification approaches for human activity recognition using body-mounted sensors

Improving Time Series Classification Using Hidden Markov Models

Context- and cost-aware feature selection in ultra-low-power sensor interfaces

A REVIEW ON LEACH-BASED HIERARCHICAL ROUTING PROTOCOLS IN WIRELESS SENSOR NETWORK

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

WIRELESS VEHICLE WITH ANIMATRONIC ROBOTIC ARM

An Effective Performance of Feature Selection with Classification of Data Mining Using SVM Algorithm

NORMALIZATION INDEXING BASED ENHANCED GROUPING K-MEAN ALGORITHM

Survivability Evaluation in Wireless Sensor Network

Digital Image Steganography Techniques: Case Study. Karnataka, India.

Distributed Bottom up Approach for Data Anonymization using MapReduce framework on Cloud

Symbolic Representation and Clustering of Bio-Medical Time-Series Data Using Non-Parametric Segmentation and Cluster Ensemble

Invariant Recognition of Hand-Drawn Pictograms Using HMMs with a Rotating Feature Extraction

A SIMPLE HIERARCHICAL ACTIVITY RECOGNITION SYSTEM USING A GRAVITY SENSOR AND ACCELEROMETER ON A SMARTPHONE

A STUDY OF SOME DATA MINING CLASSIFICATION TECHNIQUES

A Longitudinal Control Algorithm for Smart Cruise Control with Virtual Parameters

Road Detection Using Support Vector Machine based on Online Learning and Evaluation

Image Classification Using Text Mining and Feature Clustering (Text Document and Image Categorization Using Fuzzy Similarity Based Feature Clustering)

The Role of Biomedical Dataset in Classification

ACTIVITY/POSTURE RECOGNITION USING WEARABLE SENSORS PLACED ON DIFFERENT BODY LOCATIONS

Traffic Signs Recognition using HP and HOG Descriptors Combined to MLP and SVM Classifiers

Dominating Set & Clustering Based Network Coverage for Huge Wireless Sensor Networks

Non-intrusive Movement Detection in CARA Pervasive Healthcare Application

Individualized Error Estimation for Classification and Regression Models

Hydraulic pump fault diagnosis with compressed signals based on stagewise orthogonal matching pursuit

SOCIAL MEDIA MINING. Data Mining Essentials

Keywords: wearable system, flexible platform, complex bio-signal, wireless network

Flow-based Anomaly Intrusion Detection System Using Neural Network

Reduced Image Noise on Shape Recognition Using Singular Value Decomposition for Pick and Place Robotic Systems

Video Inter-frame Forgery Identification Based on Optical Flow Consistency

Network Coding Efficiency In The Presence Of An Intermittent Backhaul Network

Research on Evaluation Method of Video Stabilization

International Journal of Computer Science Trends and Technology (IJCST) Volume 5 Issue 4, Jul Aug 2017

Prognosis of Lung Cancer Using Data Mining Techniques

HUMAN COMPUTER INTERFACE BASED ON HAND TRACKING

ECE 285 Class Project Report

Object Detection in Video Streams

Correlation Based Feature Selection with Irrelevant Feature Removal

3-D MRI Brain Scan Classification Using A Point Series Based Representation

A ProM Operational Support Provider for Predictive Monitoring of Business Processes

Face Detection using Hierarchical SVM

Louis Fourrier Fabien Gaie Thomas Rolf

2. Design Methodology

Combination of PCA with SMOTE Resampling to Boost the Prediction Rate in Lung Cancer Dataset

Performance Evaluations for Parallel Image Filter on Multi - Core Computer using Java Threads

Performance Evaluation of Monitoring System Using IP Camera Networks

Supervised Learning for Image Segmentation

Artificial Neural Networks Lab 2 Classical Pattern Recognition

Temporal Weighted Association Rule Mining for Classification

Accelerometer Gesture Recognition

Multiresolution Motif Discovery in Time Series

Data Mining with Oracle 10g using Clustering and Classification Algorithms Nhamo Mdzingwa September 25, 2005

Discovery of Agricultural Patterns Using Parallel Hybrid Clustering Paradigm

A Data Classification Algorithm of Internet of Things Based on Neural Network

Estimating Data Center Thermal Correlation Indices from Historical Data

Social Behavior Prediction Through Reality Mining

Preface to the Second Edition. Preface to the First Edition. 1 Introduction 1

Computer Based Image Algorithm For Wireless Sensor Networks To Prevent Hotspot Locating Attack

Transcription:

Sensor Based Time Series Classification of Body Movement Swapna Philip, Yu Cao*, and Ming Li Department of Computer Science California State University, Fresno Fresno, CA, U.S.A swapna.philip@gmail.com, mingli@csufresno.edu *College of Engineering and Computer Science University of Tennessee at Chattanooga Chattanooga, TN, U.S.A yu-cao@utc.edu Abstract. Advances in sensing and monitoring technology are being incorporated into today s healthcare practice. As a result, the concept of Body Sensor Networks (BSN) has been proposed to describe the wearable/wireless devices for healthcare applications. One of the major application scenarios for BSN is to detect and classify the body movements for long-term lifestyle and healthcare monitoring. This paper introduces a new approach for analyzing the time series obtained from BSN. In our research, the BSN record the acceleration data of the volunteer s movement while performing a set of activities such as jogging, walking, resting, and transitional activities. The main contribution of this paper is the proposed time series approximation and feature extraction algorithm that can convert the sensor-based time series data into a density map. We have performed extensive experiments to compare the accuracy in classifying the time series into different activities. It is concluded that the proposed approach would aid greatly the development of efficient health monitoring systems in the future. To the best of our knowledge, no similar research has been reported in the BSN field and we expect our research could provide useful insights for further investigation. Keywords: Body Sensor Networks, Time Series Analysis, Human body movement, machine learning, classification. 1 Introduction Body sensor networks (BSN) [1, 2] enable an inexpensive approach for data collection. The BSN include sensor nodes that communicate wirelessly and they function with less consumption of power and less memory capacity. These sensor networks are very useful especially in the field of health monitoring as they are capable of monitoring temperature, humidity, force, acceleration and heartbeat [3, 4]. The sensor networks can thus be specifically used in interpreting human motion and thus in health care for the elderly. The BSN have their own advantage over other

types of motion or health monitoring systems as they are wearable, less expensive and more accurate with negligible environmental disturbances affecting data collection. While monitoring the health of patients, it is necessary to deal with streaming data. However, the streaming data is usually too large to process as raw data in BSN environment because the nodes in BSN have a small memory and processing capacity. Hence for the classification of patients activities, we need the suitable time series approximation and feature extraction techniques which can be easily deployed and the patterns can be analyzed in real-time. New approximation and feature analysis algorithms will make it easier to distinguish between activities such as jogging, resting, walking and transitional activities. The results from approximation and feature extraction will be feed into the machine learning based-classification algorithms. Some approaches have been proposed to deal with the classification of sensor data with real time approximation of the series [5], but it has not been tested in the area of BSN. There has been work done in the automatic segmentation of the time series for classification [6]. The symbolic time series representation SAX [7] has been a popular approach for a wide variety of time series data. The SAX representation allows the application of string manipulation algorithms for feature extraction. One such feature extraction method is the time series bitmap [8], which has previously been applied to sensor data analysis. In this paper, we proposed new approximation and feature extraction algorithms for BSN. We also test the accuracy of classification of the BSN-based time series data. The activities of four classes (jogging, resting, walking and transitional activities) are recorded in segments and applied with the SAX representation. The time series density map is then obtained from the symbolic representation. This map is then input to the classification algorithms. The classification algorithms are consisted of two steps: primary classification and secondary classification. The effectiveness of our proposed approach is validated using the percentage accuracy obtained from cross-validation. 2 Proposed Approach The proposed approach in our research is divided into three steps: data collection, feature extraction, and classification. The BSN we use include two types of nodes: the sender and the receiver which communicate wirelessly with each other [9]. The sender has the accelerometer sensor attached to it, and it is controlled with the use of a sampling program available with the BSN. The sender node is fixed to the volunteer s hand and the acceleration data of the hand movement is recorded for a period of 8 seconds while he/she performs the following activities: jogging, walking, resting, picking up and putting down things, transitional activities (lying to sitting and, sitting to standing). This data is stored offline for further use. Fig. 1 shows the steps involved in this approach.

Fig. 1 The overview of our proposed approach 2.1 Data collection from BSN This is the first step in our proposed approach. The process of data collection involves setting up the BSN and recording of the acceleration of the volunteer s hand movements while the four classes of activity (jogging, walking, resting, and transitional) are being performed. The X, Y and Z acceleration data are recorded in a CSV file for as long as eight seconds. The yellow curve (Series 3) of Figure 2 represents the X acceleration. The pink curve (Series 2) of Figure 2 indicates the Y acceleration. The blue curve (Series 1) of Figure 2 is the Z acceleration. Thus for each class we record a collection of 100 instances. A sample time chart of the jogging activity is shown in Fig. 2. 2.2 Preprocessing and approximation Our second step is to adopt SAX like algorithm [7] to the time series of each activity. The Sax algorithm converts the time series into a symbolic representation. The advantages of using SAX include dimensionality reduction, discretization, lower distance bounding and numerosity reduction which makes it apt for the body sensor network data. The procedure is as follows: first the time series is normalized to remove noise, the time series of length n is divided into w equal parts and the average 9000 8000 7000 6000 5000 4000 3000 2000 1000 0 1 9 17 25 33 41 49 57 65 73 81 89 97 105 113 121 129 Series3 Series2 Series1 Fig. 2 The time chart for jogging activity. The yellow curve (Series 3) represents the X acceleration. The pink curve (Series 2) indicates the Y acceleration. The blue curve (Series 1) is the Z acceleration.

of each part is calculated. This is piecewise aggregate approximation (PAA) of the sequence. The cutpoints are determined from the Gaussian look up table. Usually the number of cutpoints is set as 3 and hence 4 symbols would be used for time series. Then the PAA which lies in the specified cut point slot is assigned a unique symbol or character. Thus we obtain the symbolic representation of the time series. For more details of the SAX algorithm, please refer to the original paper [7]. 2.3 Feature extraction In the third step of our proposed approach, the SAX approximation is converted to a time series density map or bitmap. This approach is particularly useful while streaming real time data for classification. It is small and efficient enough to be handled by the BSN process or itself. The algorithm is described as follows. The Sax representation from the previous step (second step) has its alphabet size as four. We assign each alphabet a key. For example if A, G, C and T were the symbols used, they are assigned key values 1, 2, 3 and 4. Now the SAX representation is converted into a time series density map by moving a sliding window across the pattern and updating a frequency matrix of length 2^l x 2^l. The value of l is set as three since it has been found to work best in many experiments. It is also the length of the combination of symbols used in the Sax representation. An example of the matrix for length l=1, 2 and 3 is shown in Fig. 3. Suppose l=4, the first word in the sequence of the example string GATCACAGGTCTATCACCC is GATC. Let k0 = G, k1 = A, k2 = T and k3 = C be the key values. To map this word to a bitmap we use the formula, l 1 l 1 l n 1 l n col = ( * 2 ) mod2 = ( / 2 ) k n n= 0 Fig. 3 The mapping of words of l = 1, 2 and 3 l n 1 row k n * 2. Thus we obtain n= 0 the density map of patterns in the sequence or the bitmap of the sequence, by sliding a window of length l through it and updating the matrix based on the frequency of the

possible combinations in the sequence. The output is an 8 x 8 matrix of cells which is time series density map of the specific activity instance. A training file is thus created with the 64 values as attributes for all the 400 instances of activity, including their class specified. This is fed as input to machine learning-based classifier. 2.4 Classification This is the last step of our proposed approach. In order to evaluate the classification performance of the proposed approximation and feature extraction algorithms (step 2 and step 3), six machine learning-based classification algorithms were chosen. They are Bayes net, Naïve bayes multinomial, Simple logistic, Logit boost, Rotation forest, and Random forest. The classification was divided into primary and secondary steps. Primary classification performed the upper level classification for the four activities. The secondary classification performed lower division classification on the output of the primary i.e. the transitional activities were further classified into working (data pick up and put down) and transition activities (lying to sitting and sitting to stand). Training and cross validation testing are then performed on the training file. The evaluation of results is discussed next in the paper. 3. Experimental Result 3.1 Data input to classifier The training data file contained 400 instances in total with four class divisions and 64 attributes. This was for the primary classification. The file for the secondary classification contained 200 instances with two class divisions and 64 attributes. 3.2 Classification results The classification results for the primary and secondary are shown in Table 1 and Table 2. Six classifiers have been used. The X, Y, and Z classification have been performed separately and their average result is also shown for number of folds as 8 for primary and 12 for secondary. The primary classification receives around 90 percent accuracy for all classifiers, while the secondary classification achieves only 70 percent. This shows that if the actions are well defined the time series density map would make a good choice for feature extraction in body sensor networks.

Eight Folder Cross Validation Bayes Net Naïve bayes Multinomial Simple Logistic Logit Boost Rotation Forest accelerometer X 90 90 90 88 90 accelerometer Y 90 90 90 90 88 accelerometer Z 90 88 91 89 89 average 90 89 90 89 89 Table 1 Percentage classified correctly for primary classification Random Forest 90 89 89 89 Eight Folder Cross Bayes Naïve bayes Simple Logit Rotation Random Validation Net Multinomial Logistic Boost Forest Forest accelerometer Table 2 Percentage X classified 67 71 correctly for secondary 74 classification 71 76 75 accelerometer Y 48 63 85 78 82 77 accelerometer Z 48 57 62 58 67 68 average Fig. 6 Sample output 54 from Weka 64 for random 74 forest 869 fold cross 75 validation 73 testing (Primary) Table 2 Percentage classified correctly for secondary classification 4. Conclusion We have shown in this paper that the time series density maps feature extraction method works well for the body sensor network data. It has also been estimated that this method has a high percentage of accuracy when it comes to classifying body movements and activities of BSN data. The feature extraction technique used here can be easily deployed on the BSN, enabling quick, time and space efficient and accurate activity classification with streaming sensor data. This can be implemented as future work, where we can use more sensors and nodes and improve the accuracy further by using combined data. We will also investigate the new feature extraction and classification algorithms under challenging environments, such missing sensor data, low quality of service, and unreliable sensor data. Our ultimate goal is to develop effective and efficient classifier for real-world clinical applications and thus serve a good purpose in health monitoring systems. References [1] J. Yicka, et al., "Wireless sensor network survey," Computer Networks, Elsevier, vol. 5, pp. 2292-2330, 2008. [2] W. Su, et al., "Wireless sensor network survey," Wireless sensor networks: a survey, vol. 38, pp. 393-422, 2002. [3] G.-Z. Yang, Body Sensor Networks. New York, USA: Springer Science+Business Media LLC, 2006. [4] S. R, et al., "Body Area Network BAN--a key infrastructure element for patientcentered medical applications," Biomedizinische Technik. Biomedical engineering, vol. 47, pp. 365-368, 2002.

[5] S. Kasetty, et al., "Real-Time Classification of Streaming Sensor Data," in Proc. of 20th IEEE Int'l Conference on Tools with Artificial Intelligence, 2008. [6] E. Guenterberg, et al., "An Automatic Segmentation Technique in Body Sensor Networks Based on Signal Energy," in Proc. of The Fourth International Conference on Body Area Networks (BodyNets), 2009. [7] J. Lin, et al., "A symbolic representation of time series, with implications for streaming algorithms," in Proc. of the 8th SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, San Diego, California, USA, 2003. [8] N. Kumar, et al., "Time-series Bitmaps: a Practical Visualization Tool for Working with Large Time Series Databases," in Proc. of SIAM 2005 Data Mining Conference Newport Beach, Caliornia, USA, 2005. [9] G.-Z. Yang, et al. (2010). Tutorial on Body Sensor Networks. Available: http://vip.doc.ic.ac.uk/bsn/tutorial