Motion Interpretation and Synthesis by ICA

Similar documents
Unsupervised Learning

Generating Different Realistic Humanoid Motion

Thiruvarangan Ramaraj CS525 Graphics & Scientific Visualization Spring 2007, Presentation I, February 28 th 2007, 14:10 15:00. Topic (Research Paper):

Human Motion Synthesis by Motion Manifold Learning and Motion Primitive Segmentation

Motion Texture. Harriet Pashley Advisor: Yanxi Liu Ph.D. Student: James Hays. 1. Introduction

Synthesis and Editing of Personalized Stylistic Human Motion

Learnt Inverse Kinematics for Animation Synthesis

Graph-based High Level Motion Segmentation using Normalized Cuts

Motion Synthesis and Editing. Yisheng Chen

CIS 520, Machine Learning, Fall 2015: Assignment 7 Due: Mon, Nov 16, :59pm, PDF to Canvas [100 points]

Network Traffic Measurements and Analysis

Spectral Style Transfer for Human Motion between Independent Actions

Statistical Learning of Human Body through Feature Wireframe

Term Project Final Report for CPSC526 Statistical Models of Poses Using Inverse Kinematics

CSE 481C Imitation Learning in Humanoid Robots Motion capture, inverse kinematics, and dimensionality reduction

Epitomic Analysis of Human Motion

Human Action Recognition Using Independent Component Analysis

GRAPH-BASED APPROACH FOR MOTION CAPTURE DATA REPRESENTATION AND ANALYSIS. Jiun-Yu Kao, Antonio Ortega, Shrikanth S. Narayanan

10-701/15-781, Fall 2006, Final

CSC 411: Lecture 14: Principal Components Analysis & Autoencoders

CSC 411: Lecture 14: Principal Components Analysis & Autoencoders

Style-based Inverse Kinematics

Segment-Based Human Motion Compression

3D Human Motion Analysis and Manifolds

Clustering and Visualisation of Data

THE capability to precisely synthesize online fullbody

Motion Style Transfer in Correlated Motion Spaces

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2016

( ) =cov X Y = W PRINCIPAL COMPONENT ANALYSIS. Eigenvectors of the covariance matrix are the principal components

FMRI data: Independent Component Analysis (GIFT) & Connectivity Analysis (FNC)

PCA vs. ICA Decomposition of High Resolution SAR Images: Application to Urban Structures Recognition

Data Mining Chapter 3: Visualizing and Exploring Data Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Recognition: Face Recognition. Linda Shapiro EE/CSE 576

Learning Alignments from Latent Space Structures

CS 534: Computer Vision Segmentation and Perceptual Grouping

INDEPENDENT COMPONENT ANALYSIS APPLIED TO fmri DATA: A GENERATIVE MODEL FOR VALIDATING RESULTS

Graph-Based Action Models for Human Motion Classification

Human Motion Database with a Binary Tree and Node Transition Graphs

Facial Motion Capture Editing by Automated Orthogonal Blendshape Construction and Weight Propagation

Feature Selection Using Principal Feature Analysis

CSE 6242 A / CS 4803 DVA. Feb 12, Dimension Reduction. Guest Lecturer: Jaegul Choo

Unsupervised learning in Vision

Grundlagen der Künstlichen Intelligenz

Human Activity Recognition Based on Weighted Sum Method and Combination of Feature Extraction Methods

Stylistic Motion Decomposition

Applying Supervised Learning

Dimension Reduction CS534

Unsupervised Learning

DI TRANSFORM. The regressive analyses. identify relationships

Clustering K-means. Machine Learning CSEP546 Carlos Guestrin University of Washington February 18, Carlos Guestrin

Feature selection. Term 2011/2012 LSI - FIB. Javier Béjar cbea (LSI - FIB) Feature selection Term 2011/ / 22

Dimension reduction : PCA and Clustering

Unsupervised Learning for Speech Motion Editing

22 October, 2012 MVA ENS Cachan. Lecture 5: Introduction to generative models Iasonas Kokkinos

Motion Track: Visualizing Variations of Human Motion Data

Human pose estimation using Active Shape Models

Dimensionality Reduction, including by Feature Selection.

Static Gesture Recognition with Restricted Boltzmann Machines

CS 231. Motion Capture Data I. The Pipeline. Bodenheimer et al

General Instructions. Questions

Optimal Segmentation and Understanding of Motion Capture Data

Image Processing. Image Features

Robust Face Recognition via Sparse Representation

Clustering & Dimensionality Reduction. 273A Intro Machine Learning

Full Body Tracking Using an Agent-based Architecture

Slides adapted from Marshall Tappen and Bryan Russell. Algorithms in Nature. Non-negative matrix factorization

7.1 INTRODUCTION Wavelet Transform is a popular multiresolution analysis tool in image processing and

Understanding Faces. Detection, Recognition, and. Transformation of Faces 12/5/17

DISTANCE MAPS: A ROBUST ILLUMINATION PREPROCESSING FOR ACTIVE APPEARANCE MODELS

Video based Animation Synthesis with the Essential Graph. Adnane Boukhayma, Edmond Boyer MORPHEO INRIA Grenoble Rhône-Alpes

Final Exam Schedule. Final exam has been scheduled. 12:30 pm 3:00 pm, May 7. Location: INNOVA It will cover all the topics discussed in class

CSE 258 Lecture 5. Web Mining and Recommender Systems. Dimensionality Reduction

Object and Action Detection from a Single Example

NIC FastICA Implementation

Component-based Locomotion Composition

CS 534: Computer Vision Segmentation and Perceptual Grouping

Training-Free, Generic Object Detection Using Locally Adaptive Regression Kernels

Achieving Good Connectivity in Motion Graphs

Locality Preserving Projections (LPP) Abstract

CSE 40171: Artificial Intelligence. Learning from Data: Unsupervised Learning

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017

Nonlinear dimensionality reduction of large datasets for data exploration

Linear Methods for Regression and Shrinkage Methods

SELECTION OF THE OPTIMAL PARAMETER VALUE FOR THE LOCALLY LINEAR EMBEDDING ALGORITHM. Olga Kouropteva, Oleg Okun and Matti Pietikäinen

Hierarchical Gaussian Process Latent Variable Models

Analysis of Functional MRI Timeseries Data Using Signal Processing Techniques

Expanding gait identification methods from straight to curved trajectories

CSE 255 Lecture 5. Data Mining and Predictive Analytics. Dimensionality Reduction

Gaussian Process Dynamical Models

Note Set 4: Finite Mixture Models and the EM Algorithm

A Study on Different Challenges in Facial Recognition Methods

Computer vision: models, learning and inference. Chapter 13 Image preprocessing and feature extraction

Image Restoration and Background Separation Using Sparse Representation Framework

Dimension Reduction of Image Manifolds

ECG782: Multidimensional Digital Signal Processing

Infographics and Visualisation (or: Beyond the Pie Chart) LSS: ITNPBD4, 1 November 2016

Does Dimensionality Reduction Improve the Quality of Motion Interpolation?

Stylized synthesis of facial speech motions

PATTERN RECOGNITION USING NEURAL NETWORKS

Human Gait Recognition Using Bezier Curves

Transcription:

Motion Interpretation and Synthesis by ICA Renqiang Min Department of Computer Science, University of Toronto, 1 King s College Road, Toronto, ON M5S3G4, Canada Abstract. It is known that high-dimensional human motion data lies in a low dimensional space. Looking for independent sub-motions that contribute to generating complicated human motions is an interesting task on its own merits. Independent Component Analysis (ICA) can extract independent factors of an observed signal. In this paper, ICA is applied to the reconstructed human motion data by PCA. Experimental results show that, if the low dimenality is appropriately chosen, ICA can find meaningful sub-motions. Independent sub-motions make it easy to complete missing human motion data. Preliminary results on synthesizing missing human motion data by self-regression using Gassian Process will also be discussed in the paper. Key words Human Motion Interpretation, Motion Synthesis, Independent Componenet Analysis, Gaussian Process 1 Introduction In computer graphics, a lot of motion capture data has been recorded for studying different kinds of human motions such as walking, running, jumping, and dancing ect. These raw data is often high-dimensional in their original formats due to the large number of markers. And it is known that these high-dimensional data often lies in a low dimensional space. PCA is often used to do dimensionality reduction thereby to get a compact representation of the original motion data. However, the low dimensions calculated by PCA only account for the variance of the data on some orthogonal directions. Sometimes, we are interested in a small set of independent factors that contribute to generating different kinds of human motions. For such tasks, ICA becomes a natural choice. Obtaining independent factors of human motion data is interesting for several reasons. First, independent factors suggest possible independent sub-motions that contribute to generating complicated motions. It gives us meaningful interpretations of human motions. This task is interesting on its own merits. Second, it makes motion style transfer easy. Given some different styles of motions, we can extract the corresponding independent components that are specific to respective styles. By replacing the set of style-specific independent components in one motion with a desired set of style-specific components, we

will get the motion with the desired style, which has been done by [?]. We can also arrange the motion data with each arrangement dimension representing different features, for e. g., features can be identity and style. Then we can apply multi-linear ICA to the data to extract feature-specific indpendent factors for style transfer. A similar idea for doing face transfer using multi-linear PCA is described in [3]. Third, obtaining independent factors of human motion data suggests new ways of synthesizing and predicting missing segments of motions. For some complicated motions like dancing, if some segments of the data are missing, it is hard to synthesize and predict them directly using a bruteforce approach. Nevertheless, if the orignal motion data can be decomposed into several independent sub-motions, especially periodic, we can handle each sub-motion seperately, and it will make the synthesis/prediction task much easier. In contrast, most existing approaches to motion synthesis/prediction take the high-dimensional motion as a whole and generate missing motions by approximation. Some recent algorithms for motion synthesis create novel motions by choosing pieces from a motion database and putting them together with smoothing to form a new motion [?]. In the picking pieces process, some constraints such as continuity constraints, path constraints, frame constraints, and ect, might be satisfied depending on different goals. All of these systems are based on subsequence matching. A recent system decribed in [6] is very successful in synthesizing motions, it is still a matching based system but implemented in a hierarchical fashion. In [7], a two-layer time-series graphical model based on Gaussian Processes (GP) is used to model the latent low dimensional dynamics of motions, and its application to fitting missing data and to predict short range subsequent motions is discussed. In this paper, some preliminary results for synthesizing missing motion data by self-regression for each independent component will be presented. Although the results are not very good, I believe that powerful methods can be used to produce good results by making better use of the advantage given by the independence of sub-motions. This paper is organized as follows: in section, I will briefly compare PCA and ICA first, then I will describe in details the approach to human motion interpretation based on PCA and ICA and motion synthesis based on independent components; in section 3, I will present some experimental results about the interpretation of different styles of walking; in section 4, I will show the preliminary results on synthesizing missing data using self-regression; I will conclude the paper with some disscussions and future work in section 5. Motion interpretation and synthesis using PCA and ICA.1 PCA and ICA PCA produces a linear mapping by generating orthogonal directions preserving the variance of data as much as possible. Suppose there is a motion capture

data set X = [x (1),,x (N) ], where x (i) is an n-dimensional column vector, n is number of markers, and N is the number of frames. The mean of the data vectors is µ. Suppose V contains the eigenvectors of the covariance matrix of X, we can get the projection y of a data vector x as follows: y = V(x µ) (1) If ˆV contains the eigenvectors corresponding to the largest several eigenvectors, we can reconstruct a high-dimensional data vector ˆx from a low-dimensional vector ŷ as follows: ˆx = V T ŷ + µ () Standard ICA also gives a linear mapping. Suppose we also have the data matrix X above, ICA assumes that the data matrix X is generated by mixing some latent components having independent non-gaussian distributions. Unlike that PCA can only find correlations between components of data matrix, ICA can capture higher-order statistics in the data. ICA unmixes the data matrix X as follows: X = AS (3) Where A is the mixing matrix and S contains the independent components. There are several approaches to implementing ICA from different pointviews such as by minimizing mutual information between different components and by projection pursuit which minimizes the non-gaussianity of each component. In this paper, FastICA [8], which is an iterative procedure combining the above listed pointviews, is used to extract independent factors of human motion data.. Motion interpretation ICA can be applied directly to the motion data to extract independent components as described in section.1. Due to the high-dimensions of the original data, squre ICA will output a lot of independent factors, in which each factor only conveys a tiny bit of information. Thus, it is very hard to interpretate each independent factor. And because the factors are independent to each other, even if k-mean clustering is run on them, meaningful interpretation for each cluster still cannot always be obtained. If under-complete ICA is applied, the obtained independent components cannot reconstruct the original motion data well, and it makes it hard for motion synthesis to use the obtained independent components. In this paper, PCA is applied to the motion data X to do dimensionality reduction first. In this phase, the most important information is preserved and some noisy information is removed. The reconstructed motion data is claculated by Equation. In the second phase, ICA is applied to the reconstructed motion data generated by PCA. Since the reconstructed data lies in a low dimensional space, it is very easy for ICA to unmix the reconstructed data to extract some independent components S. These independent components often correspond to meaningful interpretations as shown in section 3. 3

.3 Synthesize and predict missing motion data Given a sequence of human motion data with some missing segments, we can synthesize and predict the missing data using the decomposed independent components. Since the components are independent and each component represents a sub-motion, we can consider one component one time during the synthesis. The extracted components are often periodic, thus, the missing segments of each component can be predicted by k-nn or by parametric periodic functions. In this paper, we will try to use a simple idea to predict the missing segments. On each component, we learn a self-regression model based on Gaussian Process (GP). That is, for each component, we get many input-output pairs (C t, C t+1 ) with the component value at time t as input and the component value at t+1 as output, where t = 1,,N 1. N is the total number of frames. 3 Experimental results on human motion interpretation In the experiments, the CMU motion capture dataset is used. The dimensionality of the original motion data is 6. At each frame, the coordinates and the oritentations of the root and the joint angles are recorded. Experiments on two styles of walking are performed to extract meaningful submotions. When the dimension corresponding to the tranlation of the root along a line is left unchanged, several independent factors seem to explain the main translation of the root. In all the subsequent experiments discussed, the main translation of the root is set to zero in order to focus on the movement of the joints. Using the approach described in section., when the dimensionality of a sequence of walking data is reduced to 1, each independent component (IC) extracted by ICA seems to correspond to a meaningful sub-motion of walking. IC1 corresponds to moving the right foot backward periodically. IC corresponds to the preparation stage for moving the left foot backward. IC3 corresponds to moving the left feet backward. IC4 and IC9 model the vertical movement of the root during the walking. IC5 and IC6 seem to model the movement of the right hand with the corresponding movement of the feet with the left hand fixed. IC7 seems to model the movement of the right upper leg and IC8 seems to model the movement of the left upper leg. IC1 models the movement of the right hand with the corresponding movement of the legs with the left hand fixed. When the dimensionality is reduced to 5, then ICA is applied. The extracted components seem to correspond to the combinations of the above extracted components. The extracted ICs still show the independent sub-motions, but they do not reflect the sub-motions of body parts so well as the ICs when the dimensionality is 1. The movies produced cannot be shown here. Currently, they are still in the amc style. 4

4 Experimental results on missing motion synthesis and prediciton Following the discussion in section.3, I will use a self-regression model to synthesize and predict the missing motion data. The experiment was performed on a sequence of walking motion data. The total number of frames is 34. The first 8 frames are used as training data, and I try to predict the following 6 frames. The training data is reduced to 3 dimensions by PCA first, and on each of the 3 independent components, a self-regression model based on GP is trained. Figure 1 and Figure shows the components corresponding to PCA and ICA respectively. 5 PCA pc1 5 5 1 15 5 3 35 5 pc 5 5 1 15 5 3 35 1 5 pc3 5 1 5 1 15 5 3 35 Fig.1. The principal components of the walking data produced by PCA. 1 IC 1 1 5 1 15 5 3 1 IC 1 5 1 15 5 3 1 IC 3 1 5 1 15 5 3 Fig.. The independent components of the walking data produced by ICA. 5

From the above two figures, we can easily find that the components generated by ICA is more periodical than the ones generated by PCA. Thus, it s reasonable to believe that the independent components can provide a big advantage to predicting missing data. In the experiment, the self-regression model based on ICA can predict about half of a walking cycle whereas the self-regression model based on PCA alone can predict nothing. The above experiment is just a toy experiment. To synthesize and predict missing motion data, the dimensionality should be made slighly bigger, for e. g., 1. But the experimental result still shows that prediction based on ICA is better, and it also shows that the self-regression model fails to work well on missing data synthesis and prediciton. Because the input-output pairs (C t, C t+1 ) are used to train the self-regression model, but C t and C t+1 ) are often very close to each other, it is very hard for the self-regression model to capture the overall shape of the distribution of the component data. It s very likely that k-nn or parametric periodic functions will do a much better job on synthesizing the missing segments of each independent component. 5 Discussion and future work In this report, ICA is used to extract the independent sub-motions of human motion data. It shows that it is hard to interpretate each independent component if square ICA or under-complete ICA is applied directly to the motion data; if PCA is used to do dimensionality reduction on the motion data first and the dimensionality is appropriately chosen, the extracted independent components correspond to meaningful sub-motions of body parts. It also shows that the independent components extracted by ICA have some advantages over principal components when they are used for synthesizing or predicting missing motion data. The self-regression model based on GP fails to give good results in this paper because only C t is used to regress C t+1. If we use C t k:t to regress C t+1, the performance might improve. And k-nn and parametric periodic functions might also be promissing candidate tools. Besides, we can also synthesize the missing segments of each component by maximizing the likelyhood of the whole sequence given the observed data. Unlike k-nn, a powerful probabilistic model can be used here to estimate the missing segments of each component simultaneously. In the future, annotations of motion data can be incorporated to assist motion transfer and motion synthesis based on the independent factors extracted by ICA. A probabilistic model can be used to model each component. When predicting missing data, the anotation labels provide some prior information about the missing segments. References 1. Y. Cao, P. Faloutsos, and F. Pighin. Unsupervised learning for speech motion editing. SIGGRAPH 3. 6

. A. Shapiro, Y. Cao, and P. Faloutsos. Interactive Motion Decomposition. 3. D. Vlastic, M. Brand, H. Pfister, and J. Popovic. Facer transfer with multilinear models. SIGGRAPH 5. 4. J. Lee, J. Chai, P. Reitsma, J. Hodgins, and N. Polland. Interactive control of avatars animated with human motion data. SIGGRAPH. 5. Y. Li, T. Wang, and H.-Y. Shum. Motion texture: a two-level statistical model for character motion synthesis. SIGGRAPH. 6. Arikan, D. Forsyth, and J. O Brien. Motion synthesis from annotations. ACM SIG- GRAPH 3), Vol: 33, No: 3, pp 4 48, 3 7. J. Wang, D. Fleet, and A. Hertzman. Gaussian process dynamical models. Advances in Neural Information Processing Systems, 5 8. A. Hyvarinen. Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Transactions on Neural Networks 1(3):66-634, 1999. This article was processed using the L A TEX macro package with LLNCS style 7