Visual Tracking of Human Body with Deforming Motion and Shape Average
|
|
- Basil Davidson
- 5 years ago
- Views:
Transcription
1 Visual Tracking of Human Body with Deforming Motion and Shape Average Alessandro Bissacco UCLA Computer Science Los Angeles, CA UCLA CSD-TR # Abstract In this work we present a novel approach for tracking human body in video sequences. We model the human skeleton as a kinematic chain of body parts which undergo a transformation composed of rigid motion and shape variation. Tracking is formulated as the problem of minimizing a cost functional with respect to the unknown position and shape of the body parts. 1 Introduction The problem of tracking humans in video streams has received great attention in the last years. The recent availability of commercial hardware for the capture, transmission and processing of full resolution video data has opened a wide range of new applications in this domain. Image-based human tracking might play a prominent role in the next generation of surveillance systems and human computer interfaces. Systems for measuring human motion from video data can also constitute a valuable support for various fields, ranging from kinesiology, rehabilitation in biomechanics, to technical training in sports and performing arts. Estimating the pose of the human body in a video stream is a difficult problem because of the significant variations in the appearance of the object throughout the sequence. Illumination, viewing conditions, relative position and orientation, self-occlusions, all contribute to make the task of matching human body parts between image frames remarkably difficult. In this work we propose an approach to visual tracking that models the human skeleton as a kinematic chain of rigid links. We formulate the problem as the minimization of a cost functional with respect to rigid motion and shape of the body parts. 2 Related work Different approaches have been proposed for the problem of visual tracking of human motion (see [1, 9] for a survey). They can be classified in two main types, depending on whether a priori models of the shape of the human body are used. Approaches that do not use shape models, such as in [21] and [14] usually rely on heuristic procedures to find correspondences of body parts between frames of video sequences. In [18] a variational approach exploiting motion information is used for detection and tracking of arbitrary objects. Model-based approaches can be divided in single view [12, 22, 25] and multi-view [7, 26, 8], 2D motion models [15] and 3D models [20, 5]. Most of these approaches require manual initialization in the first frame. Soatto et al. [23] propose a framework for modeling the motion of deforming objects. A nonrigid transformation is seen as the composition of a group action g on a particular object, on top of which a local deformation is applied. In this setting the notion of average shape is defined as the one that minimizes the deformations. Bregler and al. [24] have proposed a variety of methods to model non-rigid motions. Such models are built as linear combinations of a collection of key poses, learned using principal component analysis from motion capture data. Local representations of motion based on optical flow have been exploited in [3, 16], and view-based methods are proposed in [2, 10]. Other approaches are based on principal component analysis [27]. In [6] a mixed-state statistical model for the representation of motion has been proposed. In this Switching Linear Dynamic Model a stochastic finite-state automata at the highest level switches between local linear Gaussian models. Estimation and recognition is performed
2 with expectation maximization approaches, using particle filters [17, 4] or structured variational inference techniques [19]. 3 Modeling Human Body Motion In this paper we focus on the problem of estimating the pose of a human body in a video sequence. The ultimate goal is to build a system that, if properly initialized, can reliably track the configuration of an articulated object such as a human body from a sequence of monocular images. We do not consider the issue of model initialization, instead we assume that the configuration of the object in the first frame is given, for example, by manual initialization. We model the human skeleton as a kinematic chain of rigid bodies. Each body segment is represented by a rigid link with ellipsoidal or conic support, and the links are connected together by joints. To restrict the set of admissible motions and reduce the ambiguities in the estimation, we assume that each joint allows for one single degree of freedom, a rotation around its axis. This constraint is justified by the fact that typically the motion of the limbs in a walking gaits can be approximated as planar around an axis perpendicular to the direction of walking. Obviously the assumption of rigid object is not met by body parts. Various factors contribute to change the appearance of the limbs in the sequence, such as illumination, viewing angle, occlusions, and so forth. Because of these large variations any standard template matching technique is doomed to fail if directly applied to this problem. Our solution is to model the transformations undergone by body limbs as composition of rigid motions and shape variations. We formulate a cost functional written in terms of the position of the links of the kinematic chain and their shape, represented by a weight function. 3.1 Exponential maps for motion representation Among the possible representations of rigid motion, a sensible choice for our application is the one based on exponential maps. In this parameterization, arbitrary 3D motion can be encoded in a 6 dimensional vector ξ, called twist: ξ = v x v y v z ω x ω y ω z The twist describes a rotation around an arbitrary axis in the space: the axis and amount of rotation is given by the vector ω = ω x ω y, the location of the rotation axis and the amount of translation along this axis is given by the remaining three components v = v x v y. v z The matrix form G SE(3) of the rigid motion represented by the twist ξ is given by: G = eˆξ, ˆξ = ω z 0 ω z ω y v x ω z 0 ω x v y ω y ω x 0 v z Exponential maps have several advantages over other parameterizations of 3D rotations. They do not suffer the problem of singular configurations as the Euler angles, and as opposed to quaternion or matrix representations it is not necessary to constrain a number of parameters larger than the degrees of freedom to a set of admissible values. Derivatives of exponential maps with respect to their parameters can be computed in closed form but do not have simple expressions. We refer the reader to [11] for the details. 3.2 From kinematic chains to images Given the coordinates of a point on a link of kinematic chain, we want to compute its projection on the image plane. If the variation in depth of the points on the articulated object is small compared to the distance from the camera, we can approximate the transformation to scaled orthographic projection. This conditions are generally met in video sequences of walking people. In the following we consider the reference frames associated with links centered on the joints and having the z axis oriented along the direction of the joint axis. Let p o = [ x o y o z o 1 ] T be the homogeneous coordinates of a point relative to the reference frame of link l. The coordinates (x, y) of its projection onto the image plane are given by:
3 [ x y ] = g l (p o, Θ) = g l (p o, ξ 1,..., ξ L, s) (1) = sπeˆξ 1 g 12 eˆξ 2 g 23 eˆξ3 eˆξ l p o (2) [ ] where Π =, ξ 1 = [ ] v x v y v z ω x ω y ω z gives position and orientation of the first link, g ij is the transformation from reference of link i to reference of link j, and ξ i = [ ] ω z gives the rotation of link i around the axis of the associated joint. 3.3 Matching deforming regions Consider the problem of tracking the position of a rigid object in a sequence of images. A standard approach is to minimize the sum of squared differences between the intensity of the pixels in the model M and the corresponding pixels in the images I i. Here the model can be the first frame where the position of the object is given. Let Ω be the set of points on the model that belongs to the object and g(., Θ i ) the transformation that maps points on the model to corresponding points on the image I i. Then the function to minimize with respect to the motion parameters Θ i is: E = n f i=1 Ω (I i (g(p, Θ i )) M(p)) 2 dp (3) As previously mentioned, this simple solution cannot cope with the significant changes in appearance present in the case of motion of body parts. Our approach is to allow for deviations from the original template by introducing a weight map W (x) that defines the shape of the object. W (p) ranges between 0 and 1, being 1 for points inside the object and 0 for points outside. Then we can estimate the position of the object and its shape my minimizing the cost functional written in terms of the motion Θ i and the weight map W : E = n f i=1 Ω (I i(g(p, Θ i )) M(p)) 2 W (p)dp Ω W (p)dp (4) Notice that we have introduced a term at denominator in the cost function. This term prevents to have W 0 as solution to the minimization problem. 3.4 Modeling Self-Occlusions Visual tracking of complex articulated objects such as the human body demands for explicit modeling of selfocclusions. This can be done by introducing a visibility function V in the cost functional. Consider the case of an articulated object with L links, we have: E = n f = n f i=1 i=1 Ei(Θi, W ) = Ll=1 (I i (g l (p,θ i )) M l (p)) 2 W l (p)v l (p,θ i )dp Ll=1 W l (p)v l (p,θ i )dp Where W = (W 1, W 2..., W L ) and: { 1 if p link l is visible in pose Θ V l (p, Θ) = 0 otherwise In order to compute derivatives of the energy with respect to motion parameters Θ, we need to find an analytical expression for the visibility function V l (p, Θ). To this purpose we can use the signed distance function of a point p from a closed curve. It is defined as the minimum distance of p from a point on the curve, with the plus sign if p is outside and the minus sign if p is inside the curve. In the case at hand the shapes are simple and the distance can be analytically computed from the its parameters. Let d l (p) the signed distance function of the projection of p from the contour of the link l on the image. Then we can write V as: V l (p, Θ) = H(d j (p, Θ)) j F (l) where F (l) = {j : link j is in front of link l} and H(.) is the heaviside function: { 0 if x < 0 H(x) = 1 if x 0 Assuming that the order of visibility defined by F (.) does not change during the motion, we can compute the derivatives of V as: V l (p, Θ) Θ = k F (l) δ(d k (p, Θ)) d k Θ (p, Θ) 4 Tracking Algorithm j F (l),j k (5) H(d j (p, Θ)) The first step of the algorithm is to build a model of the appearance of the human body in the sequence. We use a kinematic chain manually initialized to match the pose of the subject in the first frame. We extract M l, the appearance model of link l, and its domain by picking the region in the first frame corresponding to the projection of link l on the image. (6)
4 Then we perform tracking by minimizing the energy functional in (5) with respect to Θ i and W. We use an alternating minimization scheme: given and initial guess for W minimize with respect to the motions Θ i, then fix Θ i to these optimal values and minimize with respect to W. The minimizations are performed using a gradient descent scheme. The gradient of (5) with respect to Θ i is: E = 1 ( L I i(p, Θ i, l) Θ i A l=1 L V l (x, Θ i)w l (p)dp + where E i(θ i, W ) A = L l=1 L l=1 ( I T i (g l(p, Θ g ) l i) (Θ i)p Θ i I i(p, Θ i, l) 2 V l (x, Θ i)w l (p)dp l=1 Θ i ) V l (p, Θ i)w l (p)dp Θ i W (p)v l (p, Θ i )dp I i (p, Θ, l) = I i (g l (p, Θ) M l (p) and: The derivative with respect to the support map W (.) is: Wl E(p k ) = i δ(d l (p k )) V l(p k, Θ i) A (7) ( ) I i(p k, Θ i, l) 2 E i(θ i, W ) This procedure is applied to blocks of n f frames of the sequence to be tracked. n f is a parameter of the algorithm and determines how many frames are used for estimating the shapes W l. Setting it to 1 is not a good idea because it means using the matching on one single image to update the shape of the object. Using large values such as the number of frames in the sequence is also not advisable because the appearance of body parts in distant frames can vary considerably and this would negatively affect the shape estimation. It must be pointed out that in this formulation we do not exploit the temporal continuity of the motions Θ i. That is, we would obtain the same results if we applied the algorithm to a sequence where the order of the frames is scrambled. In practice, since the derivative of (5) with respect to Θ i is independent from Θ j, we exploit continuity by performing the minimization separately for each Θ i and by using the optimal value of Θ i to initialize the minimization on Θ i+1. Also we do not constrain the parameters of the kinematic chain to represent configurations physically feasible for the human body. Adding to the cost function some terms that model spatial constraints or joint dynamics would possibly improve the results. (8) Figure 1: Example of kinematic chain model of the human body used for tracking 5 Experimental results In our preliminary experiments we tracked the position the body of two subjects performing a walking gait. The sequences have length of 100 and 43 frames. In both sequences we tracked the position of torso, one leg and one arm using a kinematic chain model with 5 links, pictured in figure 1. It can be seen that the chain model has links of two different shapes: ellipsoids and conic sections. For computational efficiency the projection on the image of a conic section is approximated with a trapezoid. The model has been manually initialized in the first frame of the sequence by specifying the geometry of the links and by clicking on their position in the image. Once this initialization step is complete the system performs the tracking task by minimizing (5), alternating between motions parameters Θ and shapes W l. In the minimization over the motion parameters numerical derivatives have been used. The results are obtained by performing the minimization on blocks of n f = 5 images. The results are shown in figures 5. In the first and forth row we can see some keyframes of the original sequences, in the second and fifth we have frames with superimposed the estimated kinematic chain and in the third and sixth superimposed are the estimated shape maps W l. We have also included movie clips of original and tracked sequences. References [1] J. K. Aggarwal and Q. Cai Human Motion Analysis: A Review 1999 [2] M. J. Black. Eigentracking: robust matching and tracking of articulated objects
5 Re- sults of the tracking. The first and forth rows show frames from the original sequences. The second and fifth rows display the estimated pose of the kinematic chain. The third and last show the estimated weight maps. [3] M. J. Black. Explaining optical flow events with parameterized spatio-temporal models. In Proc. of Conference on Computer Vision and Pattern Recognition, volume 1, pages , [4] M. J. Black and A. D. Jepson. A Probabilistic framework for matching temporal trajectories: Condensation-based recognition of gestures and expressions. In Proc. of European Conference on Computer Vision, volume 1, pages , [5] C. Bregler. Tracking people with twists and exponential maps. In Proc. International Conference on Computer Vision and Pattern Recognition, [6] C. Bregler. Learning and recognizing human dynamics in video sequences. In Proc. of the Conference on Computer Vision and Pattern Recognition, pages , [7] J. Deutscher, A. Blake and I. Reid Articulated motion capture by annealing particle filering In Proc. CVPR, pp , [8] D. M. Gavrila and L. S. Davis. Tracking of humans in action: a 3-d model-based approach
6 [9] D. M. Gavrila. The visual analysis of human movement: A survey. In Computer Vision and Image Understanding, volume 73, pages 82 98, [10] M. A. Giese and T. Poggio. Morphable models for the analysis and synthesis of complex motion patterns. In International Journal of Computer Vision, volume 38(1), pages , [11] F. S. Grassia. Practical parameterization of rotations using the exponential map. Journal of Graphics Tools, 3(3):29-48, [12] N. R. Howe, M. E. Leventon and W. T. Freeman Bayesian reconstruction of 3D human motion from single-camera video In Proc. of NIPS, 12, pp , 2000 [13] M. Isard and A. Blake. Condensation - conditional density propagation for visual tracking International Journal of Computer Vision 29(1), pp. 5 28, [14] I. A. Kakadiaris and D. Metaxas Model based estimation of 3D human motion with occlusion based on active multi-viewpoint selection. In Proc. of CVPR, pp , 1995 [22] H. Sidenbladh, M. Black and D. Fleet Stochastic tracking of 3d human figures using 2d image motion In Proc. of ECCV, II pp , [23] S. Soatto and A. Yezzi Deformotion: deforming motions and shape averages. In Proc. of the ECCV, LNCS, Springer Verlag, May [24] L. Torresani, D. Yang, G. Alexander and C. Bregler Tracking and Modelling Non-Rigid Objects with Rank Constraints In Proc. International Conference on Computer Vision and Pattern Recognition, [25] S. Wachter and H. Nagel Tracking of persons in monocular image sequences In CVIU, 74(3), pp , 1999 [26] C. Wren. Dynamic models of human motion [27] Y. Yacoob and M. J. Black. Parameterized modeling and recognition of activities. In Computer Vision and Image Understanding, volume 73(2), pages , [15] M. K. Leung and Y. H. Yang First sight: A human body outline labeling system In IEEE Trans. on PAMI, 17(4),pp , [16] J. J. Little and J. E. Boyd. Recognizing people by their gait: the shape of motion [17] B. North and A. Blake and M. Isard and J. Rittscher. Learning and classification of complex dynamics. In IEEE Transaction on Pattern Analysis and Machine Intelligence, volume 22(9), pages , [18] N. Paragios and R. Deriche Geodesic active contours and level sets for the detection and tracking of moving objects In PAMI, 22(3), , 2000 [19] V. Pavlovic and J. Rehg and J. MacCormick. Impact of Dynamic Model Learning on Classification of Human Motion In Proc. International Conference on Computer Vision and Pattern Recognition, [20] J. M. Regh and T. Kanade. Model-based tracking of self-occluding articulated objects. In ICCV, [21] A. Shio and J. Sklansky Segmentation of people in motion In Proc. of IEEE Workshop on Visual Motion, pp , 1991
MODELING HUMAN GAITS WITH SUBTLETIES. Alessandro Bissacco Payam Saisan Stefano Soatto
MODELING HUMAN GAITS WITH SUBTLETIES Alessandro Bissacco Payam Saisan Stefano Soatto Department of Computer Science, University of California, Los Angeles, CA 90095 {bissacco,soatto}@cs.ucla.edu Department
More informationReal-time tracking of highly articulated structures in the presence of noisy measurements
Real-time tracking of highly articulated structures in the presence of noisy measurements T. Drummond R. Cipolla Department of Engineering University of Cambridge Cambridge, UK CB2 1PZ Department of Engineering
More informationTracking of Human Body using Multiple Predictors
Tracking of Human Body using Multiple Predictors Rui M Jesus 1, Arnaldo J Abrantes 1, and Jorge S Marques 2 1 Instituto Superior de Engenharia de Lisboa, Postfach 351-218317001, Rua Conselheiro Emído Navarro,
More informationHuman Upper Body Pose Estimation in Static Images
1. Research Team Human Upper Body Pose Estimation in Static Images Project Leader: Graduate Students: Prof. Isaac Cohen, Computer Science Mun Wai Lee 2. Statement of Project Goals This goal of this project
More informationProbabilistic Tracking and Reconstruction of 3D Human Motion in Monocular Video Sequences
Probabilistic Tracking and Reconstruction of 3D Human Motion in Monocular Video Sequences Presentation of the thesis work of: Hedvig Sidenbladh, KTH Thesis opponent: Prof. Bill Freeman, MIT Thesis supervisors
More informationBayesian Reconstruction of 3D Human Motion from Single-Camera Video
Bayesian Reconstruction of 3D Human Motion from Single-Camera Video Nicholas R. Howe Cornell University Michael E. Leventon MIT William T. Freeman Mitsubishi Electric Research Labs Problem Background 2D
More informationVisual Motion Analysis and Tracking Part II
Visual Motion Analysis and Tracking Part II David J Fleet and Allan D Jepson CIAR NCAP Summer School July 12-16, 16, 2005 Outline Optical Flow and Tracking: Optical flow estimation (robust, iterative refinement,
More informationCardboard People: A Parameterized Model of Articulated Image Motion
Cardboard People: A Parameterized Model of Articulated Image Motion Shanon X. Ju Michael J. Black y Yaser Yacoob z Department of Computer Science, University of Toronto, Toronto, Ontario M5S A4 Canada
More informationMulti-camera Tracking of Articulated Human Motion using Shape and Motion Cues
IEEE TRANSACTIONS ON IMAGE PROCESSING 1 Multi-camera Tracking of Articulated Human Motion using Shape and Motion Cues Aravind Sundaresan, Student Member, IEEE, and Rama Chellappa, Fellow, IEEE Abstract
More informationA Dynamic Human Model using Hybrid 2D-3D Representations in Hierarchical PCA Space
A Dynamic Human Model using Hybrid 2D-3D Representations in Hierarchical PCA Space Eng-Jon Ong and Shaogang Gong Department of Computer Science, Queen Mary and Westfield College, London E1 4NS, UK fongej
More information3D Human Motion Analysis and Manifolds
D E P A R T M E N T O F C O M P U T E R S C I E N C E U N I V E R S I T Y O F C O P E N H A G E N 3D Human Motion Analysis and Manifolds Kim Steenstrup Pedersen DIKU Image group and E-Science center Motivation
More informationInternational Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol. XXXIV-5/W10
BUNDLE ADJUSTMENT FOR MARKERLESS BODY TRACKING IN MONOCULAR VIDEO SEQUENCES Ali Shahrokni, Vincent Lepetit, Pascal Fua Computer Vision Lab, Swiss Federal Institute of Technology (EPFL) ali.shahrokni,vincent.lepetit,pascal.fua@epfl.ch
More informationMulti-camera Tracking of Articulated Human Motion Using Motion and Shape Cues
Multi-camera Tracking of Articulated Human Motion Using Motion and Shape Cues Aravind Sundaresan and Rama Chellappa Department of Electrical and Computer Engineering, University of Maryland, College Park,
More informationInferring 3D Body Pose from Silhouettes using Activity Manifold Learning
CVPR 4 Inferring 3D Body Pose from Silhouettes using Activity Manifold Learning Ahmed Elgammal and Chan-Su Lee Department of Computer Science, Rutgers University, New Brunswick, NJ, USA {elgammal,chansu}@cs.rutgers.edu
More informationradius length global trans global rot
Stochastic Tracking of 3D Human Figures Using 2D Image Motion Hedvig Sidenbladh 1 Michael J. Black 2 David J. Fleet 2 1 Royal Institute of Technology (KTH), CVAP/NADA, S 1 44 Stockholm, Sweden hedvig@nada.kth.se
More informationModel-Based Human Motion Capture from Monocular Video Sequences
Model-Based Human Motion Capture from Monocular Video Sequences Jihun Park 1, Sangho Park 2, and J.K. Aggarwal 2 1 Department of Computer Engineering Hongik University Seoul, Korea jhpark@hongik.ac.kr
More informationStochastic Tracking of 3D Human Figures Using 2D Image Motion
Stochastic Tracking of 3D Human Figures Using 2D Image Motion Hedvig Sidenbladh 1, Michael J. Black 2, and David J. Fleet 2 1 Royal Institute of Technology (KTH), CVAP/NADA, S 1 44 Stockholm, Sweden hedvig@nada.kth.se
More informationA Unified Spatio-Temporal Articulated Model for Tracking
A Unified Spatio-Temporal Articulated Model for Tracking Xiangyang Lan Daniel P. Huttenlocher {xylan,dph}@cs.cornell.edu Cornell University Ithaca NY 14853 Abstract Tracking articulated objects in image
More informationPart I: HumanEva-I dataset and evaluation metrics
Part I: HumanEva-I dataset and evaluation metrics Leonid Sigal Michael J. Black Department of Computer Science Brown University http://www.cs.brown.edu/people/ls/ http://vision.cs.brown.edu/humaneva/ Motivation
More informationAutomatic Detection and Tracking of Human Motion with a View-Based Representation
Automatic Detection and Tracking of Human Motion with a View-Based Representation Ronan Fablet and Michael J. Black Department of Computer Science Brown University, Box 19 Providence, RI02912, USA {rfablet,black}@cs.brown.edu
More informationAn Adaptive Appearance Model Approach for Model-based Articulated Object Tracking
An Adaptive Appearance Model Approach for Model-based Articulated Object Tracking Alexandru O. Bălan Michael J. Black Department of Computer Science - Brown University Providence, RI 02912, USA {alb, black}@cs.brown.edu
More informationEigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation
EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation Michael J. Black and Allan D. Jepson Xerox Palo Alto Research Center, 3333 Coyote Hill Road, Palo Alto,
More informationAutomatic detection and tracking of human motion with a view-based representation
Automatic detection and tracking of human motion with a view-based representation Ronan Fablet and Michael J. Black Department of Computer Science Brown University, Box 19 Providence, RI02912, USA frfablet,blackg@cs.brown.edu
More informationFace Tracking. Synonyms. Definition. Main Body Text. Amit K. Roy-Chowdhury and Yilei Xu. Facial Motion Estimation
Face Tracking Amit K. Roy-Chowdhury and Yilei Xu Department of Electrical Engineering, University of California, Riverside, CA 92521, USA {amitrc,yxu}@ee.ucr.edu Synonyms Facial Motion Estimation Definition
More informationLayered Active Contours for Tracking
Layered Active Contours for Tracking Gallagher Pryor, Patricio Vela, Tauseef ur Rehman, and Allen Tannenbaum Georgia Institute of Technology, Atlanta GA, USA Abstract This paper deals with the task of
More informationDescriptive and Generative Representations for Modeling and Interpreting Human Movement
Descriptive and Generative Representations for Modeling and Interpreting Human Movement Paper ID 38 Abstract Human movement analysis requires representations that allow for effective encoding and interpretation
More information3D Tracking for Gait Characterization and Recognition
3D Tracking for Gait Characterization and Recognition Raquel Urtasun and Pascal Fua Computer Vision Laboratory EPFL Lausanne, Switzerland raquel.urtasun, pascal.fua@epfl.ch Abstract We propose an approach
More informationNonrigid Surface Modelling. and Fast Recovery. Department of Computer Science and Engineering. Committee: Prof. Leo J. Jia and Prof. K. H.
Nonrigid Surface Modelling and Fast Recovery Zhu Jianke Supervisor: Prof. Michael R. Lyu Committee: Prof. Leo J. Jia and Prof. K. H. Wong Department of Computer Science and Engineering May 11, 2007 1 2
More informationMarkerless Motion Capture from Single or Multi-Camera Video Sequence
Markerless Motion Capture from Single or Multi-Camera Video Sequence F. Remondino IGP - ETH Zurich, CH fabio@geod.baug.ethz.ch www.photogrammetry.ethz..ch N. D Apuzzo Homometrica Consulting nda@homometrica.ch
More informationRobust Model-Free Tracking of Non-Rigid Shape. Abstract
Robust Model-Free Tracking of Non-Rigid Shape Lorenzo Torresani Stanford University ltorresa@cs.stanford.edu Christoph Bregler New York University chris.bregler@nyu.edu New York University CS TR2003-840
More informationRecognition Rate. 90 S 90 W 90 R Segment Length T
Human Action Recognition By Sequence of Movelet Codewords Xiaolin Feng y Pietro Perona yz y California Institute of Technology, 36-93, Pasadena, CA 925, USA z Universit a dipadova, Italy fxlfeng,peronag@vision.caltech.edu
More informationA novel approach to motion tracking with wearable sensors based on Probabilistic Graphical Models
A novel approach to motion tracking with wearable sensors based on Probabilistic Graphical Models Emanuele Ruffaldi Lorenzo Peppoloni Alessandro Filippeschi Carlo Alberto Avizzano 2014 IEEE International
More informationHuman Action Recognition Using Independent Component Analysis
Human Action Recognition Using Independent Component Analysis Masaki Yamazaki, Yen-Wei Chen and Gang Xu Department of Media echnology Ritsumeikan University 1-1-1 Nojihigashi, Kusatsu, Shiga, 525-8577,
More informationGrasp Recognition using a 3D Articulated Model and Infrared Images
Grasp Recognition using a 3D Articulated Model and Infrared Images Koichi Ogawara Institute of Industrial Science, Univ. of Tokyo, Tokyo, Japan Jun Takamatsu Institute of Industrial Science, Univ. of Tokyo,
More informationDynamic Time Warping for Binocular Hand Tracking and Reconstruction
Dynamic Time Warping for Binocular Hand Tracking and Reconstruction Javier Romero, Danica Kragic Ville Kyrki Antonis Argyros CAS-CVAP-CSC Dept. of Information Technology Institute of Computer Science KTH,
More informationSegmentation and Tracking of Partial Planar Templates
Segmentation and Tracking of Partial Planar Templates Abdelsalam Masoud William Hoff Colorado School of Mines Colorado School of Mines Golden, CO 800 Golden, CO 800 amasoud@mines.edu whoff@mines.edu Abstract
More informationA New Hierarchical Particle Filtering for Markerless Human Motion Capture
A New Hierarchical Particle Filtering for Markerless Human Motion Capture Yuanqiang Dong, Guilherme N. DeSouza Electrical and Computer Engineering Department University of Missouri, Columbia, MO, USA Abstract
More informationAUTOMATIC 3D HUMAN ACTION RECOGNITION Ajmal Mian Associate Professor Computer Science & Software Engineering
AUTOMATIC 3D HUMAN ACTION RECOGNITION Ajmal Mian Associate Professor Computer Science & Software Engineering www.csse.uwa.edu.au/~ajmal/ Overview Aim of automatic human action recognition Applications
More informationDynamic Human Shape Description and Characterization
Dynamic Human Shape Description and Characterization Z. Cheng*, S. Mosher, Jeanne Smith H. Cheng, and K. Robinette Infoscitex Corporation, Dayton, Ohio, USA 711 th Human Performance Wing, Air Force Research
More informationRepresentation and Matching of Articulated Shapes
Representation and Matching of Articulated Shapes Jiayong Zhang, Robert Collins and Yanxi Liu The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA 11 {zhangjy,rcollins,yanxi}@cs.cmu.edu Abstract
More informationSingularity Analysis for Articulated Object Tracking
To appear in: CVPR 98, Santa Barbara, CA, June 1998 Singularity Analysis for Articulated Object Tracking Daniel D. Morris Robotics Institute Carnegie Mellon University Pittsburgh, PA 15213 James M. Rehg
More informationPredicting 3D People from 2D Pictures
Predicting 3D People from 2D Pictures Leonid Sigal Michael J. Black Department of Computer Science Brown University http://www.cs.brown.edu/people/ls/ CIAR Summer School August 15-20, 2006 Leonid Sigal
More informationParticle Filtering. CS6240 Multimedia Analysis. Leow Wee Kheng. Department of Computer Science School of Computing National University of Singapore
Particle Filtering CS6240 Multimedia Analysis Leow Wee Kheng Department of Computer Science School of Computing National University of Singapore (CS6240) Particle Filtering 1 / 28 Introduction Introduction
More informationArticulated Structure from Motion through Ellipsoid Fitting
Int'l Conf. IP, Comp. Vision, and Pattern Recognition IPCV'15 179 Articulated Structure from Motion through Ellipsoid Fitting Peter Boyi Zhang, and Yeung Sam Hung Department of Electrical and Electronic
More informationFace detection in a video sequence - a temporal approach
Face detection in a video sequence - a temporal approach K. Mikolajczyk R. Choudhury C. Schmid INRIA Rhône-Alpes GRAVIR-CNRS, 655 av. de l Europe, 38330 Montbonnot, France {Krystian.Mikolajczyk,Ragini.Choudhury,Cordelia.Schmid}@inrialpes.fr
More informationInternal Organ Modeling and Human Activity Analysis
Internal Organ Modeling and Human Activity Analysis Dimitris Metaxas dnm@cs.rutgers.edu CBIM Center Division of Computer and Information Sciences Rutgers University 3D Motion Reconstruction and Analysis
More informationVisual Recognition: Image Formation
Visual Recognition: Image Formation Raquel Urtasun TTI Chicago Jan 5, 2012 Raquel Urtasun (TTI-C) Visual Recognition Jan 5, 2012 1 / 61 Today s lecture... Fundamentals of image formation You should know
More informationArticulated Body Motion Capture by Annealed Particle Filtering
Articulated Body Motion Capture by Annealed Particle Filtering Jonathan Deutscher University of Oxford Dept. of Engineering Science Oxford, O13PJ United Kingdom jdeutsch@robots.ox.ac.uk Andrew Blake Microsoft
More informationProbabilistic Detection and Tracking of Motion Discontinuities
Probabilistic Detection and Tracking of Motion Discontinuities Michael J. Black David J. Fleet Xerox Palo Alto Research Center 3333 Coyote Hill Road Palo Alto, CA 94304 black,fleet @parc.xerox.com http://www.parc.xerox.com/
More informationTracking Human Body Parts Using Particle Filters Constrained by Human Biomechanics
Tracking Human Body Parts Using Particle Filters Constrained by Human Biomechanics J. Martínez 1, J.C. Nebel 2, D. Makris 2 and C. Orrite 1 1 Computer Vision Lab, University of Zaragoza, Spain 2 Digital
More informationVideo Alignment. Literature Survey. Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin
Literature Survey Spring 2005 Prof. Brian Evans Multidimensional Digital Signal Processing Project The University of Texas at Austin Omer Shakil Abstract This literature survey compares various methods
More informationA Kinematic Model for Human Motion and Gait Analysis
A Kinematic Model for Human Motion and Gait Analysis Shiloh L. Dockstader and A. Murat Tekalp * Dept. of Electrical and Computer Engineering, University of Rochester, Rochester, NY 1467 * College of Engineering,
More informationNotes 9: Optical Flow
Course 049064: Variational Methods in Image Processing Notes 9: Optical Flow Guy Gilboa 1 Basic Model 1.1 Background Optical flow is a fundamental problem in computer vision. The general goal is to find
More informationSingularities in Articulated Object Tracking with 2-D and 3-D Models
Singularities in Articulated Object Tracking with 2-D and 3-D Models James M. Rehg Daniel D. Morris CRL 9/8 October 199 Cambridge Research Laboratory The Cambridge Research Laboratory was founded in 198
More informationHuman pose estimation using Active Shape Models
Human pose estimation using Active Shape Models Changhyuk Jang and Keechul Jung Abstract Human pose estimation can be executed using Active Shape Models. The existing techniques for applying to human-body
More informationStructure from Motion. Prof. Marco Marcon
Structure from Motion Prof. Marco Marcon Summing-up 2 Stereo is the most powerful clue for determining the structure of a scene Another important clue is the relative motion between the scene and (mono)
More informationLearning Articulated Skeletons From Motion
Learning Articulated Skeletons From Motion Danny Tarlow University of Toronto, Machine Learning with David Ross and Richard Zemel (and Brendan Frey) August 6, 2007 Point Light Displays It's easy for humans
More informationInferring 3D People from 2D Images
Inferring 3D People from 2D Images Department of Computer Science Brown University http://www.cs.brown.edu/~black Collaborators Hedvig Sidenbladh, Swedish Defense Research Inst. Leon Sigal, Brown University
More informationModel-based Motion Capture for Crash Test Video Analysis
Model-based Motion Capture for Crash Test Video Analysis Juergen Gall 1, Bodo Rosenhahn 1, Stefan Gehrig 2, and Hans-Peter Seidel 1 1 Max-Planck-Institute for Computer Science, Campus E1 4, 66123 Saarbrücken,
More informationChapter 9 Object Tracking an Overview
Chapter 9 Object Tracking an Overview The output of the background subtraction algorithm, described in the previous chapter, is a classification (segmentation) of pixels into foreground pixels (those belonging
More informationMixture Models and EM
Mixture Models and EM Goal: Introduction to probabilistic mixture models and the expectationmaximization (EM) algorithm. Motivation: simultaneous fitting of multiple model instances unsupervised clustering
More informationIntroduction to Computer Vision
Introduction to Computer Vision Michael J. Black Nov 2009 Perspective projection and affine motion Goals Today Perspective projection 3D motion Wed Projects Friday Regularization and robust statistics
More informationAutomatic Kinematic Chain Building from Feature Trajectories of Articulated Objects
Automatic Kinematic Chain Building from Feature Trajectories of Articulated Objects Jingyu Yan and Marc Pollefeys Department of Computer Science The University of North Carolina at Chapel Hill Chapel Hill,
More informationHuman Body Model Acquisition and Tracking using Voxel Data
Submitted to the International Journal of Computer Vision Human Body Model Acquisition and Tracking using Voxel Data Ivana Mikić 2, Mohan Trivedi 1, Edward Hunter 2, Pamela Cosman 1 1 Department of Electrical
More informationThe Template Update Problem
The Template Update Problem Iain Matthews, Takahiro Ishikawa, and Simon Baker The Robotics Institute Carnegie Mellon University Abstract Template tracking dates back to the 1981 Lucas-Kanade algorithm.
More informationHand-Eye Calibration from Image Derivatives
Hand-Eye Calibration from Image Derivatives Abstract In this paper it is shown how to perform hand-eye calibration using only the normal flow field and knowledge about the motion of the hand. The proposed
More informationExtracting Spatio-temporal Local Features Considering Consecutiveness of Motions
Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions Akitsugu Noguchi and Keiji Yanai Department of Computer Science, The University of Electro-Communications, 1-5-1 Chofugaoka,
More informationExplaining Optical Flow Events with Parameterized Spatio-temporal Models
Explaining Optical Flow Events with Parameterized Spatio-temporal Models Michael J. Black * Xerox Palo Alto Research Center 3333 Coyote Hill Road, Palo Alto, CA 94304 black@parc.xerox.com, http://www.parc.xerox.com/black/
More informationOcclusion Robust Multi-Camera Face Tracking
Occlusion Robust Multi-Camera Face Tracking Josh Harguess, Changbo Hu, J. K. Aggarwal Computer & Vision Research Center / Department of ECE The University of Texas at Austin harguess@utexas.edu, changbo.hu@gmail.com,
More informationDynamical Models for Human Gait Synthesis
Dynamical Models for Human Gait Synthesis Alessandro Bissacco Payam Saisan Stefano Soatto University of California, Los Angeles CA 90095 {saisan,bissacco,soatto}@cs.ucla.edu UCLA CSD-TR # 03005 Abstract
More informationActivityRepresentationUsing3DShapeModels
ActivityRepresentationUsing3DShapeModels AmitK.Roy-Chowdhury RamaChellappa UmutAkdemir University of California University of Maryland University of Maryland Riverside, CA 9252 College Park, MD 274 College
More informationExpanding gait identification methods from straight to curved trajectories
Expanding gait identification methods from straight to curved trajectories Yumi Iwashita, Ryo Kurazume Kyushu University 744 Motooka Nishi-ku Fukuoka, Japan yumi@ieee.org Abstract Conventional methods
More informationFace Recognition At-a-Distance Based on Sparse-Stereo Reconstruction
Face Recognition At-a-Distance Based on Sparse-Stereo Reconstruction Ham Rara, Shireen Elhabian, Asem Ali University of Louisville Louisville, KY {hmrara01,syelha01,amali003}@louisville.edu Mike Miller,
More informationMarkerless Motion Capture from Monocular Videos
Markerless Motion Capture from Monocular Videos Vishal Mamania Appu Shaji Sharat Chandran Department of Computer Science &Engineering Indian Institute of Technology Bombay Mumbai, India 400 076 {vishalm,appu,sharat}@cse.iitb.ac.in
More informationUnsupervised Motion Classification by means of Efficient Feature Selection and Tracking
Unsupervised Motion Classification by means of Efficient Feature Selection and Tracking Angel D. Sappa Niki Aifanti Sotiris Malassiotis Michael G. Strintzis Computer Vision Center Informatics & Telematics
More informationHuman Detection and Tracking for Video Surveillance: A Cognitive Science Approach
Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach Vandit Gajjar gajjar.vandit.381@ldce.ac.in Ayesha Gurnani gurnani.ayesha.52@ldce.ac.in Yash Khandhediya khandhediya.yash.364@ldce.ac.in
More informationReducing Particle Filtering Complexity for 3D Motion Capture using Dynamic Bayesian Networks
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Reducing Particle Filtering Complexity for 3D Motion Capture using Dynamic Bayesian Networks Cédric Rose Diatélic SA INRIA
More informationInferring 3D from 2D
Inferring 3D from 2D History Monocular vs. multi-view analysis Difficulties structure of the solution and ambiguities static and dynamic ambiguities Modeling frameworks for inference and learning top-down
More information3D Human Body Tracking using Deterministic Temporal Motion Models
3D Human Body Tracking using Deterministic Temporal Motion Models Raquel Urtasun and Pascal Fua Computer Vision Laboratory EPFL CH-1015 Lausanne, Switzerland raquel.urtasun, pascal.fua@epfl.ch Abstract.
More informationModeling 3D Human Poses from Uncalibrated Monocular Images
Modeling 3D Human Poses from Uncalibrated Monocular Images Xiaolin K. Wei Texas A&M University xwei@cse.tamu.edu Jinxiang Chai Texas A&M University jchai@cse.tamu.edu Abstract This paper introduces an
More informationDepth. Common Classification Tasks. Example: AlexNet. Another Example: Inception. Another Example: Inception. Depth
Common Classification Tasks Recognition of individual objects/faces Analyze object-specific features (e.g., key points) Train with images from different viewing angles Recognition of object classes Analyze
More informationHuman motion analysis: methodologies and applications
Human motion analysis: methodologies and applications Maria João M. Vasconcelos, João Manuel R. S. Tavares maria.vasconcelos@fe.up.pt CMBBE 2008 8th International Symposium on Computer Methods in Biomechanics
More informationMotion Estimation for Video Coding Standards
Motion Estimation for Video Coding Standards Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Introduction of Motion Estimation The goal of video compression
More informationLearning Parameterized Models of Image Motion
Learning Parameterized Models of Image Motion Michael J. Black Yaser Yacoob Allan D. Jepson David J. Fleet Xerox Palo Alto Research Center, 3333 Coyote Hill Road, Palo Alto, CA 9434 Computer Vision Laboratory,
More informationFeature Transfer and Matching in Disparate Stereo Views through the use of Plane Homographies
Feature Transfer and Matching in Disparate Stereo Views through the use of Plane Homographies M. Lourakis, S. Tzurbakis, A. Argyros, S. Orphanoudakis Computer Vision and Robotics Lab (CVRL) Institute of
More informationHuman Detection. A state-of-the-art survey. Mohammad Dorgham. University of Hamburg
Human Detection A state-of-the-art survey Mohammad Dorgham University of Hamburg Presentation outline Motivation Applications Overview of approaches (categorized) Approaches details References Motivation
More informationInternational Journal of Advanced Research in Electronics and Communication Engineering (IJARECE) Volume 4, Issue 4, April 2015
A Survey on Human Motion Detection and Surveillance Miss. Sapana K.Mishra Dept. of Electronics and Telecommunication Engg. J.T.M.C.O.E., Faizpur, Maharashtra, India Prof. K.S.Bhagat Dept. of Electronics
More informationCombined Shape Analysis of Human Poses and Motion Units for Action Segmentation and Recognition
Combined Shape Analysis of Human Poses and Motion Units for Action Segmentation and Recognition Maxime Devanne 1,2, Hazem Wannous 1, Stefano Berretti 2, Pietro Pala 2, Mohamed Daoudi 1, and Alberto Del
More informationMotion Tracking and Event Understanding in Video Sequences
Motion Tracking and Event Understanding in Video Sequences Isaac Cohen Elaine Kang, Jinman Kang Institute for Robotics and Intelligent Systems University of Southern California Los Angeles, CA Objectives!
More informationCompositing a bird's eye view mosaic
Compositing a bird's eye view mosaic Robert Laganiere School of Information Technology and Engineering University of Ottawa Ottawa, Ont KN 6N Abstract This paper describes a method that allows the composition
More informationHierarchical Part-Based Human Body Pose Estimation
Hierarchical Part-Based Human Body Pose Estimation R. Navaratnam A. Thayananthan P. H. S. Torr R. Cipolla University of Cambridge Oxford Brookes Univeristy Department of Engineering Cambridge, CB2 1PZ,
More informationExponential Maps for Computer Vision
Exponential Maps for Computer Vision Nick Birnie School of Informatics University of Edinburgh 1 Introduction In computer vision, the exponential map is the natural generalisation of the ordinary exponential
More informationCOMPUTER AND ROBOT VISION
VOLUME COMPUTER AND ROBOT VISION Robert M. Haralick University of Washington Linda G. Shapiro University of Washington T V ADDISON-WESLEY PUBLISHING COMPANY Reading, Massachusetts Menlo Park, California
More informationContinuous and Discrete Optimization Methods in Computer Vision. Daniel Cremers Department of Computer Science University of Bonn
Continuous and Discrete Optimization Methods in Computer Vision Daniel Cremers Department of Computer Science University of Bonn Oxford, August 16 2007 Segmentation by Energy Minimization Given an image,
More informationAutomatic Tracking of Moving Objects in Video for Surveillance Applications
Automatic Tracking of Moving Objects in Video for Surveillance Applications Manjunath Narayana Committee: Dr. Donna Haverkamp (Chair) Dr. Arvin Agah Dr. James Miller Department of Electrical Engineering
More informationLearning A Highly Structured Motion Model for 3D Human Tracking
ACCV2002: The 5th Asian Conference on Computer Vision, 23 25 January 2002, Melbourne, Australia. Learning A Highly Structured Motion Model for 3D Human Tracking Tao Zhao Tianshu Wang Heung-Yeung Shum University
More informationA Bayesian Framework for Real-Time 3D Hand Tracking in High Clutter Background
A Bayesian Framework for Real-Time 3D Hand Tracking in High Clutter Background Hanning Zhou, Thomas S. Huang University of Illinois at Urbana-Champaign 45 N. Mathews, Urbana, IL 6181, U.S.A hzhou,huang@ifp.uiuc.edu
More informationArticulated Model Based People Tracking Using Motion Models
Articulated Model Based People Tracking Using Motion Models Huazhong Ning, Liang Wang, Weiming Hu and Tieniu Tan National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences,
More informationParameter estimation. Christiano Gava Gabriele Bleser
Parameter estimation Christiano Gava Christiano.Gava@dfki.de Gabriele Bleser gabriele.bleser@dfki.de Introduction Previous lectures: P-matrix 2D projective transformations Estimation (direct linear transform)
More informationComputer Vision in a Non-Rigid World
2 nd Tutorial on Computer Vision in a Non-Rigid World Dr. Lourdes Agapito Prof. Adrien Bartoli Dr. Alessio Del Bue Non-rigid Structure from Motion Non-rigid structure from motion To extract non-rigid 3D
More information