Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Similar documents
SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Discriminative Dictionary Learning with Pairwise Constraints

Prof. Feng Liu. Spring /24/2017

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping.

Human Skeleton Reconstruction for Optical Motion Capture

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction

Machine Learning: Algorithms and Applications

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

Support Vector Machines

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm

Multi-View Face Alignment Using 3D Shape Model for View Estimation

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Calibration of an Articulated Camera System with Scale Factor Estimation

A Binarization Algorithm specialized on Document Images and Photos

Kinematics of pantograph masts

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga

Adaptive Transfer Learning

Unsupervised Learning and Clustering

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Analysis of Continuous Beams in General

The Shortest Path of Touring Lines given in the Plane

Reducing Frame Rate for Object Tracking

Contours Planning and Visual Servo Control of XXY Positioning System Using NURBS Interpolation Approach

Learning a Class-Specific Dictionary for Facial Expression Recognition

Cluster Analysis of Electrical Behavior

Smoothing Spline ANOVA for variable screening

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Geometric Primitive Refinement for Structured Light Cameras

A high precision collaborative vision measurement of gear chamfering profile

Concurrent models of computation for embedded software

Routability Driven Modification Method of Monotonic Via Assignment for 2-layer Ball Grid Array Packages

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem

Estimating Human Body Pose from a Single Image via the Specialized Mappings Architecture

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Learning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris

Generalized Video Deblurring for Dynamic Scenes

Local Quaternary Patterns and Feature Local Quaternary Patterns

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

Multi-view 3D Position Estimation of Sports Players

Machine Learning 9. week

Distance Calculation from Single Optical Image

Calibration of an Articulated Camera System

Palmprint Feature Extraction Using 2-D Gabor Filters

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

An Accurate Evaluation of Integrals in Convex and Non convex Polygonal Domain by Twelve Node Quadrilateral Finite Element Method

Feature-based image registration using the shape context

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Maintaining temporal validity of real-time data on non-continuously executing resources

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Calibration of an Articulated Camera System

Programming in Fortran 90 : 2017/2018

Integrated Expression-Invariant Face Recognition with Constrained Optical Flow

Discrete-Continuous Depth Estimation from a Single Image

Biostatistics 615/815

Depth error correction for projector-camera based consumer depth cameras

METRIC ALIGNMENT OF LASER RANGE SCANS AND CALIBRATED IMAGES USING LINEAR STRUCTURES

LECTURE : MANIFOLD LEARNING

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

A Hierarchical Deformable Model Using Statistical and Geometric Information

Nonlocal Mumford-Shah Model for Image Segmentation

Visual Hand Tracking Using Nonparametric Belief Propagation

Available online at ScienceDirect. Procedia Environmental Sciences 26 (2015 )

Segmentation and Tracking of Multiple Humans in Crowded Environments

Unsupervised Learning

3D Point Cloud Video Segmentation Based on Interaction Analysis

Simplification of 3D Meshes

News. Recap: While Loop Example. Reading. Recap: Do Loop Example. Recap: For Loop Example

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature

Parallel Inverse Halftoning by Look-Up Table (LUT) Partitioning

1. Introduction. Abstract

3D vector computer graphics

New Appearance Models for Natural Image Matting

Fingerprint matching based on weighting method and SVM

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

UNIT 2 : INEQUALITIES AND CONVEX SETS

Articulated Motion Capture from Visual Hulls in High Dimensional Configuration Spaces

Index Terms Object tracking, Extended Kalmanfiter, Particle filter, Color matching.

3D Modeling Using Multi-View Images. Jinjin Li. A Thesis Presented in Partial Fulfillment of the Requirements for the Degree Master of Science

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

A Robust Method for Estimating the Fundamental Matrix

Radial Basis Functions

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

A VR-BASED HYPER INTERACTION PLATFORM. Rong-Chi Chang

Attributed Relational Graph Based Feature Extraction of Body Poses In Indian Classical Dance Bharathanatyam

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Object-driven content-based image retrieval

A novel framework for motion segmentation and tracking by clustering incomplete trajectories

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

UAV global pose estimation by matching forward-looking aerial images with satellite images

A NEW IMPLEMENTATION OF THE ICP ALGORITHM FOR 3D SURFACE REGISTRATION USING A COMPREHENSIVE LOOK UP MATRIX

Transcription:

Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs 2 Aalto Unversty {ssrdhar,frmueller,mzollhoef,dcasas,theobalt}@mp-nfmpgde {anttoulasvrta}@aaltof Supplementary Document Fg 1 Lve tracng results for three dfferent subjects In ths document we tae a deeper loo at our artculated Gaussan mxture algnment strategy and show more qualtatve results of our lve capture setup that allows to trac hand-object nteractons at frame rate In addton, we provde detals on our benchmar dataset and the error metrc used n the ground truth evaluaton Fnally, we gve the gradents of all components of our objectve functon For further results, e nfluence of the dfferent components and vdeo footage of lve tracng sessons, we refer to the supplemental vdeo 1 Algnment Objectve In ths secton, we tae a deeper loo at the desgn of our algnment objectve E a and explore ts connecton to pont set regstraton methods that are based on Gaussan mxtures [1 Note, the algnment objectve s just a small component of our complete energy functon that also ncludes novel contact and occluson handlng constrants Let us assume the model as well as the nput depth data

2 S Srdhar, F Mueller, M Zollhöfer, D Casas, A Oulasvrta, C Theobalt are represented each as a Gaussan mxture: M(x) M w G(x µ, σ ), I(x) I w G(x µ, σ ) Here, the set M contans the ndces of all model Gaussans and the set I of all mage Gaussans, respectvely Each Gaussan s sotropc wth standard devaton σ R and mean µ R 3 For smplcty let us assume all mxng weghts to be one (w 1) We then defne an l 2 -dssmlarty measure between the two Gaussan mxtures, also see [1 for more detals: E a [M(x) I(x) 2 dx The expanson of Equaton 1 splts the objectve n three dstnct parts: E a [M(x) I(x) 2 dx [ M(x) 2 2M(x)I(x) + I(x) 2 dx M(x) 2 dx 2 M(x)I(x)dx + I(x) 2 dx (a) (b) (c) Note, (c) s constant n the presented tracng scenaro, snce we only optmze for the postons of the model Gaussans The terms (a) and (b) are ntegrals over products of Gaussan Mxtures Let us frst consder (b): M(x)I(x)dx M ( )( ) G(x µ, σ ) G(x µ j, σ j ) dx M j I [ G(x µ, σ )G(x µ j, σ j ) dx M j I [ j I G(x µ, σ )G(x µ j, σ j ) dx S,j Snce S,j s the ntegral over a product of Gaussans, t has a closed form expresson [2: ( ) S,j (2π) 3 2 (σ 2 σ2 j ) 3 2 (σ 2 + σ2 j ) 3 2 exp µ µ j 2 2 2(σ 2 + σ2 j ) Its gradent can be easly derved n closed form; the same holds for (a)

Real-tme Jont Hand and Object Tracng from RGB-D Input 3 Fg 2 We are able to trac complex shapes le a toy car Our contact ponts term (contacts are crcled n blue) maes fngers hold the car even n the presence of severe occluson 2 Lve Tracng Results Our real-tme approach uses the color and depth data from a sngle Creatve Senz3D tme-of-flght (TOF) sensor Note, we also support other depth sensors le the Intel RealSense, Knect and Prmesense Carmne The used color and depth resolutons are 640 480 and 320 240, both captured at 30 Hz We show compellng lve tracng results for three dfferent subjects n a close nteracton range of 15 to 100 cm away from the camera, see Fg 1 In addton, Fg 2 presents a tracng result of a complex object (toy car) Tracng s robust even f hands closely nteract wth objects due to the proposed contact and occluson constrants Our approach s robust even f a second hand s vsble Ths enables nterestng and new nteracton possbltes as shown n Fg 1 For addtonal lve footage, we refer to the supplemental vdeo 3 Error Measure We provde a new benchmar wth 3014 frames (6 sequences) wth ground truth annotatons to evaluate hand-object tracng methods, see Fg 3 For each frame, we annotated 8 dstnct landmars (5 fngertp postons and 3 corners of the object) If a locaton s not vsble, the correspondng landmar s set to be nvald and s not consdered n the error measure For the object (cubod), the 3 landmars span a coordnate system along the cubod s two domnant axes Ths unquely defnes the cubod wth respect to an axs of symmetry For evaluaton, we employ the followng error metrc to compare our tracng results wth the ground truth annotatons: [ 1 E V + 1 M V X G + 1 M 3 m M X m G m where V denotes the set of all un-occluded fngertp postons n the ground truth, M denotes matched cubod corners, and X and G denote estmated and ground truth postons, respectvely The ndcator functon 1 M s 1 f M 3 and 0 otherwse Fngertp postons are compared wth the correspondng landmars based on the dstance n 3D Eucldean space To ths end, the 2D annotatons are bac-projected based on depth and nverse camera ntrnscs Matched cubod,

4 S Srdhar, F Mueller, M Zollho fer, D Casas, A Oulasvrta, C Theobalt Fg 3 The sx sequences of our novel ground-truth hand-object benchmar corners refers to corners n the estmated cubod that are closest to the ground truth If one of the cubod corners s occluded, then the set M s empty as the cubod cannot be unquely postoned 4 Gradents Here, we gve analytcal expressons for the gradents of all energy terms The used mathematcal notaton s defned n the man document Spatal Algnment Term Ea : µ µ µ X Xh µj Ea j S,j 2 σ + σj2 M j M µ µ µ X Xh j 2 S,j 2 σ + σj2 M j I Semantc Algnment Term Es : XX Es µ 2 α,j (µ µj ) x x M j I Anatomcal Plausblty Regularzer Ep : f xl x xu 0 Ep u 2 (x x ) f x > xu 2 (x xl ) f x < xl

Real-tme Jont Hand and Object Tracng from RGB-D Input 5 Temporal Smoothness Regularzer E t : Contact Ponts Term E c : E c x (j,l,t d ) T Object Occluson Term E o : References E t 2 (x (t) x 2x(t 1) + x (t 2) ) ( µj 4 ( µ j µ l 2 2 t 2 d) (µ j µ l ) µ ) l x x E o 2 (1 x ˆf ) (x x old ) H 1 Jan, B, Vemur, BC: Robust pont set regstraton usng gaussan mxture models Pattern Analyss and Machne Intellgence, IEEE Transactons on 33(8), 1633 1645 (2011) 2 Stoll, C, Hasler, N, Gall, J, Sedel, H, Theobalt, C: Fast artculated moton tracng usng a sums of gaussans body model In: Proc IEEE ICCV pp 951 958 (2011)