Robust Visual Tracking via Structured Multi-Task Sparse Learning

Size: px
Start display at page:

Download "Robust Visual Tracking via Structured Multi-Task Sparse Learning"

Transcription

1 Int J Comput Vs (203) 0: DOI 0.007/s z Robust Vsual Trackng va Structured Mult-Task Sparse Learnng Tanzhu Zhang Bernard Ghanem S Lu Narendra Ahuja Receved: 2 Aprl 202 / Accepted: 30 September 202 / Publshed onlne: 9 November 202 Sprnger Scence+Busness Meda New York 202 Abstract In ths paper, we formulate object trackng n a partcle flter framework as a structured mult-task sparse learnng problem, whch we denote as Structured Mult-Task Trackng (S-MTT). Snce we model partcles as lnear combnatons of dctonary templates that are updated dynamcally, learnng the representaton of each partcle s consdered a sngle task n Mult-Task Trackng (MTT). By employng popular sparsty-nducng l p,q mxed norms (specfcallyp {2, } and q = ), we regularze the representaton problem to enforce jont sparsty and learn the partcle representatons together. As compared to prevous methods that handle partcles ndependently, our results demonstrate that mnng the nterdependences between partcles mproves trackng Electronc supplementary materal The onlne verson of ths artcle (do:0.007/s z) contans supplementary materal, whch s avalable to authorzed users. T. Zhang Advanced Dgtal Scences Center (ADSC), Fusonopols Way, #08-0 Connexs North Tower, Sngapore38632, Sngapore e-mal: tzzhang0@gmal.com B. Ghanem Kng Abdullah Unversty of Scence and Technology (KAUST), Al Khwarzm Buldng #2224, Thuwal, Kngdom of Saud Araba e-mal: bernard.ghanem@kaust.edu.sa S. Lu (B) Department of Electrcal and Computer Engneerng, Natonal Unversty of Sngapore, 4 Engneerng Drve 3, Sngapore 7576, Sngapore e-mal: dcslus@nus.edu.sg N. Ahuja Department of Electrcal and Computer Engneerng, Beckman Insttute, and Coordnated Scence Laboratory, Unversty of Illnos at Urbana-Champagn, 204 Beckman Insttute, 405 N. Mathews Ave., Urbana, IL 680, USA e-mal: ahuja@vson.a.uuc.edu performance and overall computatonal complexty. Interestngly, we show that the popular L tracker (Me and Lng, IEEE Trans Pattern Anal Mach Intel 33(): , 20) s a specal case of our MTT formulaton (denoted as the L tracker) when p = q =. Under the MTT framework, some of the tasks (partcle representatons) are often more closely related and more lkely to share common relevant covarates than other tasks. Therefore, we extend the MTT framework to take nto account parwse structural correlatons between partcles (e.g. spatal smoothness of representaton) and denote the novel framework as S-MTT. The problem of learnng the regularzed sparse representaton n MTT and S-MTT can be solved effcently usng an Accelerated Proxmal Gradent (APG) method that yelds a sequence of closed form updates. As such, S-MTT and MTT are computatonally attractve. We test our proposed approach on challengng sequences nvolvng heavy occluson, drastc llumnaton changes, and large pose varatons. Expermental results show that S-MTT s much better than MTT, and both methods consstently outperform state-of-the-art trackers. Keywords Vsual trackng Partcle flter Graph Structure Sparse representaton Mult-task learnng Introducton The problem of trackng a target n vdeo arses n many mportant applcatons such as automatc survellance, robotcs, human computer nteracton, etc. For a vsual trackng algorthm to be useful n real-world scenaros, t should be desgned to handle and overcome cases where the target s appearance changes from frame-to-frame. Sgnfcant and rapd appearance varaton due to nose, occluson, varyng

2 368 Int J Comput Vs (203) 0: Fg. (Color onlne) Frames from a shakng sequence. The ground truth track of the head s desgnated n green. Due to fast moton, occluson, cluttered background, and changes n llumnaton, scale, and pose, vsual object trackng s a dffcult problem vewponts, background clutter, and llumnaton and scale changes pose major challenges to any tracker as shown n Fg.. Over the years, a plethora of trackng algorthms have been proposed to overcome these challenges. For a survey of many of these algorthms, we refer the reader to Ylmaz et al. (2006). Recently, sparse representaton (Candès et al. 2006) has been successfully appled to vsual trackng (Me and Lng 20; Me et al. 20; Lu et al. 200, 20). In ths case, the tracker represents each target canddate as a sparse lnear combnaton of dctonary templates that can be dynamcally updated to mantan an up-to-date target appearance model. Ths representaton has been shown to be robust aganst partal occlusons, whch leads to mproved trackng performance. However, sparse codng based trackers perform computatonally expensve l mnmzaton at each frame. In a partcle flter framework, computatonal cost grows lnearly wth the number of sampled partcles. It s ths computatonal bottleneck that precludes the use of these trackers n realtme scenaros. Consequently, very recent efforts have been made to speedup ths trackng paradgm (Me et al. 20; L et al. 20). More mportantly, these methods learn sparse representatons of partcles separately. Ignorng the relatonshps that ultmately constran partcle representatons tend to make the tracker more prone to drftng away from the target, especally n cases of sgnfcant changes n appearance. In ths paper, we propose a computatonally effcent mult-task sparse learnng approach for vsual trackng n a partcle flter framework. Here, learnng the representaton of each partcle s vewed as an ndvdual task. Inspred by the above work, the next target state s selected to be the partcle that has the hghest smlarty wth a dctonary of target templates. Unlke prevous methods, we explot smlartes among partcles and, therefore, seek an accurate, jont representaton of these partcles w.r.t. the dctonary. In our mult-task approach, partcle representatons are jontly sparse only a few (but the same) dctonary templates should be used to represent all the partcles at each frame. As opposed to sparse codng based trackers (Me and Lng 20; Me et al. 20; Lu et al. 200, 20) that handle partcles separately, our use of jont sparsty ncorporates the benefts of a sparse partcle representaton (e.g. partal occluson handlng), whle respectng the underlyng relatonshp between partcles, whch nherently yelds a tracker that s more robust aganst varous sources of appearance change. Therefore, we propose a mult-task formulaton (denoted as Mult-Task Trackng or MTT) for the robust object trackng problem. We explot nterdependences among the appearances of dfferent partcles to obtan ther representatons jontly. Jont sparsty s mposed on partcle representatons through an l p,q mxed-norm regularzer, whch s optmzed usng an Accelerated Proxmal Gradent (APG) method that guarantees fast convergence. In fact, jont sparsty can be vewed as a global form of structural regularzaton that nfluences all partcle representatons together. Furthermore, to extend the MTT framework to enforce local structure, we observe that some tasks (partcle representatons) are often more closely related and more lkely to share common relevant covarates than other tasks. Therefore, we expand the MTT framework to consder parwse structural correlatons between partcles (e.g. spatal smoothness of representaton) and denote the novel framework as Structured Mult-Task Trackng abbrevated as S-MTT. A prelmnary conference verson of ths work can be referred to n Zhang et al. (202b). Contrbutons: The contrbutons of ths work are three-fold.. We propose a mult-task sparse learnng method for object trackng, whch s a robust sparse codng method that mnes relatonshps between dfferent tasks to obtan better trackng results than learnng each task ndvdually. Ths s done by explotng both global and local structure among tasks. To the best of our knowledge, ths s the frst work to explot mult-task learnng n object trackng. 2. We show that the popular L tracker (Me and Lng 20) s a specal case of the proposed MTT framework. 3. Snce we learn partcle representatons jontly, we can solve the S-MTT and MTT problems effcently usng an APG method. Ths makes our trackng method computatonally attractve n general and sgnfcantly faster than the tradtonal L tracker n partcular. The rest of the paper s organzed as follows. In Sect. 2, we summarze the works most related to ours. The partcle flter algorthm s revewed n Sect. 3. Secton 4 gves a detaled descrpton of the proposed trackng approach, wth

3 Int J Comput Vs (203) 0: the optmzaton detals presented n Sect Expermental results are reported and analyzed n Sect. 5. We conclude the paper n Sect Related Work Vsual trackng s an mportant topc n computer vson and t has been studed for several decades. There s extensve lterature on vsual object trackng. In what follows, we only brefly revew nomnal trackng methods and those that are the most related to our own. We focus specfcally on trackng methods that use partcle flters and sparse representaton, as well as, general mult-task learnng methods. For a more thorough survey of trackng methods, we refer the readers to Ylmaz et al. (2006). 2. Object Trackng In general, object trackng methods can be categorzed as ether generatve or dscrmnatve. 2.. Generatve Trackers These methods adopt an appearance model to descrbe the target observatons. Here, the am of trackng s to search for the target locaton that has the most smlar appearance to the generatve model. Examples of generatve methods are egentracker (Black and Jepson 998), mean shft tracker (Comancu et al. 2003), appearance model based tracker (Jepson et al. 2003), context-aware tracker (Yang et al. 2009), fragment-based tracker (Frag) (Adam et al. 2006), ncremental tracker (IVT) (Ross et al. 2008), and VTD tracker (Kwon and Lee 200). In Black and Jepson (998), a vew-based representaton s used for trackng rgd and artculated objects. Ths approach bulds on and extends work on egenspace representatons, robust estmaton technques, and parameterzed optcal flow estmaton. The mean shft tracker (Comancu et al. 2003) s a popular mode-fndng method, whch successfully copes wth camera moton, partal occlusons, clutter, and target scale varatons. In Jepson et al. (2003), a robust and adaptve appearance model s learned for moton-based trackng of natural objects. The model adapts to slowly changng object appearance, and t mantans an acceptable measure of stablty n the observed mage structure durng trackng. Moreover, the context-aware tracker (Yang et al. 2009) focuses on an object s context for robust vsual trackng. Specfcally, ths method ntegrates nto the trackng process a set of auxlary objects that are automatcally dscovered n the vdeo va data mnng technques. Furthermore, the trackng method proposed n Ross et al. (2008) ncrementally learns a low-dmensonal subspace representaton, and effcently adapts to onlne changes n target appearance. To adapt to varatons n appearance (e.g. due to changes n llumnaton and pose), the appearance model can be dynamcally updated. The Frag tracker (Adam et al. 2006) ams to solve partal occluson wth a representaton based on hstograms of local patches. The trackng task s carred out by accumulatng votes from matchng local patches usng a template. However, ths template s not updated and, thus, t s not expected to handle changes n object appearance that can be due to scale and shape varatons. In the IVT tracker (Ross et al. 2008), an adaptve appearance model s constructed to account for appearance varaton due to rgd or lmted deformable moton. Although t has been shown to perform well when target objects undergo lghtng and pose varaton, IVT s less effectve n handlng heavy occluson or non-rgd dstorton as a result of the adopted holstc appearance model. Fnally, the VTD tracker (Kwon and Lee 200) effectvely extends the conventonal partcle flter framework wth multple moton and observaton models to account for appearance varaton caused by changes n pose, lghtng, and scale as well as partal occluson. Nevertheless, as a result of the adopted generatve representaton scheme, ths tracker s not equpped to dstngush between the target and ts context (background) Dscrmnatve Trackers These methods formulate vsual object trackng as a bnary classfcaton problem, whch seeks the target locaton that can best separate the target from ts background. Examples of dscrmnatve methods are on-lne boostng (OAB) (Grabner et al. 2006), sem-onlne boostng (Grabner et al. 2008), ensemble trackng (Avdan 2005), co-tranng trackng (Lu et al. 2009), onlne mult-vew forests for trackng (Lestner et al. 200), adaptve metrc dfferental trackng (Jang et al. 20) and onlne multple nstance learnng trackng (Babenko et al. 2009). In the OAB tracker (Grabner et al. 2006), onlne AdaBoost s adopted to select useful features for object trackng. Its performance s affected by background clutter, and the tracker can easly drft. The ensemble tracker (Avdan 2005) formulates the trackng task as a pxel based bnary classfcaton problem. Although ths method s able to dfferentate between target and background, the pxel-based representaton s rather lmted and thereby constrans ts ablty to handle heavy occluson and clutter. In the MIL tracker (Babenko et al. 2009), the multple nstance learnng method s extended to an onlne settng for object trackng. Whle t s capable of reducng tracker drft, ths method s unable to handle large nonrgd shape deformaton. In ensemble trackng (Avdan 2005), a feature vector s constructed for every pxel n the reference mage and an adaptve ensemble of classfers s traned to separate pxels that belong to the object from pxels that belong to the background. In Collns and Lu (2003), a target confdence map

4 370 Int J Comput Vs (203) 0: s bult by fndng the most dscrmnatve RGB color combnaton n each frame. Moreover, a hybrd approach that combnes a generatve model and a dscrmnatve classfer s proposed n Yu et al. (2008) to capture appearance changes and allow reacquston of an object after total occluson. Global mode seekng can be used to detect and rentalze the tracked object after total occluson (Yn and Collns 2008). Yet another approach uses mage fuson to determne the most dscrmnatve appearance model and then a generatve approach for dynamc target updates (Blasch and Kahler 2005). 2.2 Partcle Flters for Object Trackng Partcle flters (also known as condensaton or sequental Monte Carlo models) were ntroduced to vsual trackng (Isard and Blake 998). Snce then and over the last decade, t has become a popular trackng framework due prmarly to ts excellent performance n the presence of nonlnear target moton and to flexblty to dfferent object representatons (Wu and Huang 2004). In general, when more partcles are sampled and a better target representaton s constructed, partcle flter based trackng algorthms are more lkely to perform relably n cluttered and nosy envronments. However, the computatonal cost of partcle flter trackers tends to ncrease lnearly wth the number of partcles. Consequently, researchers have proposed varous means of speedng up the partcle flter framework. In Yang et al. (2005), tracked objects are descrbed usng color and edge orentaton hstogram features, and the observaton lkelhood s computed n a coarse-to-fne manner, whch allows the computaton to quckly focus on the more promsng regons. In Khan et al. (2004), subspace representatons are used n a partcle flter for trackng. Ths tracker s made effcent by applyng Rao-Blackwellzaton to the subspace coeffcents n the state vector. In Zhou et al. (2004), the number of partcle samples s adjusted accordng to an adaptve nose component. 2.3 Sparse Representaton for Object Trackng Recently, sparse representaton has been ntroduced to partcle flter based object trackng and has yelded noteworthy performance (Me and Lng 20; Me et al. 20; Lu et al. 200; L et al. 20; Bao et al. 202; Zhang et al. 202a). In Me and Lng (20), a trackng canddate s represented as a sparse lnear combnaton of object templates and trval templates. For each partcle, sparse representaton s computed by solvng a constraned l mnmzaton problem wth nonnegatvty constrants, thus, solvng the nverse ntensty pattern problem durng trackng. Although ths method yelds good trackng performance, t comes at the computatonal expense of multple l mnmzaton problems that are ndependently solved. In fact, the computatonal cost grows lnear wth the number of partcle samples. In Me et al. (20), an effcent L tracker wth mnmum error bound and occluson detecton s proposed. The mnmum error bound s quckly calculated from a lnear least squares equaton, and serves as a gude for partcle resamplng n a partcle flter framework. Wthout loss of precson durng resamplng, most of the rrelevant samples are removed before solvng the computatonally expensve l mnmzaton functon. In Lu et al. (200), dynamc group sparsty s ntegrated nto the trackng problem and hgh dmensonal mage features are used to mprove trackng robustness. In L et al. (20), dmensonalty reducton and a customzed orthogonal matchng pursut algorthm are adopted to accelerate the L tracker (Me and Lng 20). In Bao et al. (202), APG based soluton s used to mprove the L tracker (Me and Lng 20). In Zhang et al. (202a), low-rank sparse learnng s adopted to consder the correlatons among partcles for robust trackng. Inspred by these works, we should solve two problems, whch are how to consder the correlatons among partcles and how to make the tracker be fast. Therefore, we propose the S-MTT trackng method. 2.4 Mult-Task Learnng Mult-task learnng (MTL, Chen et al. 2009) has recently receved much attenton n machne learnng and computer vson. It captalzes on shared nformaton between related tasks to mprove the performance of each ndvdual task, and t has been successfully appled to popular vson problems such as mage classfcaton [(Yuan and Yan 200) and mage annotaton (Quatton et al. 2009]. The underlyng assumpton behnd many MTL algorthms s that the tasks are related. Thus, a key ssue les n how relatonshps between tasks are ncorporated n the learnng framework. Inspred by the above works, we want to mprove computatonal effcency and captalze on the nterdependence among partcle appearances (for addtonal robustness n trackng). To make ths come true, we propose a mult-task sparse representaton method for robust object trackng. 3 Partcle Flter The partcle flter (Doucet et al. 200) s a Bayesan sequental mportance samplng technque, whch recursvely approxmates the posteror dstrbuton usng a fnte set of weghted samples for estmatng the posteror dstrbuton of state varables characterzng a dynamc system. It provdes a convenent framework for estmatng and propagatng the posteror probablty densty functon of state varables regardless of the underlyng dstrbuton through a sequence of predcton and update steps. Let s t and y t denote the state varable descrbng the parameters of an

5 Int J Comput Vs (203) 0: object at tme t (e.g. locaton or moton parameters) and ts observaton respectvely. The predcton stage uses the probablstc system transton model p(s t s t ) to predct the posteror dstrbuton of s t gven all avalable observatons y :t = {y, y 2,...,y t } up to tme t s computed n Eq. (). p(s t y :t ) = p(s t s t )p(s t y :t )ds t () At tme t, the observaton y t s avalable and the state vector s updated usng Bayes rule, as n Eq. (2), where p(y t s t ) denotes the observaton lkelhood. p(s t y :t ) = p(y t s t )p(s t y :t ) (2) p(y t y :t ) In the partcle flter framework, the posteror p(s t y :t ) s approxmated by a fnte set of n samples { s n t} (called partcles) wth mportance weghts w. The partcle samples s t = are drawn from an mportance dstrbuton q(s t s :t, y :t ) and the mportance weghts are updated accordng to Eq. (3). wt = wt p ( y t s ( t) p s t s ) t (3) q(s t s :t, y :t ) To avod degeneracy, partcles are resampled accordng to the mportance weghts so as to generate a set of equally weghted partcles. For smplcty, n the case of the bootstrap flter (Doucet et al. 200), we set q(s t s :t, y :t ) = p(s t s t ), so that the weghts are updated by the observaton lkelhood p(y t s t ). Partcle flters have been used extensvely n object trackng (Ylmaz et al. 2006). In ths paper, we also employ partcle flters to track the target object. Smlar to Me and Lng (20), we assume an affne moton model between consecutve frames. Therefore, the state varable s t conssts of the sx parameters of the affne transformaton (2D lnear transformaton and a 2D translaton). By applyng an affne transformaton usng s t as parameters, we crop the regon of nterest y t from the mage and normalze t to the sze of the target templates n our dctonary. The state transton dstrbuton p(s t s t ) s modeled to be Gaussan wth the dmensons of s t assumed ndependent. The observaton model p(y t s t ) reflects the smlarty between a target canddate (partcle) and dctonary templates. In ths paper, p(y t s t ) s nversely proportonal to the reconstructon error obtaned by lnearly representng y t usng the dctonary of templates. 4 Structured Mult-Task Trackng (S-MTT) In ths secton, we gve a detaled descrpton of our partcle flter based trackng method that makes use of structured mult-task learnng to represent partcle samples. 4. Structured Mult-Task Representaton of Partcles In the MTL framework, tasks that share dependences n features or learnng parameters are jontly solved n order to captalze on ther nherent relatonshps. Many works n ths doman have shown that MTL can be appled to classcal problems [(e.g. mage annotaton (Quatton et al. 2009) and mage classfcaton (Yuan and Yan 200)] and outperform state-of-the-art methods that resort to ndependent learnng. In ths paper, we formulate object trackng as an MTL problem, where learnng the representaton of a partcle s vewed as a sngle task. Usually, partcle representatons n trackng are computed separately (e.g. L tracker, Me and Lng 20). In ths paper, we show that by representng partcles jontly n an MTL settng, trackng performance and trackng speed can be sgnfcantly mproved. In our partcle flter based trackng method, partcles are randomly sampled around the current state of the tracked object accordng to a zero-mean Gaussan dstrbuton. At nstance t, we consder n partcle samples, whose observatons (pxel color values) n the tth frame are denoted n matrx form as: X = [x, x 2,...,x n ], where each column s a partcle n R d. In the noseless case, each partcle x s represented as a lnear combnaton z of templates that form a dctonary D t = [d, d 2,...,d m ], such that X = D t Z. The dctonary columns comprse the templates that wll be used to represent each partcle. These templates nclude vsual observatons of the tracked object (called target templates) possbly under a varety of appearance changes. Snce our representaton s constructed at the pxel level, msalgnment between dctonary templates and partcles mght lead to degraded performance. To allevate ths problem, one of two strateges can be employed. () D t can be constructed from a dense samplng of the target object, whch can also nclude transformed versons of these samples. () Columns of X can be algned to columns of D t as n Peng et al. (202) to solve the geometrc transformaton. In ths paper, we employ the frst strategy, whch leads to a larger m but a lower overall computatonal cost. We denote D t wth a subscrpt because the dctonary templates wll be progressvely updated to ncorporate varatons n object appearance due to changes n llumnaton, vewpont, etc. Our dctonary update scheme s adopted from the work n Me and Lng (20), but for completeness, we present ts detals n Sect In many vsual trackng scenaros, target objects are often corrupted by nose or partally occluded. As n Me and Lng (20), ths nose can be modeled as sparse addtve nose that can take on large values anywhere n ts support. Therefore, n the presence of nose, we can stll represent the partcle observatons X as a lnear combnaton of templates, where the dctonary s augmented wth trval (or occluson) templates I d (dentty matrx of R d d ), as shown n Eq. (4). The representaton error e of partcle usng dctonary D t s the

6 372 Int J Comput Vs (203) 0: th column n E. The nonzero entres of e ndcate the pxels n x that are corrupted or occluded. The nonzero support of e can be dfferent among partcles and s unknown a pror. [ ] Z X = [D t I d ] X = BC (4) E 4.. Imposng Jont Sparsty va l p,q Mxed-Norm Because most partcles are densely sampled around the current target state, ther representatons wth respect to D t wll be sparse (few templates are requred to represent them) and smlar to each other (the support of partcle representatons s smlar) n general. These two propertes culmnate n ndvdual partcle representatons (sngle tasks) beng jontly sparse. In other words, jont sparsty wll encourage all partcle representatons to be ndvdually sparse and share the same (few) dctonary templates that relably represent them. Ths yelds a more robust representaton for the ensemble of partcles. In fact, jont sparsty has been recently employed to address MTL problems (Quatton et al. 2009; Yuan and Yan 200). A common technque to explctly enforce jont sparsty n MTL s the use of sparsty-nducng norms to regularze the parameters shared among the ndvdual tasks. In ths paper, we nvestgate the use of convex l p,q mxed norms (.e. p and q ) to address the problem of MTL n partcle flter based trackng (denoted as MTT). Therefore, we need to solve the convex optmzaton problem n Eq. (5), where λ s a tradeoff parameter between relable reconstructon and jont sparsty regularzaton. mn C 2 X BC 2 F + λ C p,q (5) Note that we defne C p,q as n Eq. (6), where C p s the l p norm of C and C s the th row of matrx C. C p,q = ( m+d ) /q ( C p ) q = As n Eq. (5), gven a dctonary B, for the n tasks X = [x, x 2,...,x n ] (each column s a partcle), we am to dscover, across these n tasks, a few common templates that are the most useful for partcle representaton. In ths settng, the constrant of jont sparsty across dfferent tasks s valuable snce dfferent tasks may favor dfferent sparse reconstructon coeffcents, yet the jont sparsty enforces the robustness n coeffcent estmaton. Moreover, jont sparsty explots correlatons among dfferent tasks to obtan better generalzaton performance as compared to learnng each task ndvdually. The goal of jont sparsty s acheved by mposng an l p,q mxed-norm penalty on the reconstructon coeffcents. In fact, jont sparsty can be vewed as a global form of structural regularzaton that nfluences all partcle (6) representatons together. In the next secton, we extend the MTT framework to enforce local structure as well Imposng Structure va Graph Regularzaton Enforcng jont sparsty usng the l p,q mxed-norm explots the global structure nherent to partcle representatons n any gven frame. However, n partcle based MTT, some of the tasks are often more closely related and more lkely to share common relevant covarates than other tasks. Ths nduces another layer of structure, whch affects partcle representatons locally. Therefore, we expand the MTT framework to consder parwse structural correlatons between partcles (e.g. spatal smoothness of representaton) and denote the novel framework as Structured Mult-Task Trackng abbrevated as S-MTT. The S-MTT formulaton can be vewed as a generalzaton of MTT, snce local structural nformaton endows MTT wth another layer of robustness n trackng. In fact, our experments show that ncorporatng such local structural nformaton sgnfcantly mproves the performance of MTT. We assume that the learned representatons C can be related through parwse nteractons, whch are consdered local structural prors to C. In ths paper, we use these structural prors to enforce spatal smoothness among partcle representatons. In other words, partcles that are spatally located near each other n the same frame should have smlar representatons. In general, hgher order relatonshps can be added to the S-MTT framework; however, such relatonshps sgnfcantly ncrease the complexty of learnng the optmal C. Therefore, we restrct ourselves to parwse relatonshps that are defned as edges n a graph, whose nodes consttute the partcle representatons (.e. columns of C). As such, we ncorporate these local parwse relatonshps nto Eq. (5)by addng a sutable graph regularzaton term. To do ths, we nvestgate the use of the well-known and wdely used, normalzed graph smoothness regularzer, whch s a weghted sum of parwse dstances between the normalzed representatons n C. The weght of each dstance term reflects how strongly the correspondng parwse relatonshp should be enforced. Snce the normalzed verson of ths regularzer has been shown to produce better and more stable results n many learnng problems, we prefer t over ts unnormalzed counterpart (Zhu 2008). In Eq. (7), we formalze the graph regularzer, denoted as G(C). Here, we defne a symmetrc weght matrx W R n n + that descrbes the smlarty measure between every par of partcle representatons. In fact, W represents the weghts of all edges n the graph. Therefore, W j s the smlarty measure between the th partcle c and the jth partcle c j. Here, we denote ˆd = n = W j, the sum of the elements of the th row of W, and ˆD = dag( ˆd, ˆd 2,..., ˆd n ). In graph

7 Int J Comput Vs (203) 0: Fg. 2 (Color onlne) Schematc example of the L 2 tracker. The representaton C of all partcles X w.r.t. dctonary B (set of target and occluson templates) s learned by solvng Eq. (9) wth p = 2 and q =. Notce that the columns of C are jontly sparse,.e. a few (but the same) dctonary templates are used to represent all the partcles together. The partcle x s selected among all other partcles as the trackng result, snce x s represented the best by object templates only theory, ˆD s called the degree of the graph. In the last step, we denote L = ˆD W as the Laplacan of the graph and Tr(A) as the trace of matrx A. G(C) = 2 = Tr n n W j c c j ˆd ˆd j 2 ( C ˆLC ) T where ˆL = ˆD 2 L ˆD 2 (7) = j= We defne W j to decrease exponentally wth the dstance between the spatal locatons of the th and jth partcles, as n Eq. (8). Here, we denote l as the 2D locaton of the center of the th partcle n the current frame and δ as a smoothng factor. The δ s the average value of all dstances between l and l j. Note that other smlarty measures can be used to descrbe W j ncludng the PASCAL overlap score. ( W j = exp l ) l j 2 2 2δ 2 (8) Therefore, the partcle representatons C can be computed by solvng Eq. (9), whch smply adds the graph regularzer G(C) to Eq. (5). Here, ˆL s the normalzed Laplacan matrx, and λ and λ 2 are two parameters that quantfy the tradeoff between local and global structural regularzaton. mn C 2 X BC 2 F + λ 2 Tr(C ˆLC T ) + λ 2 C p,q (9) Dscusson To encourage a sparse number of dctonary templates to be selected for all partcles, we restrct our choce of l p,q mxed norms to the case of q=, thus, C p, = m+d = C p. Among ts convex optons, we select three popular and wdely studed l p, norms: p {, 2, }. The S-MTT objectve n Eq. (9) s composed of a convex 2 quadratc term and a non-smooth regularzer, and thus we conventonally adopt the APG method (Tseng 2008) for optmzaton. The soluton to Eq. (9) for these choces of p and q s descrbed n Sect Note that each choce of p yelds a dfferent S-MTT tracker, whch we wll denote as the L p tracker. To dscrmnate between S-MTT and MTT trackers, we denote the MTT tracker (.e. when λ = 0nEq.(9)) usng the l p, mxed norm as the L p tracker. In Sect. 5.6, we show that L p trackers can lead to sgnfcant mprovement n trackng performance over L p trackers, n general. In Fg. 2, we present an example of how the L 2 tracker works. Gven all partcles X (sampled around the tracked car) and based on a dctonary B, we learn the representaton matrx C by solvng Eq. (9). Note that smaller values are darker n color. Clearly, columns of C are jontly sparse,.e. a few (but the same) dctonary templates are used to represent all the partcles together. Partcle x s chosen as the current trackng result y t because t has the smallest reconstructon error w.r.t. to the target templates D t. Snce partcles x j and x k are msalgned The score s the rato of the ntersecton to the unon of two boundng boxes. In our case, t would be the rato of the ntersecton of the ground truth and the predcted tracks to ther unon n each frame. 2 Snce the degree matrx ˆD s dagonal and non-negatve and snce the Laplacan L of any graph s postve sem-defnte, the normalzed Laplacan ˆL s postve sem-defnte. Thus, G(C) s convex n C.

8 374 Int J Comput Vs (203) 0: versons of the car, they are not represented well by D t (.e. z j and z k have small values). Ths precludes the tracker from drftng nto the background. As for Eq. (6), when p = q =, we note that C, = m+d = C = n c, (0) = where c and C represent the th column and th row n C respectvely. Ths equvalence property between rows and columns (.e. the sum of the l p norms of rows and that of columns are the same) only occurs when p =. In ths case, MTT (Eq. (5)) and S-MTT (Eq. (9) when λ = 0) become equvalent to Eq. (0). Ths optmzaton problem s no longer an MTL problem, snce the n representaton tasks are unrelated and are solved separately. Interestngly, Eq. (0) s the same formulaton used n the popular L tracker (Me and Lng 20), whch can be vewed as a specal case of our proposed famly of S-MTT algorthms, namely the L tracker. In fact, usng the optmzaton technque n Sect. 4.3, we observe that our L mplementaton s two orders of magntude faster than the tradtonal L tracker. mn c,...,c n n j= ( ) 2 x j Bc j λ 2 c j () A detaled overvew of the proposed S-MTT trackng method s shown n Algorthm. Based on the prevous state s t, we can use the mportance samplng approach (Isard and Blake 998) to obtan new partcles and crop the correspondng mage patches to obtan ther observatons X. Then, we learn ther representatons C by solvng Eq. (9), to be shown n Sect The partcle that has the smallest reconstructon error s selected to be the current trackng result. Fnally, the dctonary templates n D t are updated adaptvely, to be shown n Sect Solvng Eq. (9) In S-MTT, we need to solve Eq. (9) when q= and p {, 2, }. Clearly, the overall objectve s non-smooth due to the non-smoothness of the l p, mxed norm. If straghtforward frst-order subgradent methods were used to solve the S-MTT problem, only sublnear convergence (.e. convergence to an ɛ-accurate soluton n O ( ) ) ɛ teratons s 2 guaranteed. To obtan a better convergence rate, we explot recent developments n non-smooth convex optmzaton. The unpublshed manuscrpt by Nesterov (2007) consders the problem of mnmzng a convex objectve composed of a smooth convex term and a smple non-smooth convex term. Here, smple means that the proxmal mappng 3 of the nonsmooth term can be computed effcently. In ths case, an APG method can be devsed to solve the non-smooth convex program wth guaranteed quadratc convergence. Because of ts attractve convergence property, ths APG method has been extensvely used to effcently solve smooth convex optmzaton problems wth non-smooth norm regularzers (e.g. MTL problems, Chen et al. 2009). It should be noted that Beck and Teboulle (2009) ndependently proposed the ISTA algorthm for solvng lnear nverse problem wth the same convergence rate. Ths work was further extended to convex-concave optmzaton n Tseng (2008). In general, APG terates between updatng the current representaton matrx C (k) and an aggregaton matrx V (k). Each APG teraton conssts of two steps: () a generalzed gradent mappng step that updates C (k) keepng V (k) fxed, and (2) an aggregaton step that updates V (k) by lnearly combnng C (k+) and C (k). To ntalze V (0) and C (0), we set them to Gradent Mappng Step Gven the current estmate V (k), we obtan C (k+) by solvng Eq. (2), whch s nothng but the proxmal mappng of λ Y p,. Here, η s a small step parameter, and λ = ηλ 2. C (k+) = arg mn Y 2 Y H λ Y p,q, (2) The temporal parameter H s an η step from the current estmate V (k) along the negatve gradent of the smooth term n Eq. (9) and s calculated n Eq. (3). 3 The proxmal mappng of a non-smooth convex functon h(.) s defned as: prox h (x) = arg mn u ( h(u) + 2 u x 2 2).

9 Int J Comput Vs (203) 0: H = V (k) η s (k) [ ] = V (k) η B T BV (k) + λ V (k) ˆL B T X. (3) Takng jont sparsty nto consderaton and snce q =, Eq. (2) decouples nto (m + d) dsjont subproblems (one for each row vector C ), asshownneq.(4), where Y and H denote the th row of the matrx Y and H, respectvely. C (k+) = arg mn Y 2 Y H λ Y p, (4) Each subproblem s the proxmal mappng of the l p vector norm, whch s a varant of the vector projecton problem unto the l p ball. The soluton to each subproblem and ts tme complexty depends on p. In Sect , we provde the soluton of ths subproblem for popular l p norms, namely p {, 2, } Aggregaton Step In ths step, we construct a lnear combnaton of C (k) and C (k+) to update V (k+) as follows: V (k+) = C (k+) + α k+ ( α k ) ( C (k+) C (k)), (5) α k where α k s conventonally set to k+3 2. Our overall APG algorthm s summarzed n Algorthm 2. Note that convergence s acheved when the relatve change n soluton or objectve functon falls below a predefned tolerance. S λ (a) = sgn(a) max(0, a λ). Ths operator s appled elementwse on H. C (k+) = arg mn Y 2 Y H λ Y = S λ (H ) (6) For p=2: Followng Chen et al. (2009), the update C (k+) s computed n closed form n Eq. (7). C (k+) = arg mn 2 Y H λ Y 2 ( ) = max 0, λ H 2 H (7) Y For p= The update C (k+) s computed va a projecton onto the l ball that can be done by a smple sortng procedure (Chen et al. 2009). In ths case, the soluton s gven n Eq. (8). ( C (k+) = max 0, λ ) a, (8) H where the jth element of vector a s defned as a j = sgn(h j ) mn H j, ĵ u r λ ĵ j =,...,n. r= The temporary parameters u r and ĵ are obtaned as follows. We set u j = C j j and sort these values n decreasng order: u u 2 u n. Then, we set ĵ = max{ j : j r= (u r u j )< λ}. Here, C j and H j denote the element n the th row and jth column of matrces C and H, respectvely Computatonal Complexty of Algorthm Solvng Eq. (4) for p {, 2, } The soluton to Eq. (4) depends on the value of p. For p {, 2, }, we show that ths soluton has a closed form. Note that these solutons can be extended to l p norms beyond the three that we consder here. For p=: The soluton of Eq. (4) s equvalent to the L tracker soluton, as shown n Eq. (). Here, the update C (k+) s computed n closed form n Eq. (6), where S λ s the soft-thresholdng operator defned n scalar form as The computatonal complexty of each teraton n Algorthm 2 s domnated by the gradent computaton n Step 3 and the update of C (k+) n Step 4. Explotng the structure of B, the complexty of Step 3 s O(mnd), whle that of Step 4 depends on p. The latter complexty s O(n(m+d)) for p {, 2} and O(n(m + d)( + log n)) for p =. Snce d m, the perframe complexty of the proposed S-MTT and MTT trackers s O(mndɛ 2 ), where the number of teratons s O(ɛ 2 ) for an ɛ-accurate soluton. In comparson, the tme complexty of the L tracker (that s equvalent to our L tracker) s at least O ( nd 2). In our experments, we observe that the L tracker (that uses APG) s two orders of magntude faster than the L tracker (that solves n Lasso problems ndependently) n general. For example, when m =, n = 200, and d = 32 32, the average per-frame run-tme for L and L are. and

10 376 Int J Comput Vs (203) 0: s respectvely. Ths s on par wth the accelerated realtme mplementaton of the L tracker n L et al. (20). 4.4 Dctonary Update Target appearance remans the same only for a certan perod of tme, but eventually the object templates n D t are no longer an accurate representaton of the target s appearance. A fxed appearance template s prone to the trackng drft problem, snce t s ncapable of handlng appearance changes over tme. In ths paper, our dctonary update scheme s adopted from the work n Me and Lng (20). Each target template n D t s assgned a weght that s ndcatve of how representatve the template s. In fact, the more a template s used to represent trackng results, the hgher ts weght s. When D t cannot represent some partcles well (up to a predefned threshold), the target template wth the smallest weght s replaced by the current trackng result. To ntalze the m target templates, we sample equal-szed patches at and around the ntal poston of the target. In our experments, we shft the ntal boundng box by 3 pxels n each drecton, thus, resultng n m = object templates as n Me and Lng (20). All dctonary templates are normalzed. 5 Expermental Results In ths secton, we present extensve expermental results that valdate the effectveness and effcency of our proposed S-MTT and MTT methods. We also conduct a thorough comparson between our proposed trackers and state-of-the-art trackng methods where applcable. The expermental results are organzed as follows. In Sect. 5., we gve an overvew of the vdeo dataset that we test our S-MTT/MTT trackers. Secton 5.2 enumerates the sx state-of-the-art trackers that we compare aganst. The mplementaton detals of our proposed trackers are hghlghted n Sect In Sect. 5.4, we report the average runtme of the S-MTT trackers wth varyng parameter settngs, as well as, compare t to the runtme of the L tracker. Qualtatve and quanttatve comparsons between S-MTT/MTT and the state-of-the-art trackers are made n Sects. 5.5 and 5.6 respectvely. The comparatve results demonstrate that our method provdes more robust and accurate trackng results than the state-of-the-art. Several vdeos for the trackng results can be found n the supplementary materal. The vdeos and codes wll be made avalable on our project webste Datasets To evaluate our proposed trackers, we comple a set of 5 challengng trackng sequences (denoted as car4, car, davd ndoor, sylv, trells70, grl, coke, faceocc2, shakng, football, snger, snger(low frame rate), skatng, skatng(low frame rate), soccer. The vdeo sequences car4, car, davd ndoor, sylv and trells70 can be downloaded from an onlne source. 5 The vdeo sequences grl, coke and faceocc2 can be downloaded from an onlne source. 6 The other vdeo sequences shakng, football, snger, snger(low frame rate), skatng, skatng(low frame rate) and soccer can be downloaded from an onlne source. 7 These vdeos are recorded n ndoor and outdoor envronments and nclude most challengng factors n vsual trackng: complex background, movng camera, fast movement, large varaton n pose and scale, occluson, as well as shape deformaton and dstorton (see Fgs. 3, 4). 5.2 Baselnes We compared the proposed algorthms (MTT and S-MTT) wth sx state-of-the-art vsual trackers: VTD tracker (Kwon and Lee 200), L tracker (Me and Lng 20), IVT tracker (Ross et al. 2008), MIL tracker (Babenko et al. 2009), Fragments-based tracker (Frag) (Adam et al. 2006), and OAB tracker (Grabner et al. 2006). We use the publcly avalable source codes or bnares provded by the authors themselves wth the same ntalzaton and parameter settngs to generate the comparatve results. In our experments, our proposed trackng methods use the same parameters for all the test sequences. 5.3 Implementaton Detals All our experments are done usng MATLAB R2008b on a 2.66 GHZ Intel Core2 Duo PC wth 6 GB RAM. The template sze d s set to half the sze of the target ntalzaton n the frst frame. Usually, d s n the order of several hundreds of pxels. For all experments, we model p (s t s t ) N (0, dag(σ )), where σ = [0.005, , , 0.005, 4, 4] T. We set the number of partcles n = 400, the total number of target templates m = [The templates are obtaned wth 3 pxels shft around the target poston as the same as the L tracker (Me and Lng 20)], and the number of occluson templates to d. In Algorthm 2, we set η = 0.0, λ =, λ (by cross-valdaton) to {0.0, 0.005, 0.2} for L 2, L and L respectvely, and λ to {0.005, 0.00, 0.2} for L 2, L and

11 Int J Comput Vs (203) 0: Table Average per-frame runtme (n seconds) of 4 trackers (L 2, L, L, and L ) wth varyng template szes d and number of partcles n n d L L 2 L L L L 2 L L L L 2 L L L L 2 L L L respectvely. Each tracker uses the same parameters for all vdeo sequences. In all cases, the ntal poston of the target s selected manually. 5.4 Computatonal Cost The popular L tracker that uses a smlar sparse representaton model for partcle appearance has shown to acheve better trackng performance than state-of-the-art trackers (Me and Lng 20). However, ts computatonal cost grows proportonally as the number of partcle samples n and template sze d ncrease. Due to the nherent smlarty between the L tracker and the proposed trackers (MTT and S-MTT), we compare ther average runtmes n Table. S-MTT and MTT has very smlar computatonal costs, so for smplcty, we just report the runtme results of S-MTT (L 2, L, L ) and L n Table. Based on the results, t s clear that our trackers are much more effcent than the L tracker. For example, when m =, n = 400, and d = 32 32, the average per-frame run-tme for L 2, L, L, and L trackers are 4.69, 5.39, 2.74, and s, respectvely. Interestngly, our L tracker, whch s smlar to L trackng but solved usng the APG method, s about 20 tmes faster than the L tracker. As we know, ncreasng n and d wll mprove trackng performance. For L trackng, the runtme cost ncreases dramatcally wth both n and d; however, ths ncrease s much more reasonable wth our trackers. Note that the computatonal complexty of S-MTT s derved n Sect Qualtatve Comparson The car4 sequence s captured n an open road scenaro. Trackng results at frames {20, 86, 235, 305, 466, 64} for all 2 methods are shown n Fg. 3a. The dfferent trackng methods are color-coded. OAB, Frag, and VTD start to drft from the target at frame 86, whle MIL starts to show some target drftng at frame 200 and fnally loses the target at frame 300. IVT and L track the target qute well. The target s successfully tracked throughout the entre sequence by our L, L 2, L, L, L 2, and L methods. In the car sequence, a car s drven nto a very dark envronment, whle beng vdeotaped from another movng car. Trackng results for frames {0, 0, 200, 250, 309, 393} are presented n Fg. 3b. Frag starts to drft around frame 60. Due to changes n lghtng, MIL starts to undergo target drft from frame 20. OAB and L methods start to fal n frame 284. IVT and VTD can track the target through the whole vdeo sequence; however, these tracks are not as robust or accurate as the proposed L 2, L, L 2 and L trackers. The coke sequence contans frequent occlusons and fast moton, whch cause moton blur. The S-MTT, MTT, L, OAB, and MIL trackers can track the target accurately almost throughout the entre sequence. The other trackers fals due to pose change and occluson as shown n Fg. 3c. In thedavd sequence, a movng face s tracked. The trackng results at frames {354, 423, 465, 502, 588, 760} are shown n Fg. 3d. Frag and VTD fal around frames 423 and 465 respectvely. OAB starts to drft at frame 550. MIL and L adequately track the face, but experence target drft, especally at frames 690 and 500, respectvely. The IVT, S-MTT and MTT methods track the movng face accurately. In the faceocc sequence, a movng face s tracked, whch can evaluate the robustness to occlusons of dfferent methods. The trackng results at frames {00, 23, 34, 474, 57, 865} are shown n Fg. 3e. Because there s only occluson by a book and no changes n llumnaton and moton, most of the methods can track the face accurately except OAB and MIL, whch encounter mnor drft. Results on the faceocc2 sequence are shown n Fg. 3f. Most trackers start drftng from the man s face when t s almost fully occluded by the book. Because the L, MTT and S-MTT methods explctly handle partal occlusons, and update the object dctonary progressvely, they handle the

12 378 Int J Comput Vs (203) 0: (g) football (f) faceocc2 (e) faceocc (d) davd ndoor (c) coke (b) car (a) car4 Fg. 3 (Color onlne) Trackng results of 2 trackers on 7 vdeo sequences denoted wth dfferent colors. Frame numbers are overlayed n red. See text for detals appearance changes n ths sequence very well and contnue trackng the target durng and after the occluson. The football sequence ncludes severe background clutter, whch s smlar n appearance to the tracked target. For the other methods, trackng drfts from the ntended object (helmet) to other smlar lookng objects n the vcnty. Ths s especally the case when the two football players collde at frame 362 (refer to Fg. 3g). The proposed trackers (such asl 2, L and L 2 ) overcome ths problem and successfully track the target because they explot structural relatonshps between partcle representatons. Fgure 4a shows trackng results for the grl sequence. Performance on ths sequence exemplfes the robustness of MTT to occluson (complete occluson of the grl s face as she swvels n the char) and large pose change (the face undergoes sgnfcant 3D rotaton). S-MTT, MTT and L are capable of trackng the target durng the entre sequence. Other trackers experence drft at dfferent nstances: Frag at frame 248, OAB and IVT at frame 436, and VTD at frame 477. In the onelsr sequence, the background color s smlar to the color of the woman s trousers, and the man s shrt and pants have a smlar color to the woman s coat. In addton, the woman undergoes partal occluson when the man n the scene walks behnd her. Some results are shown n Fg. 4b. Whle trackng the woman, IVT, MIL, Frag, OAB, and VTD start trackng the man when the woman s partally occluded

13 Int J Comput Vs (203) 0: (h) trells70 (g) sylv (f) soccer (e) skatng (d) snger (c) shakng (b) onelsr (a) grl Fg. 4 (Color onlne) Trackng results of 2 trackers on 8 vdeo sequences delneated by dfferent colors. Frame numbers are overlayed n red. See text for detals around frame 200, and are unable to recover from ths falure after that. The L tracker tracks the woman qute well. Compared wth other trackers, our L 2, L, L 2 and L trackers are more robust to the occluson. In addton, our L 2, L, L 2 and L trackers are much better than L, L, and L, whch demonstrate that mposng jont sparsty between partcle representatons s helpful for robust object trackng. In the shakng sequence, the tracked object s subject to changes n llumnaton and pose. Whle the stage lghtng condton s drastcally changed, and the pose of the object s severely vared due to head shakng, our method successfully tracks the object (refer to Fg. 4c). Compared wth L, L and L, L 2, L, L 2 and L perform better because ther jont sparse partcle representaton s more robust to rapd changes. Other methods (OAB, IVT, L, and Frag) fal

14 380 Int J Comput Vs (203) 0: to track the object when these changes occur. VTD and MIL methods can track the object qute well except for some errors around frame 60. The snger(l) sequence contans abrupt object moton wth sgnfcant llumnaton and scale changes, whch cause most of the trackers to drft as shown n Fg. 4d. S-MTT, MTT and VTD handle these changes well. Compared wth MTT (L 2, L and L ), S-MTT (L 2, L and L ) obtans much better performance, whch shows that harnessng local structure between partcle representatons s useful for object trackng. In the skatngl sequence, there are abrupt object moton, severe llumnaton and scale changes, vewpont changes and occlusons, whch lead most of the trackers to fal. Our proposed trackers (MTT and S-MTT) and VTD handle these changes well as shown n Fg. 4e. Note that, n the 353th frame n the Fg. 4e, our proposed trackers are slghtly better than the VTD method, whch s the most recent state-of-theart trackng method that can cope wth abrupt moton and appearance changes. Results on the soccer sequence are shown n Fg. 4f. They demonstrate how our proposed method outperforms most of the state-of-the-art trackers when the target s severely occluded by other objects. The L 2, L, L, L 2 and L methods accurately track the player s face despte scale and pose changes as well as occluson/nose from the confett ranng around hm. Other methods (IVT, L, L, OAB, MIL, and Frag) fal to track the object relably. The VTD tracker can track the target n ths sequence qute well. Results on the sylv sequence are shown n Fg. 4g. In ths sequence, a stuffed anmal s beng moved around, thus, leadng to challengng pose, lghtng, and scale changes. IVT fals around frame 623 as a result of a combnaton of pose and llumnaton change. The rest of the trackers are able to track the target throughout the sequence, although the Frag, MIL, VTD, OAB, L and L encounter mnor drft from the target. The trells70 sequence s captured n an outdoor envronment where lghtng condtons change drastcally. The vdeo s acqured when a person walks underneath a trells covered n vnes. As shown n Fg. 4h, the cast shadow changes the appearance of the target face sgnfcantly. Furthermore, the combned effects of pose and lghtng varatons along wth a low frame rate make vsual trackng extremely dffcult. Nevertheless, the L 2, L, L 2 and L trackers can follow the target accurately and robustly, whle the other trackng methods perform below par n ths case. VTD and Frag fal around frame 85. L starts drftng at frame 287, whle MIL and OAB fal at frame 323. IVT starts drftng at frame Quanttatve Comparson To gve a quanttatve comparson between the 2 trackers, we obtan ground truth for all 5 sequences. Here, we note that ground truth for some of the vdeo sequences s readly avalable. We manually label the other sequences. Tracker performance s evaluated accordng to the average per-frame dstance (n pxels) between the center of the trackng result and that of ground truth as used n Babenko et al. (2009) and Me and Lng (20). Clearly, ths dstance 8 should be small. In Fg. 5, we plot the dstance of each tracker over tme on 2 sequences for smplcty. From ths fgure, we see that MTT and S-MTT trackers consstently produce a smaller dstance than other trackers n general. Ths mples that MTT and S-MTT can accurately track the target despte severe occlusons, pose varatons, llumnaton changes, and abrupt motons. In Table 2, we show the average center dstance for each tracker over the 5 sequences. It s clear that the S-MTT and MTT methods are consstently better than the other trackers n most sequences. Among the MTT methods, L 2 outperforms L and L n general. For S-MTT method, L 2 are much better than L and L. In fact, except for the faceocc, football, shakng and skatngl sequences, n whch we obtan smlar results as IVT and VTD, the L 2 and L 2 trackers do outperform the other methods. Frag and L perform well under partal occluson but tend to fal under severe llumnaton and pose changes. The IVT tracker s hardly affected by changes n appearance except those due to llumnaton. OAB s effected by background clutter and easly drfts from the target. MIL performs well except when severe llumnaton changes force the tracker to drft nto the background. VTD tends to be robust aganst llumnaton change, but t cannot handle severe occlusons and vewpont changes adequately. Now, we compare the performance of S-MTT and MTT methods. Based on the results n Table 2, L 2 and L outperform L. Ths s due to the fact that the L tracker learns partcle representatons separately, whle L 2 and L captalze on dependences between dfferent partcles to obtan more robust jontly sparse representatons. These results demonstrate that t s useful for vsual trackng to mpose jont sparsty among partcle representatons. For S-MTT, the performance of the correspondng three trackers (L 2, L and L ) has smlar tendences as MTT. However, S-MTT s reasonably better than MTT. Ths valdates the mpact of usng local graph structure to regularze partcle representatons and yeld more robust object trackng. In addton, we compare MTT and S-MTT wth the L tracker, whch s the most related tracker to ours and has shown state-of-the-art performance (Me and Lng 20). Based on the results n Table 2, MTT(L 2 and L ) and S-MTT (L 2 and L ) outperform the L tracker. That s because L trackng represents partcles separately, whle the 8 Ths dssmlarty measure s used often to compare trackng performance. Other measures can be used, ncludng the PASCAL overlap score.

15 Int J Comput Vs (203) 0: Fg. 5 (Color onlne) Center dstance (n pxels) between trackng result and ground truth over tme for 2 trackers appled to 5 vdeo sequences proposed trackers captalze on dependences between dfferent partcle representatons to obtan a more robust jontly sparse representaton. Our results demonstrate that t s useful for vsual trackng to mne partcle relatonshps. L outperforms the L and L trackers, snce the L tracker makes use of the local graph structure. Moreover, n theory, the L

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Partial Occlusion Handling for Visual Tracking via Robust Part Matching

Partial Occlusion Handling for Visual Tracking via Robust Part Matching Partal Occluson Handlng for Vsual Trackng va Robust Part Matchng Tanzhu Zhang, Ku Ja, Changsheng Xu, Y Ma, Narendra Ahuja Insttute of Automaton, Chnese Academy of Scences, P. R. Chna Advanced Dgtal Scences

More information

Online Detection and Classification of Moving Objects Using Progressively Improving Detectors

Online Detection and Classification of Moving Objects Using Progressively Improving Detectors Onlne Detecton and Classfcaton of Movng Objects Usng Progressvely Improvng Detectors Omar Javed Saad Al Mubarak Shah Computer Vson Lab School of Computer Scence Unversty of Central Florda Orlando, FL 32816

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

AGOOD appearance model is one of the most critical

AGOOD appearance model is one of the most critical 1510 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 24, NO. 5, MAY 2015 Robust Dscrmnatve Trackng va Landmark-Based Label Propagaton Yuwe Wu, Mngtao Pe, Mn Yang, Junsong Yuan, Member, IEEE, and Yunde Ja,

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking 2011 Internatonal Conference on Informaton and Intellgent Computng IPCSIT vol.18 (2011) (2011) IACSIT Press, Sngapore Robust Inler Feature Trackng Method for Multple Pedestran Trackng Young-Chul Lm a*

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Structure from Motion

Structure from Motion Structure from Moton Structure from Moton For now, statc scene and movng camera Equvalentl, rgdl movng scene and statc camera Lmtng case of stereo wth man cameras Lmtng case of multvew camera calbraton

More information

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 48 CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 3.1 INTRODUCTION The raw mcroarray data s bascally an mage wth dfferent colors ndcatng hybrdzaton (Xue

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Robust Mean Shift Tracking with Corrected Background-Weighted Histogram

Robust Mean Shift Tracking with Corrected Background-Weighted Histogram Robust Mean Shft Trackng wth Corrected Background-Weghted Hstogram Jfeng Nng, Le Zhang, Davd Zhang and Chengke Wu Abstract: The background-weghted hstogram (BWH) algorthm proposed n [] attempts to reduce

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning Computer Anmaton and Vsualsaton Lecture 4. Rggng / Sknnng Taku Komura Overvew Sknnng / Rggng Background knowledge Lnear Blendng How to decde weghts? Example-based Method Anatomcal models Sknnng Assume

More information

APPLICATION OF PREDICTION-BASED PARTICLE FILTERS FOR TELEOPERATIONS OVER THE INTERNET

APPLICATION OF PREDICTION-BASED PARTICLE FILTERS FOR TELEOPERATIONS OVER THE INTERNET APPLICATION OF PREDICTION-BASED PARTICLE FILTERS FOR TELEOPERATIONS OVER THE INTERNET Jae-young Lee, Shahram Payandeh, and Ljljana Trajovć School of Engneerng Scence Smon Fraser Unversty 8888 Unversty

More information

Robust visual tracking based on Informative random fern

Robust visual tracking based on Informative random fern 5th Internatonal Conference on Computer Scences and Automaton Engneerng (ICCSAE 205) Robust vsual trackng based on Informatve random fern Hao Dong, a, Ru Wang, b School of Instrumentaton Scence and Opto-electroncs

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs

More information

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros.

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros. Fttng & Matchng Lecture 4 Prof. Bregler Sldes from: S. Lazebnk, S. Setz, M. Pollefeys, A. Effros. How do we buld panorama? We need to match (algn) mages Matchng wth Features Detect feature ponts n both

More information

APPLICATION OF PREDICTION-BASED PARTICLE FILTERS FOR TELEOPERATIONS OVER THE INTERNET

APPLICATION OF PREDICTION-BASED PARTICLE FILTERS FOR TELEOPERATIONS OVER THE INTERNET APPLICATION OF PREDICTION-BASED PARTICLE FILTERS FOR TELEOPERATIONS OVER THE INTERNET Jae-young Lee, Shahram Payandeh, and Ljljana Trajovć School of Engneerng Scence Smon Fraser Unversty 8888 Unversty

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

A Bilinear Model for Sparse Coding

A Bilinear Model for Sparse Coding A Blnear Model for Sparse Codng Davd B. Grmes and Rajesh P. N. Rao Department of Computer Scence and Engneerng Unversty of Washngton Seattle, WA 98195-2350, U.S.A. grmes,rao @cs.washngton.edu Abstract

More information

LECTURE : MANIFOLD LEARNING

LECTURE : MANIFOLD LEARNING LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors

More information

Towards Semantic Knowledge Propagation from Text to Web Images

Towards Semantic Knowledge Propagation from Text to Web Images Guoun Q (Unversty of Illnos at Urbana-Champagn) Charu C. Aggarwal (IBM T. J. Watson Research Center) Thomas Huang (Unversty of Illnos at Urbana-Champagn) Towards Semantc Knowledge Propagaton from Text

More information

Learning a Class-Specific Dictionary for Facial Expression Recognition

Learning a Class-Specific Dictionary for Facial Expression Recognition BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 16, No 4 Sofa 016 Prnt ISSN: 1311-970; Onlne ISSN: 1314-4081 DOI: 10.1515/cat-016-0067 Learnng a Class-Specfc Dctonary for

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Multi-view 3D Position Estimation of Sports Players

Multi-view 3D Position Estimation of Sports Players Mult-vew 3D Poston Estmaton of Sports Players Robbe Vos and Wlle Brnk Appled Mathematcs Department of Mathematcal Scences Unversty of Stellenbosch, South Afrca Emal: vosrobbe@gmal.com Abstract The problem

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Classification / Regression Support Vector Machines

Classification / Regression Support Vector Machines Classfcaton / Regresson Support Vector Machnes Jeff Howbert Introducton to Machne Learnng Wnter 04 Topcs SVM classfers for lnearly separable classes SVM classfers for non-lnearly separable classes SVM

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

High resolution 3D Tau-p transform by matching pursuit Weiping Cao* and Warren S. Ross, Shearwater GeoServices

High resolution 3D Tau-p transform by matching pursuit Weiping Cao* and Warren S. Ross, Shearwater GeoServices Hgh resoluton 3D Tau-p transform by matchng pursut Wepng Cao* and Warren S. Ross, Shearwater GeoServces Summary The 3D Tau-p transform s of vtal sgnfcance for processng sesmc data acqured wth modern wde

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Object Contour Tracking Using Multi-feature Fusion based Particle Filter

Object Contour Tracking Using Multi-feature Fusion based Particle Filter Object Contour Tracng Usng Mult-feature Fuson based Partcle Flter Xaofeng Lu 1,3, L Song 1,2, Songyu Yu 1, Nam Lng 2 Insttute of Image Communcaton and Informaton Processng 1 Shangha Jao Tong Unversty,

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem Ecent Computaton of the Most Probable Moton from Fuzzy Correspondences Moshe Ben-Ezra Shmuel Peleg Mchael Werman Insttute of Computer Scence The Hebrew Unversty of Jerusalem 91904 Jerusalem, Israel Emal:

More information

Recognizing Faces. Outline

Recognizing Faces. Outline Recognzng Faces Drk Colbry Outlne Introducton and Motvaton Defnng a feature vector Prncpal Component Analyss Lnear Dscrmnate Analyss !"" #$""% http://www.nfotech.oulu.f/annual/2004 + &'()*) '+)* 2 ! &

More information

Face Tracking Using Motion-Guided Dynamic Template Matching

Face Tracking Using Motion-Guided Dynamic Template Matching ACCV2002: The 5th Asan Conference on Computer Vson, 23--25 January 2002, Melbourne, Australa. Face Trackng Usng Moton-Guded Dynamc Template Matchng Lang Wang, Tenu Tan, Wemng Hu atonal Laboratory of Pattern

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Scale and Orientation Adaptive Mean Shift Tracking

Scale and Orientation Adaptive Mean Shift Tracking Scale and Orentaton Adaptve Mean Shft Trackng Jfeng Nng, Le Zhang, Davd Zhang and Chengke Wu Abstract A scale and orentaton adaptve mean shft trackng (SOAMST) algorthm s proposed n ths paper to address

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of

More information

Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms

Categories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms 3. Fndng Determnstc Soluton from Underdetermned Equaton: Large-Scale Performance Modelng by Least Angle Regresson Xn L ECE Department, Carnege Mellon Unversty Forbs Avenue, Pttsburgh, PA 3 xnl@ece.cmu.edu

More information

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat

2 ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE (a) Ambgutes from person re dentfcaton n solaton (b) Assocatng groups of people may reduce ambgutes n mat ZHENG et al.: ASSOCIATING GROUPS OF PEOPLE 1 Assocatng Groups of People We-Sh Zheng jason@dcs.qmul.ac.uk Shaogang Gong sgg@dcs.qmul.ac.uk Tao Xang txang@dcs.qmul.ac.uk School of EECS, Queen Mary Unversty

More information

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET 1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School

More information

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS P.G. Demdov Yaroslavl State Unversty Anatoly Ntn, Vladmr Khryashchev, Olga Stepanova, Igor Kostern EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS Yaroslavl, 2015 Eye

More information

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated. Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Three supervised learning methods on pen digits character recognition dataset

Three supervised learning methods on pen digits character recognition dataset Three supervsed learnng methods on pen dgts character recognton dataset Chrs Flezach Department of Computer Scence and Engneerng Unversty of Calforna, San Dego San Dego, CA 92093 cflezac@cs.ucsd.edu Satoru

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm Resolvng Ambguty n Depth Extracton for Moton Capture usng Genetc Algorthm Yn Yee Wa, Ch Kn Chow, Tong Lee Computer Vson and Image Processng Laboratory Dept. of Electronc Engneerng The Chnese Unversty of

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

Parallel Numerics. 1 Preconditioning & Iterative Solvers (From 2016)

Parallel Numerics. 1 Preconditioning & Iterative Solvers (From 2016) Technsche Unverstät München WSe 6/7 Insttut für Informatk Prof. Dr. Thomas Huckle Dpl.-Math. Benjamn Uekermann Parallel Numercs Exercse : Prevous Exam Questons Precondtonng & Iteratve Solvers (From 6)

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Data Mining: Model Evaluation

Data Mining: Model Evaluation Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct

More information

Multi-stable Perception. Necker Cube

Multi-stable Perception. Necker Cube Mult-stable Percepton Necker Cube Spnnng dancer lluson, Nobuuk Kaahara Fttng and Algnment Computer Vson Szelsk 6.1 James Has Acknowledgment: Man sldes from Derek Hoem, Lana Lazebnk, and Grauman&Lebe 2008

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS46: Mnng Massve Datasets Jure Leskovec, Stanford Unversty http://cs46.stanford.edu /19/013 Jure Leskovec, Stanford CS46: Mnng Massve Datasets, http://cs46.stanford.edu Perceptron: y = sgn( x Ho to fnd

More information

Private Information Retrieval (PIR)

Private Information Retrieval (PIR) 2 Levente Buttyán Problem formulaton Alce wants to obtan nformaton from a database, but she does not want the database to learn whch nformaton she wanted e.g., Alce s an nvestor queryng a stock-market

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Adaptive Transfer Learning

Adaptive Transfer Learning Adaptve Transfer Learnng Bn Cao, Snno Jaln Pan, Yu Zhang, Dt-Yan Yeung, Qang Yang Hong Kong Unversty of Scence and Technology Clear Water Bay, Kowloon, Hong Kong {caobn,snnopan,zhangyu,dyyeung,qyang}@cse.ust.hk

More information

PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS

PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS Po-Lun La and Alper Ylmaz Photogrammetrc Computer Vson Lab Oho State Unversty, Columbus, Oho, USA -la.138@osu.edu,

More information

Video Object Tracking Based On Extended Active Shape Models With Color Information

Video Object Tracking Based On Extended Active Shape Models With Color Information CGIV'2002: he Frst Frst European Conference Colour on Colour n Graphcs, Imagng, and Vson Vdeo Object rackng Based On Extended Actve Shape Models Wth Color Informaton A. Koschan, S.K. Kang, J.K. Pak, B.

More information