IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1

Size: px
Start display at page:

Download "IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1"

Transcription

1 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1 Adapive Appearance Model and Condensaion Algorihm for Robus Face Tracking Yui Man Lui, Suden Member,, J. Ross Beveridge, Member,, and L. Darrell Whiley Absrac We presen an adapive framework for condensaion algorihms in he conex of human-face racking. We aack he face racking problem by making facored sampling more efficien and appearance updae more effecive. An adapive affine cascade facored sampling sraegy is inroduced o sample he parameer space such ha coarse face locaions are locaed firs, followed by a fine facored sampling wih a small number of paricles. In addiion, he local lineariy of an appearance manifold is used in conjuncion wih a new crierion o selec a angen plane for updaing an appearance in face racking. Our proposed mehod seeks he bes linear variey from he seleced angen plane o form a reference image. We demonsrae he effeciveness and efficiency of he proposed mehod on a number of challenging videos. These es video sequences show ha our mehod is robus o illuminaion, appearance, and pose changes, as well as emporary occlusions. Quaniaively, our mehod achieves he average roo-mean-square error a 4.98 on he well-known dudek video sequence while mainaining a proficien speed a 8.74 f/s. Finally, while our algorihm is adapive during execuion, no raining is required. Index Terms Adapive appearance model, adapive condensaion algorihm, face racking, angen-plane selecion. I. INTRODUCTION OBJECT racking is an acive compuer vision research opic [1]. Among many visual objecs, human faces are ofen he subjec of ineres. Tracking human faces is paricularly imporan in he racking communiy, and i has led o many applicaions. These applicaions include video surveillance, human compuer inerface, biomerics, ec. However, face racking coninues o be a challenging problem due, in par, o nonrigid moion, appearance variaions, illuminaion changes, and occlusions. Some examples of hese challenges are shown in Fig. 1. For visual racking, observaions ake place sequenially, and predicions are deermined as soon as image frames arrive. Moreover, real-world daa are ofen complex. To succeed, a racking algorihm needs o be boh efficien and effecive. Recenly, sequenial Mone Carlo (SMC) echniques [2] [4] have received aenion because of heir abiliy o escape from local minima and heir applicabiliy o non-gaussian daa. For example, hese echniques have been demonsraed o be robus relaive o parial occlusion. Manuscrip received November 29, 2008; revised May 27, This work was suppored by he Naional Science Foundaion under Gran This paper was recommended by Gues Edior K. W. Bowyer. The auhors are wih he Deparmen of Compuer Science, Colorado Sae Universiy, For Collins, CO USA ( lui@cs.colosae.edu; ross@cs.colosae.edu; whiley@cs.colosae.edu). Color versions of one or more of he figures in his paper are available online a hp://ieeexplore.ieee.org. Digial Objec Idenifier /TSMCA Fig. 1. Challenges of face racking. (a) Differen expressions. (b) Differen poses. (c) Differen illuminaions. (d) Occlusion. A condensaion algorihm [5], also known as boosrap filers [3], is an example of an SMC mehod. This echnique adops facored sampling o form a se of weighed samples. As such, he mos probable objec movemens can be locaed and racked accordingly. However, mos exising condensaion algorihms apply sequenial imporance sampling wih a fixed Gaussian envelope [5], [6] and ignore he emporal informaion and mach qualiies. This canonical sampling sraegy may be widespread raher han concenraing on he mos probable regions. Hence, i requires more paricles o sample he space. In his paper, we inroduce an adapive condensaion algorihm which uses cascade facored sampling o sample he search space. Facial appearance can change dramaically during a racking process because of various expressions, poses, and illuminaions. An appearance model for face racking is o capure hese variaions and models i for a novel image (incoming frame) /$

2 2 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS For online racking which does no have any prior raining, he appearance model mus learn he facial variaions using previous frames. Therefore, i is considered as a harder problem bu could poenially have more applicaions. Appearance models can be represened in a variey of ways. The common appearance models are emplae-based [7], [8], view-based [9], [10], and feaure-based models [11], [12]. Our appearance model may be regarded as an online emplae-based model. Incremenal subspace learning [6], [13], [14] is a popular mehod for online visual racking. This echnique updaes a subspace sequenially, and he racked objec is declared when he minimum disance is found beween he observaion and he subspace. While hese mehods updae a subspace efficienly, hey can only express an image in a linear fashion. I has been argued ha images acually reside in a nonlinear space [15], [16]. In his paper, we mark he curved naure of an image space, which we call appearance manifold, and characerize local lineariy o form a angen plane in which he emplae is buil adapively. A condensaion algorihm uilizes a facored sampling sraegy o form muliple hypoheses and compues he likelihood for each predicion in accordance wih an observaion. For human-face racking using a condensaion algorihm, wo facors play a vial role in robusness. The firs is o have effecive sampling. This facor deermines he geomery of he face, including where i appears in he image. The second is o updae he appearance effecively. Even a good sampling sraegy will fail if he appearance model does no associae a good prediced appearance wih an essenially correc locaion. Therefore, he appearance model should adap over ime, be able o generae high-fideliy maches where a rue face appears, and avoid generaing false maches for hypohesized facial geomeries ha are parially or oally in error. In his paper, we propose a new adapive framework for condensaion algorihms. We inroduce an adapive affine cascade facored sampling and adapive likelihoods o make facored sampling more efficien. Moreover, we model he appearance changes beween video frames by exploiing he local lineariy of an appearance manifold. A new angen-plane-selecion crierion is proposed, and a reference image is buil using he bes linear variey on he seleced angen plane. This adapive appearance model makes image maching robus o various appearances and poses. The proposed mehod can be characerized as an online learning paradigm which does no require prior raining daa. Unlike many racking algorihms [5], [17], [18] which employ complex dynamical models, we model he sae dynamics using he Brownian moion. This makes our mehod generic. Oher characerisics of he proposed mehod include he following: 1) adapive facored sampling; 2) adapive noise; 3) adapive likelihoods; and 4) an adapive appearance model. As our appearance model, we consider appearances on a manifold and uilize is geomerical properies accordingly. This adapive framework is he core conribuion of his paper. Four video sequences, namely, dudek [19], davidin300 [19], Rams, and Smiley, are employed o assess he performance of he proposed mehod. The res of his paper is organized as follows. Secions II and III review relaed works and he elemens of condensaion algorihms. The frameworks of adapive sampling and adapive appearance are given in Secions IV and V, respecively. Experimenal resuls are described in Secion VI. Discussion and conclusions are provided in Secions VII and VIII, respecively. II. RELATED WORK Appearance changes can be caused by pose and/or illuminaion variaions. Isard and Blake [5] propose a condensaion algorihm for visual racking using acive conours paramerized by low-dimensional vecors. The key idea of condensaion algorihms is he ieraive use of facored sampling o generae muliple hypoheses on video sequences. MacCormick and Isard [20] propose a pariioned sampling for a condensaion algorihm which performs he sampling in a hierarchical fashion using a survival rae. Unlike pariioned sampling, our adapive sampling is a wo-sage cascade facored sampling where a coarse sampling is applied o locae he approximaed face locaions, followed by a fine sampling o fine une he face posiions. The ask of visual racking can be considered as an opimizaion problem. Hager and Belhumeur [7] propose a racking algorihm uilizing a parameric model. This algorihm employs a low-dimensional linear subspace represenaion of appearance based upon a se of images acquired prior o he sar of racking. A gradien-based opimizaion procedure is used during racking o adjus appearance. Cooes and Edwards [21] inroduce acive appearance models (AAMs) for deformable objecs. This model applies principal componen analysis o encode shape and exure informaion. Typically, a Gauss Newon mehod is used o find model parameers ha adjus he model o mach closely he curren video frame. The major drawback of his model is is generalizaion o a generic person; in addiion, prior raining is needed. Jepson e al. [22] propose a WSL racker employing a mixure model o accoun for appearance changes. The image feaures are exraced using wavele filers and are modeled using an online expecaion-maximizaion algorihm during he racking process. However, he appearance updae sraegy for hese mixure models depends upon a Gaussian assumpion which may no be valid for real-world daa. Insead, we model observaions on an appearance manifold. Visual racking can be viewed as a classificaion problem. Avidan [23] formulaes he visual racking problem as a classificaion ask using suppor vecor machines (SVM), where he arge is aced as posiive examples and background is regarded as negaive examples. Song e al. [24] combine ensemble classifiers wih paricle filers for muliarge visual racking. These classificaion-based racking mehods usually require eiher offline raining or heavy compuaional load due o a complicaed online learning scheme. Yu e al. [25] inegrae a generaive racker, which employs muliple subspaces o represen an objec, and a discriminaive racker, which is online SVMs. Subspaces are learned and merged online for all appearance variaions; in addiion, an online SVM classifier is used o focus on recen appearance changes. This mehod yields very good racking resuls. However, i requires heavy compuaion for subspace merging and updaing SVMs and only obains wo frames per second in a C++ implemenaion. Adaping o environmenal changes is key for visual racking. Zhou e al. [17] employ a paricle filer for racking and recogniion. Momen images are used o build a mixure appearance

3 LUI e al.: ADAPTIVE APPEARANCE MODEL AND CONDENSATION ALGORITHM FOR ROBUST FACE TRACKING 3 model for various appearances. The adapive velociy is compued based on he difference beween wo successive frames. The adapive noise is deermined based on he qualiy of predicion. The number of paricles is also varied based upon he esimaed noise variance. Furhermore, he auhors rea occluded pixels as ouliers in which he properies of robus saisics are assumed. Conrasingly, we do no assume any saisical properies in order o handle occlusion. Li e al. [26] propose a cascade paricle filer using hree discriminaive observers. These discriminaive observers are rained using differen inervals beween video frames. A cascade model is applied for imporance sampling. There are hree sages of sampling corresponding o hese hree observers, and hey employ 3000, 600, and 200 paricles, respecively. In conras, our cascade facored sampling only needs o apply 400 o 600 paricles. Incremenally updaing a subspace is a popular echnique for online racking. Ho e al. [14] inroduce an online subspace learning mehod for visual racking. The bases of a subspace are consruced from a se of local means. The local means are compued from a se of consecuive frames, and a new observaion is consrained o remain wihin a prese disance of is local mean. The appearance subspace is incremenally updaed. This mehod mainains he emporal neighborhood (recen frames) as he bases for racking. The local consrain may evenually cause he racker o lose valuable informaion and consequenly fail o keep racking he face. In conras, our mehod keeps rack of he spaial neighborhood, and consequenly, key frames can be used o cope wih drifing and various appearance changes. Lee and Kriegman [27] uilize a prerained generic appearance represenaion in conjuncion wih an online personspecific appearance model. The auhors approximae an appearance manifold using a low-dimensional linear subspace and a probabilisic sae-ransiion Bayesian framework. The pose subspaces are incremenally updaed during racking. Recenly, Ross e al. [6] applied a paricle filer and exended he sequenial Karhunrn Loeve algorihm. This sequenial algorihm incremenally updaes a mean image and associaed eigenspace ha characerizes he face being racked. A forgeing facor is suggesed o enhance he robusness of racking. The auhors demonsrae ha heir mehod ouperforms he WSL [22] and he mean shif [28] rackers on he dudek video sequence. Neverheless, his mehod needs o employ los of paricles o achieve good resuls. Generally speaking, drifing is he challenge o be overcome by online visual rackers. Mahews e al. [8] discuss he emplae updae problem for drifing. The auhors proposed a simple mechanism for he drifing problem, which aligns he updae emplae wih he firs frame. Then, hey formulaed he racking problem as a search problem for AAMs. In his paper, we aack he drifing problem by adaping he angen plane and no allowing small perurbaions o conaminae our appearance model. III. CONDENSATION ALGORITHM To review, he objecive of condensaion algorihms [5] is o esimae he poserior disribuion p(x Y 1: ), where x is a sae a ime and Y is he observaion from ime 1 o ime. By Bayes rule, his poserior disribuion can be esimaed recursively [4] p(x Y 1: ) = κp(y x )p(x Y 1: 1 ) (1) where κ is a normalizing consan, p(y x ) is an observaion condiional densiy, and p(x Y 1: 1 ) is a prior densiy. In pracice, p(y x ) is generally mulimodal, and p(x Y 1: ) canno be compued in closed form. However, i is assumed ha p(y x ) can be evaluaed a poins, and herefore, he poserior disribuion can be approximaed using facored sampling. The key o facored sampling is o generae n samples {s 1,s 2,...,s n } from p(x ), where sample i {1, 2,...,n} is chosen in accordance wih probabiliy π i described as follows: π i = p(y x = s i ) n j=1 p(y x = s j ). (2) Once he poserior disribuion is approximaed, he maximum a poseriori esimae can be used o draw probabilisic inferences. Noe ha increasing n improves he esimae and he approximaed poserior disribuion is weakly convergen o he rue poserior disribuion using facored sampling [5]. A. Elemens of he Condensaion Algorihms Condensaion algorihms ieraively exploi propagaion and facored sampling. The poserior disribuion is sampled from a se of paricles {s (p),π (p) } n p=1 wih p(x 1 Y 1: 1 ). Samples wih high probabiliies will be sampled muliple imes and processed by propagaion and observaion seps. Four elemens of condensaion algorihms may be summarized as follows. 1) Iniializaion: The firs sep for a condensaion algorihm is o form a se of poenial saes. Each poenial sae x 0 has an associaed likelihood π 0.Len be he iniial number of paricles; hen, his se of iniial paricles can be expressed as {s (p) 0,π(p) 0 }n p=1, where he superscrip denoes he index of a paricle and he subscrip denoes he ime index. 2) Propagaion: Propagae samples using a sae-ransiion equaion s (p) = s (p) 1 + N(0, Σ), where Σ is he observaion noise. 3) Observaion: Compue likelihoods from observaions π (p) =p(y (p) s (p) ), and normalize such ha n p=1 π(p) =1. 4) Resampling: Resample s (p) wih he probabiliy π (p) using facored sampling. The propagaion, observaion, and resampling process will be ieraed in each ime sep. IV. ADAPTIVE SAMPLING The canonical condensaion algorihm employs a uniform approach o perform sampling. In oher words, he same number of new paricles are creaed upon each ieraion. In our algorihm, boh he number of new paricles and he amoun of noise vary in response o emporal differences and mach qualiies. This process is described as follows, beginning wih a formal definiion of he sae space.

4 4 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS imaely locae he face. The number of paricles used in coarse sampling is beween 400 and 600. Paricles wih high probabiliies are seleced for fine sampling wih all six affine parameers. In our experimens, we selec he op 100 paricles o perform fine sampling. In he fine sampling sep, he noise of [sx sy dx dy] is se o be small consans such ha small adjusmen is made, and he [θ q] noise depends upon a mach qualiy esimae, as described in he nex secion. Fig. 2. Illusraion of our cascade facored sampling. Every ime sep consiss of a coarse and fine sampling. A. Sae Space Model Face racking is accomplished by fiing a recangular bounding box o a region of he image cenered upon he face. The posiion, size, and shape of he box relaive o he curren frame of video are deermined by an affine ransformaion defined by he following: u cos(θ) sin(θ) dx v = sin(θ) cos(θ) dy 1 q sx sy 0 x y. (3) In his equaion, x and y are he 2-D homogeneous inpu coordinaes, and u and v are he affine ransformed coordinaes. The space of affine ransformaions is he sae space searched by he face racking algorihm. As (3) shows, affine ransformaions can be characerized by six conrol parameers [sx,sy,θ,q,dx,dy], where sx is he horizonal scaling, sy is he verical scaling, θ is a roaed angle, q is a skew parameer, dx is he horizonal ranslaion, and dy is he verical ranslaion. Hence, our sae vecor denoed as s a ime consiss of six parameers of an affine ransformaion, and he objecive of face racking is o esimae hese hidden variables. B. Adapive Affine Cascade Facored Sampling Tradiional condensaion algorihms [5], [6], [26] employ many paricles o sample a sae space, making condensaion inefficien and impracical for large spaces. To make sampling effecive and efficien, we propose a cascade facored sampling wih coarse and fine facored sampling sraegies shown in Fig. 2. As Fig. 2 shows, our cascade facored sampling consiss of wo levels of propagaion and observaion in each ime sep. The purpose of coarse 1 sampling is o approximaely locae he face posiion; hen, he face posiion is fine uned during he fine sampling process. Specifically, he [sx sy dx dy] parameers are coarsely sampled while [θ q] remains unchanged in order o approx- 1 For he sake of breviy, here and in fuure, we will drop he adjecive facored in fron of sampling. C. Adapive Noise Samples are propagaed by adding observaion noise o saes. Unlike radiional condensaion algorihms, which add a fixed amoun of noise o every image frame, we deermine he amoun of observaion noise based on he emporal difference of each sae parameer. Le u (i) be he elemen of one of he sae parameers in [sx sy dx dy], where he superscrip indicaes he index posiion. In he coarse sampling sep, he amoun of observaion noise is deermined based on he following crierion: a 1 b du (1) a Σ u =max 2 0 b, du (2) a b 2 0 du (3), a b 4 du (4) du (1) u (1) u (1) 1 du (2) where du (3) = u (2) u (2) 1 u (3) du (4) u (3). (4) 1 u (4) u (4) In his conex, max is an elemenwise maximum operaion. The erm a i ensures ha a minimum amoun of noise will always be added o an observaion. The erm b i weighs he L 1 norm of he emporal difference. In our experimens, we se a o [0.03, 0.03, 2, 2] T and diag(b) o [0.5, 0.5, 1, 1] T. While coarse sampling is concerned wih scale and ranslaion, and uses emporal differences o adjus he amoun of noise, fine sampling will add a noise erm for roaion and skew based upon he qualiy of he mach. Formally, le v be he elemen of one of he sae parameers in [θ q]. In he fine sampling sep, he amoun of observaion noise is deermined as follows: [ ] c1 Σ v = τ (5) 1 where c i is deermined empirically, and τ is a mach qualiy measure defined in he nex secion. In our experimens, we se c as [0.003, 0.003] T. In general, he parameers a, b, and c conrol he range of sampling. The specific seings for a, b, and c jus described were chosen based upon pilo experimens conduced using he dudek video sequence [19]. These seings are hen held consan in all he racking experimens presened in Secion VI. Also in a pilo experimen, he imporance of adapive sampling and adapive noise was esed by running a side-by-side comparison of our algorihm using boh and a varian of our algorihm wih boh disabled. The es was also run on he dudek video sequence. From his comparison, we noe ha he number c 2

5 LUI e al.: ADAPTIVE APPEARANCE MODEL AND CONDENSATION ALGORITHM FOR ROBUST FACE TRACKING 5 of paricles had o be increased o 1000 o obain reasonable racking performance, whereas he adapive version of our algorihm uses beween 400 and 600 paricles. Even wih he addiional paricles, he average roo-mean-square error (rmse) increased by 10% relaive o he resul for our adapive algorihm. This oucome clearly indicaes ha he adapive componens of our algorihm no only resul in a more efficien racking algorihm bu also improve racking accuracy. V. A DAPTIVE APPEARANCE Every paricle has an associaed affine ransformaion, and his is used o warp a video frame o a canonical geomery. The canonical geomery is defined as he face chip from he firs frame which is seleced manually in our experimens. Formally, le y (p) R m be an observaion given by a sae s (p) a ime, defined as y (p) ( ) = W I,s (p) where W is a warp operaion, 2 I is an image frame a ime, and s (p) is he ph sae vecor a ime. In general, he likelihood of an observaion depends upon he qualiy of he mach beween he warped and reference images. In he conex of our face racking, a reference image is a emplae buil from an appearance model, and i is used o mach all possible face candidaes. Before we discuss how o adapively build his reference image, we firs define our likelihood funcion and demonsrae our sraegy for narrowing he envelope of he likelihood funcion when a sae is found o be a good mach. The raionale is o focus he search more finely in he regions of sae space where maches are good. Then, we will describe he appearance model and our procedure for adaping he model. The model is based on a local linear approximaion o he appearance manifold. The adapaion process is based on a new angen-plane-selecion crierion. A. Adapive Likelihood Model A likelihood funcion measures he qualiy of an observaion. As is common, he likelihood depends upon he L 2 disance beween he observaion y (p) a ime and he reference image z 1 creaed from he appearance model a ime 1. Specifically, he likelihood is defined as ( p y (p) ) (p) z 1 y (p) 2 s =exp. (7) Wha makes our likelihood adapive is our inroducion of a dependence beween σ 2 and he bes mach among he curren observaions hrough an inermediae variable τ. Specifically, τ represens he smalles L 2 disance beween he reference image and he observaions a ime τ =min p σ 2 (6) z 1 y (p) 2. (8) 2 In his paper, he warp operaion is defined as an affine ransformaion, (3), followed by a cropping operaion. Fig. 3. Essence of adapive likelihood sampling is adjusing he variance of he noise model. (a) Inensify sampling when confidence in mach is high. (b) Diversify sampling when confidence in mach is low. Then, he adapive variance is defined as σ 2 = max(τ,t) σ 2 0. (9) The iniial variance σ0 2 and minimum noise hreshold T are user-defined parameers. As (9) shows, he adapive variance is proporional o he mach qualiy τ, and he mach hreshold T ensures ha a leas some minimum amoun of variaion is always applied o he sampling process. In condensaion algorihms, he purpose of compuing likelihood probabiliies p(y (p) s (p) ) is o govern he facored sampling process where paricles are resampled in order o concenrae paricles in he regions of he search space wih he highes likelihood. The adapive variance σ 2 inensifies search over a narrow range of affine ransformaions when a mach qualiy is high and broadens he range of ransformaions when a mach qualiy is low. This relaionship beween inensificaion and diversificaion is shown in Fig. 3. We also increase he number of paricles by a facor of 1.5 when τ is larger or equal o T, indicaing a low mach qualiy. The raionale is ha he mach qualiy will degrade if he racker is saring o fail o keep up wih a moving face, and increasing he number of paricles increases he chances of finding a beer affine ransformaion and reacquiring he face. B. Adapive Appearance Model The proposed adapive appearance model is moivaed by he opology of a space where images are viewed as residing on an absrac image space [16]. We call his image space as he appearance manifold. One advanage of making he opology of he appearance manifold explici is ha i allows us o ake advanage of adjacen appearances in he sense of spaial neighborhood raher han emporal neighborhood. The spaial coherence admis local lineariy even hough he appearance manifold is globally curved. Since images on an appearance manifold exhibi srong spaial coherence, he appearance model can reconsruc a beer reference image. Before showing he benefis of spaial neighborhood, we reveal how he appearance manifold is modeled. Key operaions for our racking algorihm include forming an appearance manifold, selecing a angen plane, and building a reference image based upon he angen plane. These operaions are ieraively applied o every frame during racking such ha he appearance is adaped over ime. The nex hree secions describe hese hree operaions. 1) Appearance Manifold Formaion: Alhough frame-oframe changes in face appearance are usually small, over

6 6 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS many frame appearance changes and rackers relian upon fixed emplaes will perform poorly. Mahews e al. [8] discuss he emplae updae problem by which a racker may adap over ime. We apply a similar drif correcion echnique in our appearance manifold updae scheme. Formally, le us define a locaion-independen sae as ŝ (p) = [sx, sy, θ, q] (p) and he bes observaion Y o be he y (p) associaed wih he paricle p ha yields he highes likelihood p(y (p) s (p) ) a ime. We declare an observaion Y as an appearance image (key frame) and place i on he appearance manifold when he following condiions are saisfied: τ<t ɛ 1 < min j ŝ (p) ŝ (j) 2 <ɛ2 (10) where ŝ (p) is associaed wih Y, and he index j ranges over he appearance images previously appended o he appearance manifold. The firs condiion examines he mach qualiy while he second condiion ensures ha he pose variabiliy is wihin a range defined by ɛ 1 and ɛ 2. Since he firs observaion y 0 is manually seleced, i is auomaically appended o he appearance manifold. In addiion, we replace he j h observaion on he appearance manifold when he number of observaions exceeds a predefined hreshold (40 in our experimens), where j is defined as j =argmin ŝ (p) ŝ (j) 2. (11) j This updae scheme essenially replaces he observaion on an appearance manifold ha is neares o he new observaion. 2) Tangen-Plane Selecion: The disance beween an image and a manifold can be characerized using a angen plane [15]. However, angen planes are no unique on an appearance manifold. Mos exising algorihms employ all available images on an appearance manifold o form a angen plane. In his paper, we inroduce a new crierion for angen-plane selecion. Firs, we formally define a angen plane. Le F be some unknown ransformaion acing on appearances. The firs-order approximaion of an appearance manifold is represened as F α = F 0 + Fα + H.O.T. (12) where F 0 is an appearance image on an appearance manifold, F denoes he angen vecors, and F α is he reconsruced appearance image. The purpose of angen disance is o seek a reconsruced image using angen vecors such ha he disance beween he reconsruced image and an observed image Y is minimized. In his conex, Y is he observaion y (p ) having he highes likelihood in (7). Mahemaically, his may be expressed as F0 min + 2 Fα Y. (13) α In he conex of face racking, here are many alernaive ways o selec F 0 and F given an observaion Y, and hese alernaives play an imporan role in building a reference image. To explore hree possible alernaives, a simplified illusraion is helpful. Fig. 4 shows an observaion Y in relaion o nine appearance images on he appearance manifold. These appearance images are denoed as g, g 1,...,g 8. Furhermore, in Fig. 4. Example of angen-plane selecion, where Y is an observaion and g k is he appearance on a manifold. his example, we assume ha hree angen vecors are sufficien o approximae a angen plane. 1) Se Y o be equal o F 0, and selec he hree neares neighbors (g, g 3, and g 4 )ofy o form he angen vecors. In his case, here is a rivial soluion (α =0)ha makes Y he reference image. This is known as naive updae [8] and has been shown o drif easily. 2) Se g o be equal o F 0 and hree of is neares neighbors (g 1, g 3, and g 4 ) o form he angen vecors. As Fig. 4 shows, hese appearance images may no be he bes images o reconsruc Y. We call his direc appearance updae. 3) Se g o be equal o F 0 and hree neares neighbors (g 3, g 4, and g 8 )ofy o form he angen vecors around g.as Fig. 4 shows, he neares images of Y are surrounded in a blue box. These appearance images ypically provide a much beer reconsrucion. We call his adapive appearance updae. Based on hese observaions, we choose he adapive appearance updae as our angen-plane-selecion crierion. This crierion provides beer spaial coherence of images on an appearance manifold. Our experimenal resuls conclude ha he proposed adapive appearance framework is resilien o drifing. Furher analysis is given in Secion VII-B. 3) Reference Image Updae: To updae he reference image, we firs selec he base image g on he appearance manifold, which is he neares observaion of Y shown as follows: g =argmin Y g j 2 (14) j where g j is an appearance image on he appearance manifold. Because local lineariy is a reasonable presumpion for he appearance manifold, we can form a angen plane using a se of k neares neighbors expressed as T g M : g + span{g g 1,g g 2,...,g g k } (15) where {g 1,g 2,...,g k } are he neares k neighbors of Y, and {g g 1,g g 2,...,g g k } are he angen vecors around g. In our experimen, we se k o be equal o 15. In he case where he number of images on he appearance manifold is less han k, we use all he images ha are currenly available on he appearance manifold. As discussed in he previous secion, angen-plane selecion plays an imporan role in building a reference image. We should noe ha he reference image resides on a angen plane T g M. More specifically, he reference image is formed by

7 LUI e al.: ADAPTIVE APPEARANCE MODEL AND CONDENSATION ALGORITHM FOR ROBUST FACE TRACKING 7 Fig. 5. Direc appearance updae, he adapive appearance updae, and an appearance manifold. The blue do represens a reference image residing on a angen plane. Differen appearance updaes selec differen ses of angen bases; differen angen bases form a differen angen plane and, herefore, a differen reference image. (a) Direc updae. (b) Adapive updae. seeking he bes linear combinaion of he angen vecors a g relaive o he observaion Y. Thus, he k neares neighbors (a dashed ellipse in Fig. 5) govern he qualiy of he reference image. Fig. 5 shows how he reference image changes when we apply alernaive angen-plane-selecion crieria. Fig. 5(a) shows he direc appearance updae and furher shows how he se of basis images may no be spaially closes o he observaion Y. In his case, he reference image may no be coheren relaive o he opology of he appearance manifold. The adapive appearance updae shown in Fig. 5(b) considers he spaial arrangemen on he appearance manifold and builds he reference image using a se of neares bases. This approach, which respecs he opology of he appearance manifold, is much more likely o creae a reference image from a coheren choice of basis images. Recall ha a linear variey allows a hyperplane o shif from is origin. In his paper, we furher exend he linear combinaion o he bes linear variey such ha he angen plane would be shifed closer o he observaion. In oher words, changes in he overall image brighness are rivially accommodaed. This exension can be wrien in a marix form as z 1 z 2.. z m = g 1 g 2. g m + 1 V (1) 1 V (1).. 1 V m (1) 1...V (k) V (k) 2...V (k) m α 0 α 1.. α k (16) where V (i) denoes {g g i i =1,...,k}, and he addiional column 1 is augmened for brighness compensaion; hus, any linear variey can be reconsruced wih a suiable α. Now, he goal is o seek he bes α such ha he difference beween he linear variey and an observaion Y is minimized as follows: min g + Vα Y 2. (17) α Taking a derivaive wih respec o α, he soluion becomes α = V 1 (Y g). (18) Subsiuing α back o (16), he reconsruced appearance image z a ime is he bes linear variey agains Y and will be used as a reference image for compuing likelihoods a ime +1. To summarize our adapive condensaion algorihm, we skech our mehod in Algorihm 1. As Algorihm 1 reveals, our mehod exhibis four major elemens. They are propagaion and observaion, angen-plane selecion, reference image updae, and appearance manifold updae. As such, he propagaion and observaion sep can be considered as an adapive sampling process, while he angen-plane selecion, reference image updae, and appearance manifold updae can be regarded as an adapive appearance framework. Boh elemens are inerdependen and key ingrediens for our face racker. Algorihm 1 Our Adapive Condensaion Algorihm 1: Iniialize paricles 2: while frame empy do 3: Apply propagaion and observaion 4: Coarse facored sampling 5: Compue likelihoods (7) 6: Normalize 7: Selec op paricles 8: Fine facored sampling 9: Compue likelihoods (7) 10: Normalize 11: Selec a angen plane 12: Updae a reference image 13: Solve α (18) 14: Build a reference image z (16) 15: Updae an appearance manifold 16: end while VI. EXPERIMENTAL RESULTS We demonsrae he effeciveness and robusness of our adapive condensaion algorihm using four challenging video sequences. These video sequences are acquired indoors using hand-held video cameras so ha boh camera moion and human moion are concurrenly revealed. The dudek video sequence [19] is a well-known benchmarked video for face racking and comes wih hand-labeled ground-ruh posiions. The second video sequence, davidin300 [19], has mixed shadowing

8 8 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS Fig. 6. Demonsraion of robusness o illuminaion, appearance and pose variaions, and occlusions. (a) Dudek video sequence. (b) Davidin300 video sequence. (c) Rams video sequence. (d) Smiley face video sequence. Fig. 7. Examples of occlusion recovery. and pose variaions. The hird and fourh video sequences were colleced a Colorado Sae Universiy, 3 namely, he Rams and Smiley video sequences. These video sequences also exhibi various appearances, poses, and occlusions. Example frames from hese videos are shown in Fig. 6. In our experimens, he iniial racking posiion is manually seleced. The green bounding boxes shown in Fig. 6 indicae where our racker has found he face in hese frames. 3 Available a hp:// vision/smc-facetracking/. In he nex wo secions, we will discuss specific aspecs of performance as i relaes o illuminaion, appearance, pose, and occlusion. This highlighs how aspecs of our algorihm relae o hese specific challenges. Then, in Secion VI-C, we presen our quaniaive evaluaion. A. Illuminaion, Appearances, and Poses Lighing, appearance, and pose variaions are he primary challenges in face racking. I is imperaive for a robus

9 LUI e al.: ADAPTIVE APPEARANCE MODEL AND CONDENSATION ALGORITHM FOR ROBUST FACE TRACKING 9 algorihm o handle such variaions. The sample racking resuls shown in Fig. 6(a) (d) show ha our mehod is successfully racking he human face under variaions in all hree of hese facors. In addiion, he video sequences include addiional confounding facors such as pronounced facial shadowing, occlusion of he face, and changes in facial expression and pose. Our appearance model is robus o lighing, appearance, and pose changes because i is adapive. This is accomplished by rebuilding he reference image in every frame. To be specific, our angen-plane selecion is driven by he observaion (see Secion V-B2) ha he lighing, appearance, and pose from he observaion are considered when we selec he angen vecors. To pu his anoher way, our reference image is always creaed from a se of prior images ha are mos similar o he curren image, even when ha means i is bes o sep over he recenly acquired images in favor of images seen less recenly bu exhibiing characerisics ha are more useful for explaining he appearance of he curren frame. Finally, from a pracical sandpoin, i is also imporan ha our mehod compensaes for he overall changes in brighness by using a linear variey ha enables he angen plane o shif away from he origin. B. Occlusion Focusing in more precisely on occlusion, Fig. 7 shows successive frames of a video sequence in which our algorihm is successfully handing occlusion. In Fig. 7, he whie bounding boxes are sampling posiions, and he green bounding box is he face posiion wih he highes poserior probabiliy. Observing in he firs few frames, where he face is fully visible, he number of sampling regions is small. As he hand moves in fron of he chin in he middle frame of he op row, he expansion of he sampling paern is immediaely eviden. Then, in successive frames where he face is parially occluded, he sampling paern coninues o expand. This adapive sampling allows he algorihm o reac and avoid losing a face even when he face is moving and/or emporarily occluded. There are wo primary reasons why our mehod is robus o emporary occlusion. Firs, our adapive sampling sraegy is able o follow he movemen of he face since we are using he emporal differences o adjus he amoun of noise. Second, because only observaions wih a qualiy score above a hreshold described in (10) are added o he model, occluded objecs will no ypically paricipae in he process of building a reference image, and consequenly, hey do no corrup he appearance model. Of course, hese sraegies have heir limis. The longer a face is occluded, he higher he risk ha he racker will expand is search region o such an exen ha i will lose he objec. I is also conceivable, alhough we have no observed such a case, ha very slow inroducion of ever more occluded images of a face migh allow parially covered insances of he face o be added o he reference model. C. Quaniaive Evaluaion Ground-ruh face locaion informaion is available for hree video sequences: dudek, Rams, and Smiley. For hese, we are able o carry ou a quaniaive evaluaion. Each video frame in hese sequences is annoaed wih seven fiducial posiions. We Fig. 8. Ground-ruh and esimaed posiions, where ground-ruh posiions are red and esimaed posiions are green. employ he rmse beween he ground-ruh and esimaed posiions o assess racking performance. An example of groundruh and esimaed posiions is shown in Fig. 8, where he red crosses are ground-ruh posiions and he green crosses are he esimaed posiions. We direcly compare our racking resuls wih he sae-ofhe-ar racking algorihms ha exploi he advanages of a learned subspace mehod [14] and incremenal visual racking (IVT) [6]. Boh hese mehods are online subspace learning echniques where he learned subspace mehod consrucs a se of local means as he bases of a subspace and he IVT learns he subspace incremenally wih a forgeing facor. The quaniaive resuls for he dudek, Rams, and Smiley video sequences are shown in Fig. 9 and summarized in Table I. As can be seen in Fig. 9(a), all hree algorihms successfully rack he face over he full duraion of he dudek video. This is no rue for he Rams video [Fig. 9(b)] or he Smiley video [Fig. 9(c)], where disinc poins in he video can be observed where an algorihm experiences a subsanial jump in rmse and hen never recovers. For example, in he Rams video [Fig. 9(b)], a disinc jump in he rmse for he IVT algorihm can be seen a frame 99. A similar jump can be seen for he learned subspace algorihm a frame 532. Noably, our mehod mainains a relaively sable, and comparaively low, rmse over he enire duraion of he video for boh he Rams and Smiley video sequences. Average rmse values over he enire dudek video for all hree algorihms are presened in he firs hree rows of Table I. The average rmse values for he learned subspace mehod and IVT are 6.3 and 5.32, 4 respecively, whereas our mehod achieves 4.98 average rmse. More imporanly, our mehod only employs 400 o 600 paricles and runs a 8.74 frames per second, whereas he IVT uses 4000 paricles and runs a 1.1 frames per second. Noe ha all experimens are run on he same machine, and so, his difference in ime required per frame is a reliable indicaion of jus how much more efficien is he adapive framework. Since boh he IVT and learned subspace algorihms lose rack of he face a some poin in boh he Rams and Smiley video sequences, Table I presens he average rmse values in wo ways. Firs, he average rmse values are presened over he full video sequences, and as a resul of he algorihms losing rack of he face, he average rmse is very high for boh algorihms on boh videos. To address he quesion of how 4 The MATLAB source code was downloaded from hp:// edu/~dross/iv/, and we used he bes parameer seings suggesed by he auhors.

10 10 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS Fig. 9. Quaniaive comparisons o learned subspace [14] and IVT [6] for face racking. (a) Dudek video sequence. (b) Rams video sequence. (c) Smiley video sequence. well he algorihms are doing relaive o each oher over he porion of he video where all hree algorihms remain inac in racking, Table I also shows he average rmse over jus he firs 99 and 273 frames of he Rams and Smiley videos, respecively. Even on hese runcaed video sequences, our mehod sill ouperforms he learned subspace mehod and IVT in boh Rams and Smiley video sequences, where our mehod achieves 3.51 and 6.43 average rmse values for he Rams and Smiley video sequences, respecively. The key differences beween our mehod and he subspace mehods esed in his paper are he daa represenaion of he appearance model and he updae sraegy. While boh subspace mehods build a subspace incremenally, we use key frames and represen hem on an appearance manifold. As we consider key frames on an appearance manifold, he geomeric inerpreaion is aken ino accoun. We apply he propery of local lineariy and make a local updae for a angen plane insead of an incremenal updae for a subspace like [6] and [14]. As such, our bases are a se of local 5 (similar) key frames raher han global bases. This spaial coherency allows us o selec a angen plane from adjacen appearances and builds a beer reference image. In conras, he subspace mehods such as [6] and [14] adop all he bases o form a subspace and compue he subspace disance for each observaion. Therefore, hese mehods ignore he spaial coherency which plays an imporan role in building a good reference image. Finally, we noe ha all he racking resuls are obained using MATLAB version on an HP-xw4600-Core2Duo- SATA 2x2.83G 64-b machine. The racking resuls, racking videos, and he landmark poins can be downloaded from hp:// VII. DISCUSSION A. Drif Analysis Templae updae is an essenial sep o accoun for various expression, pose, and illuminaion changes. However, i can easily resul in a racker drifing off he objec of ineres, in his case, a paricular person s face. To comba drifing, we need o firs undersand ha drif is caused by accumulaed error in a emplae posiion. Whenever he emplae is updaed, small errors accumulae o he posiion of he emplae. Once drifing sars, i can rapidly cause a racker o lose he face. The bes way o minimize drif is o avoid he small errors in he firs place, and his, in urn, requires ha he adapive model makes he correc balance beween generaliy and specificiy. Conrolling he observaions, images, which are added o he appearance manifold, and adapaion o he curren frame when consrucing he reference image ogeher help our approach srike his balance. Our approach is selecive abou wha observaions are added o he appearance manifold. As defined in (10), here are hree aspecs o he screening of images ha are imporan. Firs, here mus be a good mach beween he observaion and he reference image. Second, if he change in posiion, which is he magniude of he ransformaion beween frames, is very large, he observaion is no added o he appearance manifold. These seps ensure he inegriy of he appearance manifold. Third, only observaions wih a sufficien change in posiion are added o he appearance manifold. This proecs agains he appearance manifold becoming overwhelmed wih a se of nearly idenical images. The seps jus described do a lo o balance generaliy and specificiy in he consrucion of he appearance manifold. However, ha alone is no enough. Only observaions relaed o he curren frame should conribue o he consrucion of he reference image, and our algorihm accomplishes his hrough he angen-plane consrucion process described in Secion V-B2. Through his mechanism, our algorihm can mainain hisory abou differen aspecs of appearance, for example, parial profile versus head on, wihou making he 5 Local implies a spaial relaionship raher han a emporal relaionship.

11 LUI e al.: ADAPTIVE APPEARANCE MODEL AND CONDENSATION ALGORITHM FOR ROBUST FACE TRACKING 11 TABLE I QUANTITATIVE COMPARISONS IN TERMS OF RMSE AND SPEED Fig. 10. Quaniaive comparison for various appearance updae mehods using he dudek video sequence. misake of combining hese wo cases when maching o a specific frame. B. Updae Sraegy Analysis The selecion of angen plane plays a vial role in building a reference image. To furher analyze he effeciveness of our adapive appearance model, we compare he adapive appearance updae wih he direc and naive appearance updaes discussed in Secion V-B2. Fig. 10 shows he quaniaive resuls for various updae sraegies using he dudek video. The naive appearance updae loses rack afer he firs occlusion. The direc and adapive appearance updaes perform idenically in he beginning since, iniially, he same images are added o he appearance manifold for boh algorihms. I is only afer he firs 174 frames ha he performance diverges and our adapive appearance model can be seen o perform somewha beer. Over he enire video sequence, he average rmse is 5.8 for he direc appearance updae and 4.98 for he adapive appearance updae. The argumen of hese wo updaes is from various spaial consideraions beween he observaion and he appearances on a manifold. The experimenal resuls described in Fig. 10 shows he imporance of spaial coherency. Our adapive appearance updae considers he spaial arrangemen properly, and herefore, i is more robus o handle variabiliy and drifing. C. Limiaion While our algorihm performs well in face racking, he sae model is sill an affine ransformaion realized as a global alignmen. Adapively building a reference image o bes model he curren frame does go some ways oward handling nonlinear aspecs of facial movemen; however, i sill canno accoun for all nonridge facial expressions. Affine ransformaions do no have he expressive power o model he deails of various facial expressions, for insance, smiling or neural expression. One approach o handling nonlinear facors, such as expression variaions, would be o adop more sophisicaed appearance models. For example, AAMs [21] migh be used. However, he dimensions of he sae space would be remendously increased since i encodes boh shape and exure informaion. The increased expressive power of AAMs clearly comes a considerable exra compuaional expense. Anoher facor o consider in reviewing he quaniaive resuls presened is ha he ground-ruh posiions are manually annoaed; hus, here may be some noise in hese posiions. In fac, Jepson e al. [22] repor ha he lower bound of he average rmse for he dudek video is 3.1 pixel error. This baseline error has boh ground-ruh and fiducial-posiion errors ha canno be recovered by similariy ransformaions. Here oo, AAMs could break ou of he limis imposed by he assumpion of a global affine ransformaion bu, again, a a considerable addiional compuaional expense. Finally, i should be undersood ha, using our mehod or, indeed, any of he algorihms agains which we compared our mehod, face racking is limied o a single face and reacquisiion is needed when he objec is los. Our experimenal resuls show ha our mehod works reasonably well on shorerm occlusion where he lenghs of occlusion are 6, 20, and 22 frames for he dudek, Rams, and Smiley video sequences, respecively. However, racking failure would definiely occur when long ime occlusion is presen or he facial image appears significanly differen afer occlusion. This is because he assumpion of smooh ransiions beween frames is invalid. VIII. CONCLUSION AND FUTURE WORK This paper has presened an adapive framework for humanface racking. Our adapive affine cascade sraegy performs wo-sage facored sampling ha effecively reduces he search

12 12 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS space. In our es videos, we employ 400 o 600 paricles for coarse facored sampling and 100 paricles for fine facored sampling. This facored sampling sraegy uses fewer paricles and makes sampling more efficien. Equally imporan, we have adapively buil a reference image in every frame. This is achieved by properly selecing a angen plane and using a linear variey from an appearance manifold. Wih hese wo key ingrediens, our mehod is robus o variaion of illuminaion, appearance, pose, and emporary occlusions. Furhermore, hree video sequences wih ground-ruh posiions have been adoped o assess he performance of our mehod, and encouraging resuls have been obained. Finally, since our mehod is an online learning paradigm, no prior raining is required. Human-face racking is sill a difficul ask due, in par, o large freedom of face movemen and appearance changes. Our fuure work will focus on emporary disappearance and coninue o explore appearance manifolds. ACKNOWLEDGMENT The auhors hank David A. Ross from Universiy of Torono for he permission o use his videos and source code. REFERENCES [1] A. Yilmaz, O. Javed, and M. Shah, Objec racking: A survey, ACM J. Compu. Surv., vol. 38, no. 4, pp. 1 45, [2] N. Gordon, Bayesian mehods for racking, Ph.D. disseraion, Univ. London, London, U.K., [3] N. Gordon, D. Salmond, and A. Smih, A novel approach o non-linear and non-gaussian Bayesian sae esimaion, Proc. Ins. Elec. Eng. F, vol. 140, no. 2, pp , Apr [4] A. Douce, N. de Freias, and N. Gordon, Sequenial Mone Carlo Mehods in Pracice. New York: Springer-Verlag, [5] M. Isard and A. Blake, Condensaion Condiion densiy propagaion for visual racking, In. J. Compu. Vis., vol. 29, no. 1, pp. 5 28, Aug [6] D. Ross, J. Lim, R.-S. Lin, and M.-H. Yang, Incremenal learning for robus visual racking, In. J. Compu. Vis., vol. 77, no. 1 3, pp , May [7] G. Hager and P. Belhumeur, Efficien region racking wih parameric models of geomery and illuminaion, Trans. Paern Anal. Mach. Inell., vol. 20, no. 10, pp , Oc [8] I. Mahews, R. Ishikawa, and S. Baker, The emplae updae problem, Trans. Paern Anal. Mach. Inell., vol. 26, no. 6, pp , Jun [9] M. J. Black and A. D. Jepson, Eigenracking: Robus maching and racking of ariculaed objecs using a view-based represenaion, in Proc. ECCV, 1996, pp [10] S. Von Duhn, L. Yin, M. J. Ko, and T. Hung, Muliple-view face racking for modeling and analysis based on non-cooperaive video imagery, in Proc. CVPR, 2007, pp [11] K.-C. Lee and D. Kriegman, Face racking in video wih hybrid of Lucas Kanade and condensaion algorihm, in Proc. In. Conf. Mulimedia Expo, 2003, pp [12] W. Zhang, Q. Wang, and X. Tang, Real ime feaure based 3-D deformable face racking, in Proc. ECCV, 2008, pp [13] A. Levy and M. Lindenbaum, Sequenial Karhunen Loeve basis exracion and is applicaion o images, Trans. Image Process., vol. 9, no. 8, pp , Aug [14] J. Ho, K.-C. Lee, M.-H. Yang, and D. Kriegman, Visual racking using learned linear subspace, in Proc. CVPR, 2004, pp [15] P. Simard, Y. L. Cun, and J. Denker, Efficien paern recogniion using a new ransformaion disance, in Proc. NIPS, 1992, pp [16] H. Seung and D. Lee, The manifold ways of percepion, Science, vol. 290, no. 5500, pp , Dec [17] S. Zhou, R. Chellapa, and B. Moghaddam, Visual racking and recogniion using appearance-adapive models in paricle filers, Trans. Image Process., vol. 13, no. 11, pp , Nov [18] B. Norh, A. B. M. Isard, and J. Rischer, Learning and classificaion of complex dynamics, Trans. Paern Anal. Mach. Inell., vol. 22, no. 9, pp , Sep [19] D. Ross. [Online]. Available: hp:// [20] J. MacCormick and M. Isard, Pariioned sampling, ariculaed objecs, and inerface-qualiy hand racking, in Proc. ECCV, 2000, pp [21] T. Cooes and G. Edwards, Acive appearance models, Trans. Paern Anal. Mach. Inell., vol. 23, no. 6, pp , Jun [22] A. D. Jepson, D. J. Flee, and T. El-Maraghi, Robus online appearance models for visual racking, Trans. Paern Anal. Mach. Inell., vol. 25, no. 10, pp , Oc [23] S. Avidan, Suppor vecor racking, Trans. Paern Anal. Mach. Inell., vol. 26, no. 8, pp , Aug [24] X. Song, J. Cui, H. Zha, and H. Zhao, Vision-based muliple ineracing arges racking via on-line supervised learning, in Proc. ECCV, 2008, pp [25] Q. Yu, T. Dinh, and G. Medioni, Online racking and reacquisiion using co-rained generaive and discriminaive rackers, in Proc. ECCV, 2008, pp [26] Y. Li, H. Ai, T. Yamashia, S. Lao, and M. Kawade, Tracking in low frame rae video: A cascade paricle filer wih discriminaive observers of differen lifespans, in Proc. CVPR, 2007, pp [27] K.-C. Lee and D. Kriegman, Online learning of probabilisic appearance manifolds for video-based recogniion and racking, in Proc. CVPR, 2005, pp [28] D. Comaniciu, V. Ramesh, and P. Meer, Kernel-based objec racking, Trans. Paern Anal. Mach. Inell., vol. 25, no. 5, pp , May Yui Man Lui (S 07) is currenly working oward he Ph.D. degree in he Deparmen of Compuer Science, Colorado Sae Universiy, For Collins. His curren research ineress include special manifolds, face recogniion, face racking, and acion classificaion. Mr. Lui is he recipien of he Honeywell Bes Suden Paper Award from he 2009 Inernaional Conference on Biomerics: Theory, Applicaions and Sysems (BTAS); he coauhor of he Bes Paper Award from he 2008 Inernaional Conference on Auomaic Face and Gesure Recogniion; and he recipien of an honorable menion for he Honeywell Bes Suden Paper Award from BTAS in J. Ross Beveridge (M 83) received he Ph.D. degree from he Universiy of Massachuses, Amhers, in He is currenly an Associae Professor wih he Deparmen of Compuer Science, Colorado Sae Universiy, For Collins. He has served as an Associae Edior for Paern Recogniion and is currenly an Associae Edior for Image and Vision Compuing. Dr. Beveridge has served as an Associae Edior for he TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE.Hewas he Program Cochair for he 1999 Conference on Compuer Vision and Paern Recogniion. He was he recipien of he Ousanding Reviewer Awards in 2007 and 2008 and he Bes Paper Award from he Inernaional Conference on Auomaic Face and Gesure Recogniion in L. Darrell Whiley received he Ph.D. degree from Souhern Illinois Universiy in He is currenly he Chair wih he Deparmen of Compuer Science, Colorado Sae Universiy, For Collins. He serves on he ediorial board of he journals Arificial Inelligence and Evoluionary Compuaion. He also serves as he Chair of he Associaion for Compuing Machinery Special Ineres Group on Geneic and Evoluionary Compuaion, Sigevo.

Visual Perception as Bayesian Inference. David J Fleet. University of Toronto

Visual Perception as Bayesian Inference. David J Fleet. University of Toronto Visual Percepion as Bayesian Inference David J Flee Universiy of Torono Basic rules of probabiliy sum rule (for muually exclusive a ): produc rule (condiioning): independence (def n ): Bayes rule: marginalizaion:

More information

Learning in Games via Opponent Strategy Estimation and Policy Search

Learning in Games via Opponent Strategy Estimation and Policy Search Learning in Games via Opponen Sraegy Esimaion and Policy Search Yavar Naddaf Deparmen of Compuer Science Universiy of Briish Columbia Vancouver, BC yavar@naddaf.name Nando de Freias (Supervisor) Deparmen

More information

Implementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report)

Implementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report) Implemening Ray Casing in Terahedral Meshes wih Programmable Graphics Hardware (Technical Repor) Marin Kraus, Thomas Erl March 28, 2002 1 Inroducion Alhough cell-projecion, e.g., [3, 2], and resampling,

More information

FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine

FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS Soumya Hamlaoui & Franck Davoine HEUDIASYC Mixed Research Uni, CNRS / Compiègne Universiy of Technology BP 20529, 60205 Compiègne

More information

STEREO PLANE MATCHING TECHNIQUE

STEREO PLANE MATCHING TECHNIQUE STEREO PLANE MATCHING TECHNIQUE Commission III KEY WORDS: Sereo Maching, Surface Modeling, Projecive Transformaion, Homography ABSTRACT: This paper presens a new ype of sereo maching algorihm called Sereo

More information

Video-Based Face Recognition Using Probabilistic Appearance Manifolds

Video-Based Face Recognition Using Probabilistic Appearance Manifolds Video-Based Face Recogniion Using Probabilisic Appearance Manifolds Kuang-Chih Lee Jeffrey Ho Ming-Hsuan Yang David Kriegman klee10@uiuc.edu jho@cs.ucsd.edu myang@honda-ri.com kriegman@cs.ucsd.edu Compuer

More information

CENG 477 Introduction to Computer Graphics. Modeling Transformations

CENG 477 Introduction to Computer Graphics. Modeling Transformations CENG 477 Inroducion o Compuer Graphics Modeling Transformaions Modeling Transformaions Model coordinaes o World coordinaes: Model coordinaes: All shapes wih heir local coordinaes and sies. world World

More information

Image segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding

Image segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding Moivaion Image segmenaion Which pixels belong o he same objec in an image/video sequence? (spaial segmenaion) Which frames belong o he same video sho? (emporal segmenaion) Which frames belong o he same

More information

Probabilistic Detection and Tracking of Motion Discontinuities

Probabilistic Detection and Tracking of Motion Discontinuities Probabilisic Deecion and Tracking of Moion Disconinuiies Michael J. Black David J. Flee Xerox Palo Alo Research Cener 3333 Coyoe Hill Road Palo Alo, CA 94304 fblack,fleeg@parc.xerox.com hp://www.parc.xerox.com/fblack,fleeg/

More information

Definition and examples of time series

Definition and examples of time series Definiion and examples of ime series A ime series is a sequence of daa poins being recorded a specific imes. Formally, le,,p be a probabiliy space, and T an index se. A real valued sochasic process is

More information

An Improved Square-Root Nyquist Shaping Filter

An Improved Square-Root Nyquist Shaping Filter An Improved Square-Roo Nyquis Shaping Filer fred harris San Diego Sae Universiy fred.harris@sdsu.edu Sridhar Seshagiri San Diego Sae Universiy Seshigar.@engineering.sdsu.edu Chris Dick Xilinx Corp. chris.dick@xilinx.com

More information

A Matching Algorithm for Content-Based Image Retrieval

A Matching Algorithm for Content-Based Image Retrieval A Maching Algorihm for Conen-Based Image Rerieval Sue J. Cho Deparmen of Compuer Science Seoul Naional Universiy Seoul, Korea Absrac Conen-based image rerieval sysem rerieves an image from a daabase using

More information

EECS 487: Interactive Computer Graphics

EECS 487: Interactive Computer Graphics EECS 487: Ineracive Compuer Graphics Lecure 7: B-splines curves Raional Bézier and NURBS Cubic Splines A represenaion of cubic spline consiss of: four conrol poins (why four?) hese are compleely user specified

More information

A Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates

A Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates A Fas Sereo-Based Muli-Person Tracking using an Approximaed Likelihood Map for Overlapping Silhouee Templaes Junji Saake Jun Miura Deparmen of Compuer Science and Engineering Toyohashi Universiy of Technology

More information

Tracking Appearances with Occlusions

Tracking Appearances with Occlusions Tracking ppearances wih Occlusions Ying Wu, Ting Yu, Gang Hua Deparmen of Elecrical & Compuer Engineering Norhwesern Universiy 2145 Sheridan oad, Evanson, IL 60208 {yingwu,ingyu,ganghua}@ece.nwu.edu bsrac

More information

4.1 3D GEOMETRIC TRANSFORMATIONS

4.1 3D GEOMETRIC TRANSFORMATIONS MODULE IV MCA - 3 COMPUTER GRAPHICS ADMN 29- Dep. of Compuer Science And Applicaions, SJCET, Palai 94 4. 3D GEOMETRIC TRANSFORMATIONS Mehods for geomeric ransformaions and objec modeling in hree dimensions

More information

CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL

CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL Klečka Jan Docoral Degree Programme (1), FEEC BUT E-mail: xkleck01@sud.feec.vubr.cz Supervised by: Horák Karel E-mail: horak@feec.vubr.cz

More information

Design Alternatives for a Thin Lens Spatial Integrator Array

Design Alternatives for a Thin Lens Spatial Integrator Array Egyp. J. Solids, Vol. (7), No. (), (004) 75 Design Alernaives for a Thin Lens Spaial Inegraor Array Hala Kamal *, Daniel V azquez and Javier Alda and E. Bernabeu Opics Deparmen. Universiy Compluense of

More information

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors We are InechOpen, he world s leading publisher of Open Access books Buil by scieniss, for scieniss 4,000 116,000 120M Open access books available Inernaional auhors and ediors Downloads Our auhors are

More information

J. Vis. Commun. Image R.

J. Vis. Commun. Image R. J. Vis. Commun. Image R. 20 (2009) 9 27 Conens liss available a ScienceDirec J. Vis. Commun. Image R. journal homepage: www.elsevier.com/locae/jvci Face deecion and racking using a Boosed Adapive Paricle

More information

Improved TLD Algorithm for Face Tracking

Improved TLD Algorithm for Face Tracking Absrac Improved TLD Algorihm for Face Tracking Huimin Li a, Chaojing Yu b and Jing Chen c Chongqing Universiy of Poss and Telecommunicaions, Chongqing 400065, China a li.huimin666@163.com, b 15023299065@163.com,

More information

Robust Visual Tracking for Multiple Targets

Robust Visual Tracking for Multiple Targets Robus Visual Tracking for Muliple Targes Yizheng Cai, Nando de Freias, and James J. Lile Universiy of Briish Columbia, Vancouver, B.C., Canada, V6T 1Z4 {yizhengc, nando, lile}@cs.ubc.ca Absrac. We address

More information

Multiple View Discriminative Appearance Modeling with IMCMC for Distributed Tracking

Multiple View Discriminative Appearance Modeling with IMCMC for Distributed Tracking Muliple View Discriminaive ing wih IMCMC for Disribued Tracking Sanhoshkumar Sunderrajan, B.S. Manjunah Deparmen of Elecrical and Compuer Engineering Universiy of California, Sana Barbara {sanhosh,manj}@ece.ucsb.edu

More information

AUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION

AUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION Chaper 3 AUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION A. Koschan, V. R. Ayyagari, F. Boughorbel, and M. A. Abidi Imaging, Roboics, and Inelligen Sysems Laboraory, The Universiy of Tennessee, 334

More information

Rao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly

Rao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly MITSUBISHI ELECTRIC RESEARCH LABORATORIES hp://www.merl.com Rao-Blackwellized Paricle Filering for Probing-Based 6-DOF Localizaion in Roboic Assembly Yuichi Taguchi, Tim Marks, Haruhisa Okuda TR1-8 June

More information

MATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008

MATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008 MATH 5 - Differenial Equaions Sepember 15, 8 Projec 1, Fall 8 Due: Sepember 4, 8 Lab 1.3 - Logisics Populaion Models wih Harvesing For his projec we consider lab 1.3 of Differenial Equaions pages 146 o

More information

Occlusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction

Occlusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction 58 IJCSNS Inernaional Journal of Compuer Science and Nework Securiy, VOL.6 No.10, Ocober 006 Occlusion-Free Hand Moion Tracking by Muliple Cameras and Paricle Filering wih Predicion Makoo Kao, and Gang

More information

Real Time Integral-Based Structural Health Monitoring

Real Time Integral-Based Structural Health Monitoring Real Time Inegral-Based Srucural Healh Monioring The nd Inernaional Conference on Sensing Technology ICST 7 J. G. Chase, I. Singh-Leve, C. E. Hann, X. Chen Deparmen of Mechanical Engineering, Universiy

More information

Reinforcement Learning by Policy Improvement. Making Use of Experiences of The Other Tasks. Hajime Kimura and Shigenobu Kobayashi

Reinforcement Learning by Policy Improvement. Making Use of Experiences of The Other Tasks. Hajime Kimura and Shigenobu Kobayashi Reinforcemen Learning by Policy Improvemen Making Use of Experiences of The Oher Tasks Hajime Kimura and Shigenobu Kobayashi Tokyo Insiue of Technology, JAPAN genfe.dis.iech.ac.jp, kobayasidis.iech.ac.jp

More information

Nonparametric CUSUM Charts for Process Variability

Nonparametric CUSUM Charts for Process Variability Journal of Academia and Indusrial Research (JAIR) Volume 3, Issue June 4 53 REEARCH ARTICLE IN: 78-53 Nonparameric CUUM Chars for Process Variabiliy D.M. Zombade and V.B. Ghue * Dep. of aisics, Walchand

More information

In Proceedings of CVPR '96. Structure and Motion of Curved 3D Objects from. using these methods [12].

In Proceedings of CVPR '96. Structure and Motion of Curved 3D Objects from. using these methods [12]. In Proceedings of CVPR '96 Srucure and Moion of Curved 3D Objecs from Monocular Silhouees B Vijayakumar David J Kriegman Dep of Elecrical Engineering Yale Universiy New Haven, CT 652-8267 Jean Ponce Compuer

More information

Upper Body Tracking for Human-Machine Interaction with a Moving Camera

Upper Body Tracking for Human-Machine Interaction with a Moving Camera The 2009 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems Ocober -5, 2009 S. Louis, USA Upper Body Tracking for Human-Machine Ineracion wih a Moving Camera Yi-Ru Chen, Cheng-Ming Huang, and

More information

STRING DESCRIPTIONS OF DATA FOR DISPLAY*

STRING DESCRIPTIONS OF DATA FOR DISPLAY* SLAC-PUB-383 January 1968 STRING DESCRIPTIONS OF DATA FOR DISPLAY* J. E. George and W. F. Miller Compuer Science Deparmen and Sanford Linear Acceleraor Cener Sanford Universiy Sanford, California Absrac

More information

MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES

MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES B. MARCOTEGUI and F. MEYER Ecole des Mines de Paris, Cenre de Morphologie Mahémaique, 35, rue Sain-Honoré, F 77305 Fonainebleau Cedex, France Absrac. In image

More information

Visual Indoor Localization with a Floor-Plan Map

Visual Indoor Localization with a Floor-Plan Map Visual Indoor Localizaion wih a Floor-Plan Map Hang Chu Dep. of ECE Cornell Universiy Ihaca, NY 14850 hc772@cornell.edu Absrac In his repor, a indoor localizaion mehod is presened. The mehod akes firsperson

More information

A Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes

A Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes A Bayesian Approach o Video Objec Segmenaion via Merging 3D Waershed Volumes Yu-Pao Tsai 1,3, Chih-Chuan Lai 1,2, Yi-Ping Hung 1,2, and Zen-Chung Shih 3 1 Insiue of Informaion Science, Academia Sinica,

More information

Real-time 2D Video/3D LiDAR Registration

Real-time 2D Video/3D LiDAR Registration Real-ime 2D Video/3D LiDAR Regisraion C. Bodenseiner Fraunhofer IOSB chrisoph.bodenseiner@iosb.fraunhofer.de M. Arens Fraunhofer IOSB michael.arens@iosb.fraunhofer.de Absrac Progress in LiDAR scanning

More information

A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions

A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions A Hierarchical Objec Recogniion Sysem Based on Muli-scale Principal Curvaure Regions Wei Zhang, Hongli Deng, Thomas G Dieerich and Eric N Morensen School of Elecrical Engineering and Compuer Science Oregon

More information

Viewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors

Viewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Viewpoin Invarian 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Chaohui Wang 1,2, Yun Zeng 3, Loic Simon 1, Ioannis Kakadiaris 4, Dimiris Samaras 3, Nikos Paragios 1,2

More information

Robust 3D Visual Tracking Using Particle Filtering on the SE(3) Group

Robust 3D Visual Tracking Using Particle Filtering on the SE(3) Group Robus 3D Visual Tracking Using Paricle Filering on he SE(3) Group Changhyun Choi and Henrik I. Chrisensen Roboics & Inelligen Machines, College of Compuing Georgia Insiue of Technology Alana, GA 3332,

More information

Multi-camera multi-object voxel-based Monte Carlo 3D tracking strategies

Multi-camera multi-object voxel-based Monte Carlo 3D tracking strategies RESEARCH Open Access Muli-camera muli-objec voxel-based Mone Carlo 3D racking sraegies Crisian Canon-Ferrer *, Josep R Casas, Monse Pardàs and Enric Mone Absrac This aricle presens a new approach o he

More information

Coded Caching with Multiple File Requests

Coded Caching with Multiple File Requests Coded Caching wih Muliple File Requess Yi-Peng Wei Sennur Ulukus Deparmen of Elecrical and Compuer Engineering Universiy of Maryland College Park, MD 20742 ypwei@umd.edu ulukus@umd.edu Absrac We sudy a

More information

Wheelchair-user Detection Combined with Parts-based Tracking

Wheelchair-user Detection Combined with Parts-based Tracking Wheelchair-user Deecion Combined wih Pars-based Tracking Ukyo Tanikawa 1, Yasuomo Kawanishi 1, Daisuke Deguchi 2,IchiroIde 1, Hiroshi Murase 1 and Ryo Kawai 3 1 Graduae School of Informaion Science, Nagoya

More information

Evaluation and Improvement of Region-based Motion Segmentation

Evaluation and Improvement of Region-based Motion Segmentation Evaluaion and Improvemen of Region-based Moion Segmenaion Mark Ross Universiy Koblenz-Landau, Insiue of Compuaional Visualisics, Universiässraße 1, 56070 Koblenz, Germany Email: ross@uni-koblenz.de Absrac

More information

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild IEEE TRANSACTIONS ON IMAGE PROCESSING, DRAFT 1 Robus LSTM-Auoencoders for Face De-Occlusion in he Wild Fang Zhao, Jiashi Feng, Jian Zhao, Wenhan Yang, Shuicheng Yan arxiv:1612.08534v1 [cs.cv] 27 Dec 2016

More information

Sam knows that his MP3 player has 40% of its battery life left and that the battery charges by an additional 12 percentage points every 15 minutes.

Sam knows that his MP3 player has 40% of its battery life left and that the battery charges by an additional 12 percentage points every 15 minutes. 8.F Baery Charging Task Sam wans o ake his MP3 player and his video game player on a car rip. An hour before hey plan o leave, he realized ha he forgo o charge he baeries las nigh. A ha poin, he plugged

More information

Computer representations of piecewise

Computer representations of piecewise Edior: Gabriel Taubin Inroducion o Geomeric Processing hrough Opimizaion Gabriel Taubin Brown Universiy Compuer represenaions o piecewise smooh suraces have become vial echnologies in areas ranging rom

More information

Video Content Description Using Fuzzy Spatio-Temporal Relations

Video Content Description Using Fuzzy Spatio-Temporal Relations Proceedings of he 4s Hawaii Inernaional Conference on Sysem Sciences - 008 Video Conen Descripion Using Fuzzy Spaio-Temporal Relaions rchana M. Rajurkar *, R.C. Joshi and Sananu Chaudhary 3 Dep of Compuer

More information

Image Content Representation

Image Content Representation Image Conen Represenaion Represenaion for curves and shapes regions relaionships beween regions E.G.M. Perakis Image Represenaion & Recogniion 1 Reliable Represenaion Uniqueness: mus uniquely specify an

More information

IntentSearch:Capturing User Intention for One-Click Internet Image Search

IntentSearch:Capturing User Intention for One-Click Internet Image Search JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO. 1, JANUARY 2010 1 InenSearch:Capuring User Inenion for One-Click Inerne Image Search Xiaoou Tang, Fellow, IEEE, Ke Liu, Jingyu Cui, Suden Member, IEEE, Fang

More information

NEWTON S SECOND LAW OF MOTION

NEWTON S SECOND LAW OF MOTION Course and Secion Dae Names NEWTON S SECOND LAW OF MOTION The acceleraion of an objec is defined as he rae of change of elociy. If he elociy changes by an amoun in a ime, hen he aerage acceleraion during

More information

ACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure

ACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure SUBMITTED TO TRANSACTION ON IMAGE PROCESSING 1 Online Temporally Consisen Indoor Deph Video Enhancemen via Saic Srucure Lu Sheng, Suden Member, IEEE, King Ngi Ngan, Fellow, IEEE, Chern-Loon Lim and Songnan

More information

MOTION TRACKING is a fundamental capability that

MOTION TRACKING is a fundamental capability that TECHNICAL REPORT CRES-05-008, CENTER FOR ROBOTICS AND EMBEDDED SYSTEMS, UNIVERSITY OF SOUTHERN CALIFORNIA 1 Real-ime Moion Tracking from a Mobile Robo Boyoon Jung, Suden Member, IEEE, Gaurav S. Sukhame,

More information

Gauss-Jordan Algorithm

Gauss-Jordan Algorithm Gauss-Jordan Algorihm The Gauss-Jordan algorihm is a sep by sep procedure for solving a sysem of linear equaions which may conain any number of variables and any number of equaions. The algorihm is carried

More information

Detection and segmentation of moving objects in highly dynamic scenes

Detection and segmentation of moving objects in highly dynamic scenes Deecion and segmenaion of moving objecs in highly dynamic scenes Aurélie Bugeau Parick Pérez INRIA, Cenre Rennes - Breagne Alanique Universié de Rennes, Campus de Beaulieu, 35 042 Rennes Cedex, France

More information

A non-stationary uniform tension controlled interpolating 4-point scheme reproducing conics

A non-stationary uniform tension controlled interpolating 4-point scheme reproducing conics A non-saionary uniform ension conrolled inerpolaing 4-poin scheme reproducing conics C. Beccari a, G. Casciola b, L. Romani b, a Deparmen of Pure and Applied Mahemaics, Universiy of Padova, Via G. Belzoni

More information

Audio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA

Audio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA Audio Engineering Sociey Convenion Paper Presened a he 119h Convenion 2005 Ocober 7 10 New Yor, New Yor USA This convenion paper has been reproduced from he auhor's advance manuscrip, wihou ediing, correcions,

More information

4. Minimax and planning problems

4. Minimax and planning problems CS/ECE/ISyE 524 Inroducion o Opimizaion Spring 2017 18 4. Minima and planning problems ˆ Opimizing piecewise linear funcions ˆ Minima problems ˆ Eample: Chebyshev cener ˆ Muli-period planning problems

More information

Robust Multi-view Face Detection Using Error Correcting Output Codes

Robust Multi-view Face Detection Using Error Correcting Output Codes Robus Muli-view Face Deecion Using Error Correcing Oupu Codes Hongming Zhang,2, Wen GaoP P, Xilin Chen 2, Shiguang Shan 2, and Debin Zhao Deparmen of Compuer Science and Engineering, Harbin Insiue of Technolog

More information

Mobile Robots Mapping

Mobile Robots Mapping Mobile Robos Mapping 1 Roboics is Easy conrol behavior percepion modelling domain model environmen model informaion exracion raw daa planning ask cogniion reasoning pah planning navigaion pah execuion

More information

Analysis of Various Types of Bugs in the Object Oriented Java Script Language Coding

Analysis of Various Types of Bugs in the Object Oriented Java Script Language Coding Indian Journal of Science and Technology, Vol 8(21), DOI: 10.17485/ijs/2015/v8i21/69958, Sepember 2015 ISSN (Prin) : 0974-6846 ISSN (Online) : 0974-5645 Analysis of Various Types of Bugs in he Objec Oriened

More information

Open Access Research on an Improved Medical Image Enhancement Algorithm Based on P-M Model. Luo Aijing 1 and Yin Jin 2,* u = div( c u ) u

Open Access Research on an Improved Medical Image Enhancement Algorithm Based on P-M Model. Luo Aijing 1 and Yin Jin 2,* u = div( c u ) u Send Orders for Reprins o reprins@benhamscience.ae The Open Biomedical Engineering Journal, 5, 9, 9-3 9 Open Access Research on an Improved Medical Image Enhancemen Algorihm Based on P-M Model Luo Aijing

More information

Multi-Target Detection and Tracking from a Single Camera in Unmanned Aerial Vehicles (UAVs)

Multi-Target Detection and Tracking from a Single Camera in Unmanned Aerial Vehicles (UAVs) 2016 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems (IROS) Daejeon Convenion Cener Ocober 9-14, 2016, Daejeon, Korea Muli-Targe Deecion and Tracking from a Single Camera in Unmanned Aerial

More information

Quantitative macro models feature an infinite number of periods A more realistic (?) view of time

Quantitative macro models feature an infinite number of periods A more realistic (?) view of time INFINIE-HORIZON CONSUMPION-SAVINGS MODEL SEPEMBER, Inroducion BASICS Quaniaive macro models feaure an infinie number of periods A more realisic (?) view of ime Infinie number of periods A meaphor for many

More information

An Adaptive Spatial Depth Filter for 3D Rendering IP

An Adaptive Spatial Depth Filter for 3D Rendering IP JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.3, NO. 4, DECEMBER, 23 175 An Adapive Spaial Deph Filer for 3D Rendering IP Chang-Hyo Yu and Lee-Sup Kim Absrac In his paper, we presen a new mehod

More information

MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES

MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES Arun Kumar H. D. 1 and Prabhakar C. J. 2 1 Deparmen of Compuer Science, Kuvempu Universiy, Shimoga, India ABSTRACT

More information

Tracking Deforming Objects Using Particle Filtering for Geometric Active Contours

Tracking Deforming Objects Using Particle Filtering for Geometric Active Contours 1470 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 29, NO. 8, AUGUST 2007 Tracking Deforming Objecs Using Paricle Filering for Geomeric Acive Conours Yogesh Rahi, Member, IEEE, NamraaVaswani,

More information

Real time 3D face and facial feature tracking

Real time 3D face and facial feature tracking J Real-Time Image Proc (2007) 2:35 44 DOI 10.1007/s11554-007-0032-2 ORIGINAL RESEARCH PAPER Real ime 3D face and facial feaure racking Fadi Dornaika Æ Javier Orozco Received: 23 November 2006 / Acceped:

More information

A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER

A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER ABSTRACT Modern graphics cards for compuers, and especially heir graphics processing unis (GPUs), are designed for fas rendering of graphics.

More information

Simultaneous Localization and Mapping with Stereo Vision

Simultaneous Localization and Mapping with Stereo Vision Simulaneous Localizaion and Mapping wih Sereo Vision Mahew N. Dailey Compuer Science and Informaion Managemen Asian Insiue of Technology Pahumhani, Thailand Email: mdailey@ai.ac.h Manukid Parnichkun Mecharonics

More information

Spline Curves. Color Interpolation. Normal Interpolation. Last Time? Today. glshademodel (GL_SMOOTH); Adjacency Data Structures. Mesh Simplification

Spline Curves. Color Interpolation. Normal Interpolation. Last Time? Today. glshademodel (GL_SMOOTH); Adjacency Data Structures. Mesh Simplification Las Time? Adjacency Daa Srucures Spline Curves Geomeric & opologic informaion Dynamic allocaion Efficiency of access Mesh Simplificaion edge collapse/verex spli geomorphs progressive ransmission view-dependen

More information

DAGM 2011 Tutorial on Convex Optimization for Computer Vision

DAGM 2011 Tutorial on Convex Optimization for Computer Vision DAGM 2011 Tuorial on Convex Opimizaion for Compuer Vision Par 3: Convex Soluions for Sereo and Opical Flow Daniel Cremers Compuer Vision Group Technical Universiy of Munich Graz Universiy of Technology

More information

In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magnetic Field Maps

In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magnetic Field Maps In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magneic Field Maps A. D. Hahn 1, A. S. Nencka 1 and D. B. Rowe 2,1 1 Medical College of Wisconsin, Milwaukee, WI, Unied

More information

Robot localization under perceptual aliasing conditions based on laser reflectivity using particle filter

Robot localization under perceptual aliasing conditions based on laser reflectivity using particle filter Robo localizaion under percepual aliasing condiions based on laser refleciviy using paricle filer DongXiang Zhang, Ryo Kurazume, Yumi Iwashia, Tsuomu Hasegawa Absrac Global localizaion, which deermines

More information

Moving Object Detection Using MRF Model and Entropy based Adaptive Thresholding

Moving Object Detection Using MRF Model and Entropy based Adaptive Thresholding Moving Objec Deecion Using MRF Model and Enropy based Adapive Thresholding Badri Narayan Subudhi, Pradipa Kumar Nanda and Ashish Ghosh Machine Inelligence Uni, Indian Saisical Insiue, Kolkaa, 700108, India,

More information

A Face Detection Method Based on Skin Color Model

A Face Detection Method Based on Skin Color Model A Face Deecion Mehod Based on Skin Color Model Dazhi Zhang Boying Wu Jiebao Sun Qinglei Liao Deparmen of Mahemaics Harbin Insiue of Technology Harbin China 150000 Zhang_dz@163.com mahwby@hi.edu.cn sunjiebao@om.com

More information

Parallel and Distributed Systems for Constructive Neural Network Learning*

Parallel and Distributed Systems for Constructive Neural Network Learning* Parallel and Disribued Sysems for Consrucive Neural Nework Learning* J. Flecher Z. Obradovi School of Elecrical Engineering and Compuer Science Washingon Sae Universiy Pullman WA 99164-2752 Absrac A consrucive

More information

A Fast Non-Uniform Knots Placement Method for B-Spline Fitting

A Fast Non-Uniform Knots Placement Method for B-Spline Fitting 2015 IEEE Inernaional Conference on Advanced Inelligen Mecharonics (AIM) July 7-11, 2015. Busan, Korea A Fas Non-Uniform Knos Placemen Mehod for B-Spline Fiing T. Tjahjowidodo, VT. Dung, and ML. Han Absrac

More information

Real-Time Non-Rigid Multi-Frame Depth Video Super-Resolution

Real-Time Non-Rigid Multi-Frame Depth Video Super-Resolution Real-Time Non-Rigid Muli-Frame Deph Video Super-Resoluion Kassem Al Ismaeil 1, Djamila Aouada 1, Thomas Solignac 2, Bruno Mirbach 2, Björn Oersen 1 1 Inerdisciplinary Cenre for Securiy, Reliabiliy, and

More information

Network management and QoS provisioning - QoS in Frame Relay. . packet switching with virtual circuit service (virtual circuits are bidirectional);

Network management and QoS provisioning - QoS in Frame Relay. . packet switching with virtual circuit service (virtual circuits are bidirectional); QoS in Frame Relay Frame relay characerisics are:. packe swiching wih virual circui service (virual circuis are bidirecional);. labels are called DLCI (Daa Link Connecion Idenifier);. for connecion is

More information

An Iterative Scheme for Motion-Based Scene Segmentation

An Iterative Scheme for Motion-Based Scene Segmentation An Ieraive Scheme for Moion-Based Scene Segmenaion Alexander Bachmann and Hildegard Kuehne Deparmen for Measuremen and Conrol Insiue for Anhropomaics Universiy of Karlsruhe (H), 76 131 Karlsruhe, Germany

More information

AML710 CAD LECTURE 11 SPACE CURVES. Space Curves Intrinsic properties Synthetic curves

AML710 CAD LECTURE 11 SPACE CURVES. Space Curves Intrinsic properties Synthetic curves AML7 CAD LECTURE Space Curves Inrinsic properies Synheic curves A curve which may pass hrough any region of hreedimensional space, as conrased o a plane curve which mus lie on a single plane. Space curves

More information

(Structural Time Series Models for Describing Trend in All India Sunflower Yield Using SAS

(Structural Time Series Models for Describing Trend in All India Sunflower Yield Using SAS (Srucural Time Series Models for Describing Trend in All India Sunflower Yield Using SAS Himadri Ghosh, Prajneshu and Savia Wadhwa I.A.S.R.I., Library Avenue, New Delhi-110 01 him_adri@iasri.res.in, prajnesh@iasri.res.in,

More information

Michiel Helder and Marielle C.T.A Geurts. Hoofdkantoor PTT Post / Dutch Postal Services Headquarters

Michiel Helder and Marielle C.T.A Geurts. Hoofdkantoor PTT Post / Dutch Postal Services Headquarters SHORT TERM PREDICTIONS A MONITORING SYSTEM by Michiel Helder and Marielle C.T.A Geurs Hoofdkanoor PTT Pos / Duch Posal Services Headquarers Keywords macro ime series shor erm predicions ARIMA-models faciliy

More information

Efficient Region Tracking With Parametric Models of Geometry and Illumination

Efficient Region Tracking With Parametric Models of Geometry and Illumination EEE TRANSACTONS ON PATTERN ANALYSS AND MACHNE NTELLGENCE, VOL. 2, NO. 1, OCTOBER 1998 1 Efficien Region Tracking Wih Parameric Models of Geomery and lluminaion Gregory D. Hager, Member, EEE, and Peer N.

More information

Robust parameterized component analysis: theory and applications to 2D facial appearance models

Robust parameterized component analysis: theory and applications to 2D facial appearance models Compuer Vision and Image Undersanding 91 (2003) 53 71 www.elsevier.com/locae/cviu Robus parameerized componen analysis: heory and applicaions o 2D facial appearance models Fernando De la Torre a, * and

More information

IROS 2015 Workshop on On-line decision-making in multi-robot coordination (DEMUR 15)

IROS 2015 Workshop on On-line decision-making in multi-robot coordination (DEMUR 15) IROS 2015 Workshop on On-line decision-making in muli-robo coordinaion () OPTIMIZATION-BASED COOPERATIVE MULTI-ROBOT TARGET TRACKING WITH REASONING ABOUT OCCLUSIONS KAROL HAUSMAN a,, GREGORY KAHN b, SACHIN

More information

Deformable Parts Correlation Filters for Robust Visual Tracking

Deformable Parts Correlation Filters for Robust Visual Tracking PAPER UNDER REVISION Deformable Pars Correlaion Filers for Robus Visual Tracking Alan Lukežič, Luka Čehovin, Member, IEEE, and Maej Krisan, Member, IEEE ha par-based models should be considered in a layered

More information

FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS

FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS Mohammed A. Aseeri and M. I. Sobhy Deparmen of Elecronics, The Universiy of Ken a Canerbury Canerbury, Ken, CT2

More information

Landmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases

Landmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases Lmarks: A New Model for Similariy-Based Paern Querying in Time Series Daabases Chang-Shing Perng Haixun Wang Sylvia R. Zhang D. So Parker perng@cs.ucla.edu hxwang@cs.ucla.edu Sylvia Zhang@cle.com so@cs.ucla.edu

More information

Track-based and object-based occlusion for people tracking refinement in indoor surveillance

Track-based and object-based occlusion for people tracking refinement in indoor surveillance Trac-based and objec-based occlusion for people racing refinemen in indoor surveillance R. Cucchiara, C. Grana, G. Tardini Diparimeno di Ingegneria Informaica - Universiy of Modena and Reggio Emilia Via

More information

Low-Cost WLAN based. Dr. Christian Hoene. Computer Science Department, University of Tübingen, Germany

Low-Cost WLAN based. Dr. Christian Hoene. Computer Science Department, University of Tübingen, Germany Low-Cos WLAN based Time-of-fligh fligh Trilaeraion Precision Indoor Personnel Locaion and Tracking for Emergency Responders Third Annual Technology Workshop, Augus 5, 2008 Worceser Polyechnic Insiue, Worceser,

More information

Gender Classification of Faces Using Adaboost*

Gender Classification of Faces Using Adaboost* Gender Classificaion of Faces Using Adaboos* Rodrigo Verschae 1,2,3, Javier Ruiz-del-Solar 1,2, and Mauricio Correa 1,2 1 Deparmen of Elecrical Engineering, Universidad de Chile 2 Cener for Web Research,

More information

THE goal of this work is to develop statistical models for

THE goal of this work is to develop statistical models for IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 32, NO. 4, APRIL 2010 579 Nonsaionary Shape Aciviies: Dynamic Models for Landmark Shape Change and Applicaions Samarji Das, Suden Member,

More information

The Impact of Product Development on the Lifecycle of Defects

The Impact of Product Development on the Lifecycle of Defects The Impac of Produc Developmen on he Lifecycle of Rudolf Ramler Sofware Compeence Cener Hagenberg Sofware Park 21 A-4232 Hagenberg, Ausria +43 7236 3343 872 rudolf.ramler@scch.a ABSTRACT This paper invesigaes

More information

Rule-Based Multi-Query Optimization

Rule-Based Multi-Query Optimization Rule-Based Muli-Query Opimizaion Mingsheng Hong Dep. of Compuer cience Cornell Universiy mshong@cs.cornell.edu Johannes Gehrke Dep. of Compuer cience Cornell Universiy johannes@cs.cornell.edu Mirek Riedewald

More information

A Review on Block Matching Motion Estimation and Automata Theory based Approaches for Fractal Coding

A Review on Block Matching Motion Estimation and Automata Theory based Approaches for Fractal Coding Regular Issue A Review on Block Maching Moion Esimaion and Auomaa Theory based Approaches for Fracal Coding Shailesh D Kamble 1, Nileshsingh V Thakur 2, and Preei R Bajaj 3 1 Compuer Science & Engineering,

More information

Optimal Crane Scheduling

Optimal Crane Scheduling Opimal Crane Scheduling Samid Hoda, John Hooker Laife Genc Kaya, Ben Peerson Carnegie Mellon Universiy Iiro Harjunkoski ABB Corporae Research EWO - 13 November 2007 1/16 Problem Track-mouned cranes move

More information

Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game

Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game Deecion Tracking and Recogniion of Human Poses for a Real Time Spaial Game Feifei Huo, Emile A. Hendriks, A.H.J. Oomes Delf Universiy of Technology The Neherlands f.huo@udelf.nl Pascal van Beek, Remco

More information

Joint Feature Learning With Robust Local Ternary Pattern for Face Recognition

Joint Feature Learning With Robust Local Ternary Pattern for Face Recognition Join Feaure Learning Wih Robus Local Ternary Paern for Face Recogniion Yuvaraju.M 1, Shalini.S 1 Assisan Professor, Deparmen of Elecrical and Elecronics Engineering, Anna Universiy Regional Campus, Coimbaore,

More information