J. Vis. Commun. Image R.

Size: px
Start display at page:

Download "J. Vis. Commun. Image R."

Transcription

1 J. Vis. Commun. Image R. 20 (2009) 9 27 Conens liss available a ScienceDirec J. Vis. Commun. Image R. journal homepage: Face deecion and racking using a Boosed Adapive Paricle Filer Wenlong Zheng a, *, Suchendra M. Bhandarkar b a Deparmen of Informaion Sysems, Norhwes Missouri Sae Universiy, 800 Universiy Drive, Maryville, MO 64468, USA b Deparmen of Compuer Science, The Universiy of Georgia, Ahens, GA , USA aricle info absrac Aricle hisory: Received 30 March 2007 Acceped 9 Sepember 2008 Available online 7 Sepember 2008 Keywords: Adapive Paricle Filer Paricle filer Face deecion Video racking Image analysis Boosed learning Boosed Adapive Paricle Filer Adapive Learning Consrain A novel algorihm, ermed a Boosed Adapive Paricle Filer (BAPF), for inegraed face deecion and face racking is proposed. The proposed algorihm is based on he synhesis of an adapive paricle filering algorihm and he AdaBoos face deecion algorihm. An Adapive Paricle Filer (APF), based on a new sampling echnique, is proposed. The APF is shown o yield more accurae esimaes of he proposal disribuion and he poserior disribuion han he sandard Paricle Filer hus enabling more accurae racking in video sequences. In he proposed BAPF algorihm, he AdaBoos algorihm is used o deec faces in inpu image frames, whereas he APF algorihm is designed o rack faces in video sequences. The proposed BAPF algorihm is employed for face deecion, face verificaion, and face racking in video sequences. Experimenal resuls show ha he proposed BAPF algorihm provides a means for robus face deecion and accurae face racking under various racking scenarios. Ó 2008 Elsevier Inc. All righs reserved.. Inroducion Face deecion is imporan in several auomaed sysems ha ake as inpu images of he human face. Examples include fully auomaic face recogniion sysems, video-based surveillance and warning sysems, human face/body racking sysems and percepual human compuer inerfaces. Mos face deecion algorihms can be classified as feaure-based or appearance-based. In recen years, appearance-based face deecion algorihms ha employ machine learning and saisical esimaion mehods have demonsraed excellen resuls among all exising face deecion mehods. Examples of appearance-based face deecion echniques include he AdaBoos algorihm [,2], he FloaBoos algorihm [3], he S-AdaBoos algorihm [4], neural neworks [5,6], Suppor Vecor Machines (SVM) [7,8], Hidden Markov Models [9], and he Bayes classifier [0,]. Viola and Jones [,2] propose a robus AdaBoos face deecion algorihm, which can deec faces in a rapid and robus manner wih a high deecion rae. Li e al. [3] propose he Floa- Boos algorihm, an improved version of he AdaBoos algorihm, for learning a boosed classifier wih minimum error rae. The Floa- Boos algorihm uses a backracking mechanism o improve he face deecion rae afer each ieraion of he AdaBoos procedure. However his mehod is compuaionally more inensive han he AdaBoos algorihm. Jiang and Loe [4] propose he S-AdaBoos * Corresponding auhor. addresses: zheng@nwmissouri.edu (W. Zheng), suchi@cs.uga.edu (S.M. Bhandarkar). algorihm, a varian of he AdaBoos algorihm, for handling ouliers in paern deecion and classificaion. Since he S-AdaBoos algorihm uses differen classifiers in differen phases of compuaion, i suffers from compuaional inefficiency and lack of accuracy. Among he various neural nework-based approaches o face deecion, he work of Rowley e al. [5] is paricularly well known. Rowley e al. [5] employ a mulilayer neural nework o learn he face and non-face paerns from raining ses consising of face and non-face images. A significan drawback of heir echnique is ha he deecion is limied o primarily uprigh fronal faces. Alhough Rowley e al. [5] furher generalize heir mehod o deec roaed face images, he repored resuls are no promising because of he resuling low deecion rae. Face deecion echniques based on Suppor Vecor Machines (SVMs) use srucural risk minimizaion o minimize he upper bound of he expeced generalizaion error [7,8]. The major disadvanages of SVMs include inensive compuaion during he learning process and high memory requiremen. Face deecion echniques based on Hidden Markov Models (HMMs) assume ha face and non-face paerns can be characerized as oupus of a parameric random Markovian process whose parameers can be learned using a well-defined esimaion procedure [9]. The goal of raining an HMM is o esimae he appropriae parameers in he HMM model ha maximize he probabiliy or likelihood of he observed raining daa. Schneiderman and Kanade [0] presen a naive Bayes classifier for face deecion, which is based on he esimaion of he join probabiliy of he local appearance and posiion of a face paern a muliple scales. However, he performance of he naive Bayes /$ - see fron maer Ó 2008 Elsevier Inc. All righs reserved. doi:0.06/j.jvcir

2 0 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) 9 27 classifier is repored o be poor [0]. To address his problem, Schneiderman [] has proposed a resriced Bayesian nework for face deecion based on performing a search in he large space of possible nework srucures o deermine he opimal srucure of a Bayesian nework-based classifier. Objec racking in video has also been sudied exensively by researchers in compuer vision because of various compuer vision applicaions such as auonomous robos [2], video surveillance [3], human eye racking [4] and human face racking [5] ha use racking algorihms exensively. Objec racking algorihms designed o operae under more general and less srucured siuaions need o deal wih complex issues of uncerainy and error arising from occlusion, and changes in illuminaion, viewpoin and objec scale [6]. Consequenly, many echniques have been developed o ackle he various aforemenioned issues in visual objec racking and repored in he lieraure over he pas decade. The various echniques for visual objec racking can be classified as image (region)-based, conour-based or filering-based. Image (region)-based racking mehods ypically exrac generic regionbased feaures from he inpu images and hen combine hese feaures using high-level scene informaion [6]. Inille e al. [6] propose a blob-racker for human racking in real ime wherein he background is subraced o exrac foreground regions. The foreground regions are hen divided ino blobs based on color. This approach exhibis good run-ime performance, bu suffers from a major disadvanage in erms of merging of blobs when he objecs in he scene approach each oher. Conour-based racking mehods assume ha he racked objecs are bounded by conours wih known properies [7 9]. The conour pixels are racked from one image frame o he nex using a predefined conour shape model. Dynamical or elasic conour models are used o handle changes in objec shape due o deformaion or change in scale. Filering-based mehods are based on predicion and updaing of objec feaures over ime, i.e., over successive frames in he video sequence. Tracking of objec shape and objec locaion over ime is ackled adequaely by he radiional Kalman filer in cases where he racking problem can be effecively modeled as a linear dynamic sysem [20]. The Exended Kalman Filer (EKF) is an exension of he radiional Kalman filer o a non-linear bu unimodal process where he non-linear behavior can be approximaed by local linearizaion [2]. However, i is widely acceped ha he paricle filer is superior o he radiional Kalman filer in erms of racking performance [22], since he paricle filer provides a robus objec racking framework wihou being resriced o a linear sysem model. Paricle filers, also known as sequenial Mone Carlo filers, have been widely used in visual racking o address limiaions arising from non-lineariy and non-normaliy of he moion model [23,24]. The basic idea of he paricle filer is o approximae he poserior densiy using a recursive Bayesian filer based on a se of paricles wih cerain assigned weighs. The Condensaion algorihm, a simple paricle filer, proposed by Isard [25] is designed o solve racking problems arising from non-lineariy and non-normaliy of he moion model. During he sampling sep, he Condensaion algorihm uses a simple proposal disribuion o draw a se of paricles, which defines he condiional disribuion on he paricle sae in he previous frame. The proposal disribuion is hen used o approximae he arge a poseriori disribuion. A commonly observed shorcoming of he Condensaion algorihm is ha he proposal disribuion does no incorporae he informaion from he curren frame hus resuling in longer run ime needed for convergence o he desired a poseriori disribuion. Various approaches have been developed o improve he racking performance of he convenional paricle filer. Li e al. [23] propose a Kalman paricle filer (KPF) and an unscened paricle filer (UPF) o improve he paricle sampling procedure in he conex of visual conour racking. This approach makes use of a Kalman filer or an unscened Kalman filer o incorporae he curren observaion in he esimaion of he proposal disribuion. The Kalman filer or he unscened Kalman filer is shown o seer he se of paricles o regions of high likelihood in he search space, and hus reducing he number of paricles needed o esimae he proposal disribuion [23]. To address he occlusion problem, Wang and Cheong [26] propose a paricle filer wih a Markov random field (MRF)-based represenaion of he racked objec wihin a dynamic Bayesian framework. This mehod ransforms an objec ino a composie of muliple MRF regions o improve he modeling and racking accuracy. Chang e al. [22] presen a kernel paricle filer o improve he sampling efficiency for muliple objec racking using daa associaion echniques. This scheme invokes kernels o approximae coninuously he poserior densiy, where he kernels for objec represenaion and localizaion are allocaed based on he value of he gradien derived from he kernel densiy. However, his mehod can no handle siuaions in which he moion paern of objecs in one group changes drasically. Rahi e al. [9] formulae geomeric acive conours as a parameerizaion echnique o deal wih deformable objecs. However, heir echnique performs poorly when he racked objec is compleely occluded over several frames. Isard and MacCormick [27] propose a Bayesian muliple blob-racker (BraMBLe), an early varian of a paricle filer, in which he number of racked objecs is allowed o vary during he racking process. Noneheless, his approach relies on modeling a fixed background o idenify foreground objecs, a siuaion ha is no always pracical in real-world racking scenarios. To address his problem, Okuma e al. [24] relax he assumpion of a fixed background and allow he background o vary in order o handle real-world image sequences. Okuma e al. [24] also propose a boosed paricle filer (BPF) for muliple objec deecion and racking, which inerleaves he AdaBoos algorihm wih a simple paricle filer based on he Condensaion algorihm. However, his mehod does no presen a sysemaic way o incorporae objec models o guaranee accurae approximaion of he proposal disribuion, nor does i address he occlusion problem. In his paper, we propose a novel paricle filering scheme, ermed as an Adapive Paricle Filer (APF), o enable esimaion of he proposal disribuion and he poserior disribuion wih a much higher degree of accuracy. Based on he previous work of Isard [25], Li e al. [23], Vermaak e al. [28] and Okuma e al. [24], we also propose a novel scheme for inegraion of face deecion and face racking by combining he APF-based racking algorihm wih he AdaBoos face deecion algorihm. We erm he combinaion of he proposed APF algorihm and he AdaBoos algorihm as a Boosed Adapive Paricle Filer (BAPF). In he proposed BAPF-based face racking scheme, he AdaBoos algorihm is used o deec faces and also verify he exisence of a racked face in an inpu image frame, whereas he APF algorihm is designed o rack he deeced faces across he image frames in he video sequence. The BAPF algorihm is shown o yield very good racking resuls in siuaions where he racked objecs are severely occluded. Experimenal resuls show ha he proposed BAPF scheme provides robus face deecion and accurae face racking in various racking scenarios. 2. Saisical model A glossary of mahemaical noaion used in he formulaion of he saisical model and he paricle filering algorihm is given in Appendix A. 2.. Observaion model We denoe a sae vecor for an objec by x, and an observaion vecor by y. I is imporan for conour racking o obain an accurae

3 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) 9 27 esimae of he observaion likelihood (also ermed as he observaion densiy) p(y x). Blake e al. [29], Isard[25], andmaccormick andblake [8,30] inroduce saisical models for esimaion of he observaion densiy p(y x). These models use specific image feaures ha are colleced along a se of normals o a hypohesized conour. A finie numberofsamplepoins, calledconrol poins, are generaed on he hypohesized conour. We follow he general direcion provided by he aforemenioned models for modeling he observaion process, bu specifically follow he model proposed by MacCormick [8]. Fig. shows an observed conour and image feaures exraced along a measuremen line. We denoe he finie number of sample poins on a hypohesized conour by a se {x i ; i =,2,..., n}. We erm he normals o he above conour as measuremen lines, and denoe hem by a se {s i ; i =,2,..., n}. The lengh of he measuremen lines is fixed a a value T. The Canny edge deecor is applied o he measuremen line s i (i =,2,..., n) in order o obain he posiions of he edge feaures fz ðmþ i ; m ¼ ; 2;...; m i g where m is he index of he deeced feaures, and m i is he number of deeced feaures. As can be seen, each feaure is joinly generaed by he boundary of an objec and random cluer presen in he image. The cluer feaures on he measuremen line s i (i =,2,..., n) are assumed o obey a Poisson disribuion wih densiy parameer k given by p T ðm i Þ¼ ðktþm i e kt m i! where m i is he number of deeced cluer feaures. A boundary densiy funcion is assumed o obey a Gaussian disribuion; hus he generic likelihood funcion of an observaion a a sample poin x i (i =,2,...,n) can be described by [8]: p xi ðzjv ¼fx i gþ!! ¼ ðktþm i e kt q m i! 0 þ q X m i pffiffiffiffiffiffi exp ðzðmþ i x i Þ 2 k 2p r 2r 2 where q 0 is he probabiliy of undeeced feaures for an objec boundary, and q is he probabiliy of deeced feaures for an objec boundary. Assuming ha he sample poins are independen and idenically disribued, he overall likelihood funcion of he observaion p(y x) can be represened as pðyjxþ ¼ Yn ðktþ m i m i! q 0 þ q k Y e kt meas: line s i inersecing x!! pffiffiffiffiffiffi exp ðzðmþ i x i Þ 2 2p r 2r 2 X m i ðþ ð2þ ð3þ 2.2. Dynamical model Generally speaking, a paricle filer algorihm requires a dynamical model o describe he evoluion of a racking sysem over ime. An auo-regressive process (ARP) model has been widely used for he purpose of formulaing such a dynamical model [8,3]. Blake e al. [29,32] model he objec dynamics as a second-order process. Isard and Blake [33] and Li e al. [23] also follow he dynamical model of Blake e al. [29,32] in heir work on objec racking. Following he previous works of Blake e al. [29,32], Isard and Blake [33] and Li e al. [23], his paper also employs a second-order ARP as he dynamical model for face racking. I is widely acceped ha he secondorder ARP capures he various moions of ineres ha are relevan o visual racking [8]. The parameers of he dynamical model in a ypical real-world applicaion can be obained via learning from he inpu raining daa. The second-order ARP represens he sae x a ime as a linear combinaion of he previous wo saes and addiive Gaussian noise. The dynamical model can be represened as a second-order linear difference equaion: x x ¼ Aðx xþþbx where x is Gaussian noise ha is independen of he sae vecor x, x denoes he mean value of he sae vecor, and A and B are marices describing he deerminisic componen and he sochasic componen of he dynamical model, respecively. The sae vecor x encodes he knowledge of he objec conour in he curren sae and he previous sae and is represened by x ¼ X : X In mos real-world applicaions, one can se some reasonable defaul values for he parameers A, B and x of he dynamical model. However, i is more effecive o approximae hese values via observaion of video sequences, in which he racked objec undergoes some ypical represenaive moions [29,34]. The dynamical model can also be represened by a emporal Markov chain [33] given by pðx jx Þ¼Cexp 2 jb ððx xþ Aðx xþþj 2 ð5þ where C is a consan, and denoes he Euclidean norm. ð4þ Fig.. (a) Observaion process: he ellipse is a hypohesized conour in an image. (b) The image feaures on he measuremen line.

4 2 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) Face racking using paricle filering 3.. The filering disribuion We denoe a sae vecor for an objec a ime by x, and is hisory up o ime by x : ={x,x 2,...,x }. Likewise, an observaion vecor a ime is denoed by y and is hisory up o ime is denoed by y : ={y,y 2,...,y }. The sandard problem of arge racking, in he erminology of saisical paern recogniion, is o esimae he sae x of he objecs a ime, using a se of observaions y from a sequence of inpu images. The a poserior densiy p(x y : ) represens all he informaion abou x a ime ha is poenially deducible from he se of observaions y up o ha ime. We assume ha he objec dynamics consiue a emporal Markov process and ha he observaions y are independen. Therefore, he dynamics are deermined by a ransiion prior p(x x ). Given he ransiion prior p(x x ) and he observaion densiy p(y x ), he poserior densiy p(x y : ) can be compued by applying Bayes rule [35] for inferring he poserior sae densiy from ime-varying observaions. The poserior densiy is esimaed recursively via Bayesian filering [33,36] as follows pðx jy : Þ¼ pðy jx ; y : Þpðx jy : Þ ¼ pðy jx Þpðx jy : Þ pðy jy : Þ pðy jy : Þ where Z pðx jy : Þ¼ pðx jx Þpðx jy : Þdx The poserior densiy p(x y : ) is generally evaluaed in wo seps, namely he predicion sep and he updaing sep. Firs, an effecive prior p(x y : ) shown in Eq. (7) is prediced from he poserior densiy p(x y : ) via he ransiion prior p(x x ). Second, he poserior densiy p(x y : ) is updaed based upon he new observaion y a ime, as given by Eq. (6). The observaion prior p(y y : ), which is he denominaor in Eq. (6), can be represened by pðy jy : Þ¼ X x pðy ; x jy : Þ¼ X x pðy jx Þpðx jy : Þ ð8þ Furhermore, he observaion prior p(y y : ) can be represened by he following inegral: Z pðy jy : Þ¼ pðy jx Þpðx jy : Þdx ð9þ ð6þ ð7þ Eqs. (0) and () represen an opimal soluion o he sandard problem of objec racking via recursive Bayesian filering. Obviously, his soluion involves he compuaion of high-dimensional inegrals, and dealing wih he non-lineariy and non-normaliy of he moion model in many racking scenarios. High-dimensional inegraions usually canno be easily compued in a closed analyical form. Thus a paricle filer, also known as a sequenial Mone Carlo filer, is adoped as a pracical soluion o he oherwise inracable problem of objec racking via recursive Bayesian filering The sandard paricle filer A sandard paricle filer uses Mone Carlo simulaion o esimae he poserior probabiliy densiy funcion p(x y : ) given in Eq. (0). Paricle filering uses random sampling sraegies o model a complex poserior probabiliy densiy funcion p(x y : ). I uses N weighed discree paricles o approximae he poserior probabiliy densiy funcion p(x y : ) via observaion of he daa. Each paricle consiss of a sae vecor x and a weigh w. The weighed paricle se is given by fðx ðiþ ; Þ; i ¼ ; 2;...; Ng. Paricle filering samples he space spanned by x wih N discree paricles and approximaes he disribuion using he discree poins sampled by he paricles and heir associaed weighs. Specifically, we assume ha N paricles are used in he sampling procedure o approximae he poserior probabiliy densiy funcion p(x y : ), and ha he N discree sample poins in he space of x are given by x ; x2 ;...; xn, respecively. Thus we have pðx jy : Þ¼ XN w i dðx x i Þ ð3þ Since i is no feasible o draw samples direcly from he poserior disribuion, a proposal disribuion q(x x,y : ) is used o draw he samples easily for approximaion of he poserior probabiliies. Using he proposal disribuion q(x x,y : ), a paricle filer generaes he ih paricle x ðiþ, where (i =,2,...,N), and compues he weigh for x ðiþ ¼ pðy jx ðiþ Þpðx ðiþ jx ðiþ Þ qðx ðiþ jx ðiþ ; y :Þ from x ðiþ using he following equaion The poserior disribuion p(x y : ) can be hus approximaed as pðx jy : Þ XN ð4þ dðx x ðiþ Þ ð5þ The esimae of he funcion f(x ) of he sae vecor could be hen compued as Thus Eq. (6) becomes pðx jy : Þ¼ pðy jx Þpðx jy : Þ R pðy jx Þpðx jy : Þdx ð0þ E½f ðx ÞŠ XN f ðx ðiþ Þ: ð6þ Using Eq. (7), we subsiue he expression for he effecive prior p(x y : ) in Eq. (0) o obain pðx jy : Þ¼ pðy jx Þ R pðx jx Þpðx jy : Þdx R pðy jx Þ R pðx jx Þpðx jy : Þdx dx ðþ In addiion o he esimae of he poserior densiy p(x y : ), an esimae of a funcion f(x ) of objec sae vecor is also required under many siuaions. The expeced value or esimae of he funcion f(x ) is approximaed as Z E½f ðx ÞŠ ¼ f ðx Þpðx jy : Þdx ð2þ 3.3. The Adapive Paricle Filer One of he acive research areas in paricle filering in recen years is he generaion of a good proposal disribuion q(x x,y : ) ha would enable a more accurae esimae of he poserior disribuion p(x y : ). The aim is o obain as close an approximaion o he poserior probabiliy disribuion as possible. The sandard paricle filer based on he Condensaion algorihm [33] does no use he knowledge obained from he curren image frame, which leads o higher inaccuracy in he esimae of he poserior disribuion. Douce e al. [36] propose an opimal proposal disribuion (OPD) for sae esimaion of jump Markov linear sysems, and recursively compue opimal sae esimaes based on he selecion of he minimum variance of weighs

5 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) ði ¼ ; 2;...; NÞ. To overcome he problem of inefficien compuaion of he OPD, Li e al. [23] propose a Kalman paricle filer (KPF) and an unscened paricle filer (UPF) o drive a se of paricles o he high-likelihood regions in he search space. Li e al. [23] propose a local linearizaion of he OPD o esimae he proposal disribuion, which is assumed o be a Gaussian disribuion. Therefore, he proposal disribuion obained by Li e al. [23] can be represened as u l ðxþ ¼qðx ðiþ jx ðiþ ; y :kþ ¼Nð^x ðiþ ; b P ðiþ Þ i ¼ ; 2;...; N: ð7þ where ^x ðiþ and P bðiþ denoe he mean and covariance, respecively, of he Gaussian disribuion Nð^x ðiþ ; P bðiþ Þ. In his paper, we propose a new paricle filering scheme, ermed as an Adapive Paricle Filer (APF), which enables significanly more accurae esimaion of he proposal disribuion and he poserior disribuion. The proposed APF can be viewed as an exension of he Condensaion algorihm and he Kalman paricle filer o obain an accurae approximaion of he proposal disribuion and he poserior disribuion. The proposed APF algorihm is depiced in Fig. 2. In he sampling sep of he APF algorihm (Fig. 2), a new sampling sraegy is used o improve he accuracy of he approximaion, which is disinc from he sampling sraegies used in oher paricle filers. The sampling sep is considered o be he mos imporan sep in a paricle filering algorihm. For each discree paricle x ðiþ ;l, he Adapive Paricle Filer generaes a new paricle x ðiþ ;l based on a proposal disribuion u l (x). We use he loop conrolled by he parameer l in he APF algorihm (Fig. 2) o implemen he proposed sampling sraegy. Le he parameer L denoe he fixed number of ieraions of loop l, where he value of L can be uned based on he needs of differen real-world applicaions. When L =, he APF is anamoun o he sandard paricle filer; hus he sandard paricle filer is a special case of he proposed APF. When L >, he APF performs more sampling ieraions han he sandard paricle filer. We prove in Appendix A ha he addiional ieraions of he APF resul in a lower esimaion error for he proposal disribuion and poserior disribuion. In order o enable more accurae esimaion of he proposal disribuion, we ierae he sampling procedure subjec o a consrain ermed as he Adapive Learning Consrain (ALC) described using Eq. (7). A more deailed explanaion and derivaion of he ALC is provided in Appendix A. K l max l a K l min l where max l ¼ maxfm jn ðiþ in min l ¼ minfm jn ðiþ in xðiþ j; M ;l 2jn ðiþ 2 xðiþ j; m ;l 2jn ðiþ 2 xðiþ ;l jg xðiþ ;l jg ð8þ K l, K l are consans, and 0 < a <. As long as he proposed ALC is saisfied, he curren ieraion ha generaes new paricles for racking in he curren image frame will coninue. The curren ieraion in he sampling sep is haled when he proposed ALC ceases o be saisfied or he predefined loop hreshold is reached. From boh, a heoreical and pracical viewpoin, he paricles (and heir associaed weighs and sae vecor) resuling from he laes ieraion of he ALC represen a beer approximaion o he proposal disribuion and he poserior disribuion. The heoreical analysis and experimenal resuls presened in he following secions of he paper confirm ha he performance of he APF algorihm is indeed superior o ha of he convenional paricle filering algorihm. The oher seps in he proposed APF algorihm are similar o hose in he case of oher paricle filers such as he KPF and he UPF. The iniializaion sep in he proposed APF algorihm akes advanage of he informaion derived from he resuls of he Ada- Boos face deecion algorihm. 4. The Boosed Adapive Paricle Filer The proposed Boosed Adapive Paricle Filer (BAPF) for face deecion and racking employs wo objec models: he conourbased model used in he Adapive Paricle Filer (APF) and he region-based model used in he AdaBoos face deecion procedure. The objec models used in he conex of racking can be ypically classified ino hree general caegories [37]: conour-based models [23,38,39], region-based models [5,27,40], and feaure poinbasedmodels [4 43,45]. Since he proposed BAPF algorihm uses wo disinc models for face deecion and racking, i has disinc advanages over he convenional paricle filer. The incorporaion of he AdaBoos algorihm wihin he APF algorihm is shown o subsanially improve he robusness of he resuling BAPF algorihm. The Ada- Boos algorihm provides a naural mechanism for inegraion of he conour- and region-based represenaions. This makes he proposed BAPF algorihm more powerful han he naïve K-means clusering mehod proposed by Vermaak e al. [28] in heir disribuion mixure racking algorihm. The proposed BAPF algorihm also performs beer han he disribuion mixure represenaion scheme of Okuma e al. [24] since he proposed approach employs a more effecive paricle filering algorihm, i.e., he APF algorihm. In he proposed BAPF scheme, he AdaBoos algorihm allows for explici deecion of faces enering and leaving he field of view of he camera, hus providing a means for (re)iniializaion of he APF-based racking algorihm. The AdaBoos algorihm provides a means of verificaion of he predicions made by he APF-based racking algorihm whereas he APF-based racking algorihm, in urn, provides he focus-of-aenion regions for he AdaBoos algorihm. The resuling BAPF algorihm is seen o provide robus face deecion and accurae face racking under various racking scenarios. 4.. Face deecion using adaboos Among he various face deecion mehods described in he published research lieraure, he boosed learning-based face deecion mehods have demonsraed excellen resuls. Building on he previous work of Tieu and Viola [44] and Schneiderman [], Viola and Jones [,2] have proposed a robus AdaBoos face deecion algorihm, which can deec faces in an inpu image in a rapid and robus manner wih a high deecion rae. The face deecion echnique in AdaBoos is comprised of hree aspecs: he inegral image, a srong classifier comprising of weak classifiers based on he AdaBoos learning algorihm, and an archiecure comprising of a cascade of a number of srong classifiers. We employ he AdaBoos scheme of Viola and Jones [,2] for face deecion. A 25-layer cascade of boosed classifiers is rained o deec muli-view faces. A se of sample face and non-face (ermed as background) images are used for raining. Anoher se of non-face sample images is colleced from video sequences conaining no faces. The resuls of he AdaBoos raining procedure and he AdaBoos face deecion procedure are presened in Secion Inegraing he Adapive Paricle Filer wih adaboos The proposed face deecion and racking scheme is based on he inegraion of wo disinc models: an AdaBoos face deecion model and an Adapive Paricle Filer (APF)-based face racking

6 4 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) 9 27 Fig. 2. The algorihm describing he Adapive Paricle Filer. model. The AdaBoos face deecion model performs muli-view face deecion based on he rained AdaBoos algorihm. The APF model performs visual conour racking using he adapive paricle filering algorihm described in Secion 3.3. Fig. 3 shows he closed-loop conrol sysem view of he proposed inegraed face deecion and face racking scheme.

7 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) Single Frame Adapive Paricle Filer Face Tracking Predicion scene condiions deermined by he ambien illuminaion, cluer, and iner-objec and background occlusions. Iniializaion Verificaion Definiion The process for face deecion and racking conains wo phases: an iniializaion phase and a racking phase. In he iniializaion phase of he BAPF, he AdaBoos face deecion model is used o provide he iniial parameers for he APF face racking model based on observaions of he image frames in he video sream over a cerain ime inerval. During he racking phase, he AdaBoos face deecion model and he APF face racking model improve he racking performance via synergeic ineracion. The AdaBoos face deecion model helps he APF face racking model o find and define new objecs (faces), and o verify he curren saes of he objecs (faces) being racked. On he oher hand, he APF face racking model provides focus-of-aenion regions wihin he image o speed up he AdaBoos face deecion procedure. Afer performing he AdaBoos face deecion procedure on an image, we obain a confidence measure g for each deeced face in he image. Likewise, using he APF racking algorihm, he esimae of f(x ) generaed by he Adapive Paricle Filer a each sample poin along he conour is given by E½f ðx ÞŠ XN AdaBoos Face Deecion f ðx ðiþ Þ ð9þ The resuls of he AdaBoos algorihm and he APF algorihm are combined o updae he posiion of a sampled poin, as follows E c ðf ðx ÞÞ ¼ ð cþeðf ðx ÞÞ þ c g d Focus-of-aenion Fig. 3. Inegraion of he APF wih AdaBoos wihin a single feedback conrol sysem. ð20þ where E c (.) represens he esimaed posiion of a sampled poin on he conour combining he esimaion values from he APF and he AdaBoos algorihm, he parameer c is he weigh assigned o he Adaboos deecion procedure, he parameer g is a confidence measure assigned o each deeced face in he image, d is he disance beween he cener of a deeced face from he AdaBoos algorihm and he cener of a sampled emplae conour from he APF algorihm, and f(x ) represens he locaion of he conour of he racked face. In pracice, he oupu of he Adaboos face deecion procedure is averaged over he previous F frames before combining i wih he APF oupu using Eq. (9). The value of E c (f(x )) is fed back o he APF for furher processing in successive ieraions. Eq. (9) demonsraes ha he combinaion of he AdaBoos algorihm and he APF algorihm can be employed o obain he esimaed posiion of a sampled poin on he conour. The acual esimaed posiion was compued by adding a weighed esimae of he APF and a weighed disance beween he cener of a face deeced using he AdaBoos algorihm and he cener of a sampled emplae conour from he APF algorihm. The firs erm represens he influence of he APF, whereas he second erm represens he influence of he AdaBoos algorihm. The parameer c in Eq. (9) can be fine-uned wihou affecing he convergence of he APF. When c = 0, he BAPF degeneraes o he pure APF. By increasing he value of c, we lay a greaer emphasis on he oupu of he Ada- Boos face deecion algorihm. On he oher hand, when c =, he BAPF degeneraes o he pure AdaBoos algorihm, which does no ake ino accoun he oupu of he APF-based racker. In pracice, one could adjus he value of he parameer c based on differen 5. Experimenal resuls 5.. Adaboos face deecion The AdaBoos scheme of Viola and Jones [,2] is used o deec faces in inpu images. In our experimen, we rain a 25-layer cascade of srong classifiers o deec muli-view faces in video sequences. The raining daa se is composed of face and nonface images of size pixels. A se of 6230 muli-view face images of eigh persons is colleced from video sequences under differen condiions of ambien scene illuminaion, surface reflecion, facial pose, facial expression and background composiion in order o make face deecion scheme more robus in differen scenarios. The face images are cropped and scaled o a resoluion of pixels. Anoher se of 6598 non-face sample images of size pixels are colleced from video sequences conaining no faces. The non-face sample images are of he same size as he video frames acquired by he video camera for realime face racking, bu his is no a sric requiremen. Fig. 4 shows some random face samples used for he raining, and Fig. 5 shows some random non-face samples used for raining. A larger raining se of face and non-face examples ypically leads o beer deecion resuls, alhough deecion failures sill exis in he inpu image regions ha conain overlaps or occlusions, and are characerized by high scene cluer. Some resuls of face deecion using our rained AdaBoos face deecion procedure are illusraed in Fig. 6. AdaBoos face deecion performs well in mos cases, bu generaes false posiives in very busy images ha conain overlaps or occlusions, and are characerized by high scene cluer Boosed Adapive Paricle Filer The proposed Boosed Adapive Paricle Filer (BAPF) is implemened using C++ under he Microsof Visual C++.NET environmen on a.6 GHz Penium-M worksaion. Video sequences are of size pixels and are sampled a 30 frames per second. In he beginning, he AdaBoos face deecion algorihm iniializes he objec sae(s) in he Adapive Paricle Filer (APF)-based face racking algorihm. The iniializaion is done using observaions of he image frames in he inpu video sequence over a cerain ime inerval. Since he conour defining he appearance of he face in he video sequences is roughly circular or ellipical in shape, we use a simple parameerized model o represen he conour i.e., Ax 2 + By 2 + C = 0. Noe ha he proposed BAPF algorihm can also be used in he case of more complex conours ha are modeled using a B-spline represenaion. The proposed BAPF algorihm has been applied o various racking scenarios shown in Fig. 7 hrough Fig. 2. The racking resuls are presened for hree es video sequences ha are capured under varying condiions defined by ambien scene illuminaion, surface reflecion, objec scale, facial pose (consising of boh, in-plane and ou-of-plane roaions), facial expression, occlusions and background composiion. In wo of he es videos (es videos and 3) here is a single face in he scene. Tes video is used in experimens comprising of differen racking scenarios, whereas es video 3 is used o compare he racking accuracy of he proposed BAPF algorihm wih ha of he Condensaion algorihm (i.e., he convenional paricle filer). The hird es video (es video 2), in which here are wo faces in he scene, is used in experimens ha cover differen muli-face racking scenarios. All racking resuls are obained using N = 000 paricles in he APF, BAPF and Condensaion algorihms.

8 6 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) 9 27 Fig. 4. Face examples. Fig. 5. Non-face examples. Fig. 6. Resuls of fronal face deecion and muli-view face deecion.

9 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) Fig. 7. Tracking resuls wih scale changes on es video. From lef o righ, he frame numbers are 98, 043 and 067. Fig. 8. Tracking resuls wih illuminaion changes on es video. From lef o righ, he frame numbers are 866, 954 and Descripion of BAPF experimens In Fig. 7 hrough 2, a yellow ellipse around a face implies he absence of occlusion, whereas a red ellipse indicaes he presence of occlusion. Fig. 7 shows snapshos of a single-face racking experimen on es video where he scale of he racked face is observed o change significanly. The racking resuls show ha he proposed BAPF racking algorihm can handle significan scale changes in he objec appearance. Fig. 8 shows snapshos of a single-face racking experimen on es video under changing illuminaion condiions. I demonsraes ha he proposed BAPF algorihm is robus o various changes in ambien illuminaion condiions which can be aribued o he inheren robusness of he APF algorihm and he inegraion of he saisical learning procedure in AdaBoos wih he APF-based racker. Fig. 9 shows snapshos of a single-face racking experimen on es video under changes in viewpoin and facial pose arising from ou-of-plane roaions of he face. I shows ha he proposed BAPF algorihm can handle muli-view face deecion and racking. Fig. 0 shows snapshos of a single-face racking experimen on es video wih in-plane roaions of he face. I shows ha he proposed BAPF algorihm can handle objec appearance changes due o in-plane objec roaions. Fig. shows snapshos of a single-face racking experimen on es video where he face is periodically occluded. I confirms ha he proposed BAPF algorihm performs correcly in he presence of occlusions because of he robusness of he APF. Fig. 2 presens snapshos of a wo-face racking experimen on es video 2 which conains insances of iner-objec (i.e., inerface) occlusion. I can be observed ha he proposed BAPF algorihm can deal wih and recover from insances of iner-objec (iner-face) occlusion in he video sream. The racking performance of he proposed BAPF algorihm is compared wih ha of he Condensaion algorihm [33], which is a well-known implemenaion of a convenional paricle filerbased racker. Boh algorihms employ N = 000 paricles for face racking in es video 3. The experimenal resuls show ha he racking accuracy of he proposed BAPF algorihm is superior o ha of he Condensaion algorihm. Thus, in he conex of objec racking, he proposed BAPF algorihm can be deemed o yield beer performance han he convenional paricle filer. However, he beer performance of he proposed BAPF algorihm comes a he price of compuaional efficiency. The BAPF algorihm acually consumes more compuing resources han he relaively sraighforward Condensaion algorihm since he BAPF algorihm performs more compuaion per ieraion in order o yield more accurae non-linear esimaions of he objec sae(s). The racking accuracy of he proposed BAPF algorihm and he Condensaion algorihm can be visually compared using he snapshos shown in Figs. 3 and 4, respecively BAPF performance analysis Using racking accuracy and compuaion ime as he performance merics, we quaniaively analyze and compare he performance of he BAPF algorihm, APF algorihm, and Condensaion algorihm. The racking accuracy is defined in erms of he displacemen error beween he cenroid of a ground ruh face and he cenroid of a racked face in a video sequence. All hree aforemenioned racking algorihms employ N = 000 paricles for face racking and are esed on es video 3. In he following empirical comparison, he performance of he APF algorihm is compared o ha of he Condensaion algorihm, he performance of he BAPF algorihm is compared o ha of he APF algorihm, he performance of he APF algorihm is analyzed for differen values of he parameer L, he performance of he BAPF algorihm is analyzed for differen values of he parameer F, and he performance of he BAPF algorihm is analyzed for differen values of he parameer c. We firs compare he performance of he APF algorihm o ha of he Condensaion algorihm. Boh algorihms employ N = 000 paricles for face racking in he es video 3. In he APF algorihm, he upper bound on he number of ieraions of he loop conrolled by parameer l is L = 3. The experimenal resuls, as shown in Fig. 5 and Table, demonsrae ha he racking accuracy of he APF algorihm is superior o ha of he Condensaion algorihm. I can be seen from Table ha he mean displacemen error in he case of he APF algorihm is significanly lower han ha in he case of he Condensaion algorihm. However, he racking speed (in frames/s) of he APF algorihm is observed o be slower han ha of he Condensaion algorihm. As menioned previously, he slower racking speed of he APF algorihm can be aribued o he fac ha he APF algorihm performs more compuaion per

10 8 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) 9 27 Fig. 9. Tracking resuls in he presence of muliple facial views and ou-of-plane face roaions on es video. The yellow ellipse implies ha no occlusion has occurred, whereas he red ellipse implies ha occlusion has occurred. From op lef o boom righ, he frame numbers are 55, 59, 524, 530, 533, 544, 566, 573 and 585. Fig. 0. Tracking resuls in he presence of in-plane roaions on es video. From lef o righ, he frame numbers are 04, 35 and 52. Fig.. Tracking resuls in he presence of occlusions on es video. The yellow ellipse implies ha no occlusion has occurred, whereas he red ellipse implies ha occlusion has occurred. From op lef o boom righ, he frame numbers are 356, 359, 362, 366, 382 and 397.

11 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) Fig. 2. Tracking resuls in he presence of wo faces on es video 2. The yellow ellipse implies ha no occlusion has occurred, whereas he red ellipse implies ha occlusion has occurred. From op lef o boom righ, he frame numbers are 2, 4, 25, 38, 78 and 38. Fig. 3. Tracking resuls wih he BAPF a six differen imes in es video 3. From op lef o boom righ, he frame numbers are 2, 40, 6, 36, 58 and 80. Fig. 4. Tracking resuls wih he Condensaion algorihm a same imes as in Fig. 4. From op lef o boom righ, he frame numbers are 2, 40, 6, 36, 58 and 80.

12 20 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) APF vs. Condensaion Displacemen (pixels) APF Condensaion Frame number Fig. 5. Tracking resuls for he APF and Condensaion algorihms. Table Summary of racking resuls for he APF and Condensaion algorihms Mean displacemen error (pixels) Sandard deviaion (pixels) Speed (frames/s) ieraion han does he Condensaion algorihm in order o yield more accurae esimaes of he proposal disribuion and he poserior disribuion. In summary, he APF algorihm can be seen o offer significanly more accurae racking han he convenional paricle filer, albei a he cos of a moderae hough olerable loss in racking speed. Nex, he performance of he BAPF algorihm is compared o ha of he APF algorihm. Boh algorihms employ N = 000 paricles for face racking in es video 3. The upper bound on he number of ieraions of he loop conrolled by parameer l is L = 3 for boh algorihms. In he BAPF algorihm, he weigh assigned o he resul of AdaBoos face deecion is c = 0.8. The number of successive frames F over which he oupu of he AdaBoos face deecion algorihm is averaged before combining i wih he APF oupu is F =. The experimenal resuls, shown in Fig. 6 and Table 2, demonsrae ha he racking accuracy of he BAPF algorihm is higher han ha of he APF algorihm. I can be seen from Table ha he mean displacemen error in he case of he BAPF algorihm is significanly lower han ha in he case of he APF algorihm. However, he racking speed of he BAPF algorihm is slighly slower han ha of he APF algorihm since he BAPF algorihm runs he AdaBoos face deecion procedure on each inpu frame of he video sequence. Thus, he BAPF algorihm can be seen o significanly improve he racking accuracy of he APF algorihm, albei a he cos of a sligh decrease in racking speed compared o he APF algorihm. The performance of he APF algorihm is analyzed using differen values of he parameer L which is he upper bound on he number of ieraions of he loop conrolled by he variable l in APF Condensaion he APF algorihm (Fig. 2). The APF algorihm employs N = 000 paricles for face racking in he es video 3. The value of L is varied from L = o L = 4 in he experimens. The experimenal resuls, as shown in Fig. 7 and Table 3, demonsrae ha he racking accuracy of he APF algorihm improves as he value of L increases. I can be seen from Table 3 ha he mean displacemen error in he case of he APF algorihm decreases wih increasing values of L. Thus, he APF algorihm wih a larger value of L provides more accurae racking. However, he racking speed of he APF algorihm also decreases wih increasing values of L since he APF algorihm performs more compuaion per ieraion o esimae he poserior disribuion. From Fig. 7 and Table 3, i can be seen ha as he value of L is increased beyond 3, he accuracy of esimaion of he poserior disribuion improves only slighly whereas he compuaion ime increases significanly. In order o srike a reasonable balance beween racking accuracy and racking speed for real-ime applicaions, we choose L = 3 for all of our experimens wih he APF algorihm and he BAPF algorihm. The performance of he BAPF algorihm is also analyzed for differen values of he parameer F. In Eq. (9), we combine he resuls of he APF algorihm and he AdaBoos algorihm o obain he curren conour of he racked face in he curren frame. The AdaBoos face deecion procedure is performed for each frame of he inpu video sequence. Using he AdaBoos face deecion algorihm, we obain he esimaed posiion of a deeced face by averaging he oupu of he AdaBoos face deecion algorihm over F successive frames including he curren frame (i.e., he curren frame and F previous frames). For example, for F = 3, we average he posiions of he deeced face in he curren frame and he previous wo frames (using he AdaBoos face deecion algorihm) o obain an esimaed posiion of he face in he curren frame, which is hen combined wih he oupu of he APF algorihm using Eq. (9). In his experimen, he weigh assigned o he resul of Ada- Boos face deecion procedure in he BAPF algorihm is c = 0.8. The value of F is varied wihin a se of values S F = {, 3, 5, 0} whereas he upper bound on he number of sampling ieraions L is fixed a 3. The BAPF algorihm employs N = 000 paricles for

13 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) BAPF vs. APF Displacemen (pixels) BAPF APF Frame number Fig. 6. Tracking resuls of he BAPF algorihm and he APF algorihm. Table 2 Summary of racking resuls of he BAPF and he APF algorihms BAPF Mean displacemen error (pixels) Sandard deviaion (pixels) Speed (frames/s) APF Table 3 Summary of racking resuls for he APF algorihm for differen values of L L =4 L =3 L =2 L = Mean displacemen error (pixels) Sandard deviaion (pixels) Speed (frames/s) face racking in es video 3. The experimenal resuls in Fig. 8 and Table 4, show ha racking accuracy of he BAPF algorihm decreases wih increasing values of F. As can be seen from Table 4, he mean displacemen error in BAPF algorihm is an increasing 45 APF Displacemen (pixels) L=4 L=3 L=2 L= Frame number Fig. 7. Tracking resuls of he APF algorihm for differen values of L.

14 22 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) BAPF 20 Displacemen (pixels) F=0 F=5 F=3 F= Frame number Fig. 8. Tracking resuls of he BAPF algorihm for differen values of F. Table 4 Summary of racking resuls of he BAPF for differen values of F F =0 F =5 F =3 F = Mean displacemen error (pixels) Sandard deviaion (pixels) Speed (frames/s) funcion of F. The racking speed of he BAPF algorihm does no depend on he value of he parameer F. Hence, we choose F = for he BAPF algorihm in our experimens, i.e., we consider he resul of he AdaBoos face deecion procedure only for he curren frame before inegraing he resuls of he AdaBoos face deecion procedure and he APF algorihm. The performance of he BAPF algorihm is analyzed using differen values of he parameer c (denoed as gamma in Fig. 9), where c is a weigh assigned o he resul of he AdaBoos face deecion procedure in he BAPF algorihm as shown in Eq. (9). The weigh c is varied wihin a se of values consising of Sc = {0, 0.5, 0.8,.0}. When c = 0, he BAPF algorihm is equivalen o he pure APF algorihm. By increasing he value of c, we lay greaer emphasis on he resul of he AdaBoos face deecion procedure. When c =, he BAPF algorihm is equivalen o he pure AdaBoos face deecion algorihm. The BAPF algorihm employs N = 000 paricles for face racking in he es video 3 wih parameers F = and L = 3. The experimenal resuls in Fig. 9 and Table 5, show ha racking accuracy of he BAPF algorihm varies significanly wih differen values of he parameer c. I can be seen from Table 5 ha he mean displacemen error in he case of he BAPF algorihm wih c = 0.8 is he leas, and wih c = 0 is he highes. Thus he experimenal resuls show ha he performance of he BAPF-based racker ha simply uses he AdaBoos face deecion algorihm in every single frame is he wors. This can be aribued o he fac ha he AdaBoos face deecion algorihm by iself is prone o deecion of false faces or missing he racked faces in he video sequence. In he case of he BAPF algorihm where 0 < c <, he APF racking algorihm provides regions of ineres o he AdaBoos face deecion algorihm, whereas he AdaBoos face deecion algorihm, in urn, provides a means for iniializaion and verificaion of he oupu of he APF racking algorihm via he combinaion funcion in Eq. (9). Consequenly, on accoun of he synergeic ineracion beween he APF racking algorihm and he AdaBoos face deecion algorihm, he performance of he BAPF algorihm is superior o ha of eiher he APF algorihm or he AdaBoos algorihm used in isolaion. Table 5 shows ha he racking speed of he BAPF algorihm does no vary for differen values of c since he APF algorihm and he AdaBoos algorihm are boh performed for each frame of he inpu video sequence. In our experimens, we choose c = 0.8 for he BAPF algorihm Discussion of BAPF experimenal resuls The BAPF-based face racker was observed o be successful in racking he faces hroughou each of he es video sequences excep when he face being racked is compleely occluded for a long ime duraion, as shown in Fig. 20. In his case, he occluded face canno be disinguished from he foreground or he background using he AdaBoos face deecor. In he case where he face is occluded for a shor ime duraion, we assume ha he occluded face is saionary over he ime period of occlusion. In such an insance, he BAPF algorihm is able o recover from emporary racking failure since he AdaBoos face deecion algorihm is capable of reiniializing he APF racking algorihm. However, he above assumpion does no hold in he case of occlusion ha lass for a longer ime period, since he person wih he occluded face could well exi he scene during he ime period of occlusion. When he faces of hree people are occluded and are aligned wih he opical axis of he camera as shown in Fig. 20b, i is hard o deec and rack he faces of he wo people ha are farhes from he camera, which resuls in racking failure as well. From boh he cases shown in Fig. 20, i is clear ha from he appearance of he face alone i is no possible o reliably deduce he rue locaion of he occluded face. For more robus and accurae racking, we need o exploi furher sources of informaion such as he appearance of he body or he limbs o augmen he BAPF algorihm in order o handle cases of complee occlusion over long ime duraions. However, he augmened appearance model for accurae racking

15 W. Zheng, S.M. Bhandarkar / J. Vis. Commun. Image R. 20 (2009) BAPF 00 Displacemen (pixels) gamma=0 gamma=0.5 gamma=0.8 gamma=.0 Frame number Fig. 9. Tracking resuls of he BAPF algorihm wih differen values of he parameer c. Table 5 Summary of racking resuls of he BAPF wih differen values of he parameer c would increase he complexiy of he dynamical model, which, in urn, may reduce he robusness of he racker in oher scenarios. 6. Conclusions c =0 c = 0.5 c = 0.8 c =.0 Mean displacemen error (pixels) Sandard deviaion (pixels) Speed (frames/s) This paper proposes a novel algorihm for face deecion and racking based on a combinaion of a novel adapive paricle filering algorihm and he AdaBoos face deecion algorihm. The proposed algorihm provides a general framework for deecion and racking of faces in video sequences. The proposed framework is also applicable o he racking of oher ypes of objecs such as deformable and elasic objecs if appropriae conour models such as B-splines are used. The proposed Adapive Paricle Filer (APF) uses a new sampling echnique o obain accurae esimaes of he proposal disribuion and he poserior disribuion for improving racking accuracy in video sequences. The proposed scheme ermed as he Boosed Adapive Paricle Filer (BAPF) combines he APF wih he AdaBoos face deecion algorihm. The AdaBoos face deecion algorihm is used o deec faces in he inpu images, whereas he APF is used o rack he faces in he video sequences. The proposed BAPF algorihm is employed for face deecion, face verificaion, and face racking in video sequences. I is experimenally shown ha he performance of face deecion and face racking can be muually improved via synergeic ineracion in he proposed BAPF scheme resuling in a racker ha is boh accurae and compuaionally efficien. Fig. 20. (a) Tracking failure in case of occlusion for a long ime duraion. (b) Tracking failure in case of hree people overlapping.

Visual Perception as Bayesian Inference. David J Fleet. University of Toronto

Visual Perception as Bayesian Inference. David J Fleet. University of Toronto Visual Percepion as Bayesian Inference David J Flee Universiy of Torono Basic rules of probabiliy sum rule (for muually exclusive a ): produc rule (condiioning): independence (def n ): Bayes rule: marginalizaion:

More information

FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine

FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS Soumya Hamlaoui & Franck Davoine HEUDIASYC Mixed Research Uni, CNRS / Compiègne Universiy of Technology BP 20529, 60205 Compiègne

More information

Learning in Games via Opponent Strategy Estimation and Policy Search

Learning in Games via Opponent Strategy Estimation and Policy Search Learning in Games via Opponen Sraegy Esimaion and Policy Search Yavar Naddaf Deparmen of Compuer Science Universiy of Briish Columbia Vancouver, BC yavar@naddaf.name Nando de Freias (Supervisor) Deparmen

More information

Implementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report)

Implementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report) Implemening Ray Casing in Terahedral Meshes wih Programmable Graphics Hardware (Technical Repor) Marin Kraus, Thomas Erl March 28, 2002 1 Inroducion Alhough cell-projecion, e.g., [3, 2], and resampling,

More information

Image segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding

Image segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding Moivaion Image segmenaion Which pixels belong o he same objec in an image/video sequence? (spaial segmenaion) Which frames belong o he same video sho? (emporal segmenaion) Which frames belong o he same

More information

Probabilistic Detection and Tracking of Motion Discontinuities

Probabilistic Detection and Tracking of Motion Discontinuities Probabilisic Deecion and Tracking of Moion Disconinuiies Michael J. Black David J. Flee Xerox Palo Alo Research Cener 3333 Coyoe Hill Road Palo Alo, CA 94304 fblack,fleeg@parc.xerox.com hp://www.parc.xerox.com/fblack,fleeg/

More information

Real Time Integral-Based Structural Health Monitoring

Real Time Integral-Based Structural Health Monitoring Real Time Inegral-Based Srucural Healh Monioring The nd Inernaional Conference on Sensing Technology ICST 7 J. G. Chase, I. Singh-Leve, C. E. Hann, X. Chen Deparmen of Mechanical Engineering, Universiy

More information

Improved TLD Algorithm for Face Tracking

Improved TLD Algorithm for Face Tracking Absrac Improved TLD Algorihm for Face Tracking Huimin Li a, Chaojing Yu b and Jing Chen c Chongqing Universiy of Poss and Telecommunicaions, Chongqing 400065, China a li.huimin666@163.com, b 15023299065@163.com,

More information

STEREO PLANE MATCHING TECHNIQUE

STEREO PLANE MATCHING TECHNIQUE STEREO PLANE MATCHING TECHNIQUE Commission III KEY WORDS: Sereo Maching, Surface Modeling, Projecive Transformaion, Homography ABSTRACT: This paper presens a new ype of sereo maching algorihm called Sereo

More information

A Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates

A Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates A Fas Sereo-Based Muli-Person Tracking using an Approximaed Likelihood Map for Overlapping Silhouee Templaes Junji Saake Jun Miura Deparmen of Compuer Science and Engineering Toyohashi Universiy of Technology

More information

Multiple View Discriminative Appearance Modeling with IMCMC for Distributed Tracking

Multiple View Discriminative Appearance Modeling with IMCMC for Distributed Tracking Muliple View Discriminaive ing wih IMCMC for Disribued Tracking Sanhoshkumar Sunderrajan, B.S. Manjunah Deparmen of Elecrical and Compuer Engineering Universiy of California, Sana Barbara {sanhosh,manj}@ece.ucsb.edu

More information

Tracking Deforming Objects Using Particle Filtering for Geometric Active Contours

Tracking Deforming Objects Using Particle Filtering for Geometric Active Contours 1470 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 29, NO. 8, AUGUST 2007 Tracking Deforming Objecs Using Paricle Filering for Geomeric Acive Conours Yogesh Rahi, Member, IEEE, NamraaVaswani,

More information

Tracking Appearances with Occlusions

Tracking Appearances with Occlusions Tracking ppearances wih Occlusions Ying Wu, Ting Yu, Gang Hua Deparmen of Elecrical & Compuer Engineering Norhwesern Universiy 2145 Sheridan oad, Evanson, IL 60208 {yingwu,ingyu,ganghua}@ece.nwu.edu bsrac

More information

Rao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly

Rao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly MITSUBISHI ELECTRIC RESEARCH LABORATORIES hp://www.merl.com Rao-Blackwellized Paricle Filering for Probing-Based 6-DOF Localizaion in Roboic Assembly Yuichi Taguchi, Tim Marks, Haruhisa Okuda TR1-8 June

More information

Real-time 2D Video/3D LiDAR Registration

Real-time 2D Video/3D LiDAR Registration Real-ime 2D Video/3D LiDAR Regisraion C. Bodenseiner Fraunhofer IOSB chrisoph.bodenseiner@iosb.fraunhofer.de M. Arens Fraunhofer IOSB michael.arens@iosb.fraunhofer.de Absrac Progress in LiDAR scanning

More information

Robust Visual Tracking for Multiple Targets

Robust Visual Tracking for Multiple Targets Robus Visual Tracking for Muliple Targes Yizheng Cai, Nando de Freias, and James J. Lile Universiy of Briish Columbia, Vancouver, B.C., Canada, V6T 1Z4 {yizhengc, nando, lile}@cs.ubc.ca Absrac. We address

More information

CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL

CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL Klečka Jan Docoral Degree Programme (1), FEEC BUT E-mail: xkleck01@sud.feec.vubr.cz Supervised by: Horák Karel E-mail: horak@feec.vubr.cz

More information

Occlusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction

Occlusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction 58 IJCSNS Inernaional Journal of Compuer Science and Nework Securiy, VOL.6 No.10, Ocober 006 Occlusion-Free Hand Moion Tracking by Muliple Cameras and Paricle Filering wih Predicion Makoo Kao, and Gang

More information

Robust Multi-view Face Detection Using Error Correcting Output Codes

Robust Multi-view Face Detection Using Error Correcting Output Codes Robus Muli-view Face Deecion Using Error Correcing Oupu Codes Hongming Zhang,2, Wen GaoP P, Xilin Chen 2, Shiguang Shan 2, and Debin Zhao Deparmen of Compuer Science and Engineering, Harbin Insiue of Technolog

More information

Nonparametric CUSUM Charts for Process Variability

Nonparametric CUSUM Charts for Process Variability Journal of Academia and Indusrial Research (JAIR) Volume 3, Issue June 4 53 REEARCH ARTICLE IN: 78-53 Nonparameric CUUM Chars for Process Variabiliy D.M. Zombade and V.B. Ghue * Dep. of aisics, Walchand

More information

The Impact of Product Development on the Lifecycle of Defects

The Impact of Product Development on the Lifecycle of Defects The Impac of Produc Developmen on he Lifecycle of Rudolf Ramler Sofware Compeence Cener Hagenberg Sofware Park 21 A-4232 Hagenberg, Ausria +43 7236 3343 872 rudolf.ramler@scch.a ABSTRACT This paper invesigaes

More information

IROS 2015 Workshop on On-line decision-making in multi-robot coordination (DEMUR 15)

IROS 2015 Workshop on On-line decision-making in multi-robot coordination (DEMUR 15) IROS 2015 Workshop on On-line decision-making in muli-robo coordinaion () OPTIMIZATION-BASED COOPERATIVE MULTI-ROBOT TARGET TRACKING WITH REASONING ABOUT OCCLUSIONS KAROL HAUSMAN a,, GREGORY KAHN b, SACHIN

More information

Video-Based Face Recognition Using Probabilistic Appearance Manifolds

Video-Based Face Recognition Using Probabilistic Appearance Manifolds Video-Based Face Recogniion Using Probabilisic Appearance Manifolds Kuang-Chih Lee Jeffrey Ho Ming-Hsuan Yang David Kriegman klee10@uiuc.edu jho@cs.ucsd.edu myang@honda-ri.com kriegman@cs.ucsd.edu Compuer

More information

A Face Detection Method Based on Skin Color Model

A Face Detection Method Based on Skin Color Model A Face Deecion Mehod Based on Skin Color Model Dazhi Zhang Boying Wu Jiebao Sun Qinglei Liao Deparmen of Mahemaics Harbin Insiue of Technology Harbin China 150000 Zhang_dz@163.com mahwby@hi.edu.cn sunjiebao@om.com

More information

An Improved Square-Root Nyquist Shaping Filter

An Improved Square-Root Nyquist Shaping Filter An Improved Square-Roo Nyquis Shaping Filer fred harris San Diego Sae Universiy fred.harris@sdsu.edu Sridhar Seshagiri San Diego Sae Universiy Seshigar.@engineering.sdsu.edu Chris Dick Xilinx Corp. chris.dick@xilinx.com

More information

Coded Caching with Multiple File Requests

Coded Caching with Multiple File Requests Coded Caching wih Muliple File Requess Yi-Peng Wei Sennur Ulukus Deparmen of Elecrical and Compuer Engineering Universiy of Maryland College Park, MD 20742 ypwei@umd.edu ulukus@umd.edu Absrac We sudy a

More information

Upper Body Tracking for Human-Machine Interaction with a Moving Camera

Upper Body Tracking for Human-Machine Interaction with a Moving Camera The 2009 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems Ocober -5, 2009 S. Louis, USA Upper Body Tracking for Human-Machine Ineracion wih a Moving Camera Yi-Ru Chen, Cheng-Ming Huang, and

More information

A Matching Algorithm for Content-Based Image Retrieval

A Matching Algorithm for Content-Based Image Retrieval A Maching Algorihm for Conen-Based Image Rerieval Sue J. Cho Deparmen of Compuer Science Seoul Naional Universiy Seoul, Korea Absrac Conen-based image rerieval sysem rerieves an image from a daabase using

More information

THE goal of this work is to develop statistical models for

THE goal of this work is to develop statistical models for IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 32, NO. 4, APRIL 2010 579 Nonsaionary Shape Aciviies: Dynamic Models for Landmark Shape Change and Applicaions Samarji Das, Suden Member,

More information

MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES

MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES Arun Kumar H. D. 1 and Prabhakar C. J. 2 1 Deparmen of Compuer Science, Kuvempu Universiy, Shimoga, India ABSTRACT

More information

Reinforcement Learning by Policy Improvement. Making Use of Experiences of The Other Tasks. Hajime Kimura and Shigenobu Kobayashi

Reinforcement Learning by Policy Improvement. Making Use of Experiences of The Other Tasks. Hajime Kimura and Shigenobu Kobayashi Reinforcemen Learning by Policy Improvemen Making Use of Experiences of The Oher Tasks Hajime Kimura and Shigenobu Kobayashi Tokyo Insiue of Technology, JAPAN genfe.dis.iech.ac.jp, kobayasidis.iech.ac.jp

More information

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1 Adapive Appearance Model and Condensaion Algorihm for Robus Face Tracking Yui Man Lui, Suden Member,, J. Ross Beveridge, Member,,

More information

Video Content Description Using Fuzzy Spatio-Temporal Relations

Video Content Description Using Fuzzy Spatio-Temporal Relations Proceedings of he 4s Hawaii Inernaional Conference on Sysem Sciences - 008 Video Conen Descripion Using Fuzzy Spaio-Temporal Relaions rchana M. Rajurkar *, R.C. Joshi and Sananu Chaudhary 3 Dep of Compuer

More information

IntentSearch:Capturing User Intention for One-Click Internet Image Search

IntentSearch:Capturing User Intention for One-Click Internet Image Search JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO. 1, JANUARY 2010 1 InenSearch:Capuring User Inenion for One-Click Inerne Image Search Xiaoou Tang, Fellow, IEEE, Ke Liu, Jingyu Cui, Suden Member, IEEE, Fang

More information

Mobile Robots Mapping

Mobile Robots Mapping Mobile Robos Mapping 1 Roboics is Easy conrol behavior percepion modelling domain model environmen model informaion exracion raw daa planning ask cogniion reasoning pah planning navigaion pah execuion

More information

Network management and QoS provisioning - QoS in Frame Relay. . packet switching with virtual circuit service (virtual circuits are bidirectional);

Network management and QoS provisioning - QoS in Frame Relay. . packet switching with virtual circuit service (virtual circuits are bidirectional); QoS in Frame Relay Frame relay characerisics are:. packe swiching wih virual circui service (virual circuis are bidirecional);. labels are called DLCI (Daa Link Connecion Idenifier);. for connecion is

More information

Definition and examples of time series

Definition and examples of time series Definiion and examples of ime series A ime series is a sequence of daa poins being recorded a specific imes. Formally, le,,p be a probabiliy space, and T an index se. A real valued sochasic process is

More information

Real time 3D face and facial feature tracking

Real time 3D face and facial feature tracking J Real-Time Image Proc (2007) 2:35 44 DOI 10.1007/s11554-007-0032-2 ORIGINAL RESEARCH PAPER Real ime 3D face and facial feaure racking Fadi Dornaika Æ Javier Orozco Received: 23 November 2006 / Acceped:

More information

M(t)/M/1 Queueing System with Sinusoidal Arrival Rate

M(t)/M/1 Queueing System with Sinusoidal Arrival Rate 20 TUTA/IOE/PCU Journal of he Insiue of Engineering, 205, (): 20-27 TUTA/IOE/PCU Prined in Nepal M()/M/ Queueing Sysem wih Sinusoidal Arrival Rae A.P. Pan, R.P. Ghimire 2 Deparmen of Mahemaics, Tri-Chandra

More information

Viewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors

Viewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Viewpoin Invarian 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Chaohui Wang 1,2, Yun Zeng 3, Loic Simon 1, Ioannis Kakadiaris 4, Dimiris Samaras 3, Nikos Paragios 1,2

More information

Analysis of Various Types of Bugs in the Object Oriented Java Script Language Coding

Analysis of Various Types of Bugs in the Object Oriented Java Script Language Coding Indian Journal of Science and Technology, Vol 8(21), DOI: 10.17485/ijs/2015/v8i21/69958, Sepember 2015 ISSN (Prin) : 0974-6846 ISSN (Online) : 0974-5645 Analysis of Various Types of Bugs in he Objec Oriened

More information

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors We are InechOpen, he world s leading publisher of Open Access books Buil by scieniss, for scieniss 4,000 116,000 120M Open access books available Inernaional auhors and ediors Downloads Our auhors are

More information

Quantitative macro models feature an infinite number of periods A more realistic (?) view of time

Quantitative macro models feature an infinite number of periods A more realistic (?) view of time INFINIE-HORIZON CONSUMPION-SAVINGS MODEL SEPEMBER, Inroducion BASICS Quaniaive macro models feaure an infinie number of periods A more realisic (?) view of ime Infinie number of periods A meaphor for many

More information

Evaluation and Improvement of Region-based Motion Segmentation

Evaluation and Improvement of Region-based Motion Segmentation Evaluaion and Improvemen of Region-based Moion Segmenaion Mark Ross Universiy Koblenz-Landau, Insiue of Compuaional Visualisics, Universiässraße 1, 56070 Koblenz, Germany Email: ross@uni-koblenz.de Absrac

More information

A Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes

A Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes A Bayesian Approach o Video Objec Segmenaion via Merging 3D Waershed Volumes Yu-Pao Tsai 1,3, Chih-Chuan Lai 1,2, Yi-Ping Hung 1,2, and Zen-Chung Shih 3 1 Insiue of Informaion Science, Academia Sinica,

More information

A High-Speed Adaptive Multi-Module Structured Light Scanner

A High-Speed Adaptive Multi-Module Structured Light Scanner A High-Speed Adapive Muli-Module Srucured Ligh Scanner Andreas Griesser 1 Luc Van Gool 1,2 1 Swiss Fed.Ins.of Techn.(ETH) 2 Kaholieke Univ. Leuven D-ITET/Compuer Vision Lab ESAT/VISICS Zürich, Swizerland

More information

Multi-Target Detection and Tracking from a Single Camera in Unmanned Aerial Vehicles (UAVs)

Multi-Target Detection and Tracking from a Single Camera in Unmanned Aerial Vehicles (UAVs) 2016 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems (IROS) Daejeon Convenion Cener Ocober 9-14, 2016, Daejeon, Korea Muli-Targe Deecion and Tracking from a Single Camera in Unmanned Aerial

More information

Moving Object Detection Using MRF Model and Entropy based Adaptive Thresholding

Moving Object Detection Using MRF Model and Entropy based Adaptive Thresholding Moving Objec Deecion Using MRF Model and Enropy based Adapive Thresholding Badri Narayan Subudhi, Pradipa Kumar Nanda and Ashish Ghosh Machine Inelligence Uni, Indian Saisical Insiue, Kolkaa, 700108, India,

More information

CENG 477 Introduction to Computer Graphics. Modeling Transformations

CENG 477 Introduction to Computer Graphics. Modeling Transformations CENG 477 Inroducion o Compuer Graphics Modeling Transformaions Modeling Transformaions Model coordinaes o World coordinaes: Model coordinaes: All shapes wih heir local coordinaes and sies. world World

More information

An Iterative Scheme for Motion-Based Scene Segmentation

An Iterative Scheme for Motion-Based Scene Segmentation An Ieraive Scheme for Moion-Based Scene Segmenaion Alexander Bachmann and Hildegard Kuehne Deparmen for Measuremen and Conrol Insiue for Anhropomaics Universiy of Karlsruhe (H), 76 131 Karlsruhe, Germany

More information

Simultaneous Localization and Mapping with Stereo Vision

Simultaneous Localization and Mapping with Stereo Vision Simulaneous Localizaion and Mapping wih Sereo Vision Mahew N. Dailey Compuer Science and Informaion Managemen Asian Insiue of Technology Pahumhani, Thailand Email: mdailey@ai.ac.h Manukid Parnichkun Mecharonics

More information

Robust 3D Visual Tracking Using Particle Filtering on the SE(3) Group

Robust 3D Visual Tracking Using Particle Filtering on the SE(3) Group Robus 3D Visual Tracking Using Paricle Filering on he SE(3) Group Changhyun Choi and Henrik I. Chrisensen Roboics & Inelligen Machines, College of Compuing Georgia Insiue of Technology Alana, GA 3332,

More information

Improving the Efficiency of Dynamic Service Provisioning in Transport Networks with Scheduled Services

Improving the Efficiency of Dynamic Service Provisioning in Transport Networks with Scheduled Services Improving he Efficiency of Dynamic Service Provisioning in Transpor Neworks wih Scheduled Services Ralf Hülsermann, Monika Jäger and Andreas Gladisch Technologiezenrum, T-Sysems, Goslarer Ufer 35, D-1585

More information

FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS

FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS FIELD PROGRAMMABLE GATE ARRAY (FPGA) AS A NEW APPROACH TO IMPLEMENT THE CHAOTIC GENERATORS Mohammed A. Aseeri and M. I. Sobhy Deparmen of Elecronics, The Universiy of Ken a Canerbury Canerbury, Ken, CT2

More information

Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game

Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game Deecion Tracking and Recogniion of Human Poses for a Real Time Spaial Game Feifei Huo, Emile A. Hendriks, A.H.J. Oomes Delf Universiy of Technology The Neherlands f.huo@udelf.nl Pascal van Beek, Remco

More information

Robot localization under perceptual aliasing conditions based on laser reflectivity using particle filter

Robot localization under perceptual aliasing conditions based on laser reflectivity using particle filter Robo localizaion under percepual aliasing condiions based on laser refleciviy using paricle filer DongXiang Zhang, Ryo Kurazume, Yumi Iwashia, Tsuomu Hasegawa Absrac Global localizaion, which deermines

More information

Multi-camera multi-object voxel-based Monte Carlo 3D tracking strategies

Multi-camera multi-object voxel-based Monte Carlo 3D tracking strategies RESEARCH Open Access Muli-camera muli-objec voxel-based Mone Carlo 3D racking sraegies Crisian Canon-Ferrer *, Josep R Casas, Monse Pardàs and Enric Mone Absrac This aricle presens a new approach o he

More information

A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions

A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions A Hierarchical Objec Recogniion Sysem Based on Muli-scale Principal Curvaure Regions Wei Zhang, Hongli Deng, Thomas G Dieerich and Eric N Morensen School of Elecrical Engineering and Compuer Science Oregon

More information

Visual Indoor Localization with a Floor-Plan Map

Visual Indoor Localization with a Floor-Plan Map Visual Indoor Localizaion wih a Floor-Plan Map Hang Chu Dep. of ECE Cornell Universiy Ihaca, NY 14850 hc772@cornell.edu Absrac In his repor, a indoor localizaion mehod is presened. The mehod akes firsperson

More information

Dynamic Route Planning and Obstacle Avoidance Model for Unmanned Aerial Vehicles

Dynamic Route Planning and Obstacle Avoidance Model for Unmanned Aerial Vehicles Volume 116 No. 24 2017, 315-329 ISSN: 1311-8080 (prined version); ISSN: 1314-3395 (on-line version) url: hp://www.ijpam.eu ijpam.eu Dynamic Roue Planning and Obsacle Avoidance Model for Unmanned Aerial

More information

Wheelchair-user Detection Combined with Parts-based Tracking

Wheelchair-user Detection Combined with Parts-based Tracking Wheelchair-user Deecion Combined wih Pars-based Tracking Ukyo Tanikawa 1, Yasuomo Kawanishi 1, Daisuke Deguchi 2,IchiroIde 1, Hiroshi Murase 1 and Ryo Kawai 3 1 Graduae School of Informaion Science, Nagoya

More information

A METHOD OF MODELING DEFORMATION OF AN OBJECT EMPLOYING SURROUNDING VIDEO CAMERAS

A METHOD OF MODELING DEFORMATION OF AN OBJECT EMPLOYING SURROUNDING VIDEO CAMERAS A METHOD OF MODELING DEFORMATION OF AN OBJECT EMLOYING SURROUNDING IDEO CAMERAS Joo Kooi TAN, Seiji ISHIKAWA Deparmen of Mechanical and Conrol Engineering Kushu Insiue of Technolog, Japan ehelan@is.cnl.kuech.ac.jp,

More information

DAGM 2011 Tutorial on Convex Optimization for Computer Vision

DAGM 2011 Tutorial on Convex Optimization for Computer Vision DAGM 2011 Tuorial on Convex Opimizaion for Compuer Vision Par 3: Convex Soluions for Sereo and Opical Flow Daniel Cremers Compuer Vision Group Technical Universiy of Munich Graz Universiy of Technology

More information

MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES

MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES B. MARCOTEGUI and F. MEYER Ecole des Mines de Paris, Cenre de Morphologie Mahémaique, 35, rue Sain-Honoré, F 77305 Fonainebleau Cedex, France Absrac. In image

More information

Detection and segmentation of moving objects in highly dynamic scenes

Detection and segmentation of moving objects in highly dynamic scenes Deecion and segmenaion of moving objecs in highly dynamic scenes Aurélie Bugeau Parick Pérez INRIA, Cenre Rennes - Breagne Alanique Universié de Rennes, Campus de Beaulieu, 35 042 Rennes Cedex, France

More information

Image Based Computer-Aided Manufacturing Technology

Image Based Computer-Aided Manufacturing Technology Sensors & Transducers 03 by IFSA hp://www.sensorsporal.com Image Based Compuer-Aided Manufacuring Technology Zhanqi HU Xiaoqin ZHANG Jinze LI Wei LI College of Mechanical Engineering Yanshan Universiy

More information

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild IEEE TRANSACTIONS ON IMAGE PROCESSING, DRAFT 1 Robus LSTM-Auoencoders for Face De-Occlusion in he Wild Fang Zhao, Jiashi Feng, Jian Zhao, Wenhan Yang, Shuicheng Yan arxiv:1612.08534v1 [cs.cv] 27 Dec 2016

More information

Gender Classification of Faces Using Adaboost*

Gender Classification of Faces Using Adaboost* Gender Classificaion of Faces Using Adaboos* Rodrigo Verschae 1,2,3, Javier Ruiz-del-Solar 1,2, and Mauricio Correa 1,2 1 Deparmen of Elecrical Engineering, Universidad de Chile 2 Cener for Web Research,

More information

4.1 3D GEOMETRIC TRANSFORMATIONS

4.1 3D GEOMETRIC TRANSFORMATIONS MODULE IV MCA - 3 COMPUTER GRAPHICS ADMN 29- Dep. of Compuer Science And Applicaions, SJCET, Palai 94 4. 3D GEOMETRIC TRANSFORMATIONS Mehods for geomeric ransformaions and objec modeling in hree dimensions

More information

TrackNet: Simultaneous Detection and Tracking of Multiple Objects

TrackNet: Simultaneous Detection and Tracking of Multiple Objects TrackNe: Simulaneous Deecion and Tracking of Muliple Objecs Chenge Li New York Universiy cl2840@nyu.edu Gregory Dobler New York Universiy greg.dobler@nyu.edu Yilin Song New York Universiy ys1297@nyu.edu

More information

Gauss-Jordan Algorithm

Gauss-Jordan Algorithm Gauss-Jordan Algorihm The Gauss-Jordan algorihm is a sep by sep procedure for solving a sysem of linear equaions which may conain any number of variables and any number of equaions. The algorihm is carried

More information

A Framework for Applying Point Clouds Grabbed by Multi-Beam LIDAR in Perceiving the Driving Environment

A Framework for Applying Point Clouds Grabbed by Multi-Beam LIDAR in Perceiving the Driving Environment Sensors 215, 15, 21931-21956; doi:1.339/s15921931 Aricle OPEN ACCESS sensors ISSN 1424-822 www.mdpi.com/journal/sensors A Framewor for Applying Poin Clouds Grabbed by Muli-Beam LIDAR in Perceiving he Driving

More information

(Structural Time Series Models for Describing Trend in All India Sunflower Yield Using SAS

(Structural Time Series Models for Describing Trend in All India Sunflower Yield Using SAS (Srucural Time Series Models for Describing Trend in All India Sunflower Yield Using SAS Himadri Ghosh, Prajneshu and Savia Wadhwa I.A.S.R.I., Library Avenue, New Delhi-110 01 him_adri@iasri.res.in, prajnesh@iasri.res.in,

More information

Deformable Parts Correlation Filters for Robust Visual Tracking

Deformable Parts Correlation Filters for Robust Visual Tracking PAPER UNDER REVISION Deformable Pars Correlaion Filers for Robus Visual Tracking Alan Lukežič, Luka Čehovin, Member, IEEE, and Maej Krisan, Member, IEEE ha par-based models should be considered in a layered

More information

A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER

A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER A GRAPHICS PROCESSING UNIT IMPLEMENTATION OF THE PARTICLE FILTER ABSTRACT Modern graphics cards for compuers, and especially heir graphics processing unis (GPUs), are designed for fas rendering of graphics.

More information

Time Expression Recognition Using a Constituent-based Tagging Scheme

Time Expression Recognition Using a Constituent-based Tagging Scheme Track: Web Conen Analysis, Semanics and Knowledge Time Expression Recogniion Using a Consiuen-based Tagging Scheme Xiaoshi Zhong and Erik Cambria School of Compuer Science and Engineering Nanyang Technological

More information

EECS 487: Interactive Computer Graphics

EECS 487: Interactive Computer Graphics EECS 487: Ineracive Compuer Graphics Lecure 7: B-splines curves Raional Bézier and NURBS Cubic Splines A represenaion of cubic spline consiss of: four conrol poins (why four?) hese are compleely user specified

More information

Weighted Voting in 3D Random Forest Segmentation

Weighted Voting in 3D Random Forest Segmentation Weighed Voing in 3D Random Fores Segmenaion M. Yaqub,, P. Mahon 3, M. K. Javaid, C. Cooper, J. A. Noble NDORMS, Universiy of Oxford, IBME, Deparmen of Engineering Science, Universiy of Oxford, 3 MRC Epidemiology

More information

Image Content Representation

Image Content Representation Image Conen Represenaion Represenaion for curves and shapes regions relaionships beween regions E.G.M. Perakis Image Represenaion & Recogniion 1 Reliable Represenaion Uniqueness: mus uniquely specify an

More information

Michiel Helder and Marielle C.T.A Geurts. Hoofdkantoor PTT Post / Dutch Postal Services Headquarters

Michiel Helder and Marielle C.T.A Geurts. Hoofdkantoor PTT Post / Dutch Postal Services Headquarters SHORT TERM PREDICTIONS A MONITORING SYSTEM by Michiel Helder and Marielle C.T.A Geurs Hoofdkanoor PTT Pos / Duch Posal Services Headquarers Keywords macro ime series shor erm predicions ARIMA-models faciliy

More information

COSC 3213: Computer Networks I Chapter 6 Handout # 7

COSC 3213: Computer Networks I Chapter 6 Handout # 7 COSC 3213: Compuer Neworks I Chaper 6 Handou # 7 Insrucor: Dr. Marvin Mandelbaum Deparmen of Compuer Science York Universiy F05 Secion A Medium Access Conrol (MAC) Topics: 1. Muliple Access Communicaions:

More information

LAMP: 3D Layered, Adaptive-resolution and Multiperspective Panorama - a New Scene Representation

LAMP: 3D Layered, Adaptive-resolution and Multiperspective Panorama - a New Scene Representation Submission o Special Issue of CVIU on Model-based and Image-based 3D Scene Represenaion for Ineracive Visualizaion LAMP: 3D Layered, Adapive-resoluion and Muliperspecive Panorama - a New Scene Represenaion

More information

Optimal Crane Scheduling

Optimal Crane Scheduling Opimal Crane Scheduling Samid Hoda, John Hooker Laife Genc Kaya, Ben Peerson Carnegie Mellon Universiy Iiro Harjunkoski ABB Corporae Research EWO - 13 November 2007 1/16 Problem Track-mouned cranes move

More information

SENSING using 3D technologies, structured light cameras

SENSING using 3D technologies, structured light cameras IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 39, NO. 10, OCTOBER 2017 2045 Real-Time Enhancemen of Dynamic Deph Videos wih Non-Rigid Deformaions Kassem Al Ismaeil, Suden Member,

More information

Why not experiment with the system itself? Ways to study a system System. Application areas. Different kinds of systems

Why not experiment with the system itself? Ways to study a system System. Application areas. Different kinds of systems Simulaion Wha is simulaion? Simple synonym: imiaion We are ineresed in sudying a Insead of experimening wih he iself we experimen wih a model of he Experimen wih he Acual Ways o sudy a Sysem Experimen

More information

Track and Cut: simultaneous tracking and segmentation of multiple objects with graph cuts

Track and Cut: simultaneous tracking and segmentation of multiple objects with graph cuts INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE Track and Cu: simulaneous racking and segmenaion of muliple objecs wih graph cus Aurélie Bugeau Parick Pérez N 6337 Ocober 2007 Thèmes COM

More information

Improving Occupancy Grid FastSLAM by Integrating Navigation Sensors

Improving Occupancy Grid FastSLAM by Integrating Navigation Sensors Improving Occupancy Grid FasSLAM by Inegraing Navigaion Sensors Chrisopher Weyers Sensors Direcorae Air Force Research Laboraory Wrigh-Paerson AFB, OH 45433 Gilber Peerson Deparmen of Elecrical and Compuer

More information

In Proceedings of CVPR '96. Structure and Motion of Curved 3D Objects from. using these methods [12].

In Proceedings of CVPR '96. Structure and Motion of Curved 3D Objects from. using these methods [12]. In Proceedings of CVPR '96 Srucure and Moion of Curved 3D Objecs from Monocular Silhouees B Vijayakumar David J Kriegman Dep of Elecrical Engineering Yale Universiy New Haven, CT 652-8267 Jean Ponce Compuer

More information

Terrain Based GPS Independent Lane-Level Vehicle Localization using Particle Filter and Dead Reckoning

Terrain Based GPS Independent Lane-Level Vehicle Localization using Particle Filter and Dead Reckoning Terrain Based GPS Independen -Level Vehicle Localizaion using Paricle Filer and Dead Reckoning Hamad Ahmed and Muhammad Tahir Deparmen of Elecrical Engineering, Lahore Universiy of Managemen Sciences,

More information

Tracking a Large Number of Objects from Multiple Views

Tracking a Large Number of Objects from Multiple Views Tracking a Large Number of Objecs from Muliple Views Zheng Wu 1, Nickolay I. Hrisov 2, Tyson L. Hedrick 3, Thomas H. Kun 2, Margri Beke 1 1 Deparmen of Compuer Science, Boson Universiy 2 Deparmen of Biology,

More information

SLAM in Large Indoor Environments with Low-Cost, Noisy, and Sparse Sonars

SLAM in Large Indoor Environments with Low-Cost, Noisy, and Sparse Sonars SLAM in Large Indoor Environmens wih Low-Cos, Noisy, and Sparse Sonars Teddy N. Yap, Jr. and Chrisian R. Shelon Deparmen of Compuer Science and Engineering Universiy of California, Riverside, CA 92521,

More information

ACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure

ACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure SUBMITTED TO TRANSACTION ON IMAGE PROCESSING 1 Online Temporally Consisen Indoor Deph Video Enhancemen via Saic Srucure Lu Sheng, Suden Member, IEEE, King Ngi Ngan, Fellow, IEEE, Chern-Loon Lim and Songnan

More information

Precise Voronoi Cell Extraction of Free-form Rational Planar Closed Curves

Precise Voronoi Cell Extraction of Free-form Rational Planar Closed Curves Precise Voronoi Cell Exracion of Free-form Raional Planar Closed Curves Iddo Hanniel, Ramanahan Muhuganapahy, Gershon Elber Deparmen of Compuer Science Technion, Israel Insiue of Technology Haifa 32000,

More information

MATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008

MATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008 MATH 5 - Differenial Equaions Sepember 15, 8 Projec 1, Fall 8 Due: Sepember 4, 8 Lab 1.3 - Logisics Populaion Models wih Harvesing For his projec we consider lab 1.3 of Differenial Equaions pages 146 o

More information

AML710 CAD LECTURE 11 SPACE CURVES. Space Curves Intrinsic properties Synthetic curves

AML710 CAD LECTURE 11 SPACE CURVES. Space Curves Intrinsic properties Synthetic curves AML7 CAD LECTURE Space Curves Inrinsic properies Synheic curves A curve which may pass hrough any region of hreedimensional space, as conrased o a plane curve which mus lie on a single plane. Space curves

More information

MOTION TRACKING is a fundamental capability that

MOTION TRACKING is a fundamental capability that TECHNICAL REPORT CRES-05-008, CENTER FOR ROBOTICS AND EMBEDDED SYSTEMS, UNIVERSITY OF SOUTHERN CALIFORNIA 1 Real-ime Moion Tracking from a Mobile Robo Boyoon Jung, Suden Member, IEEE, Gaurav S. Sukhame,

More information

Robust Segmentation and Tracking of Colored Objects in Video

Robust Segmentation and Tracking of Colored Objects in Video IEEE TRANSACTIONS ON CSVT, VOL. 4, NO. 6, 2004 Robus Segmenaion and Tracking of Colored Objecs in Video Theo Gevers, member, IEEE Absrac Segmening and racking of objecs in video is of grea imporance for

More information

Sam knows that his MP3 player has 40% of its battery life left and that the battery charges by an additional 12 percentage points every 15 minutes.

Sam knows that his MP3 player has 40% of its battery life left and that the battery charges by an additional 12 percentage points every 15 minutes. 8.F Baery Charging Task Sam wans o ake his MP3 player and his video game player on a car rip. An hour before hey plan o leave, he realized ha he forgo o charge he baeries las nigh. A ha poin, he plugged

More information

Tracking a Large Number of Objects from Multiple Views

Tracking a Large Number of Objects from Multiple Views Boson Universiy Compuer Science Deparmen Technical Repor BUCS-TR 2009-005 Tracking a Large Number of Objecs from Muliple Views Zheng Wu 1, Nickolay I. Hrisov 2, Tyson L. Hedrick 3, Thomas H. Kun 2, Margri

More information

A Novel Approach for Monocular 3D Object Tracking in Cluttered Environment

A Novel Approach for Monocular 3D Object Tracking in Cluttered Environment Inernaional Journal of Compuaional Inelligence Research ISSN 0973-1873 Volume 13, Number 5 (2017), pp. 851-864 Research India Publicaions hp://www.ripublicaion.com A Novel Approach for Monocular 3D Objec

More information