Detection of salient objects with focused attention based on spatial and temporal coherence

Size: px
Start display at page:

Download "Detection of salient objects with focused attention based on spatial and temporal coherence"

Transcription

1 ricle Informaion Processing Technology pril 2011 Vol.56 No.10: doi: /s SPECIL TOPICS: Deecion of salien objecs wih focused aenion based on spaial and emporal coherence WU Yang 1, ZHENG NanNing 1, YUN ZeJian 1, JING HuaiZu 1 & LIU Tie 2 1 Insiue of rificial Inelligence and Roboics, Xi an Jiaoong Universiy, Xi an , China; 2 IBM Research-China, Beijing , China Received Ocober 8, 2010; acceped November 8, 2010 The undersanding and analysis of video conen are fundamenally imporan for numerous applicaions, including video summarizaion, rerieval, navigaion, and ediing. n imporan par of his process is o deec salien (which usually means imporan and ineresing) objecs in video segmens. Unlike exising approaches, we propose a mehod ha combines he saliency measuremen wih spaial and emporal coherence. The inegraion of spaial and emporal coherence is inspired by he focused aenion in human vision. In he proposed mehod, he spaial coherence of low-level visual grouping cues (e.g. appearance and moion) helps per-frame objec-background separaion, while he emporal coherence of he objec properies (e.g. shape and appearance) ensures consisen objec localizaion over ime, and hus he mehod is robus o unexpeced environmen changes and camera vibraions. Having developed an efficien opimizaion sraegy based on coarse-o-fine muli-scale dynamic programming, we evaluae our mehod using a challenging daase ha is freely available ogeher wih his paper. We show he effeciveness and complemenariness of he wo ypes of coherence, and demonsrae ha hey can significanly improve he performance of salien objec deecion in videos. visual aenion, focused aenion, salien objec deecion, spaial and emporal coherence Ciaion: Wu Y, Zheng N N, Yuan Z J, e al. Deecion of salien objecs wih focused aenion based on spaial and emporal coherence. Chinese Sci Bull, 2011, 56: , doi: /s The rapid developmen of neworks and sorage devices has grealy encouraged he capure, spread and sharing of large quaniies of video daa. However, because hese exensive videos ofen conain unimporan or unineresing conen, searching for a desired video segmen in a large daabase becomes a very difficul and ime-consuming ask. To solve his problem, a plausible approach is o model visual aenion of human vision, as i can filer ou unwaned informaion in a scene [1]. In he lieraure, visual aenion has been widely modeled as a visual saliency esimaion problem [2 5], which has been exensively invesigaed using boh saic images and video segmens. Insead of disregarding he conen when dealing wih he generic visual saliency esimaion Corresponding auhors ( wuyang0321@gmail.com; nnzheng@mail.xju.edu.cn) problem in video [5], we focus on deecing a salien objec in video segmens. Video segmens wih a paricular focus are found in many kinds of videos, and research on hese has very imporan applicaions such as video summarizaion, rerieval, navigaion and ediing. s argued by cogniive scienis Sephen Palmer in his book [6], here are wo ypes of visual aenion: disribued aenion and focused aenion. The former relaes o visual signal processing ha occurs when subjecs are prepared for he arge o appear in any locaion, while he laer occurs when hey have seleced a single percepual objec. For he problem of deecing a salien objec in a video clip, focused aenion of he human vision sysem should play a criical role. Therefore, he percepive properies of focused aenion become our inspiraion for solving he problem. Mos of he relaed works focus on he inegraion of The uhor(s) This aricle is published wih open access a Springerlink.com csb.scichina.com

2 1056 Wu Y, e al. Chinese Sci Bull pril (2011) Vol.56 No.10 various boom-up cues for esimaing saliency [7 9], suppored by feaure inegraion heory [10]. Usually such cues are represened by conras-based feaures, including boh low-level measures of conras (e.g. pixel-wise color, gradien conras and regional appearance conras) and highlevel measures (e.g. objecness [11]). When human eyefixaion records or hand-labeled bounding boxes of salien regions are available for model learning [3 5], hese op-down priors can be used o rain a saliency model, whose properies depend largely on he raining daa. n exreme case is o rain class-specific saliency esimaors ha can be used o deec ineresing objec classes [12]. There are also some sudies on he combinaion of boom-up and op-down models for beer deecion performance [5,13]. Irrespecive of which cues or priors have been used, mos of hese effors are aimed a generaing a saliency map, wihou explicily modeling he coherence of he resuls, eiher spaially or emporally. lhough spaial and emporal informaion can somehow be represened by regional conras, we argue ha coherence is no equivalen o regional conras, because i emphasizes he absolue consisency of he objec iself, and no he relaive differences agains he changing background. Spaial coherence has been used widely in image and moion segmenaion [14,15], while emporal coherence plays a criical role in objec racking [16]. Though boh of hese have also been used for moving objec deecion [17,18], hey are usually modeled separaely. s far as we are aware, here is no prior work ha uses boh of hese facors o deec salien objecs in videos. However, when humans focus heir aenion on an objec, i means ha he objec remains somewhere consisenly in he spaial and emporal space. Therefore, spaial and emporal coherence are indispensable properies for deecing he objec. This paper proposes a novel mehod ha combines spaial and emporal coherence sysemaically wih he saliency measuremen for salien objec deecion. We presen encouraging resuls ha demonsrae he imporance and complemenariness of hese wo ypes of coherence. 1 Problem formulaion We formulae salien objec deecion as a binary labeling problem, where he salien objec is represened as a binary mask { = ax } for each frame I, {1,,T} in a video segmen I=I 1:T. In he mask, a x {0,1} is a binary label for each pixel x in frame I and T is he number of frames in he video segmen. More concreely, a x = 1 indicaes ha pixel x belongs o he salien objec, while a x = 0 means he opposie. Following he widely used condiional random field model [19], he probabiliy of he predicion = 1:T can be modeled as a condiional disribuion: 1 P( I ) = P( 1: T I1: T) = exp ( E( 1: T I1: T), (1) Z where E( ) is an energy funcion and Z is he pariion funcion. Therefore, he opimal soluion for he labeling 1:T based on he maximum a poseriori esimaion can be obained by minimizing E( 1:T I 1:T ). If he Markov propery holds for he daa, he opimizaion can be wrien as: ( ) = arg min E( I ) = arg min E I,. (2) 1: 1:, 1 1 T T 1: T 1: T The energy funcion of he binary mask of each frame E( I, 1, 1 ) can furher be decomposed ino he following hree erms: E( I, 1, 1) = S(, I, M) + α CS(, I, M) Saliency T 1 1 Temporal coherence Spaial coherence + β C (,, I, I ), ( 3) where M is he opical flow field from frame I 1 o frame I, while α and β are he balancing weighs for he spaial and emporal coherence, respecively. Noe ha eq. (3) is he general form of he global energy funcion based on boh he saic image and dynamic moion informaion, where he opical flow field M and he saliency map S(,I,M ) can be obained using any mehod. In fac, hese compuaions are currenly wo acive research opics [5,20] wih new approaches consanly being proposed despie he exisence of many soluions. In our former work [4], we proposed he hree differen ypes of feaures defined below, o compue he saliency maps of saic images. (i) Local muli-scale conras. Given an image I, a Gaussian image pyramid of L levels is compued, and hen he muli-scale conras feaure is defined as 1) L l l ( x ) ( x) ( x ) f, I I I, c l= 1 x N ( x) where I l is he lh level image in he pyramid and N ( x) is he neighborhood of pixel x. Typically, L=6 and N ( x) is a 9 9 window. (ii) Regional cener-surround hisogram. Suppose he salien objec is bounded by a recangle R, we consruc a surrounding conour R S wih he same area as R. Given a pixel x, we find he mos disinc recangle R (x) cenered a x, from he candidaes wih differen sizes and aspec raios: ( ) ( x) ( ( ) S ( )) 2 R = χ R R R 2 (4) x arg max x, x. (5) 1) The proporion sign is necessary because he definiion is no normalized. fer normalizaion, he righ hand side is he desired feaure. The oher wo feaures are designed similarly.

3 Wu Y, e al. Chinese Sci Bull pril (2011) Vol.56 No The size of R varies wihin [0.1,0.7] min (w,h), where w,h are he image widh and heigh, respecively. The aspec raio is one of he five values {0.5, 0.75, 1.0, 1.5, 2.0}. The χ 2 disance measures he dissimilariy of he wo RGB color hisograms of he cener and surrounding recangles. The cener-surround hisogram feaure is defined as a weighed sum of he χ 2 disance a neighboring pixels: ( S ) 2 ( x ) χ ( x ) ( x ) fh, I wxx R, R, (6) { xx R ( x )} 2 2 where w exp( 0.5σ ) = xx x x x is a Gaussian falloff weigh wih variance σ 2 x se o one hird of he size of R (x ), making he weigh adapive o he daa. (iii) Global color spaial-disribuion. This feaure measures he spaial variance of he color. Suppose he colors in he image can be represened by a Gaussian mixure model: wn c ( Ix μc, Σc) p( c Ix ) =, C (7) wn(, ) c 1 c I μ = x c Σc where C is he number of color componens. Then, afer obaining he posiion variance V(c) of each color componen c, he color spaial-disribuion feaure can be defined as f x, I p c I 1 V c. (8) s ( ) ( x ) ( ( )) c Furher deails abou he definiion and compuaion of V(c) can be found in [4]. These feaures have been exensively evaluaed, wih promising resuls, using a large-scale image daase. The inegraed saliency map based hereon is F x = λ f x, I + λ f x, I + λ f x, I, (9) ( ) ( ) ( ) ( ) S c c h h s s where he learned opimal parameers are λ c =0.24, λ h =0.54, and λ s =0.22 [4]. Saic saliency may be confused by a cluered background especially in videos, so we have also proposed compuing he same feaures on he opical flow fields o represen he dynamic moion saliency [21]. However, because of he insabiliy of he opical flow exracion algorihm, he opical flow field was smoohed before feaure exracion. If he opical flow is compued by a saisfacory approach such as he one proposed in [22], he feaures can be exraced direcly from M wihou exra smoohing. In he experimens presened in Secion 4, opical flow is compued by he algorihm inroduced in [22], and he same parameers (λ c, λ h and λ s ) are adoped o combine he hree ypes of feaures ino he moion saliency map FM ( x ). Having obained boh hese saliency maps, we can combine hem ino an overall saliency map F using he adapive sraegy proposed in [21]. Then, he saliency erm in eq. (3) can be defined as S(, I, M) = F( x, I, M) + (1 F( x, I, M)). (10) x: ax= 0 x: ax= 1 Figure 1 illusraes he framework of our mehod based on eq. (3). Besides he saliency maps, spaial and emporal coherence are wo oher imporan componens for global opimizaion, as explained in deail in he following secion. 2 Spaial and emporal coherence 2.1 Spaial coherence The spaial relaionship beween neighboring pixels has widely been used o represen he inrinsic propery of random fields. The simples relaionship is based on he Figure 1 Overall framework of our mehod.

4 1058 Wu Y, e al. Chinese Sci Bull pril (2011) Vol.56 No.10 assumpion ha proximal pixels are more likely o have he same caegory labels. In condiional random field-based applicaions, however, informaion from he daa is usually aken ino accoun as well, resuling in a more general spaial consrain because neighboring pixels belonging o he same caegory are more likely o share he same feaures/properies. Such a spaial coherence consrain is also widely referred o as he smoohing erm or smoohing model in he lieraure because i ends o smoohen he feaures/properies wihin he same caegory. Consequenly, we formulae he spaial coherence as S δ x x x,x C (, I, M ) = ( a a ) D( x, I, M ), ( 11) where x and x are neighboring pixels and D is he smoohness measuremen of he image and moion fields a pixel x. The smooher he image and moion fields are a x, he larger he value of D(x,I,M ) is. Therefore, when C S is minimized, he boundary pixels separaing he foreground objec and he background regions end o be locaed a or close o he edge poins of he image and moion field. simple ye effecive choice for D is he disance ransform [23], which is a sof smoohness measuremen. If chamfer disance is used as he disance meric, i can be compued efficienly. The formulaion above is a generic form ha conains boh he saic appearance and dynamic moion informaion. Here, we discuss hese wo ypical cues, ogeher wih he combinaion hereof. (i) Saic cue: appearance. lhough in general he saic cue can be any image feaures, we focus on he mos widely used one: raw color informaion. If only his cue is considered, eq. (11) becomes S δ x x ch x,x C (, I ) = ( a a ) D ( x, EM( I )), (12) where EM(I ) is he edge map of I and D ch is he chamfer disance. The saic cue uses he image edges o guide he labeling, which should be beneficial for he resuls if he background is relaively clean. Unforunaely, here are many cases in which he background is cluered, especially oudoor scenes as shown in Figure 2, where row (d) illusraes he Figure 2 Effeciveness of spaial coherence. (a) Sample video frames wih ground ruh labels; (b) saliency maps; (c) resuls based on saliency maps only; (d), (e), and (f) disance ransform maps of he spaial coherence based on color cue, moion cue, and boh color and moion cues, respecively, where he darkes areas (valleys) are poenial objec boundaries; (g) deecion resuls using boh saliency and spaial coherence.

5 Wu Y, e al. Chinese Sci Bull pril (2011) Vol.56 No disance ransform resuls using he saic cue only. The valleys (darkes areas) are spread ou across he whole images because of he cluered background. In hese cases, opimizaion using appearance-based spaial coherence can easily ge confused because i canno disinguish he real objec boundaries from he background cluer. (ii) Dynamic cue: moion. If he foreground objec moves differenly o he cluered background, he opical flow field M is useful for deecing he objec. Similar o he appearance-based spaial coherence, we can define spaial coherence of he moion field as S δ x x ch x,x C (, M ) = ( a a ) D ( x, EM( M )), (13) where EM(M ) is he edge map of M and D ch is once again he chamfer disance. s far as we are aware, his is he firs ime ha he opical flow field has been used as he spaial coherence consrain o deec salien objecs in videos. The firs 4 frames of row (e) in Figure 2 demonsrae he effeciveness of spaial coherence when he cluered background moves uniformly ye differenly o he salien objec. (iii) Inegraing saic and dynamic cues. When he salien objec moves in a dynamic scene wih an uneven background moion field, he power of he dynamic cue is weakened. In hese cases, he saic cue may be helpful if he moving pars of he background have weaker saic boundaries han he salien objecs. Therefore, combining he wo cues migh be a beer choice han using only one of hem. Inspired by he way in which he spaial and emporal smoohing erms are combined for moving objec deecion in reference [24], we combined he wo cues for spaial coherence by defining he smoohness measuremen D(x,I,M ) in eq. (11) as ( x,, ) = 1 1 ηi ch(, ( ) ) x ηm Dch( x EM ( M) ) D I M D EM I 1,, (14) where η I [0,1] and η M [0,1] are wo parameers conrolling he influence of he saic and dynamic cues, respecively (where 0 means ineffecive and 1 means fully involved). Such a combinaion can reduce he risk of being confused by he cluer or movemen in he background. The las wo rows in Figure 2 show he combined disance ransform map (i.e. D( x, I, M) a each pixel) and he deecion resuls using he wo-cue based spaial coherence, for which η I and η M are boh se o 1. I can be seen ha he resuls are much igher han hose using only saliency maps. 2.2 Temporal coherence Temporal coherence is designed o represen he emporal similariy beween he salien objecs from wo consecuive frames. Such emporal similariy can be measured by boh shape and appearance as C (, I ) = β C (, ) Shape T 1, 1, S T 1 ppearance + CT ( 1,, I 1, ), Shape 1 CT ( 1, ) = ax ax, x ppearance 2 T 1, 1, = χ 1 1 ( ( ) ( )) C (, I ) H, I, H, I, (15) where H(,I) is he appearance hisogram of area in frame I (in our experimens, only color has been used as he appearance represenaion). In pracice, we find ha seing he 0.5 w h balancing weigh o βs = 100 works well wih w+ h wh, denoing he image widh and heigh, respecively. Figure 3 shows an example of how emporal coherence works. The five consecuive frames are prone o a shor and irregular vibraion of he camera, causing he saliency maps o be heavily influenced by he disracing moion. However, he emporal coherence can smooh ou he randomness wihin he saliency maps and generae a sable sequence of objec bounding boxes. 3 Opimizaion sraegy Given he problem in eqs. (2) and (3), he opimizaion involves finding he bes labeling for each pixel x in each frame I of he video segmen I=I 1:T. This is compuaionally infeasible as he search space is 2 wht, where w,h denoe he image widh and heigh, respecively. In objec deecion, however, pixel-wise segmenaion resuls are usually unnecessary and he bounding boxes suffice. If he binary mask is resriced o a recangle, i can be represened by four parameers: cener x and size s, boh of which are wo-dimensional. The sae space hus decreases dramaically o no more han (w 2 h 2 ) T. Meanwhile, he emporal coherence based on shape can be simplified as Shape CT ( 1, ) = x x + γ s s, (16) where γ is a weigh balancing he locaion and scale differences according o [25]. The problem defined by eqs. (2) and (3) can be decomposed ino a se of subproblems, he soluions of which enable he original opimizaion problem o be solved quickly. Moreover, hese subproblems overlap, i.e., heir local soluions can be used muliple imes in he global opimizaion. Therefore, he opimizaion can be solved efficienly by dynamic programming. Le B ( ) be he summarized energy of frames 1 o, hen hese subproblems can be wrien recursively as

6 1060 Wu Y, e al. Chinese Sci Bull pril (2011) Vol.56 No.10 Figure 3 Effeciveness of emporal coherence. (a) Sample video frames wih ground ruh labels; (b) saliency maps; (c) resuls using saliency maps only; (d) resuls using boh saliency and emporal coherence. ( ) = ( T) = (, ) + α S (, ), ( ) = min { 1( 1) + E( I, 1, 1) } ( ) B E I S I C I : B B 1 N DP (,, ) α S (,, ) ( B 1 ( 1) βct ( 1,, I 1, ) ) N ( ) = S I M + C I M + min +. (17) 1 DP Noe ha we narrow down he search space of 1 by enforcing i o be in he neighborhood of, which coincides wih he emporal coherence requiremen. fer compuing all possible B ( ) for each frame I, he final soluion for he overall problem can be obained by ( ) = arg min B. (18) T T T T Then we race back he sequence o obain he opimal soluion for each of he res frames, {1,,T 1}: ( ( ) ( )) = arg min B,, + βct + I ( 1) 1, + 1. (19) N DP + Suppose he number of possible in I is N (no more han w 2 h 2 ), and NT is no very large, he index of can be cached o a able of every possible +1, wih size NT. Then he race back operaion can be done in O(T) ime by merely consuling he lookup able T( + 1) =. The compuaional expense of he algorihm is dominaed by he compuaion of B ( ), {1,,T}. Suppose here are n candidaes in he neighborhood space N DP ( ), hen he ime cos is O(nNT). Because of he variaions in objec size, i is no easy o se a proper neighborhood size n. If n is oo small, i may fail o represen he large movemens of big objecs, whereas conversely, an overlarge n loses he efficiency of he neighborhood search. To solve his problem and furher speed up he compuaion, a coarse-o-fine sraegy can be used [21]. ll he saliency maps are down-sampled o generae a pyramid (for example 6 layers) and hen, afer searching he enire sae space a he coarses scale, he opimizaion resuls serve as he iniial soluion for he finer scale. By doing his, he neighbor space can be limied o a small range, such as a circle wih a radius of 2 pixels. 4 Experimenal resuls To demonsrae quaniaively he advanages of using spaial and emporal coherence, we carried ou our experimens on 32 video segmens wih a oal of 4820 frames colleced from he inerne 2). Each video segmen conains a single salien objec, which ranges from humans performing various aciviies and animals in he wild, o vehicles boh on he ground and in he air. ll he frames are annoaed wih objec bounding boxes, and he deecion performance is evaluaed in erms of mean precision (P), recall (R), F-measure (F 0.5 ), and boundary displacemen errors (BDE) [3]. Four differen seings on eq. (3) have been adoped for performance comparison: saliency only, namely seing α = β = 0; involving spaial coherence (Saliency + SC), i.e. β = 0; involving emporal coherence (Saliency + TC), i.e. 2) This daase is freely available from he auhors.

7 Wu Y, e al. Chinese Sci Bull pril (2011) Vol.56 No α=0; and involving boh spaial and emporal coherence (Saliency + STC). For spaial coherence, we chose α=0.125 because oher values such as 0.025, 0.05, 0.25, 0.75 gave poorer resuls, while η I and η M were se o 1 because smaller values resuled in lower performance gain, and we found ha using boh cues was beer han using eiher of hem. For he emporal coherence, γ=2 and β=0.01 were found o be effecive. The mean performances of he four differen seings, illusraed in Figure 4, provide several valuable resuls. (i) Spaial coherence increases he precision significanly, bu a he same ime diminishes he recall somewha. Overall, i improves he performance in erms of boh he F-measure and BDE. Moreover, adding SC o Saliency yielded a 10.9% increase in precision and a 3.8% decrease in recall, whereas adding SC o Saliency + TC improved precision by 14.3% and reduced recall by 7.1%. The performance gains wih respec o he F-measure (BDE) for hese wo cases are 6.2% (10.3%) and 7.0% (10.1%), respecively. (ii) Temporal coherence increases boh he precision and he recall and herefore, significanly improves he F-mea- sure and BDE. Moreover, adding TC o Saliency improved precision by 9.2% and recall by 3.8%, whereas adding TC o Saliency + SC improved precision by 12.6% and recall by 0.3%. The performance improvemens in erms of F-measure (BDE) for hese wo cases are 7.6% (23.6%) and 8.3% (23.4%), respecively. (iii) Involving boh SC and TC is beer han using only one of hem, and can grealy improve he deecion performance. Compared wih Saliency, Saliency + STC improved he F-measure by 15.1% and reduced BDE by 31.3%. Figure 5 shows wo represenaive objec deecion resuls using hese four differen sraegies. The benefis of involving spaial and emporal coherence can be clearly seen. The experimenal resuls show ha spaial coherence and emporal coherence are in fac complemenary. The exisence of eiher of hem has no significan impac on he properies of he oher. lhough in our model he spaial coherence includes he beween-frame moion, i only requires he moion field wihin he objec o be as consisen as possible, whereas emporal coherence consrains he magniude of he moion vecors. Figure 4 Mean performance comparison for evaluaing he effeciveness of spaial and emporal coherence. The four differen seings on he horizonal axis are: 1, no coherence (saliency only); 2, involving spaial coherence; 3, involving emporal coherence; 4, involving boh spaial and emporal coherence. Figure 5 Qualiaive resuls using sampled video frames. (a) Inpu video segmens wih ground ruh labels; (b), (c), (d) and (e) deecion resuls using he four differen sraegies: Saliency, Saliency + SC, Saliency + TC and Saliency + STC, respecively.

8 1062 Wu Y, e al. Chinese Sci Bull pril (2011) Vol.56 No.10 5 Conclusion and discussion Inspired by he focused aenion inheren in human vision, we proposed a novel mehod for salien objec deecion ha incorporaes boh spaial and emporal coherence wih radiional saliency measuremens. Experimenal resuls demonsrae he effeciveness and complemenariness of he spaial and emporal coherence of saliency feaures. The mehod is simple and flexible in he sense ha any saliency compuaion models can be direcly embedded herein, and new cues on modeling he wihin-frame spaial coherence and beween-frame emporal coherence can easily be added. Moreover, dynamic programming and a coarse-o-fine search sraegy have been used o solve he problem efficienly. The limiaion of his work is ha i assumes ha he salien objec appears hroughou he whole video segmen, which implicily requires he segmen o be exraced in advance from a longer video. possible fuure work is o deec when he focused aenion sar and end, so ha he video segmen can be localized auomaically. This work was suppored by he Naional Naural Science Foundaion of China ( and ) and he Naional Basic Research Program of China (2007CB311005). 1 Ii L, Rees G, Tsosos J. Neurobiology of enion. San Diego: Elsevier, Treue S. Visual aenion: The where, wha, how and why of saliency. Curr Opin Neurobiol, 2003, 13: Liu T, Sun J, Zheng N N, e al. Learning o deec a salien objec. In: Proceedings of IEEE Compuer Sociey Conference on Compuer and Vision Paern Recogniion, Minneapolis, US, Liu T, Yuan Z J, Sun J, e al. Learning o deec a salien objec. IEEE Trans Paern nal Mach Inell, 02 Mar. 2010, doi: /TPMI Li J, Tian Y H, Huang T J, e al. Probabilisic muli-ask learning for visual saliency esimaion in video. In J Compu Vis, 2010, 90: Palmer S. Vision Science: Phoons o Phenomenology. Cambridge, M: The MIT Press, Ii L, Koch C, Niebur E. model of saliency-based visual aenion for rapid scene analysis. IEEE Trans Paern nal Mach Inell, 1998, 20: Ii L, Baldi P. principled approach o deecing surprising evens in video. In: Proceedings of he IEEE Compuer Sociey Conference on Compuer and Vision Paern Recogniion, San Diego, C, US, Walher D, Koch C. Modeling aenion o salien prooobjecs. Neural New, 2006, 19: Treisman M, Gelade G. feaure-inegraion heory of aenion. Cogn Psychol, 1980, 12: lexe B, Deselaers T, Ferrari V. Wha is an objec? In: Proceedings of IEEE Compuer Sociey Conference on Compuer and Vision Paern Recogniion, San Francisco, US, Moosmann F, Larlus D, Jurie F. Learning saliency maps for objec caegorizaion. In: Proceedings of ECCV Inernaional Workshop on he Represenaion and Use of Prior Knowledge in Vision, Graz, usria, Peers R J, Ii L. Beyond boom-up: incorporaing Task dependen influences ino a compuaional model of spaial aenion. In: Proceedings of IEEE Compuer Sociey Conference on Compuer and Vision Paern Recogniion, Minneapolis, US, Cao L L, Li F F. Spaially coheren laen opic model for concurren segmenaion and classificaion of objecs and scenes. In: Proceedings of IEEE Inernaional Conference on Compuer Vision, Rio de Janeiro, Brazil, Weiss Y, delson E H. unified mixure framework for moion segmenaion: incorporaing spaial coherence and esimaing he number of models. In: Proceedings of IEEE Compuer Sociey Conference on Compuer and Vision Paern Recogniion, San Francisco, US, Moscheni F, Dufaux F, Kun M. Objec racking based on emporal and spaial informaion. In: Proceedings of IEEE Conference on cousics, Speech, and Signal Processing, lana, US, Kim M, Choi J G, Kim D, e al. VOP generaion ool: uomaic segmenaion of moving objecs in image sequences based on spaio-emporal informaion. IEEE Trans Circuis Sys Video Technol, 1999, 9: Tsaig Y, verbuch. uomaic segmenaion of moving objecs in video sequences: region labeling approach. IEEE Trans Circuis Sys Video Technol, 2002, 12: Laffery J, McCallum, Pereira F. Condiional random fields: Probabilisic models for segmening and labeling sequence daa. In: Brodley C E, Danyluk P, eds. Proceedings of Inernaional Conference on Machine Learning, Williams College, Williamsown, M, US, 2001, Baker S, Roh S, Scharsein D, e al. daabase and evaluaion mehodology for opical flow. In: Proceedings of IEEE Inernaional Conference on Compuer Vision, Rio de Janeiro, Brazil, Liu T, Zheng N N, Ding W, e al. Video aenion: Learning o deec a salien objec sequence. In: Proceedings of Inernaional Conference on Paern Recogniion, Tampa, Florida, US, Brox T, Bruhn, Papenberg N, e al. High accuracy opical flow esimaion based on a heory for warping. In: Pajdla T, Maas J, eds. Proceedings of European Conference on Compuer Vision, Prague, Czech Republic, Rosenfeld, Pfalz J. Sequenial operaions in digial picure processing. J CM, 1966, 13: Yang H, Tian J, Chu Y, e al. Spaioemporal smooh models for moving objec deecion. IEEE Signal Process Le, 2008, 15: Sun J, Zhang W, Tang X, e al. Bidirecional racking using rajecory segmen analysis. In: Proceedings of IEEE Inernaional Conference on Compuer Vision, Beijing, China, Open ccess This aricle is disribued under he erms of he Creaive Commons ribuion License which permis any use, disribuion, and reproducion in any medium, provided he original auhor(s) and source are credied.

Image segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding

Image segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding Moivaion Image segmenaion Which pixels belong o he same objec in an image/video sequence? (spaial segmenaion) Which frames belong o he same video sho? (emporal segmenaion) Which frames belong o he same

More information

Video Content Description Using Fuzzy Spatio-Temporal Relations

Video Content Description Using Fuzzy Spatio-Temporal Relations Proceedings of he 4s Hawaii Inernaional Conference on Sysem Sciences - 008 Video Conen Descripion Using Fuzzy Spaio-Temporal Relaions rchana M. Rajurkar *, R.C. Joshi and Sananu Chaudhary 3 Dep of Compuer

More information

A Matching Algorithm for Content-Based Image Retrieval

A Matching Algorithm for Content-Based Image Retrieval A Maching Algorihm for Conen-Based Image Rerieval Sue J. Cho Deparmen of Compuer Science Seoul Naional Universiy Seoul, Korea Absrac Conen-based image rerieval sysem rerieves an image from a daabase using

More information

STEREO PLANE MATCHING TECHNIQUE

STEREO PLANE MATCHING TECHNIQUE STEREO PLANE MATCHING TECHNIQUE Commission III KEY WORDS: Sereo Maching, Surface Modeling, Projecive Transformaion, Homography ABSTRACT: This paper presens a new ype of sereo maching algorihm called Sereo

More information

IntentSearch:Capturing User Intention for One-Click Internet Image Search

IntentSearch:Capturing User Intention for One-Click Internet Image Search JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO. 1, JANUARY 2010 1 InenSearch:Capuring User Inenion for One-Click Inerne Image Search Xiaoou Tang, Fellow, IEEE, Ke Liu, Jingyu Cui, Suden Member, IEEE, Fang

More information

Probabilistic Detection and Tracking of Motion Discontinuities

Probabilistic Detection and Tracking of Motion Discontinuities Probabilisic Deecion and Tracking of Moion Disconinuiies Michael J. Black David J. Flee Xerox Palo Alo Research Cener 3333 Coyoe Hill Road Palo Alo, CA 94304 fblack,fleeg@parc.xerox.com hp://www.parc.xerox.com/fblack,fleeg/

More information

Wheelchair-user Detection Combined with Parts-based Tracking

Wheelchair-user Detection Combined with Parts-based Tracking Wheelchair-user Deecion Combined wih Pars-based Tracking Ukyo Tanikawa 1, Yasuomo Kawanishi 1, Daisuke Deguchi 2,IchiroIde 1, Hiroshi Murase 1 and Ryo Kawai 3 1 Graduae School of Informaion Science, Nagoya

More information

A Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates

A Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates A Fas Sereo-Based Muli-Person Tracking using an Approximaed Likelihood Map for Overlapping Silhouee Templaes Junji Saake Jun Miura Deparmen of Compuer Science and Engineering Toyohashi Universiy of Technology

More information

Visual Perception as Bayesian Inference. David J Fleet. University of Toronto

Visual Perception as Bayesian Inference. David J Fleet. University of Toronto Visual Percepion as Bayesian Inference David J Flee Universiy of Torono Basic rules of probabiliy sum rule (for muually exclusive a ): produc rule (condiioning): independence (def n ): Bayes rule: marginalizaion:

More information

Research Article Auto Coloring with Enhanced Character Registration

Research Article Auto Coloring with Enhanced Character Registration Compuer Games Technology Volume 2008, Aricle ID 35398, 7 pages doi:0.55/2008/35398 Research Aricle Auo Coloring wih Enhanced Characer Regisraion Jie Qiu, Hock Soon Seah, Feng Tian, Quan Chen, Zhongke Wu,

More information

Evaluation and Improvement of Region-based Motion Segmentation

Evaluation and Improvement of Region-based Motion Segmentation Evaluaion and Improvemen of Region-based Moion Segmenaion Mark Ross Universiy Koblenz-Landau, Insiue of Compuaional Visualisics, Universiässraße 1, 56070 Koblenz, Germany Email: ross@uni-koblenz.de Absrac

More information

Analysis of Various Types of Bugs in the Object Oriented Java Script Language Coding

Analysis of Various Types of Bugs in the Object Oriented Java Script Language Coding Indian Journal of Science and Technology, Vol 8(21), DOI: 10.17485/ijs/2015/v8i21/69958, Sepember 2015 ISSN (Prin) : 0974-6846 ISSN (Online) : 0974-5645 Analysis of Various Types of Bugs in he Objec Oriened

More information

Learning in Games via Opponent Strategy Estimation and Policy Search

Learning in Games via Opponent Strategy Estimation and Policy Search Learning in Games via Opponen Sraegy Esimaion and Policy Search Yavar Naddaf Deparmen of Compuer Science Universiy of Briish Columbia Vancouver, BC yavar@naddaf.name Nando de Freias (Supervisor) Deparmen

More information

The Impact of Product Development on the Lifecycle of Defects

The Impact of Product Development on the Lifecycle of Defects The Impac of Produc Developmen on he Lifecycle of Rudolf Ramler Sofware Compeence Cener Hagenberg Sofware Park 21 A-4232 Hagenberg, Ausria +43 7236 3343 872 rudolf.ramler@scch.a ABSTRACT This paper invesigaes

More information

CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL

CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL Klečka Jan Docoral Degree Programme (1), FEEC BUT E-mail: xkleck01@sud.feec.vubr.cz Supervised by: Horák Karel E-mail: horak@feec.vubr.cz

More information

MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES

MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES B. MARCOTEGUI and F. MEYER Ecole des Mines de Paris, Cenre de Morphologie Mahémaique, 35, rue Sain-Honoré, F 77305 Fonainebleau Cedex, France Absrac. In image

More information

Visual Indoor Localization with a Floor-Plan Map

Visual Indoor Localization with a Floor-Plan Map Visual Indoor Localizaion wih a Floor-Plan Map Hang Chu Dep. of ECE Cornell Universiy Ihaca, NY 14850 hc772@cornell.edu Absrac In his repor, a indoor localizaion mehod is presened. The mehod akes firsperson

More information

Time Expression Recognition Using a Constituent-based Tagging Scheme

Time Expression Recognition Using a Constituent-based Tagging Scheme Track: Web Conen Analysis, Semanics and Knowledge Time Expression Recogniion Using a Consiuen-based Tagging Scheme Xiaoshi Zhong and Erik Cambria School of Compuer Science and Engineering Nanyang Technological

More information

Moving Object Detection Using MRF Model and Entropy based Adaptive Thresholding

Moving Object Detection Using MRF Model and Entropy based Adaptive Thresholding Moving Objec Deecion Using MRF Model and Enropy based Adapive Thresholding Badri Narayan Subudhi, Pradipa Kumar Nanda and Ashish Ghosh Machine Inelligence Uni, Indian Saisical Insiue, Kolkaa, 700108, India,

More information

Open Access Research on an Improved Medical Image Enhancement Algorithm Based on P-M Model. Luo Aijing 1 and Yin Jin 2,* u = div( c u ) u

Open Access Research on an Improved Medical Image Enhancement Algorithm Based on P-M Model. Luo Aijing 1 and Yin Jin 2,* u = div( c u ) u Send Orders for Reprins o reprins@benhamscience.ae The Open Biomedical Engineering Journal, 5, 9, 9-3 9 Open Access Research on an Improved Medical Image Enhancemen Algorihm Based on P-M Model Luo Aijing

More information

Detection and segmentation of moving objects in highly dynamic scenes

Detection and segmentation of moving objects in highly dynamic scenes Deecion and segmenaion of moving objecs in highly dynamic scenes Aurélie Bugeau Parick Pérez INRIA, Cenre Rennes - Breagne Alanique Universié de Rennes, Campus de Beaulieu, 35 042 Rennes Cedex, France

More information

Multi-Target Detection and Tracking from a Single Camera in Unmanned Aerial Vehicles (UAVs)

Multi-Target Detection and Tracking from a Single Camera in Unmanned Aerial Vehicles (UAVs) 2016 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems (IROS) Daejeon Convenion Cener Ocober 9-14, 2016, Daejeon, Korea Muli-Targe Deecion and Tracking from a Single Camera in Unmanned Aerial

More information

A Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes

A Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes A Bayesian Approach o Video Objec Segmenaion via Merging 3D Waershed Volumes Yu-Pao Tsai 1,3, Chih-Chuan Lai 1,2, Yi-Ping Hung 1,2, and Zen-Chung Shih 3 1 Insiue of Informaion Science, Academia Sinica,

More information

4. Minimax and planning problems

4. Minimax and planning problems CS/ECE/ISyE 524 Inroducion o Opimizaion Spring 2017 18 4. Minima and planning problems ˆ Opimizing piecewise linear funcions ˆ Minima problems ˆ Eample: Chebyshev cener ˆ Muli-period planning problems

More information

Improved TLD Algorithm for Face Tracking

Improved TLD Algorithm for Face Tracking Absrac Improved TLD Algorihm for Face Tracking Huimin Li a, Chaojing Yu b and Jing Chen c Chongqing Universiy of Poss and Telecommunicaions, Chongqing 400065, China a li.huimin666@163.com, b 15023299065@163.com,

More information

MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES

MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES Arun Kumar H. D. 1 and Prabhakar C. J. 2 1 Deparmen of Compuer Science, Kuvempu Universiy, Shimoga, India ABSTRACT

More information

Upper Body Tracking for Human-Machine Interaction with a Moving Camera

Upper Body Tracking for Human-Machine Interaction with a Moving Camera The 2009 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems Ocober -5, 2009 S. Louis, USA Upper Body Tracking for Human-Machine Ineracion wih a Moving Camera Yi-Ru Chen, Cheng-Ming Huang, and

More information

A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions

A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions A Hierarchical Objec Recogniion Sysem Based on Muli-scale Principal Curvaure Regions Wei Zhang, Hongli Deng, Thomas G Dieerich and Eric N Morensen School of Elecrical Engineering and Compuer Science Oregon

More information

Multi-camera multi-object voxel-based Monte Carlo 3D tracking strategies

Multi-camera multi-object voxel-based Monte Carlo 3D tracking strategies RESEARCH Open Access Muli-camera muli-objec voxel-based Mone Carlo 3D racking sraegies Crisian Canon-Ferrer *, Josep R Casas, Monse Pardàs and Enric Mone Absrac This aricle presens a new approach o he

More information

Implementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report)

Implementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report) Implemening Ray Casing in Terahedral Meshes wih Programmable Graphics Hardware (Technical Repor) Marin Kraus, Thomas Erl March 28, 2002 1 Inroducion Alhough cell-projecion, e.g., [3, 2], and resampling,

More information

Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game

Detection Tracking and Recognition of Human Poses for a Real Time Spatial Game Deecion Tracking and Recogniion of Human Poses for a Real Time Spaial Game Feifei Huo, Emile A. Hendriks, A.H.J. Oomes Delf Universiy of Technology The Neherlands f.huo@udelf.nl Pascal van Beek, Remco

More information

Object Trajectory Proposal via Hierarchical Volume Grouping

Object Trajectory Proposal via Hierarchical Volume Grouping Objec Trajecory Proposal via Hierarchical Volume Grouping Xu Sun 1, Yuanian Wang 1, Tongwei Ren 1,, Zhi Liu 2, Zheng-Jun Zha 3, and Gangshan Wu 1 1 Sae Key Laboraory for Novel Sofware Technology, Nanjing

More information

J. Vis. Commun. Image R.

J. Vis. Commun. Image R. J. Vis. Commun. Image R. 20 (2009) 9 27 Conens liss available a ScienceDirec J. Vis. Commun. Image R. journal homepage: www.elsevier.com/locae/jvci Face deecion and racking using a Boosed Adapive Paricle

More information

DAGM 2011 Tutorial on Convex Optimization for Computer Vision

DAGM 2011 Tutorial on Convex Optimization for Computer Vision DAGM 2011 Tuorial on Convex Opimizaion for Compuer Vision Par 3: Convex Soluions for Sereo and Opical Flow Daniel Cremers Compuer Vision Group Technical Universiy of Munich Graz Universiy of Technology

More information

A Face Detection Method Based on Skin Color Model

A Face Detection Method Based on Skin Color Model A Face Deecion Mehod Based on Skin Color Model Dazhi Zhang Boying Wu Jiebao Sun Qinglei Liao Deparmen of Mahemaics Harbin Insiue of Technology Harbin China 150000 Zhang_dz@163.com mahwby@hi.edu.cn sunjiebao@om.com

More information

Multiple View Discriminative Appearance Modeling with IMCMC for Distributed Tracking

Multiple View Discriminative Appearance Modeling with IMCMC for Distributed Tracking Muliple View Discriminaive ing wih IMCMC for Disribued Tracking Sanhoshkumar Sunderrajan, B.S. Manjunah Deparmen of Elecrical and Compuer Engineering Universiy of California, Sana Barbara {sanhosh,manj}@ece.ucsb.edu

More information

Design Alternatives for a Thin Lens Spatial Integrator Array

Design Alternatives for a Thin Lens Spatial Integrator Array Egyp. J. Solids, Vol. (7), No. (), (004) 75 Design Alernaives for a Thin Lens Spaial Inegraor Array Hala Kamal *, Daniel V azquez and Javier Alda and E. Bernabeu Opics Deparmen. Universiy Compluense of

More information

Audio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA

Audio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA Audio Engineering Sociey Convenion Paper Presened a he 119h Convenion 2005 Ocober 7 10 New Yor, New Yor USA This convenion paper has been reproduced from he auhor's advance manuscrip, wihou ediing, correcions,

More information

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors

We are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors We are InechOpen, he world s leading publisher of Open Access books Buil by scieniss, for scieniss 4,000 116,000 120M Open access books available Inernaional auhors and ediors Downloads Our auhors are

More information

Viewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors

Viewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Viewpoin Invarian 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Chaohui Wang 1,2, Yun Zeng 3, Loic Simon 1, Ioannis Kakadiaris 4, Dimiris Samaras 3, Nikos Paragios 1,2

More information

In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magnetic Field Maps

In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magnetic Field Maps In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magneic Field Maps A. D. Hahn 1, A. S. Nencka 1 and D. B. Rowe 2,1 1 Medical College of Wisconsin, Milwaukee, WI, Unied

More information

Definition and examples of time series

Definition and examples of time series Definiion and examples of ime series A ime series is a sequence of daa poins being recorded a specific imes. Formally, le,,p be a probabiliy space, and T an index se. A real valued sochasic process is

More information

Occlusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction

Occlusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction 58 IJCSNS Inernaional Journal of Compuer Science and Nework Securiy, VOL.6 No.10, Ocober 006 Occlusion-Free Hand Moion Tracking by Muliple Cameras and Paricle Filering wih Predicion Makoo Kao, and Gang

More information

Dynamic Route Planning and Obstacle Avoidance Model for Unmanned Aerial Vehicles

Dynamic Route Planning and Obstacle Avoidance Model for Unmanned Aerial Vehicles Volume 116 No. 24 2017, 315-329 ISSN: 1311-8080 (prined version); ISSN: 1314-3395 (on-line version) url: hp://www.ijpam.eu ijpam.eu Dynamic Roue Planning and Obsacle Avoidance Model for Unmanned Aerial

More information

Rao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly

Rao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly MITSUBISHI ELECTRIC RESEARCH LABORATORIES hp://www.merl.com Rao-Blackwellized Paricle Filering for Probing-Based 6-DOF Localizaion in Roboic Assembly Yuichi Taguchi, Tim Marks, Haruhisa Okuda TR1-8 June

More information

Robot localization under perceptual aliasing conditions based on laser reflectivity using particle filter

Robot localization under perceptual aliasing conditions based on laser reflectivity using particle filter Robo localizaion under percepual aliasing condiions based on laser refleciviy using paricle filer DongXiang Zhang, Ryo Kurazume, Yumi Iwashia, Tsuomu Hasegawa Absrac Global localizaion, which deermines

More information

Quantitative macro models feature an infinite number of periods A more realistic (?) view of time

Quantitative macro models feature an infinite number of periods A more realistic (?) view of time INFINIE-HORIZON CONSUMPION-SAVINGS MODEL SEPEMBER, Inroducion BASICS Quaniaive macro models feaure an infinie number of periods A more realisic (?) view of ime Infinie number of periods A meaphor for many

More information

A new algorithm for small object tracking based on super-resolution technique

A new algorithm for small object tracking based on super-resolution technique A new algorihm for small objec racking based on super-resoluion echnique Yabunayya Habibi, Dwi Rana Sulisyaningrum, and Budi Seiyono Ciaion: AIP Conference Proceedings 1867, 020024 (2017); doi: 10.1063/1.4994427

More information

Landmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases

Landmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases Lmarks: A New Model for Similariy-Based Paern Querying in Time Series Daabases Chang-Shing Perng Haixun Wang Sylvia R. Zhang D. So Parker perng@cs.ucla.edu hxwang@cs.ucla.edu Sylvia Zhang@cle.com so@cs.ucla.edu

More information

MATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008

MATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008 MATH 5 - Differenial Equaions Sepember 15, 8 Projec 1, Fall 8 Due: Sepember 4, 8 Lab 1.3 - Logisics Populaion Models wih Harvesing For his projec we consider lab 1.3 of Differenial Equaions pages 146 o

More information

AML710 CAD LECTURE 11 SPACE CURVES. Space Curves Intrinsic properties Synthetic curves

AML710 CAD LECTURE 11 SPACE CURVES. Space Curves Intrinsic properties Synthetic curves AML7 CAD LECTURE Space Curves Inrinsic properies Synheic curves A curve which may pass hrough any region of hreedimensional space, as conrased o a plane curve which mus lie on a single plane. Space curves

More information

Video-Based Face Recognition Using Probabilistic Appearance Manifolds

Video-Based Face Recognition Using Probabilistic Appearance Manifolds Video-Based Face Recogniion Using Probabilisic Appearance Manifolds Kuang-Chih Lee Jeffrey Ho Ming-Hsuan Yang David Kriegman klee10@uiuc.edu jho@cs.ucsd.edu myang@honda-ri.com kriegman@cs.ucsd.edu Compuer

More information

Simple Network Management Based on PHP and SNMP

Simple Network Management Based on PHP and SNMP Simple Nework Managemen Based on PHP and SNMP Krasimir Trichkov, Elisavea Trichkova bsrac: This paper aims o presen simple mehod for nework managemen based on SNMP - managemen of Cisco rouer. The paper

More information

FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine

FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS Soumya Hamlaoui & Franck Davoine HEUDIASYC Mixed Research Uni, CNRS / Compiègne Universiy of Technology BP 20529, 60205 Compiègne

More information

CONTEXT MODELS FOR CRF-BASED CLASSIFICATION OF MULTITEMPORAL REMOTE SENSING DATA

CONTEXT MODELS FOR CRF-BASED CLASSIFICATION OF MULTITEMPORAL REMOTE SENSING DATA ISPRS Annals of he Phoogrammery, Remoe Sensing and Spaial Informaion Sciences, Volume I-7, 2012 XXII ISPRS Congress, 25 Augus 01 Sepember 2012, Melbourne, Ausralia CONTEXT MODELS FOR CRF-BASED CLASSIFICATION

More information

Robust Visual Tracking for Multiple Targets

Robust Visual Tracking for Multiple Targets Robus Visual Tracking for Muliple Targes Yizheng Cai, Nando de Freias, and James J. Lile Universiy of Briish Columbia, Vancouver, B.C., Canada, V6T 1Z4 {yizhengc, nando, lile}@cs.ubc.ca Absrac. We address

More information

Multi-Scale Object Candidates for Generic Object Tracking in Street Scenes

Multi-Scale Object Candidates for Generic Object Tracking in Street Scenes Muli-Scale Objec Candidaes for Generic Objec Tracking in Sree Scenes Aljoša Ošep, Alexander Hermans, Francis Engelmann, Dirk Klosermann, Markus Mahias and Basian Leibe Absrac Mos vision based sysems for

More information

COSC 3213: Computer Networks I Chapter 6 Handout # 7

COSC 3213: Computer Networks I Chapter 6 Handout # 7 COSC 3213: Compuer Neworks I Chaper 6 Handou # 7 Insrucor: Dr. Marvin Mandelbaum Deparmen of Compuer Science York Universiy F05 Secion A Medium Access Conrol (MAC) Topics: 1. Muliple Access Communicaions:

More information

TrackNet: Simultaneous Detection and Tracking of Multiple Objects

TrackNet: Simultaneous Detection and Tracking of Multiple Objects TrackNe: Simulaneous Deecion and Tracking of Muliple Objecs Chenge Li New York Universiy cl2840@nyu.edu Gregory Dobler New York Universiy greg.dobler@nyu.edu Yilin Song New York Universiy ys1297@nyu.edu

More information

Real-Time Non-Rigid Multi-Frame Depth Video Super-Resolution

Real-Time Non-Rigid Multi-Frame Depth Video Super-Resolution Real-Time Non-Rigid Muli-Frame Deph Video Super-Resoluion Kassem Al Ismaeil 1, Djamila Aouada 1, Thomas Solignac 2, Bruno Mirbach 2, Björn Oersen 1 1 Inerdisciplinary Cenre for Securiy, Reliabiliy, and

More information

STRING DESCRIPTIONS OF DATA FOR DISPLAY*

STRING DESCRIPTIONS OF DATA FOR DISPLAY* SLAC-PUB-383 January 1968 STRING DESCRIPTIONS OF DATA FOR DISPLAY* J. E. George and W. F. Miller Compuer Science Deparmen and Sanford Linear Acceleraor Cener Sanford Universiy Sanford, California Absrac

More information

Weighted Voting in 3D Random Forest Segmentation

Weighted Voting in 3D Random Forest Segmentation Weighed Voing in 3D Random Fores Segmenaion M. Yaqub,, P. Mahon 3, M. K. Javaid, C. Cooper, J. A. Noble NDORMS, Universiy of Oxford, IBME, Deparmen of Engineering Science, Universiy of Oxford, 3 MRC Epidemiology

More information

Dynamic Depth Recovery from Multiple Synchronized Video Streams 1

Dynamic Depth Recovery from Multiple Synchronized Video Streams 1 Dynamic Deph Recoery from Muliple ynchronized Video reams Hai ao, Harpree. awhney, and Rakesh Kumar Deparmen of Compuer Engineering arnoff Corporaion Uniersiy of California a ana Cruz Washingon Road ana

More information

An Iterative Scheme for Motion-Based Scene Segmentation

An Iterative Scheme for Motion-Based Scene Segmentation An Ieraive Scheme for Moion-Based Scene Segmenaion Alexander Bachmann and Hildegard Kuehne Deparmen for Measuremen and Conrol Insiue for Anhropomaics Universiy of Karlsruhe (H), 76 131 Karlsruhe, Germany

More information

IROS 2015 Workshop on On-line decision-making in multi-robot coordination (DEMUR 15)

IROS 2015 Workshop on On-line decision-making in multi-robot coordination (DEMUR 15) IROS 2015 Workshop on On-line decision-making in muli-robo coordinaion () OPTIMIZATION-BASED COOPERATIVE MULTI-ROBOT TARGET TRACKING WITH REASONING ABOUT OCCLUSIONS KAROL HAUSMAN a,, GREGORY KAHN b, SACHIN

More information

EECS 487: Interactive Computer Graphics

EECS 487: Interactive Computer Graphics EECS 487: Ineracive Compuer Graphics Lecure 7: B-splines curves Raional Bézier and NURBS Cubic Splines A represenaion of cubic spline consiss of: four conrol poins (why four?) hese are compleely user specified

More information

Real-time 2D Video/3D LiDAR Registration

Real-time 2D Video/3D LiDAR Registration Real-ime 2D Video/3D LiDAR Regisraion C. Bodenseiner Fraunhofer IOSB chrisoph.bodenseiner@iosb.fraunhofer.de M. Arens Fraunhofer IOSB michael.arens@iosb.fraunhofer.de Absrac Progress in LiDAR scanning

More information

Gender Classification of Faces Using Adaboost*

Gender Classification of Faces Using Adaboost* Gender Classificaion of Faces Using Adaboos* Rodrigo Verschae 1,2,3, Javier Ruiz-del-Solar 1,2, and Mauricio Correa 1,2 1 Deparmen of Elecrical Engineering, Universidad de Chile 2 Cener for Web Research,

More information

Visually Summarizing the Web using Internal Images and Keyphrases

Visually Summarizing the Web using Internal Images and Keyphrases Visually Summarizing he Web using Inernal Images and Keyphrases M.V.Gedam, S. A. Taale Deparmen of compuer engineering, PUNE Universiy Vidya Praishhan s College of Engg., India Absrac Visual summarizaion

More information

Stereoscopic Neural Style Transfer

Stereoscopic Neural Style Transfer Sereoscopic Neural Syle Transfer Dongdong Chen 1 Lu Yuan 2, Jing Liao 2, Nenghai Yu 1, Gang Hua 2 1 Universiy of Science and Technology of China 2 Microsof Research cd722522@mail.usc.edu.cn, {luyuan,jliao}@microsof.com,

More information

Track-based and object-based occlusion for people tracking refinement in indoor surveillance

Track-based and object-based occlusion for people tracking refinement in indoor surveillance Trac-based and objec-based occlusion for people racing refinemen in indoor surveillance R. Cucchiara, C. Grana, G. Tardini Diparimeno di Ingegneria Informaica - Universiy of Modena and Reggio Emilia Via

More information

Robust Segmentation and Tracking of Colored Objects in Video

Robust Segmentation and Tracking of Colored Objects in Video IEEE TRANSACTIONS ON CSVT, VOL. 4, NO. 6, 2004 Robus Segmenaion and Tracking of Colored Objecs in Video Theo Gevers, member, IEEE Absrac Segmening and racking of objecs in video is of grea imporance for

More information

ACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure

ACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure SUBMITTED TO TRANSACTION ON IMAGE PROCESSING 1 Online Temporally Consisen Indoor Deph Video Enhancemen via Saic Srucure Lu Sheng, Suden Member, IEEE, King Ngi Ngan, Fellow, IEEE, Chern-Loon Lim and Songnan

More information

Mobile Robots Mapping

Mobile Robots Mapping Mobile Robos Mapping 1 Roboics is Easy conrol behavior percepion modelling domain model environmen model informaion exracion raw daa planning ask cogniion reasoning pah planning navigaion pah execuion

More information

Nonparametric CUSUM Charts for Process Variability

Nonparametric CUSUM Charts for Process Variability Journal of Academia and Indusrial Research (JAIR) Volume 3, Issue June 4 53 REEARCH ARTICLE IN: 78-53 Nonparameric CUUM Chars for Process Variabiliy D.M. Zombade and V.B. Ghue * Dep. of aisics, Walchand

More information

LAMP: 3D Layered, Adaptive-resolution and Multiperspective Panorama - a New Scene Representation

LAMP: 3D Layered, Adaptive-resolution and Multiperspective Panorama - a New Scene Representation Submission o Special Issue of CVIU on Model-based and Image-based 3D Scene Represenaion for Ineracive Visualizaion LAMP: 3D Layered, Adapive-resoluion and Muliperspecive Panorama - a New Scene Represenaion

More information

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1 TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1 Adapive Appearance Model and Condensaion Algorihm for Robus Face Tracking Yui Man Lui, Suden Member,, J. Ross Beveridge, Member,,

More information

MOTION TRACKING is a fundamental capability that

MOTION TRACKING is a fundamental capability that TECHNICAL REPORT CRES-05-008, CENTER FOR ROBOTICS AND EMBEDDED SYSTEMS, UNIVERSITY OF SOUTHERN CALIFORNIA 1 Real-ime Moion Tracking from a Mobile Robo Boyoon Jung, Suden Member, IEEE, Gaurav S. Sukhame,

More information

Track and Cut: simultaneous tracking and segmentation of multiple objects with graph cuts

Track and Cut: simultaneous tracking and segmentation of multiple objects with graph cuts INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE Track and Cu: simulaneous racking and segmenaion of muliple objecs wih graph cus Aurélie Bugeau Parick Pérez N 6337 Ocober 2007 Thèmes COM

More information

Multi-Vehicle Cooperative Local Mapping: A Methodology Based on Occupancy Grid Map Merging

Multi-Vehicle Cooperative Local Mapping: A Methodology Based on Occupancy Grid Map Merging Muli-Vehicle Cooperaive Local Mapping: Mehodology ased on Occupancy Grid Map Merging Hao Li, Manabu Tsukada, Fawzi Nashashibi, Michel Paren To cie his version: Hao Li, Manabu Tsukada, Fawzi Nashashibi,

More information

IAJIT First Online Publication

IAJIT First Online Publication An Improved Feaure Exracion and Combinaion of Muliple Classifiers for Query-by- ming Naha Phiwma and Parinya Sanguansa 2 Deparmen of Compuer Science, Suan Dusi Rajabha Universiy, Thailand 2 Faculy of Engineering

More information

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild

Robust LSTM-Autoencoders for Face De-Occlusion in the Wild IEEE TRANSACTIONS ON IMAGE PROCESSING, DRAFT 1 Robus LSTM-Auoencoders for Face De-Occlusion in he Wild Fang Zhao, Jiashi Feng, Jian Zhao, Wenhan Yang, Shuicheng Yan arxiv:1612.08534v1 [cs.cv] 27 Dec 2016

More information

NEWTON S SECOND LAW OF MOTION

NEWTON S SECOND LAW OF MOTION Course and Secion Dae Names NEWTON S SECOND LAW OF MOTION The acceleraion of an objec is defined as he rae of change of elociy. If he elociy changes by an amoun in a ime, hen he aerage acceleraion during

More information

Joint Feature Learning With Robust Local Ternary Pattern for Face Recognition

Joint Feature Learning With Robust Local Ternary Pattern for Face Recognition Join Feaure Learning Wih Robus Local Ternary Paern for Face Recogniion Yuvaraju.M 1, Shalini.S 1 Assisan Professor, Deparmen of Elecrical and Elecronics Engineering, Anna Universiy Regional Campus, Coimbaore,

More information

Robust Multi-view Face Detection Using Error Correcting Output Codes

Robust Multi-view Face Detection Using Error Correcting Output Codes Robus Muli-view Face Deecion Using Error Correcing Oupu Codes Hongming Zhang,2, Wen GaoP P, Xilin Chen 2, Shiguang Shan 2, and Debin Zhao Deparmen of Compuer Science and Engineering, Harbin Insiue of Technolog

More information

arxiv: v2 [cs.cv] 20 May 2018

arxiv: v2 [cs.cv] 20 May 2018 Sereoscopic Neural Syle Transfer Dongdong Chen 1 Lu Yuan 2, Jing Liao 2, Nenghai Yu 1, Gang Hua 2 1 Universiy of Science and Technology of China 2 Microsof Research cd722522@mail.usc.edu.cn, {jliao, luyuan,

More information

Rule-Based Multi-Query Optimization

Rule-Based Multi-Query Optimization Rule-Based Muli-Query Opimizaion Mingsheng Hong Dep. of Compuer cience Cornell Universiy mshong@cs.cornell.edu Johannes Gehrke Dep. of Compuer cience Cornell Universiy johannes@cs.cornell.edu Mirek Riedewald

More information

A time-space consistency solution for hardware-in-the-loop simulation system

A time-space consistency solution for hardware-in-the-loop simulation system Inernaional Conference on Advanced Elecronic Science and Technology (AEST 206) A ime-space consisency soluion for hardware-in-he-loop simulaion sysem Zexin Jiang a Elecric Power Research Insiue of Guangdong

More information

IDEF3 Process Description Capture Method

IDEF3 Process Description Capture Method IDEF3 Process Descripion Capure Mehod IDEF3 is par of he IDEF family of mehods developmen funded by he US Air Force o provide modelling suppor for sysems engineering and enerprise inegraion 2 IDEF3 Mehod

More information

arxiv: v1 [cs.cv] 25 Apr 2017

arxiv: v1 [cs.cv] 25 Apr 2017 Sudheendra Vijayanarasimhan Susanna Ricco svnaras@google.com ricco@google.com... arxiv:1704.07804v1 [cs.cv] 25 Apr 2017 SfM-Ne: Learning of Srucure and Moion from Video Cordelia Schmid Esimaed deph, camera

More information

A METHOD OF MODELING DEFORMATION OF AN OBJECT EMPLOYING SURROUNDING VIDEO CAMERAS

A METHOD OF MODELING DEFORMATION OF AN OBJECT EMPLOYING SURROUNDING VIDEO CAMERAS A METHOD OF MODELING DEFORMATION OF AN OBJECT EMLOYING SURROUNDING IDEO CAMERAS Joo Kooi TAN, Seiji ISHIKAWA Deparmen of Mechanical and Conrol Engineering Kushu Insiue of Technolog, Japan ehelan@is.cnl.kuech.ac.jp,

More information

4.1 3D GEOMETRIC TRANSFORMATIONS

4.1 3D GEOMETRIC TRANSFORMATIONS MODULE IV MCA - 3 COMPUTER GRAPHICS ADMN 29- Dep. of Compuer Science And Applicaions, SJCET, Palai 94 4. 3D GEOMETRIC TRANSFORMATIONS Mehods for geomeric ransformaions and objec modeling in hree dimensions

More information

An efficient approach to improve throughput for TCP vegas in ad hoc network

An efficient approach to improve throughput for TCP vegas in ad hoc network Inernaional Research Journal of Engineering and Technology (IRJET) e-issn: 395-0056 Volume: 0 Issue: 03 June-05 www.irje.ne p-issn: 395-007 An efficien approach o improve hroughpu for TCP vegas in ad hoc

More information

Real time 3D face and facial feature tracking

Real time 3D face and facial feature tracking J Real-Time Image Proc (2007) 2:35 44 DOI 10.1007/s11554-007-0032-2 ORIGINAL RESEARCH PAPER Real ime 3D face and facial feaure racking Fadi Dornaika Æ Javier Orozco Received: 23 November 2006 / Acceped:

More information

Scale Recovery for Monocular Visual Odometry Using Depth Estimated with Deep Convolutional Neural Fields

Scale Recovery for Monocular Visual Odometry Using Depth Estimated with Deep Convolutional Neural Fields Scale Recovery for Monocular Visual Odomery Using Deph Esimaed wih Deep Convoluional Neural Fields Xiaochuan Yin, Xiangwei Wang, Xiaoguo Du, Qijun Chen Tongji Universiy yinxiaochuan@homail.com,wangxiangwei.cpp@gmail.com,

More information

Difficulty-aware Hybrid Search in Peer-to-Peer Networks

Difficulty-aware Hybrid Search in Peer-to-Peer Networks Difficuly-aware Hybrid Search in Peer-o-Peer Neworks Hanhua Chen, Hai Jin, Yunhao Liu, Lionel M. Ni School of Compuer Science and Technology Huazhong Univ. of Science and Technology {chenhanhua, hjin}@hus.edu.cn

More information

Real Time Integral-Based Structural Health Monitoring

Real Time Integral-Based Structural Health Monitoring Real Time Inegral-Based Srucural Healh Monioring The nd Inernaional Conference on Sensing Technology ICST 7 J. G. Chase, I. Singh-Leve, C. E. Hann, X. Chen Deparmen of Mechanical Engineering, Universiy

More information

Improving Ranking of Search Engines Results Based on Power Links

Improving Ranking of Search Engines Results Based on Power Links IPASJ Inernaional Journal of Informaion Technology (IIJIT) Web Sie: hp://www.ipasj.org/iijit/iijit.hm A Publisher for Research Moivaion... Email: edioriiji@ipasj.org Volume 2, Issue 9, Sepember 2014 ISSN

More information

Improving Occupancy Grid FastSLAM by Integrating Navigation Sensors

Improving Occupancy Grid FastSLAM by Integrating Navigation Sensors Improving Occupancy Grid FasSLAM by Inegraing Navigaion Sensors Chrisopher Weyers Sensors Direcorae Air Force Research Laboraory Wrigh-Paerson AFB, OH 45433 Gilber Peerson Deparmen of Elecrical and Compuer

More information

Large-scale 3D Outdoor Mapping and On-line Localization using 3D-2D Matching

Large-scale 3D Outdoor Mapping and On-line Localization using 3D-2D Matching Large-scale 3D Oudoor Mapping and On-line Localizaion using 3D-D Maching Takahiro Sakai, Kenji Koide, Jun Miura, and Shuji Oishi Absrac Map-based oudoor navigaion is an acive research area in mobile robos

More information