Viewpoint Invariant 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors
|
|
- Winfred McDowell
- 6 years ago
- Views:
Transcription
1 Viewpoin Invarian 3D Landmark Model Inference from Monocular 2D Images Using Higher-Order Priors Chaohui Wang 1,2, Yun Zeng 3, Loic Simon 1, Ioannis Kakadiaris 4, Dimiris Samaras 3, Nikos Paragios 1,2 1 Cener for Visual Compuing, Ecole Cenrale Paris, Châenay-Malabry, France 2 Equipe GALEN, INRIA Saclay - Ile-de-France, Orsay, France 3 Deparmen of Compuer Science, Sony Brook Universiy, NY, USA 4 Compuaional Biomedicine Lab, Universiy of Houson, TX, USA Absrac In his paper, we propose a novel one-sho opimizaion approach o simulaneously deermine boh he opimal 3D landmark model and he corresponding 2D projecions wihou explici esimaion of he camera viewpoin, which is also able o deal wih misdeecions as well as parial occlusions. To his end, a 3D shape manifold is buil upon fourh-order ineracions of landmarks from a raining se where pose-invarian saisics are obained in his space. The 3D-2D consisency is also encoded in such highorder ineracions, which eliminae he necessiy of viewpoin esimaion. Furhermore, he modeling of visibiliy improves furher he performance of he mehod by handling missing correspondences and occlusions. The inference is addressed hrough a MAP formulaion which is naurally ransformed ino a higher-order MRF opimizaion problem and is solved using a dual-decomposiion-based mehod. Promising resuls on sandard face benchmarks demonsrae he poenial of our approach. 1. Inroducion 3D model inference from 2D images is one of he mos challenging problems in compuer vision. This is due o he fac ha boh camera esimaion and 3D model opimizaion have o be addressed wihin a single framework. In he mos general case, he camera parameers are unknown, he 3D model iself usually inheris high complexiy (high degrees of freedom even for non-ariculaed objecs), while a he same ime image feaures can be ambiguous, occluded and noisy. There are numerous applicaions involving he above scenario, such as raffic monioring wih 3D model based racking [18], hand racking [9], facial analysis [4] and medical imaging [17]. Such an inference process usually involves hree seps: he firs aims o deermine a compac represenaion of he 3D model, he second o associae such a represenaion wih he 2D image observaion, and he las o recover he opimal parameers of he model. Modeling variaions of he 3D model requires a saisical parameric represenaion of he objec of ineres. Such represenaions in mos cases are pose-varian, i.e., all raining examples are regisered o a same referenial frame where saisics are hen buil from he raining daa. One can cie acive shape [8] and acive appearance [7] models (ASMs and AAMs), which offer a good compromise beween compuaional complexiy and model expressiveness poenial. Oher represenaions adop more complex saisical models ha can vary from Mixures of Gaussians (MoG) o non-parameric densiy funcions [3]. In such a conex, one has o address he curse of dimensionaliy (he dimensionaliy of he manifold o be learned versus he number of raining samples). Once he represenaion has been buil, he nex seps consis of defining an image likelihood and combining i wih a 3D model prior owards opimal esimaion of he 3D model. Since he image likelihood is relaed o boh he 3D model configuraion and he camera parameers, he model esimaion is ofen achieved hrough an alernaing search or EM-syle approach [14]. Given an iniial 3D-2D correspondence map, he camera parameers are firs esimaed and hen used o define he fiing error beween he model and he image. This error is o be opimized by gradiendriven mehods and ieraive search processes so as o esimae boh he correspondences and he opimal model configuraion. Despie he promising performance of such a scheme, he fac ha explici esimaion of he camera viewpoin parameers is required in he process is a major drawback, since coordinae-descen approaches are prone o be rapped in local minima and provide no guaranee on he opimaliy of he esimaions. Graphical models have become a dominan approach in compuer vision and have been employed o address a num IEEE Inernaional Conference on Compuer Vision /11/$26.00 c 2011 IEEE 319
2 ber of vision problems (e.g., [5, 19, 20]), which is mainly due o heir srengh in erms of he qualiy of he opimum. The use of higher-order models has raised more and more aenion (e.g., [15, 12, 22]), along wih he developmen of efficien opimizaion mehods. Higher-order ineracions can naurally inroduce invariance o cerain class of ransformaions like ranslaion/roaion/scale, while a he same ime hey can deal wih vision asks of imporan complexiy. The objecive of our paper is o ake benefi of heir srengh and propose a unified formulaion o esimae 3D models from 2D images wihou alernaing search. The main conribuion of his paper is a probabilisic inference approach ha does no require explici viewpoin esimaion, while being able o joinly opimize he pose parameers and he corresponding landmarks selecion as well as explicily handling missing correspondences and occlusions via a visibiliy modeling. To his end, we formulae he problem as a maximum a poseriori (MAP) esimaion ask which involves 3D pose parameers, associaed 2D correspondences and visibiliy saes. We derive a poserior probabiliy as he produc of an image likelihood, a visibiliy prior, a 3D geomeric prior and a projecion consisency prior consraining he 2D and 3D configuraions. In order o circumven he need of viewpoin esimaion, we adop a high-order decomposiion of he 3D model ha enables o deermine he projecion error beween a given 3D configuraion and he corresponding 2D landmark posiions in a disribued manner. Furhermore, an explici visibiliy modeling is also inroduced o cope wih misdeecions and ouliers. The MAP inference is hen naurally ransformed ino a higher-order MRF opimizaion problem and all he laen variables are inferred using a one-sho opimizaion over a facor graph [3] hrough dual-decomposiion [2, 16, 20, 22]. The proposed formulaion has been validaed in he conex of 3D facial pose esimaion from 2D images. Promising resuls on sandard face benchmarks demonsrae he poenial of our mehod. The remainder of his paper is organized as follows. In Sec. 2, we presen he probabilisic formulaion for he join esimaion of he 3D pose, is visibiliy saes and he 2D correspondences. The individual likelihoods wih respec o geomery, 3D-o-2D consisency, visibiliy, and image suppor are presened in Sec. 3 while he corresponding higherorder graphical model is discussed in Sec. 4. Experimenal resuls compose Sec. 5, while discussion and fuure work conclude he paper in Sec Probabilisic 3D-2D Inference Framework We consider a poin-disribuion shape model composed of a se V of landmarks locaed on he surface of he 3D objec of ineres. Le laen variable X i = (X i,x (2) i ) denoe he 3D and 2D posiions of a landmark i (i V). More specifically, X i and X (2) i, 3-dimensional and 2- dimensional vecors respecively, denoe he 3D posiion of landmark i in he model space and he 2D posiion in he observed image I. Each variable X i akes a value x i from is possible configuraion se X i = X i X (2) i, where X i and X (2) i denoe he 3D and 2D posiion candidae ses, respecively. Due o he fac ha landmarks may be invisible, we also inroduce a visibiliy variable O i for landmark i [19]. O i = 1 when he landmark is visible in he 2D image space, and O i = 0 oherwise. Given he observed image I, he esimaion of he 3D-2D posiions and he visibiliy of he landmarks is formulaed as a maximizaion of he poserior probabiliy of (X,O) = ((X i ) i V,(O i ) i V ) over heir domains X = i V X i and O = {0,1} V : (x,o) op = arg max p(x, o I) (1) (x,o) X O The poserior probabiliy p(x,o I) is: p(x,o I) = p(x,o,i)/p(i) p(x,o,i) = p(i x,o) p(x,o) = p(i x (2),x,o) p(x (2) x,o) p(o x ) p(x ) = p(i x (2),o) p(x (2) x,o) p(o) p(x ) }{{}}{{}}{{}}{{} Image Likelihood Projecion Prior Visibiliy Prior 3D Model Prior where p(i x (2),o) encodes he likelihood of he observed image given he 2D posiion configuraions x (2) and he visibiliy saes o of he landmarks, p(x (2) x,o) encodes he projecion prior from he 3D configuraion x o he 2D configuraion of he landmarks, p(o) denoes he visibiliy prior on he landmarks, and p(x ) denoes he prior on he 3D configuraions of he landmarks. Noe ha his probabilisic formulaion can be direcly applied o he esimaion of 3D (or 2D) configuraion of he landmarks given 2D (or 3D) configuraion, simply by insaniaing he variables whose configuraions are known. 3. Definiions of he Probabiliy Terms In his secion, we define all he probabiliy erms which are involved in he poserior probabiliy p(x,o I) (Eq. 2) Image Likelihood The image likelihood p(i x (2),o) measures he occurrence probabiliy of he observed image I, given he 2D posiion configuraions x (2) and he visibiliy saes o of he landmarks. If we assume, wihou loss of generaliy, ha he landmarks are independen in erms of appearance, hen we can define p(i x (2),o) as follows: p(i x (2),o) i V (2) p(i x (2) i,o i ) 320
3 Regarding p(i x (2) i,o i ), here are wo cases: 1. When O i = 1, he landmark s posiion is informaive and p(i x (2) i,o i ) denoes he likelihood of he observed image given ha landmark i is locaed a posiion x (2) i, which can be defined using he oupu of a classifier such as Randomized Fores [6]. 2. When O i = 0, he landmark s posiion is no informaive and p(i x (2) i,o i ) denoes a uniform disribuion, hus we assume ha p(i x (2) i,o i ) = ˆp (consan) Projecion Prior The projecion prior p(x (2) x,o) measures he occurrence possibiliy of he 2D posiions x (2) of he landmarks when he 3D posiions x and he visibiliy saes o are given, which is modeled using Gibbs disribuion: p(x (2) x,o) exp{ f(x,o) } (4) T where T is emperaure, and he energy funcion f(x,o) encodes inconsisency beween he 3D and 2D configuraions of he landmarks aking he visibiliy saes ino accoun (he smaller f(x, o) is, he beer is he correspondence beween x and x (2) ). Wihou loss of generaliy, we use he weak-perspecive camera configuraion [1] o model he projecion from 3D poins o 2D poins 1. Le us firs consider a riple T = { V and = 3} of landmarks ha are all visible. Their 3D-2D posiions x deermine a mos wo projecion mappings P (s) x (s {1,2}) [1, 11] corresponding o wo reflecive symmeric camera configuraions. Then for any addiional visible poin i, we can measure he error e x (x i ) beween is 2D posiion x (2) i and he value obained by projecing is 3D posiion x i, i.e.: e x (x i ) = min P (s) x (x i ) x (2) i (5) s {1,2} where denoes he Euclidean norm, and beween he wo feasible projecions we consider he mos prominen one wih respec o he considered 2D configuraion [1]. On he conrary, if one or more of hese four poins are invisible, we se a consan energy Ê as he projecion error e x (x i ), which can be undersood as an upper bound of he average projecion error which is allowed beween four poins. Therefore, we define he error funcion e x,o (x i,o i ) by aking he visibiliy saes ino accoun as: { ex (x i ) if o j = 1, j {i} e x,o (x i,o i ) = w Ê oherwise (6) 1 In he proposed framework, he weak-perspecive camera model can be easily replaced by oher camera models such as he perspecive model. where w is a confidence weigh for he error measure obained under he mapping deermined by he posiions of he poins in clique, which will be presened laer in his secion. And hen, he 3D-2D consisency beween a quadruple c of landmarks consiss of he sum of he errors which are deermined by aking all possible combinaions of riples wihin he quadruple and evaluaing he projecion error on he remaining poin: e(x c,o c ) = c e x,o (x c\,o c\ ) (7) Finally, we define he energy funcion f(x,o) as he sum of e(x c,o c ) over all he quadruple, i.e.: f(x,o) = c C e(x c,o c ) (8) where C = {c c V and c = 4} denoes he se of all quadruples. Las, we should noe ha we can furher combine oher cues in his projecion prior, such as regional exure similariy. Robus Confidence Weigh Since he projecion marix esimaion is unsable when considering riples of 3D poins ha are nearly collinear [1], we inroduce a confidence weigh w o modulae he error conribuion of each riple of poins. For a riangle x consising of a riple of poins wih 3D posiions x, we define he non-collinear coefficien NC(x ) us- ) and is perimeer ing he square roo of is area Area( x Perim( x ) as follows: NC(x ) = Area 1 2 ( x ) Perim( x ) We can observe ha NC(x ) = 1 for an equilaeral riangle and NC(x ) = 0 when he hree poins are collinear. Then we learn he confidence weigh w by averaging he non-collinear coefficiens for each riple over he raining daa: w = 1 M M m=1 (9) NC(x,m ) (10) where M denoes he number of raining samples. Specificaion of he Projecion Error Regarding he compuaion of e x (x i ), we use he efficien mehod proposed in [1] o compue direcly he projecion of a 3D poin under he projecion deermined by a riple 321
4 are collinear so ha he final soluion of x canno be exacly collinear. By doing so, he erm e x (x i ) is well-defined for all he cases. For he sake of clariy, hereafer, we assume ha he definiion of e x (x i ) in Eq. 5 implicily includes he definiion in he degenerae case. of corresponding 3D-2D poins wihou calculaing he projecion mapping. We refer readers o [1] for more deails. Collinear riples of poins lead o degenerae configuraions from which we canno obain a soluion for he projecion mapping. In his case, he corresponding error erm e x (x i ) in Eq. 5 is no well-defined. To deal wih his, we consider wo differen scenarios: (i) When we have a prior knowledge ha he 3D posiions of a riple of poins have o be collinear, we simply ignore he corresponding error measure by defining e x (x i ) = 0 (his is consisen wih he confidence weigh defined in Eq. 10, i.e., w = 0 leads o zero conribuion o f(x)); (ii) Oherwise, we define e x (x i ) = + if x 3.3. Visibiliy Prior We inroduce he visibiliy variable O o achieve a more precise modeling of he 3D-2D esimaion, due o he fac ha a landmark can be invisible. The noion of invisibiliy encodes occlusions and self-occlusions in he 3D space, as well as misdeecion due o insufficien image suppor or classificaion failure. The inference process is performed by considering, for each landmark i, a number of 2D posiions which lead o he highes probabiliies p(i x (2) i ) owards composing he se of plausible soluions for X (2) i, expecing ha a leas one candidae is (or close o) he rue posiion. However, because of erroneous deecion or occlusions, i is possible ha all he candidaes are far from he ground ruh. In such a conex, we define he noion of visibiliy as wheher he rue 2D correspondence of he landmark is capured by he candidae se. More specifically, O i = 1 means ha a leas one candidae in X (2) i is close o he ground ruh, and O i = 0 sands for he opposie case. The prior probabiliy p(o) is defined as follows: p(o) = i V p(o i ) (11) where p(o i ) denoes he prior probabiliy of he visibiliy of each individual landmark i and is modeled as a Bernoulli disribuion Bern(o i µ i ) wih parameer µ i = Pr(O i = 1). In pracice, i is usually reasonable o assume he same parameer µ > 0.5 for all he landmarks [20] D Model Prior The raining daa are used o learn a 3D shape model. No assumpion on regisraion beween surfaces is being made. However, we assume ha correspondences have been deermined for he landmarks among he samples of he raining se. The key concern of shape modeling is how o capure he inheren variabiliy of he class of objecs from a reasonable small raining se using a compac represenaion ha can be easily adoped owards an efficien inference. We adop he pose-invarian prior in [22] which is based on he relaive lenghs of a riple of poins, and exend i in a more general formulaion where he cliques can be of any higher order. Such a prior does no require he esimaion of he global pose in he raining and esing sages and eliminaes he bias caused by such esimaions. Le us consider a clique 2 c (c V and c 3) of landmarks, we enumerae all he pairs P c = {(i,j) i,j c and i < j} of poins. Le d ij = x i x j denoe he Euclidean disance beween poins i and j ((i,j) P c ). We obain he relaive disance ˆd ij by normalizing he disance d ij over he sum of he disances beween he pairs of poins involved in clique c, i.e., ˆd ij = d ij / d ij (12) (i,j) P c Since for clique c, any relaive disance ˆd ij is a linear combinaion of he ohers (i.e., (i,j) P c ˆdij = 1), we sore all he relaive disances, excep one in a vecor ˆd c = ( ˆd ij ) (i,j) Pc, where P c conains he pairs ha are involved in he vecor ˆd c. Then, he saisics on ˆd c are learned from he raining daa. We can model is disribuion p c (ˆd c ) using sandard probabilisic models such as MoGs and Parzen-Windows. Finally, we define he prior probabiliy of he 3D configuraion as: p(x ) c C p c (ˆd c (x c )) (13) where C denoes he se of all cliques, and ˆd c (x 3 c) denoes he mapping from he 3D posiion x 3 c of he clique c o he relaive disance vecor ˆd c. 4. Higher-order MRF Formulaion The daa likelihood, he 3D-2D consisency, he visibiliy prior and he 3D shape model, presened in Sec. 3, can be naurally encoded wihin a higher-order MRF model where laen variables are o be inferred hrough an energy minimizaion. In his perspecive, he negaive logarihm of he poserior probabiliy (Eq. 2) is facorized ino he poenials of he MRF and consiues he MRF energy. To his end, we use a node o model a landmark i (i V) wih is laen 3D-2D posiion X i and is visibiliy O i. 2 As presened in Sec. 4, we use 4-order cliques (quadruples) in his work, i.e., c = 4. However, oher higher-order cliques c ( c 3) can also be used in his shape model. 322
5 Acually, we can use a single random variable 3 o encode X i and O i compacly by simply defining a special label occ wihin 2D posiion candidae se X (2) i such ha: { (x x i = i,x (2) i ) if O i = 1 (x (14) i, occ) if O i = 0 This compac represenaion is valid because he 2D posiion x (2) i is meaningless when he landmark i is occluded (i.e., when O i = 0, he image likelihood p(i x (2),o) and he projecion prior p(x (2) x,o) are consan wih respec o x (2) i.). In order o facorize he poenial funcions, we use a fourh-order clique o model a quadruple c of landmarks. Due o he bijecive mappings beween nodes and landmarks and beween fourh-order cliques and quadruples, we reuse V and C o denoe he node se and he clique se which deermine he opology of he MRF. The 3D and 2D posiions of he landmarks are esimaed hrough he minimizaion of he MRF energy E(x): x op = arg min E(x) (15) x X Here, he energy of he MRF is defined as he negaive logarihm of he poserior probabiliy in Eq. 2 (up o an addiive consan) and can be facorized ino he following form: E(x) = i V U i (x i ) + c C H c (x c ) (16) where x c denoes he configuraion (x i ) i c of clique c. Singleon poenial U i (x i ) (i V) encodes he daa likelihood (see secion 3.1) and he visibiliy prior (see secion 3.3). Afer aking he negaive logarihm, we obain is definiion as follows: U i (x i ) = { log p(i x (2) i ) if x (2) i occ λ 1 if x (2) i = occ where λ 1 is a consan coefficien. (17) Higher-order clique poenial H c (x c ) (c C) is defined as follows: H c (x c ) = λ 2 H (1) c (x c ) + λ 3 H (2) c (x c ) (18) where λ 2 > 0 and λ 3 > 0 are wo balancing consans, H c (1) (x c ) encodes he 3D saisic geomery consrains implied by he shape prior on he 3D configuraion of he landmarks, and H c (2) (x c ) encodes he 3D-2D projecion prior: { H (1) c (x c ) = log p c (ˆd c (x c )) H (2) c (x c ) = e(x c,o c (x c )) (19) 3 In order o reduce he number of symbols used, we reuse X i o denoe his new random variable. Accordingly, we reuse x i, X i and he oher relaed noaions. (a) Figure 1. (a) The disribuion of landmarks; (b) The hisogram presening he disribuion of he number of missing 2D correspondences in he firs experimen. where o c (x c ) denoes he binary visibiliy values ha are recovered from x c using Eq. 14, and he definiions of e(x c,o c ) and p c (α(x c )) have been presened in Sec. 3.2 and 3.4, respecively. Dual-Decomposiion MRF Inference: Regarding he inference of he proposed higher-order MRF, we adop he dual-decomposiion opimizaion framework [2, 16], which is considered o be he sae-of-he-ar owards MAP-MRF inference [16, 20] in paricular when handling higher-order MRFs [15, 22]. Based on his framework, we decompose he original problem which is difficul o solve direcly ino a se of sub-problems which can be solved very efficienly. The soluions of he sub-problems are combined using projeced subgradien mehod [16, 20] o achieve he soluion of he original problem. Regarding he decomposiion, like [22], we decompose he original graph ino a se of facor rees which can be solved wihin polynomial ime using max-produc belief propagaion algorihm [3]. 5. Experimenal Resuls 5.1. Experimenal Seings The performance of he proposed mehod was evaluaed on he publicly-available facial expression daases BU- 3DFE [24] and BU-4DFE [23]. The former consiss of 3D range daa of 6 prooypical facial expressions of 100 differen subjecs (56 female and 44 male), and he laer is composed of 3D dynamic facial expressions of 101 differen subjecs (58 female and 43 male). The subjecs included in boh daases are of various ehnic/racial origins. The considered model consiss of 13 landmarks (eyes, nose, mouh and eyebrows as shown in Fig. 1(a)). In he inference sage, is 3D iniializaion was done by randomly picking one raining example. Regarding he 3D posiions of he landmarks, he search was guided by a coarse-o-fine scheme and sparse sampling sraegy in a similar way as [13]. Upon convergence of he algorihm, we performed Procruses Analysis [10] o obain he similariy ransform (b) 323
6 (a) (b) 2 3 (c) Figure 2. Resuls of he firs experimen. (a) and (b): 3D model esimaion resuls. In each sub-figure, 3D face mesh is provided for measuring visually he error beween he resuling posiions (in red) of landmarks and he ground ruh (in blue). (c): Boxplos for he disribuions of dissimilariy measures for qualiaively evaluaing he 3D model esimaion. c.1: Resuls obained by he proposed mehod; c.2: Resuls obained by he version wihou visibiliy modeling; c.3: Iniializaion of he model. On each box, he cenral mark in red is he median, he edges of he box are he 25h and 75h perceniles, he whiskers exend o he mos exreme daa poins no considered ouliers, and ouliers are ploed individually. beween he esimaed 3D model and he ground ruh, hen ransformed he esimaed one ino he referenial frame of he ground ruh. In erms of quaniaive evaluaion, a common goodness-of-fi crierion is he squared error sandardized by he scale of he objec. Thus, Procruses disance [10] was used as he dissimilariy measure E d o evaluae our mehod quaniaively, which can be compued as follows: X 2 2 X x i C (20) x i x i / Ed = i V i V he landmarks (randomly sampled), he rue correspondence was removed and replaced wih a random posiion in he image plane, which produced beween 0 and 5 missing 2D correspondences for each es (see Fig. 1(b)). Figs. 2(a) and (b) presen 3D model esimaion resuls. Fig. 2(c).3 and Fig. 2(c).1 (i.e., he boxes 3 and 1 in Fig. 2(c)) depic he saisics of he dissimilariy measure E d (Eq. 20) for he iniializaion and he resuling 3D model obained by he proposed mehod, respecively. The qualiaive and quaniaive evaluaions demonsrae ha our mehod leads o well-esimaed 3D models even when correspondences are parially missing. Furhermore, in order o demonsrae he impac of he visibiliy modeling, we have also evaluaed an alernaive version (wihou visibiliy modeling) of he proposed mehod where he occ label was removed from he 2D candidae se of each node, and show he obained saisics of E d in Fig. 2(c).2. Based on he comparison of Fig. 2(c).1 and Fig. 2(c).2, we can conclude ha he visibiliy modeling indeed leads o significanly beer performance. Second, we employed he facial feaure poin deecor proposed in [21] o obain he 2D posiion candidaes for 101 samples of BU-4DFE, also one from each subjec. Such a deecor is based on Gabor feaures and boosing classifiers, and can well localize he considered landmarks from observed 2D images (Figs. 3(a)-(f)), hough errors may sill be presen in some ess. We also performed a leaveone-ou cross-validaion in his experimen. Figs. 3(a )-(f ) show six 3D model esimaion resuls of differen qualiies and Fig. 3(g) presens he saisics of E d for he proposed mehod and he version wihou visibiliy modeling. These resuls furher demonsrae he poenial of he proposed mehod o infer he 3D configuraion of he model from 2D observed images wih misdeecions/occlusion handling. Las bu no leas, we compared our mehod wih an al- where x i and x i denoe he resuling and ground ruh 3D posiions of landmark i, respecively, C = P 1 is he cener of he ground ruh model. The i V x i V d smaller E is, he closer he resuling model is o he ground ruh. In all he experimens, he concep of leave-one-ou cross-validaion was adoped owards he evaluaion of he mehod. In his conex, we do he validaion on a sample while using he remaining samples as raining daa, and such a validaion is done for all he samples conained in a daase using he same parameer seings. Regarding he 3D model prior (Eq. 13), we modeled he probabiliy disri buion pc (d c (xc )) beween a quadruple c of poins using a wo-componen Gaussian Mixure Qualiaive Resuls and Quaniaive Analysis Firs, we considered 100 samples of he neural expression from BU-3DFE, one from each subjec. The 2D landmark correspondence space was associaed wih 5 labels, four corresponding o he 2D posiion candidaes and he las o he occlusion label occ. On op of he ground ruh correspondence, noise was added o generae erroneous 2D candidaes as well. Furhermore, for 10% of 324
7 (a) (b) (c) (d) (e) (f) 2 3 (g) (a ) (b ) (c ) (d ) (e ) (f ) Figure 3. Resuls of he second experimen. (a)-(f): 2D landmark deecion resuls [21]; (a )-(f ): The corresponding 3D model esimaion resuls. (g): Boxplos for he disribuions of dissimilariy measures for qualiaively evaluaing he 3D model esimaion. g.1: Resuls obained by he proposed mehod; g.2: Resuls obained by he version wihou visibiliy modeling; g.3: Iniializaion of he model. ernaive mehod (ASM+RANSAC) wih a relaxed condiion where we assumed ha he ground ruh 2D correspondences were known. For each es, we firs learned an ASM [8] from he raining daa. Then, we used RANSAC [11] o esimae he camera projecion funcion based on he iniializaion of he shape model and he given ground ruh 2D correspondences. Once he projecion funcion was esimaed, we searched for he bes shape configuraion by minimizing he errors beween he projecions of he 3D poins and heir 2D correspondences. Furhermore, we evaluaed boh mehods using wo differen iniializaions: besides he random sample iniializaion used hroughou he experimens, we also esed he mean-shape iniializaion where we chose one example as he reference, regisered all he oher raining examples o i and compued he mean shape as iniializaion. We performed leave-one-ou crossvalidaion on all he 2500 samples of BU-3DFE daase and he quaniaive evaluaion is shown in Fig. 4. Figs. 4.1 and 4.4 show ha our mehod performed equally well wih he wo differen iniializaions, which demonsraes robusness wih respec o he choice of iniializaion. The evalua- ion of ASM+RANSAC is presened in Figs. 4.2 and 4.5. We observe from Fig. 4 ha he dissimilariy measure of our mehod is approximaely 3 o 5 imes lower compared o ASM+RANSAC, which demonsraes ha our mehod performs significanly beer han ASM+RANSAC and is highly robus wih respec o he iniializaion. In conclusion, he resuls of all he experimens demonsrae ha our mehod, despie he imporan variabiliy of pose and facial geomery, has well esimaed he 3D configuraion of he model even wih he exisence of misdeecions, and ouperforms significanly he alernaive mehods. 6. Conclusion In his paper, we have inroduced a novel approach for 3D landmark model inference from a monocular 2D view ha combines he esimaion of he 3D pose, he visibiliy saes and he 2D correspondences. The main innovaions of he mehod are he absence of camera parameers esimaion, he abiliy o model geomeric consisency hrough local priors, he explici modeling of visibiliy and 325
8 Figure 4. Comparison wih ASM+RANSAC in erms of dissimilariy measures. 1. Our mehod wih random-sample iniializaion; 2. ASM+RANSAC wih random-sample iniializaion; 3. The random-sample iniializaion; 4. Our mehod wih mean-shape iniializaion; 5. ASM+RANSAC wih mean-shape iniializaion; 6. The mean-shape iniializaion. he one-sho opimizaion o joinly infer all he variables. We have evaluaed our mehod on sandard facial daases wih promising resuls. Fuure work concerns firs he achievemen of a beer model decomposiion owards recovering he smalles subse of higher-order ineracions ha can express he 3D geomeric manifold, which could drasically decrease he compuaional complexiy of he mehod. The use of more advanced parameerizaions of he manifold which go beyond simple 3D landmark posiions (e.g., he enire surface hrough some kind of local inerpolaion) would open new applicaion domains of our mehod like body pose esimaion or medical image analysis where 2D parial acquisiion of 3D objecs is frequen. Las bu no leas, faser opimizaion algorihms of higher-order MRFs, including poenial implemenaions of exising opimizers on GPUs, could be beneficial o our approach boh in erms of he considered applicaion as well as in erms of modulariy wih respec o oher 3D pose esimaion problems from 2D images. Acknowledgmens This work was parially suppored by he European Research Council Saring Gran DIOCLES (ERC-STG ) and he MEDICEN Compeiive Cluser sereos+ gran. References [1] T. D. Aler. 3-D pose from 3 poins using weak-perspecive. IEEE TPAMI, 16(8): , [2] D. P. Bersekas. Nonlinear Programming. Ahena Scienific, [3] C. M. Bishop. Paern Recogniion and Machine Learning. Springer, [4] V. Blanz and T. Veer. A morphable model for he synhesis of 3D faces. In SIGGRAPH, [5] Y. Boykov and G. Funka-Lea. Graph cus and efficien N-D image segmenaion. IJCV, 70(2): , [6] L. Breiman. Random foress. Machine Learning, 45(1):5 32, [7] T. F. Cooes, G. J. Edwards, and C. J. Taylor. Acive appearance models. IEEE TPAMI, 23(6): , [8] T. F. Cooes, C. J. Taylor, D. H. Cooper, and J. Graham. Acive shape models - heir raining and applicaion. CVIU, 61(1):38 59, [9] M. de La Gorce, N. Paragios, and D. J. Flee. Model-based hand racking wih exure, shading and self-occlusions. In CVPR, [10] I. L. Dryden and K. V. Mardia. Saisical shape analysis. John Wiley & Sons Inc., [11] M. A. Fischler and R. C. Bolles. Random sample consensus: A paradigm for model fiing wih applicaions o image analysis and auomaed carography. Communicaions of he ACM, 24(6): , [12] B. Glocker, H. Heibel, N. Navab, P. Kohli, and C. Roher. TriangleFlow: Opical flow wih riangulaion-based higherorder likelihoods. In ECCV, [13] B. Glocker, N. Komodakis, G. Tzirias, N. Navab, and N. Paragios. Dense image regisraion hrough MRFs and efficien linear programming. Medical Image Analysis, 12(6): , [14] L. Gu and T. Kanade. 3D alignmen of face in a single image. In CVPR, [15] N. Komodakis and N. Paragios. Beyond pairwise energies: Efficien opimizaion for higher-order MRFs. In CVPR, [16] N. Komodakis, N. Paragios, and G. Tzirias. MRF opimizaion via dual decomposiion: Message-passing revisied. In ICCV, [17] R. Kurazume, K. Nakamura, T. Okada, Y. Sao, N. Sugano, T. Koyama, Y. Iwashia, and T. Hasegawa. 3D reconsrucion of a femoral shape using a parameric model and wo 2D fluoroscopic images. CVIU, 113(2): , [18] D. Roller, K. Daniilidis, and H.-H. Nagel. Model-based objec racking in monocular image sequences of road raffic scenes. IJCV, 10: , [19] E. B. Sudderh, M. I. Mandel, W. T. Freeman, and A. S. Willsky. Disribued occlusion reasoning for racking wih nonparameric belief propagaion. In NIPS, [20] L. Torresani, V. Kolmogorov, and C. Roher. Feaure correspondence via graph maching: Models and global opimizaion. In ECCV, [21] D. Vukadinovic and M. Panic. Fully auomaic facial feaure poin deecion using gabor feaure based boosed classifiers. In IEEE SMC, [22] C. Wang, O. Teboul, F. Michel, S. Essafi, and N. Paragios. 3D knowledge-based segmenaion using poseinvarian higher-order graphs. In MICCAI, [23] L. Yin, X. Chen, Y. Sun, T. Worm, and M. Reale. A highresoluion 3D dynamic facial expression daabase. In In. Conf. on Auomaic Face and Gesure Recogniion, [24] L. Yin, X. Wei, Y. Sun, J. Wang, and M. Rosao. A 3D facial expression daabase for facial behavior research. In In. Conf. on Auomaic Face and Gesure Recogniion,
STEREO PLANE MATCHING TECHNIQUE
STEREO PLANE MATCHING TECHNIQUE Commission III KEY WORDS: Sereo Maching, Surface Modeling, Projecive Transformaion, Homography ABSTRACT: This paper presens a new ype of sereo maching algorihm called Sereo
More informationCAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL
CAMERA CALIBRATION BY REGISTRATION STEREO RECONSTRUCTION TO 3D MODEL Klečka Jan Docoral Degree Programme (1), FEEC BUT E-mail: xkleck01@sud.feec.vubr.cz Supervised by: Horák Karel E-mail: horak@feec.vubr.cz
More informationImage segmentation. Motivation. Objective. Definitions. A classification of segmentation techniques. Assumptions for thresholding
Moivaion Image segmenaion Which pixels belong o he same objec in an image/video sequence? (spaial segmenaion) Which frames belong o he same video sho? (emporal segmenaion) Which frames belong o he same
More informationLearning in Games via Opponent Strategy Estimation and Policy Search
Learning in Games via Opponen Sraegy Esimaion and Policy Search Yavar Naddaf Deparmen of Compuer Science Universiy of Briish Columbia Vancouver, BC yavar@naddaf.name Nando de Freias (Supervisor) Deparmen
More informationVisual Perception as Bayesian Inference. David J Fleet. University of Toronto
Visual Percepion as Bayesian Inference David J Flee Universiy of Torono Basic rules of probabiliy sum rule (for muually exclusive a ): produc rule (condiioning): independence (def n ): Bayes rule: marginalizaion:
More informationFACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS. Soumya Hamlaoui & Franck Davoine
FACIAL ACTION TRACKING USING PARTICLE FILTERS AND ACTIVE APPEARANCE MODELS Soumya Hamlaoui & Franck Davoine HEUDIASYC Mixed Research Uni, CNRS / Compiègne Universiy of Technology BP 20529, 60205 Compiègne
More informationImplementing Ray Casting in Tetrahedral Meshes with Programmable Graphics Hardware (Technical Report)
Implemening Ray Casing in Terahedral Meshes wih Programmable Graphics Hardware (Technical Repor) Marin Kraus, Thomas Erl March 28, 2002 1 Inroducion Alhough cell-projecion, e.g., [3, 2], and resampling,
More informationVideo-Based Face Recognition Using Probabilistic Appearance Manifolds
Video-Based Face Recogniion Using Probabilisic Appearance Manifolds Kuang-Chih Lee Jeffrey Ho Ming-Hsuan Yang David Kriegman klee10@uiuc.edu jho@cs.ucsd.edu myang@honda-ri.com kriegman@cs.ucsd.edu Compuer
More informationEECS 487: Interactive Computer Graphics
EECS 487: Ineracive Compuer Graphics Lecure 7: B-splines curves Raional Bézier and NURBS Cubic Splines A represenaion of cubic spline consiss of: four conrol poins (why four?) hese are compleely user specified
More informationProbabilistic Detection and Tracking of Motion Discontinuities
Probabilisic Deecion and Tracking of Moion Disconinuiies Michael J. Black David J. Flee Xerox Palo Alo Research Cener 3333 Coyoe Hill Road Palo Alo, CA 94304 fblack,fleeg@parc.xerox.com hp://www.parc.xerox.com/fblack,fleeg/
More informationAudio Engineering Society. Convention Paper. Presented at the 119th Convention 2005 October 7 10 New York, New York USA
Audio Engineering Sociey Convenion Paper Presened a he 119h Convenion 2005 Ocober 7 10 New Yor, New Yor USA This convenion paper has been reproduced from he auhor's advance manuscrip, wihou ediing, correcions,
More informationIntentSearch:Capturing User Intention for One-Click Internet Image Search
JOURNAL OF L A T E X CLASS FILES, VOL. 6, NO. 1, JANUARY 2010 1 InenSearch:Capuring User Inenion for One-Click Inerne Image Search Xiaoou Tang, Fellow, IEEE, Ke Liu, Jingyu Cui, Suden Member, IEEE, Fang
More information4. Minimax and planning problems
CS/ECE/ISyE 524 Inroducion o Opimizaion Spring 2017 18 4. Minima and planning problems ˆ Opimizing piecewise linear funcions ˆ Minima problems ˆ Eample: Chebyshev cener ˆ Muli-period planning problems
More informationA Matching Algorithm for Content-Based Image Retrieval
A Maching Algorihm for Conen-Based Image Rerieval Sue J. Cho Deparmen of Compuer Science Seoul Naional Universiy Seoul, Korea Absrac Conen-based image rerieval sysem rerieves an image from a daabase using
More informationMultiple View Discriminative Appearance Modeling with IMCMC for Distributed Tracking
Muliple View Discriminaive ing wih IMCMC for Disribued Tracking Sanhoshkumar Sunderrajan, B.S. Manjunah Deparmen of Elecrical and Compuer Engineering Universiy of California, Sana Barbara {sanhosh,manj}@ece.ucsb.edu
More informationMORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES
MORPHOLOGICAL SEGMENTATION OF IMAGE SEQUENCES B. MARCOTEGUI and F. MEYER Ecole des Mines de Paris, Cenre de Morphologie Mahémaique, 35, rue Sain-Honoré, F 77305 Fonainebleau Cedex, France Absrac. In image
More informationNonparametric CUSUM Charts for Process Variability
Journal of Academia and Indusrial Research (JAIR) Volume 3, Issue June 4 53 REEARCH ARTICLE IN: 78-53 Nonparameric CUUM Chars for Process Variabiliy D.M. Zombade and V.B. Ghue * Dep. of aisics, Walchand
More informationRobust Visual Tracking for Multiple Targets
Robus Visual Tracking for Muliple Targes Yizheng Cai, Nando de Freias, and James J. Lile Universiy of Briish Columbia, Vancouver, B.C., Canada, V6T 1Z4 {yizhengc, nando, lile}@cs.ubc.ca Absrac. We address
More informationWe are IntechOpen, the world s leading publisher of Open Access books Built by scientists, for scientists. International authors and editors
We are InechOpen, he world s leading publisher of Open Access books Buil by scieniss, for scieniss 4,000 116,000 120M Open access books available Inernaional auhors and ediors Downloads Our auhors are
More informationA Fast Stereo-Based Multi-Person Tracking using an Approximated Likelihood Map for Overlapping Silhouette Templates
A Fas Sereo-Based Muli-Person Tracking using an Approximaed Likelihood Map for Overlapping Silhouee Templaes Junji Saake Jun Miura Deparmen of Compuer Science and Engineering Toyohashi Universiy of Technology
More informationReal-time 2D Video/3D LiDAR Registration
Real-ime 2D Video/3D LiDAR Regisraion C. Bodenseiner Fraunhofer IOSB chrisoph.bodenseiner@iosb.fraunhofer.de M. Arens Fraunhofer IOSB michael.arens@iosb.fraunhofer.de Absrac Progress in LiDAR scanning
More informationIEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1
TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART A: SYSTEMS AND HUMANS 1 Adapive Appearance Model and Condensaion Algorihm for Robus Face Tracking Yui Man Lui, Suden Member,, J. Ross Beveridge, Member,,
More informationReal-Time Non-Rigid Multi-Frame Depth Video Super-Resolution
Real-Time Non-Rigid Muli-Frame Deph Video Super-Resoluion Kassem Al Ismaeil 1, Djamila Aouada 1, Thomas Solignac 2, Bruno Mirbach 2, Björn Oersen 1 1 Inerdisciplinary Cenre for Securiy, Reliabiliy, and
More informationCoded Caching with Multiple File Requests
Coded Caching wih Muliple File Requess Yi-Peng Wei Sennur Ulukus Deparmen of Elecrical and Compuer Engineering Universiy of Maryland College Park, MD 20742 ypwei@umd.edu ulukus@umd.edu Absrac We sudy a
More informationAUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION
Chaper 3 AUTOMATIC 3D FACE REGISTRATION WITHOUT INITIALIZATION A. Koschan, V. R. Ayyagari, F. Boughorbel, and M. A. Abidi Imaging, Roboics, and Inelligen Sysems Laboraory, The Universiy of Tennessee, 334
More informationGender Classification of Faces Using Adaboost*
Gender Classificaion of Faces Using Adaboos* Rodrigo Verschae 1,2,3, Javier Ruiz-del-Solar 1,2, and Mauricio Correa 1,2 1 Deparmen of Elecrical Engineering, Universidad de Chile 2 Cener for Web Research,
More informationEvaluation and Improvement of Region-based Motion Segmentation
Evaluaion and Improvemen of Region-based Moion Segmenaion Mark Ross Universiy Koblenz-Landau, Insiue of Compuaional Visualisics, Universiässraße 1, 56070 Koblenz, Germany Email: ross@uni-koblenz.de Absrac
More informationRecovering Joint and Individual Components in Facial Data
JOURNAL OF L A E X CLASS FILES, VOL. 14, NO. 8, AUGUS 2015 1 Recovering Join and Individual Componens in Facial Daa Chrisos Sagonas, Evangelos Ververas, Yannis Panagakis, and Sefanos Zafeiriou, Member,
More informationGauss-Jordan Algorithm
Gauss-Jordan Algorihm The Gauss-Jordan algorihm is a sep by sep procedure for solving a sysem of linear equaions which may conain any number of variables and any number of equaions. The algorihm is carried
More informationDetection and segmentation of moving objects in highly dynamic scenes
Deecion and segmenaion of moving objecs in highly dynamic scenes Aurélie Bugeau Parick Pérez INRIA, Cenre Rennes - Breagne Alanique Universié de Rennes, Campus de Beaulieu, 35 042 Rennes Cedex, France
More informationIn fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magnetic Field Maps
In fmri a Dual Echo Time EPI Pulse Sequence Can Induce Sources of Error in Dynamic Magneic Field Maps A. D. Hahn 1, A. S. Nencka 1 and D. B. Rowe 2,1 1 Medical College of Wisconsin, Milwaukee, WI, Unied
More informationVisual Indoor Localization with a Floor-Plan Map
Visual Indoor Localizaion wih a Floor-Plan Map Hang Chu Dep. of ECE Cornell Universiy Ihaca, NY 14850 hc772@cornell.edu Absrac In his repor, a indoor localizaion mehod is presened. The mehod akes firsperson
More informationAn Iterative Scheme for Motion-Based Scene Segmentation
An Ieraive Scheme for Moion-Based Scene Segmenaion Alexander Bachmann and Hildegard Kuehne Deparmen for Measuremen and Conrol Insiue for Anhropomaics Universiy of Karlsruhe (H), 76 131 Karlsruhe, Germany
More informationA Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions
A Hierarchical Objec Recogniion Sysem Based on Muli-scale Principal Curvaure Regions Wei Zhang, Hongli Deng, Thomas G Dieerich and Eric N Morensen School of Elecrical Engineering and Compuer Science Oregon
More informationStereoscopic Neural Style Transfer
Sereoscopic Neural Syle Transfer Dongdong Chen 1 Lu Yuan 2, Jing Liao 2, Nenghai Yu 1, Gang Hua 2 1 Universiy of Science and Technology of China 2 Microsof Research cd722522@mail.usc.edu.cn, {luyuan,jliao}@microsof.com,
More informationThe Impact of Product Development on the Lifecycle of Defects
The Impac of Produc Developmen on he Lifecycle of Rudolf Ramler Sofware Compeence Cener Hagenberg Sofware Park 21 A-4232 Hagenberg, Ausria +43 7236 3343 872 rudolf.ramler@scch.a ABSTRACT This paper invesigaes
More informationRobust Multi-view Face Detection Using Error Correcting Output Codes
Robus Muli-view Face Deecion Using Error Correcing Oupu Codes Hongming Zhang,2, Wen GaoP P, Xilin Chen 2, Shiguang Shan 2, and Debin Zhao Deparmen of Compuer Science and Engineering, Harbin Insiue of Technolog
More informationQuantitative macro models feature an infinite number of periods A more realistic (?) view of time
INFINIE-HORIZON CONSUMPION-SAVINGS MODEL SEPEMBER, Inroducion BASICS Quaniaive macro models feaure an infinie number of periods A more realisic (?) view of ime Infinie number of periods A meaphor for many
More informationDeformable Parts Correlation Filters for Robust Visual Tracking
PAPER UNDER REVISION Deformable Pars Correlaion Filers for Robus Visual Tracking Alan Lukežič, Luka Čehovin, Member, IEEE, and Maej Krisan, Member, IEEE ha par-based models should be considered in a layered
More informationRao-Blackwellized Particle Filtering for Probing-Based 6-DOF Localization in Robotic Assembly
MITSUBISHI ELECTRIC RESEARCH LABORATORIES hp://www.merl.com Rao-Blackwellized Paricle Filering for Probing-Based 6-DOF Localizaion in Roboic Assembly Yuichi Taguchi, Tim Marks, Haruhisa Okuda TR1-8 June
More informationRobust LSTM-Autoencoders for Face De-Occlusion in the Wild
IEEE TRANSACTIONS ON IMAGE PROCESSING, DRAFT 1 Robus LSTM-Auoencoders for Face De-Occlusion in he Wild Fang Zhao, Jiashi Feng, Jian Zhao, Wenhan Yang, Shuicheng Yan arxiv:1612.08534v1 [cs.cv] 27 Dec 2016
More informationImproved TLD Algorithm for Face Tracking
Absrac Improved TLD Algorihm for Face Tracking Huimin Li a, Chaojing Yu b and Jing Chen c Chongqing Universiy of Poss and Telecommunicaions, Chongqing 400065, China a li.huimin666@163.com, b 15023299065@163.com,
More informationVideo Content Description Using Fuzzy Spatio-Temporal Relations
Proceedings of he 4s Hawaii Inernaional Conference on Sysem Sciences - 008 Video Conen Descripion Using Fuzzy Spaio-Temporal Relaions rchana M. Rajurkar *, R.C. Joshi and Sananu Chaudhary 3 Dep of Compuer
More information4.1 3D GEOMETRIC TRANSFORMATIONS
MODULE IV MCA - 3 COMPUTER GRAPHICS ADMN 29- Dep. of Compuer Science And Applicaions, SJCET, Palai 94 4. 3D GEOMETRIC TRANSFORMATIONS Mehods for geomeric ransformaions and objec modeling in hree dimensions
More informationCENG 477 Introduction to Computer Graphics. Modeling Transformations
CENG 477 Inroducion o Compuer Graphics Modeling Transformaions Modeling Transformaions Model coordinaes o World coordinaes: Model coordinaes: All shapes wih heir local coordinaes and sies. world World
More informationMODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES
MODEL BASED TECHNIQUE FOR VEHICLE TRACKING IN TRAFFIC VIDEO USING SPATIAL LOCAL FEATURES Arun Kumar H. D. 1 and Prabhakar C. J. 2 1 Deparmen of Compuer Science, Kuvempu Universiy, Shimoga, India ABSTRACT
More informationReal Time Integral-Based Structural Health Monitoring
Real Time Inegral-Based Srucural Healh Monioring The nd Inernaional Conference on Sensing Technology ICST 7 J. G. Chase, I. Singh-Leve, C. E. Hann, X. Chen Deparmen of Mechanical Engineering, Universiy
More informationIn Proceedings of CVPR '96. Structure and Motion of Curved 3D Objects from. using these methods [12].
In Proceedings of CVPR '96 Srucure and Moion of Curved 3D Objecs from Monocular Silhouees B Vijayakumar David J Kriegman Dep of Elecrical Engineering Yale Universiy New Haven, CT 652-8267 Jean Ponce Compuer
More informationLAMP: 3D Layered, Adaptive-resolution and Multiperspective Panorama - a New Scene Representation
Submission o Special Issue of CVIU on Model-based and Image-based 3D Scene Represenaion for Ineracive Visualizaion LAMP: 3D Layered, Adapive-resoluion and Muliperspecive Panorama - a New Scene Represenaion
More informationReal time 3D face and facial feature tracking
J Real-Time Image Proc (2007) 2:35 44 DOI 10.1007/s11554-007-0032-2 ORIGINAL RESEARCH PAPER Real ime 3D face and facial feaure racking Fadi Dornaika Æ Javier Orozco Received: 23 November 2006 / Acceped:
More informationMichiel Helder and Marielle C.T.A Geurts. Hoofdkantoor PTT Post / Dutch Postal Services Headquarters
SHORT TERM PREDICTIONS A MONITORING SYSTEM by Michiel Helder and Marielle C.T.A Geurs Hoofdkanoor PTT Pos / Duch Posal Services Headquarers Keywords macro ime series shor erm predicions ARIMA-models faciliy
More informationResearch Article Auto Coloring with Enhanced Character Registration
Compuer Games Technology Volume 2008, Aricle ID 35398, 7 pages doi:0.55/2008/35398 Research Aricle Auo Coloring wih Enhanced Characer Regisraion Jie Qiu, Hock Soon Seah, Feng Tian, Quan Chen, Zhongke Wu,
More informationarxiv: v2 [cs.cv] 20 May 2018
Sereoscopic Neural Syle Transfer Dongdong Chen 1 Lu Yuan 2, Jing Liao 2, Nenghai Yu 1, Gang Hua 2 1 Universiy of Science and Technology of China 2 Microsof Research cd722522@mail.usc.edu.cn, {jliao, luyuan,
More informationWeighted Voting in 3D Random Forest Segmentation
Weighed Voing in 3D Random Fores Segmenaion M. Yaqub,, P. Mahon 3, M. K. Javaid, C. Cooper, J. A. Noble NDORMS, Universiy of Oxford, IBME, Deparmen of Engineering Science, Universiy of Oxford, 3 MRC Epidemiology
More informationA Bayesian Approach to Video Object Segmentation via Merging 3D Watershed Volumes
A Bayesian Approach o Video Objec Segmenaion via Merging 3D Waershed Volumes Yu-Pao Tsai 1,3, Chih-Chuan Lai 1,2, Yi-Ping Hung 1,2, and Zen-Chung Shih 3 1 Insiue of Informaion Science, Academia Sinica,
More informationDAGM 2011 Tutorial on Convex Optimization for Computer Vision
DAGM 2011 Tuorial on Convex Opimizaion for Compuer Vision Par 3: Convex Soluions for Sereo and Opical Flow Daniel Cremers Compuer Vision Group Technical Universiy of Munich Graz Universiy of Technology
More informationTrack and Cut: simultaneous tracking and segmentation of multiple objects with graph cuts
INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET EN AUTOMATIQUE Track and Cu: simulaneous racking and segmenaion of muliple objecs wih graph cus Aurélie Bugeau Parick Pérez N 6337 Ocober 2007 Thèmes COM
More informationAn Adaptive Spatial Depth Filter for 3D Rendering IP
JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.3, NO. 4, DECEMBER, 23 175 An Adapive Spaial Deph Filer for 3D Rendering IP Chang-Hyo Yu and Lee-Sup Kim Absrac In his paper, we presen a new mehod
More informationLandmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases
Lmarks: A New Model for Similariy-Based Paern Querying in Time Series Daabases Chang-Shing Perng Haixun Wang Sylvia R. Zhang D. So Parker perng@cs.ucla.edu hxwang@cs.ucla.edu Sylvia Zhang@cle.com so@cs.ucla.edu
More informationOpen Access Research on an Improved Medical Image Enhancement Algorithm Based on P-M Model. Luo Aijing 1 and Yin Jin 2,* u = div( c u ) u
Send Orders for Reprins o reprins@benhamscience.ae The Open Biomedical Engineering Journal, 5, 9, 9-3 9 Open Access Research on an Improved Medical Image Enhancemen Algorihm Based on P-M Model Luo Aijing
More informationUpper Body Tracking for Human-Machine Interaction with a Moving Camera
The 2009 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems Ocober -5, 2009 S. Louis, USA Upper Body Tracking for Human-Machine Ineracion wih a Moving Camera Yi-Ru Chen, Cheng-Ming Huang, and
More informationMulti-Target Detection and Tracking from a Single Camera in Unmanned Aerial Vehicles (UAVs)
2016 IEEE/RSJ Inernaional Conference on Inelligen Robos and Sysems (IROS) Daejeon Convenion Cener Ocober 9-14, 2016, Daejeon, Korea Muli-Targe Deecion and Tracking from a Single Camera in Unmanned Aerial
More informationACQUIRING high-quality and well-defined depth data. Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure
SUBMITTED TO TRANSACTION ON IMAGE PROCESSING 1 Online Temporally Consisen Indoor Deph Video Enhancemen via Saic Srucure Lu Sheng, Suden Member, IEEE, King Ngi Ngan, Fellow, IEEE, Chern-Loon Lim and Songnan
More informationDesign Alternatives for a Thin Lens Spatial Integrator Array
Egyp. J. Solids, Vol. (7), No. (), (004) 75 Design Alernaives for a Thin Lens Spaial Inegraor Array Hala Kamal *, Daniel V azquez and Javier Alda and E. Bernabeu Opics Deparmen. Universiy Compluense of
More informationMATH Differential Equations September 15, 2008 Project 1, Fall 2008 Due: September 24, 2008
MATH 5 - Differenial Equaions Sepember 15, 8 Projec 1, Fall 8 Due: Sepember 4, 8 Lab 1.3 - Logisics Populaion Models wih Harvesing For his projec we consider lab 1.3 of Differenial Equaions pages 146 o
More informationTracking Appearances with Occlusions
Tracking ppearances wih Occlusions Ying Wu, Ting Yu, Gang Hua Deparmen of Elecrical & Compuer Engineering Norhwesern Universiy 2145 Sheridan oad, Evanson, IL 60208 {yingwu,ingyu,ganghua}@ece.nwu.edu bsrac
More informationWheelchair-user Detection Combined with Parts-based Tracking
Wheelchair-user Deecion Combined wih Pars-based Tracking Ukyo Tanikawa 1, Yasuomo Kawanishi 1, Daisuke Deguchi 2,IchiroIde 1, Hiroshi Murase 1 and Ryo Kawai 3 1 Graduae School of Informaion Science, Nagoya
More informationEffects needed for Realism. Ray Tracing. Ray Tracing: History. Outline. Foundations of Computer Graphics (Fall 2012)
Foundaions of ompuer Graphics (Fall 2012) S 184, Lecure 16: Ray Tracing hp://ins.eecs.berkeley.edu/~cs184 Effecs needed for Realism (Sof) Shadows Reflecions (Mirrors and Glossy) Transparency (Waer, Glass)
More informationRepresenting Non-Manifold Shapes in Arbitrary Dimensions
Represening Non-Manifold Shapes in Arbirary Dimensions Leila De Floriani,2 and Annie Hui 2 DISI, Universiy of Genova, Via Dodecaneso, 35-646 Genova (Ialy). 2 Deparmen of Compuer Science, Universiy of Maryland,
More informationRobot localization under perceptual aliasing conditions based on laser reflectivity using particle filter
Robo localizaion under percepual aliasing condiions based on laser refleciviy using paricle filer DongXiang Zhang, Ryo Kurazume, Yumi Iwashia, Tsuomu Hasegawa Absrac Global localizaion, which deermines
More informationScale Recovery for Monocular Visual Odometry Using Depth Estimated with Deep Convolutional Neural Fields
Scale Recovery for Monocular Visual Odomery Using Deph Esimaed wih Deep Convoluional Neural Fields Xiaochuan Yin, Xiangwei Wang, Xiaoguo Du, Qijun Chen Tongji Universiy yinxiaochuan@homail.com,wangxiangwei.cpp@gmail.com,
More informationRobust parameterized component analysis: theory and applications to 2D facial appearance models
Compuer Vision and Image Undersanding 91 (2003) 53 71 www.elsevier.com/locae/cviu Robus parameerized componen analysis: heory and applicaions o 2D facial appearance models Fernando De la Torre a, * and
More informationRobust 3D Visual Tracking Using Particle Filtering on the SE(3) Group
Robus 3D Visual Tracking Using Paricle Filering on he SE(3) Group Changhyun Choi and Henrik I. Chrisensen Roboics & Inelligen Machines, College of Compuing Georgia Insiue of Technology Alana, GA 3332,
More informationDetection of salient objects with focused attention based on spatial and temporal coherence
ricle Informaion Processing Technology pril 2011 Vol.56 No.10: 1055 1062 doi: 10.1007/s11434-010-4387-1 SPECIL TOPICS: Deecion of salien objecs wih focused aenion based on spaial and emporal coherence
More informationJoint Feature Learning With Robust Local Ternary Pattern for Face Recognition
Join Feaure Learning Wih Robus Local Ternary Paern for Face Recogniion Yuvaraju.M 1, Shalini.S 1 Assisan Professor, Deparmen of Elecrical and Elecronics Engineering, Anna Universiy Regional Campus, Coimbaore,
More informationReinforcement Learning by Policy Improvement. Making Use of Experiences of The Other Tasks. Hajime Kimura and Shigenobu Kobayashi
Reinforcemen Learning by Policy Improvemen Making Use of Experiences of The Oher Tasks Hajime Kimura and Shigenobu Kobayashi Tokyo Insiue of Technology, JAPAN genfe.dis.iech.ac.jp, kobayasidis.iech.ac.jp
More informationJ. Vis. Commun. Image R.
J. Vis. Commun. Image R. 20 (2009) 9 27 Conens liss available a ScienceDirec J. Vis. Commun. Image R. journal homepage: www.elsevier.com/locae/jvci Face deecion and racking using a Boosed Adapive Paricle
More informationMoving Object Detection Using MRF Model and Entropy based Adaptive Thresholding
Moving Objec Deecion Using MRF Model and Enropy based Adapive Thresholding Badri Narayan Subudhi, Pradipa Kumar Nanda and Ashish Ghosh Machine Inelligence Uni, Indian Saisical Insiue, Kolkaa, 700108, India,
More informationTHE goal of this work is to develop statistical models for
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 32, NO. 4, APRIL 2010 579 Nonsaionary Shape Aciviies: Dynamic Models for Landmark Shape Change and Applicaions Samarji Das, Suden Member,
More informationIROS 2015 Workshop on On-line decision-making in multi-robot coordination (DEMUR 15)
IROS 2015 Workshop on On-line decision-making in muli-robo coordinaion () OPTIMIZATION-BASED COOPERATIVE MULTI-ROBOT TARGET TRACKING WITH REASONING ABOUT OCCLUSIONS KAROL HAUSMAN a,, GREGORY KAHN b, SACHIN
More informationLearning nonlinear appearance manifolds for robot localization
Learning nonlinear appearance manifolds for robo localizaion Jihun Hamm, Yuanqing Lin, and Daniel. D. Lee GRAS Lab, Deparmen of Elecrical and Sysems Engineering Universiy of ennsylvania, hiladelphia, A
More informationImage Content Representation
Image Conen Represenaion Represenaion for curves and shapes regions relaionships beween regions E.G.M. Perakis Image Represenaion & Recogniion 1 Reliable Represenaion Uniqueness: mus uniquely specify an
More informationWho thinks who knows who? Socio-Cognitive Analysis of an Network
Who hinks who knows who? Socio-Cogniive Analysis of an Email Nework Nishih Pahak Deparmen of Compuer Science Universiy of Minnesoa Minneapolis, MN, USA npahak@cs.umn.edu Sandeep Mane Deparmen of Compuer
More informationImproving the Efficiency of Dynamic Service Provisioning in Transport Networks with Scheduled Services
Improving he Efficiency of Dynamic Service Provisioning in Transpor Neworks wih Scheduled Services Ralf Hülsermann, Monika Jäger and Andreas Gladisch Technologiezenrum, T-Sysems, Goslarer Ufer 35, D-1585
More informationMulti-camera multi-object voxel-based Monte Carlo 3D tracking strategies
RESEARCH Open Access Muli-camera muli-objec voxel-based Mone Carlo 3D racking sraegies Crisian Canon-Ferrer *, Josep R Casas, Monse Pardàs and Enric Mone Absrac This aricle presens a new approach o he
More informationChapter 3 MEDIA ACCESS CONTROL
Chaper 3 MEDIA ACCESS CONTROL Overview Moivaion SDMA, FDMA, TDMA Aloha Adapive Aloha Backoff proocols Reservaion schemes Polling Disribued Compuing Group Mobile Compuing Summer 2003 Disribued Compuing
More informationPART 1 REFERENCE INFORMATION CONTROL DATA 6400 SYSTEMS CENTRAL PROCESSOR MONITOR
. ~ PART 1 c 0 \,).,,.,, REFERENCE NFORMATON CONTROL DATA 6400 SYSTEMS CENTRAL PROCESSOR MONTOR n CONTROL DATA 6400 Compuer Sysems, sysem funcions are normally handled by he Monior locaed in a Peripheral
More informationRule-Based Multi-Query Optimization
Rule-Based Muli-Query Opimizaion Mingsheng Hong Dep. of Compuer cience Cornell Universiy mshong@cs.cornell.edu Johannes Gehrke Dep. of Compuer cience Cornell Universiy johannes@cs.cornell.edu Mirek Riedewald
More informationOcclusion-Free Hand Motion Tracking by Multiple Cameras and Particle Filtering with Prediction
58 IJCSNS Inernaional Journal of Compuer Science and Nework Securiy, VOL.6 No.10, Ocober 006 Occlusion-Free Hand Moion Tracking by Muliple Cameras and Paricle Filering wih Predicion Makoo Kao, and Gang
More informationSENSING using 3D technologies, structured light cameras
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 39, NO. 10, OCTOBER 2017 2045 Real-Time Enhancemen of Dynamic Deph Videos wih Non-Rigid Deformaions Kassem Al Ismaeil, Suden Member,
More informationA Fast Non-Uniform Knots Placement Method for B-Spline Fitting
2015 IEEE Inernaional Conference on Advanced Inelligen Mecharonics (AIM) July 7-11, 2015. Busan, Korea A Fas Non-Uniform Knos Placemen Mehod for B-Spline Fiing T. Tjahjowidodo, VT. Dung, and ML. Han Absrac
More informationSegmentation by Level Sets and Symmetry
Segmenaion by Level Ses and Symmery Tammy Riklin-Raviv Nahum Kiryai Nir Sochen Tel Aviv Universiy, Tel Aviv 69978, Israel ammy@eng.au.ac.il nk@eng.au.ac.il sochen@pos.au.ac.il Absrac Shape symmery is an
More informationDynamic Depth Recovery from Multiple Synchronized Video Streams 1
Dynamic Deph Recoery from Muliple ynchronized Video reams Hai ao, Harpree. awhney, and Rakesh Kumar Deparmen of Compuer Engineering arnoff Corporaion Uniersiy of California a ana Cruz Washingon Road ana
More informationA CONDITIONAL RANDOM FIELD MODEL FOR TRACKING IN DENSELY PACKED CELL STRUCTURES. Anirban Chakraborty, Amit Roy-Chowdhury
A CONDITIONAL RANDOM FIELD MODEL FOR TRACKING IN DENSELY PACKED CELL STRUCTURES Anirban Chakrabory, Ami Roy-Chowdhury Deparmen of Elecrical Engineering, Universiy of California, Riverside, USA ABSTRACT
More informationReal-Time Avatar Animation Steered by Live Body Motion
Real-Time Avaar Animaion Seered by Live Body Moion Oliver Schreer, Ralf Tanger, Peer Eiser, Peer Kauff, Bernhard Kaspar, and Roman Engler 3 Fraunhofer Insiue for Telecommunicaions/Heinrich-Herz-Insiu,
More informationImproving Ranking of Search Engines Results Based on Power Links
IPASJ Inernaional Journal of Informaion Technology (IIJIT) Web Sie: hp://www.ipasj.org/iijit/iijit.hm A Publisher for Research Moivaion... Email: edioriiji@ipasj.org Volume 2, Issue 9, Sepember 2014 ISSN
More informationHandling uncertainty in semantic information retrieval process
Handling uncerainy in semanic informaion rerieval process Chkiwa Mounira 1, Jedidi Anis 1 and Faiez Gargouri 1 1 Mulimedia, InfoRmaion sysems and Advanced Compuing Laboraory Sfax Universiy, Tunisia m.chkiwa@gmail.com,
More informationShortest Path Algorithms. Lecture I: Shortest Path Algorithms. Example. Graphs and Matrices. Setting: Dr Kieran T. Herley.
Shores Pah Algorihms Background Seing: Lecure I: Shores Pah Algorihms Dr Kieran T. Herle Deparmen of Compuer Science Universi College Cork Ocober 201 direced graph, real edge weighs Le he lengh of a pah
More informationA non-stationary uniform tension controlled interpolating 4-point scheme reproducing conics
A non-saionary uniform ension conrolled inerpolaing 4-poin scheme reproducing conics C. Beccari a, G. Casciola b, L. Romani b, a Deparmen of Pure and Applied Mahemaics, Universiy of Padova, Via G. Belzoni
More informationA Tool for Multi-Hour ATM Network Design considering Mixed Peer-to-Peer and Client-Server based Services
A Tool for Muli-Hour ATM Nework Design considering Mied Peer-o-Peer and Clien-Server based Services Conac Auhor Name: Luis Cardoso Company / Organizaion: Porugal Telecom Inovação Complee Mailing Address:
More information