Discrete-Continuous Depth Estimation from a Single Image

Size: px
Start display at page:

Download "Discrete-Continuous Depth Estimation from a Single Image"

Transcription

1 Dscrete-Contnuous Depth Estmaton from a Sngle Image Maomao Lu, Matheu Salzmann, Xumng He NICTA and CECS, ANU, Canberra {maomao.lu, matheu.salzmann, xumng.he}@ncta.com.au Abstract In ths paper, we tackle the problem of estmatng the depth of a scene from a sngle mage. Ths s a challengng task, snce a sngle mage on ts own does not provde any depth cue. To address ths, we explot the avalablty of a pool of mages for whch the depth s known. More specfcally, we formulate monocular depth estmaton as a dscrete-contnuous optmzaton problem, where the contnuous varables encode the depth of the superpxels n the nput mage, and the dscrete ones represent relatonshps between neghborng superpxels. The soluton to ths dscrete-contnuous optmzaton problem s then obtaned by performng nference n a graphcal model usng partcle belef propagaton. The unary potentals n ths graphcal model are computed by makng use of the mages wth known depth. We demonstrate the effectveness of our model n both the ndoor and outdoor scenaros. Our expermental evaluaton shows that our depth estmates are more accurate than exstng methods on standard datasets.. Introducton In ths paper, we address the problem of scene depth estmaton from a sngle mage. Estmatng the depth of a general scene from a monocular, statc vewpont s a very challengng task, snce no relable cues, such as stereo correspondences, or moton, can be exploted. In recent years, much progress has been made towards accurate 3D scene reconstructon from sngle mages. For nstance, smple geometrc assumptons (.e., box models) have proven effectve to estmate the layout of a room [9, 7, 27]. Smlarly, for outdoor scenes, the Manhattan, or blocks world, assumpton has been utlzed to perform 3D scene layout estmaton [7]. These box models, however, are lmted to represent smple structures, and are therefore ll-suted to obtan detaled 3D reconstructons. NICTA s funded by the Australan Government as represented by the Department of Broadband, Communcatons and the Dgtal Economy and the ARC through the ICT Centre of Excellence program. Fgure. Depth estmaton from a sngle mage: Input mages and depth maps estmated by our method. In contrast, several methods have been proposed to drectly estmate the depth of mage (super)pxels [24, 25]. In ths context, t was shown that explotng addtonal sources of nformaton, such as user annotatons [22], semantc labels [8], or the presence of repettve structures [30], could help mprovng reconstructon accuracy. Unfortunately, such addtonal nformaton s not avalable n general. Recently, nonparametrc approaches were therefore ntroduced to handle ths case [3, 5, 6]. Gven an nput mage, these approaches proceed by retrevng smlar mages n a pool of mages for whch the depth s known. The depths of the retreved canddates are then employed n conjuncton wth smoothness constrants to estmate a depth map. Whle ths has acheved some success, as suggested n [3] n the context of stereo, the gradent-aware smoothng strategy often poorly reflects the real 3D scene observed n the mage. In ths paper, we ntroduce a method that addresses ths ssue by modelng depth estmaton as a dscrete-contnuous optmzaton problem. In partcular, n addton to the standard contnuous varables that encode the depth of the superpxels n the nput mage, we make use of dscrete varables that allow us to model complex relatonshps between

2 neghborng superpxels. Depth estmaton can then be expressed as nference n a hgher-order, dscrete-contnuous graphcal model. More specfcally, gven an nput mage, we make use of a nonparametrc approach to retreve smlar mages n a dataset of mages wth known depth. We explot the depths of these mages to construct data terms for the contnuous varables n our model. Furthermore, we employ dscrete varables to encode the occluson relatonshps between neghborng superpxels. The nteractons of several dscrete varables can then be expressed wth juncton potentals, whch defne nvald confguratons. These dscrete occluson varables also let us defne smoothness constrants that better reflect realstc scenes. We make use of partcle belef propagaton [2, 20] to perform nference n the resultng hgher-order, dscrete-contnuous graphcal model. In contrast to most exstng methods whch typcally consder ether ndoor scenes, or outdoor ones, we demonstrate the effectveness of our model n these two scenaros. Our experments show the benefts of our dscrete-contnuous formulaton, whch yelds state-of-the-art accuraces on the NYU v2 ndoor scenes dataset [28] and on Make3D [25]. 2. Related Work Estmatng the depth of a scene from mages s one of the major goals of computer vson. Therefore, t has attracted a lot of attenton over the years. Here, we focus on the advances that have been made n the recent years. A classcal approach to reconstructng 3D scenes conssts n explotng vdeo. In ths context, 3D scene flow s one of the most popular and mature approaches to depth estmaton [3, 4, 29]. Smlarly, structure-from-moton [5] and SLAM [9] have now reached the stage where exstng systems can effcently handle huge amounts of mages. Therefore, they are now beng ntegrated nto 3D scene understandng methods that jontly detect, or segment objects whle performng 3D reconstructon [6, 8, 23]. Whle here we tackle the monocular, statc case, t was shown that the depth maps obtaned from sngle mages could also have a benefcal mpact on vdeo-based 3D reconstructon [3]. When t comes to sngle mages, 3D reconstructon methods have not yet attaned the same degree of maturty as vdeo-based technques. Nonetheless, much progress has been made n recent years. In partcular, for ndoor scenes, effectve technques have been proposed to estmate the layout of rooms. These methods typcally rely on box-shaped models, and try to ft the box edges to those observed n the mage [9, 7, 27]. The same smple geometrc pror, blocks world, was exploted n outdoor scenes [7]. In [0], a more accurate geometrc model was employed, but the results reman only a rough estmate of surface normals. The smple geometrc models descrbed above do not allow us to obtan a detaled 3D descrpton of the scene. In contrast, several methods have proposed to drectly estmate the depth of mage (super)pxels. Snce a sngle mage does typcally not provde enough nformaton to estmate depth, other sources of nformaton have been exploted. In partcular, n [22], depth was predcted from user annotatons. In [8], ths was acheved by makng use of semantc class labels. Alternatvely, the presence of repettve structures n the scene was also employed for 3D reconstructon [30]. Wth the recent popularty of depth sensors, sparse depths have also been used to estmate denser depth maps [2]. In ths work, however, we focus on the scenaro where no such sources of nformaton are avalable. In ths settng, supervsed learnng technques were the frst to provde realstc results by learnng the parameters of a Markov random feld [24, 25]. More recently, several nonparametrc approaches were ntroduced [3, 5, 6]. These methods explot the avalablty of a set of mages for whch the depths are known. Depth n the nput mage s then estmated by frst retrevng smlar mages n ths set, and optonally warpng ther depth usng SIFT flow. These (warped) depth maps are then utlzed n the objectve functon of a nonlnear optmzaton problem that encourages the resultng depth to be smooth. Our work s close n sprt to that of [3, 5, 6] n the sense that we also make use of a nonparametrc approach to retreve canddate depth maps. However, we avod the warpng process of [3, 5], whch s computatonally expensve and does not necessarly mprove the qualty of the canddates. More mportantly, we ntroduce the use of dscrete varables that allow us to model more complex relatonshps between neghborng superpxels, and formulate depth estmaton as nference n a dscrete-contnuous graphcal model. As evdenced by our results, ths formulaton s benefcal n terms of accuracy of the estmated depth, and proved effectve for both the ndoor and outdoor scenaros. 3. Dscrete-Contnuous Depth Estmaton We now descrbe our approach to depth estmaton from a sngle mage. To ths end, we frst derve the Condtonal Random Feld (CRF) that defnes our problem, and dscuss the nference method that we use. We then defne the dfferent potentals utlzed n our model. 3.. Dscrete-Contnuous CRF Our goal s to estmate the depth of the pxels observed n a sngle mage depctng a general scene. We formulate ths problem n terms of superpxels, makng the common assumpton that each superpxel s planar. The pose of a superpxel s then expressed n terms of the depth of ts centrod and ts plane normal. Furthermore, we make use of addtonal dscrete varables that encode the relatonshp of two neghborng superpxels. In partcular, here, we con-

3 sder 4 types of relatonshps encodng the fact that the two superpxels () belong to the same object; () belong to two dfferent but connected objects; () belong to two objects that form a left occluson; and (v) belong to two objects that form a rght occluson. Here, the noton of left and rght occlusons follows the formalsm of [] based on edge drectons. Gven these varables, we express depth estmaton as an nference problem n a dscrete-contnuous CRF. More specfcally, let Y = {y,..., y S } be the set of contnuous varables, where each y R 4 concatenates the centrod depth and plane normal of superpxel, and where S s the total number of superpxels n the nput mage. Furthermore, let E = {e p } p E be the set of dscrete varables, where each e p {so, co, lo, ro}, whch ndcates same object (so), connected but dfferent objects (co), left occluson (lo) and rght occluson (ro), respectvely. E s the set of pars of superpxels that share a common boundary. Gven these varables, we then form a CRF, where the jont dstrbuton over the random varables factorzes nto a product of non-negatve potentals. Ths jont dstrbuton can be wrtten as p(y, E) = Ψ (y ) Ψ α (y α, e α ) Ψ β (e β ), Z α β where Z s a normalzaton constant,.e., the partton functon, Ψ s a unary potental functon over the contnuous varables that defnes the data term for depth, and Ψ α and Ψ β are potentals over mxed varables and dscrete varables, respectvely, whch encode the smoothness and consstency between depth and edge types. Inference n the graphcal model s then performed by computng a MAP estmate. By workng wth negatve log potental functons, e.g., φ (y ) = ln (Ψ (y )), ths can be expressed as the optmzaton problem (Y, E ) () = argmn φ (y ) + φ α (y α, e α ) + φ β (e β ). Y,E α β The potentals that we use here are dscussed n Secton 3.2. To handle mxed dscrete and contnuous varables, we make use of partcle (convex) belef propagaton (PCBP) [20], whch lets us obtan an approxmate soluton to the optmzaton problem (). More specfcally, PCBP proceeds by teratvely solvng the followng steps:. Draw N s random samples y j, j N s around the prevous MAP soluton for each varable y. 2. Compute the (approxmate) MAP soluton of the dscrete CRF formed by the dscrete varables {e p } and by utlzng the random samples {y j } as dscrete states for the varables {y }. In practce, we draw samples for the plane normal of the superpxels accordng to a Fsher-Bngham dstrbuton, whch forces them to have unt norm. Samples for the depth of the centrod of each superpxel are drawn accordng to a Gaussan dstrbuton. At each teraton, we tghten the samplng around the prevous MAP soluton. The approxmate MAP of the dscrete CRF s obtaned by dstrbuted convex belef propagaton [26]. In ths work, we make use of a nonparametrc approach to obtan a reasonable ntalzaton for the algorthm. In partcular, we retreve the K mages most smlar to the nput mage from a set of mages for whch the depth s known. To ths end, we perform a nearest-neghbor search based on concatenated GIST, PHOG and Object Bank features and drectly make use the depth of the retreved mages,.e., n contrast to [3, 5], we do not warp the depth of the retreved mages. The retreved K depth maps then drectly act as states n the frst round of PCBP,.e., no random samples are used n ths round. In the next secton, we descrbe the specfc potentals used n the optmzaton problem () Depth and Occluson Potentals The objectve functon n () contans three dfferent types of potentals nvolvng, respectvely, contnuous varables only, dscrete and contnuous varables, and dscrete varables only. Below, we dscuss the functons used n these three dfferent types of potentals. Potentals for contnuous varables: The potentals nvolvng purely contnuous varables are unary potentals, and are of two dfferent knds. For the frst one, we explot the K canddates retreved by the magebased nearest-neghbor strategy mentoned n the prevous secton. The frst potental encodes the fact that the fnal depth should reman close to at least one canddate. To ths end, we make use of the squared depth dfference. More specfcally, assumng a calbrated camera, the depth d uj of pxel u j = (u j, v j ) n superpxel can be obtaned by ntersectng the vsual ray passng through u j wth the plane defned by y. Ths lets us wrte the potental φ c (y ) = K mn k= N p N p (d uj j= (y ) d uj k, )2, (2) where N p s the number of pxels n superpxel, and d uj k, denotes the depth of the k th canddate for superpxel at pxel u j. In practce, nstead of drectly usng the canddate depth, we ft a plane to the canddate superpxels and use the ntersecton of ths plane wth the vsual rays. Ths provdes some robustness to nose n the canddates. As a second unary potental for the contnuous varables, we also make use of the canddate depths, but n a less d-

4 rect manner. More specfcally, we tran 4 dfferent Gaussan Process (GP) regressors, each correspondng to one dmenson of the varable y. The nput to each regressor s composed of the correspondng measurement of the canddates for superpxel. We found these nputs to be more relable than mage features. For each GP, we used an RBF kernel wth wdth set to the medan squared dstance computed over all the tranng samples. For more detals on GP regresson, we refer the reader to [2]. Gven the regressed value y r for superpxel, we compute the depth duj r, at each pxel u j n the same manner as before, and wrte the potental φ r (y ) = w r N p N p (d uj j= (y ) d uj r, )2, (3) where w r s the weght of ths potental relatve to φ c (y ). In practce, we also use the regressed value y r as a state for superpxel n the frst round of PCBP where no samplng s performed. Potental for mxed varables: Our model also explots a potental that nvolves both contnuous and dscrete varables. In partcular, we defne a potental that encodes the compatblty of two superpxels that share a common boundary and the correspondng dscrete varable. Ths potental can be expressed as φ m,j(y, y j, e,j ) = w m g,j n n j 2 + N b,j N b,j N b,j (d um m= N b,j m= (dum φ o,j (y, y j, e,j ) (y ) d um j (y j )) 2 f e,j = so (y ) d um j (y j )) 2 f e,j = co otherwse, where w m s the weght of ths potental, n s the plane normal of superpxel,.e., 3 components of y, N,j b s the number of pxels shared along the boundary between superpxel and superpxel j, and g,j s a weght based on the mage gradent at the boundary between superpxel and j,.e., g,j = exp( µ,j /σ), wth µ,j the mean gradent along the boundary between the two superpxels. To handle the occluson cases, the functon φ o,j (y, y j, e,j ) assgns a cost 0 f the two superpxels are n a confguraton that agrees wth the state of e,j,.e., left occluson or rght occluson, and a cost θ max otherwse. Whle ths potental depends on three varables, t remans fast to compute, snce e,j can only take four states. Potentals for dscrete varables: Fnally, we use two dfferent potentals that only nvolve dscrete varables. The frst one s a unary potental that makes use of a classfer traned to dscrmnate between occluson (.e., lo ro) and non-occluson (.e., so co) cases. To ths end, we utlze the mage-based occluson cues ntroduced n [] and employ a bnary boosted decson tree classfer. Gven the predcton of the classfer ê p, our potental functon takes the form { φ t θt f e p(e p ) = p agrees wth ê p (4) θ t otherwse, where θ t s a parameter of our model. Note that dstngushng between all four types of edge varables proved too unrelable, whch motvated our decson to only consder occluson vs. non-occluson. The second purely dscrete potental s smlar to the juncton feasblty potental used n [3] for stereo. More specfcally, t encodes nformaton about whether the juncton between three edge varables s physcally possble, or not. Therefore, ths potental takes the form { φ t θmax f mpossble case p,q,r(e p, e q, e r ) = (5) 0 otherwse. Here, we employed the same mpossble cases as n [3] for our 4 states, assumng that co typcally form a hnge, whle so are mostly coplanar. Note that, here, we only consdered junctons of three superpxels, snce junctons of four occur very rarely. However, 4-junctons could easly be ntroduced n our framework. 4. Expermental Evaluaton We now present our expermental results on depth estmaton n outdoor and ndoor scenes. In partcular, we evaluated our method on two publcly avalable datasets: the Make3D range mage dataset [25] and the NYU v2 Knect dataset [28]. For both datasets, we compare our results wth those of the depth transfer method of [3], whch represents the current state-of-the-art for depth estmaton from a sngle mage. In addton to the baselne [3], we also evaluate the results of our unary terms only and of our GP depth regressors on ther own, as well as the results of our model wthout dscrete varables and wth the same parwse term as the e,j = so case and of the frst approxmate MAP n our model obtaned before samplng partcles n PCBP. For our quanttatve evaluaton, we report errors obtaned wth the three followng commonly-used metrcs: g average relatve error (rel): u d u N u g u, average log 0 error: N u log 0g u log 0 d u, root mean squared error (rms): N u (g u d u ) 2, where g u s the ground-truth depth at pxel u, d u s the correspondng estmated depth, and N denotes the total number of pxels n all the mages.

5 Our method Depth transfer [3] Ground-truth Image Fgure 2. Qualtatve comparson of the depths estmated wth depth transfer [3] and wth our method on the Make3D dataset. Color ndcates depth (red s far, blue s close). Method rel log 0 rms Depth transfer [3] C C Our method C C Table. Depth reconstructon errors on the Make3D dataset for depth transfer [3] and for our method evaluated on two crtera (C and C 2, see text for detals.) In both experments, we used SLIC [] to compute the superpxels. For each test mage, we retreved K = 7 canddates from the tranng mages. The parameters of our model were selected usng a small valdaton set of 0 mages from the NYU v2 dataset and kept the same n both experments. The specfc values were w r =, w m = 0, Method rel log 0 rms Unary GP regresson No dscrete varables No samplng Full model Table 2. Make3D: Comparson of our fnal results wth those obtaned wth unary terms only, wth our GP depth regressors only, usng a model wthout dscrete edge type varables, and after the frst round of PCBP where no samplng s nvolved. θ t = 0, and θ max = 20. Note that these parameters could, n prncple be learned. However, our approach proved robust enough for ths to be unnecessary. We performed two teratons of PCBP wth N s = 20 partcles at each teraton.

6 Ground-truth Regresson No samplng Samplng round Fnal depth Fgure 3. Make3D: Depth maps at dfferent stages of our approach. Method rel log 0 rms Depth transfer [3] Our method Table 3. Depth reconstructon errors on the NYU v2 dataset for depth transfer [3] and for our method usng the tranng/test partton provded wth the dataset. Method rel log 0 rms Depth fuson (no warp) [6] Depth fuson [5] Depth transfer Our method Table 4. Comparson of the depth estmaton errors on the NYU v2 dataset usng a leave-one-out strategy. 4.. Outdoor Scene Reconstructon: Make3D The Make3D dataset contans 534 mages wth correspondng depth maps, parttoned nto 400 tranng mages and 34 test mages. All the mages were reszed to pxels n order to preserve the aspect rato of the orgnal mages. Snce the true focal length of the camera s unknown, we assume a reasonable value of 500 for the reszed mages. Due to the lmted range and resoluton of the sensor used to collect the ground-truth, far away pxels, were arbtrarly set to depth 80 n the orgnal dataset. To take ths, as well as the effect of nterpolaton when reszng the mages, nto account n our evaluaton, we report errors based on two dfferent crtera: (C ) Errors are computed n the regons wth ground-truth depth less than 70; (C 2 ) Errors are computed n the entre mage. In ths second scenaro, to reduce the effect of meanngless canddates n sky regons, we used a classfer to label sky pxels and for the depth of the correspondng superpxels to take the value (0, 0,, 80). Note that the same two crtera (C and C 2 ) were used to evaluate the baselne. In Table, we compare the results of our approach wth those obtaned by depth transfer [3]. Note that, usng crtera C, we outperform the baselne n terms of relatve error and perform slghtly worse for the other metrcs. Usng crtera C 2, we outperform the baselne for all metrcs. Method rel log 0 rms Unary GP regresson No dscrete varables No samplng Full model Table 5. NYU v2: Comparson of our fnal results wth those obtaned wth unary terms only, wth our GP depth regressors only, usng a model wthout dscrete edge type varables, and after the frst round of PCBP where no samplng s nvolved. Fg. 2 provdes a qualtatve comparson of our depth maps wth those estmated by depth transfer [3] for some mages of the dataset. Note that depth transfer tends to oversmooth the depth maps and, e.g., merge foreground objects wth the background. Thanks to our dscrete varables, our approach better respects the dscontnutes n the scene. In Table 2, we show the results obtaned wth some of the parts of our model. Note that, even though the samplng n PCBP does not seem to have a great mpact on the errors, t helps smoothng the depth maps and thus makes them look more realstc. Ths s evdenced by Fg. 3, where we show the depth maps at dfferent stages of our approach. Note that the nfluence of each stage s more easly seen wth NYU v2 (see Fg. 5) for whch the overall depth range s smaller Indoor Scene Reconstructon: NYU v2 The NYU v2 dataset contans 449 mages, parttoned nto 795 tranng mages and 654 test mages. All the mages were reszed to pxels, whle smultaneously respectng the masks provded wth the dataset. In ths case, the ntrnsc camera parameters are gven wth the dataset. We evaluated the depth transfer code provded by [3] to obtan baselne results on the tranng/test provded wth the dataset and compare these results wth those obtaned wth our approach n Table 3. To be able to compare our results wth those reported n [4], we also appled our method n a leave-one-out manner on the full dataset. The results are reported n Table 4. Note that, n both cases, we outperform the baselnes for all metrcs. These error metrcs were computed over the vald pxels (non-zero depth) n the ground-

7 Our method Depth transfer [3] Ground-truth Image Fgure 4. Qualtatve comparson of the depths estmated wth depth transfer [3] and wth our method on the NYU v2 dataset. Ground-truth Regresson No samplng Samplng round Fnal depth Fgure 5. NYU v2: Depth maps at dfferent stages of our approach. truth depth maps. In Fg. 4, we provde a qualtatve comparson of our results wth those of [3] for some mages. Note that the oversmoothng of the depth maps generated by depth transfer s even more obvous n the short depth range scenaro. In contrast, our approach stll yelds a realstc representaton of the scene. In Table 5, we show the nfluence of the dfferent parts of our model. Note that all the components contrbute to our fnal results. Fg. 5 depcts the depth maps at dfferent stages of our approach. Whle samplng smoothes the depth map, t stll respects the mage dscontnutes. In addton to the estmated depth, our model can also predct the boundary type of the superpxel edges. In partcular, the occluson boundares are useful cues for spatal reasonng. We qualtatvely evaluate the occluson boundary predcton by showng typcal results n Fg. 6 for both ndoor and outdoor scenaros. Note that our model captures most of the occluson edges. 5. Concluson In ths paper, we have presented an approach to estmatng the depth of a scene from a sngle mage. To ths end, we have employed contnuous varables to represent the depth of mage superpxels, and dscrete ones to encode relatonshps between neghborng superpxels. As a result, we have formulated depth estmaton as nference n a hgher-order, dscrete-contnuous graphcal model, whch we have performed usng partcle belef propagaton. Our experments

8 Fgure 6. Estmated boundary occluson map. The top row shows the nput mage and the bottom row shows the estmated boundary occluson map. The superpxel boundares are drawn n blue. Pxels n magenta denote the estmated occluson boundares. have shown that ths model let us effectvely reconstruct general scenes from stll mages n both the ndoor and outdoor scenaros. In the future, we ntend to study how ths model can be exploted n 3D scene understandng by, e.g., jontly performng semantc labelng and depth estmaton. References [] R. Achanta, A. Shaj, K. Smth, A. Lucch, P. Fua, and S. Suesstrunk. Slc superpxels compared to state-of-the-art superpxel methods. PAMI, [2] O. M. Aodha, N. D. Campbell, A. Nar, and G. Brostow. Patch based synthess for sngle depth mage superresoluton. In ECCV, [3] N. Brkbeck, D. Cobzas, and M. Jaegersand. Depth and scene flow from a sngle movng camera. In 3DPVT, [4] N. Brkbeck, D. Cobzas, and M. Jaegersand. Bass constraned 3d scene flow on a dynamc proxy. In ICCV, [5] D. Crandall, A. Owens, N. Snavely, and D. Huttenlocher. Dscrete-contnuous optmzaton for large-scale structure from moton. In CVPR, [6] A. Geger, C. Wojek, and R. Urtasun. Jont 3d estmaton of objects and scene layout. In NIPS, [7] A. Gupta, A. Efros, and M. Hebert. Blocks world revsted: Image understandng usng qualtatve geometry and mechancs. In ECCV, 200., 2 [8] C. Haene, C. Zach, A. Cohen, R. Angst, and M. Pollefeys. Jont 3d scene reconstructon and class segmentaton. In CVPR, [9] V. Hedau, D. Hoem, and D. Forsyth. Thnkng nsde the box: Usng appearance models and context based on room geometry. In ECCV, 200., 2 [0] D. Hoem, A. Efros, and M. Hebert. Geometrc context from a sngle mage. ICCV, [] D. Hoem, A. Sten, A. Efros, and M. Hebert. Recoverng occluson boundares from a sngle mage. In ICCV, , 4 [2] A. Ihler and D. McAllester. Partcle belef propagaton. In AISTATS, [3] K. Karsch, C. Lu, and S. B. Kang. Depth extracton from vdeo usng non-parametrc samplng. In ECCV, 202., 2, 3, 4, 5, 6, 7 [4] K. Karsch, C. Lu, and S. B. Kang. Depthtransfer: Depth extracton from vdeo usng non-parametrc samplng. PAMI, [5] J. Konrad, G.Brown, M. Wang, P. Ishwar, C. Wu, and D. Mukherjee. Automatc 2d-to-3d mage converson usng 3d examples from the nternet. In SPIE Stereoscopc Dsplays and Applcatons, 202., 2, 3, 6 [6] J. Konrad, M. Wang, and P. Ishwar. 2d-to-3d mage converson by learnng depth from examples. In 3DCINE, 202., 2, 6 [7] D. C. Lee, A. Gupta, M. Hebert, and T. Kanade. Estmatng spatal layout of rooms usng volumetrc reasonng about object and surfaces. In NIPS, 200., 2 [8] B. Lu, S. Gould, and D. Koller. Sngle mage septh estmaton from predcted semantc labels. In CVPR, 200., 2 [9] R. A. Newcombe, S. Lovegrove, and A. J. Davson. Dtam: Dense trackng and mappng n real-tme. In ICCV, [20] J. Peng, T. Hazan, D. McAllester, and R. Urtasun. Convex max-product algorthms for contnuous mrfs wth applcatons to proten foldng. In ICML, 20. 2, 3 [2] C. E. Rasmussen and C. K. Wllams. Gaussan Process for Machne Learnng. MIT Press, [22] B. Russell and A. Torralba. Buldng a database of 3d scenes from user annotatons. In CVPR, 2009., 2 [23] R. F. Salas-Moreno, R. A. Newcombe, H. Strasdat, P. H. J. Kelly, and A. J. Davson. Slam++: Smultaneous localsaton and mappng at the level of objects. In CVPR, [24] A. Saxena, S. H. Chung, and A. Y. Ng. 3-d depth reconstructon from a sngle stll mage. IJCV, 2007., 2 [25] A. Saxena, M. Sun, and A. Y. Ng. Make3d: Learnng 3d scene structure from a sngle stll mage. PAMI, 2009., 2, 4 [26] A. G. Schwng, T. Hazan, M. Pollefeys, and R. Urtasun. Dstrbuted message passng for large scale graphcal models. In CVPR, [27] A. G. Schwng and R. Urtasun. Effcent exact nference for 3d ndoor scene understandng. In ECCV, 202., 2 [28] N. Slberman, D. Hoem, P. Kohl, and R. Fergus. Indoor segmentaton and support nference from rgbd mages. In ECCV, , 4 [29] C. Vogel, K. Schndler, and S. Roth. 3d scene flow estmaton wth a rgd moton pror. In ICCV, [30] C. Wu, J. M. Frahm, and M. Pollefeys. Repetton-based dense sngle-vew reconstructon. In CVPR, 20., 2 [3] K. Yamaguch, T. Hazan, D. McAllester, and R. Urtasun. Contnuous markov random felds for robust stereo estmaton. In ECCV, 202., 4

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

arxiv: v1 [cs.cv] 4 Jul 2016

arxiv: v1 [cs.cv] 4 Jul 2016 JUN et al.: LEARNING FINE-SCALED DEPTH MAPS FROM SINGLE RGB IMAGES 1 arxv:1607.00730v1 [cs.cv] 4 Jul 2016 Learnng Fne-Scaled Depth Maps from Sngle RGB Images Jun L http://cg.cs.un-bonn.de/ Renhard Klen

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Efficient Inference for Fully-Connected CRFs with Stationarity

Efficient Inference for Fully-Connected CRFs with Stationarity Effcent Inference for Fully-onnected RFs wth Statonarty Ymeng Zhang Tsuhan hen School of Electrcal and omputer Engneerng, ornell Unversty {yz457,tsuhan}@cornell.edu Abstract The ondtonal Random Feld (RF)

More information

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation GeoNet: Geometrc Neural Network for Jont Depth and Surface Normal Estmaton Xaojuan Q Renje Lao, Zhengzhe Lu Raquel Urtasun, Jaya Ja, The Chnese Unversty of Hong Kong Unversty of Toronto Uber Advanced Technologes

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Multiple Frame Motion Inference Using Belief Propagation

Multiple Frame Motion Inference Using Belief Propagation Multple Frame Moton Inference Usng Belef Propagaton Jang Gao Janbo Sh The Robotcs Insttute Department of Computer and Informaton Scence Carnege Mellon Unversty Unversty of Pennsylvana Pttsburgh, PA 53

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

LECTURE : MANIFOLD LEARNING

LECTURE : MANIFOLD LEARNING LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Adaptive Transfer Learning

Adaptive Transfer Learning Adaptve Transfer Learnng Bn Cao, Snno Jaln Pan, Yu Zhang, Dt-Yan Yeung, Qang Yang Hong Kong Unversty of Scence and Technology Clear Water Bay, Kowloon, Hong Kong {caobn,snnopan,zhangyu,dyyeung,qyang}@cse.ust.hk

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

S.P.H. : A SOLUTION TO AVOID USING EROSION CRITERION?

S.P.H. : A SOLUTION TO AVOID USING EROSION CRITERION? S.P.H. : A SOLUTION TO AVOID USING EROSION CRITERION? Célne GALLET ENSICA 1 place Emle Bloun 31056 TOULOUSE CEDEX e-mal :cgallet@ensca.fr Jean Luc LACOME DYNALIS Immeuble AEROPOLE - Bat 1 5, Avenue Albert

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

Learning to Co-Generate Object Proposals with a Deep Structured Network

Learning to Co-Generate Object Proposals with a Deep Structured Network Learnng to Co-enerate Object Proposals wth a Deep Structured Network Zeeshan Hayder,2, Xumng He 2, Australan Natonal Unversty & 2 NICTA {zeeshan.hayder, xumng.he}@anu.edu.au atheu Salzmann,3 3 CVLab, EPFL,

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

MOTION BLUR ESTIMATION AT CORNERS

MOTION BLUR ESTIMATION AT CORNERS Gacomo Boracch and Vncenzo Caglot Dpartmento d Elettronca e Informazone, Poltecnco d Mlano, Va Ponzo, 34/5-20133 MILANO boracch@elet.polm.t, caglot@elet.polm.t Keywords: Abstract: Pont Spread Functon Parameter

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15 CS434a/541a: Pattern Recognton Prof. Olga Veksler Lecture 15 Today New Topc: Unsupervsed Learnng Supervsed vs. unsupervsed learnng Unsupervsed learnng Net Tme: parametrc unsupervsed learnng Today: nonparametrc

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Fusion Performance Model for Distributed Tracking and Classification

Fusion Performance Model for Distributed Tracking and Classification Fuson Performance Model for Dstrbuted rackng and Classfcaton K.C. Chang and Yng Song Dept. of SEOR, School of I&E George Mason Unversty FAIRFAX, VA kchang@gmu.edu Martn Lggns Verdan Systems Dvson, Inc.

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

A Graphical Model Framework for Coupling MRFs and Deformable Models

A Graphical Model Framework for Coupling MRFs and Deformable Models A Graphcal Model Framework for Couplng MRFs and Deformable Models Ru Huang, Vladmr Pavlovc, and Dmtrs N. Metaxas Dvson of Computer and Informaton Scences, Rutgers Unversty {ruhuang, vladmr, dnm}@cs.rutgers.edu

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information

Active 3D scene segmentation and detection of unknown objects

Active 3D scene segmentation and detection of unknown objects Actve 3D scene segmentaton and detecton of unknown objects Mårten Björkman and Danca Kragc Abstract We present an actve vson system for segmentaton of vsual scenes based on ntegraton of several cues. The

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm

Resolving Ambiguity in Depth Extraction for Motion Capture using Genetic Algorithm Resolvng Ambguty n Depth Extracton for Moton Capture usng Genetc Algorthm Yn Yee Wa, Ch Kn Chow, Tong Lee Computer Vson and Image Processng Laboratory Dept. of Electronc Engneerng The Chnese Unversty of

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

PRÉSENTATIONS DE PROJETS

PRÉSENTATIONS DE PROJETS PRÉSENTATIONS DE PROJETS Rex Onlne (V. Atanasu) What s Rex? Rex s an onlne browser for collectons of wrtten documents [1]. Asde ths core functon t has however many other applcatons that make t nterestng

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents Supplementary Materal DESIRE: Dstant Future Predcton n Dynamc Scenes wth Interactng Agents Namhoon Lee 1, Wongun Cho 2, Paul Vernaza 2, Chrstopher B. Choy 3, Phlp H. S. Torr 1, Manmohan Chandraker 2,4

More information

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of

More information

Relevance Assignment and Fusion of Multiple Learning Methods Applied to Remote Sensing Image Analysis

Relevance Assignment and Fusion of Multiple Learning Methods Applied to Remote Sensing Image Analysis Assgnment and Fuson of Multple Learnng Methods Appled to Remote Sensng Image Analyss Peter Bajcsy, We-Wen Feng and Praveen Kumar Natonal Center for Supercomputng Applcaton (NCSA), Unversty of Illnos at

More information

SVM-based Learning for Multiple Model Estimation

SVM-based Learning for Multiple Model Estimation SVM-based Learnng for Multple Model Estmaton Vladmr Cherkassky and Yunqan Ma Department of Electrcal and Computer Engneerng Unversty of Mnnesota Mnneapols, MN 55455 {cherkass,myq}@ece.umn.edu Abstract:

More information

Real-Time Coarse-to-fine Topologically Preserving Segmentation

Real-Time Coarse-to-fine Topologically Preserving Segmentation Real-Tme Coarse-to-fne Topologcally Preservng Segmentaton Jan Yao, Marko Boben, Sanja Fdler, Raquel Urtasun Unversty of Toronto, Unversty of Ljubljana Abstract In ths paper, we tackle the problem of unsupervsed

More information

Long-Term Moving Object Segmentation and Tracking Using Spatio-Temporal Consistency

Long-Term Moving Object Segmentation and Tracking Using Spatio-Temporal Consistency Long-Term Movng Obect Segmentaton Trackng Usng Spato-Temporal Consstency D Zhong Shh-Fu Chang {dzhong, sfchang}@ee.columba.edu Department of Electrcal Engneerng, Columba Unversty, NY, USA Abstract The

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction Encodng Crowd Interacton wth Deep Neural Network for Pedestran Trajectory Predcton Yanyu Xu ShanghaTech Unversty xuyy2@shanghatech.edu.cn Zhxn Pao ShanghaTech Unversty paozhx@shanghatech.edu.cn Shenghua

More information

Wavefront Reconstructor

Wavefront Reconstructor A Dstrbuted Smplex B-Splne Based Wavefront Reconstructor Coen de Vsser and Mchel Verhaegen 14-12-201212 2012 Delft Unversty of Technology Contents Introducton Wavefront reconstructon usng Smplex B-Splnes

More information

Scale Selective Extended Local Binary Pattern For Texture Classification

Scale Selective Extended Local Binary Pattern For Texture Classification Scale Selectve Extended Local Bnary Pattern For Texture Classfcaton Yutng Hu, Zhlng Long, and Ghassan AlRegb Multmeda & Sensors Lab (MSL) Georga Insttute of Technology 03/09/017 Outlne Texture Representaton

More information

Intelligent Information Acquisition for Improved Clustering

Intelligent Information Acquisition for Improved Clustering Intellgent Informaton Acquston for Improved Clusterng Duy Vu Unversty of Texas at Austn duyvu@cs.utexas.edu Mkhal Blenko Mcrosoft Research mblenko@mcrosoft.com Prem Melvlle IBM T.J. Watson Research Center

More information

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping.

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping. SIGGRAPH 004 Interactve Image Cutout Lazy Snappng Yn L Jan Sun Ch-Keung Tang Heung-Yeung Shum Mcrosoft Research Asa Hong Kong Unversty Separate an object from ts background Compose the object on another

More information

Joint Example-based Depth Map Super-Resolution

Joint Example-based Depth Map Super-Resolution Jont Example-based Depth Map Super-Resoluton Yanje L 1, Tanfan Xue,3, Lfeng Sun 1, Janzhuang Lu,3,4 1 Informaton Scence and Technology Department, Tsnghua Unversty, Bejng, Chna Department of Informaton

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

3D vector computer graphics

3D vector computer graphics 3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning Computer Anmaton and Vsualsaton Lecture 4. Rggng / Sknnng Taku Komura Overvew Sknnng / Rggng Background knowledge Lnear Blendng How to decde weghts? Example-based Method Anatomcal models Sknnng Assume

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay,

Vanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay, Vanshng Hull Jnhu Hu Suya You Ulrch Neumann Unversty of Southern Calforna {jnhuhusuyay uneumann}@graphcs.usc.edu Abstract Vanshng ponts are valuable n many vson tasks such as orentaton estmaton pose recovery

More information

Face Detection with Deep Learning

Face Detection with Deep Learning Face Detecton wth Deep Learnng Yu Shen Yus122@ucsd.edu A13227146 Kuan-We Chen kuc010@ucsd.edu A99045121 Yzhou Hao y3hao@ucsd.edu A98017773 Mn Hsuan Wu mhwu@ucsd.edu A92424998 Abstract The project here

More information

Lecture #15 Lecture Notes

Lecture #15 Lecture Notes Lecture #15 Lecture Notes The ocean water column s very much a 3-D spatal entt and we need to represent that structure n an economcal way to deal wth t n calculatons. We wll dscuss one way to do so, emprcal

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS

PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS Po-Lun La and Alper Ylmaz Photogrammetrc Computer Vson Lab Oho State Unversty, Columbus, Oho, USA -la.138@osu.edu,

More information

A Semi-parametric Regression Model to Estimate Variability of NO 2

A Semi-parametric Regression Model to Estimate Variability of NO 2 Envronment and Polluton; Vol. 2, No. 1; 2013 ISSN 1927-0909 E-ISSN 1927-0917 Publshed by Canadan Center of Scence and Educaton A Sem-parametrc Regresson Model to Estmate Varablty of NO 2 Meczysław Szyszkowcz

More information

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS P.G. Demdov Yaroslavl State Unversty Anatoly Ntn, Vladmr Khryashchev, Olga Stepanova, Igor Kostern EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS Yaroslavl, 2015 Eye

More information

(1) The control processes are too complex to analyze by conventional quantitative techniques.

(1) The control processes are too complex to analyze by conventional quantitative techniques. Chapter 0 Fuzzy Control and Fuzzy Expert Systems The fuzzy logc controller (FLC) s ntroduced n ths chapter. After ntroducng the archtecture of the FLC, we study ts components step by step and suggest a

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem Ecent Computaton of the Most Probable Moton from Fuzzy Correspondences Moshe Ben-Ezra Shmuel Peleg Mchael Werman Insttute of Computer Scence The Hebrew Unversty of Jerusalem 91904 Jerusalem, Israel Emal:

More information

Three supervised learning methods on pen digits character recognition dataset

Three supervised learning methods on pen digits character recognition dataset Three supervsed learnng methods on pen dgts character recognton dataset Chrs Flezach Department of Computer Scence and Engneerng Unversty of Calforna, San Dego San Dego, CA 92093 cflezac@cs.ucsd.edu Satoru

More information

Learning physical Models of Robots

Learning physical Models of Robots Learnng physcal Models of Robots Jochen Mück Technsche Unverstät Darmstadt jochen.mueck@googlemal.com Abstract In robotcs good physcal models are needed to provde approprate moton control for dfferent

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Review of approximation techniques

Review of approximation techniques CHAPTER 2 Revew of appromaton technques 2. Introducton Optmzaton problems n engneerng desgn are characterzed by the followng assocated features: the objectve functon and constrants are mplct functons evaluated

More information

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features

A Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features A Probablstc Approach to Detect Urban Regons from Remotely Sensed Images Based on Combnaton of Local Features Berl Sırmaçek German Aerospace Center (DLR) Remote Sensng Technology Insttute Weßlng, 82234,

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

Efficient Video Coding with R-D Constrained Quadtree Segmentation

Efficient Video Coding with R-D Constrained Quadtree Segmentation Publshed on Pcture Codng Symposum 1999, March 1999 Effcent Vdeo Codng wth R-D Constraned Quadtree Segmentaton Cha-Wen Ln Computer and Communcaton Research Labs Industral Technology Research Insttute Hsnchu,

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

Title: A Novel Protocol for Accuracy Assessment in Classification of Very High Resolution Images

Title: A Novel Protocol for Accuracy Assessment in Classification of Very High Resolution Images 2009 IEEE. Personal use of ths materal s permtted. Permsson from IEEE must be obtaned for all other uses, n any current or future meda, ncludng reprntng/republshng ths materal for advertsng or promotonal

More information