3D Scene Mesh From CNN Depth Predictions And Sparse Monocular SLAM

Size: px
Start display at page:

Download "3D Scene Mesh From CNN Depth Predictions And Sparse Monocular SLAM"

Transcription

1 3D Scene Mesh From CNN Depth Predctons And Sparse Monocular SLAM Tomoyuk Mukasa Ju Xu Bjorn Stenger Rakuten Insttute of Technology, Tokyo, Japan Abstract In ths paper, we propose a novel framework for ntegratng geometrcal measurements of monocular vsual smultaneous localzaton and mappng (SLAM) and depth predcton usng a convolutonal neural network (CNN). In our framework, SLAM-measured sparse features and CNNpredcted dense depth maps are fused to obtan a more accurate dense 3D reconstructon ncludng scale. We contnuously update an ntal 3D mesh by ntegratng accurately tracked sparse features ponts. Compared to pror work on ntegratng SLAM and CNN estmates [26], there are two man dfferences: Usng a 3D mesh representaton allows as-rgd-as-possble update transformatons. We further propose a system archtecture sutable for moble devces, where feature trackng and CNN-based depth predcton modules are separated, and only the former s run on the devce. We evaluate the framework by comparng the 3D reconstructon result wth 3D measurements obtaned usng an RGBD sensor, showng a reducton n the mean resdual error of 38% compared to CNN-based depth map predcton alone. 1. Introducton Computer vson has long been successfully employed to track camera moton and reconstruct 3D structure from mage sequences. These methods have been appled e.g. n the vsual effects ndustry [6], the robot vson communty [4], or varous types of 3D reconstructon from a large scale [1] to a small scale on a moble devce [14]. In the past decade, moble applcatons for augmented realty (AR) and mxed realty (MR) have become ubqutous. Snce most current moble devces come wth a sngle (back-facng) camera, these applcatons rely on monocular vsual SLAM to recover camera pose [5, 20]. Vsual SLAM estmates depth from small-baselne stereo matchng over pars of nearby frames. Ths assumes that the camera translates n space over tme, so that pars of consecutve frames are equvalent Fgure 1. 3D Scene Mesh. Result of our method, reconstructed from a CNN-predcted mesh, deformed usng 3D ponts obtaned by SLAM, whch are ndcated as lght green dots. to the pars of frames captured usng a stereo rg. Tradtonally, vsual SLAM has reled on 2D feature matchng due to ts effcency and robustness n scenes wth suffcent texture [5, 20]. Recent methods that employ contour nformaton or the whole mage have been shown to add robustness, but come wth hgher computatonal cost [9, 11, 22]. A known lmtaton of monocular SLAM s that t cannot estmate the scale of the scene. Ths can be estmated by ncludng addtonal nformaton, such as data from an nertal measurement unt (IMU) [19], or pror knowledge about object szes [29]. Another shortcomng s that monocular vsual SLAM s ll-condtoned for certan camera motons lke rotaton wthout translaton. Recently, neural networks have been shown to provde good predctons of geometry,.e. depth and normals, from a gven nput mage [7, 8, 15]. An end-to-end traned CNN s able to predct geometry densely, even for less textured areas. Unlke the geometry from monocular SLAM, the depth 1921

2 Image capturng & Vsualzaton thread CLIENT-SIDE Key-frame t t+1 t+2 t+3 t+4 Monocular vsual SLAM 3D reconstructon 2D trackng thread 3D Mappng thread Depth predcton thread Depth predcton by CNN Depth fuson by surface mesh deformaton Depth fuson thread SERVER-SIDE Fgure 2. System Overvew. Monocular SLAM s run on the moble devce. Images and 3D ponts are sent to the server sde, where CNN depth predcton and surface mesh deformaton s carred out and s sent back to the clent for vsualzaton. map ncludes an absolute scale, as learned from the tranng examples. One drawback of current methods s that occludng boundary regons tend to be overly smooth and shape detals are lost. Recent work on combnng 3D SLAM measurements and depth predctons from a CNN has shown that both sources can complement each other [26], see Table 1. In ths paper we propose a framework to fuse monocular SLAM wth CNN-based depth predctons n a new way: Feature pont-based ORB-SLAM s run on the moble devce, yeldng sparse but accurate 3D ponts. Depth maps are asynchronously predcted on a server and converted to a mesh representaton. The mesh s then deformed, n an as-rgdas-possble manner, usng the sparse, but accurate feature ponts, and the updated mesh sent to the devce. Ths approach corrects both coarse global geometrc errors and rentroduces some shape detals, see Fgure 1. We evaluate the method on challengng offce scenes, comparng the result wth depth-sensor ground truth. 2. Related work We revew three areas of related work, monocular vsual SLAM, CNN-based depth predcton, and surface mesh deformaton. Monocular vsual SLAM can be classfed nto two categores [30], feature-based [13, 20] and drect approaches [9, 22]. One of the state-of-the-art methods n the featurebased category s ORB-SLAM[20]. The method extracts sparse ORB features and reconstructs them n 3D usng bundle adjustment and pose graph optmzaton. In contrast, drect methods carry out pose estmaton usng all mage pxels. The frst real-tme method n ths category was Dense Trackng and Mappng (DTAM) [22]. Snce processng every pxel s computatonally more expensve, DTAM acheved real-tme performance usng a GPU. Engel et al. [9] proposed Large-Scale Drect Monocular SLAM (LSD- SLAM) whch runs n real-tme on a CPU. The method estmates depth at pxels near mage boundares and recovers a sem-dense map. Apart from hgher computatonal complexty, drect matchng tends to work better for short baselnes even wth moton blur, whle the nvarance property of feature-based approaches allows large vewpont changes. Engel et al. [10] proposed Sem-dense Vsual Odometry (SVO), a hybrd between feature-based and drect SLAM methods, usng a combnaton of drect methods to establsh feature correspondences and feature-based methods to refne the camera pose estmates. Depth predcton from sngle mages has been a longstandng research problem, and deep learnng methods have been shown to exceed methods usng hand-crafted features n terms of the accuracy [7, 8, 15, 17, 18, 27]. Recent methods combne CNN-based depth predctons wth vsual SLAM. Lana et al. [15] proposed a fully convolutonal archtecture and resdual learnng to predct depth maps from mages. In ther evaluaton, the predcted depth maps were nput to Kellers Pont-Based Fuson RGB-D SLAM algorthm [12]. The estmated 3D geometry lacks some shape detal because of blurred regons n the predcted depth maps. Recently, Tateno et al. [26] proposed CNN-SLAM n whch predcted depth and normal maps are fused wth drect monocular SLAM nspred by LSD-SLAM. Depth fuson s an mportant process for reconstructng accurate and complete 3D shape from depth maps. Cur- 922

3 Method 3D Reconstructon Computatonal complexty Accuracy Scale Monocular vsual SLAM (feature based) CNN-based depth predcton Sparse (scene complexty dependent) Dense (estmated for each pxel) Proposed framework Dense (estmated for each pxel) Low (runs on moble devce) Hgh None Hgh (a few seconds for each frame) Hgh (but only vsual SLAM runs on moble devce) Medum (tranng-data dependent) Hgh Avalable Avalable Table 1. Propertes of ndvdual reconstructon methods and of ther combnaton, whch retans desrable propertes of each. less et al. proposed to use averagng truncated sgned dstance functons (TSDF) for depth suson [3] whch s smple yet effectve and used n a large number of reconstructon ppelnes ncludng KnectFuson [21]. Mesh deformaton technques are wdely used n graphcs and vson. Especally, lnear varatonal mesh deformaton technques were developed for edtng detaled hghresoluton meshes, lke those produced by scannng realworld objects [2]. For local detal preservaton mesh deformatons that are locally as-rgd-as-possble (ARAP) have been proposed. The ARAP method by Sorkne et al. [25] optmzes rgd transformatons n 1-rng neghborhoods ( cells ), mantanng consstency between adjacent pars of rgd transformatons by sngle overlappng edges. Lev et al. [16] ntroduced SR-ARAP energy formulaton n whch rotaton of local neghborhood on mesh are constraned to be smlar to neghbors, and enhance the smoothness of the ARAP method. In ths work we convert depth maps to surface meshes and employ ARAP deformatons usng geometrc constrants. 3. Depth fuson by geometrc constrants We desgned our framework consstng of three parts, monocular vsual SLAM, CNN-based depth predcton, and surface mesh deformaton for fusng depth maps constraned by geometrc constrants generated by the SLAM process. Fgure 2 shows the ppelne of our framework. For the mplementaton we use a clent-server desgn, where feature-based monocular SLAM runs on the moble devce and dstnctve key-frames together wth camera poses and 2D and 3D feature coordnates are sent to the server. On the server sde, a depth map s predcted for each key frame and converted to a surface mesh. Fnally, the surface meshes are deformed usng 3D features as geometrc constrants, and fused to a 3D reconstructon. Updates of the 3D reconstructon are returned to the clent and where t s vsualzed for the current camera poston. In the followng subsectons, we wll detal each stage of the framework, SLAM process n Secton3.1, depth predcton n Secton 3.2, and surface mesh deformaton n Monocular vsual SLAM Although our framework s compatble wth any type of feature-based monocular vsual SLAM methods, we employ ORB-SLAM [20] because of ts robustness and accuracy. ORB-SLAM ncorporates three parallel threads: trackng, mappng and loop closng. The trackng s n charge of localzng the camera n every frame and decdng when to nsert a new key-frame. The mappng processes new key-frames and performs local bundle adjustment for reconstructon. The loop closng searches for loops wth every new key-frame. Each key-framek t s assocated wth camera poset kt at tme t, locatons of ORB features p 2D (t) and correspondng 3D map ponts p 3D (t). Note that T kt and p 3D (t) are defned n the map coordnates, whch lacks absolute scale CNN based depth predcton For depth predcton, we use the state-of-the-art archtecture proposed n [7]. When a new key-frame s created and sent to the server-sde, a depth map s predcted by the CNN. The CNN of [7] s a three-step mult-scale network that predcts the structure of the scene takng context nto account by ncludng poolng and convoluton layers wth dfferent strde and kernel szes. The network s traned usng an element-wse L2 loss that explctly accounts for depth relatons between pxel locatons, n addton to the pont-wse error. The loss s defned as: ( ) 2 L depth (D,D ) = 1 d 2 1 n 2n 2 d + 1 [( x d 2 n )+( y d 2 )], where D and D are predcted depth and ground truth depth, respectvely, the loss equalsd D. After computng the depth mapd t of key-framek t, we convert t to a pont cloud n whch ponts correspond to pxels n the map, and a surface meshst cam defned n camera coordnates at tme t. Surface mesh St cam s fused wth other meshes to form a unque 3D reconstructon n the next step. In ths deformaton process, the mesh needs to be wa- (1) 923

4 camera cotangent weght: w j = 1 2 (cotα j +cotβ j ), (4) deformed mesh map ponts! " #$ mesh where α j, β j are the angles opposte of the mesh edge (,j). We defne the energy functon of the whole mesh by summng over the devatons from rgdty per local neghborhood as follows: E(S t) = v S t w E(C ), (5) Fgure 3. Correspondences between 3D map ponts and mesh vertces. tertght to avod mesh corrupton because of the local dspartes of deformaton force. We smply defne edges between vertces based on the pxel connectvtes on the map Mesh deformaton for depth fuson We fuse surface meshst cam nto a unque 3D reconstructon n world coordnates based on map ponts p 3D (t) defned n map coordnates. We frst convert St cam defned n map coordnates by reprojectng each vertex us- to S map t ng the assocated camera poset kt recorded n map coordnates. Secondly, we scalest cam by mnmzng the dstance between map ponts p 3D (t) and correspondng vertces v nst cam as follows: s map t = argmn sv p 3D (t) 2, (2) s s the scale factor for St cam. The correspon- (t) can be easly found by projectng where s map t dences f : v p 3D a ray from the camera center to map ponts and fnd the vertex nearest to ts ntersecton wth the mesh (see Fgure 3). Our mesh deformaton s nspred by as-rgd-as-possble (ARAP) transformatons proposed n [25]. We use the set of map ponts p 3D (t) as the geometrc constrant of the deformaton and defne the one-rng neghborhood of each vertex. Ideally the deformaton seeks to keep the transformaton for the surface n each local neghborhood as rgd as possble. Overlap of local neghborhoods s necessary to avod surface stretchng or shearng at the boundary of the local neghborhoods. By usng the local neghborhood concept, we can defne the followng energy functon for the local neghborhood C, correspondng to vertex v, and ts deformed versonc : E(C ) = w j (v v j) R (v v j ) 2, (3) v j C where R s a 3 3 rotaton matrx and w j denotes the wherew s a weght for the local neghborhoodc. We expand the ARAP method by defnng w based on the normal vector correspondng to the local neghborhood of each vertex. As we defne S t as a watertght surface mesh, and ts correspondng depth map s generated from a sngle camera vewpont, there are areas n whch ther normals are nearly perpendcular to the ray from the camera center,.e., the observablty from the camera s low. These areas tend to be boundary areas between objects n the scene, and do not correspond to objects n the actual scene. To mantan shape detals of the objects n the scene, we selectvely deform these areas as much as possble by defnng the weghtw by a sgmod functon as follows: 1 w = 1+e a(x+bπ), (6) where x denotes the angle between the normal of the local neghborhood and the ray from the camera center to the vertex, a and b are emprcally defned parameters. Fgure 4 shows the dstrbuton ofw on a surface mesh. We further ntroduce a bendng factor B j, as suggested n [16], as follows: B j = αa R R j, (7) whereαs a weghtng coeffcent,as the surface area for scalng nvarance, and fnally update Equaton 5 as follows: E(S t) = v S t w E(C )+B j. (8) After ths deformaton, we scale the deformed mesh by the absolute scales world t estmated for tmetas follows: s world t = t t scam t. (9) The scaled 3D mesh s sent to the clent, and rendered from the current camera poston or any other vewponts specfed by the user. 924

5 Fgure 4. (Top) Dstrbuton of weghtsw for the deformaton and (bottom) the correspondng textured mesh. Larger ntensty values n the top fgure ndcate the hgher weghts. 4. Experments We evaluate our framework quanttatvely by comparng the 3D reconstructon result wth ground truth data. For the acquston of the ground truth data and the nput mages, we used a tablet equpped wth an RGB-D StructureSensor [23]. The sensor captures RGB mages at VGA resoluton and depth mages at QVGA resoluton, and s able to capture wthn a range of 40 to 350 cm dstance to the camera (see Fgure 5). Our CNN archtecture s traned on the NYU Depth Dataset v2 [24], and thus performs well on typcal ndoor scenes. Our framework s desgned for the case n whch the CNN-predcted depth map s naccurate. We captured a new challengng offce dataset wth many poorly textured surfaces. We fed the mage sequences nto the proposed framework. ORB-SLAM runs n real-tme (5 to 10 fps) on the clent, specfcally, an Phone 6 wth an A8 processor and a 8 mega-pxel camera. The other components, CNN depth estmaton and mesh adaptaton s carred out on the server, a PC wth Intel Xeon dual core CPU, 2.4GHz, 96GB of RAM and an Nvda GeForce GTX Ttan X GPU wth 12GB of VRAM. To adjust to the orgnal mplementaton of the CNN and ncrease speed, both nput RGB mages and estmated depth mages are reszed to The average processng tme of the depth predcton for each key-frame s 2.6 seconds. Ths s longer than the duraton between keyframes detected by ORB-SLAM because these are selected based on vsual changes. We flter out those key-frames usng a spato-temporal dstance crteron smlar to the other feature-based approaches, e.g., PTAM, and send them to the server. The key-frames are processed on the server and the depth mage for each frame s estmated by the CNN archtecture. In the fuson process, we convert the depth mages to a refned mesh sequence as shown at the bottom of Fgure 5.We also make the ground truth mesh sequence correspond to the refned one from the raw depth maps captured by the depth sensor on the other hand. We compute resdual errors between the refned mesh and the ground truth as shown n Table 2 and Fgure 6. We can observe that our framework effcently reduces the resdual errors for all sequences. Both the average and the medan of the resdual errors fall wthn the range from about two thrds to a half. We also evaluate the absolute scale estmated from depth predcton as shown n the rghtmost column n the Table 2. The average error of the estmated scales for our sx offce scenes s20% of the ground truth scale. 5. Concluson In ths paper, we proposed a framework fusng the result of geometrc measurement,.e., feature-based monocular vsual SLAM and CNN-based depth predcton. We have shown ts effcency andpotental for applcatons whch can run on standard moble devces only equpped wth a sngle camera. Thanks to the capablty of CNN for depth predcton, some of the man lmtatons of feature-based monocular vsual SLAM, such as lack the absolute scale, sparse 3D reconstructon, were overcome. The 3D map acqured by monocular vsual SLAM also compensate the lmtaton of CNN-based depth maps by refnng t wth surface mesh deformaton to mantan shape detals. There are several possble drectons of future work. The frst s global mesh refnement and ntegraton based on the photometrc and geometrc consstency between meshes to obtan a unfed reconstructon result smlar to [28]. As the current deformaton s constraned only by sparse 3D features, refned meshes are not fully regstered wth each other as show at the bottom of Fgure 5. Second drecton s IMU-based scale estmaton. As the absolute scale n the current framework s predcted by CNN and hghly depend on the tranng data, we expect the ts accuracy can be enhanced f we fuse t wth IMU measurement. Another drecton s full use of CNN-based predcton, e.g., semantc labelng. By utlzng semantc labelng, we can selectvely manpulate the 3D reconstructon result. For nstance, we can recognze real furnture n the reconstructon and replace t wth a vrtual one. 925

6 Sofa area 1 Sofa area 2 Sofa area 3 Desk area 1 Desk area 2 Meetng room Fgure 5. Input data for our depth fuson and the reconstructed scenes. From top to bottom row: color mages, feature trackng result of SLAM, correspondng ground truth depth mages, depth mages estmated by DNN, and 3D reconstructon results on sx offce scenes, respectvely. Scene Mesh from CNN depth map Refned mesh by our method Mean Medan Std dev Mean Medan Std dev Scale Sofa area Sofa area Sofa area Desk area Desk area Meetng room Table 2. Accuracy mprovement results. Comparson of resdual errors [cm] from the ground truth obtaned usng a depth sensor, and the absolute scale estmated based on depth predcton. References [1] S. Agarwal, Y. Furukawa, N. Snavely, I. Smon, B. Curless, S. M. Setz, and R. Szelsk. Buldng rome n a day. Commun. ACM, 54(10): , Oct [2] M. Botsch and O. Sorkne. On lnear varatonal surface deformaton methods. IEEE Transactons on Vsualzaton and Computer Graphcs, 14(1): , Jan [3] B. Curless and M. Levoy. A volumetrc method for buldng complex models from range mages. In Proceedngs of the 23rd Annual Conference on Computer Graphcs and Interactve Technques, SIGGRAPH 96, pages , New York, NY, USA, ACM. 3 [4] A. J. Davson and D. W. Murray. Moble robot localsaton usng actve vson, pages Sprnger Berln Hedelberg, Berln, Hedelberg, [5] A. J. Davson, I. D. Red, N. D. Molton, and O. Stasse. MonoSLAM: Real-tme sngle camera SLAM. 29(6): , [6] T. Dobbert. Matchmovng: The Invsble Art of Camera Trackng. John Wley and Sons, [7] D. Egen and R. Fergus. Predctng depth, surface normals and semantc labels wth a common mult-scale convolu- 926

7 Ground truth CNN-predcted mesh Deformed mesh Ground truth mesh wth SLAM features Resdual error of CNN-predcted mesh Resdual error of deformed mesh Fgure 6. Surface mesh deformaton result. (Top left) ground truth surface mesh obtaned from depth-sensor measurements, (top center) surface mesh converted from the CNN-predcted depth map, (top rght) deformaton result of center mesh usng 3D SLAM features. (Bottom left) 3D SLAM features together wth the ground truth mesh, (bottom center and rght) resdual errors of CNN-predcton and deformed mesh. Warm colors ndcates hgher resdual error. Black regons ndcate areas beyond the maxmum depth sensor range. tonal archtecture. CoRR, abs/ , , 2, 3 [8] D. Egen, C. Puhrsch, and R. Fergus. Depth map predcton from a sngle mage usng a mult-scale deep network. CoRR, abs/ , , 2 [9] J. Engel, T. Schöps, and D. Cremers. LSD-SLAM: Largescale drect monocular SLAM. In ECCV, September , 2 [10] J. Engel, J. Sturm, and D. Cremers. Sem-dense vsual odometry for a monocular camera. In ICCV, Sydney, Australa, December [11] M. Hofer, M. Maurer, and H. Bschof. Effcent 3D scene abstracton usng lne segments. CVIU, [12] M. Keller, D. Lefloch, M. Lambers, S. Izad, T. Weyrch, and A. Kolb. Real-tme 3D reconstructon n dynamc scenes usng pont-based fuson. In 3DV, pages 1 8, Washngton, DC, USA, IEEE Computer Socety. 2 [13] G. Klen and D. Murray. Parallel trackng and mappng for small AR workspaces. In ISMAR, pages 1 10, Washngton, DC, USA, IEEE Computer Socety. 2 [14] K. Kolev, P. Tanskanen, P. Specale, and M. Pollefeys. Turnng moble phones nto 3d scanner. In CVPR, [15] I. Lana, C. Rupprecht, V. Belaganns, F. Tombar, and N. Navab. Deeper depth predcton wth fully convolutonal resdual networks. CoRR, abs/ , , 2 [16] Z. Lev and C. Gotsman. Smooth rotaton enhanced as-rgdas-possble mesh anmaton. IEEE Transactons on Vsualzaton and Computer Graphcs, 21: , , 4 [17] B. L, C. Shen, Y. Da, A. van den Hengel, and M. He. Depth and surface normal estmaton from monocular mages usng regresson on deep features and herarchcal crfs. In 2015 IEEE Conference on Computer Vson and Pattern Recognton (CVPR), pages , June [18] F. Lu, C. Shen, and G. Ln. Deep convolutonal neural felds for depth estmaton from a sngle mage. CoRR, abs/ , [19] S. Lynen, M. Achtelk, S. Wess, M. Chl, and R. Segwart. A robust and modular mult-sensor fuson approach appled to mav navgaton. In Proc. of the IEEE/RSJ Conference on Intellgent Robots and Systems (IROS), [20] R. Mur-Artal, J. M. M. Montel, and J. D. Tards. Orb-slam: A versatle and accurate monocular slam system. IEEE Transactons on Robotcs, 31(5): , Oct , 2, 3 [21] R. A. Newcombe, S. Izad, O. Hllges, D. Molyneaux, D. Km, A. J. Davson, P. Koh, J. Shotton, S. Hodges, and A. Ftzgbbon. Knectfuson: Real-tme dense surface mappng and trackng. In th IEEE Internatonal Symposum on Mxed and Augmented Realty, pages , Oct

8 [22] R. A. Newcombe, S. J. Lovegrove, and A. J. Davson. DTAM: Dense trackng and mappng n real-tme. In ICCV, pages , Nov , 2 [23] Occptal. v0.6.2, [24] N. Slberman, D. Hoem, P. Kohl, and R. Fergus. Indoor segmentaton and support nference from rgbd mages. In ECCV, [25] O. Sorkne and M. Alexa. As-rgd-as-possble surface modelng. In Proc. Ffth Eurographcs Symp. Geometry Processng, SGP 07, pages , , 4 [26] K. Tateno, F. Tombar, I. Lana, and N. Navab. CNN-SLAM: real-tme dense monocular SLAM wth learned depth predcton. CoRR, abs/ , , 2 [27] P. Wang, X. Shen, Z. Ln, S. Cohen, B. Prce, and A. Yulle. Towards unfed depth and semantc predcton from a sngle mage. In 2015 IEEE Conference on Computer Vson and Pattern Recognton (CVPR), pages , June [28] T. Whelan, R. F. Salas-Moreno, B. Glocker, A. J. Davson, and S. Leutenegger. Elastcfuson: Real-tme dense slam and lght source estmaton. Int. J. Robotcs Research, 35(14): , [29] Z. Wu, S. Song, A. Khosla, F. Yu, L. Zhang, X. Tang, and J. Xao. 3d shapenets: A deep representaton for volumetrc shapes. In CVPR, [30] G. Younes, D. C. Asmar, and E. A. Shammas. A survey on non-flter-based monocular vsual SLAM systems. CoRR, abs/ ,

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning

Computer Animation and Visualisation. Lecture 4. Rigging / Skinning Computer Anmaton and Vsualsaton Lecture 4. Rggng / Sknnng Taku Komura Overvew Sknnng / Rggng Background knowledge Lnear Blendng How to decde weghts? Example-based Method Anatomcal models Sknnng Assume

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

Line-based Camera Movement Estimation by Using Parallel Lines in Omnidirectional Video

Line-based Camera Movement Estimation by Using Parallel Lines in Omnidirectional Video 01 IEEE Internatonal Conference on Robotcs and Automaton RverCentre, Sant Paul, Mnnesota, USA May 14-18, 01 Lne-based Camera Movement Estmaton by Usng Parallel Lnes n Omndrectonal Vdeo Ryosuke kawansh,

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation GeoNet: Geometrc Neural Network for Jont Depth and Surface Normal Estmaton Xaojuan Q Renje Lao, Zhengzhe Lu Raquel Urtasun, Jaya Ja, The Chnese Unversty of Hong Kong Unversty of Toronto Uber Advanced Technologes

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

METRIC ALIGNMENT OF LASER RANGE SCANS AND CALIBRATED IMAGES USING LINEAR STRUCTURES

METRIC ALIGNMENT OF LASER RANGE SCANS AND CALIBRATED IMAGES USING LINEAR STRUCTURES METRIC ALIGNMENT OF LASER RANGE SCANS AND CALIBRATED IMAGES USING LINEAR STRUCTURES Lorenzo Sorg CIRA the Italan Aerospace Research Centre Computer Vson and Vrtual Realty Lab. Outlne Work goal Work motvaton

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

An efficient method to build panoramic image mosaics

An efficient method to build panoramic image mosaics An effcent method to buld panoramc mage mosacs Pattern Recognton Letters vol. 4 003 Dae-Hyun Km Yong-In Yoon Jong-Soo Cho School of Electrcal Engneerng and Computer Scence Kyungpook Natonal Unv. Abstract

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga Angle-Independent 3D Reconstructon J Zhang Mrelle Boutn Danel Alaga Goal: Structure from Moton To reconstruct the 3D geometry of a scene from a set of pctures (e.g. a move of the scene pont reconstructon

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

arxiv: v1 [cs.cv] 4 Jul 2016

arxiv: v1 [cs.cv] 4 Jul 2016 JUN et al.: LEARNING FINE-SCALED DEPTH MAPS FROM SINGLE RGB IMAGES 1 arxv:1607.00730v1 [cs.cv] 4 Jul 2016 Learnng Fne-Scaled Depth Maps from Sngle RGB Images Jun L http://cg.cs.un-bonn.de/ Renhard Klen

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Interactive Rendering of Translucent Objects

Interactive Rendering of Translucent Objects Interactve Renderng of Translucent Objects Hendrk Lensch Mchael Goesele Phlppe Bekaert Jan Kautz Marcus Magnor Jochen Lang Hans-Peter Sedel 2003 Presented By: Mark Rubelmann Outlne Motvaton Background

More information

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

A Volumetric Approach for Interactive 3D Modeling

A Volumetric Approach for Interactive 3D Modeling A Volumetrc Approach for Interactve 3D Modelng Dragan Tubć Patrck Hébert Computer Vson and Systems Laboratory Laval Unversty, Ste-Foy, Québec, Canada, G1K 7P4 Dens Laurendeau E-mal: (tdragan, hebert, laurendeau)@gel.ulaval.ca

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Wavefront Reconstructor

Wavefront Reconstructor A Dstrbuted Smplex B-Splne Based Wavefront Reconstructor Coen de Vsser and Mchel Verhaegen 14-12-201212 2012 Delft Unversty of Technology Contents Introducton Wavefront reconstructon usng Smplex B-Splnes

More information

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 1. SSDH: Semi-supervised Deep Hashing for Large Scale Image Retrieval IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY SSDH: Sem-supervsed Deep Hashng for Large Scale Image Retreval Jan Zhang, and Yuxn Peng arxv:607.08477v2 [cs.cv] 8 Jun 207 Abstract Hashng

More information

Articulated Motion Capture from Visual Hulls in High Dimensional Configuration Spaces

Articulated Motion Capture from Visual Hulls in High Dimensional Configuration Spaces Semnar Presentaton June 18th, 010 Artculated Moton Capture from Vsual Hulls n Hgh Dmensonal Confguraton Spaces Azawa Yamasak Lab D, 48-09741, 羅衛蘭 ABSTRACT In ths paper, we propose a novel approach for

More information

A Range Image Refinement Technique for Multi-view 3D Model Reconstruction

A Range Image Refinement Technique for Multi-view 3D Model Reconstruction A Range Image Refnement Technque for Mult-vew 3D Model Reconstructon Soon-Yong Park and Mural Subbarao Electrcal and Computer Engneerng State Unversty of New York at Stony Brook, USA E-mal: parksy@ece.sunysb.edu

More information

Geometric Primitive Refinement for Structured Light Cameras

Geometric Primitive Refinement for Structured Light Cameras Self Archve Verson Cte ths artcle as: Fuersattel, P., Placht, S., Maer, A. Ress, C - Geometrc Prmtve Refnement for Structured Lght Cameras. Machne Vson and Applcatons 2018) 29: 313. Geometrc Prmtve Refnement

More information

High resolution 3D Tau-p transform by matching pursuit Weiping Cao* and Warren S. Ross, Shearwater GeoServices

High resolution 3D Tau-p transform by matching pursuit Weiping Cao* and Warren S. Ross, Shearwater GeoServices Hgh resoluton 3D Tau-p transform by matchng pursut Wepng Cao* and Warren S. Ross, Shearwater GeoServces Summary The 3D Tau-p transform s of vtal sgnfcance for processng sesmc data acqured wth modern wde

More information

Improving Initial Estimations for Structure from Motion Methods

Improving Initial Estimations for Structure from Motion Methods Improvng Intal Estmatons for Structure from Moton Methods Chrstopher Schwartz Renhard Klen Insttute for Computer Scence II, Unversty of Bonn Abstract In Computer Graphcs as well as n Computer Vson and

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Learning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris

Learning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris Learnng Ensemble of Local PDM-based Regressons Yen Le Computatonal Bomedcne Lab Advsor: Prof. Ioanns A. Kakadars 1 Problem statement Fttng a statstcal shape model (PDM) for mage segmentaton Callosum segmentaton

More information

SDF Tracker: A Parallel Algorithm for On-line Pose Estimation and Scene Reconstruction From Depth Images

SDF Tracker: A Parallel Algorithm for On-line Pose Estimation and Scene Reconstruction From Depth Images SDF Tracker: A Parallel Algorthm for On-lne Pose Estmaton and Scene Reconstructon From Depth Images Danel R. Canelhas, Todor Stoyanov, Achm J. Llenthal Center of Appled Autonomous Sensor Systems (AASS),

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Structure from Motion

Structure from Motion Structure from Moton Structure from Moton For now, statc scene and movng camera Equvalentl, rgdl movng scene and statc camera Lmtng case of stereo wth man cameras Lmtng case of multvew camera calbraton

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

MOTION BLUR ESTIMATION AT CORNERS

MOTION BLUR ESTIMATION AT CORNERS Gacomo Boracch and Vncenzo Caglot Dpartmento d Elettronca e Informazone, Poltecnco d Mlano, Va Ponzo, 34/5-20133 MILANO boracch@elet.polm.t, caglot@elet.polm.t Keywords: Abstract: Pont Spread Functon Parameter

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS by XUNYU PAN (Under the Drecton of Suchendra M. Bhandarkar) ABSTRACT In modern tmes, more and more

More information

3D vector computer graphics

3D vector computer graphics 3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

arxiv: v2 [cs.cv] 9 Apr 2018

arxiv: v2 [cs.cv] 9 Apr 2018 Boundary-senstve Network for Portrat Segmentaton Xanzh Du 1, Xaolong Wang 2, Dawe L 2, Jngwen Zhu 2, Serafettn Tasc 2, Cameron Uprght 2, Stephen Walsh 2, Larry Davs 1 1 Computer Vson Lab, UMIACS, Unversty

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

A B-Snake Model Using Statistical and Geometric Information - Applications to Medical Images

A B-Snake Model Using Statistical and Geometric Information - Applications to Medical Images A B-Snake Model Usng Statstcal and Geometrc Informaton - Applcatons to Medcal Images Yue Wang, Eam Khwang Teoh and Dnggang Shen 2 School of Electrcal and Electronc Engneerng, Nanyang Technologcal Unversty

More information

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem Ecent Computaton of the Most Probable Moton from Fuzzy Correspondences Moshe Ben-Ezra Shmuel Peleg Mchael Werman Insttute of Computer Scence The Hebrew Unversty of Jerusalem 91904 Jerusalem, Israel Emal:

More information

Research and Application of Fingerprint Recognition Based on MATLAB

Research and Application of Fingerprint Recognition Based on MATLAB Send Orders for Reprnts to reprnts@benthamscence.ae The Open Automaton and Control Systems Journal, 205, 7, 07-07 Open Access Research and Applcaton of Fngerprnt Recognton Based on MATLAB Nng Lu* Department

More information

Scale Selective Extended Local Binary Pattern For Texture Classification

Scale Selective Extended Local Binary Pattern For Texture Classification Scale Selectve Extended Local Bnary Pattern For Texture Classfcaton Yutng Hu, Zhlng Long, and Ghassan AlRegb Multmeda & Sensors Lab (MSL) Georga Insttute of Technology 03/09/017 Outlne Texture Representaton

More information

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Supplementary Material DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents Supplementary Materal DESIRE: Dstant Future Predcton n Dynamc Scenes wth Interactng Agents Namhoon Lee 1, Wongun Cho 2, Paul Vernaza 2, Chrstopher B. Choy 3, Phlp H. S. Torr 1, Manmohan Chandraker 2,4

More information

Mesh Editing in ROI with Dual Laplacian

Mesh Editing in ROI with Dual Laplacian Mesh Edtng n ROI wth Dual Laplacan Luo Qong, Lu Bo, Ma Zhan-guo, Zhang Hong-bn College of Computer Scence, Beng Unversty of Technology, Chna lqngng@sohu.com, lubo@but.edu.cn,mzgsy@63.com,zhb@publc.bta.net.cn

More information

Multi-Resolution Geometric Fusion

Multi-Resolution Geometric Fusion Internatonal Conference on Recent Advances n 3-D Dgtal Imagng and Modellng, Ottawa, Canada May 12 15, 1997 Mult-Resoluton Geometrc Fuson Adran Hlton and John Illngworth Centre for Vson, Speech and Sgnal

More information

Modeling, Manipulating, and Visualizing Continuous Volumetric Data: A Novel Spline-based Approach

Modeling, Manipulating, and Visualizing Continuous Volumetric Data: A Novel Spline-based Approach Modelng, Manpulatng, and Vsualzng Contnuous Volumetrc Data: A Novel Splne-based Approach Jng Hua Center for Vsual Computng, Department of Computer Scence SUNY at Stony Brook Talk Outlne Introducton and

More information

Joint Example-based Depth Map Super-Resolution

Joint Example-based Depth Map Super-Resolution Jont Example-based Depth Map Super-Resoluton Yanje L 1, Tanfan Xue,3, Lfeng Sun 1, Janzhuang Lu,3,4 1 Informaton Scence and Technology Department, Tsnghua Unversty, Bejng, Chna Department of Informaton

More information

3D Rigid Facial Motion Estimation from Disparity Maps

3D Rigid Facial Motion Estimation from Disparity Maps 3D Rgd Facal Moton Estmaton from Dsparty Maps N. Pérez de la Blanca 1, J.M. Fuertes 2, and M. Lucena 2 1 Department of Computer Scence and Artfcal Intellgence ETSII. Unversty of Granada, 1871 Granada,

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Multi-stable Perception. Necker Cube

Multi-stable Perception. Necker Cube Mult-stable Percepton Necker Cube Spnnng dancer lluson, Nobuuk Kaahara Fttng and Algnment Computer Vson Szelsk 6.1 James Has Acknowledgment: Man sldes from Derek Hoem, Lana Lazebnk, and Grauman&Lebe 2008

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Simplification of 3D Meshes

Simplification of 3D Meshes Smplfcaton of 3D Meshes Addy Ngan /4/00 Outlne Motvaton Taxonomy of smplfcaton methods Hoppe et al, Mesh optmzaton Hoppe, Progressve meshes Smplfcaton of 3D Meshes 1 Motvaton Hgh detaled meshes becomng

More information

Mobile Robot Localization and Mapping by Scan Matching using Laser Reflection Intensity of the SOKUIKI Sensor

Mobile Robot Localization and Mapping by Scan Matching using Laser Reflection Intensity of the SOKUIKI Sensor Moble Robot Localzaton and Mappng by Scan Matchng usng Reflecton Intensty of the SOKUIKI Sensor *HARA Yoshtaka, KAWATA Hrohko, OHYA Akhsa, YUTA Shn ch Intellgent Robot Laboratory Unversty of Tsukuba 1-1-1,

More information

TPL-Aware Displacement-driven Detailed Placement Refinement with Coloring Constraints

TPL-Aware Displacement-driven Detailed Placement Refinement with Coloring Constraints TPL-ware Dsplacement-drven Detaled Placement Refnement wth Colorng Constrants Tao Ln Iowa State Unversty tln@astate.edu Chrs Chu Iowa State Unversty cnchu@astate.edu BSTRCT To mnmze the effect of process

More information

Face Detection with Deep Learning

Face Detection with Deep Learning Face Detecton wth Deep Learnng Yu Shen Yus122@ucsd.edu A13227146 Kuan-We Chen kuc010@ucsd.edu A99045121 Yzhou Hao y3hao@ucsd.edu A98017773 Mn Hsuan Wu mhwu@ucsd.edu A92424998 Abstract The project here

More information

A Comparison and Evaluation of Three Different Pose Estimation Algorithms In Detecting Low Texture Manufactured Objects

A Comparison and Evaluation of Three Different Pose Estimation Algorithms In Detecting Low Texture Manufactured Objects Clemson Unversty TgerPrnts All Theses Theses 12-2011 A Comparson and Evaluaton of Three Dfferent Pose Estmaton Algorthms In Detectng Low Texture Manufactured Objects Robert Krener Clemson Unversty, rkrene@clemson.edu

More information

Alignment of Non-Overlapping Sequences

Alignment of Non-Overlapping Sequences Algnment of Non-Overlappng Sequences Yaron Casp Mchal ran Dept. of Computer Scence and Appled Math The Wezmann nsttute of Scence 76100 Rehovot, srael Ths paper shows how two mage sequences that have no

More information

Feature-Preserving Mesh Denoising via Bilateral Normal Filtering

Feature-Preserving Mesh Denoising via Bilateral Normal Filtering Feature-Preservng Mesh Denosng va Blateral Normal Flterng Ka-Wah Lee, Wen-Png Wang Computer Graphcs Group Department of Computer Scence, The Unversty of Hong Kong kwlee@cs.hku.hk, wenpng@cs.hku.hk Abstract

More information

Video Object Tracking Based On Extended Active Shape Models With Color Information

Video Object Tracking Based On Extended Active Shape Models With Color Information CGIV'2002: he Frst Frst European Conference Colour on Colour n Graphcs, Imagng, and Vson Vdeo Object rackng Based On Extended Actve Shape Models Wth Color Informaton A. Koschan, S.K. Kang, J.K. Pak, B.

More information

Dense Visual SLAM for RGB-D Cameras

Dense Visual SLAM for RGB-D Cameras Dense Vsual SLAM for RGB-D Cameras Chrstan Kerl, Jürgen Sturm, and Danel Cremers Abstract In ths paper, we propose a dense vsual SLAM method for RGB-D cameras that mnmzes both the photometrc and the depth

More information

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of

More information

Transformation Networks for Target-Oriented Sentiment Classification ACL / 25

Transformation Networks for Target-Oriented Sentiment Classification ACL / 25 Transformaton Networks for Target-Orented Sentment Classfcaton 1 Xn L 1, Ldong Bng 2, Wa Lam 1, Be Sh 1 1 The Chnese Unversty of Hong Kong 2 Tencent AI Lab ACL 2018 1 Jont work wth Tencent AI Lab Transformaton

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

Recognizing Faces. Outline

Recognizing Faces. Outline Recognzng Faces Drk Colbry Outlne Introducton and Motvaton Defnng a feature vector Prncpal Component Analyss Lnear Dscrmnate Analyss !"" #$""% http://www.nfotech.oulu.f/annual/2004 + &'()*) '+)* 2 ! &

More information

Semi-Direct Visual Odometry for Monocular, Wide-angle, and Multi-Camera Systems

Semi-Direct Visual Odometry for Monocular, Wide-angle, and Multi-Camera Systems 1 Sem-Drect Vsual Odometry for Monocular, Wde-angle, and Mult-Camera Systems Chrstan Forster, Zchao Zhang, Mchael Gassner, Manuel Werlberger, Davde Scaramuzza Abstract Drect methods for Vsual Odometry

More information

Computer Vision I. Xbox Kinnect: Rectification. The Fundamental matrix. Stereo III. CSE252A Lecture 16. Example: forward motion

Computer Vision I. Xbox Kinnect: Rectification. The Fundamental matrix. Stereo III. CSE252A Lecture 16. Example: forward motion Xbox Knnect: Stereo III Depth map http://www.youtube.com/watch?v=7qrnwoo-8a CSE5A Lecture 6 Projected pattern http://www.youtube.com/watch?v=ceep7x-z4wy The Fundamental matrx Rectfcaton The eppolar constrant

More information

Registration of Expressions Data using a 3D Morphable Model

Registration of Expressions Data using a 3D Morphable Model Regstraton of Expressons Data usng a 3D Morphable Model Curzo Basso DISI, Unverstà d Genova, Italy Emal: curzo.basso@ds.unge.t Thomas Vetter Departement Informatk, Unverstät Basel, Swtzerland Emal: thomas.vetter@unbas.ch

More information

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature Detecton of hand graspng an object from complex background based on machne learnng co-occurrence of local mage feature Shnya Moroka, Yasuhro Hramoto, Nobutaka Shmada, Tadash Matsuo, Yoshak Shra Rtsumekan

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Human Skeleton Reconstruction for Optical Motion Capture

Human Skeleton Reconstruction for Optical Motion Capture Journal of Computatonal Informaton Systems 9: 0 (013) 8073 8080 Avalable at http://www.jofcs.com Human Skeleton Reconstructon for Optcal Moton Capture Guanghua TAN, Melan ZHOU, Chunmng GAO College of Informaton

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Distance Calculation from Single Optical Image

Distance Calculation from Single Optical Image 17 Internatonal Conference on Mathematcs, Modellng and Smulaton Technologes and Applcatons (MMSTA 17) ISBN: 978-1-6595-53-8 Dstance Calculaton from Sngle Optcal Image Xao-yng DUAN 1,, Yang-je WEI 1,,*

More information

LECTURE : MANIFOLD LEARNING

LECTURE : MANIFOLD LEARNING LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors

More information

Some Tutorial about the Project. Computer Graphics

Some Tutorial about the Project. Computer Graphics Some Tutoral about the Project Lecture 6 Rastersaton, Antalasng, Texture Mappng, I have already covered all the topcs needed to fnsh the 1 st practcal Today, I wll brefly explan how to start workng on

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH

IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH IMAGE MATCHING WITH SIFT FEATURES A PROBABILISTIC APPROACH Jyot Joglekar a, *, Shrsh S. Gedam b a CSRE, IIT Bombay, Doctoral Student, Mumba, Inda jyotj@tb.ac.n b Centre of Studes n Resources Engneerng,

More information

A high precision collaborative vision measurement of gear chamfering profile

A high precision collaborative vision measurement of gear chamfering profile Internatonal Conference on Advances n Mechancal Engneerng and Industral Informatcs (AMEII 05) A hgh precson collaboratve vson measurement of gear chamferng profle Conglng Zhou, a, Zengpu Xu, b, Chunmng

More information

SVO: Semi-Direct Visual Odometry for Monocular and Multi-Camera Systems

SVO: Semi-Direct Visual Odometry for Monocular and Multi-Camera Systems 1 : Sem-Drect Vsual Odometry for Monocular and Mult-Camera Systems Chrstan Forster, Zchao Zhang, Mchael Gassner, Manuel Werlberger, Davde Scaramuzza Abstract Drect methods for Vsual Odometry (VO) have

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information