Title. Issue Date DOI. Doc URL. Type. File Information /doctoral.k

Size: px
Start display at page:

Download "Title. Issue Date DOI. Doc URL. Type. File Information /doctoral.k"

Transcription

1 Ttle Study of a Vew-based 3-D Object Retreval Method fo Authors 張, 永生 Issue Date DOI /doctoral.k12406 Doc URL Type theses doctoral Fle Informaton Zhang_Yongsheng.pdf Instructons for use Hokkado Unversty Collecton of Scholarly and Aca

2 Study of a Vew-based 3-D Object Retreval Method for 3-D Object Reconstructon 3 次元再構成のためのビューベース 3-D オブジェクト復元に関する研究 Yongsheng Zhang Graduate School of Informaton Scence and Technology Hokkado Unversty Sapporo, Hokkado, Japan

3 Table of Contents Abstract... 3 Lst of Fgures Introducton Background Image-based 3-D ModelngIBM and Vew-based 3-D Object RetrevalVBOR Thess Overvew Related Work From Images to 3-D models A General IBM Method Super-pxel based IBM D Object Retreval Model-based Methods Vew-based Methods Mult-Scale Object Retreval va Learnng on Graph Shape Feature Extracton Mult-vew Object Dstance Mult-scale Object Graph Constructon Graph Learnng for Object Retreval Computatonal Cost Experment Evaluaton Datasets Natonal Tawan Unversty 3-D Model DatabaseNTU The Edgenösssche Technsche Hochschule Zürch DatabaseETH Compared Methods Elevaton DescrptorED

4 4.2.2 Adaptve Vews ClusterngAVC Query Vew Selecton MethodQVS Evaluaton Crtera The Accuracy of The Nearest NeghborNN F-MeasureF Dscounted Cumulatve GanDCG Average Normalzed Modfed Retreval RankANMRR Expermental Results Analyss On the Mult-scale Hypergraph Connectons On Parameter Concluson and Future Work Concluson Future Work Bblography Acknowledgements Research Achevements

5 Abstract Wth rapd advances n computer technques and the popularty of the camera, a large number of photographs can be obtaned. How to obtan a novel 3-D object or 3-D scene from 2-D mages s a challengng task. To create a 3-D object, two methods can be appled. One s to create 3-D object by usng some 3-D modelng methods. The other s to create 3-D object by combnng or revsng exstng desgns. Accordng to a research report, 20% of desgns should be started from the very begnnng reconstructon, and 80% of desgns can be obtaned by combnng or revsng exstng 3-D models. Many research about 3-D modelng has been nvestgated, but t s stll a hgh-cost and laborous task to model hghly realstc 3-D models. We have also proposed a super-pxel based method to get a 3-D model from mage sequence. Wth the development of 3-D technology and 3-D applcatons, people can easly get a lot of exstng 3-D model. We can use these exstng model nstead of a drect 3-D modelng from mages. So we focus on the vew-based 3-D object retreval. Object retreval has attracted much research attenton n recent years. Confrontng object retreval, how to estmate the relevance among objects s a challengng task. We focus on vew-based object retreval and propose a mult-scale object retreval algorthm va learnng on graph from multmodal data. In our work, shape features are extracted from each vew of objects. The relevance among objects s formulated n a hypergraph structure, where the dstance of dfferent vews n the feature space s employed to generate the connecton n the hypergraph. To acheve better representaton performance, we propose a mult-scale hypergraph structure to model object correlatons. The learnng on graph s conducted to estmate the optmal relevance among these objects, whch are used for object retreval. To evaluate the performance of the proposed method, 3

6 we conduct experments on the Natonal Tawan Unversty dataset and the ETH dataset. To evaluate the 3-D object retreval performance of our method, we employ the state-of-the-art methods of ED, AVC and QVS for comparson. In order to measure the 3-D object retreval performance, the crtera of NN, F, DCG and ANMRR are employed to compare dfferent methods n our experments. Expermental results and comparsons wth the state-of-the-art methods demonstrate the effectveness of the proposed method. 4

7 Lst of Fgures Fgure 1.1 The structure of 3-D object rebuld... 9 Fgure 1.2 Example vews of 3-D objects Fgure 1.3 The framework of the proposed method Fgure 2.1 The baselne of the mage-based 3-D model reconstructon algorthms Fgure 2.2 Outlne of the super-pxel based 3-D modelng approach Fgure 2.3 An example of resultng super-pxels Fgure 2.4 Patch defnton Fgure 2.5 The procedure of back projecton, durng ths procedure3-d surface patch nodes are acqured Fgure 2.6 Patches expanded on curved surface. Normal vector correct s necessary Fgure 2.7 The purpose of normal vector correcton s to fnd the angles a and b, t s a nonlnear mnmzaton problem Fgure 2.8 Sample mages from the nput dataset Fgure 2.9 Fnal polygon model smulated from 3-D surface patches usng meshlab software Fgure 2.10 The general framework of vew-based 3-D object retreval Fgure 3.1 Hyperedge generaton usng dfferent K values Fgure D object examples n the NTU database Fgure 4.2 Structure of Buckmnsterfullerene Fgure D object examples n the ETH database Fgure 4.4 Expermental results on the NTU dataset Fgure 4.5 Expermental results on the ETH dataset Fgure 4.6 Expermental results wth respect to dfferent connecton numbers on the NTU dataset Fgure 4.7 Expermental results wth respect to dfferent connecton numbers on 5

8 the ETH dataset Fgure 4.8 Expermental results wth respect to dfferent values on the NTU dataset Fgure 4.9 Expermental results wth respect to dfferent values on the ETH dataset

9 Chapter 1 1. Introducton In ths chapter, we provde the background for understandng mage-based 3-D object reconstructon and 3-D object retreval. We then brefly defne the motvaton and goal of our research. The last secton of ths chapter provdes an overvew of contents of the rest of ths book. 1.1 Background Wth rapd advances n computer technques and the popularty of the camera, a large number of photographs are obtaned. How to obtan a novel 3-D object or 3-D scene from 2-D mages s a challengng task. In the past several decades, 3-D object reconstructon has been nvestgated extensvely. In recent years, graphcs hardware and mage processng technques have made remarkable progress and 3-D technques have been appled n varous felds, such as computer-aded desgncad, medcal dagnoss, onlne shoppng, vrtual realtyvr, and entertanment. For example, 3-D navgaton of cty and museum have become much more popular nowadays. The long hstory of 3-D technology can be drawn the way back to the start of photography. Stereoscopc photography, or the technque of creatng a "thrd dmenson", was frst nvented n 1838 by the Englsh scentst Sr Charles Wheatstone. Stereoscopc photography s a specal moton pcture camera system that records mages from two dfferent perspectves. Eyewear s then used to combne these perspectves and create the lluson of depth. In 1965, a team led by Charles Lang from Cambrdge Unversty 7

10 started conductng 3-D CAD modelng research. And 3-D modelng became popular and has been appled to varous desgn works. In 1981, Hdeo Kodama of Nagoya Muncpal Industral Research Insttute publshed hs account of a functonal rapd prototypng system usng photopolymers. A sold, prnted model was bult up n layers, each of whch corresponded to a cross-sectonal slce n the model. On 31 January 2010, BSKYB became the frst broadcaster n the world to show a lve sports event n 3-D when Sky Sports screened a football match between Manchester Unted and Arsenal to a publc audence n several selected pubs. Wth the development and wde applcaton of 3-D technology, varous methods have been proposed to produce a lot of 3-D data. Smultaneously, 3-D data are ncreased n both local data storage and onlne data storage. To create a 3-D object or 3-D scene, two methods can be appled as Fgure 1.1 shows. One s to create 3-D object by usng some 3-D modelng methods ncludng actve methods and passve methods. The other s to create 3-D object by combnng or revsng exstng desgns. Accordng to a report from Gunn[1], 20% of desgns should be started from the very begnnng reconstructon, and 80% of desgns can be obtaned by combnng or revsng exstng desgns. Although lots of methods have been nvestgated, 3-D modelng s stll a hgh cost and laborous task. The combnaton and revson of exstng 3-D models can mprove model desgn performance. 8

11 3-D modelng drectly How to get a 3-D object combnng or revsng exstng 3-D object 3-D object retreval model-based 3DOR vew-based 3DOR Fgure 1.1 The structure of 3-D object rebuld 1.2 Image-based 3-D ModelngIBM and Vew-based 3-D Object RetrevalVBOR The motvaton of ths paper s to explore solutons to get 3-D object from mage vews. Image-based modelng methods rely on a set of 2-D mages of a scene to generate a 3-D model. In recent years, great progress has been made wth applcatons n the feld, and varous methodologes have been proposed to deal wth related problems. Some of these methods have produced outstandng results, and can be roughly dvded nto four categores [2]. The frst nvolves computaton of a 3-D volume and extracton of an optmal surface from t [3]. The accuracy of ths approach s lmted by the resoluton of the voxel grd. The second nvolves the use of voxels, level sets or surface meshes, and teratve evoluton of a surface to reduce or mnmze the cost functon. Although ths approach s wdely used n medcal mage 3-D reconstructon, ts applcablty s lmted. The thrd [4, 5, 6] s based on a set of depth maps. The solutons are flexble, but requre a set of such maps to be merged nto a 3-D scene wth consstency constrants. The fourth nvolves extractng feature ponts and matchng followed by the fttng of a surface to the reconstructed ponts. Wth ths method, wde-baselne stereo matchng s appled to an MVS model [7] to recover salent 3-D 9

12 features, and a vsual hull model can be shrunk so that the recovered ponts le on the surface. The results are then refned usng cost functon mnmzaton. Although 3-D model reconstructon has been nvestgated for decades, t s stll a challengng task and costs much computaton load. In recent years, 3-D object retreval[12] has attracted much research attenton from both research and ndustral felds. Extensve research attenton [13,14,15] has been dedcated n such an emergng feld[16-19], from ether model-based [20,21,22]or vew-based drectons [23-26], based on the representaton methods of 3-D objects. Most of early objects are model-based, where each object s represented by a vrtual 3-D model, such as trangle mesh. Model-based methods have shown advantages when descrbng the global spatal nformaton. However, one man lmtaton of such methods le n stuaton of lack of model data. Model-based methods hghly depend on the vrtual model, where such model nformaton may be not avalable n many practcal applcatons. Dfferent from model-based methods, vew-based methods [23,30,24,31] have become more useful n recent years. In vew-based methods, each object s represented by a set of vews from dfferent drectons. Such methods are much more flexble than model-based methods, as the model nformaton s not mandatorly requred. Also, vew-based methods can be benefced from exstng mage processng achevements, such as mage feature extracton and comparson. Fgure 1.2 provdes examples of vews from objects. Daras et al. [32] ntroduced that vew-based methods [33, 34] can be more dscrmnatve than model-based methods, and vew-based methods have been nvestgated n recent decade. In vew-based methods, generally, object comparson s based on mult-vew matchng. It s noted that t s stll a challengng task to compare two objects va two groups of vews, whch s dfferent from tradtonal mage comparson. In ths paper, we focus on vew-based object retreval and propose a mult-scale object retreval algorthm 10

13 va learnng on graph. Fgure 1.3 shows the framework of our proposed method. In our work, shape features are extracted from each vew of objects. The relevance among objects s formulated n a hypergraph structure, where the dstance of dfferent vews n the feature space s employed to generate the connecton n the hypergraph. To acheve better representaton performance, we propose a mult-scale hypergraph structure to model object correlatons. The learnng on graph s conducted to estmate the optmal relevance among these objects, whch are used for object retreval. To evaluate the performance of the proposed method, we conduct experments on the Natonal Tawan Unversty dataset and the ETH dataset. Expermental results and comparsons wth the state-of-the-art methods demonstrate the effectveness of the proposed method. Fgure 1.2 Example vews of 3-D objects 11

14 Fgure 1.3 The framework of the proposed method. 1.3 Thess Overvew Ths thess s organzed as follows. Ths chapter presents the background, research motvaton and structure of ths thess. We descrbe the hstory and progress of the development of 3-D technology. We also ntroduce the IBM and 3DOR as well as the solutons and our contrbutons to get 3-D object from mage vews. In Chapter 2 we provde a bref overvew of prevous research that can be consdered background for the proposed method. We do some experment n mage-based modelng, then ntroduce the recent progress n 3-D object retreval. Chapter 3 presents the 3-D object retreval method that we use n our research. We focus on vew-based object retreval and propose a mult-scale object retreval algorthm va learnng on graph from multmodal data. In Chapter 4 we ntroduce the testng datasets, comparng methods, evaluaton crtera and expermental results. Experments are conducted on two publc datasets the Natonal Tawan Unversty dataset and the ETH dataset. Methods of ED, AVC and QVS are employed for comparson. The crtera of NN, F, DCG and ANMRR s employed to evaluate the performance of dfferent methods. 12

15 Chapter 5 s the concludng remarks and future work. 13

16 Chapter 2 2. Related Work From the 2-D mage sequence to get the 3-D model, the general approach s the method of 3-D modelng. Many research about 3-D modelng has been nvestgated, but t s stll a hgh-cost and laborous task to model hghly realstc 3-D models. We have also done some work n ths area. The technology and the analyss wll be ntroduced n secton 2.1. Wth the development of 3-D technology and 3-D applcatons, people can easly get a lot of exstng 3-D models. We can use these exstng models nstead of a drect 3-D modelng from mages. So we focus on the vew-based 3-D object retreval. From Secton 2.2 we wll ntroduce recent progress on 3-D object retreval. 2.1 From Images to 3-D models A General IBM Method The acquston methods of the real object model can be roughly dvded nto two categores: actve method and passve method. The typcal representatve of the actve approaches s the method of usng scanner. It can get the 3-D model of objects accurately, whle ts cost s very hgh and t s dffcult to acqure enough data to reconstruct models n all applcatons. The passve method s to reconstruct 3-D model based on mages of the scene or objects one wshes to reconstruct, whch s economcal and can get features wth color drectly. The baselne of the mage-based method s shown as the fgure 2.1. Provded wth a sequence of mages of a scene or an object, t 14

17 can generate a realstc 3-D model n fve steps. Frstly, for an mage s just a large collecton of pxels wth ther own ntenstes, features matchng should be appled. That s to say, to reconstruct the model of ths object based on mages, t should fnd the correspondng features, such as ponts n dfferent mages, whch may be completed by comparng ntenstes over a small local wndow centered wth a pont. Secondly, the moton and the 3-D structure should be recovered. If there s not a pror calbraton of the camera used to get the mage sequences, a projectve skew should be contaned n ths stage to recovery the calbraton of camera. Thrdly, wth the knowledge of the camera parameters that we have got n the second step or based on pror, we can get a depth estmate for almost each pxel of an mage to match all pxels of an mage wth pxels n ts neghborng mages. In ths way, all ponts of the object can be reconstructed. Fourthly, 3-D model s bult by fusng the results together. Fnally, texture mappng should be appled to acheve the fnal photo-realstc model. Fgure 2.1 The baselne of the mage-based 3-D model reconstructon algorthms Super-pxel based IBM Generally, IBM processng s consdered hgh cost and neffcent for texture-less mages. We propose a super-pxel based IBM method to solve the problem. The pxels of the reference mages are clustered nto super-pxels n preprocessng before reconstructon. Then the reconstructng s based on these super-pxels. Fgure 2.2 gves an outlne of the super-pxel based approach. Frst, an mage sequence s nput, camera parameters are derved for the subsequent processng based on a general structure and moton method. At the same tme, good mages are selected for reference and splt nto small segments super-pxels. Feature ponts are then extracted for each mage, but correspondent relatve matchng ponts are found only between mages relatng to the references. From the salent matchng ponts and camera parameters, 3-D ponts are 15

18 derved va trangulaton and used to construct salent-bass 3-D surface patches. However, the number of these patches s small. It s necessary to expand the patches to nearby voxels n lne wth the geometrc relatonshps lnkng the super-pxels. A photometrc dscrepancy functon s used to remove the ncorrect 3-D surface patches at the post-expanson stage. Fnally, the 3-D patches are used to reconstruct the polygonal surface model va an optmzaton procedure. Fgure 2.2 Outlne of the super-pxel based 3-D modelng approach Camera calbraton s a basc technque n mult-vew mage processng. Many methods have been developed and appled n related areas. The bundle method [61] s used to smulate the process of structure and moton analyss. After ths step, a sparse set of 3-D ponts and camera parameters descrbng the relatonshp between the camera and the scene are occuped. The concept of super-pxel s proposed by Ren and Malk[58], orgnally defned as a knd of over-segmentaton. Super-pxel n ths study s defned as a group of many adjonng pxels, where all pxels have smlar propertes. For the applcaton on 3-D reconstructon, super-pxel has four specfc propertes: 16

19 Most of feature ponts should be on the boundares of super-pxels. Surfaces wthout textures should also be splt nto small clusters no specfc rule s appled for such splttng. Brghtness and texture dfferences n super-pxels are homogeneous. mage. The number of super-pxels should be large enough to preserve more features n one In ths study, we arrange ntal seeds under a lattce grd, where one ntal seed s selected n each grd wth the most sgnfcant features. If there are no sgnfcant features n a grd, the center pont of the grd s selected. The feature s defned usng the Harrs and SIFT method. Followed by the seed ntalzaton, the next step s to generate the local optmal path connected neghborhood of each seed vertcally and horzontally. The searchng algorthm of shortest path s used to fnd the path. We defne an regon undrected graph accordng to the poston and gradent of pxels. In the undrected graph, the node denotes the pxel, and each node has a weght. The weght s defned as: w 1.0 /1.0 s * t 2.1 t d d d d 2.2 x x y y Where d x and d y s horzontal and vertcal dervatves. Senstvty of s s a tunng parameter. Djkstra algorthm s employed to generate the shortest path, and seed connecton can be accomplshed. We use several ponts sequence on the edge of seed connecton to defne t. In our work, no more than 5 Harrs or SIFT feature ponts are selected for an 17

20 edge. Therefore, every super-pxel s depcted by a feature ponts sequence. Fgure 2.3 shows an example of super-pxels. It s obvously that most of feature ponts n ths mage, such as object boundares, are located on the edges of super-pxels. Fgure 2.3 An example of resultng super-pxels. After the defnton of super-pxel, 3-D surface patch s defned as followng. A 3-D surface patch model p s essentally a local tangent plane approxmaton of a surface wth specfcally boundary correspondng to a super-pxel n the reference mage. The projecton of a 3-D surface patch onto the reference mage forms a super-pxel n the mage. It has four propertes: poston pos p ; unt normal vector n p ; vertexes node p correspondng to the super-pxel polygon vertexes; and a reference mages r p n whch p s vsble. In contrast to the usual defnton of a patch [7], the patch referred here may be polygonal rather than rectangular Fgure

21 3D surface patch node p n p node p node p pos p node p node p Fgure 2.4 Patch defnton 3-D surface patch s correspondng to 2-D super-pxel. Frst, as a normal dense matchng method, feature ponts n each mage are detected usng SIFT and Harrs operators. In ths study, super-pxel s appled to reduce the processng tme and ncrease computng effcency by control the threshold number of features n super-pxels. Only 4 feature ponts are computed for a super-pxel wth the sze of 256 pxels n ths study. For each feature pont n the super-pxel of the reference mage I r, the best matchng pont f f ' n the other mage s searched, and 3-D ponts c assocated wth the pars of matchng ponts are trangulated as shown n Eq.2.3. Multple 3-D ponts may be derved because multple feature ponts the most s 4 n a sngle super-pxel may be derved. The mean coordnates of these ponts are set for the ntal poston of ths 3-D surface patch correspondng to the super-pxel see Eq c { trangulaton from f and f '} 2.3 pos p c 2.4 The drecton vector of the patch s defned as a unt vector from a poston c orentng toward the camera O I of the reference mage as r 19

22 n p pos p O I / pos p O I 2.5 r r It s clearly that a pont and a normal drecton can defne a plane. The nodes of 3-D surface patches are defned by the ntersecton ponts va back projecton of the nodes of super-pxel polygon wth the patch plane see Fgure 2.5. For the ntalzaton of reference mage r p, t s frst ntalzed from the outcome of camera calbraton, and optmzed by removng some mages wth bgger photometrc dscrepancy score compared to others. Indeed some patches are also removed when all the dscrepancy scores are bg. Nodes of super-pxel Nodes of 3D patches pos p 3D surface patch Back projecton I 1 Super-pxel f I 2 O I r Fgure 2.5 The procedure of back projecton, durng ths procedure3-d surface patch nodes are acqured. As some mages are texture-less mages, and some patches are removed for bg dscrepancy score. The ntal 3-D surface patches are sparse, not all super-pxels have relatve 3-D surface patches. Ths study uses an expanson procedure to create more patches. The goal of the expanson s to reconstruct correspondng patch for each super-pxel. In the study, the super-pxel s called fxed super-pxel, when t has a relatve 3-D patch. Ths spreads fxed super-pxel to nearby super-pxels and makes as many super-pxels assocated wth a correspondng determnate 3-D patches as possble. For a 3-D surface patch, all the neghborng super-pxels of ts fxed super-pxel are 20

23 dentfed n the reference mage. If the neghbor s already assocated wth a patch, expanson n ths orentaton s termnated. For a known patch p and ts correspondng fxed super-pxel s, a new patch p ' correspondng to the neghborng super-pxel s ' s generated as followng. Unt normal vector n p' and reference mages r p' are ntalzed by replcatng the values of the known patch p. Poston pos p' s ntalzed based on the ntersectng pont where a vewng ray passng through the center of the super-pxel s ' ntersects the plane of patch p. Depth testng s also performed to prevent expanson from the depth wth dramatc changng. Untl now, the expanded patches are some coarse ones because ther normal vectors may be wth bg mstakes. As Fgure 2.6 shows, when the expandng surface plane s curved surface, the normal vector of patch must be corrected as Fgure 2.7 shows. The problem can be thought as a nonlnear mnmzaton problem. Quas-Newton method wth a relevance evaluaton s appled to get the rght normal vectors. 3D surface patch Patch poston Surface s plane Patch orentaton Surface s curved Super-pxel Fgure 2.6 Patches expanded on curved surface. Normal vector correct s necessary. 21

24 Fgure 2.7 The purpose of normal vector correcton s to fnd the angles a and b, t s a nonlnear mnmzaton problem. The proposed algorthm was tested wth a number of datasets, and was found to be vald as Fgure 2.8 and Fgure 2.9 shows. Its applcablty to the processng of complcated surface objects s currently lmted e.g. leaf, grass etc. Fgure 2.8 Sample mages from the nput dataset 22

25 Fgure 2.9 Fnal polygon model smulated from 3-D surface patches usng meshlab software. 23

26 2.2 3-D Object Retreval In ths secton, we wll ntroduce recent progress on 3-D object retreval. 3-D object retreval can be dvded nto two types of methods,.e., model-based methods and vew-based methods Model-based Methods Most early 3-D object retreval methods are model-based method. For model-based methods, exstng vrtual 3-D model nformaton s requred. For these methods, low level features, such as volumetrc descrptor [27], surface dstrbuton [21] and geometry[20, 28, 29] can be extracted for object descrpton. Other methods [63, 64] extract hgh level feature for object structure descrpton. To extract model-based feature, Papadaks et al.[35] ntroduced a panoramc vew,.e., panoramc object representaton for accurate model attrbutng PANORAMA, where were generated by projectng the model to a lateral surface of a cylnder. To compare two 3-D models, the dstance was measured by matchng between two PANORAMA mages. In [30], Gao et al. ntroduced a spatal structure crcular descrptor SSCD, whch employed the projected model nformaton n a crcular regon to represent the 3-D model. In SSCD, the projected mage s able to preserve the global spatal nformaton, and the comparson between two 3-D models s acheved by the hstogram dstance for each SSCD vew. Vranc et al. [37] ntroduced an Extenson Ray-based Descrptor ERD method, where the concentrc spheres were used to extract the surface nformaton. In ths method, each samplng surface pont had a value on the correspondng sphere surface. The dsadvantage of model-based 3-D object retreval s that the 3-D model nformaton s requred for the 3-D object retreval. In the case where no 3-D model s avalable, the 3-D model reconstructon s needed to generate a model frst Vew-based Methods 24

27 Vew-based method s much more flexble compared wth model-based methods, because t does not need the vrtual model nformaton. For vew-based methods, each 3-D model s represented by clusters of multple vews and the feature s extracted from vews. The general process s composed of four steps: vew capture, vew selecton, feature extracton and object matchng as shown n Fgure D object data The query set Vew Vew Feature Object capture selecton extracton matchng Retreved objects Fgure 2.10 The general framework of vew-based 3-D object retreval Chen et al. [38] proposed the frst vew-based 3-D object retreval method,.e., Lghtng Feld Descrptor LFD. In LFD, several groups of 10 vews are used to represent each 3-D object. For these vews, the Zernke moments and Fourer descrptors were employed as the features. To compare two 3-D objects, the mnmal dstance between two groups of vews were used n [38]. Shh et al.[39] ntroduced an Elevaton Descrptor ED, whch employed sx range vews from dfferent drectons to represent 3-D objects. In ED, the depth hstogram was extracted as the ED feature and matchng between two ED hstograms was measured as the dstance between two 3-D objects. To represent 3-D objects va a set of vews, fve sets of mages, from four vertcal and one horzontal loop drectons, were employed n [65]. In ths method, each group of vews were formulated as a Markov Chan MC. The comparson between two 3-D models can be dvded nto the comparson n the vew set level and the comparson n the model level, and the objectve of 3-D model retreval was to fnd the maxmal a posteror MAP gven the query model. Daras et al. [32] ntroduced the Compact 25

28 Mult-Vew Descrptor CMVD, whch contaned 18 vews from the vertces of a 32-hedron. Mahmoud et al. [40] proposed a model to employ the curvature scale space as the vew descrptor. The curvature scale space was combned wth Zernke moments to compare 3-D models. Adan et al. [41] proposed a depth gradent mage DGI model, whch employed both the surface and the contour nformaton to avod restrctons concernng the layout and vsblty of 3-D models. To select representatve vews from the large vew pool, a query vew selectonqvs method was ntroduced n [45]. In QVS, a small set of canddate vews were frst selected va vew clusterng. Then the user relevance feedback was nvolved to nteractvely select representatve vews. In ths way, the vews were ncrementally selected, whch can be more dscrmtve to the query nformaton. Ansary et al. [23] proposed the Adaptve Vews Clusterng AVC, where 320 ntal vews were frstly captured and about 20 to 40 representatve vews were selected. The objectve for 3-D model retreval was formulated as a probablstc approach to measure the posteror probablty for one object gven the query object, and the hgher the posteror probablty of one 3-D model, the hgher relevance between t and the query. In [34], a probablstc matchng method was ntroduced, where a postve matchng model and a negatve matchng model were generated ndvdually. For each target object, the postve matchng rato and the negatve matchng rato were measured and then combned for retreval. To measure the dstance between two groups of vews, Gao et al. [31] proposed a Hausdorff dstance learnng method, where a vew-level Mahalanobs dstance metrc for Hausdorf dstanc was learnt through relevance feedback. Hypergraph learnng has also been nvestgated n 3-D object retreval [30], where the relatonshp of 3-D objects were formulated n a hypergraph structure. In recent years, the bag-of-words methods have been nvestgated n 3-D object retreval [46, 47, 48]. Ohbuch et al. [24] ntroduced a bag-of-vsualfeature BoVF method, where the local SIFT features [49] were extracted from the mages and vsual 26

29 words were generated usng a clusterng-based vsual vocabulary. A hstogram of vsual words was bult as the feature for the 3-D object, and the Kullback-Lebler dvergence KLD was used to measure the dstance between 3-D objects. 27

30 Chapter 3 3. Mult-Scale Object Retreval va Learnng on Graph In ths Chapter, we ntroduce the proposed mult-scale object retreval va learnng on graph. The proposed method s composed of three components,.e., mult-vew object dstance, mult-scale object graph constructon and learnng on the graph, as shown n Fgure Shape Feature Extracton Feature extracton s an ntegral part of multmeda nformaton retreval. For vew feature extracton, several effectve features can be used, such as Fourer descrptors[8] and Zernke moments[50,51,23,62]. In our work, the Zernke moments are employed as the vsual descrptors. Moments are consdered as popular pattern representaton methods. Zernke moments are a class of orthogonal moments and have been shown effectve n terms of mage representaton. Zernke moments are rotaton nvarant and can be easly constructed to an arbtrary order. Zernke moments are constructed usng a set of complex polynomals whch form a complete orthogonal bass set defned on the unt crcle. Zernke polynomals are expressed n polar coordnates by { P st x, y} as jt P x, y P, R e, 3.1 st st st 28

31 Where s 0,1,2,..., and defnes the order, t s an postve or negatve nteger depctng the angular dependence, or rotaton, subject to the condtons: s t even, t s 3.2 And, are defned over the unt crcle, j 1 and R s the orthogonal radal polynomal, whch s defned as st R st s t z 1 F s, t, z, z0 where F s, t, z, s z! s t s t z! z! z! 2 2 s2z 3.4 Zernke moments are the projectons of the mage functon onto the orthogonal basc functons. The s t th Zernke moment for an mage functon f x, y s defned as s 1 ZM st f x, y P st, 3.5 x y k The number of moments for the k th order s k 1 k Mult-vew Object Dstance Dstance metrc plays an mportant role n multmeda nformaton retreval. For the mage retreval task, whch s based on the matchng of two sngle mages. 3-D object retreval s more complex as the hgh-order nformaton contaned n the multple vews of 3-D object. Mult-vew 3-D object matchng s a many to many matchng problem. 29

32 Here, we let O { v 1, v 2,... vn} denote one object O wth n vews, and let O j { v j1, v j 2,... v jn } denote another object O j. For each vew v a, a shape feature s extracted for vew representaton. The Zernke moments are employed as the vsual descrptors. To measure the dstance between O and O j based on multple vews, the followng dstance measure s employed n our work: n 1 d O, O j d mn va, O j 3.6 n a1 where d mn v a, O j s the mnmal dstance between v a and all the vews n O j : d mn v a, O j mn{ d v, v b [1, n]} 3.7 a jb Here, d v a, v jb s the Eucldean dstance between v a and v jb based on the Zermke moments feature. 3.3 Mult-scale Object Graph Constructon In our work, the relatonshp among objects are formulated n a object hypergraph. Here, each object s regarded as a vertex n the object hypergraph V, E,, and how to construct the vertex connecton s mportant. Here, the star expanson method n [52] s employed to generate the hyperedges. Each tme, one vertex s selected as the centrod vertex and a hyperedge s generated by connectng ts nearest neghbors. A parameter K s used to select the number of nearest neghbors for each hyperedge. Fgure 3.1 llustrates the constructon of hyperedge va star expanson. We note that dfferent selectons of K ndcates dfferent representaton scales for object formulaton. A large K value ndcates a large amount of objects can be connected by one edge and a small K value wll lead to strct constrant on the smlartes of 30

33 connected objects by each edge. As t s dffcult to dentfy the optmal K value for hyperedge constructon, multple K values are used n our work to construct edges on the hypergraph, whch leads to a mult-scale object graph. As shown n Fgure 3.1, multple K values can construct multple hyperedges. The mult-scale object hypergraph V, E, s composed by a vertex set V, a hyperedge set E, and the weghts of the edges. Here, each vertex v V denotes one object, the each edge e E s constructed va star expanson, and the weght s assgned wth the equal wght 1. Fgure 3.1 Hyperedge generaton usng dfferent K values. The structure of the object hypergraph s presented by an ncdence matrx H, where the entry of H s calculated by: f v e f v e 1 H v, e For each vertex, the vertex degree d v s defned as: d v e E ehv, e

34 For each hyperedge, the edge degree e s defned as: e v V h v, e 3.10 For the vertex degree and the edge degree, two matrces D v and D e are used to denote the dagonal matrces of the vertex degrees and the edge degrees respectvely. 3.4 Graph Learnng for Object Retreval To estmate the relevance among objects, t s mportant to explore the relatonshp of the vertces n the hypergraph structure. In recent years, learnng methods [53, 54] have been performed on hypergraphs, such as data clusterng, classfcaton and rankng. In our work, we formulate the object relevance exploraton task as a bnary classfcaton task,.e., whether one object belongs to the query type or not. here, The followng normalzed Laplacan hypergraph learnng framework s employed arg mn{ R f f } 3.11 f where f s the relevance from each object to the query, R f s the emprcal loss of hypergraph learnng on the labeled data,.e., the query, f s a regularzer on the hypergraph structure, and s a parameter to balance dfferent components on the objectve functon. The emprcal loss R f s defned as: 2 f f y uv R f u y u

35 33 where y s the labeled vector. In y, all the entres are zeros expect the query, whch s one. The regularzer on the hypergraph f s defned as: f I f e v d u d v f e v h e e u h u f e e v h u d e u h e u f v d u d v f u f u d u f e e v h e u h e f T E e V v u V v E e V u E e V v u,,,,,,, 2, where v T e v D H HWD D. Here we let I, equaton can be rewrtten as f f f T 3.14 Now, the cost functon on the learnng task can be rewrtten as: 2 y f f f f T 3.15 The relevance among objects can be learned by the mnmzng the objectve functon, and the optmal f can be solved va: y f To further reduce the computatonal cost, Eq.3.16 can be solved usng an teratve process as shown n the follows. We frst ntalze f wth 0 t. Then, we update f

36 by f t 1 1 t 1 f 1 y We let t t 1and then go back to update f. Ths process s repeated untl the cost functon does not further sgnfcantly reduced. Wth the learned relevance vector f, all the objects can be ranked n a descendng order, whch generates the object retreval results. 3.5 Computatonal Cost In ths part, we analyze the computatonal cost of our proposed method. The man computatonal load les n the hypergraph constructon and the learnng part. It can be obtaned that the hypergraph constructon process costs O n n K O, where n K s the number of employed K n the hyperedge constructon part, and no s the number of objects n the dataset. The computatonal cost for the learnng part s O n n O, where nt s the teraton number for the alternatng optmzaton process. 2 t 34

37 Chapter 4 4. Experment In ths secton, we ntroduce the testng datasets, compared methods, evaluaton crtera and expermental results. 4.1 Evaluaton Datasets To evaluate the performance of the proposed method, we have conducted 3-D object retreval experments on two publc datasets,.e., Natonal Tawan Unversty 3-D Model database NTU [38], and Edgenösssche Technsche Hochschule Zürch 3-D object dataset ETH [55]. Fgure 4.1 and Fgure 4.2 shows examples n the NTU and the ETH datasets Natonal Tawan Unversty 3-D Model DatabaseNTU The NTU 3-D model database contans two parts, one s the NTU 3-D model benchmark and the other s the NTU 3-D model database. NTU provdes 3-D models for research purpose n 3-D model retreval, matchng, recognton, classfcaton, clusterng and analyss. The benchmark contans a database of 1,833 3-D models, whch are free downloaded from 3-DCafe n Dec. 2001, but removes several models wth faled formats n decodng. The benchmark was clustered nto 47 classes ncludng D models manly for vehcle and household tems, and 35

38 all the other 1,284 models classfed as "mscellaneous". 3-D models n the mscellaneous class are not the same functon but nose for correct retreval. The database contans a database of 10,911 3-D models, whch are free downloaded from the Internet n July All 3-D models are converted nto Wavefront fle format.obj n the database. Thumbnal mages of each 3-D model are also ncluded n the database. The frst dataset used n our work s the NTU benchmark. In the NTU dataset, each object contans a correspondng 3-D model. Here, the vrtual cameras are set to capture multple vews for each object. In our work, a vrtual camera array wth 60 cameras are used, whch locate on the vertces of a polyhedron wth the smlar structure of Buckmnsterfullerene as Fgure 4.2 shows. Usng these vrtual cameras, 60 vews can be obtaned for each 3-D object. Fgure D object examples n the NTU database 36

39 Fgure 4.2 Structure of Buckmnsterfullerene The Edgenösssche Technsche Hochschule Zürch DatabaseETH The ETH dataset s a real world 3-D object dataset wth multple vews. There are 80 objects from 8 categores n the ETH dataset, such as apple, pear, tomato, dog, cow, cup, car, and horse. In ths dataset, each object contans 41 vews, whch are captured spaced evenly over the upper vewng hemsphere. All the cameras are located on the postons by subdvdng the faces of an octahedron to the thrd recurson level. All mages have been taken wth a Sony DFW-X700 progressve scan dgtal camera wth pxel resoluton and a Tamron 6-12mm varfocal lens F1.4. For every mage, a hgh-qualty segmentaton mask s provded. Fgure D object examples n the ETH database 37

40 4.2 Compared Methods To evaluate the 3-D object retreval performance of our method, we employ the followng state-of-the-art methods for comparson Elevaton DescrptorED Elevaton descrptor ED [39] s a vew-based 3-D object retreval method, whch proposed a new feature for 3-D model retreval, for 2-D slhouettes represented by bnary mages do not descrbe the alttude nformaton of the 3-D model from dfferent vews well. Based on the new feature, a new content-based multmeda retreval system of 3-D models retreval s desgned. One 3-D model should be represented wth sx elevatons to descrbe ts alttude nformaton from sx dfferent vews. Each evaluaton s represented by a 2-D gray-level mage decomposed nto several concentrc crcles and obtaned by takng the dfference between the alttude sums of two successve concentrc crcles. For each 3-D model, a tghtest boundng box s constructed and decomposed nto a 2L 2L 2L voxel grd. voxel m, n, h represents a voxel located at m, n, h. Based on whether there s a polygonal surface located wthn the voxel, voxel m, n, h 1 or voxel m, n, h 0. As a result, each voxel wth a polygonal surface s weghted equally. Let the model s center move to the locaton L, L, L, the average dstance from all voxels wth voxel m, n, h 1 to the center s lnearly. In ths way, ED s robust for rotaton and nvarant to translaton and scalng of 3-D models. In addton, an effectve way for extractng features from each gray-level mage s employed n order to make them less senstve to rotatons. Each elevaton s decomposed nto L concentrc crcles C j, j 1,2..., L around the center pont to extract the ED from sx elevatons. For the k th elevaton, g k j s 38

41 the sum of gray values of pxels n the j th crcle and calculated by gk j fk r, c, j 1,2,..., L r, c C j 4.1 The dfference between two successve concentrc crcles s defned as: dk j gk j gk j and the sum of all d k j values for the k th elevaton s D k d k j 4.3 L j1 The ED X of a 3-D model s defned as: X ] T T T T T T T [ x1, x2, x3, x4, x5, x6 4.4 where x ] T k [ xk 1, xk 2,..., xk L. For a 3-D model, sx elevatons are obtaned to descrbe the alttude nformaton of 2-D projectons from sx dfferent vews: front, top, rght, rear, bottom and left whch are notated successvely as, k 1,2..., 6. Takng the relatve postons of the E k elevatons nto account, t can dvde the sx elevatons nto three pars E6 E, E, E, E, E, and reduce the matchng tme between two models to 3! matchng operatons. For the th permutaton, 1 48 p matchng operaton, the dstance between X ] T T T T T T T [ x1, x2, x3, x4, x5, x6 and Y ] T T T T T T T [ y1, y2, y3, y4, y5, y6 s defned as 39

42 6 6 L X, Y xk y p k xk r y p k r k1 k1 r1 Ds 4.5 and the dstance between these two models s Ds mn X Y Ds, X, Y The smlarty measure between X and Y s defned as Sm X, Y Ds X, Y The larger the smlarty value s, the more smlar a model s. Take the model wth the largest smlarty value as the retreved model Adaptve Vews ClusterngAVC Adaptve vews clusterng AVC [23] provdes a probablstc Bayesan method for 3-D model retreval from these vews. A set of characterstc vews V V, V,..., V } represent a model n model collecton { 1 2 c D b, wth C the number of characterstc vews. Correspondng to a 3-D request model Q, the target retreval model M s the closest one n hghest probablty of P M Q. P M Q can be wrtten as D b, wth the P M K k 1 k k Q P M V P V Q 4.8 Q Q Where K s the number of characterstc vews of the model Q. Let H be the set of all the possble hypotheses of correspondence between the request vewv and a k Q 40

43 model k k k M, H h h... h }. A hypothess { 1 2 N k h p means that the vew p of the model s the vew request k V. The sgn represents logc or operator. If an cq hypothess k h p s true, all the other hypotheses are false. P M V cq can be expressed k k by P M H. N k j k P M H j P M, VM h 1 j 4.9 The sum j N j k P M, VM h 1 j can be reduced to the only true hypothess P M, V j c M H k j. In fact, a characterstc vew from the request model Q can match only one characterstc vew from the model the maxmum probablty. M. The characterstc vew s chose wth P M K k 1 Q j k k Max P M, V h P V Q 4.10 j M j Q Usng the Bayes theorem: P M k j j K P h j VM, M P V M P M M k Q Max P V Q k 1 j Q 4.11 N K k j j P h V, M P V M P M 1 k 1 j M M Wth P M the probablty to observe the model M. 1. M / M N P M e. Where M s the number of characterstc vews of the model M. s a parameter to hold the effect of the probablty P M. The algorthm concepton makes that, the complex s the geometry of the 3-D model, the greater s the number of ts characterstc vews. 41

44 On the other hand P V j M j. N Vr / 320 M e j V r M M 1 Where N s the number of vews represented by the characterstc vew j of the model M. The greater s the j V r M number of represented vews N, the more the characterstc vew V j c M s mportant and the best t represents the 3-D model. The coeffcent s ntroduced to reduce the effect of the vew probablty. k j The value P h V, M s the probablty that, knowng that we observe the j M characterstc vew j of the model M, ths vew s the k vew of the 3-D query model Q : P h V k j j M, M 1 D Wth D h q, h the Eucldean dstance between k Q, h j v M j v M the 2-D Zernke descrptors of Q and of the V j c M characterstc vew of the 3-D model M. In AVC, the representatve vews are selected ether from the 60 vews n the NTU dataset or the 41 vews n the ETH dataset. Then, the probablstc matchng s conducted to measure the relevance between each object to the query Query Vew Selecton MethodQVS For query vew selecton QVS [45], the query vews are nteractvely selected and ncrementally ncreased. In ths method a group of vews are provded to descrbe the query object Q q, q,..., q } that contans m vews. Vew clusterng s frst { 1 2 m conducted to group vews nto clusters usng a herarchcal agglomeratve method[66]. The central vew s selected from each cluster to generate a canddate vew set Q ~ wth r canddate query vews. After generatng the canddates, a vew graph s constructed based on the relatonshp among the canddate query vews, and a random walk process s employed to select the ntal query vew. In the vew graph, each node denotes a 42

45 canddate and the edge between two canddates smlarty: v and v j s defned as ther vsual s j d v, v j exp 4.12 The transton probablty between the th and the j th canddates s defned as sj p j 4.13 s k k and the random walk process s actually repeatng t 1 t 0 k pk 1 k 4.14 untl convergence. The canddate q ~ K wth the hghest score s selected as the ntal query vew. If users are not satsfed wth the ntal retreval results, a relevance feedback process can be conducted to refne the search results by mnmzng ts dstance to the postve samples and maxmzng ts dstance to the negatve samples. A dstance metrc can be learned for the selected query vew wth the crteron that mnmzes the dstance between the selected query vew and the relevant objects and maxmzes ts dstance to the rrelevant objects. All the selected query vews are combned usng the learned weghts for next search. The dstance estmaton of two mult-vew 3-D objects n QVS plays the central role n the retreval process. Gven two objects Q and O, the dstance d Q, O s defned as d Q, O d q ~, O 4.15 K j1 j j 43

46 where K s the number of the vews of the query object, and d q ~ j, O s defned by ~ mn ~ T d q, O q v W q ~ v 4.16 j j, p j j, p P Here q ~ j s the j th selected query vew of Q, and p v, s the p th vew of the th object n the database. And Mahalanobs dstance metrc for W j s the correspondng weght metrc of the q ~ j. In our experments, we only compare our proposed method wth the QVS method wthout relevance feedback. In our experments, we mplemented ED, AVC and QVS followng the ntroductons n [39, 23, 45]. For our method, s set as 100, and the number K of nearest neghbors s selected as {5; 10; 15; 20}. In 3-D object retreval experments, each tme one object s selected as the query, and ths process s repeated untl all the objects are severed as the query once. The average retreval performance for all the objects n each dataset wll be used for comparson. 44

47 4.3 Evaluaton Crtera For measurng the 3-D object retreval performance, evaluaton crtera s mportant to evaluate the dfferent methods. Gven a vew of the query object, 3-D object retreval method can be appled to calculate the smlarty wth other objects n object database. After gotten the fnal rankng lst of retreved 3-D objects, the outcome should be justfed by some evaluaton crtera. We employ the followng crtera to compare dfferent methods n our experments. The crtera ncludes nearest neghbor precson, F-measure, dscounted cumulatve gandcg, and average normalzed modfed retreval rankanmrr The Accuracy of The Nearest NeghborNN The accuracy of the nearest neghbor NN evaluates the retreval accuracy of the frst returned result. NN ranges from 0 to 1, a hgher value ndcates better performance. The NN s a smple but effectve crtera whch s defned as 4.17 shows F-MeasureF F measure[67] s a composte measure of both the precson and the recall for a fxed number of returned results. The precson of retreved objects can be defned as follows: {relevant objects} {retreved objects} precson 4.17 {retreved objects} Where {retreved objects} are the retreved objects gven the query, {relevant objects} are the relevant objects for the querythe groundtruth. X s the number of objects n X, and X Y s the ntersecton whch specfy the objects n both sets X and Y. The precson value ranges from 0 to 1. Precson of 1 ndcates 45

48 all the retreved results are correct. The recall evaluates the recall of retreved objects compared wth the groundtruth. The recall s defned as follows: {relevant objects} {retreved objects} recall 4.18 {relevant objects} The recall value ranges from 0 to 1. Recall of 1 ndcates that all the relevant results have been correctly retreved. A commonly used performance measure that combnes Precson and Recall s the F-measure, also known as the balanced F-score: P R 2 P R K K F 4.19 K K Where K s the number of selected top returned results, the top K results, and P K s the precson for R K s the recall for the top K results. F ranges from 0 to 1, a hgher value ndcates better performance. In our experment K s set as 20. F s defned as F 2 P R P R where P 20 and R 20 are the precson and the recall of the top 20 retreval results, respectvely Dscounted Cumulatve GanDCG DCG[56] measures the rankng performance of the retreved result lst, whch s a statstc that gves relevant object hgh score. DCG works under the assumpton that a 46

49 user s less lke to consder lower results. The DCG value s then defned as G[1] f 1 DCG [ ] G[ ] 4.21 DCG[ 1] otherwse log 2 where 1 f the th result s correct G [ ] otherwse We explore the behavor of DCG as the relatve weght gven to hghly relevant objects vares. By manpulatng ths weght we can closely approxmate ether evaluaton by all relevant objects or evaluaton by hghly relevant objects only. DCG ranges from 0 to 1, a hgher value ndcates better performance. Assumng the number of all relevant objects s, and the number of all objects s n, the maxmal DCG s computed as DCGn DCG max log Average Normalzed Modfed Retreval RankANMRR ANMRR[57] measures the rank performance gven a rankng lst, whch consders the rankng nformaton of relevant objects among the top-retreved objects. ANMRR ranges from 0 to 1, and the smaller the value of ths measure the better the matchng qualty of the query s. ANMRR s defned as follows: To calculate ANMRR, the average retreval rank AVR Q for a gven k th query Q k s depcted as: k NR Q k RANK AVR Q 4.24 k 1 NR Qk 47

50 where NR Qk s the number of relevant objects for the query Q k. If the th result s relevant to the query then RANK s the rankng poston; otherwse RANK S k. k S s the top-ranked returned retrevals, where: S k mn{ 4 NR Q,2GMT} 4.25 k and GMT s the maxmal number of relevant objects for all queres. The modfed retreval rank s: MRR 1 NR Q AVR Qk k Qk Then the modfed retreval rank can be normalzed to compute the normalzed MRR as follows: NMRR Q k MRR Qk NR Qk 1.25 Sk 2 Fnally, the average NMRRANMRR can be calculated by averagng the NMRR values over all queres: n NMRR Qk k ANMRR 1 n 4.28 Where n s the number of queres. 48

51 4.4 Expermental Results Expermental results on the NTU dataset and comparson of dfferent methods are shown n Fgure 4.4. As shown n these results, our proposed method,.e., mult-scale object graph learnng MSOGL, acheves the best performance compared wth other methods. Based on NN, the proposed method acheves an mprovement of %, 53.72%, and 12.66%, compared wth ED, AVC, and QVS, respectvely. In terms of F, the mprovement from the proposed methods s 99.40%, 27.80%, and 7.77%, respectvely. For DCG, the gan s 55.32%, 8.06%, and 6.01%, respectvely. Regardng the rankng performance, the proposed method acheves an mprovement of 9.96%, 4.03%, and 3.31% n terms of ANMRR compared to ED, AVC, and QVS, respectvely. We can also observe smlar results on other crtera. Fgure 4.4 Expermental results on the NTU dataset. 49

52 Expermental results on the ETH dataset and comparson of dfferent methods are shown n Fgure 4.5. As shown n these results, our proposed method acheves the best performance compared wth other methods. Based on NN, the proposed method acheves an mprovement of 23.8%, 23.8%, and 11.8%, compared wth AVC and QVS, respectvely. In terms of F, the mprovement from the proposed methods s 19.69% and 11.94%, respectvely. For DCG, the gan s 10.08% and 4.12%, respectvely. Regardng the rankng performance, the proposed method acheves an mprovement of 25.1% and 14.3% n terms of ANMRR compared to ED, AVC, and QVS, respectvely. Smlar results can be observed from other crtera. Fgure 4.5 Expermental results on the ETH dataset. As shown n the results, we can have the followng observatons. 50

53 The proposed acheves the best performance compared wth all other compared methods. Ths satsfactory result can be dedcated to the better formulaton of our method on object relatonshp. The proposed method s able to formulate the connectons among objects from multple scales. Compared wth exstng formulaton methods, our proposed method can be much more robust to object varatons. In ths way, dfferent connectons among objects can be modeled n the hypergraph structure and the learnng on hypergraph can explore the optmal relevance among these objects. The gan on the ETH dataset s smaller than that on the NTU dataset. Ths can be dedcated to the hgh performance of all methods on the ETH dataset. The better performance can be dedcated to the flexble structure of the proposed method. In the mult-scale relatonshp formulaton,.e., mult-scale hypergraph constructon, the proposed method s able to explore all possble vew matchng results, whch s hard to fnd the best one n tradtonal methods. In ths way, the proposed method can acheve better performance through the optmal data dstrbuton modelng. 51

54 4.5 Analyss On the Mult-scale Hypergraph Connectons In ths subsecton, we evaluate the nfluence of the mult-scale hypergraph connectons. The parameter K ndcates how many nearest neghbors can be connected by one hyperedge. When K s too large, the connected objects wll be much dssmlar. When K s too small, the constructed hyperedge wll lose the dscrmnatve performance. Here we evaluate dfferent K values and the combnaton of multple K values, whch s used n our work. More specfcally, we vary K from 5, 10, 15, to 20, and the expermental results on the two datasets are demonstrated n Fgure 4.6 and Fgure 4.7, respectvely. As shown n these two fgures, the performance wth sngle K value vares a lot wth respect to the selecton of K, and dfferent K values have dfferent performance n the two datasets. When multple K values are employed, as shown n our method, the overall performance becomes steady and better than that of each sngle one. These results can demonstrate the effectveness of our proposed mult-scale hypergraph connecton method. 52

55 Fgure 4.6 Expermental results wth respect to dfferent connecton numbers on the NTU dataset. 53

56 Fgure 4.7 Expermental results wth respect to dfferent connecton numbers on the ETH dataset On Parameter In ths secton, we evaluate the nfluence of the parameter on the 3-D object retreval performance. s a parameter to balance dfferent components n the learnng formulaton. In ths part, we vary from to 10000, and evaluate the performance of our method. Expermental results on the two datasets are demonstrated n Fgure 4.8 and Fgure 4.9, respectvely. As shown n these results, when s too small, such as 0.001, the performance s not very satsfactory. When vares n a large range, such as from 1 to 10000, the performance s steady and satsfactory. These results can demonstrate that the proposed method s robust to the selecton of the parameter. 54

57 Fgure 4.8 Expermental results wth respect to dfferent values on the NTU dataset. 55

58 Fgure 4.9 Expermental results wth respect to dfferent values on the ETH dataset. 56

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

LECTURE : MANIFOLD LEARNING

LECTURE : MANIFOLD LEARNING LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

An efficient method to build panoramic image mosaics

An efficient method to build panoramic image mosaics An effcent method to buld panoramc mage mosacs Pattern Recognton Letters vol. 4 003 Dae-Hyun Km Yong-In Yoon Jong-Soo Cho School of Electrcal Engneerng and Computer Scence Kyungpook Natonal Unv. Abstract

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

3D vector computer graphics

3D vector computer graphics 3D vector computer graphcs Paolo Varagnolo: freelance engneer Padova Aprl 2016 Prvate Practce ----------------------------------- 1. Introducton Vector 3D model representaton n computer graphcs requres

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes

R s s f. m y s. SPH3UW Unit 7.3 Spherical Concave Mirrors Page 1 of 12. Notes SPH3UW Unt 7.3 Sphercal Concave Mrrors Page 1 of 1 Notes Physcs Tool box Concave Mrror If the reflectng surface takes place on the nner surface of the sphercal shape so that the centre of the mrror bulges

More information

Structure from Motion

Structure from Motion Structure from Moton Structure from Moton For now, statc scene and movng camera Equvalentl, rgdl movng scene and statc camera Lmtng case of stereo wth man cameras Lmtng case of multvew camera calbraton

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures A Novel Adaptve Descrptor Algorthm for Ternary Pattern Textures Fahuan Hu 1,2, Guopng Lu 1 *, Zengwen Dong 1 1.School of Mechancal & Electrcal Engneerng, Nanchang Unversty, Nanchang, 330031, Chna; 2. School

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS by XUNYU PAN (Under the Drecton of Suchendra M. Bhandarkar) ABSTRACT In modern tmes, more and more

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Multi-stable Perception. Necker Cube

Multi-stable Perception. Necker Cube Mult-stable Percepton Necker Cube Spnnng dancer lluson, Nobuuk Kaahara Fttng and Algnment Computer Vson Szelsk 6.1 James Has Acknowledgment: Man sldes from Derek Hoem, Lana Lazebnk, and Grauman&Lebe 2008

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Simplification of 3D Meshes

Simplification of 3D Meshes Smplfcaton of 3D Meshes Addy Ngan /4/00 Outlne Motvaton Taxonomy of smplfcaton methods Hoppe et al, Mesh optmzaton Hoppe, Progressve meshes Smplfcaton of 3D Meshes 1 Motvaton Hgh detaled meshes becomng

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

A Comparison and Evaluation of Three Different Pose Estimation Algorithms In Detecting Low Texture Manufactured Objects

A Comparison and Evaluation of Three Different Pose Estimation Algorithms In Detecting Low Texture Manufactured Objects Clemson Unversty TgerPrnts All Theses Theses 12-2011 A Comparson and Evaluaton of Three Dfferent Pose Estmaton Algorthms In Detectng Low Texture Manufactured Objects Robert Krener Clemson Unversty, rkrene@clemson.edu

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

Keyword-based Document Clustering

Keyword-based Document Clustering Keyword-based ocument lusterng Seung-Shk Kang School of omputer Scence Kookmn Unversty & AIrc hungnung-dong Songbuk-gu Seoul 36-72 Korea sskang@kookmn.ac.kr Abstract ocument clusterng s an aggregaton of

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

PRÉSENTATIONS DE PROJETS

PRÉSENTATIONS DE PROJETS PRÉSENTATIONS DE PROJETS Rex Onlne (V. Atanasu) What s Rex? Rex s an onlne browser for collectons of wrtten documents [1]. Asde ths core functon t has however many other applcatons that make t nterestng

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Image Fusion With a Dental Panoramic X-ray Image and Face Image Acquired With a KINECT

Image Fusion With a Dental Panoramic X-ray Image and Face Image Acquired With a KINECT Image Fuson Wth a Dental Panoramc X-ray Image and Face Image Acqured Wth a KINECT Kohe Kawa* 1, Koch Ogawa* 1, Aktosh Katumata* 2 * 1 Graduate School of Engneerng, Hose Unversty * 2 School of Dentstry,

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping.

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping. SIGGRAPH 004 Interactve Image Cutout Lazy Snappng Yn L Jan Sun Ch-Keung Tang Heung-Yeung Shum Mcrosoft Research Asa Hong Kong Unversty Separate an object from ts background Compose the object on another

More information

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images

Using Fuzzy Logic to Enhance the Large Size Remote Sensing Images Internatonal Journal of Informaton and Electroncs Engneerng Vol. 5 No. 6 November 015 Usng Fuzzy Logc to Enhance the Large Sze Remote Sensng Images Trung Nguyen Tu Huy Ngo Hoang and Thoa Vu Van Abstract

More information

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval Fuzzy -Means Intalzed by Fxed Threshold lusterng for Improvng Image Retreval NAWARA HANSIRI, SIRIPORN SUPRATID,HOM KIMPAN 3 Faculty of Informaton Technology Rangst Unversty Muang-Ake, Paholyotn Road, Patumtan,

More information

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros.

Fitting & Matching. Lecture 4 Prof. Bregler. Slides from: S. Lazebnik, S. Seitz, M. Pollefeys, A. Effros. Fttng & Matchng Lecture 4 Prof. Bregler Sldes from: S. Lazebnk, S. Setz, M. Pollefeys, A. Effros. How do we buld panorama? We need to match (algn) mages Matchng wth Features Detect feature ponts n both

More information

Review of approximation techniques

Review of approximation techniques CHAPTER 2 Revew of appromaton technques 2. Introducton Optmzaton problems n engneerng desgn are characterzed by the followng assocated features: the objectve functon and constrants are mplct functons evaluated

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga

Angle-Independent 3D Reconstruction. Ji Zhang Mireille Boutin Daniel Aliaga Angle-Independent 3D Reconstructon J Zhang Mrelle Boutn Danel Alaga Goal: Structure from Moton To reconstruct the 3D geometry of a scene from a set of pctures (e.g. a move of the scene pont reconstructon

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15 CS434a/541a: Pattern Recognton Prof. Olga Veksler Lecture 15 Today New Topc: Unsupervsed Learnng Supervsed vs. unsupervsed learnng Unsupervsed learnng Net Tme: parametrc unsupervsed learnng Today: nonparametrc

More information

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem Ecent Computaton of the Most Probable Moton from Fuzzy Correspondences Moshe Ben-Ezra Shmuel Peleg Mchael Werman Insttute of Computer Scence The Hebrew Unversty of Jerusalem 91904 Jerusalem, Israel Emal:

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

Multi-view 3D Position Estimation of Sports Players

Multi-view 3D Position Estimation of Sports Players Mult-vew 3D Poston Estmaton of Sports Players Robbe Vos and Wlle Brnk Appled Mathematcs Department of Mathematcal Scences Unversty of Stellenbosch, South Afrca Emal: vosrobbe@gmal.com Abstract The problem

More information

Face Recognition using 3D Directional Corner Points

Face Recognition using 3D Directional Corner Points 2014 22nd Internatonal Conference on Pattern Recognton Face Recognton usng 3D Drectonal Corner Ponts Xun Yu, Yongsheng Gao School of Engneerng Grffth Unversty Nathan, QLD, Australa xun.yu@grffthun.edu.au,

More information

A new segmentation algorithm for medical volume image based on K-means clustering

A new segmentation algorithm for medical volume image based on K-means clustering Avalable onlne www.jocpr.com Journal of Chemcal and harmaceutcal Research, 2013, 5(12):113-117 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCRC5 A new segmentaton algorthm for medcal volume mage based

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Gender Classification using Interlaced Derivative Patterns

Gender Classification using Interlaced Derivative Patterns Gender Classfcaton usng Interlaced Dervatve Patterns Author Shobernejad, Ameneh, Gao, Yongsheng Publshed 2 Conference Ttle Proceedngs of the 2th Internatonal Conference on Pattern Recognton (ICPR 2) DOI

More information

Accounting for the Use of Different Length Scale Factors in x, y and z Directions

Accounting for the Use of Different Length Scale Factors in x, y and z Directions 1 Accountng for the Use of Dfferent Length Scale Factors n x, y and z Drectons Taha Soch (taha.soch@kcl.ac.uk) Imagng Scences & Bomedcal Engneerng, Kng s College London, The Rayne Insttute, St Thomas Hosptal,

More information

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation

Quality Improvement Algorithm for Tetrahedral Mesh Based on Optimal Delaunay Triangulation Intellgent Informaton Management, 013, 5, 191-195 Publshed Onlne November 013 (http://www.scrp.org/journal/m) http://dx.do.org/10.36/m.013.5601 Qualty Improvement Algorthm for Tetrahedral Mesh Based on

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng, Natonal Unversty of Sngapore {shva, phanquyt, tancl }@comp.nus.edu.sg

More information

PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS

PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS PROJECTIVE RECONSTRUCTION OF BUILDING SHAPE FROM SILHOUETTE IMAGES ACQUIRED FROM UNCALIBRATED CAMERAS Po-Lun La and Alper Ylmaz Photogrammetrc Computer Vson Lab Oho State Unversty, Columbus, Oho, USA -la.138@osu.edu,

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

KIDS Lab at ImageCLEF 2012 Personal Photo Retrieval

KIDS Lab at ImageCLEF 2012 Personal Photo Retrieval KD Lab at mageclef 2012 Personal Photo Retreval Cha-We Ku, Been-Chan Chen, Guan-Bn Chen, L-J Gaou, Rong-ng Huang, and ao-en Wang Knowledge, nformaton, and Database ystem Laboratory Department of Computer

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection 2009 10th Internatonal Conference on Document Analyss and Recognton A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng,

More information

Brushlet Features for Texture Image Retrieval

Brushlet Features for Texture Image Retrieval DICTA00: Dgtal Image Computng Technques and Applcatons, 1 January 00, Melbourne, Australa 1 Brushlet Features for Texture Image Retreval Chbao Chen and Kap Luk Chan Informaton System Research Lab, School

More information

Scan Conversion & Shading

Scan Conversion & Shading Scan Converson & Shadng Thomas Funkhouser Prnceton Unversty C0S 426, Fall 1999 3D Renderng Ppelne (for drect llumnaton) 3D Prmtves 3D Modelng Coordnates Modelng Transformaton 3D World Coordnates Lghtng

More information