3D Point Cloud Video Segmentation Based on Interaction Analysis

Size: px
Start display at page:

Download "3D Point Cloud Video Segmentation Based on Interaction Analysis"

Transcription

1 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss Xao Ln, Josep R.Casas and Montse Pardàs Image Processng Group, Techncal Unversty of Catalona (UPC) bstract. Gven the wdespread avalablty of pont cloud data from consumer depth sensors, 3D segmentaton becomes a promsng buldng block for hgh level applcatons such as scene understandng and nteracton analyss. It benefts from the rcher nformaton contaned n actual world 3D data compared to apparent (projected) data n 2D mages. Ths also mples that the classcal color segmentaton challenges have recently shfted to RGBD data, whereas new emergng challenges are added as the depth nformaton s usually nosy, sparse and unorganzed. In ths paper, we present a novel segmentaton approach for 3D pont cloud vdeo based on low level features and orented to the analyss of object nteractons. herarchcal representaton of the nput pont cloud s proposed to effcently segment pont clouds at the fner level, and to temporally establsh the correspondence between segments whle dynamcally managng the object splt and merge at the coarser level. Experments llustrate promsng results for our approach and ts potental applcaton n object nteracton analyss.... Keywords: object segmentaton, 3D pont clouds, dynamc splt and merge management, object nteractons 1 Introducton Segmentaton s an essental task n computer vson. It usually serves as the foundaton for solvng hgher level problems such as object recognton, nteracton analyss and scene understandng. Tradtonally, segmentaton s defned as a process of groupng homogeneous pxels nto multple segments on a sngle mage, whch s also known as low level segmentaton. The obtaned segments are somehow more homogeneous and more perceptually meanngful than raw pxels. Based on that, the concept of semantc segmentaton/labelng s proposed. It s devoted to segment an mage nto regons whch deally correspond to meanngful objects n the scene. To acheve ths goal, hgh level knowledge s usually ncorporated nto the segmentaton process, such as object models [2] exploted n constraned scenes, accurate object annotatons requred n the ntalzaton [12, 16] and large databases contanng fully annotated data n, for nstance, label transfer approaches such as [9]. These approaches yeld outstandng segmentaton results; however, most computer vson applcatons nvolve large amounts

2 2 X. Ln, J.R. Casas and M. Pardàs of data wth dfferent types of scenes contanng several objects, whch dffcult the adaptaton to generc scenes of those methods based on manual ntalzaton or predefned/learned object models. To generalze the methodology from constraned stuatons, larger attenton has been drawn on nvestgatng the spatal relaton between segments and ther temporal correspondences when temporal vdeo (stream) data s avalable. bunch of methods focusng on the spato-temporal relaton between segments are proposed [4, 15, 3, 5, 6, 1]. These methods manly focus on tacklng two problems: a hgher level representaton, whch abstracts the raw data from scratch, and a method to establsh the spato-temporal correspondences. Several methods employ a generc model to represent the objects n the scene. Husan et al. [6] mantans a quadratc surface model to generally represent the object segments n the scene. The model s then updated along the sequence to obtan the fnal segmentaton result. But t s dffcult to handle objects wth large dsplacement n successve frames. Smlarly, a Gaussan Mxture Model (GMM) s used n [8] to represent the objects, whle the model s ncrementally updated for new frames n the sequence. However, t establshes the correspondence between the object model and the pont cloud n the new frame by usng the Iteratve Closest Pont (ICP) technque, whch may lead to the accumulaton of regstraton errors n the object model due to the deformaton of the objects. More generally n scene representaton, Rchtsfeld et al. [13] propose to represent the 3D pont cloud wth a graph of surface patches detected n the scene, such as planes and non-unform ratonal B-splnes (NURBs). SVM based learnng process s then employed to decde the relaton between surface patches for a sub-sequent graph cut segmentaton. Grundmann et al. [3] use a graph-based model to herarchcally construct a consstent vdeo segmentaton from over-segmented frames. Smlarly, Hckson et al. [5] extend the method to RGBD stream data. But the over-segmentaton for each frame n these two approaches s stll calculated ndependently, wthout the temporal coherence constrant, whch may lead to a temporal nconsstency problem due to changes of correspondng over-segments n dfferent frames. bramov et al. [1] perform label transfer n the pxel level between frames by usng optcal flow. Then, they mnmze the label dstrbuton energy n the Potts model to generate labels for objects n the scene. Ths establshes the temporal correspondences n the pxel level, whch makes the approach hghly rely on the performance of optcal flow estmaton. Motvated by the problems mentoned above, we propose a segmentaton algorthm based on the defnton of objects as compact pont clouds n the 3D-space plus tme doman. However, pont clouds correspondng to an object can break nto dfferent compact sub-clouds due to occlusons, or can merge wth compact pont clouds correspondng to other objects, producng a sngle compact pont cloud, when they become spatally close (object nteracton). Our system ams to produce a robust spato-temporal segmentaton of the pont clouds by analyzng ther connectvty to defne the objects accordng to the evdence observed up to a gven temporal pont. Our prmary contrbutons are:

3 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss 3 We propose a novel tree structure representaton for the pont cloud of the scene whch allows us to temporally update the smlartes between nodes n the tree We propose to approach the temporal correspondences establshment task by a labellng assgnment problem regardng the tree structure. dynamc management of object splts and merges s exploted n our approach for generatng a better segmentaton result based on all low-level features avalable n over-segmentaton method based on the compactness of the connecton between neghborng super voxels n the graph s proposed. The rest of the paper s organzed as follow. In secton 2 we explan how the 3D pont cloud segmentaton problem s modeled. Sectons 3 and 4, present the framework of the proposed segmentaton approach and show expermental results, respectvely. Fnally, secton 5 dscusses the results and yelds conclusons. 2 Problem modelng and defnton In ths secton, we explan how the 3D pont cloud segmentaton task s modeled by the proposed tree structure. 2.1 Tree structure representaton of the pont cloud Gven a stream of RGBD data, our goal s to segment the foreground pont cloud n each frame nto meanngful sub-clouds and assocate these sub-clouds n consecutve frames to mantan the trajectores for them wthout explct object models or accurate ntalzaton. More precsely, we represent the nput pont cloud as a graph G (shown n Fg.1) wth a super-voxel approach [1]. Nodes n the graph are homogeneous sub-cloud patches and edges defne the spatal connectons among patches. In ths manner, the connectvty of a pont cloud s nterpreted as the connectvty n the correspondng graph representaton. The set of connected nodes n the graph corresponds to the compact parts of the nput pont cloud, whch we call blobs (shown n Fg.1(c) and marked n dfferent colors). Then a tree structure wth 4 levels varyng from coarse to fne s exploted to represent the nput pont cloud at dfferent scales of objectconnectvty. Fg.1 shows the constructed tree structure for the pont cloud data n the second row. The root of the tree represents the scene. The second level of the tree s the object level, n whch each node stands for an object proposal. The next level, named component level, s employed to handle potental splts and merges of pont clouds representng these objects. n object s represented by more than one component f t splts n dfferent blobs n the graph. Components from dfferent objects can be part of the same blob, because of the nteractons between objects. Splts and merges of components are managed by mantanng the smlartes among object components along tme, whch provdes a temporally coherent way to obtan object proposals based on pont cloud

4 4 X. Ln, J.R. Casas and M. Parda s root object smlarty Scene Level Object Level component smlarty B Component Level B 1 B1 B Over-Segmentaton Level Blob B2 B1 (c) (d) (e) Fg. 1. n example of tree structure representaton. tree structure representaton of the nput pont cloud data. The graph bult on the nput pont cloud. (c) Blobs n the nput pont cloud. (d) Components for each object. (e) Objects segmentaton obtaned from the tree structure. connectvty. The fnal level of the tree s the over-segmentaton level. We oversegment components usng normalsed cut n ther graphs n order to correctly establsh correspondences between trees along tme and update ts structure, that s, the temporal coherent assgnment of labels to the segmented objects. Note that three knds of labels are used n Fg.1 to dfferentate the nodes n the tree whle showng ther relatonshps, whch are object label (color), component label (alphabet) and segment label (number). We algned the color used n Fg.1 wth the real pont cloud data plots (Fg.1(c) and Fg.1(e)). In Fg.1(e), we present the object segmentaton result obtaned n ths tree structure, whle Fg.1(d) shows the components of each object from the pont cloud vew. Fg.1(c) presents the blobs n the nput pont cloud whch s related to the ellpses marked wth the same color n Fg Tree structure creaton Takng the pont cloud n frame t as nput data, we abstract t wth super voxels, usng the method proposed n [1]. The graph representaton smplfes the nput data by groupng homogeneous ponts on the pont cloud nto super voxels whle preservng the boundary nformaton. Then, a graph G s constructed regardng the spatal connectvty between super voxels. We group the pont cloud nto blobs by detectng the connected components n the graph. The tree n the frst

5 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss 5? B B1 B2 frame t-1 frame t frame t-1 frame t Fg. 2. n example of temporal nconsstency problem. The problem when establshng the correspondence between components n the prevous frame and blobs n the current frame. Usng the segments nstead of components solves ths problem. frame s created by smply takng the detected blobs as the objects, as no pror nformaton about the objects s provded. ccordngly, we create one component for each object and over-segment each component nto segments. part from the frst frame, the tree s bult n a bottom-up way, startng at the component level. Frst, a correspondence s made between the connected components of the graph (blobs) n the current frame and the segments at the over-segmentaton level of the tree structure n the prevous frame. Ths over-segmentaton level s employed to avod temporal nconsstency problem. Fg.2 shows an example of t, where the component B of the blue object n frame t 1 splts nto two blobs n frame t. In ths case, no correct assocaton s found between components and blobs. The problem may be tackled by over-segmentng the component B of the blue object nto segments B1 and B2 (shown n Fg.2) and assocatng the segments n frame t 1 wth blobs n frame t. Establshng the correspondence between the blob labels and the segments s a problem of assgnng M b blob labels to M s segments. Ths s a nonlnear nteger programmng problem whch s solved usng a Genetc lgorthm to mnmze an energy functon whch s composed of three terms: one for representng the appearance changes E a, one for the dsplacements E d and the other one E o for the penalty when objects move out of the scene. further segmentaton s needed when segments that correspond to dfferent objects n the prevous frame are assgned to the same blob. restrcted graph cut method s employed to segment the graph of the blob by mnmzng a segmentaton energy functon, n whch we consder the degree that a graph cut fts the current data whle beng coherent wth the mnmum cut n the prevous frame. Once the current segmentaton s done, the components and objects n the current tree are created ntally from t regardng the prevous tree structure. To dynamcally manage object splts and merges along tme, we mantan smlartes between nodes at the component and object level respectvely and update the tree based on t. The component smlartes are measured among components whch belong to the same object whle the object smlartes are measured among objects. These smlartes are computed consderng spatal dstance and apprearance dfference, whch reveal the lkelhood of object splts and merges.

6 6 X. Ln, J.R. Casas and M. Pardàs We accumulate them along tme by averagng the current smlarty and the prevous accumulated smlarty regardng the establshed correspondences. Then object splts and merges are confrmed by thresholdng the accumulated smlartes. Fnally, an over-segmentaton s performed at the component level to generate segments for correctly establshng the correspondence to the next frame. Specfcally, a normalzed cut s performed n the graph representng the component teratvely untl the cut cost s larger than a threshold. 3 Graph based dynamc 3D pont cloud segmentaton In ths secton, we present the frame work of the proposed approach ncludng data acquston and ntalzaton, temporal correspondences establshment and segmentaton, the proposed dynamc management of object splts and merges mechansm and the over-segmentaton method. 3.1 Data acquston and ntalzaton We can transform the per-pxel dstances provded n an RGBD mage nto a 3D pont cloud C I R 3 usng camera parameters. We focus on the nterest area of the foreground cloud C fg R 3 n 3D space. Takng the foreground pont cloud at frame t as nput data, a graph representaton s constructed f (C fg ) G (v, e) va a graph buldng method f, where v s the set of vertces or nodes and e the edges of the graph. The super voxel method ntroduced n [1] s employed as the graph buldng method f n our approach. It aggregates ponts on the pont cloud nto homogeneous sub-cloud patches (super voxels) wth respect to the pont proxmty and appearance smlarty whle preservng the boundary nformaton. Then a graph G s bult for the super voxels regardng ther adjacency. The connectvty of C fg s nterpreted as the connectvty on G. Our system s ntalzed by buldng the tree structure for the frst frame. Nodes n the tree are denoted as Nlevel, where we specfy whch level the node belongs to and ts node number n ths level. Each node s descrbed by ts related pont cloud and graph Nlevel ( Clevel, level) G, where C level C fg and G level G. node for the tree root s created as Nsc 1 (C fg, G). s mentoned n Secton 2.2, we base the constructon of the tree for the frst frame only on the connectvty of the nput pont cloud. Thus, we extract blobs from C fg by detectng connected components on graph G. Each blob s treated as one object proposal whle accordngly we create one object node No n the tree. For each object node, one component node Nc s created. Components are over-segmented nto M s segments va an over segmentaton method OSeg(Nc) { } Ns 1 Ns Ms. 3.2 Correspondences establshment and segmentaton part from the frst frame, we create the current tree structure wth respect to the tree n the prevous frame T r. Smlarly, a graph G s obtaned for the current pont cloud and blobs are detected on G. Then, the tree buldng process

7 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss 7 s started by establshng the correspondences between the current data and T r. ssocatng a set of labels to another set of labels s treated as an assgnment task, n whch we optmze dfferent assgnment proposals wth respect to an energy functon. Snce the problem scale ncreases exponentally wth the number of labels, t s crtcal to lmt the number of the labels. Therefore, we propose to assgn the blob labels n the current frame to the segments n the prevous tree T r. The blobs represent the few compact sub-clouds of the nput pont cloud. ssgnng blob labels to segments reduces the problem scale whle t respects the spatal dsconnecton between the compact sub-clouds, whch may concde wth the object boundares. The problem now becomes a task of assgnng M b blob labels n the current frame to M s segments n the prevous frame. To cope wth the stuaton when objects go out of the scene, we employ a vrtual blob B out whch stands for the space out of the nterest area and does not represent any pont cloud. Then, the assgnment energy functon s defned as: E ass () = E a + E d + E o (1) Where s an assgnment proposal whch maps a segment label n the prevous frame l s to a blob label l b n the current frame, ( l ) s l b. E a stands for the summaton of the energy of the appearance dfference between the set of the pont clouds Cs related to the segment N s and the pont cloud C b related the blob B, where the segment label ls j s assocated to the blob label l b regardng the assgnment. The appearance dfference s measured by comparng the number of ponts NoP ( ) on the pont cloud. M b E a () = NoP ( C ) b =1 (ls )==lb NoP ( ) Cs (2) E d s the summaton of the energy of the dsplacement whch s calculated by measurng the Hausdorff dstance dst h ( ) between the pont cloud Cs and the pont cloud Cb (l ), where s j == lb. E d () = M b =1 (ls )==lb ) dst h (C s, C b (3) E o s the summaton of the dstance between the pont cloud of the segment Ns, whch s assocated wth blob B out, to the closest boundary bd. The boundares are predefned planes whch are also used n the data acquston step n Sec.3.1. The dstance s calculated as the Eucldean dstance (dst e ( )) from the centrod of the pont cloud of the segment to the closest boundary plane. ( )) E o () = mn (dst e Cs, bd (4) (ls bd Boundary )==lb out

8 8 X. Ln, J.R. Casas and M. Pardàs Optmzng ths energy functon s a nonlnear nteger programmng problem. We employed the Genetc lgorthm to solve t. fter the best assgnment s obtaned, the blobs n the current frame are assocated wth segments n the prevous frame. further segmentaton s needed when segments that correspond to dfferent objects n the prevous frame (accordng to the prvous tree T r ) are assgned to the same blob. Gven a blob B ( Cb, b) G and the M unque object labels related to the assocated segment labels { ls j ( } ls) j == l b, our goal s to segment the graph G b nto M parts. To ths end, we employ the restrcted graph cut approach proposed n [17] to seek for the mnmal cut on graph G b, whch further segments ths blob consderng both spatal/feature homogenety and the temporal consstency. The mnmal cut s obtaned by mnmzng a segmentaton energy functon wth the way ntroduced n [7]. The energy functon for graph cut s usually defned as the summaton of the data energy and the smoothness energy (E (L) = E data + E smooth ), where L stands for the label proposal for the graph. The data energy s an unary energy term representng the degree that the label proposal fts the current data. The smoothness energy s a par-wse energy whch manages the label smoothness between nodes n the graph. The method proposed n [17] ntroduces a novel smoothness energy term consderng the label smoothness regardng not only the current data but also the mnmal cut obtaned n the prevous frame. The further segmentaton performed on the blob wth segment labels related to multple object labels yelds the object parttons for ths blob whle establshng ther correspondences to the objects n the prevous tree. s mentoned n Secton 2.1, an object s represented by more than one component f t splts n dfferent blobs, n the current or prevous frames. Thus, for each object partton n the blobs, we create one component for the related object. ccordngly, the correspondences between the current components and the components n the prevous tree are made. Note that there s no correspondence establshed for the newly generated components n the current tree. In ths step, the frst three levels of the current tree structure are ntally bult regardng the establshed correspondences to the prevous tree at each of the three levels. 3.3 Dynamc management of merge and splt Gven the current tree obtaned n the last step, an object proposal s mplct at the object level. Ths object proposal for the current nput data s temporally coherent wth the object proposal n the prevous frame. However, t may not be a correct object proposal, snce no accurate ntalzaton s guaranteed at the begnnng of ths process n our approach. That s to say, ths object proposal needs to be further analyzed, n order to cope wth the errors n the prevous nformaton. In ths case, we explot the establshed correspondences and analyze the behavors of related nodes n trees along tme. More specfcally, we compute the smlartes among the components correspondng to the same object, whch forms a smlarty matrx for each object n the tree. The object smlartes are computed among objects n the tree producng an object smlarty matrx. In

9 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss 9 accumulated splt... Object Level accumulated smlarty matrx... merge... Object Level smlarty matrx blob blob Component Level blob blob Component Level Fg. 3. Example of dynamc management of splt and merge our approach, the smlarty between node N and N j s defned as the Eucldean dstance between ther related pont clouds n Eq.(5). Sm ( { ( N, N j) dst e C, C j) > ψ = 1 dste(c,c j ) (5) ψ otherwse Note that here the Eucldean dstance between two pont clouds s calculated as the dstance between the closest pont par from them. ψ s a normalzng factor. fterwards, the smlartes are accumulated along tme by averagng them wth the correspondng accumulated smlarty n the prevous frame, n order to dynamcally manage the merges and splts of objects n the scene on the fly. The accumulated smlarty reveals the lkelhood of object splts and merges regardng the evdence observed up to the current frame. Thus, the decsons for splt and merge are made by thresholdng the accumulated smlarty regardng two thresholds, T h splt and T h merge. Fg.3 shows an example how the object merge and splt are dynamcally managed. Specfcally, a splt for an object node s confrmed when a set of ts chld component nodes have all the accumulated smlartes smaller than T h splt wth respect to the rest of the chld component nodes. Then a new object node s created as the parent node of the splt component nodes. In Fg.3, the red component node splts from ts parent object node and a new object node marked n blue s created as ts new parent. merge between object nodes s confrmed when they are physcally connected whle the accumulated smlartes between them are larger than T h merge. In Fg.3, a merge s confrmed between the blue object node and the green object node whch s physcally connected wth each other. The nodes are merged to the one wth the larger number of the ponts on the related pont cloud (the green node). Ther chld component nodes are all connected to the green object node n the tree. Then the blue object node s removed from the tree. 3.4 Over segmentaton The frst three levels n the current tree structure are bult and updated n the last two steps. In ths secton, we ntroduce an over segmentaton process n order to buld the forth level of the tree. In the over-segmentaton level, we generate segments for each component n the current tree. Ths s treated as the preparaton for establshng the correspondence between segments n the

10 Frame Frame Frame Frame 1 X. Ln, J.R. Casas and M. Pardàs current tree to the blobs detected n the next frame. The more segments are generated n the current frame, the better they wll respect the topology of the nput pont cloud n the next frame, whch avods the temporal nconsstency problem. However, the number of the segments wll affect the problem scale of the assgnment task n the correspondence establshment and segmentaton process n the next frame. Therefore, nstead of usng a technque such as super voxel [1] whch fnely over-segments the pont cloud, we propose a relatvely coarser over-segmentaton method to tackle ths problem. Pratcally, n the related graph G c (v, e) of a component node Nc, we defne the touchng ponts for connected nodes n G c. The touchng ponts T Pj from the node v to the node v j s computed as the number of the ponts n v whch have the closest Eucldean dstance lower than a threshold T h tp to v j. The touchng ponts T P j from v j to v s defned n the same manner. The number of touchng ponts reveals the compactness CC between connected nodes n the graph, whch s defned as: CC ( v, v j) = NoP ( ( ) ) T Pj NoP T P j NoP (v + ) NoP (v j (6) ) where NoP ( ) stands for the number of the ponts n a graph node. We beleve that any splt of the pont cloud wll gradually lead the decrease n the number of touchng ponts at the splttng poston on the graph. Then the edges n the graph are weghted by CC and a normalzed cut approach [14] s performed on the graph teratvely, whch yelds one segment node n each teraton untl the cut cost s larger than a threshold T h mc. In ths manner, the component s teratvely over-segmented nto segments at the postons whch are less compact n the graph, whch may concde wth the splts n the next frame. 4 Experments 4.1 Segmentaton result evaluaton To evaluate our approach from the 3D pont cloud segmentaton perspectve, we select 4 sequences wth 3D pont cloud ground truth labelng n the human manpulaton data set [11]. Each of them contans 2 frames. These 4 sequences Error (%) Error (%) Error (%) Error (%) (c) (d) Fg. 4. -(d) present the segmentaton results for sequence 1-4, shown as percentage of error ponts (vertcal axs) per frame (horzontal). Red: GDS, Blue: GS.

11 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss 11 vary from sngle attachment to mult-attachments, low moton to hgher moton, double attached objects to multple attached objects. The task of ths experment s to segment objects from the scene. The evaluaton metrcs s 3D segmentaton accuracy (3D CCU) proposed n [18] whch computes the fracton of a ground truth segment that s correctly classfed n our approach. Snce the super voxel based graph representaton method organses the nput pont cloud wth voxels n 3D producng a down sampled pont cloud, whle the ground truth s labeled n the orgnal cloud, we fnd K nearest neghbors for a pont on the down sampled pont cloud from the ground truth labelng and use majorty voted label among K nearest neghbors as the ground truth labelng for ths pont. Fg.4 shows the segmentaton results of these 4 sequences. In each sub-fgure, we plot the percentage of segmentaton error aganst the frame number. We compare the segmentaton performance of our graph based dynamc segmentaton approach (GDS) wth the graph based segmentaton method (GS) proposed n [17] whch provdes a temporally coherent segmentaton for RGBD stream data. The red lnes n Fg.4 stand for the result of GDS whle the blue lnes stand for the result of GS. s mentoned n Secton 3.2, GS s also ntegrated n our approach producng temporally coherent segmentaton for blobs wth more than one unque object labels. Thus, the comparson between them shows the mportance of ntroducng the dynamc management of object splt and merge mechansm. Partcularly, GDS outperforms GS n all the 4 sequences shown n Fg.4, whch proves that the dynamc management mechansm contrbutes n the low level to the better segmentaton of actual objects n the scene. GDS acheves an overall foreground 3D pont cloud segmentaton 3.92% mean segmentaton error. In Fg.4(c), we observe a dramatc ncrease (from 2.5% to 15%) n the segmentaton error for GDS (red lne). Ths s caused by the error n dynamc management of object merge and splt. Fg.5(d)-5(f) show 3 key frames n ths process, where objects are marked n dfferent colors. The left arm of the human body n blue s confrmed to splt from the torso n frame 81 due to the persstent self-occluson. The self-occluson breaks the human body nto two compact pont clouds (the left arm and the rest), whch gradually decreases the accumulated smlarty tll the splt s confrmed n frame 81. However, these two pont clouds reattach to each other n frame 13, whch contnuously ncreases the accumulated smlarty between them, tll they are confrmed to merge n frame 136. Fg.5(j)-5(l) n the second row present another example, n whch our system s ncorrectly ntalzed n frame 1 (part of the torso marked n red s treated as one object because of the spatal dsconnecton caused by occluson). They reattach to each other n frame 8 and get merged n frame 14 due to the dynamc management mechansm. These two examples llustrate the robustness of our system regardng the errors n prevous frames whle showng that our approach does not rely on an accurate ntalzaton. For comparson, we employ 3 more sequences proposed n [6] and perform our approach aganst the daptve Surface Models based 3D Segmentaton method (SMS) n [6]. SMS mantans a quadratc surface model to generally represent the object segments n the scene. The model s updated along the sequence by

12 12 X. Ln, J.R. Casas and M. Parda s (c) (d) (e) (f) (g) (h) () (j) (k) (l) Fg. 5. Examples of dynamc management of merge and splt. -(c) present the color mages of frame 81,13 and 136 n sequence 4, (d)-(f) show the segmentaton results n these frames. (g)-() present the color mages of frame 1,8 and 14 n sequence 2, (j)-(l) show the segmentaton results n these frames. fndng and growng the overlappng area to obtan the fnal segmentaton result. To adapt our approach to the scenes n these 3 sequences, we remove the background pont cloud by usng a plane fttng technque to extract foreground pont clouds as nput data. Fg. 6 shows a quanttatve comparson between GDS and SMS n these 3 sequences. Sequence 1 contans a scenaro of a human hand rollng a green ball forward and then backward wth the fngers. Sequence 2 nvolves a robot arm graspng a paper roll and movng t to a new poston. Sequence 3 descrbes a scenaro n whch the human hand enters and leaves the scene, dsplacng the objects rapdly. The comparson results show that the proposed GDS approach outperforms SMS n all the 3 sequences. Specfcally, Fg.6(c) shows an example of the drawback n SMS. The spkes n the blue curve are caused by rapd object movement, whch leaves lttle or no overlap of correspondng segments for SMS. However, our method has the robustness to rapd movements, snce the correspondences establshment problem s treated as the optmzaton of an assgnment energy. part from the sequences used n the quanttatve evaluaton experments, we employ 5 more sequences wthout ground truth labelng, recorded by ourselves wth scenes dsplayng nteractons n human manpulaton scenes. Fg.7 shows Error(%) Error(%) Error(%) Frame Frame Frame (c) Fg. 6. Quanttatve result of GDS (n red) and the SMS (n blue) for the 3 sequences provded n [6]. From left to rght, sequence

13 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss (c) (d) 13 Fg. 7. Qualtatve results of the proposed method: - from human manpulaton dataset n [11], (c) from data recorded by ourselves and (d) from data n [6]. some qualtatve results of our approach from 4 sequences. For each sequence, we unformly sample 4 frames and show the segmentaton result from our approach. More vsual results are avalable on Interacton detecton Our approach s also capable to obtan the nteractons between objects, whch s mplct n the tree structure. s mentoned n Secton 1, an nteracton between objects s defned as a state when they become spatally close. In our method, an nteracton s detected when a blob s related to more than one unque object label. n example of an object nteracton s shown n Fg.8, where an nteracton between a human body and a box s detected and marked as the black lne connectng them. Based on ths defnton, we manually label the object nteracton occurrng n the 4 sequences used n the frst experment and calculate the tmes of nteractons between objects n the scene detected by the proposed method. Fg.8(c) shows the nteracton detecton result n each sequence, where the top of each bar shows the number of nteractons n the ground truth, the red lne stands for the true postve detectons n our approach and the blue lne stands for the false postve detectons. Our approach detected 742 truth postve nteractons over 98 labeled nteractons n the ground truth. We notce that the number of false postve detectons n sequence 3 s relatvely hgh. Ths s 4 35 Red lne: True Postve Blue lne: False Postve number of nteractons Seq.1 Seq.2 Seq.3 Seq.4 (c) Fg. 8. n example of object nteracton. n example of false postve detectons (c) Interacton detecton evaluaton for 4 sequences from [11]

14 14 X. Ln, J.R. Casas and M. Pardàs manly caused by the errors n dynamc management of object splt and merge process. Fg.8 shows the segmented pont cloud n a frame of sequence 3, where an nteracton s detected between the arm n blue and the juce box n green. However, the arm and the torso are not correctly merged as one object n ths frame, whch makes the detected nteracton a false detecton. 4.3 Computatonal cost analyss In our approach, there are two man parts where the computatonal power s spent: the optmzaton for the mult-labels assgnment for temporal correspondences establshment and the graph cut technque used n ether the further segmentaton or the over-segmentaton process. The man problem of approachng the temporal correspondences assocaton by a mult-labels assgnment problem s the computaton complexty. The problem scale ncreases exponentally when the number of labels grows. However, the number of the labels s controlled n our approach by fndng a sutable over-segmentaton level so that we can acheve the assgnment task n a small problem scale whle not leadng to the temporal nconsstency problem. In the experments, generally 2 segments and 5 blobs are nvolved n the assgnment task n each frame. The graph cut technque used n our approach has the reported computaton complexty O ( v 2 sqrt (e) ) where v stands for the number of vertces and e the number of edges on the graph. 5 Concluson In ths paper, we have ntroduced a graph based dynamc 3D pont cloud segmentaton method, whch works at low level wth a tree structure representaton for segmentng generc objects n RGBD steam data. We have evaluated the performance of the proposed approach wth a human manpulaton data set and also compared t wth the method proposed n [6]. Our approach acheves an overall 3.92% 3D pont cloud segmentaton error whle outperformng n the comparson experment. Our contrbuton can be summarzed n 3 ponts: frstly, we proposed a novel over-segmentaton method based on the compactness of the connecton between neghborng super voxels on the graph then a novel tree structure representaton for the scene s proposed, whch allows to temporally update the smlartes between nodes n the tree the temporal correspondences establshment task s approached by a labellng assgnment problem that takes nto account the appearance and dsplacement of the components. Based on the tree structure, a dynamc management of object splts and merges mechansm s proposed Our approach generates a better segmentaton result based on all low-level features avalable. Ths guarantees t to be generc, as no explct or learnt model of the objects or the scene are ntroduced n the proposed method. cknowledgement: Ths work has been developed n the framework of the project TEC R, fnanced by the Spansh Mnstero de Economa y Compettvdad and the European Regonal Development Fund (ERDF).

15 References 3D Pont Cloud Vdeo Segmentaton Based on Interacton nalyss bramov,., Pauwels, K., Papon, J., Wörgötter, F., Dellen, B.: Depth-supported real-tme vdeo segmentaton wth the knect. In: pplcatons of Computer Vson (WCV), 212 IEEE Workshop on. pp IEEE (212) 2. Felzenszwalb, P.F., Grshck, R.B., Mcllester, D., Ramanan, D.: Object detecton wth dscrmnatvely traned part-based models. Pattern nalyss and Machne Intellgence, IEEE Transactons on 32(9), (21) 3. Grundmann, M., Kwatra, V., Han, M., Essa, I.: Effcent herarchcal graph-based vdeo segmentaton. In: Computer Vson and Pattern Recognton (CVPR), 21 IEEE Conference on. pp IEEE (21) 4. He, X., Zemel, R.S., Carrera-Perpñán, M.Á.: Multscale condtonal random felds for mage labelng. In: In CVPR 24. vol. 2, pp. II 695. IEEE (24) 5. Hckson, S., Brchfeld, S., Essa, I., Chrstensen, H.: Effcent herarchcal graphbased segmentaton of rgbd vdeos. In: CVPR214. IEEE Computer Socety (214) 6. Husan, F., Dellen, B., Torras, C.: Consstent depth vdeo segmentaton usng adaptve surface models. Cybernetcs, IEEE Transactons on 45(2), (215) 7. Kolmogorov, V., Zabn, R.: What energy functons can be mnmzed va graph cuts? Pattern nalyss and Machne Intellgence, IEEE Transactons on 26(2), (24) 8. Koo, S., Lee, D., Kwon, D.S.: Incremental object learnng and robust trackng of multple objects from rgb-d pont set data. Journal of Vsual Communcaton and Image Representaton 25(1), (214) 9. Lu, C., Yuen, J., Torralba,.: Nonparametrc scene parsng va label transfer. Pattern nalyss and Machne Intellgence 33(12), (211) 1. Papon, J., bramov,., Schoeler, M., Worgotter, F.: Voxel cloud connectvty segmentaton-supervoxels for pont clouds. In: Computer Vson and Pattern Recognton (CVPR), 213 IEEE Conference on. pp IEEE (213) 11. Peropan,., Salv, G., Pauwels, K., Kjellstrom, H.: udo-vsual classfcaton and detecton of human manpulaton actons. In: Intellgent Robots and Systems (IROS 214), 214 IEEE/RSJ Internatonal Conference on. pp IEEE (214) 12. Ren, X., Malk, J.: Trackng as repeated fgure/ground segmentaton. In: Computer Vson and Pattern Recognton, 27. pp IEEE (27) 13. Rchtsfeld,., Mörwald, T., Prankl, J., Zllch, M., Vncze, M.: Segmentaton of unknown objects n ndoor envronments. In: Intellgent Robots and Systems (IROS), 212 IEEE/RSJ Internatonal Conference on. pp IEEE (212) 14. Sh, J., Malk, J.: Normalzed cuts and mage segmentaton. Pattern nalyss and Machne Intellgence, IEEE Transactons on 22(8), (2) 15. Shotton, J., Wnn, J., Rother, C., Crmns,.: Textonboost: Jont appearance, shape and context modelng for mult-class object recognton and segmentaton. In: Computer Vson ECCV 26, pp Sprnger (26) 16. Tsa, D., Flagg, M., Nakazawa,., Rehg, J.M.: Moton coherent trackng usng mult-label mrf optmzaton. IJCV 1(2), (212) 17. Xao, L., Josep, C., Montse, P.: 3d pont cloud segmentaton orented to the analyss of nteractons. In: 24th European Sgnal Processng Conference (EUSIPCO),216. p. (ccepted and to be publshed). IEEE (216) 18. Xu, C., Corso, J.J.: Evaluaton of super-voxel methods for early vdeo processng. In: Computer Vson and Pattern Recognton (CVPR), 212 IEEE Conference on. pp IEEE (212)

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input

Real-time Joint Tracking of a Hand Manipulating an Object from RGB-D Input Real-tme Jont Tracng of a Hand Manpulatng an Object from RGB-D Input Srnath Srdhar 1 Franzsa Mueller 1 Mchael Zollhöfer 1 Dan Casas 1 Antt Oulasvrta 2 Chrstan Theobalt 1 1 Max Planc Insttute for Informatcs

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Fitting: Deformable contours April 26 th, 2018

Fitting: Deformable contours April 26 th, 2018 4/6/08 Fttng: Deformable contours Aprl 6 th, 08 Yong Jae Lee UC Davs Recap so far: Groupng and Fttng Goal: move from array of pxel values (or flter outputs) to a collecton of regons, objects, and shapes.

More information

Active Contours/Snakes

Active Contours/Snakes Actve Contours/Snakes Erkut Erdem Acknowledgement: The sldes are adapted from the sldes prepared by K. Grauman of Unversty of Texas at Austn Fttng: Edges vs. boundares Edges useful sgnal to ndcate occludng

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping.

SIGGRAPH Interactive Image Cutout. Interactive Graph Cut. Interactive Graph Cut. Interactive Graph Cut. Hard Constraints. Lazy Snapping. SIGGRAPH 004 Interactve Image Cutout Lazy Snappng Yn L Jan Sun Ch-Keung Tang Heung-Yeung Shum Mcrosoft Research Asa Hong Kong Unversty Separate an object from ts background Compose the object on another

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval Fuzzy -Means Intalzed by Fxed Threshold lusterng for Improvng Image Retreval NAWARA HANSIRI, SIRIPORN SUPRATID,HOM KIMPAN 3 Faculty of Informaton Technology Rangst Unversty Muang-Ake, Paholyotn Road, Patumtan,

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Local Quaternary Patterns and Feature Local Quaternary Patterns

Local Quaternary Patterns and Feature Local Quaternary Patterns Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents

More information

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution

Real-time Motion Capture System Using One Video Camera Based on Color and Edge Distribution Real-tme Moton Capture System Usng One Vdeo Camera Based on Color and Edge Dstrbuton YOSHIAKI AKAZAWA, YOSHIHIRO OKADA, AND KOICHI NIIJIMA Graduate School of Informaton Scence and Electrcal Engneerng,

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature

Detection of hand grasping an object from complex background based on machine learning co-occurrence of local image feature Detecton of hand graspng an object from complex background based on machne learnng co-occurrence of local mage feature Shnya Moroka, Yasuhro Hramoto, Nobutaka Shmada, Tadash Matsuo, Yoshak Shra Rtsumekan

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Discriminative Dictionary Learning with Pairwise Constraints

Discriminative Dictionary Learning with Pairwise Constraints Dscrmnatve Dctonary Learnng wth Parwse Constrants Humn Guo Zhuoln Jang LARRY S. DAVIS UNIVERSITY OF MARYLAND Nov. 6 th, Outlne Introducton/motvaton Dctonary Learnng Dscrmnatve Dctonary Learnng wth Parwse

More information

Image Alignment CSC 767

Image Alignment CSC 767 Image Algnment CSC 767 Image algnment Image from http://graphcs.cs.cmu.edu/courses/15-463/2010_fall/ Image algnment: Applcatons Panorama sttchng Image algnment: Applcatons Recognton of object nstances

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Online codebook modeling based background subtraction with a moving camera

Online codebook modeling based background subtraction with a moving camera Onlne codebook modelng based background subtracton wth a movng camera Lyun Gong School of Computer Scence Unversty of Lncoln, UK Emal: lgong@lncoln.ac.uk Mao Yu School of Computer Scence Unversty of Lncoln,

More information

Face Detection with Deep Learning

Face Detection with Deep Learning Face Detecton wth Deep Learnng Yu Shen Yus122@ucsd.edu A13227146 Kuan-We Chen kuc010@ucsd.edu A99045121 Yzhou Hao y3hao@ucsd.edu A98017773 Mn Hsuan Wu mhwu@ucsd.edu A92424998 Abstract The project here

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

Multiple Frame Motion Inference Using Belief Propagation

Multiple Frame Motion Inference Using Belief Propagation Multple Frame Moton Inference Usng Belef Propagaton Jang Gao Janbo Sh The Robotcs Insttute Department of Computer and Informaton Scence Carnege Mellon Unversty Unversty of Pennsylvana Pttsburgh, PA 53

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng, Natonal Unversty of Sngapore {shva, phanquyt, tancl }@comp.nus.edu.sg

More information

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field

3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field 3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field Xiao Lin Image Processing Group Technical University of Catalonia (UPC) Barcelona, Spain Josep R.Casas Image Processing Group

More information

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem

Ecient Computation of the Most Probable Motion from Fuzzy. Moshe Ben-Ezra Shmuel Peleg Michael Werman. The Hebrew University of Jerusalem Ecent Computaton of the Most Probable Moton from Fuzzy Correspondences Moshe Ben-Ezra Shmuel Peleg Mchael Werman Insttute of Computer Scence The Hebrew Unversty of Jerusalem 91904 Jerusalem, Israel Emal:

More information

An efficient method to build panoramic image mosaics

An efficient method to build panoramic image mosaics An effcent method to buld panoramc mage mosacs Pattern Recognton Letters vol. 4 003 Dae-Hyun Km Yong-In Yoon Jong-Soo Cho School of Electrcal Engneerng and Computer Scence Kyungpook Natonal Unv. Abstract

More information

A Gradient Difference based Technique for Video Text Detection

A Gradient Difference based Technique for Video Text Detection 2009 10th Internatonal Conference on Document Analyss and Recognton A Gradent Dfference based Technque for Vdeo Text Detecton Palaahnakote Shvakumara, Trung Quy Phan and Chew Lm Tan School of Computng,

More information

Computer Vision I. Xbox Kinnect: Rectification. The Fundamental matrix. Stereo III. CSE252A Lecture 16. Example: forward motion

Computer Vision I. Xbox Kinnect: Rectification. The Fundamental matrix. Stereo III. CSE252A Lecture 16. Example: forward motion Xbox Knnect: Stereo III Depth map http://www.youtube.com/watch?v=7qrnwoo-8a CSE5A Lecture 6 Projected pattern http://www.youtube.com/watch?v=ceep7x-z4wy The Fundamental matrx Rectfcaton The eppolar constrant

More information

Face Recognition using 3D Directional Corner Points

Face Recognition using 3D Directional Corner Points 2014 22nd Internatonal Conference on Pattern Recognton Face Recognton usng 3D Drectonal Corner Ponts Xun Yu, Yongsheng Gao School of Engneerng Grffth Unversty Nathan, QLD, Australa xun.yu@grffthun.edu.au,

More information

3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field

3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field 07 5th European Signal Processing Conference (EUSIPCO) 3D Point Cloud Segmentation Using a Fully Connected Conditional Random Field Xiao Lin Josep R.Casas Montse Pardás Abstract Traditional image segmentation

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

Detection of Human Actions from a Single Example

Detection of Human Actions from a Single Example Detecton of Human Actons from a Sngle Example Hae Jong Seo and Peyman Mlanfar Electrcal Engneerng Department Unversty of Calforna at Santa Cruz 1156 Hgh Street, Santa Cruz, CA, 95064 {rokaf,mlanfar}@soe.ucsc.edu

More information

LECTURE : MANIFOLD LEARNING

LECTURE : MANIFOLD LEARNING LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

An Efficient Background Updating Scheme for Real-time Traffic Monitoring

An Efficient Background Updating Scheme for Real-time Traffic Monitoring 2004 IEEE Intellgent Transportaton Systems Conference Washngton, D.C., USA, October 3-6, 2004 WeA1.3 An Effcent Background Updatng Scheme for Real-tme Traffc Montorng Suchendra M. Bhandarkar and Xngzh

More information

Wavefront Reconstructor

Wavefront Reconstructor A Dstrbuted Smplex B-Splne Based Wavefront Reconstructor Coen de Vsser and Mchel Verhaegen 14-12-201212 2012 Delft Unversty of Technology Contents Introducton Wavefront reconstructon usng Smplex B-Splnes

More information

A B-Snake Model Using Statistical and Geometric Information - Applications to Medical Images

A B-Snake Model Using Statistical and Geometric Information - Applications to Medical Images A B-Snake Model Usng Statstcal and Geometrc Informaton - Applcatons to Medcal Images Yue Wang, Eam Khwang Teoh and Dnggang Shen 2 School of Electrcal and Electronc Engneerng, Nanyang Technologcal Unversty

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS by XUNYU PAN (Under the Drecton of Suchendra M. Bhandarkar) ABSTRACT In modern tmes, more and more

More information

Robust visual tracking based on Informative random fern

Robust visual tracking based on Informative random fern 5th Internatonal Conference on Computer Scences and Automaton Engneerng (ICCSAE 205) Robust vsual trackng based on Informatve random fern Hao Dong, a, Ru Wang, b School of Instrumentaton Scence and Opto-electroncs

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction

2. Related Work Hand-crafted Features Based Trajectory Prediction Deep Neural Networks Based Trajectory Prediction Encodng Crowd Interacton wth Deep Neural Network for Pedestran Trajectory Predcton Yanyu Xu ShanghaTech Unversty xuyy2@shanghatech.edu.cn Zhxn Pao ShanghaTech Unversty paozhx@shanghatech.edu.cn Shenghua

More information

Understanding K-Means Non-hierarchical Clustering

Understanding K-Means Non-hierarchical Clustering SUNY Albany - Techncal Report 0- Understandng K-Means Non-herarchcal Clusterng Ian Davdson State Unversty of New York, 1400 Washngton Ave., Albany, 105. DAVIDSON@CS.ALBANY.EDU Abstract The K-means algorthm

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Fitting and Alignment

Fitting and Alignment Fttng and Algnment Computer Vson Ja-Bn Huang, Vrgna Tech Many sldes from S. Lazebnk and D. Hoem Admnstratve Stuffs HW 1 Competton: Edge Detecton Submsson lnk HW 2 wll be posted tonght Due Oct 09 (Mon)

More information

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline

Image Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and

More information

Dynamic Camera Assignment and Handoff

Dynamic Camera Assignment and Handoff 12 Dynamc Camera Assgnment and Handoff Br Bhanu and Ymng L 12.1 Introducton...338 12.2 Techncal Approach...339 12.2.1 Motvaton and Problem Formulaton...339 12.2.2 Game Theoretc Framework...339 12.2.2.1

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

Active 3D scene segmentation and detection of unknown objects

Active 3D scene segmentation and detection of unknown objects Actve 3D scene segmentaton and detecton of unknown objects Mårten Björkman and Danca Kragc Abstract We present an actve vson system for segmentaton of vsual scenes based on ntegraton of several cues. The

More information

A framework for the objective evaluation of segmentation algorithms using a ground-truth of human segmented 3D-models

A framework for the objective evaluation of segmentation algorithms using a ground-truth of human segmented 3D-models IEEE INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS (SMI) 2009 1 A framework for the objectve evaluaton of segmentaton algorthms usng a ground-truth of human segmented 3D-models H. Benhables

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

Prof. Feng Liu. Spring /24/2017

Prof. Feng Liu. Spring /24/2017 Prof. Feng Lu Sprng 2017 ttp://www.cs.pd.edu/~flu/courses/cs510/ 05/24/2017 Last me Compostng and Mattng 2 oday Vdeo Stablzaton Vdeo stablzaton ppelne 3 Orson Welles, ouc of Evl, 1958 4 Images courtesy

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Multi-stable Perception. Necker Cube

Multi-stable Perception. Necker Cube Mult-stable Percepton Necker Cube Spnnng dancer lluson, Nobuuk Kaahara Fttng and Algnment Computer Vson Szelsk 6.1 James Has Acknowledgment: Man sldes from Derek Hoem, Lana Lazebnk, and Grauman&Lebe 2008

More information

A fast algorithm for color image segmentation

A fast algorithm for color image segmentation Unersty of Wollongong Research Onlne Faculty of Informatcs - Papers (Arche) Faculty of Engneerng and Informaton Scences 006 A fast algorthm for color mage segmentaton L. Dong Unersty of Wollongong, lju@uow.edu.au

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Efficient Video Coding with R-D Constrained Quadtree Segmentation

Efficient Video Coding with R-D Constrained Quadtree Segmentation Publshed on Pcture Codng Symposum 1999, March 1999 Effcent Vdeo Codng wth R-D Constraned Quadtree Segmentaton Cha-Wen Ln Computer and Communcaton Research Labs Industral Technology Research Insttute Hsnchu,

More information

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION

Overview. Basic Setup [9] Motivation and Tasks. Modularization 2008/2/20 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Overvew 2 IMPROVED COVERAGE CONTROL USING ONLY LOCAL INFORMATION Introducton Mult- Smulator MASIM Theoretcal Work and Smulaton Results Concluson Jay Wagenpfel, Adran Trachte Motvaton and Tasks Basc Setup

More information

Network Intrusion Detection Based on PSO-SVM

Network Intrusion Detection Based on PSO-SVM TELKOMNIKA Indonesan Journal of Electrcal Engneerng Vol.1, No., February 014, pp. 150 ~ 1508 DOI: http://dx.do.org/10.11591/telkomnka.v1.386 150 Network Intruson Detecton Based on PSO-SVM Changsheng Xang*

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information

A Background Subtraction for a Vision-based User Interface *

A Background Subtraction for a Vision-based User Interface * A Background Subtracton for a Vson-based User Interface * Dongpyo Hong and Woontack Woo KJIST U-VR Lab. {dhon wwoo}@kjst.ac.kr Abstract In ths paper, we propose a robust and effcent background subtracton

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking

Robust Inlier Feature Tracking Method for Multiple Pedestrian Tracking 2011 Internatonal Conference on Informaton and Intellgent Computng IPCSIT vol.18 (2011) (2011) IACSIT Press, Sngapore Robust Inler Feature Trackng Method for Multple Pedestran Trackng Young-Chul Lm a*

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Video Object Tracking Based On Extended Active Shape Models With Color Information

Video Object Tracking Based On Extended Active Shape Models With Color Information CGIV'2002: he Frst Frst European Conference Colour on Colour n Graphcs, Imagng, and Vson Vdeo Object rackng Based On Extended Actve Shape Models Wth Color Informaton A. Koschan, S.K. Kang, J.K. Pak, B.

More information

Fast Feature Value Searching for Face Detection

Fast Feature Value Searching for Face Detection Vol., No. 2 Computer and Informaton Scence Fast Feature Value Searchng for Face Detecton Yunyang Yan Department of Computer Engneerng Huayn Insttute of Technology Hua an 22300, Chna E-mal: areyyyke@63.com

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information