Recursive estimation of generative models of video

Size: px
Start display at page:

Download "Recursive estimation of generative models of video"

Transcription

1 Reursive estimation of generative models of video Nemanja Petrovi Google In. Sumit Basu Mirosoft Researh Aleksandar Ivanovi Universit of Illinois Urbana Thomas Huang Universit of Illinois Urbana Nebojsa Joji Mirosoft Researh Abstrat In this paper we present a generative model and learning proedure for unsupervised video lustering into senes. The work addresses two important problems: realisti modeling of the soures of variabilit in the video and fast transformation invariant frame lustering. We suggest a solution to the problem of omputationall intensive learning in this model b ombining the reursive model estimation, fast inferene, and on-line learning. Thus, we ahieve real time frame lustering performane. Novel aspets of this method inlude an algorithm for the lustering of Gaussian mixtures, and the fast omputation of the KL divergene between two mixtures of Gaussians. The effiien and the performane of lustering and KL approximation methods are demonstrated. We also present novel video browsing tool based on the visualization of the variables in the generative model. 1. Introdution The amount of video data available to an average onsumer has alread beome overwhelming. Still, there is a lak of effiient general-purpose tools for navigating this vast amount of information. We suggest that a suessful video browsing and summarization sstem has to aomplish two goals. First, it shall orretl model the soures of vast information ontent in the video. Seond, it shall provide the user with an intuitive and fast video navigation interfae that is ompatible, if not even jointl optimized with the analsis algorithm. As a solution of the first problem we propose the lustering of related but non-sequential frames into senes. Clustering is based on the generative model (Fig. 1) that builds on the model for translation invariant lustering [14]. Learning in the generative model with multiple disrete variables faes onsiderable omputational hallenges. We utilize the properties of video signal to develop provabl onvergent reursive lustering algorithm. To make the model more realisti, it assumes the Figure 1. Left: Sene generative model. Pair z is a Gaussian mixture. Observation x is obtained b saling b Z, rotating b R, transforming and ropping the latent image z b translation indexed b T. W is the fixed ropping window that models frame x as the small part of the video sene z. The effet of the omposition WT is illustrated as the shaded retangle that indiates the position of x in z. Right: Nine tpial frames from the video are initiall lustered into three lusters using onl translation invariant lustering. Three so obtained distributions are lustered into a single luster using translation, sale and rotation invariant distribution lustering (f. Se. 3). video frame to be a portion of the video sene, whih is reminisent of the panorami sene representations. As a solution of the seond problem we propose a video navigation tool based on the visualization of the variables in the generative model, whih ideall reflets all frames in the video. The navigation tool serves as a visuall meaningful index into the video. Video lustering and summarization is one of the most diffiult problems in the automati video understanding. It aims to produe short, et representative, snopsis of the video b extrating pertinent information or highlights that would enable the viewer to quikl grasp the general stor or navigate to the speifi segment. Two main approahes to video summarization inlude stati summarizations (inluding shots, mosais and storboards), and dnami summarizations (video skimming). Numerous shot and ke-frame detetion approahes are based on extrating and traking low level features over the time and deteting their abrupt

2 hanges. But, long lists of shots result in another information flood rather than abstration, while ke frames onl are not suffiient for a user to judge the relevane of the ontent. Dnami video summarization, often referred to as skimming, onsist of olleting representative or desirable sub-lips from the video. The navigation tool we introdue in this paper has the distintive feature that it inludes both stati and dnami summar of the video. Several shot-independent approahes for summarization have been proposed reentl, like the reursive ke-frame lustering [9, 10]. While promising, the lak robust similarit measure and with the number of lusters above the ertain levels, visuall different shots start to merge. There have been man interesting approahes to video browsing and summarization based on mosaiking. In this paper we introdue probabilisti mosai representation of video that inludes the invariane with respet to amera motion, hange in the sale and rotation. It assumes the video sene to be muh larger that the viewing field of the amera. We should emphasize that our goal is not the building of perfet mosais. Rather, it is the onstruting of robust similarit measure that ields to the high likelihood of the frame under the generative model. Similar mosaiking representations [4, 5, 6, 7] were used before, some of them based of generative models [8], but were overl onstrained with respet to the amera motions, requirements that target senes must be approximatel planar and should not ontain moving objets. For example, [5] studied mosais in the ontext of video browsing and a snopti view of a sene. Video summaries work [6] uses mosais for summarization, but ignores the foreground motion and relies on the bakground invariane for the sene lustering together with ad-ho sene similarit measure. Our work is similar to the mosaiking work [7] on mosaibased representations of video sequenes. There are a few important differenes. Our method, having video lustering and ompat video presentation as the ultimate goals, has a notion of variabilit of appearane (moving objets in the sene and blemishes in the bakground are allowed and treated as the noise), automatiall estimates the number of parameters (eg. number of different senes), explains the ause of variabilit of the data, and reognizes the sene that alread appeared. Also, other methods were used in the highl regimented ases (eg. aerial surveillane, sitoms ) where our is intended for the general lass of unonstrained home videos. Realisti graphial (generative) models ma sometimes fae serious omputational hallenges. Similarl, naive learning in this model is infeasible. Clustering of one hour of video does not allow visiting eah datum more than one. This onstraint suggests one pass over-lustering of the frames, followed b iterative luster grouping. Eah of this operations orrespond to the re-estimation of the parameters in the model. We derive the algorithm for reursive estimation of this model based on the EM algorithm, thus inheriting its onvergene and optimalit properties. Fast inferene methods and video navigation tool are the features of this work. 2. Model The video analsis algorithm we present is based on a generative model Figure 1 (left) that assumes the video senes are generated b a set of normalized senes that are subjeted to geometrial transforms and noise [3]. The appearane of the sene is modeled b a Gaussian appearane map. The probabilit densit of the vetor of pixel values z for the latent image orresponding to the luster is p(z ) =N (z; μ, Φ ), (1) where μ is the mean of the latent image z, and Φ is a diagonal ovariane matrix that speifies the variabilit of eah pixel in the latent image. The variabilit Φ is neessar to apture various auses not aptured b the variabilit in sene lass and transformation, eg. slight blemishes in appearane or hanges in lighting. We do not model the full ovariane matrix as there is never enough data to estimate it form the data. It is possible, however to use a subspae modeling tehnique to apture some orrelations in this matrix. The observable image is modeled as a ropped region of the latent sene image. Before ropping, the latent image undergoes a transformation, omposed of a zoom, rotation and translation. The motivation for this is that out of the global amera motion tpes, zoom and pan are the most frequent, while rotations are fairl rare. Leaving out more omplex motion, suh as the ones produed b perspetive effets, several dominant motion vetor fields, nonuniform motion, et., speeds up the algorithm (real time in our implementation), but makes the above defined variane maps a ruial part of the model, as the an apture the extra variabilit, although in a ruder manner. In addition, the nonuniform variane map has to apture some other auses of variabilit we left out, suh as small illumination hanges, variable ontrast, et. The probabilit densit of the observable vetor of pixel values x for the image orresponding to the zoom Z, translation T, rotation R, latent image z and fixed ropping transform W is p(x T, Z, R, z) =δ(x WTZRz) (2) where T, R and Z ome from a finite set of possible transformations. Similar affine generative model in onjuntion with Baesian inferene was proposed in [13]. We onsider onl a few different levels of zoom and rotation. The omputational burden of searhing over all integer translations

3 is relieved b the use of Fast Fourier Transform (FFT) for performing omputations in the Fourier domain (Se. 4). While we an view this model as the model where the omposition WTZR is treated as a novel transformation, it is an imperative to keep these transformations separate in order to derive an effiient inferene algorithm based on the FFTs, whih is several orders of magnitude faster than the algorithm based on testing all possible transformations jointl. The joint likelihood of a single video frame x and latent image z, given and T is p(x, z, T, Z, R) =δ(x WTZRz)N (z; μ, Φ ) (3) Note that the distribution over z an be integrated out in the losed form p(x,t,z,r)=n (x;wtzμ,wtzφ R Z T W ) (4) Under the assumption that eah frame is independentl generated in this fashion, the joint distribution over all variables is p({x, t, R t, Z t, T t } T t=1)= t p(x t t, R t, Z t, T t ) p( t )p(t t, Z t ) (5) The model is parameterized b sene means μ, pixel varianes stored on the diagonal of Φ and sene probabilities π = p( t = ), and as suh providing a summar of what is ommon in the video. The hidden variables t, R t, Z t, T t, desribe the main auses of variabilit in the video, and as suh var from frame to frame. The prior distribution over R, Z, T is assumed uniform. 3. Reursive model estimation It is possible to derive the EM algorithm in the losed form (Se. 4) for the proposed model. However, the number of senes (omponents in the mixture) is unknown. Also, the exhaustive omputation of the posterior probabilities over transformations and lasses is intratable. We use the variant of inremental EM algorithm [15, 16] to quikl luster the frames into the large number of lusters using, at this stage, translation and ropping-invariant model onl. We dnamiall update the number of lasses, b adding a new lass whenever the model annot explain the new data well. Given that a large number of frames x t have been lustered (summarized) in this manner using a mixture model p(x) = p(x )p() with C lusters (omponents), eah desribed b the prior p() =π, mean μ and a diagonal ovariane matrix Φ, we want to estimate another mixture model p 1 defined b a smaller number of lusters S with parameters π s, μ s, Φ s on the same data. We will formall derive the re-estimation algorithm using a Gaussian mixture model as an example, with the understanding that the same derivation is arried out for the more omplex models that inludes transformations. Assuming that p summarizes the data well, we an replae the real data {x i } with the similar ( virtual ) data { i },i=1...n randoml generated from the obtained mixture model, and estimate the parameters of the model p 1 using the virtual data { t }. When the number of virtual data (N) grows infinitel, the distribution onverges in probabilit to the original data distribution. We an fit the simpler distribution p 1 to { t } without atuall generating them, but rather b working onl with the expetations under the model p. We optimize the expetation i log p1 ( i ) for 1 of the likelihood of the generated data, N large N, where i is sampled from p() (in our example the mixture model with parameters {π, μ, Φ }). 1 N log p 1 ( i ) E[log p 1 ()]= N i=1 [ ] = p()logp 1 ()= p( )p() log p 1 () = p() p( )logp 1 () p() p( ) q (s)log p1 ( s)p 1 (s) = EF, (6) q s (s) where the inequalit follows b the same onvexit argument as in the ase of the standard free energ [15]. Suh reparametrized model p 1 an be reursivel reparametrized, giving the hierarh of models of the dereasing omplexit. B doing this we resort to the original data exatl one and avoid ostl re-proessing of hundreds of thousands of video frames. The new bound on the free energ EF would be tight if q (s) were exatl equal to the posterior, i.e., q (s) =p 1 (s ). However, we assume that the posterior is the same for all one the lass is hosen, and we emphasize this with the notation q (s). Under this assumption the bound further simplifies into EF = { [ ] p() q (s) p( )logp 1 ( s) + s } q (s)[log p 1 (s) log q (s)] s = p() [ q (s) 1 2 (μ s μ ) T Φ 1 s (μ s μ ) s 1 2 tr(φ 1 s Φ ) 1 ] 2 log 2πΦ s + p() s q (s)[log p 1 (s) log q (s)] (7)

4 Minimizing the free energ under the usual onstraints, e.g., s q (s) =1ields an iteration of an EM algorithm that reparameterizes the model, e.g., for the plain mixture model, q (s) p 1 (s)e 1 2 (μ s μ )T Φ 1 s (μ s μ ) 1 2 tr(φ 1 s Φ) 1 2 log 2πΦs (8) μ s = p()q (s)μ p()q (s) Φ s = p()q (s)[(μ s μ )(μ s μ ) T + Φ ] p()q (s) π s = p 1 (s) = p()q (s) p() = p()q (s) (9) Similar reparametrization algorithm was intuitivel proposed in [1] for data lustering in the presene of unertainties. The idea of reursive densit estimation is reminisent of [2]. The EM algorithm above will onverge to a loal maximum and the qualit of the results will depend on the validit of the assumption that the posterior q(s) is shared among all virtual data samples from the same lass. When model p aptures the original data with lots of narrow models p( ), and S C, the approximation is reasonable and redues the omputation b a fator of T/C in omparison with retraining diretl on the original data. The result of reursive model estimation is a hierarh of models whih an be elegantl presented to the user through an appropriate user interfae shown in the video submission. Figure 1 (right) illustrates reursive lustering of three distributions into a single hper-luster, using both translation and sale invariant lustering Computing the fast approximate KL divergene between two mixtures The optimization in Eq.(6) an be alternativel seen as the minimization of the KL divergene between distributions p and p 1. Thus, we an use the bound on the variational free energ for the re-estimation problem to obtain tight upper bound on the KL divergene between two mixture of Gaussians (MoGs) a problem not tratable in the losed form. Reentl, effiient and aurate omputation of the KL divergene between the mixtures has attrated a lot of attention [17, 18]. As the ground truth for the omputation of the KL divergene, we will use Monte Carlo simulation with large number of partiles as KL(f g) = f log f g 1 n n t=1 log f(x t) g(x t ) (10) While this method is asmptotiall exat, it is painfull slow. In [17] authors proposed a ouple of approximations on KL divergene based on ounting the influene onl of nearest omponents in two mixtures ( weak interations ). The demonstrated that their approximation is better than previous one published in [18]. The onlusion of their work is that KL divergene based on unsented transformation [19] (also known as the quadrati approximation ) gives exellent results, with the slight omputational overhead. This method is based on the approximate omputation of the expetation of some funtion h under d dimensional Gaussian f with the mean μ and ovariane matrix Σ as f(x)h(x)dx 1 2d 2d k=1 h(x k ) (11) where the set of 2d sigma points x k is defined as x k = μ +( dσ) k k =1,...,d x d+k = μ ( dσ) k k =1,...,d (12) We will use this method as the urrent art to ompare against the variational method. Given two mixtures p and p 1 the KL divergene an be separated into two terms KL(p, p 1 )=H(p) p()logp 1 () = (13) p()logp() p()logp 1 () We note that optimization we performed in Eq.(6) is the variational maximization of the lower bound of p()p1 (). B substituting the S C matrix q (readil omputed in Eq.(8)) into Eq.(7) the upper bound for p()p1 () follows. In the same manner, the lower bound on entrop H(p) of the Gaussian mixture p with parameters (π, μ, Φ ) an be approximated as {π ( 1 2 log det(2πφ )) + log(π )} <H(p) (14) The summation of the lower and upper bound of two terms in the KL divergene need not lead to the unambiguous onlusion on the nature of the approximation. Empiriall, we found that the entrop term negligibl ontributes to the KL divergene. 4. Inferene of lasses and transformations. Learning the senes in the model Inferene (posterior optimization). In this setion we will omit the rotation R b treating it as an identit transformation in order to keep the derivations simple. For our

5 model it is possible to derive exat EM algorithm that optimizes the free energ [15] of the form F = q( t, Z t, T t )log p(x t t, Z t, T t )π t q( t t, Z t, T t ) t,z t,t t (15) For given parameters, we an optimize the free energ with respet to the posterior q. We express the posterior as q( t, Z t )q(t t t, Z t ) and optimize F under the normalization onstraints t,z t q( t, Z t ) = 1 and T t q(t t t, Z t )=1, whih results in the same result as appling the Baes rule, q(t t t, Z t ) p(x t t, Z t, T t ),q( t, Z t ) p( t )e q(t Zt,Tt)logp(xt t,zt,tt) (16) Parameter optimization. Finding the derivatives of F with respet to the luster mean t = k we get T t=1 {T t, t} It an be shown that q({ t, T t })(WT t Z t ) (WT t Z t Φ k Z tt tw ) 1 (x t WT t Z t μ k )=0 (17) T W WTZΦ 1 Z T W x t = ZΦ 1 T W WTZΦ 1 Z diag(t X t ) Z T W WT = ZΦ 1 Z diag(t m) (18) where m diag(w W), is the binar mask that shows the position of observable image within latent image (upper left orner), and where X W x is frame x zero padded to the resolution of the latent image. Thus, assuming that the zoom has small effet on inverse varianes, i.e., (ZΦZ ) 1 (ZΦ 1 Z ) 1 we obtain simple update rules, e.g. for μ k T t=1 Z t q( t =k, Z t )Z 1 t T t q(t t t =k, Z t )(T X t ) T t=1 Z t q( t =k, Z t )Z 1 t T t q(t t t =k, Z t )(T m) (19) where Z 1 is the pseudoinverse of matrix Z, or the inverse zoom. In a similar fashion, we obtain the derivatives of F with respet to other two tpes of model parameters Φ k and π k, and derive the update equations. It ma seem at first that zero padding of the original frame to the size of the latent image onstitutes the unjustified manipulation of the data. But, taking into aount that zero is neutral element for the summation of the suffiient statistis in (19), it is atuall the mathematial onveniene to treat all variables as being of the same dimensionalit (resolution). The intuition 1 To avoid degenerate solutions, the likelihood is saled with the number of pixel inrease that the zoom auses. behind (19) is that the mean latent (panorami) image is the weighted sum of the properl shifted and saled frames, normalized with the ounter that keeps trak how man times eah pixel was visited. Speeding up inferene and parameter optimization using FFTs. Inferene and update equations (16) and (17) involve either testing all possible transformations or summations over all possible transformations T. If all possible integer shifts are onsidered (whih is desirable sine one an handle arbitrar interframe shifts), then these operations an be effiientl performed in the Fourier domain b identifing them as either onvolutions or orrelations. For example, (19) an be effiientl omputed using two dimensional FFT [11] as q(t, Z)(T X)= T IFFT2[onj(FFT2(q(T))) (FFT2(X))] (20) where denotes point wise multipliation, and onj denotes omplex onjugate. This is done for eah ombination of the lass and sale variables, and a similar onvolution of the transformation posterior is also applied to the mask m. Similarl, FFTs are used for inferene to ompute Mahalanobis distane in (4). This redues omputational omplexit of both inferene and parameter update from N 2 to NlogN (N number of pixels), allows us to analze video frames of higher resolution, and demonstrate the benefits of keeping translation variable T and separate from ropping W and zoom Z in the model. The omputation is still proportional to the number of lasses, as well as the number of zoom levels we searh and sum over in the E and M steps, but the number of these onfiguration is tpiall muh smaller than the number of possible shifts in the image. On-line learning. The bath EM learning suffers from two drawbaks: the need to preset the number of lasses C, and the need to iterate. The struture of realisti video allows development of more effiient algorithms. Frames in video tpiall ome in bursts of a single lass whih means that the algorithm does not need to test all lasses against all frames all the time. We use an on-line variant of the EM algorithm with the inremental estimation of suffiient statistis [15, 16]. The reestimation update equations (Eq. (9)) are reformulated in the same manner. In order to dnamiall learn the number of senes in the on-line EM algorithm, we introdue the threshold on the log-likelihood suh that whenever the log-likelihood falls under the threshold a new sene is introdued. The sensitivit due to the hoie of the threshold is overome b setting high threshold that guarantees the likelihood of the dataset remains high, but whih ma lead to over-lustering. The problem of merging large number of lusters still muh smaller than the number of frames is addressed in Setion

6 3. When the number of lusters is redued to the order of 100, we appl the full learning using the bath EM algorithm with number of lusters determined b the MDL riterion. Taking into aount that amera/objet shifts are b far the most ommon transformations in the video, another speed-up is to perform translation-onl invariant lustering in the first pass (b setting Z,R to identit matries). This approah redues most of the variabilit in the data with little omputational ost. The overall performane of our lustering is 35 frames per seond on 3GHz PC. 5. Experiments Computing the KL divergene between two mixtures. We tested the performane of omputing the upper bound on the KL divergene between two Gaussians in the setup similar to [17]. The mean of eah Gaussian in the five dimensional spae is randoml hosen aording to N (0, 1), whereas the ovariane matrix is sampled from Wishart distribution. In order to avoid almost singular ovariane matries that ma arise in the random sampling, ondition number is set to be at least 20. Covariane matrix is premultiplied with a small number ɛ that aounts for the width of the Gaussians. Higher values of ɛ orrespond to higher overlap between the blobs. We tested four methods: Monte Carlo simulation with 10,000 partiles (MC10000) assumed to be the golden standard; Monte Carlo simulation with 100 partiles (MC100); method based on unsented transform; and, our method (variational). We repeated eah simulation 100 times and averaged the results. We present the results in Table 1 and Fig. 2. The best results were obtained via the unsented approximation, followed b our method, and the MC100. The bottom row of Table 1 indiates the relative proessing time needed to ompute KL divergene for eah method. While approximate, our method is b far the fastest of the proposed methods. In Fig. 2 we illustrate the aura of the omputation of the KL divergene for different values of the parameter ɛ and the omputational omplexit of our method for different dimensionalit of the spae. As demonstrated, our method espeiall sales well in the the high dimensional spae. Video lustering and navigation. We tested our sstem on an 18 minutes long home video. In the first pass of on-line learning, the video is summarized in 290 lusters, man of them repeating. We reestimate this model until we end up with roughl three dozen of lasses. In all but the first pass we searh over all onfigurations of the zoom, rotation and lass variables. The omplexit of the learning drops as we go higher and higher in the hierarh (due to the smaller number of lusters), and so we do not need to be areful about the exat seletion of the sequene of thresholds or numbers of lasses - we simpl train a large number of models, as the user an hoose an one of them quikl at the browsing time. Figure 2. Computing KL divergene between two mixtures (12 and 8 omponents). Comparison of Monte-Carlo simulations, unsented transform-based method and the method in this work. Top: KL divergene is alulated between two MoGs in the 20-dimensional spae. Horizontal axis depits the regularization parameter epsilon. Our method, as antiipated, is upper bound to the true KL divergene. Middle: KL divergene as a funtion of the dimensionalit of mixtures. Regularization parameter is fixed to 0.4. Bottom: Computational time in seonds for our method and unsented-based method as a funtion of spae dimensionalit. Monte Carlo simulations are too slow to sale. Fig. 3 (top) illustrates the time line and sene partitioning using shot detetors and our method. On the left we hand-

7 ɛ unsented MC10000 variational MC Time Table 1. Values of the KL-divergene for four different methods and different values of regularization parameter ɛ. MC10000 is taken as the golden standard. Variational method is a reasonable approximation and it is b far the fastest. As expeted, for larger ɛ all methods make large errors. label ground truth (also indiating the repeating senes). In the middle we show shot detetion results using ommerial shot detetor largel on the olor histogram (from Mirosoft MovieMaker). On the right we show the lustering results using our method, and indiate the ases where our method over-segmented. The lustering fails in the ases when sene variabilit is not explained b the model. Some of the issues an be takled b using the higher number of sale levels in the model and inreasing the sene size with respet to the frame size. But, the real benefit of this approah is in the novel video navigation and browsing tool. Supplemental video material (Fig. 3 bottom and nemanja/video1.wmv) demonstrates the usefulness of this method. Cluster means are visualized as the thumbnail images that represent the index into the video. For eah pixel in eah frame in the video there is a mapping into the thumbnail image. The user browses the video b moving the mouse pointer over the ative panel. Instantl, frames within the luster that are loated in the proximit of the ursor are retrieved and marked in green on the time-line at the bottom of the interfae. The user an further double-lik at eah thumbnail images and it will deompose into the hild lusters that were merged together in the re-estimation proedure. The browsing ma then be ontinued seamlessl at the different level. Informall tested on 165 users, the sstem proved to be ver useful for the users to rapidl grasp the ontent of the video not seen before. 6. Conlusions In this work we presented a generative model for video that proved useful for unsupervised video lustering. We speifiall addressed the problem of intratabilit of naive learning for large sale problems b introduing the number of algorithmi tehniques for rapid learning and inferene. Our video analsis requires no hand-set parameters. The burden of seleting the optimal number of senes itself highl subjetive task is shifted to the user to hoose among the hierarh of the models. We believe that this model and the aompaning intuitive user-interfae will prove useful for quik and seamless video retrieval. Additionall, we will further explore the benefits of the proposed method for the rapid omputation of the KL divergene, espeiall in the high dimensional spae and for the massive data-sets. Aknowledgments This work was supported in part b Advaned Researh and Development Ativities (ARDA) under Contrat MDA C Referenes [1] T.F. Cootes, C.J.Talor. Statistial Models of Appearane for Computer Vision, Tehnial Report Universit of Manhester, [2] A. Gra, A. Moore. Rapid Evaluation of Multiple Densit Models, In Aritifiial Intelligene & Statistis, [3] D. Mumford. Pattern theor: a unifing perspetive, In Pereption as Baesian Inferene, Cambridge Universit Press, [4] R. Jones, D. DeMenthon, and D. Doermann, Building mosais from video using MPEG motion vetors In ACM Multimedia, [5] P. Anandan, M. Irani, M. Kumar, and J. Bergen. Video as an image data soure: Effiient representations and appliations, In Proeedings of IEEE ICIP, pp , [6] A. Aner, J. R. Kender. Video summaries through mosaibased shot and sene lustering, In ECCV, [7] M. Irani, P. Anandan, S. Hsu. Mosai based representations of video sequenes and their appliations, In ICCV, pages , June [8] M. Brown, D. Lowe, Reognising Panoramas, ICCV03. [9] M. Yeung, B. Yeo, B. Liu. Segmentation of video b lustering and graph analsis, In CVIU, 71:1, Jul [10] A. Girgensohn, J. Borezk. Time-onstrained keframe seletion tehnique, In Pro. IEEE Multimedia Computing and Sstems, [11] B.J. Fre and N. Joji. Fast, large-sale transformationinvariant lustering, In NIPS 14, Cambridge, MA: MIT Press, [12] C. M. Bishop. Variational learning in graphial models and neural networks In Proeedings 8th ICANN, [13] J. Winn, A. Blake. Generative Affine Loalisation and Traking In NIPS, 2003.

8 Figure 3. Top: Comparison of ground truth, shot detetion and our method. The time line with ground truth segmentation (left) illustrates positions and labels of 15 senes in the video. Some of the senes are repeating. Shot detetion algorithm based on olor histogram (enter) is sensitive to sudden swipes and hanges in light. In the same time, sene hanges are undeteted if there are due to slow swipes of the amera. Our approah (right) orretl detets senes in most ases. We label the situations when it over-segments. Bottom: A snapshot of video navigation and browsing tool. See the demo at nemanja/video1.wmv [14] B.J. Fre, N. Joji. Transformation-invariant lustering using the EM algorithm. PAMI, 25(1), Jan [15] R. M. Neal, G. E. Hinton. A new view of the EM algorithm that justifies inremental, sparse and other variants, In Learning in Graphial Models, page Norwell MA: Kluwer Aademi Publishers, [16] S. J. Nowlan. Soft Competitive Adaptation: Neural Network Learning Algorithms based on Fitting Statistial Mixtures, Ph.D. thesis, Carnegie Mellon Universit, [17] J. Goldberger, S. Gordon, H. Greenspan. Effiient Image Similarit Measure based on Approximations of KL- Divergene Between Two Gaussian Mixtures, ICCV03. [18] N. Vasonelos. On the omplexit of probabilisti image retrieval, In ICCV, [19] S. Julier, J. K. Uhlmann. A general method for approximating non-linear transformations of probabilit distributions, Tehnial report, RRG, Universit of Oxford, 1996.

timestamp, if silhouette(x, y) 0 0 if silhouette(x, y) = 0, mhi(x, y) = and mhi(x, y) < timestamp - duration mhi(x, y), else

timestamp, if silhouette(x, y) 0 0 if silhouette(x, y) = 0, mhi(x, y) = and mhi(x, y) < timestamp - duration mhi(x, y), else 3rd International Conferene on Multimedia Tehnolog(ICMT 013) An Effiient Moving Target Traking Strateg Based on OpenCV and CAMShift Theor Dongu Li 1 Abstrat Image movement involved bakground movement and

More information

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract CS 9 Projet Final Report: Learning Convention Propagation in BeerAdvoate Reviews from a etwork Perspetive Abstrat We look at the way onventions propagate between reviews on the BeerAdvoate dataset, and

More information

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks Unsupervised Stereosopi Video Objet Segmentation Based on Ative Contours and Retrainable Neural Networks KLIMIS NTALIANIS, ANASTASIOS DOULAMIS, and NIKOLAOS DOULAMIS National Tehnial University of Athens

More information

NOISE REMOVAL FOR OBJECT TRACKING BASED ON HSV COLOR SPACE PARAMETER USING CAMSHIFT

NOISE REMOVAL FOR OBJECT TRACKING BASED ON HSV COLOR SPACE PARAMETER USING CAMSHIFT International Journal of Computational Intelligene & Teleommuniation Sstems () 0 pp. 39-45 NOISE REOVAL FOR OBJECT TRACKING BASED ON HSV COLOR SPACE PARAETER USING CASHIFT P. Raavel G. Appasami and R.

More information

A Novel Validity Index for Determination of the Optimal Number of Clusters

A Novel Validity Index for Determination of the Optimal Number of Clusters IEICE TRANS. INF. & SYST., VOL.E84 D, NO.2 FEBRUARY 2001 281 LETTER A Novel Validity Index for Determination of the Optimal Number of Clusters Do-Jong KIM, Yong-Woon PARK, and Dong-Jo PARK, Nonmembers

More information

arxiv: v1 [cs.db] 13 Sep 2017

arxiv: v1 [cs.db] 13 Sep 2017 An effiient lustering algorithm from the measure of loal Gaussian distribution Yuan-Yen Tai (Dated: May 27, 2018) In this paper, I will introdue a fast and novel lustering algorithm based on Gaussian distribution

More information

WIRELESS CAPSULE ENDOSCOPY IMAGES ENHANCEMENT BASED ON ADAPTIVE ANISOTROPIC DIFFUSION

WIRELESS CAPSULE ENDOSCOPY IMAGES ENHANCEMENT BASED ON ADAPTIVE ANISOTROPIC DIFFUSION WIRELESS CAPSULE ENDOSCOPY IMAGES ENHANCEMENT BASED ON ADAPTIVE ANISOTROPIC DIFFUSION Lei Li 1, Y. X. ZOU 1* and Yi Li 1 ADSPLAB/ELIP, Shool of ECE, Peking Universit, Shenzhen 518055, China Shenzhen JiFu

More information

Extracting Partition Statistics from Semistructured Data

Extracting Partition Statistics from Semistructured Data Extrating Partition Statistis from Semistrutured Data John N. Wilson Rihard Gourlay Robert Japp Mathias Neumüller Department of Computer and Information Sienes University of Strathlyde, Glasgow, UK {jnw,rsg,rpj,mathias}@is.strath.a.uk

More information

KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION

KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION Cuiui Kang 1, Shengai Liao, Shiming Xiang 1, Chunhong Pan 1 1 National Laboratory of Pattern Reognition, Institute of Automation, Chinese

More information

Detection and Recognition of Non-Occluded Objects using Signature Map

Detection and Recognition of Non-Occluded Objects using Signature Map 6th WSEAS International Conferene on CIRCUITS, SYSTEMS, ELECTRONICS,CONTROL & SIGNAL PROCESSING, Cairo, Egypt, De 9-31, 007 65 Detetion and Reognition of Non-Oluded Objets using Signature Map Sangbum Park,

More information

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1.

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1. Fuzzy Weighted Rank Ordered Mean (FWROM) Filters for Mixed Noise Suppression from Images S. Meher, G. Panda, B. Majhi 3, M.R. Meher 4,,4 Department of Eletronis and I.E., National Institute of Tehnology,

More information

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application World Aademy of Siene, Engineering and Tehnology 8 009 Performane of Histogram-Based Skin Colour Segmentation for Arms Detetion in Human Motion Analysis Appliation Rosalyn R. Porle, Ali Chekima, Farrah

More information

the data. Structured Principal Component Analysis (SPCA)

the data. Structured Principal Component Analysis (SPCA) Strutured Prinipal Component Analysis Kristin M. Branson and Sameer Agarwal Department of Computer Siene and Engineering University of California, San Diego La Jolla, CA 9193-114 Abstrat Many tasks involving

More information

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2 On - Line Path Delay Fault Testing of Omega MINs M. Bellos, E. Kalligeros, D. Nikolos,2 & H. T. Vergos,2 Dept. of Computer Engineering and Informatis 2 Computer Tehnology Institute University of Patras,

More information

Video Data and Sonar Data: Real World Data Fusion Example

Video Data and Sonar Data: Real World Data Fusion Example 14th International Conferene on Information Fusion Chiago, Illinois, USA, July 5-8, 2011 Video Data and Sonar Data: Real World Data Fusion Example David W. Krout Applied Physis Lab dkrout@apl.washington.edu

More information

Semi-Supervised Affinity Propagation with Instance-Level Constraints

Semi-Supervised Affinity Propagation with Instance-Level Constraints Semi-Supervised Affinity Propagation with Instane-Level Constraints Inmar E. Givoni, Brendan J. Frey Probabilisti and Statistial Inferene Group University of Toronto 10 King s College Road, Toronto, Ontario,

More information

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines The Minimum Redundany Maximum Relevane Approah to Building Sparse Support Vetor Mahines Xiaoxing Yang, Ke Tang, and Xin Yao, Nature Inspired Computation and Appliations Laboratory (NICAL), Shool of Computer

More information

Gradient based progressive probabilistic Hough transform

Gradient based progressive probabilistic Hough transform Gradient based progressive probabilisti Hough transform C.Galambos, J.Kittler and J.Matas Abstrat: The authors look at the benefits of exploiting gradient information to enhane the progressive probabilisti

More information

An Optimized Approach on Applying Genetic Algorithm to Adaptive Cluster Validity Index

An Optimized Approach on Applying Genetic Algorithm to Adaptive Cluster Validity Index IJCSES International Journal of Computer Sienes and Engineering Systems, ol., No.4, Otober 2007 CSES International 2007 ISSN 0973-4406 253 An Optimized Approah on Applying Geneti Algorithm to Adaptive

More information

Weak Dependence on Initialization in Mixture of Linear Regressions

Weak Dependence on Initialization in Mixture of Linear Regressions Proeedings of the International MultiConferene of Engineers and Computer Sientists 8 Vol I IMECS 8, Marh -6, 8, Hong Kong Weak Dependene on Initialization in Mixture of Linear Regressions Ryohei Nakano

More information

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION Ken Sauer and Charles A. Bouman Department of Eletrial Engineering, University of Notre Dame Notre Dame, IN 46556, (219) 631-6999 Shool of

More information

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425)

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425) Automati Physial Design Tuning: Workload as a Sequene Sanjay Agrawal Mirosoft Researh One Mirosoft Way Redmond, WA, USA +1-(425) 75-357 sagrawal@mirosoft.om Eri Chu * Computer Sienes Department University

More information

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating Capturing Large Intra-lass Variations of Biometri Data by Template Co-updating Ajita Rattani University of Cagliari Piazza d'armi, Cagliari, Italy ajita.rattani@diee.unia.it Gian Lua Marialis University

More information

Stable Road Lane Model Based on Clothoids

Stable Road Lane Model Based on Clothoids Stable Road Lane Model Based on Clothoids C Gakstatter*, S Thomas**, Dr P Heinemann*, Prof Gudrun Klinker*** *Audi Eletronis Venture GmbH, **Leibniz Universität Hannover, ***Tehnishe Universität Münhen

More information

Evolutionary Feature Synthesis for Image Databases

Evolutionary Feature Synthesis for Image Databases Evolutionary Feature Synthesis for Image Databases Anlei Dong, Bir Bhanu, Yingqiang Lin Center for Researh in Intelligent Systems University of California, Riverside, California 92521, USA {adong, bhanu,

More information

A scheme for racquet sports video analysis with the combination of audio-visual information

A scheme for racquet sports video analysis with the combination of audio-visual information A sheme for raquet sports video analysis with the ombination of audio-visual information Liyuan Xing a*, Qixiang Ye b, Weigang Zhang, Qingming Huang a and Hua Yu a a Graduate Shool of the Chinese Aadamy

More information

8 : Learning Fully Observed Undirected Graphical Models

8 : Learning Fully Observed Undirected Graphical Models 10-708: Probabilisti Graphial Models 10-708, Spring 2018 8 : Learning Fully Observed Undireted Graphial Models Leturer: Kayhan Batmanghelih Sribes: Chenghui Zhou, Cheng Ran (Harvey) Zhang When learning

More information

Electrical Engineering Department, University of Texas at Dallas, Richardson, TX, USA 3

Electrical Engineering Department, University of Texas at Dallas, Richardson, TX, USA 3 Effiient Performane odeling via Dual-Prior Baesian odel Fusion for Analog and ixed-signal Ciruits Qiheng Huang Chenlei Fang Fan Yang * Xuan Zeng * Dian Zhou and Xin i 3 State Ke ab of ASIC & Sstem iroeletronis

More information

An Alternative Approach to the Fuzzifier in Fuzzy Clustering to Obtain Better Clustering Results

An Alternative Approach to the Fuzzifier in Fuzzy Clustering to Obtain Better Clustering Results An Alternative Approah to the Fuzziier in Fuzzy Clustering to Obtain Better Clustering Results Frank Klawonn Department o Computer Siene University o Applied Sienes BS/WF Salzdahlumer Str. 46/48 D-38302

More information

Improved Vehicle Classification in Long Traffic Video by Cooperating Tracker and Classifier Modules

Improved Vehicle Classification in Long Traffic Video by Cooperating Tracker and Classifier Modules Improved Vehile Classifiation in Long Traffi Video by Cooperating Traker and Classifier Modules Brendan Morris and Mohan Trivedi University of California, San Diego San Diego, CA 92093 {b1morris, trivedi}@usd.edu

More information

A Coarse-to-Fine Classification Scheme for Facial Expression Recognition

A Coarse-to-Fine Classification Scheme for Facial Expression Recognition A Coarse-to-Fine Classifiation Sheme for Faial Expression Reognition Xiaoyi Feng 1,, Abdenour Hadid 1 and Matti Pietikäinen 1 1 Mahine Vision Group Infoteh Oulu and Dept. of Eletrial and Information Engineering

More information

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System Algorithms, Mehanisms and Proedures for the Computer-aided Projet Generation System Anton O. Butko 1*, Aleksandr P. Briukhovetskii 2, Dmitry E. Grigoriev 2# and Konstantin S. Kalashnikov 3 1 Department

More information

Exploring the Commonality in Feature Modeling Notations

Exploring the Commonality in Feature Modeling Notations Exploring the Commonality in Feature Modeling Notations Miloslav ŠÍPKA Slovak University of Tehnology Faulty of Informatis and Information Tehnologies Ilkovičova 3, 842 16 Bratislava, Slovakia miloslav.sipka@gmail.om

More information

New Fuzzy Object Segmentation Algorithm for Video Sequences *

New Fuzzy Object Segmentation Algorithm for Video Sequences * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 521-537 (2008) New Fuzzy Obet Segmentation Algorithm for Video Sequenes * KUO-LIANG CHUNG, SHIH-WEI YU, HSUEH-JU YEH, YONG-HUAI HUANG AND TA-JEN YAO Department

More information

Superpixel Tracking. School of Information and Communication Engineering, Dalian University of Technology, China 2

Superpixel Tracking. School of Information and Communication Engineering, Dalian University of Technology, China 2 Superpixel Traking Shu Wang1, Huhuan Lu1, Fan Yang1, and Ming-Hsuan Yang2 1 Shool of Information and Communiation Engineering, Dalian University of Tehnology, China 2 Eletrial Engineering and Computer

More information

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study What are Cyle-Stealing Systems Good For? A Detailed Performane Model Case Study Wayne Kelly and Jiro Sumitomo Queensland University of Tehnology, Australia {w.kelly, j.sumitomo}@qut.edu.au Abstrat The

More information

Cluster-Based Cumulative Ensembles

Cluster-Based Cumulative Ensembles Cluster-Based Cumulative Ensembles Hanan G. Ayad and Mohamed S. Kamel Pattern Analysis and Mahine Intelligene Lab, Eletrial and Computer Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1,

More information

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization Self-Adaptive Parent to Mean-Centri Reombination for Real-Parameter Optimization Kalyanmoy Deb and Himanshu Jain Department of Mehanial Engineering Indian Institute of Tehnology Kanpur Kanpur, PIN 86 {deb,hjain}@iitk.a.in

More information

Boosted Random Forest

Boosted Random Forest Boosted Random Forest Yohei Mishina, Masamitsu suhiya and Hironobu Fujiyoshi Department of Computer Siene, Chubu University, 1200 Matsumoto-ho, Kasugai, Aihi, Japan {mishi, mtdoll}@vision.s.hubu.a.jp,

More information

Detecting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry

Detecting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry Deteting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry D. M. Zasada, P. K. Sanyal The MITRE Corp., 6 Eletroni Parkway, Rome, NY 134 (dmzasada, psanyal)@mitre.org

More information

Chapter 2: Introduction to Maple V

Chapter 2: Introduction to Maple V Chapter 2: Introdution to Maple V 2-1 Working with Maple Worksheets Try It! (p. 15) Start a Maple session with an empty worksheet. The name of the worksheet should be Untitled (1). Use one of the standard

More information

Using Augmented Measurements to Improve the Convergence of ICP

Using Augmented Measurements to Improve the Convergence of ICP Using Augmented Measurements to Improve the onvergene of IP Jaopo Serafin, Giorgio Grisetti Dept. of omputer, ontrol and Management Engineering, Sapienza University of Rome, Via Ariosto 25, I-0085, Rome,

More information

Face and Facial Feature Tracking for Natural Human-Computer Interface

Face and Facial Feature Tracking for Natural Human-Computer Interface Fae and Faial Feature Traking for Natural Human-Computer Interfae Vladimir Vezhnevets Graphis & Media Laboratory, Dept. of Applied Mathematis and Computer Siene of Mosow State University Mosow, Russia

More information

A {k, n}-secret Sharing Scheme for Color Images

A {k, n}-secret Sharing Scheme for Color Images A {k, n}-seret Sharing Sheme for Color Images Rastislav Luka, Konstantinos N. Plataniotis, and Anastasios N. Venetsanopoulos The Edward S. Rogers Sr. Dept. of Eletrial and Computer Engineering, University

More information

The Implementation of RRTs for a Remote-Controlled Mobile Robot

The Implementation of RRTs for a Remote-Controlled Mobile Robot ICCAS5 June -5, KINEX, Gyeonggi-Do, Korea he Implementation of RRs for a Remote-Controlled Mobile Robot Chi-Won Roh*, Woo-Sub Lee **, Sung-Chul Kang *** and Kwang-Won Lee **** * Intelligent Robotis Researh

More information

Simultaneous image orientation in GRASS

Simultaneous image orientation in GRASS Simultaneous image orientation in GRASS Alessandro BERGAMINI, Alfonso VITTI, Paolo ATELLI Dipartimento di Ingegneria Civile e Ambientale, Università degli Studi di Trento, via Mesiano 77, 38 Trento, tel.

More information

FUZZY WATERSHED FOR IMAGE SEGMENTATION

FUZZY WATERSHED FOR IMAGE SEGMENTATION FUZZY WATERSHED FOR IMAGE SEGMENTATION Ramón Moreno, Manuel Graña Computational Intelligene Group, Universidad del País Vaso, Spain http://www.ehu.es/winto; {ramon.moreno,manuel.grana}@ehu.es Abstrat The

More information

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking Algorithms for External Memory Leture 6 Graph Algorithms - Weighted List Ranking Leturer: Nodari Sithinava Sribe: Andi Hellmund, Simon Ohsenreither 1 Introdution & Motivation After talking about I/O-effiient

More information

Graph-Based vs Depth-Based Data Representation for Multiview Images

Graph-Based vs Depth-Based Data Representation for Multiview Images Graph-Based vs Depth-Based Data Representation for Multiview Images Thomas Maugey, Antonio Ortega, Pasal Frossard Signal Proessing Laboratory (LTS), Eole Polytehnique Fédérale de Lausanne (EPFL) Email:

More information

Interconnection Styles

Interconnection Styles Interonnetion tyles oftware Design Following the Export (erver) tyle 2 M1 M4 M5 4 M3 M6 1 3 oftware Design Following the Export (Client) tyle e 2 e M1 M4 M5 4 M3 M6 1 e 3 oftware Design Following the Export

More information

Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors

Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors Eurographis Symposium on Geometry Proessing (003) L. Kobbelt, P. Shröder, H. Hoppe (Editors) Rotation Invariant Spherial Harmoni Representation of 3D Shape Desriptors Mihael Kazhdan, Thomas Funkhouser,

More information

Approximate logic synthesis for error tolerant applications

Approximate logic synthesis for error tolerant applications Approximate logi synthesis for error tolerant appliations Doohul Shin and Sandeep K. Gupta Eletrial Engineering Department, University of Southern California, Los Angeles, CA 989 {doohuls, sandeep}@us.edu

More information

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8 Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introdution... 1 1.1. Internet Information...2 1.2. Internet Information Retrieval...3 1.2.1. Doument Indexing...4 1.2.2. Doument Retrieval...4

More information

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering A Novel Bit Level Time Series Representation with Impliation of Similarity Searh and lustering hotirat Ratanamahatana, Eamonn Keogh, Anthony J. Bagnall 2, and Stefano Lonardi Dept. of omputer Siene & Engineering,

More information

Exploiting Enriched Contextual Information for Mobile App Classification

Exploiting Enriched Contextual Information for Mobile App Classification Exploiting Enrihed Contextual Information for Mobile App Classifiation Hengshu Zhu 1 Huanhuan Cao 2 Enhong Chen 1 Hui Xiong 3 Jilei Tian 2 1 University of Siene and Tehnology of China 2 Nokia Researh Center

More information

Shape Outlier Detection Using Pose Preserving Dynamic Shape Models

Shape Outlier Detection Using Pose Preserving Dynamic Shape Models Shape Outlier Detetion Using Pose Preserving Dynami Shape Models Chan-Su Lee Ahmed Elgammal Department of Computer Siene, Rutgers University, Pisataway, NJ 8854 USA CHANSU@CS.RUTGERS.EDU ELGAMMAL@CS.RUTGERS.EDU

More information

Chromaticity-matched Superimposition of Foreground Objects in Different Environments

Chromaticity-matched Superimposition of Foreground Objects in Different Environments FCV216, the 22nd Korea-Japan Joint Workshop on Frontiers of Computer Vision Chromatiity-mathed Superimposition of Foreground Objets in Different Environments Yohei Ogura Graduate Shool of Siene and Tehnology

More information

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays nalysis of input and output onfigurations for use in four-valued D programmable logi arrays J.T. utler H.G. Kerkhoff ndexing terms: Logi, iruit theory and design, harge-oupled devies bstrat: s in binary,

More information

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks Abouberine Ould Cheikhna Department of Computer Siene University of Piardie Jules Verne 80039 Amiens Frane Ould.heikhna.abouberine @u-piardie.fr

More information

Plot-to-track correlation in A-SMGCS using the target images from a Surface Movement Radar

Plot-to-track correlation in A-SMGCS using the target images from a Surface Movement Radar Plot-to-trak orrelation in A-SMGCS using the target images from a Surfae Movement Radar G. Golino Radar & ehnology Division AMS, Italy ggolino@amsjv.it Abstrat he main topi of this paper is the formulation

More information

Fitting conics to paracatadioptric projections of lines

Fitting conics to paracatadioptric projections of lines Computer Vision and Image Understanding 11 (6) 11 16 www.elsevier.om/loate/viu Fitting onis to paraatadioptri projetions of lines João P. Barreto *, Helder Araujo Institute for Systems and Robotis, Department

More information

Outline: Software Design

Outline: Software Design Outline: Software Design. Goals History of software design ideas Design priniples Design methods Life belt or leg iron? (Budgen) Copyright Nany Leveson, Sept. 1999 A Little History... At first, struggling

More information

FOREGROUND OBJECT EXTRACTION USING FUZZY C MEANS WITH BIT-PLANE SLICING AND OPTICAL FLOW

FOREGROUND OBJECT EXTRACTION USING FUZZY C MEANS WITH BIT-PLANE SLICING AND OPTICAL FLOW FOREGROUND OBJECT EXTRACTION USING FUZZY C EANS WITH BIT-PLANE SLICING AND OPTICAL FLOW SIVAGAI., REVATHI.T, JEGANATHAN.L 3 APSG, SCSE, VIT University, Chennai, India JRF, DST, Dehi, India. 3 Professor,

More information

Transition Detection Using Hilbert Transform and Texture Features

Transition Detection Using Hilbert Transform and Texture Features Amerian Journal of Signal Proessing 1, (): 35-4 DOI: 1.593/.asp.1.6 Transition Detetion Using Hilbert Transform and Texture Features G. G. Lashmi Priya *, S. Domni Department of Computer Appliations, National

More information

Pipelined Multipliers for Reconfigurable Hardware

Pipelined Multipliers for Reconfigurable Hardware Pipelined Multipliers for Reonfigurable Hardware Mithell J. Myjak and José G. Delgado-Frias Shool of Eletrial Engineering and Computer Siene, Washington State University Pullman, WA 99164-2752 USA {mmyjak,

More information

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Malaysian Journal of Computer Siene, Vol 10 No 1, June 1997, pp 36-41 A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Md Rafiqul Islam, Harihodin Selamat and Mohd Noor Md Sap Faulty of Computer Siene and

More information

Gray Codes for Reflectable Languages

Gray Codes for Reflectable Languages Gray Codes for Refletable Languages Yue Li Joe Sawada Marh 8, 2008 Abstrat We lassify a type of language alled a refletable language. We then develop a generi algorithm that an be used to list all strings

More information

Segmentation of brain MR image using fuzzy local Gaussian mixture model with bias field correction

Segmentation of brain MR image using fuzzy local Gaussian mixture model with bias field correction IOSR Journal of VLSI and Signal Proessing (IOSR-JVSP) Volume 2, Issue 2 (Mar. Apr. 2013), PP 35-41 e-issn: 2319 4200, p-issn No. : 2319 4197 Segmentation of brain MR image using fuzzy loal Gaussian mixture

More information

CleanUp: Improving Quadrilateral Finite Element Meshes

CleanUp: Improving Quadrilateral Finite Element Meshes CleanUp: Improving Quadrilateral Finite Element Meshes Paul Kinney MD-10 ECC P.O. Box 203 Ford Motor Company Dearborn, MI. 8121 (313) 28-1228 pkinney@ford.om Abstrat: Unless an all quadrilateral (quad)

More information

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks International Journal of Advanes in Computer Networks and Its Seurity IJCNS A Load-Balaned Clustering Protool for Hierarhial Wireless Sensor Networks Mehdi Tarhani, Yousef S. Kavian, Saman Siavoshi, Ali

More information

Parametric Abstract Domains for Shape Analysis

Parametric Abstract Domains for Shape Analysis Parametri Abstrat Domains for Shape Analysis Xavier RIVAL (INRIA & Éole Normale Supérieure) Joint work with Bor-Yuh Evan CHANG (University of Maryland U University of Colorado) and George NECULA (University

More information

Simulation of Crystallographic Texture and Anisotropie of Polycrystals during Metal Forming with Respect to Scaling Aspects

Simulation of Crystallographic Texture and Anisotropie of Polycrystals during Metal Forming with Respect to Scaling Aspects Raabe, Roters, Wang Simulation of Crystallographi Texture and Anisotropie of Polyrystals during Metal Forming with Respet to Saling Aspets D. Raabe, F. Roters, Y. Wang Max-Plank-Institut für Eisenforshung,

More information

Performance Benchmarks for an Interactive Video-on-Demand System

Performance Benchmarks for an Interactive Video-on-Demand System Performane Benhmarks for an Interative Video-on-Demand System. Guo,P.G.Taylor,E.W.M.Wong,S.Chan,M.Zukerman andk.s.tang ARC Speial Researh Centre for Ultra-Broadband Information Networks (CUBIN) Department

More information

Accommodations of QoS DiffServ Over IP and MPLS Networks

Accommodations of QoS DiffServ Over IP and MPLS Networks Aommodations of QoS DiffServ Over IP and MPLS Networks Abdullah AlWehaibi, Anjali Agarwal, Mihael Kadoh and Ahmed ElHakeem Department of Eletrial and Computer Department de Genie Eletrique Engineering

More information

ASL Recognition Based on a Coupling Between HMMs and 3D Motion Analysis

ASL Recognition Based on a Coupling Between HMMs and 3D Motion Analysis An earlier version of this paper appeared in the proeedings of the International Conferene on Computer Vision, pp. 33 3, Mumbai, India, January 4 7, 18 ASL Reognition Based on a Coupling Between HMMs and

More information

Fast Rigid Motion Segmentation via Incrementally-Complex Local Models

Fast Rigid Motion Segmentation via Incrementally-Complex Local Models Fast Rigid Motion Segmentation via Inrementally-Complex Loal Models Fernando Flores-Mangas Allan D. Jepson Department of Computer Siene, University of Toronto {mangas,jepson}@s.toronto.edu Abstrat The

More information

TUMOR DETECTION IN MRI BRAIN IMAGE SEGMENTATION USING PHASE CONGRUENCY MODIFIED FUZZY C MEAN ALGORITHM

TUMOR DETECTION IN MRI BRAIN IMAGE SEGMENTATION USING PHASE CONGRUENCY MODIFIED FUZZY C MEAN ALGORITHM TUMOR DETECTION IN MRI BRAIN IMAGE SEGMENTATION USING PHASE CONGRUENCY MODIFIED FUZZY C MEAN ALGORITHM M. Murugeswari 1, M.Gayathri 2 1 Assoiate Professor, 2 PG Sholar 1,2 K.L.N College of Information

More information

Drawing lines. Naïve line drawing algorithm. drawpixel(x, round(y)); double dy = y1 - y0; double dx = x1 - x0; double m = dy / dx; double y = y0;

Drawing lines. Naïve line drawing algorithm. drawpixel(x, round(y)); double dy = y1 - y0; double dx = x1 - x0; double m = dy / dx; double y = y0; Naïve line drawing algorithm // Connet to grid points(x0,y0) and // (x1,y1) by a line. void drawline(int x0, int y0, int x1, int y1) { int x; double dy = y1 - y0; double dx = x1 - x0; double m = dy / dx;

More information

Discrete sequential models and CRFs. 1 Case Study: Supervised Part-of-Speech Tagging

Discrete sequential models and CRFs. 1 Case Study: Supervised Part-of-Speech Tagging 0-708: Probabilisti Graphial Models 0-708, Spring 204 Disrete sequential models and CRFs Leturer: Eri P. Xing Sribes: Pankesh Bamotra, Xuanhong Li Case Study: Supervised Part-of-Speeh Tagging The supervised

More information

3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT?

3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT? 3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT? Bernd Girod, Peter Eisert, Marus Magnor, Ekehard Steinbah, Thomas Wiegand Te {girod eommuniations Laboratory, University of Erlangen-Nuremberg

More information

特集 Road Border Recognition Using FIR Images and LIDAR Signal Processing

特集 Road Border Recognition Using FIR Images and LIDAR Signal Processing デンソーテクニカルレビュー Vol. 15 2010 特集 Road Border Reognition Using FIR Images and LIDAR Signal Proessing 高木聖和 バーゼル ファルディ Kiyokazu TAKAGI Basel Fardi ヘンドリック ヴァイゲル Hendrik Weigel ゲルド ヴァニーリック Gerd Wanielik This paper

More information

Contour Box: Rejecting Object Proposals Without Explicit Closed Contours

Contour Box: Rejecting Object Proposals Without Explicit Closed Contours Contour Box: Rejeting Objet Proposals Without Expliit Closed Contours Cewu Lu, Shu Liu Jiaya Jia Chi-Keung Tang The Hong Kong University of Siene and Tehnology Stanford University The Chinese University

More information

1. Introduction. 2. The Probable Stope Algorithm

1. Introduction. 2. The Probable Stope Algorithm 1. Introdution Optimization in underground mine design has reeived less attention than that in open pit mines. This is mostly due to the diversity o underground mining methods and omplexity o underground

More information

arxiv: v1 [cs.gr] 10 Apr 2015

arxiv: v1 [cs.gr] 10 Apr 2015 REAL-TIME TOOL FOR AFFINE TRANSFORMATIONS OF TWO DIMENSIONAL IFS FRACTALS ELENA HADZIEVA AND MARIJA SHUMINOSKA arxiv:1504.02744v1 s.gr 10 Apr 2015 Abstrat. This work introdues a novel tool for interative,

More information

One Against One or One Against All : Which One is Better for Handwriting Recognition with SVMs?

One Against One or One Against All : Which One is Better for Handwriting Recognition with SVMs? One Against One or One Against All : Whih One is Better for Handwriting Reognition with SVMs? Jonathan Milgram, Mohamed Cheriet, Robert Sabourin To ite this version: Jonathan Milgram, Mohamed Cheriet,

More information

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality INTERNATIONAL CONFERENCE ON MANUFACTURING AUTOMATION (ICMA200) Multi-Piee Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality Stephen Stoyan, Yong Chen* Epstein Department of

More information

Comparing Images Under Variable Illumination

Comparing Images Under Variable Illumination ( This paper appeared in CVPR 8. IEEE ) Comparing Images Under Variable Illumination David W. Jaobs Peter N. Belhumeur Ronen Basri NEC Researh Institute Center for Computational Vision and Control The

More information

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications System-Level Parallelism and hroughput Optimization in Designing Reonfigurable Computing Appliations Esam El-Araby 1, Mohamed aher 1, Kris Gaj 2, arek El-Ghazawi 1, David Caliga 3, and Nikitas Alexandridis

More information

NOISE is inevitable in any image-capturing process. Even

NOISE is inevitable in any image-capturing process. Even IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 34, NO. 7, JULY 2012 1329 Differene-Based Image Noise Modeling Using Skellam Distribution Youngbae Hwang, Member, IEEE, Jun-Sik Kim,

More information

HEXA: Compact Data Structures for Faster Packet Processing

HEXA: Compact Data Structures for Faster Packet Processing Washington University in St. Louis Washington University Open Sholarship All Computer Siene and Engineering Researh Computer Siene and Engineering Report Number: 27-26 27 HEXA: Compat Data Strutures for

More information

Video stabilization based on a 3D perspective camera model

Video stabilization based on a 3D perspective camera model Vis Comput DOI 10.1007/s00371-009-0310-z ORIGINAL ARTICLE Video stabilization based on a 3D perspetive amera model Guofeng Zhang Wei Hua Xueying Qin Yuanlong Shao Hujun Bao Springer-Verlag 2009 Abstrat

More information

Probabilistic Classification of Image Regions using an Observation-Constrained Generative Approach

Probabilistic Classification of Image Regions using an Observation-Constrained Generative Approach 9 Probabilisti Classifiation of mage Regions using an Observation-Constrained Generative Approah Sanjiv Kumar, Alexander C. Loui 2, and Martial Hebert The Robotis nstitute, Carnegie Mellon University,

More information

An Approach to Physics Based Surrogate Model Development for Application with IDPSA

An Approach to Physics Based Surrogate Model Development for Application with IDPSA An Approah to Physis Based Surrogate Model Development for Appliation with IDPSA Ignas Mikus a*, Kaspar Kööp a, Marti Jeltsov a, Yuri Vorobyev b, Walter Villanueva a, and Pavel Kudinov a a Royal Institute

More information

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction University of Wollongong Researh Online Faulty of Informatis - apers (Arhive) Faulty of Engineering and Information Sienes 7 Time delay estimation of reverberant meeting speeh: on the use of multihannel

More information

Cluster Centric Fuzzy Modeling

Cluster Centric Fuzzy Modeling 10.1109/TFUZZ.014.300134, IEEE Transations on Fuzzy Systems TFS-013-0379.R1 1 Cluster Centri Fuzzy Modeling Witold Pedryz, Fellow, IEEE, and Hesam Izakian, Student Member, IEEE Abstrat In this study, we

More information

A Fast Kernel-based Multilevel Algorithm for Graph Clustering

A Fast Kernel-based Multilevel Algorithm for Graph Clustering A Fast Kernel-based Multilevel Algorithm for Graph Clustering Inderjit Dhillon Dept. of Computer Sienes University of Texas at Austin Austin, TX 78712 inderjit@s.utexas.edu Yuqiang Guan Dept. of Computer

More information

AUTOMATIC TRACKING OF MOVING OBJECT WITH EYE-IN- HAND ROBOT MANIPULATOR

AUTOMATIC TRACKING OF MOVING OBJECT WITH EYE-IN- HAND ROBOT MANIPULATOR AUTOATIC TRACKING OF OVING OBJECT WITH EYE-IN- HAND ROBOT ANIPULATOR 1 JIN-SIANG SHAW, WEN-LIN CHI Institute of ehatroni Engineering, Taipei Teh, Taipei, Taiwan Email: 1 jshaw@ntut.edu.tw, ton455448@gmail.om

More information

Improved Real-Time Shadow Mapping for CAD Models

Improved Real-Time Shadow Mapping for CAD Models Improved Real-Time Shado Mapping for CAD Models Vitor Barata R. B. Barroso TeGraf, Computer Siene Dept., PUC-Rio vbarata@tegraf.pu-rio.br Waldemar Celes TeGraf, Computer Siene Dept., PUC-Rio eles@tegraf.pu-rio.br

More information

Z Combinatorial Filters: Sensor Beams, Obstacles, and Possible Paths

Z Combinatorial Filters: Sensor Beams, Obstacles, and Possible Paths Z Combinatorial Filters: Sensor Beams, Obstales, and Possible Paths BENJAMIN TOVAR, Northwestern University FRED COHEN, University of Rohester LEONARDO BOBADILLA, University of Illinois JUSTIN CZARNOWSKI,

More information

3D Model Based Pose Estimation For Omnidirectional Stereovision

3D Model Based Pose Estimation For Omnidirectional Stereovision 3D Model Based Pose Estimation For Omnidiretional Stereovision Guillaume Caron, Eri Marhand and El Mustapha Mouaddib Abstrat Robot vision has a lot to win as well with wide field of view indued by atadioptri

More information