Multi-modal Clustering for Multimedia Collections

Size: px
Start display at page:

Download "Multi-modal Clustering for Multimedia Collections"

Transcription

1 Multi-modal Clustering for Multimedia Colletions Ron Bekkerman and Jiwoon Jeon Center for Intelligent Information Retrieval University of Massahusetts at Amherst, USA {ronb Abstrat Most of the online multimedia olletions, suh as piture galleries or video arhives, are ategorized in a fully manual proess, whih is very expensive and may soon be infeasible with the rapid growth of multimedia repositories. In this paper, we present an effetive method for automating this proess within the unsupervised learning framework. We exploit the truly multi-modal nature of multimedia olletions they have multiple views, or modalities, eah of whih ontributes its own perspetive to the olletion s organization. For example, in piture galleries, image aptions are often provided that form a separate view on the olletion. Color histograms (or any other set of global features) form another view. Additional views are blobs, interest points and other sets of loal features. Our model, alled Comraf* (pronouned Comraf-Star), effiiently inorporates various views in multi-modal lustering, by whih it allows great modeling flexibility. Comraf* is a light-weight version of the reently introdued ombinatorial Markov random field (Comraf). We show how to translate an arbitrary Comraf into a series of Comraf* models, and give an empirial evidene for omparable effetiveness of the two. Comraf* demonstrates exellent results on two real-world image galleries: it obtains times higher auray ompared with a uni-modal k-means. 1. Introdution Clustering is the ore omponent in many data management systems. Reent explosion of multimedia information on the WWW demands new lustering tehniques that an seamlessly handle omplex strutures of multimedia data. Multimedia information is intrinsially multi-modal. The word modality an be interpreted in many different ways in this paper, modality means the type of input. For example, image aptions and olor histograms are different types of input to an image proessing system and therefore we onsider them as two separate modalities. Eah modality gives a different aspet of the data, and eah modality has its own dependeny relationship with other modalities. For example, image aptions tend to desribe events aptured on the image, while annotations usually list salient objets in the image. Color histograms and texture features onvey visual information to the system. In this paper, we develop an effetive multimedia information lustering system based on the reently introdued ombinatorial Markov random field (Comraf) [4]. Comraf is a speial type of a Markov random field (MRF), designed to be a onvenient framework for multi-modal learning in general, and for multi-modal lustering in partiular. In Comrafs, eah data modality is represented by a single node, orresponding to a random variable of rih struture (alled a ombinatorial random variable); undireted edges are drawn between modalities that stay in a statistial interation with eah other. We fous on multi-modal lustering of image olletions, when multiple views on the olletion are available. Image lustering an be a useful omponent in a retrieval system [7], it an also be a stand-alone appliation, for example, for onstruting semanti groups of image retrieval results [19], or for browsing image olletions [1]. Being fully unsupervised, existing lustering methods often demonstrate poor performane. We show that by employing the multi-modal learning paradigm we an signifiantly improve lustering results. Multiple modalities are a heap form of supervision: while it is expensive to reate a large dataset, eah element of whih is labeled with its semanti ategory, it is usually straightforward to obtain another, orthogonal type of labels, over the data modalities. The idea of lustering images using both low-level image features and surrounding text (i.e. grouping together visually similar and semantially related images) has attrated the lose attention of researh ommunity. Barnard et al. [1] propose a generative hierarhial model for image lustering, in whih every node generates words and blobs based on the given probability distributions for that node. Higher level nodes generate more general terms and lower level nodes generate more speifi terms. The EM algorithm is used to fit the model. This approah an handle /07/$ IEEE

2 only two feature types (words, blobs); to handle more types, the model and the learning proedure must be revised. Cai et al. [6] luster Web image searh results using visual, textual and link analysis. They extrat text relevant to the image using a vision-based page segmentation algorithm. First, only text and hyperlink data is used to luster images. The resulting lusters are lustered again using low-level image features. Loeff et al. [16] apply a similar approah: they alulate a histogram of gradient magnitude of the pixel values from every interest point and then luster images using these loal features with global olor histograms and surrounding text. Both Cai et al. and Loeff et al. use spetral lustering methods, where the affinity sores for every pair of images and every modality must be alulated, resulting in an unrealistially heavy representation. Bipartite spetral graph partitioning [8] is useful for olustering two modalities suh as douments and words. Gao et al. [11] extend this method to handle one more modality. In their tripartite graph model, nodes are arranged in three layers: words, images and image features. To handle more modalities, Gao et al.[12] propose another method that is most losely related to our work: they organize modalities in a star struture of interrelationships, where a entral modality is onneted to all the others. They treat this problem as fusion of multiple pairwise o-lustering problems. Eah sub-problem is solved using the bipartite graph partitioning method. Our approah has a few advantages over the others. First, our method has no pratial limitation in the number of modalities as long as the pairwise interation data is available addition of a modality inreases the omputational omplexity only linearly. Seond, our model an luster multiple modalities while taking into aount other modalities, whih do not have to be lustered. Third, our information-theoreti lustering method does not rely on hard-to-obtain affinity matries of individual modalities. Instead, easily omputable ontingeny tables of interating modalities are used. Overall, our paper proposes a general framework for lustering multimedia olletions, whih an be straightforwardly applied to video data, sound traks, hypertext et. as well as to any of their ombinations. 2. Combinatorial Markov Random Fields Definition 1 A ombinatorial random variable X is a disrete random variable defined over a ombinatorial set (i.e. a set of all subsets, partitionings, partial orderings et. of a given finite set). In this paper, we fous on the task of hard lustering, or partitioning, of a given set. For this task, we define a ombinatorial random variable over all the possible partitionings of the set. Note that a ombinatorial random variable, while being an ordinary disrete random variable with a finite domain, has a unique property: in most real-world ases, the event spae of X is so large that the distribution P (X ) annot be expliitly speified. For example, onsider a disrete random variable X with three values {red, green, blue} seleted aording to a probability mass funtion { 1 2, 1 3, 1 6 }. A ombinatorial random variable X an then take five values: {{red, green, blue}}, {{red, green}, {blue}}, {{red, blue}, {green}}, {{green, blue}, {red}}, and {{red}, {green}, {blue}}. Assume that the underlying probability mass funtion of X is { 1 5, , 1 5, 1 5, }, whih is unknown to the pratitioner. If the task is to find the most probable partitioning, a long sampling proess should be applied. This proess will be infeasibly long if the event spae of X onsists of, say, one hundred values. For eah partitioning x, we find it useful (see Setion 2.2) to define a disrete random variable X over lusters in x. Continuing our example above, for x = {{red, blue}, {green}} the variable X takes two values: {red, blue} and {green}. Note that the probability of a luster is a sum of probabilities of its elements, so that the probability mass funtion of X here is { , 1 3 }. Definition 2 A ombinatorial Markov random field (Comraf) is an MRF, at least one node of whih is a ombinatorial random variable. In this paper we onsider only Comraf models, eah node of whih is a ombinatorial random variable. Beause of the uniqueness of ombinatorial random variables, it is impossible to apply existing MRF inferene tehniques to Comrafs, therefore a speially designed inferene method is used, whih is based on ombinatorial optimization. One of the plausible harateristis of Comraf models is ompatness: a useful Comraf model an onsist of just a handful of nodes. Compatness makes the model analysis muh easier; also, hoosing the best Comraf model (for a partiular problem) is a manageable task, as the number of possible variations is relatively small. In our future work, we will apply model learning, whih is feasible in Comrafs Comraf* model for multi-modal lustering Multi-modal (hard) lustering is a problem of simultaneously onstruting m partitionings of m data modalities, e.g. of images, their olors, their interest points, words in their aptions et. By lustering modalities simultaneously, one would overome statistial sparseness of the data representation, leading to a dense, smooth joint distribution of the modalities, whih would result in potentially more aurate lusterings than the ones obtained separately. Note that in most real-world ases, a pratitioner is interested in lustering only one modality (images, in our ase), whih we all a target modality. This implies that not every

3 modality has to be lustered: if a representation of a modality is dense enough, lustering it may ause an underestimation of the joint (an effet known as oversmoothing), whih may hurt lustering results of the target modality. For example, if images are distributed over 256 olors, it makes no sense to simultaneously luster images and olors beause the distributions are already dense enough. In this paper, we present a speial ase of Comraf models, in whih only the target modality is lustered, while the representations of all the other modalities are assumed to be dense enough. Eah unlustered modality is represented by an observed ombinatorial random variable. An observed random variable is a variable whose value is preset and fixed (traditionally, suh a variable is shaded on an MRF graph). Reall that a ombinatorial random variable is defined over all the possible lusterings of a given set. In ase of unlustered modalities, the observed value of a orresponding ombinatorial random variable is a lustering of all singleton lusters. For example, given a set {red, green, blue}, the observed value of a orresponding ombinatorial random variable is {{red}, {green}, {blue}}. Eah observed ombinatorial random variable of an unlustered modality is onneted by an edge with a hidden ombinatorial random variable of the target modality. Observed nodes are not onneted to eah other beause they are statistially independent by definition. Hene, the resulting topology of the Comraf model is an asterisk with the target modality in the enter. We all suh a model Comraf*. Examples of Comraf* graphs are given in Figure 1. Despite that only one modality is lustered in Comraf*, it is still a model for multi-modal lustering, as multiple modalities are involved in the lustering proess. The general Comraf model, however, takes are of any number of dense and sparse random variables. In Setion 3.2 we present a Comraf model for simultaneously lustering images and their loal features, while inorporating other (unlustered) modalities. Sine the simultaneous lustering an be omputationally hard, we show how to redue the omputational burden by translating suh a Comraf model into a number of Comraf* models, eah of whih is then optimized sequentially Inferene in Comraf and Comraf* Given a Comraf model over m ombinatorial random variables X = {X0,X 1,...,X m 1}, where X0 orresponds to the target modality, the task is to find the most probable instantiation of X (this task is ommonly referred to as the Most Probable Explanation, ormpe):x MPE = arg max x P (x ). Aording to the Hammersley-Clifford theorem [5], the joint distribution P (x ) is a Gibbs distribution: P (x )= 1 Z f exp j f j(x ), where f j (x ) are arbitrary potential funtions defined over liques in the Comraf graph, and Z f is a normalization fator alled a partition funtion. Let us onsider only the smallest liques, i.e. edges E. If the potential funtions are preset and fixed for eah edge, then the partition funtion beomes a onstant and thus the MPE problem is solved with: x MPE = arg max x e ij E f ij (x i,x j). (1) The simpliity of this model allows to hoose omplex, theoretially justified potential funtions. Here we adopt the idea presented in [3] in a similar setting: as a potential funtion f j, we hoose (weighted) mutual information between variables X i and X j, whih are defined over x i and x j respetively (see the beginning of Setion 2). Thus, our objetive funtion is: (x 0) MPE = arg max x 0 e ij E w ij I( X i ; X j ), (2) subjet to X i = k i and X j = k j. Mutual information between a lustering and another, interating random variable has a long history of being used in various unsupervised settings, starting with the Information Bottlenek [18], and inluding image lustering [13]. For a short review, see [4]. Weights w ij are by default set to 1; non-unity weights an be used to bring widely ranging mutual information terms to the same sale (see an example in Setion 3.2). In Comraf*, where all the edges are attahed to X0 and all the leaves are observed ombinatorial random variables, Equation (1) is transformed into: (x 0) MPE = m 1 m 1 = arg max f j (x 0,x j) = arg max w j I( X 0 ; X j ), x 0 x 0 j=1 j=1 (3) sine X j = X j for the unlustered modalities Comraf* optimization proedure To ompute the weighted sum of pairwise mutual information from Equation (3), the following proedure is used. The input of the proedure is an (empirial) joint distribution P (X 0,X j ) of the underlying data of eah interating pair (X0,X j ). For a given partitioning x 0, the distribution P ( X 0,X j ) is omputed using the umulative rule P ( x 0 ; x j ) = x 0 x 0 P (x 0,x j ). Marginals P ( X 0 ) and P (X j ) are obtained through the marginalization P ( x 0 )= x j P ( x 0,x j ) and P (x j ) = x 0 P ( x 0,x j ). Now we have all the ingredients to alulate the mutual information: I( X 0 ; X j )= x P ( x 0,x j ) log P ( x 0,x j ) P ( x 0 )P (x j ). 0,x j In most real-world ases, it is infeasible to find the global maximum in Equation (3) beause the number of possible

4 G W C G W B C G W C B G? W (a) (b) () (d) Figure 1. Comraf* models: (a) for images G and words W from their aptions; (b) for images, words and olors C ; () for images, words, olors and blobs B ; (d) straightforward generalization to any number of modalities. partitionings x 0 is exponentially large in the size of the X 0 s domain. We apply a loal optimization proedure, in whih we start with some lustering and greedily optimize the objetive while exploring a loal neighborhood of the initial onfiguration. The loal searh is performed by iteratively transferring eah value of X 0 from its urrent luster into a luster suh that the objetive is maximized. Note that we an apply Equation (3) only subjet to X 0 = k 0, i.e. only when the number of lusters is fixed, otherwise suh an optimization an result in a degenerative lustering of all singleton lusters. This does not imply, however, that hierarhial lustering is not appliable to our ase. Clusters an be arbitrarily merged or split, after whih the number of lusters should be fixed and then it is safe to optimize Equation (3). When the greedy optimization onverges to a loal maximum, we may again merge or split lusters, after whih we proeed with the next iteration of the optimization routine et. Thus, we an employ both top-down and bottom-up lustering proedures. In the top-down proedure, we start with one luster that ontains all the values of X 0 and split it until the required number of lusters is obtained (while interleaving with the optimization routine). In bottom-up lustering, we start with all singleton lusters and merge them until, again, reahing the required number of lusters. Computational omplexity of the top-down algorithm is O(l X 0 m 1 j=1 X j ), and of the bottom-up algorithm O(l X 0 2 m 1 j=1 X j ), where l is a (fixed) number of lustering iterations. Note that an arbitrary number of leaves (unlustered modalities) an be inorporated into the Comraf* model, while adding new modalities inreases the omplexity only linearly. 3. Modalities In this work, along with images, we onsider three other modalities. The first one is words from image aptions. We remove stopwords and apply simple s -stemming (removal of plural suffixes). A joint probability of an image g and awordw is P (g, w) = Nw g W, where N w g is the number of ourrenes of w in g s aption, W is the total number of words. Another modality is olors appearing in images. The joint probability distribution of olors and images is obtained from olor histograms, as a number of pixels of olor in image g divided by the total number of pixels in all images. The third modality is blobs, as desribed below Retangular blobs Blobs (or visual terms) are a speial type of image ontent representation based on a fixed voabulary. To generate blobs, images are first segmented into regions, whih are then lustered aross all images. Blobs are the resulting region lusters. Eah image is mapped onto the set of blobs whih leads to in a representation analogous to the bag-of-words (BOW) in text proessing. Barnard and Forsyth [2] and Duygulu et al. [9] segment images into semantially oherent regions using Blobworld and Normalized-Cuts algorithms. Unfortunately, these algorithms do not always produe segmentations aurate enough for further use. Jeon and Manmatha [15] and Feng et al. [10] use a retangular grid to segment images and report better results on an image retrieval task. We apply the same set of blobs as in [10], built using the following proedure. Images are first segmented to regions using a 6 by 4 grid. Then, for eah region, a feature vetor is onstruted that ontains texture and olor information: Gabor texture filters with 4 orientations and 3 sales are used to onstrut 12 dimensional texture features; the mean, standard deviation and skewness of RGB and LAB omponents are omputed to build 18 dimensional olor features. The resulting 30 dimensional feature vetors are lustered using k-means Blobs onstruted by Comraf models As disussed in Setion 3.1 above, a lustering proess is involved in onstruting blobs from retangular regions, represented by olor and texture features. Naturally, sine Comrafs are models for multi-modal lustering, an intrinsi Comraf model an be used for simultaneously lustering images and their regions. Co-lustering of images and features has been reently desribed in literature [17], however, Comrafs have an additional power over o-lustering methods: Comrafs an inorporate multiple modalities, both sparse (that are to be lustered) and dense (that are not). Figure 2 (left) shows a Comraf model for lustering images G simultaneously with their regions R, taking into aount olor C and texture T information of the regions, as well as the olors and aption words W of the images. Obviously, more edges and nodes an be added to the model, depending on the data availability. In Setion 2.3 we mentioned that the input of a Comraf inferene proedure is a set of pairwise probability tables P (X i,x j ) for eah edge in the Comraf graph. An interesting ase is the (G,R ) edge between image and region

5 R T C G W 1. R T C 2. R C G W Figure 2. (left) A Comraf model for simultaneously lustering images G and their retangular regions R, while taking into aount words W from image aptions, olors C and texture data T ; (right) a translation of this model into a two-step Comraf*: the first Comraf* is for lustering regions into blobs, whereas the seond Comraf* is for lustering images based on these blobs. ombinatorial random variables in Figure 2 (left). Unlike olors and aption words, eah region is unique, so for eah region r and eah image g, their joint probability is: P (r, g) = { 1 R, if r g 0, otherwise, where R is the total number of regions in the dataset. Suh a probability mass funtion is useless for lustering regions, beause only regions that belong to the same image an be lustered together. A possible way to resolve this problem would be to estimate this probability by giving a portion of its mass to P (r, g) even if r/ g. Suh an estimation an be made based on omputing similarities between regions of various images, whih is omputationally hard: O( R 2 r ), where r is the size of any region. Comrafs offer an elegant solution to this problem: sine regions are lustered not only based on images, but also based on olors and texture, neither of whih has this problem, we still an use Equation (4). As long as images are lustered in parallel with regions, Equation (4) allows grouping together regions that belong to the same image luster, as desired. Therefore, we apply the Comraf model from Figure 2 (left) as it is. We hoose to luster images bottom-up and regions top-down. In our objetive funtion (2), we ope with the fat that I( R; G) is two orders of magnitude larger than I( R; C) and I( R; T ), by setting the weights of the latter two terms to 100. The simultaneous lustering of images and regions is a time onsuming proess: its omplexity is O( R G ( C + T + W )). We propose a light-weight version of this model, in whih inferene is done in two steps: first, regions are lustered based on their olor and texture features, and then images are lustered based on olors, aption words and region lusters. Suh a model is equivalent to a set of two Comraf* models presented in Figure 2 (right). This model s omplexity is plausible: O( R ( C + T )+ G ( C + R + W )), where R is the number of region lusters. Generalizing this setting, it is easy to see that any Comraf an be translated into a series of Comraf* models. (4) Category # of images Category # of images Birds 152 Christianity 191 Desert 172 Islam 96 Flowers 165 Judaism 187 Trees 190 Personalities 188 Food 187 Symbols 130 Housing 165 OVERALL: 1823 Table 1. Categories (and their sizes) of the IsraelImages dataset. 4. Experimentation We experiment with a variety of partiular Comraf* models (see examples in Figure 1), as well as with the general Comraf models from Figure 2. The experiments are onduted using our open-soure Comraf lustering tool. 1 In all our models, images are lustered agglomeratively. All our results are averaged over 10 independent runs, with the standard error reported. As a baseline, we use the k-means algorithm (SimpleKMeans implementation of WEKA 2 ), where images are represented as BOW of their aptions. Also, our 2-node Comraf* model is equivalent to the hardlustering version of Information Bottlenek (IB) [18] (see [4] for disussion), hene we use it as our baseline as well. For evaluation of our lustering results, we use image lustering auray. LetC be the set of ground truth ategories. For eah image luster g, letµ C ( g) be the maximal number of elements of g that belong to one ategory. Then, the preision of g with respet to C, is defined as Pre( g, C) = µ C ( g)/ g. The miro-averaged preision of the entire lustering g is: Pre(g, C) = g µ C( g)/ g g, whih is the portion of douments that belong to the dominant ategories. For all our experiments, we fix the number of lusters to be equal to the number of ategories, thus Pre(g, C) equals lustering auray Datasets We demonstrate the performane of our lustering methods on two datasets: a subset of the benhmark Corel dataset and a new multimedia dataset, whih we refer to as IsraelImages, olleted by us espeially for this work. The Corel subset 3 has already been used in various previous researh projets [9, 14, 10]. The dataset onsists of 5,000 images from 50 Corel Stok Photo CDs. Eah CD ontains 100 images on the same topi, suh as Sunrises and Sunsets, Mountains of Ameria and Wild Animals. Every image has a aption and an annotation. The aption is a brief desription of the sene and the annotation is a list of objets that appear in the image. An example of an image aption is Man And Boy Fishing Mountain 1 omraf.soureforge.net 2 s.waikato.a.nz/ml/weka 3 kobus.a/researh/data/ev_2002

6 Method Auray k-means: images over aption words 22.0% IB: images/aption words 44.2 ± 1.0% IB: images/olors 24.4 ± 0.2% Comraf*: images/words/olors 54.2 ± 0.9% Two-step Comraf*: Figure 2 (right) 69.0 ± 0.6% General Comraf: Figure 2 (left) 68.6 ± 1.0% Table 2. Clustering results on the IsraelImages dataset. All IB/Comraf results are averaged over 10 independent runs with the standard error of the mean reported after the ± sign. Lake, while Tree People Mountain Water is an annotation for this image. Overall 371 words are used to annotate the olletion. The original dataset has 4,500 training images and 500 test images. Sine our model does not require training, we use 4,500 training images for our experiments and save the remaining 500 images for future use. The seond dataset onsists of 1823 images downloaded from IsraelImages.om. The images reflet main aspets of Israel senery/soiety and are grouped into 11 ategories (see Table 1). Eah image is 375 by 250 pixels and has a 1 to 18 words long aption. This dataset is available to the researh ommunity Results and disussion Our results on the IsraelImages dataset are reported in Table 2. Adding the olor modality to the aption BOW improves the lustering result by 10% (on an absolute sale), whereas adding the regions (in a 2-step Comraf* sheme) leads to an additional 15% improvement. These findings demonstrate the value of multi-modal setting in image lustering. The general Comraf model from Figure 2 (left) is not able to outperform the 2-step Comraf*. This is probably due to the fat that olor and texture information is more important for lustering regions than the orrespondene between regions and image lusters. We also experiment with various levels of olor granularity in a 3-node Comraf* setting (from Figure 1b) the results are presented in Figure 3 (left). As an be seen, if the olor information is detailed enough (above 216 olors), the differene in the results is statistially insignifiant. Figure 3 (enter) shows the results of the 2-step Comraf* over various numbers of olors for lustering regions. Generally, less olors are needed for lustering regions than for lustering images: 216 olors appear to be too many. A summary of our results on the Corel dataset is presented in Table 3. It shows surprisingly similar trends as for IsraelImages. On a 3-node setup with aption words and blobs we obtain 59.4% auray, whih is espeially impressive given that a random assignment of images into ronb/image_lustering.html Method Auray k-means: images over aption words 22.0% IB: images/aption words 46.6 ± 0.5% IB: images/olors 22.5 ± 0.2% IB: images/blobs (see Setion 3.1) 24.7 ± 0.3% Comraf*: images/words/olors 55.3 ± 0.5% Comraf*: images/words/blobs 59.4 ± 0.5% Comraf*: images/words/olors/blobs 60.1 ± 0.3% Two-step Comraf*: Figure 2 (right) 61.2 ± 0.4% IB: images/annotation words 58.6 ± 0.3% Table 3. Clustering results on the Corel dataset. All IB/Comraf results are averaged over 10 independent runs with the standard error of the mean reported after the ± sign. lusters would lead to 2% auray (our result is 30 times above random). Adding the olor modality improves this result only insignifiantly (as expeted, sine blobs already inorporate the olor information, among with texture). The suess of 3-node and 4-node Comraf* lustering models is also supported by the fat that they outperform a 2-node supervised lustering model, in whih images are lustered with respet to their annotations assigned by human experts. The 2-step Comraf* shows some further (minor) improvement over the 1-step Comraf* models. Here, in ontrast to IsraelImages, 8 olors are enough for lustering regions, and adding more olors auses a signifiant drop in the performane. We suspet that the Corel dataset is too simple : it ontains many images that are almost idential to eah other, therefore more advaned lustering models lead to no (or minor) gain over the simpler ones. Analogously to our IsraelImages experiment with various sizes of olor sets, we test various numbers of blobs on Corel. In previous work [9, 14], the number of blobs is set to 500, to (roughly) orrespond to the number of annotation keywords. Here we show that 500 blobs are not enough for lustering: when moving from 1000 to 2000 blobs, a signifiant boost in the system s performane an be seen. Figures 4 and 5 are illustrations of the quality of multimodal setup: unrelated groups of images are mixed together when the lustering is based only on aption words, whereas they are niely separated when a visual modality is added. 5. Conlusion In this paper, we have introdued the powerful Comraf framework to lustering multimedia olletions. We have also proposed a family of lightweight Comraf models alled Comraf*, whih demonstrate exellent performane on lustering two real-world data olletions. To further improve the results, a semi-supervised Comraf setting [4]may be used, in whih a few labeled examples are taken into aount in the lustering proess. We plan to experiment with this setting in our future work.

7 Clustering auray on IsraelImages images/words images/words/olors Clustering auray on IsraelImages Clustering auray on Corel images/words images/words/blobs auray auray images/words 2 step Comraf* auray number of olors for image lustering number of olors for region lustering number of blobs Figure 3. Experimentation with various numbers of: (left) olors on IsraelImages in a 3-node images/words/olors Comraf*; (enter) olors for lustering regions in the 2-step Comraf* on IsraelImages; (right) blobs on Corel in a 3-node images/words/blobs Comraf*. Our baseline is the 2-node images/words lustering result. Left and right graphs show the same trend: after reahing a ertain number of olors (256) or blobs (2000), the results vary only insignifiantly. The entral graph, however, shows that too many olors for lustering regions an hurt. Designing general Comraf models for image lustering (in flavor of the model shown in Figure 2 left) is an ongoing proess. Various setups should still be tested, various modalities should be inorporated. For example, interest points of images is an important modality not to be ignored. Extensive experimentation will lead to disovering the optimal Comraf setting for lustering multimedia olletions. Aknowledgements This work was supported in part by the Center for Intelligent Information Retrieval and in part by the Defense Advaned Researh Projets Ageny (DARPA) under ontrat number HR C Referenes [1] K. Barnard, P. Duygulu, and D. A. Forsyth. Clustering art. In Proeedings of CVPR, pages , [2] K. Barnard and D. A. Forsyth. Learning the semantis of words and pitures. In Proeedings of ICCV-8, pages , [3] R. Bekkerman, R. El-Yaniv, and A. MCallum. Multi-way distributional lustering via pairwise interations. In Proeedings of ICML-22, pages 41 48, [4] R. Bekkerman, M. Sahami, and E. Learned-Miller. Combinatorial Markov Random Fields. In Proeedings of ECML- 17, , 3, 5, 6 [5] J. Besag. Spatial interation and statistial analysis of lattie systems. Journal of the Royal Statistial Soiety, 36(2): , [6] D. Cai, X. He, Z. Li, W.-Y. Ma, and J.-R. Wen. Hierarhial lustering of www image searh results using visual, textual and link information. In Proeedings of the 12th international onferene on Multimedia, pages , [7] Y. Chen, J. Z. Wang, and R. Krovetz. Clue: Cluster-based retrieval of images by unsupervised learning. IEEE Transations on Image Proessing, 14(8): , [8] I. S. Dhillon. Co-lustering douments and words using bipartite spetral graph partitioning. In Proeedings of SIGKDD-7, pages , [9] P. Duygulu, K. Barnard, N. de Freitas, and D. Forsyth. Objet reognition as mahine translation: Learning a lexion for a fixed image voabulary. In Proeedings of ECCV-7, , 5, 6 [10] S. L. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevane models for image and video annotation. In Proeedings of CVPR, pages , , 5 [11] B. Gao, T.-Y. Liu, T. Qin, X. Zheng, Q.-S. Cheng, and W.-Y. Ma. Web image lustering by onsistent utilization of visual features and surrounding texts. In Proeedings of the 13th international onferene on Multimedia, [12] B. Gao, T.-Y. Liu, X. Zheng, Q.-S. Cheng, and W.-Y. Ma. Consistent bipartite graph o-partitioning for star-strutured high-order heterogeneous data o-lustering. In Proeeding of SIGKDD-11, pages 41 50, [13] J. Goldberger, S. Gordon, and H. Greenspan. Unsupervised image-set lustering using an information theoreti framework. Transations on Image Proessing, 15, [14] J. Jeon, V. Lavrenko, and R. Manmatha. Automati image annotation and retrieval using ross-media relevane models. In Proeedings of SIGIR-26, pages , , 6 [15] J. Jeon and R. Manmatha. Using maximum entropy for automati image annotation. In Proeedings of the 5th International Conferene on Image and Video Retrieval, pages 24 32, [16] N. Loeff, C. O. Alm, and D. A. Forsyth. Disriminating image senses by lustering with multimodal features. In Proeedings of COLING/ACL, pages , [17] G. Qiu. Image and feature o-lustering. In Proeedings of ICPR-17, pages , [18] N. Tishby, F. Pereira, and W. Bialek. The information bottlenek method, Invited paper to the 37th Allerton Conferene on Communiation, Control, and Computing. 3, 5 [19] Y. Uematsu, R. Kataoka, and H. Takeno. Clustering presentation of web image retrieval results using textual information and image features. In Proeedings of the 24th IASTED international onferene on Internet and multimedia systems and appliations, pages ,

8 (a) Clustering results using only aption words, Corel dataset (b) Clustering results using words and blobs, Corel dataset Figure 4. Corel dataset. The first row shows lustering results using only words. Swimmers and swimming tigers are lustered together beause they share ommon terms like water and swim. The seond and the third rows show lustering results using both words and blobs. The swimmers and the swimming tigers are now in two different lusters with other similar images. (a) Clustering results using only aption words, IsraelImages dataset (b) Clustering results using words and olor histograms, IsraelImages dataset Figure 5. IsraelImages dataset. People portraits and pitures of the menorah monument are lustered together using aption words beause they have a word Knesset (the Israeli parliament) in ommon: the individuals are Knesset members, while the menorah monument is plaed in front of the Knesset building. The problem is resolved after the olor modality is added.

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract CS 9 Projet Final Report: Learning Convention Propagation in BeerAdvoate Reviews from a etwork Perspetive Abstrat We look at the way onventions propagate between reviews on the BeerAdvoate dataset, and

More information

Cluster-Based Cumulative Ensembles

Cluster-Based Cumulative Ensembles Cluster-Based Cumulative Ensembles Hanan G. Ayad and Mohamed S. Kamel Pattern Analysis and Mahine Intelligene Lab, Eletrial and Computer Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1,

More information

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION Ken Sauer and Charles A. Bouman Department of Eletrial Engineering, University of Notre Dame Notre Dame, IN 46556, (219) 631-6999 Shool of

More information

Exploiting Enriched Contextual Information for Mobile App Classification

Exploiting Enriched Contextual Information for Mobile App Classification Exploiting Enrihed Contextual Information for Mobile App Classifiation Hengshu Zhu 1 Huanhuan Cao 2 Enhong Chen 1 Hui Xiong 3 Jilei Tian 2 1 University of Siene and Tehnology of China 2 Nokia Researh Center

More information

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines The Minimum Redundany Maximum Relevane Approah to Building Sparse Support Vetor Mahines Xiaoxing Yang, Ke Tang, and Xin Yao, Nature Inspired Computation and Appliations Laboratory (NICAL), Shool of Computer

More information

Discrete sequential models and CRFs. 1 Case Study: Supervised Part-of-Speech Tagging

Discrete sequential models and CRFs. 1 Case Study: Supervised Part-of-Speech Tagging 0-708: Probabilisti Graphial Models 0-708, Spring 204 Disrete sequential models and CRFs Leturer: Eri P. Xing Sribes: Pankesh Bamotra, Xuanhong Li Case Study: Supervised Part-of-Speeh Tagging The supervised

More information

Extracting Partition Statistics from Semistructured Data

Extracting Partition Statistics from Semistructured Data Extrating Partition Statistis from Semistrutured Data John N. Wilson Rihard Gourlay Robert Japp Mathias Neumüller Department of Computer and Information Sienes University of Strathlyde, Glasgow, UK {jnw,rsg,rpj,mathias}@is.strath.a.uk

More information

the data. Structured Principal Component Analysis (SPCA)

the data. Structured Principal Component Analysis (SPCA) Strutured Prinipal Component Analysis Kristin M. Branson and Sameer Agarwal Department of Computer Siene and Engineering University of California, San Diego La Jolla, CA 9193-114 Abstrat Many tasks involving

More information

Exploring the Commonality in Feature Modeling Notations

Exploring the Commonality in Feature Modeling Notations Exploring the Commonality in Feature Modeling Notations Miloslav ŠÍPKA Slovak University of Tehnology Faulty of Informatis and Information Tehnologies Ilkovičova 3, 842 16 Bratislava, Slovakia miloslav.sipka@gmail.om

More information

Boosted Random Forest

Boosted Random Forest Boosted Random Forest Yohei Mishina, Masamitsu suhiya and Hironobu Fujiyoshi Department of Computer Siene, Chubu University, 1200 Matsumoto-ho, Kasugai, Aihi, Japan {mishi, mtdoll}@vision.s.hubu.a.jp,

More information

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application

Performance of Histogram-Based Skin Colour Segmentation for Arms Detection in Human Motion Analysis Application World Aademy of Siene, Engineering and Tehnology 8 009 Performane of Histogram-Based Skin Colour Segmentation for Arms Detetion in Human Motion Analysis Appliation Rosalyn R. Porle, Ali Chekima, Farrah

More information

Evolutionary Feature Synthesis for Image Databases

Evolutionary Feature Synthesis for Image Databases Evolutionary Feature Synthesis for Image Databases Anlei Dong, Bir Bhanu, Yingqiang Lin Center for Researh in Intelligent Systems University of California, Riverside, California 92521, USA {adong, bhanu,

More information

A Novel Validity Index for Determination of the Optimal Number of Clusters

A Novel Validity Index for Determination of the Optimal Number of Clusters IEICE TRANS. INF. & SYST., VOL.E84 D, NO.2 FEBRUARY 2001 281 LETTER A Novel Validity Index for Determination of the Optimal Number of Clusters Do-Jong KIM, Yong-Woon PARK, and Dong-Jo PARK, Nonmembers

More information

Using Augmented Measurements to Improve the Convergence of ICP

Using Augmented Measurements to Improve the Convergence of ICP Using Augmented Measurements to Improve the onvergene of IP Jaopo Serafin, Giorgio Grisetti Dept. of omputer, ontrol and Management Engineering, Sapienza University of Rome, Via Ariosto 25, I-0085, Rome,

More information

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425)

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425) Automati Physial Design Tuning: Workload as a Sequene Sanjay Agrawal Mirosoft Researh One Mirosoft Way Redmond, WA, USA +1-(425) 75-357 sagrawal@mirosoft.om Eri Chu * Computer Sienes Department University

More information

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating Capturing Large Intra-lass Variations of Biometri Data by Template Co-updating Ajita Rattani University of Cagliari Piazza d'armi, Cagliari, Italy ajita.rattani@diee.unia.it Gian Lua Marialis University

More information

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8

Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introduction Information Retrieval... 8 Contents Contents...I List of Tables...VIII List of Figures...IX 1. Introdution... 1 1.1. Internet Information...2 1.2. Internet Information Retrieval...3 1.2.1. Doument Indexing...4 1.2.2. Doument Retrieval...4

More information

A scheme for racquet sports video analysis with the combination of audio-visual information

A scheme for racquet sports video analysis with the combination of audio-visual information A sheme for raquet sports video analysis with the ombination of audio-visual information Liyuan Xing a*, Qixiang Ye b, Weigang Zhang, Qingming Huang a and Hua Yu a a Graduate Shool of the Chinese Aadamy

More information

Graph-Based vs Depth-Based Data Representation for Multiview Images

Graph-Based vs Depth-Based Data Representation for Multiview Images Graph-Based vs Depth-Based Data Representation for Multiview Images Thomas Maugey, Antonio Ortega, Pasal Frossard Signal Proessing Laboratory (LTS), Eole Polytehnique Fédérale de Lausanne (EPFL) Email:

More information

3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT?

3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT? 3-D IMAGE MODELS AND COMPRESSION - SYNTHETIC HYBRID OR NATURAL FIT? Bernd Girod, Peter Eisert, Marus Magnor, Ekehard Steinbah, Thomas Wiegand Te {girod eommuniations Laboratory, University of Erlangen-Nuremberg

More information

Semi-Supervised Affinity Propagation with Instance-Level Constraints

Semi-Supervised Affinity Propagation with Instance-Level Constraints Semi-Supervised Affinity Propagation with Instane-Level Constraints Inmar E. Givoni, Brendan J. Frey Probabilisti and Statistial Inferene Group University of Toronto 10 King s College Road, Toronto, Ontario,

More information

Video Data and Sonar Data: Real World Data Fusion Example

Video Data and Sonar Data: Real World Data Fusion Example 14th International Conferene on Information Fusion Chiago, Illinois, USA, July 5-8, 2011 Video Data and Sonar Data: Real World Data Fusion Example David W. Krout Applied Physis Lab dkrout@apl.washington.edu

More information

8 : Learning Fully Observed Undirected Graphical Models

8 : Learning Fully Observed Undirected Graphical Models 10-708: Probabilisti Graphial Models 10-708, Spring 2018 8 : Learning Fully Observed Undireted Graphial Models Leturer: Kayhan Batmanghelih Sribes: Chenghui Zhou, Cheng Ran (Harvey) Zhang When learning

More information

Pipelined Multipliers for Reconfigurable Hardware

Pipelined Multipliers for Reconfigurable Hardware Pipelined Multipliers for Reonfigurable Hardware Mithell J. Myjak and José G. Delgado-Frias Shool of Eletrial Engineering and Computer Siene, Washington State University Pullman, WA 99164-2752 USA {mmyjak,

More information

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2 On - Line Path Delay Fault Testing of Omega MINs M. Bellos, E. Kalligeros, D. Nikolos,2 & H. T. Vergos,2 Dept. of Computer Engineering and Informatis 2 Computer Tehnology Institute University of Patras,

More information

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering A Novel Bit Level Time Series Representation with Impliation of Similarity Searh and lustering hotirat Ratanamahatana, Eamonn Keogh, Anthony J. Bagnall 2, and Stefano Lonardi Dept. of omputer Siene & Engineering,

More information

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1.

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1. Fuzzy Weighted Rank Ordered Mean (FWROM) Filters for Mixed Noise Suppression from Images S. Meher, G. Panda, B. Majhi 3, M.R. Meher 4,,4 Department of Eletronis and I.E., National Institute of Tehnology,

More information

CleanUp: Improving Quadrilateral Finite Element Meshes

CleanUp: Improving Quadrilateral Finite Element Meshes CleanUp: Improving Quadrilateral Finite Element Meshes Paul Kinney MD-10 ECC P.O. Box 203 Ford Motor Company Dearborn, MI. 8121 (313) 28-1228 pkinney@ford.om Abstrat: Unless an all quadrilateral (quad)

More information

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking Algorithms for External Memory Leture 6 Graph Algorithms - Weighted List Ranking Leturer: Nodari Sithinava Sribe: Andi Hellmund, Simon Ohsenreither 1 Introdution & Motivation After talking about I/O-effiient

More information

Contour Box: Rejecting Object Proposals Without Explicit Closed Contours

Contour Box: Rejecting Object Proposals Without Explicit Closed Contours Contour Box: Rejeting Objet Proposals Without Expliit Closed Contours Cewu Lu, Shu Liu Jiaya Jia Chi-Keung Tang The Hong Kong University of Siene and Tehnology Stanford University The Chinese University

More information

arxiv: v1 [cs.db] 13 Sep 2017

arxiv: v1 [cs.db] 13 Sep 2017 An effiient lustering algorithm from the measure of loal Gaussian distribution Yuan-Yen Tai (Dated: May 27, 2018) In this paper, I will introdue a fast and novel lustering algorithm based on Gaussian distribution

More information

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System Algorithms, Mehanisms and Proedures for the Computer-aided Projet Generation System Anton O. Butko 1*, Aleksandr P. Briukhovetskii 2, Dmitry E. Grigoriev 2# and Konstantin S. Kalashnikov 3 1 Department

More information

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction University of Wollongong Researh Online Faulty of Informatis - apers (Arhive) Faulty of Engineering and Information Sienes 7 Time delay estimation of reverberant meeting speeh: on the use of multihannel

More information

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study

What are Cycle-Stealing Systems Good For? A Detailed Performance Model Case Study What are Cyle-Stealing Systems Good For? A Detailed Performane Model Case Study Wayne Kelly and Jiro Sumitomo Queensland University of Tehnology, Australia {w.kelly, j.sumitomo}@qut.edu.au Abstrat The

More information

Detecting Outliers in High-Dimensional Datasets with Mixed Attributes

Detecting Outliers in High-Dimensional Datasets with Mixed Attributes Deteting Outliers in High-Dimensional Datasets with Mixed Attributes A. Koufakou, M. Georgiopoulos, and G.C. Anagnostopoulos 2 Shool of EECS, University of Central Florida, Orlando, FL, USA 2 Dept. of

More information

Recommendation Subgraphs for Web Discovery

Recommendation Subgraphs for Web Discovery Reommation Subgraphs for Web Disovery Arda Antikaioglu Department of Mathematis Carnegie Mellon University aantika@andrew.mu.edu R. Ravi Tepper Shool of Business Carnegie Mellon University ravi@mu.edu

More information

A Coarse-to-Fine Classification Scheme for Facial Expression Recognition

A Coarse-to-Fine Classification Scheme for Facial Expression Recognition A Coarse-to-Fine Classifiation Sheme for Faial Expression Reognition Xiaoyi Feng 1,, Abdenour Hadid 1 and Matti Pietikäinen 1 1 Mahine Vision Group Infoteh Oulu and Dept. of Eletrial and Information Engineering

More information

A {k, n}-secret Sharing Scheme for Color Images

A {k, n}-secret Sharing Scheme for Color Images A {k, n}-seret Sharing Sheme for Color Images Rastislav Luka, Konstantinos N. Plataniotis, and Anastasios N. Venetsanopoulos The Edward S. Rogers Sr. Dept. of Eletrial and Computer Engineering, University

More information

A Comprehensive Review of Overlapping Community Detection Algorithms for Social Networks

A Comprehensive Review of Overlapping Community Detection Algorithms for Social Networks International Journal of Engineering Researh and Appliations (IJERA) ISSN: 2248-9622 National Conferene on Advanes in Engineering and Tehnology (AET- 29th Marh 2014) RESEARCH ARTICLE OPEN ACCESS A Comprehensive

More information

One Against One or One Against All : Which One is Better for Handwriting Recognition with SVMs?

One Against One or One Against All : Which One is Better for Handwriting Recognition with SVMs? One Against One or One Against All : Whih One is Better for Handwriting Reognition with SVMs? Jonathan Milgram, Mohamed Cheriet, Robert Sabourin To ite this version: Jonathan Milgram, Mohamed Cheriet,

More information

Sparse Certificates for 2-Connectivity in Directed Graphs

Sparse Certificates for 2-Connectivity in Directed Graphs Sparse Certifiates for 2-Connetivity in Direted Graphs Loukas Georgiadis Giuseppe F. Italiano Aikaterini Karanasiou Charis Papadopoulos Nikos Parotsidis Abstrat Motivated by the emergene of large-sale

More information

Transition Detection Using Hilbert Transform and Texture Features

Transition Detection Using Hilbert Transform and Texture Features Amerian Journal of Signal Proessing 1, (): 35-4 DOI: 1.593/.asp.1.6 Transition Detetion Using Hilbert Transform and Texture Features G. G. Lashmi Priya *, S. Domni Department of Computer Appliations, National

More information

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 9, September 2013 ISSN: 2277 128X International Journal of Advaned Researh in Computer Siene and Software Engineering Researh Paper Available online at: www.ijarsse.om A New-Fangled Algorithm

More information

Improved Vehicle Classification in Long Traffic Video by Cooperating Tracker and Classifier Modules

Improved Vehicle Classification in Long Traffic Video by Cooperating Tracker and Classifier Modules Improved Vehile Classifiation in Long Traffi Video by Cooperating Traker and Classifier Modules Brendan Morris and Mohan Trivedi University of California, San Diego San Diego, CA 92093 {b1morris, trivedi}@usd.edu

More information

TUMOR DETECTION IN MRI BRAIN IMAGE SEGMENTATION USING PHASE CONGRUENCY MODIFIED FUZZY C MEAN ALGORITHM

TUMOR DETECTION IN MRI BRAIN IMAGE SEGMENTATION USING PHASE CONGRUENCY MODIFIED FUZZY C MEAN ALGORITHM TUMOR DETECTION IN MRI BRAIN IMAGE SEGMENTATION USING PHASE CONGRUENCY MODIFIED FUZZY C MEAN ALGORITHM M. Murugeswari 1, M.Gayathri 2 1 Assoiate Professor, 2 PG Sholar 1,2 K.L.N College of Information

More information

FOREGROUND OBJECT EXTRACTION USING FUZZY C MEANS WITH BIT-PLANE SLICING AND OPTICAL FLOW

FOREGROUND OBJECT EXTRACTION USING FUZZY C MEANS WITH BIT-PLANE SLICING AND OPTICAL FLOW FOREGROUND OBJECT EXTRACTION USING FUZZY C EANS WITH BIT-PLANE SLICING AND OPTICAL FLOW SIVAGAI., REVATHI.T, JEGANATHAN.L 3 APSG, SCSE, VIT University, Chennai, India JRF, DST, Dehi, India. 3 Professor,

More information

An Optimized Approach on Applying Genetic Algorithm to Adaptive Cluster Validity Index

An Optimized Approach on Applying Genetic Algorithm to Adaptive Cluster Validity Index IJCSES International Journal of Computer Sienes and Engineering Systems, ol., No.4, Otober 2007 CSES International 2007 ISSN 0973-4406 253 An Optimized Approah on Applying Geneti Algorithm to Adaptive

More information

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Malaysian Journal of Computer Siene, Vol 10 No 1, June 1997, pp 36-41 A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Md Rafiqul Islam, Harihodin Selamat and Mohd Noor Md Sap Faulty of Computer Siene and

More information

timestamp, if silhouette(x, y) 0 0 if silhouette(x, y) = 0, mhi(x, y) = and mhi(x, y) < timestamp - duration mhi(x, y), else

timestamp, if silhouette(x, y) 0 0 if silhouette(x, y) = 0, mhi(x, y) = and mhi(x, y) < timestamp - duration mhi(x, y), else 3rd International Conferene on Multimedia Tehnolog(ICMT 013) An Effiient Moving Target Traking Strateg Based on OpenCV and CAMShift Theor Dongu Li 1 Abstrat Image movement involved bakground movement and

More information

New Fuzzy Object Segmentation Algorithm for Video Sequences *

New Fuzzy Object Segmentation Algorithm for Video Sequences * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 521-537 (2008) New Fuzzy Obet Segmentation Algorithm for Video Sequenes * KUO-LIANG CHUNG, SHIH-WEI YU, HSUEH-JU YEH, YONG-HUAI HUANG AND TA-JEN YAO Department

More information

Approximate logic synthesis for error tolerant applications

Approximate logic synthesis for error tolerant applications Approximate logi synthesis for error tolerant appliations Doohul Shin and Sandeep K. Gupta Eletrial Engineering Department, University of Southern California, Los Angeles, CA 989 {doohuls, sandeep}@us.edu

More information

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks International Journal of Advanes in Computer Networks and Its Seurity IJCNS A Load-Balaned Clustering Protool for Hierarhial Wireless Sensor Networks Mehdi Tarhani, Yousef S. Kavian, Saman Siavoshi, Ali

More information

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks Unsupervised Stereosopi Video Objet Segmentation Based on Ative Contours and Retrainable Neural Networks KLIMIS NTALIANIS, ANASTASIOS DOULAMIS, and NIKOLAOS DOULAMIS National Tehnial University of Athens

More information

Gray Codes for Reflectable Languages

Gray Codes for Reflectable Languages Gray Codes for Refletable Languages Yue Li Joe Sawada Marh 8, 2008 Abstrat We lassify a type of language alled a refletable language. We then develop a generi algorithm that an be used to list all strings

More information

KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION

KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION Cuiui Kang 1, Shengai Liao, Shiming Xiang 1, Chunhong Pan 1 1 National Laboratory of Pattern Reognition, Institute of Automation, Chinese

More information

Detection and Recognition of Non-Occluded Objects using Signature Map

Detection and Recognition of Non-Occluded Objects using Signature Map 6th WSEAS International Conferene on CIRCUITS, SYSTEMS, ELECTRONICS,CONTROL & SIGNAL PROCESSING, Cairo, Egypt, De 9-31, 007 65 Detetion and Reognition of Non-Oluded Objets using Signature Map Sangbum Park,

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. Improvement of low illumination image enhancement algorithm based on physical mode

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. Improvement of low illumination image enhancement algorithm based on physical mode [Type text] [Type text] [Type text] ISSN : 0974-7435 Volume 10 Issue 22 BioTehnology 2014 An Indian Journal FULL PAPER BTAIJ, 10(22), 2014 [13995-14001] Improvement of low illumination image enhanement

More information

FUZZY WATERSHED FOR IMAGE SEGMENTATION

FUZZY WATERSHED FOR IMAGE SEGMENTATION FUZZY WATERSHED FOR IMAGE SEGMENTATION Ramón Moreno, Manuel Graña Computational Intelligene Group, Universidad del País Vaso, Spain http://www.ehu.es/winto; {ramon.moreno,manuel.grana}@ehu.es Abstrat The

More information

Model Based Approach for Content Based Image Retrievals Based on Fusion and Relevancy Methodology

Model Based Approach for Content Based Image Retrievals Based on Fusion and Relevancy Methodology The International Arab Journal of Information Tehnology, Vol. 12, No. 6, November 15 519 Model Based Approah for Content Based Image Retrievals Based on Fusion and Relevany Methodology Telu Venkata Madhusudhanarao

More information

A Fast Kernel-based Multilevel Algorithm for Graph Clustering

A Fast Kernel-based Multilevel Algorithm for Graph Clustering A Fast Kernel-based Multilevel Algorithm for Graph Clustering Inderjit Dhillon Dept. of Computer Sienes University of Texas at Austin Austin, TX 78712 inderjit@s.utexas.edu Yuqiang Guan Dept. of Computer

More information

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality INTERNATIONAL CONFERENCE ON MANUFACTURING AUTOMATION (ICMA200) Multi-Piee Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality Stephen Stoyan, Yong Chen* Epstein Department of

More information

Hello neighbor: accurate object retrieval with k-reciprocal nearest neighbors

Hello neighbor: accurate object retrieval with k-reciprocal nearest neighbors Hello neighbor: aurate objet retrieval with -reiproal nearest neighbors Danfeng Qin Stephan Gammeter Luas Bossard Till Qua,2 Lu van Gool,3 ETH Zürih 2 Kooaba AG 3 K.U. Leuven {ind,gammeter,bossard,tua,vangool}@vision.ee.ethz.h

More information

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem Calulation of typial running time of a branh-and-bound algorithm for the vertex-over problem Joni Pajarinen, Joni.Pajarinen@iki.fi Otober 21, 2007 1 Introdution The vertex-over problem is one of a olletion

More information

mahines. HBSP enhanes the appliability of the BSP model by inorporating parameters that reet the relative speeds of the heterogeneous omputing omponen

mahines. HBSP enhanes the appliability of the BSP model by inorporating parameters that reet the relative speeds of the heterogeneous omputing omponen The Heterogeneous Bulk Synhronous Parallel Model Tiani L. Williams and Rebea J. Parsons Shool of Computer Siene University of Central Florida Orlando, FL 32816-2362 fwilliams,rebeag@s.uf.edu Abstrat. Trends

More information

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks Abouberine Ould Cheikhna Department of Computer Siene University of Piardie Jules Verne 80039 Amiens Frane Ould.heikhna.abouberine @u-piardie.fr

More information

HEXA: Compact Data Structures for Faster Packet Processing

HEXA: Compact Data Structures for Faster Packet Processing Washington University in St. Louis Washington University Open Sholarship All Computer Siene and Engineering Researh Computer Siene and Engineering Report Number: 27-26 27 HEXA: Compat Data Strutures for

More information

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks Flow Demands Oriented Node Plaement in Multi-Hop Wireless Networks Zimu Yuan Institute of Computing Tehnology, CAS, China {zimu.yuan}@gmail.om arxiv:153.8396v1 [s.ni] 29 Mar 215 Abstrat In multi-hop wireless

More information

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization Self-Adaptive Parent to Mean-Centri Reombination for Real-Parameter Optimization Kalyanmoy Deb and Himanshu Jain Department of Mehanial Engineering Indian Institute of Tehnology Kanpur Kanpur, PIN 86 {deb,hjain}@iitk.a.in

More information

Chromaticity-matched Superimposition of Foreground Objects in Different Environments

Chromaticity-matched Superimposition of Foreground Objects in Different Environments FCV216, the 22nd Korea-Japan Joint Workshop on Frontiers of Computer Vision Chromatiity-mathed Superimposition of Foreground Objets in Different Environments Yohei Ogura Graduate Shool of Siene and Tehnology

More information

Colouring contact graphs of squares and rectilinear polygons de Berg, M.T.; Markovic, A.; Woeginger, G.

Colouring contact graphs of squares and rectilinear polygons de Berg, M.T.; Markovic, A.; Woeginger, G. Colouring ontat graphs of squares and retilinear polygons de Berg, M.T.; Markovi, A.; Woeginger, G. Published in: nd European Workshop on Computational Geometry (EuroCG 06), 0 Marh - April, Lugano, Switzerland

More information

Visualization of patent analysis for emerging technology

Visualization of patent analysis for emerging technology Available online at www.sienediret.om Expert Systems with Appliations Expert Systems with Appliations 34 (28) 84 82 www.elsevier.om/loate/eswa Visualization of patent analysis for emerging tehnology Young

More information

A New RBFNDDA-KNN Network and Its Application to Medical Pattern Classification

A New RBFNDDA-KNN Network and Its Application to Medical Pattern Classification A New RBFNDDA-KNN Network and Its Appliation to Medial Pattern Classifiation Shing Chiang Tan 1*, Chee Peng Lim 2, Robert F. Harrison 3, R. Lee Kennedy 4 1 Faulty of Information Siene and Tehnology, Multimedia

More information

Cluster Centric Fuzzy Modeling

Cluster Centric Fuzzy Modeling 10.1109/TFUZZ.014.300134, IEEE Transations on Fuzzy Systems TFS-013-0379.R1 1 Cluster Centri Fuzzy Modeling Witold Pedryz, Fellow, IEEE, and Hesam Izakian, Student Member, IEEE Abstrat In this study, we

More information

An Efficient and Scalable Approach to CNN Queries in a Road Network

An Efficient and Scalable Approach to CNN Queries in a Road Network An Effiient and Salable Approah to CNN Queries in a Road Network Hyung-Ju Cho Chin-Wan Chung Dept. of Eletrial Engineering & Computer Siene Korea Advaned Institute of Siene and Tehnology 373- Kusong-dong,

More information

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications System-Level Parallelism and hroughput Optimization in Designing Reonfigurable Computing Appliations Esam El-Araby 1, Mohamed aher 1, Kris Gaj 2, arek El-Ghazawi 1, David Caliga 3, and Nikitas Alexandridis

More information

Detecting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry

Detecting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry Deteting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry D. M. Zasada, P. K. Sanyal The MITRE Corp., 6 Eletroni Parkway, Rome, NY 134 (dmzasada, psanyal)@mitre.org

More information

An Edge-based Clustering Algorithm to Detect Social Circles in Ego Networks

An Edge-based Clustering Algorithm to Detect Social Circles in Ego Networks JOURNAL OF COMPUTERS, VOL. 8, NO., OCTOBER 23 2575 An Edge-based Clustering Algorithm to Detet Soial Cirles in Ego Networks Yu Wang Shool of Computer Siene and Tehnology, Xidian University Xi an,77, China

More information

Multivariate Texture-based Segmentation of Remotely Sensed. Imagery for Extraction of Objects and Their Uncertainty

Multivariate Texture-based Segmentation of Remotely Sensed. Imagery for Extraction of Objects and Their Uncertainty Multivariate Texture-based Segmentation of Remotely Sensed Imagery for Extration of Objets and Their Unertainty Arko Luieer*, Alfred Stein* & Peter Fisher** * International Institute for Geo-Information

More information

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks A Dual-Hamiltonian-Path-Based Multiasting Strategy for Wormhole-Routed Star Graph Interonnetion Networks Nen-Chung Wang Department of Information and Communiation Engineering Chaoyang University of Tehnology,

More information

Chapter 2: Introduction to Maple V

Chapter 2: Introduction to Maple V Chapter 2: Introdution to Maple V 2-1 Working with Maple Worksheets Try It! (p. 15) Start a Maple session with an empty worksheet. The name of the worksheet should be Untitled (1). Use one of the standard

More information

Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors

Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors Eurographis Symposium on Geometry Proessing (003) L. Kobbelt, P. Shröder, H. Hoppe (Editors) Rotation Invariant Spherial Harmoni Representation of 3D Shape Desriptors Mihael Kazhdan, Thomas Funkhouser,

More information

Smooth Trajectory Planning Along Bezier Curve for Mobile Robots with Velocity Constraints

Smooth Trajectory Planning Along Bezier Curve for Mobile Robots with Velocity Constraints Smooth Trajetory Planning Along Bezier Curve for Mobile Robots with Veloity Constraints Gil Jin Yang and Byoung Wook Choi Department of Eletrial and Information Engineering Seoul National University of

More information

HIGHER ORDER full-wave three-dimensional (3-D) large-domain techniques in

HIGHER ORDER full-wave three-dimensional (3-D) large-domain techniques in FACTA UNIVERSITATIS (NIŠ) SER.: ELEC. ENERG. vol. 21, no. 2, August 2008, 209-220 Comparison of Higher Order FEM and MoM/SIE Approahes in Analyses of Closed- and Open-Region Eletromagneti Problems Milan

More information

Creating Adaptive Web Sites Through Usage-Based Clustering of URLs

Creating Adaptive Web Sites Through Usage-Based Clustering of URLs Creating Adaptive Web Sites Through Usage-Based Clustering of URLs Bamshad Mobasher Dept. of Computer Siene, DePaul University, Chiago, IL mobasher@s.depaul.edu Robert Cooley, Jaideep Srivastava Dept.

More information

Gradient based progressive probabilistic Hough transform

Gradient based progressive probabilistic Hough transform Gradient based progressive probabilisti Hough transform C.Galambos, J.Kittler and J.Matas Abstrat: The authors look at the benefits of exploiting gradient information to enhane the progressive probabilisti

More information

A RAY TRACING SIMULATION OF SOUND DIFFRACTION BASED ON ANALYTIC SECONDARY SOURCE MODEL

A RAY TRACING SIMULATION OF SOUND DIFFRACTION BASED ON ANALYTIC SECONDARY SOURCE MODEL 19th European Signal Proessing Conferene (EUSIPCO 211) Barelona, Spain, August 29 - September 2, 211 A RAY TRACING SIMULATION OF SOUND DIFFRACTION BASED ON ANALYTIC SECONDARY SOURCE MODEL Masashi Okada,

More information

Chemical, Biological and Radiological Hazard Assessment: A New Model of a Plume in a Complex Urban Environment

Chemical, Biological and Radiological Hazard Assessment: A New Model of a Plume in a Complex Urban Environment hemial, Biologial and Radiologial Haard Assessment: A New Model of a Plume in a omplex Urban Environment Skvortsov, A.T., P.D. Dawson, M.D. Roberts and R.M. Gailis HPP Division, Defene Siene and Tehnology

More information

Face and Facial Feature Tracking for Natural Human-Computer Interface

Face and Facial Feature Tracking for Natural Human-Computer Interface Fae and Faial Feature Traking for Natural Human-Computer Interfae Vladimir Vezhnevets Graphis & Media Laboratory, Dept. of Applied Mathematis and Computer Siene of Mosow State University Mosow, Russia

More information

Semantic Concept Detection Using Weighted Discretization Multiple Correspondence Analysis for Disaster Information Management

Semantic Concept Detection Using Weighted Discretization Multiple Correspondence Analysis for Disaster Information Management Semanti Conept Detetion Using Weighted Disretization Multiple Correspondene Analysis for Disaster Information Management Samira Pouyanfar and Shu-Ching Chen Shool of Computing and Information Sienes Florida

More information

ASSESSMENT OF TWO CHEAP CLOSE-RANGE FEATURE EXTRACTION SYSTEMS

ASSESSMENT OF TWO CHEAP CLOSE-RANGE FEATURE EXTRACTION SYSTEMS ASSESSMENT OF TWO CHEAP CLOSE-RANGE FEATURE EXTRACTION SYSTEMS Ahmed Elaksher a, Mohammed Elghazali b, Ashraf Sayed b, and Yasser Elmanadilli b a Shool of Civil Engineering, Purdue University, West Lafayette,

More information

Wide-baseline Multiple-view Correspondences

Wide-baseline Multiple-view Correspondences Wide-baseline Multiple-view Correspondenes Vittorio Ferrari, Tinne Tuytelaars, Lu Van Gool, Computer Vision Group (BIWI), ETH Zuerih, Switzerland ESAT-PSI, University of Leuven, Belgium {ferrari,vangool}@vision.ee.ethz.h,

More information

arxiv: v1 [cs.gr] 10 Apr 2015

arxiv: v1 [cs.gr] 10 Apr 2015 REAL-TIME TOOL FOR AFFINE TRANSFORMATIONS OF TWO DIMENSIONAL IFS FRACTALS ELENA HADZIEVA AND MARIJA SHUMINOSKA arxiv:1504.02744v1 s.gr 10 Apr 2015 Abstrat. This work introdues a novel tool for interative,

More information

Learning Discriminative and Shareable Features. Scene Classificsion

Learning Discriminative and Shareable Features. Scene Classificsion Learning Disriminative and Shareable Features for Sene Classifiation Zhen Zuo, Gang Wang, Bing Shuai, Lifan Zhao, Qingxiong Yang, and Xudong Jiang Nanyang Tehnologial University, Singapore, Advaned Digital

More information

Tackling IPv6 Address Scalability from the Root

Tackling IPv6 Address Scalability from the Root Takling IPv6 Address Salability from the Root Mei Wang Ashish Goel Balaji Prabhakar Stanford University {wmei, ashishg, balaji}@stanford.edu ABSTRACT Internet address alloation shemes have a huge impat

More information

Plot-to-track correlation in A-SMGCS using the target images from a Surface Movement Radar

Plot-to-track correlation in A-SMGCS using the target images from a Surface Movement Radar Plot-to-trak orrelation in A-SMGCS using the target images from a Surfae Movement Radar G. Golino Radar & ehnology Division AMS, Italy ggolino@amsjv.it Abstrat he main topi of this paper is the formulation

More information

We P9 16 Eigenray Tracing in 3D Heterogeneous Media

We P9 16 Eigenray Tracing in 3D Heterogeneous Media We P9 Eigenray Traing in 3D Heterogeneous Media Z. Koren* (Emerson), I. Ravve (Emerson) Summary Conventional two-point ray traing in a general 3D heterogeneous medium is normally performed by a shooting

More information

WIRELESS CAPSULE ENDOSCOPY IMAGES ENHANCEMENT BASED ON ADAPTIVE ANISOTROPIC DIFFUSION

WIRELESS CAPSULE ENDOSCOPY IMAGES ENHANCEMENT BASED ON ADAPTIVE ANISOTROPIC DIFFUSION WIRELESS CAPSULE ENDOSCOPY IMAGES ENHANCEMENT BASED ON ADAPTIVE ANISOTROPIC DIFFUSION Lei Li 1, Y. X. ZOU 1* and Yi Li 1 ADSPLAB/ELIP, Shool of ECE, Peking Universit, Shenzhen 518055, China Shenzhen JiFu

More information

INTERPOLATED AND WARPED 2-D DIGITAL WAVEGUIDE MESH ALGORITHMS

INTERPOLATED AND WARPED 2-D DIGITAL WAVEGUIDE MESH ALGORITHMS Proeedings of the COST G-6 Conferene on Digital Audio Effets (DAFX-), Verona, Italy, Deember 7-9, INTERPOLATED AND WARPED -D DIGITAL WAVEGUIDE MESH ALGORITHMS Vesa Välimäki Lab. of Aoustis and Audio Signal

More information

DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT

DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT DETECTION METHOD FOR NETWORK PENETRATING BEHAVIOR BASED ON COMMUNICATION FINGERPRINT 1 ZHANGGUO TANG, 2 HUANZHOU LI, 3 MINGQUAN ZHONG, 4 JIAN ZHANG 1 Institute of Computer Network and Communiation Tehnology,

More information

Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems

Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems Methods for Multi-Dimensional Robustness Optimization in Complex Embedded Systems Arne Hamann, Razvan Rau, Rolf Ernst Institute of Computer and Communiation Network Engineering Tehnial University of Braunshweig,

More information