Adapting K-Medians to Generate Normalized Cluster Centers

Size: px
Start display at page:

Download "Adapting K-Medians to Generate Normalized Cluster Centers"

Transcription

1 Adapting -Medians to Generate Normalized Cluster Centers Benamin J. Anderson, Deborah S. Gross, David R. Musiant Anna M. Ritz, Thomas G. Smith, Leah E. Steinberg Carleton College {dgross, dmusian, ritza, smitht, Abstrat Many appliations of lustering require the use of normalized data, suh as text or mass spetra mining. The spherial -means algorithm [6], an adaptation of the traditional -means algorithm, is highly useful for data of this kind beause it produes normalized luster enters. The -medians lustering algorithm is also an important lustering tool beause of its wellknown resistane to outliers. -medians, however, is not trivially adapted to produe normalized luster enters. We introdue a new algorithm (alled MN), inspired by spherial -means, that integrates with - medians lustering to produe loally optimal normalized luster enters. We then show theoretially and experimentally that MN produes lusters of signifiantly higher quality than one would obtain via a simple saling of the luster enters produed from traditional -medians. eywords: -medians, lustering, normalized data. Introdution Clustering is one of the key tehniques used in data mining. In this paper we fous on lustering normalized datasets. Data is often normalized before lustering in order to remove unimportant distintions of sale. In lustering a orpus of text, for example, eah doument is often represented as a point where eah dimension represents a word s frequeny. When lustering data in this format, two douments of idential subet matter with different lengths should not be onsidered different. It therefore often makes sense to normalize text data before lustering. Single-partile mass spetrometry [] is another area where it is appropriate to normalize data. The hemial makeup of an aerosol partile is represented as two mass spetra, whih are plots of signal intensity versus the mass-to-harge ratio (m/z) for positive and negative ions produed by fragmenting the omponents of the aerosol partile. Thus, the presene of a peak indiates the presene of one or more ions ontaining the m/z value indiated (see Figure 5 for examples). As with text, we are generally not interested in the absolute magnitude of these peaks, but in the relative size of one peak when ompared with another. Cluster enters are often used to represent prototypial points, so it is usually desirable to normalize luster enters that arise from normalized data [5, 6]. The spherial -means algorithm [6] is well-suited to this task. For some appliations it is more desirable to use - norm distane (also known as Manhattan distane, denoted here as ) to measure the distane between points. This is beause the luster enter that minimizes -norm distane to all points within that luster is the median of that luster, and using the median instead of the mean tends to be more robust to outliers. The -medians algorithm [4, 9] is thus a powerful alternative. When attempting to use -medians on normalized data, however, unique hallenges arise in finding normalized loally optimal luster enters. We present the Manhattan Normalization (MN) algorithm, whih when integrated with -medians addresses these hallenges. There are two seemingly straightforward solutions to the problem of finding normalized luster enters with -medians. One ould apply traditional - medians and sale the luster enters at the end, or alternatively one ould use traditional -medians and sale the luster enters after eah iteration. Both of these ideas are heuristially based and lak theoretial guarantees of onvergene. We will show that MN integrated with -medians has guaranteed onvergene properties and performs better experimentally than either of these other approahes. We borrow many of our ideas from spherial -means, but we address the additional diffiulties introdued by the use of medians.

2 Setion 2 of this paper desribes relevant known lustering algorithms. Setion 3 desribes the hallenges that arise in normalized -medians lustering and our solutions to those hallenges. Experimental results and analysis are in Setion Clustering Review 2. Traditional -means The well-known -means algorithm [8] is used when one wishes to find luster enters that minimize the total of the squared 2-norm distane (also known as Eulidean distane, denoted here as 2 ) from eah point to its losest luster enter. Sine finding globally optimal luster enters is an NP-hard problem, -means may be used to find a loal solution. In order to run -means, one hooses the number of lusters to find and an initial set of luster enters. There are many different approahes for hoosing initial luster enters [3], but all of our work here holds regardless of whih tehnique is used. The -means framework an then be desribed as follows: -means Algorithm. Let x,x 2,...,x n be a set of points. We wish to partition them into disoint * * * lusters π, π2,..., π that minimize the obetive funtion: 2 Q({ π } = ) = x () = x π For eah luster, is the enter of eah luster, 2 whih is defined as the point for whih x is 2 x π minimized. This has been proven to be aomplished by hoosing to be the entroid of luster, defined as = x (2) n x π where n is the number of points in luster [8]. We therefore wish to find lusters that solve the following minimization problem: * { π } = arg min Q ({ π } ) (3) = = { π } = To do this, start with an arbitrary partitioning of the () π. Define the luster enters assoiated data { } = () with this partitioning as { } =. Define the index of iteration t =. Then follow these steps:. For eah point, find the luster enter with losest Eulidean distane. This yields a new partitioning { x x x i 2 2 } π = : i,, (4) =, where ties between lusters are resolved by random assignment to one of the optimal enters. Note that this is guaranteed not to inrease the obetive Q, sine eah point is assigned to its losest enter. by 2. Compute the new set of luster enters { } = omputing the mean (entroid) of eah luster. Sine the entroid is the point that minimizes the total distanes from all points to it, this step is also guaranteed not to inrease the obetive Q. 3. If a stopping riterion is met, report { π } final partitioning and { } = = as the as the final luster enters. Otherwise, inrement t by, and go to step above. A variety of stopping onditions are available. One ommon ondition is to stop when the differene between suessive values of the obetive Q is less than a small tolerane. -means is ultimately a loal optimization algorithm for minimizing lustering error, where lustering error is defined as the total squared Eulidean distane from eah point to its losest enter. The obetive Q never inreases from one iteration to the next. Sine -means is applied to a finite number of points, the algorithm must therefore terminate. 2.2 Spherial -means Many appliations for lustering normalized data require normalized luster enters. The spherial - means algorithm by Dhillon and Modha [6] addresses this need. The spherial -means literature uses the osine similarity metri. We find it more onvenient for this paper to use the squared Eulidean distane metri for spherial -means, but it is easily shown that for normalized data both of these metris yield preisely the same results. Spherial -means produes luster enters of magnitude by normalizing the luster enters after eah iteration. In other words, an extra step is added to the traditional -means algorithm as follows: Spherial -means algorithm. Start with a partitioning of the data as in the traditional -means algorithm. Initialize t =.. For eah point, find the losest luster enter as measured via squared Eulidean distane.

3 2. Compute the new set of luster enters { } = omputing the mean (entroid) of eah luster. 3. Normalize eah luster enter by saling it so that it has a magnitude of. In other words, redefine : =, =,, (5) 2 4. Terminate if the stopping ondition is met. Inrement t and go to step otherwise. Note that step 3 may atually inrease the lustering error following step 2, sine step 2 finds the luster enters that optimize luster error regardless of luster enter magnitude. Step 3 modifies the optimal luster enters so that they have a magnitude of at the expense of inreasing lustering error. However, Dhillon and Modha show [6] that if step 2 and step 3 are onsidered together as one operation, this proedure finds the optimal enter of magnitude for eah luster. The property that luster enters have a magnitude of is thus preserved from one iteration to the next, while the algorithm ensures that luster error does not inrease. Spherial -means is therefore guaranteed to onverge to a solution of loally optimal luster enters, eah with magnitude of medians We now review the -medians algorithm [4, 9], whih is used when one wishes to minimize the total - norm distane from eah point to its nearest luster enter. -medians is quite similar to -means, and its differenes from -means are defined as follows: -medians algorithm. Sine we now work with - norm distane instead of squared Eulidean distane, our obetive is stated as: π = = = x π by Q({ } ) x (6) We start with a partitioning of the data as in - means. Initialize t =.. For eah point, find the losest luster enter as measured via -norm distane. 2. Compute the new set of luster enters { } = omputing the median of the luster. In other words, for eah dimension ompute the median value for that dimension over all points in the luster. We use the median beause the median is the point that minimizes the total -norm distane from all points to it [4]. 3. Terminate if the stopping ondition is met. Inrement t and go to step otherwise. by In a similar fashion to -means, steps and 2 of - medians are guaranteed not to inrease the obetive Q. 3. Normalized -medians luster enters There are a variety of tradeoffs in hoosing between -medians and -means. We onsider the disussion of the hoie of -medians vs. -means (-medians is slower but more robust to outliers, et.) outside the sope of this paper. Our purpose is to enable the appropriate use of -medians on normalized data. We mentioned in Setion some simple hanges to -medians that might seem to appropriately adapt it for obtaining normalized luster enters from normalized data. We disuss these ideas, point out their flaws, and then move on to disuss our solution. 3. Simple Approahes Simple approah #: normalize by saling at eah iteration. A straightforward adaptation of spherial -means is problemati. The onept seems to be easy enough: during eah iteration, after new luster enters have been determined, normalize them via saling using the appropriate distane metri. In other words, redefine luster enters as follows: : =, =, (7) There is a key problem with this approah. Unlike spherial -means, saled luster enters are not the points of magnitude that minimize total -norm distane from eah point to its luster enter. Example: Consider the -norm normalized points x = (/5, /5, 3/5), x 2 = (/5, /5, 3/5), x 3 = (/5, 3/5, /5), x 4 = (/5, 3/5, /5), and x 5 = (3/5,/5,/5). The median of these points is (/5, /5, /5). If we simply proportionately sale this median to have a magnitude of, we end up with a luster enter of (/3, /3, /3). If we measure -norm distane from eah point to this enter, eah point is a distane of 8/5 from this enter, resulting in a total distane of 4/5. Suppose that we instead hoose the normalized point (/5, 3/5, /5) to be our luster enter. This enter yields a total -norm distane of only 36/5. The saled median is therefore learly not the enter that minimizes total -norm distane from all points to it. Simple approah #2: normalize by (method of hoie) when -medians stabilizes. This approah is heuristi in nature, though it may yield positive results. If the normalization tehnique to be used is simple saling, however, problems arise as disussed above. Like Simple Approah #, Simple Approah #2

4 to show that an in fat be less than or greater than (as was done in Setion 3.). We also assume through the remainder of this algorithm and in the theorem that follows that, =,...,d. This is true for many appliations of normalized data. If < for a partiular, transform Figure : Graphial representation of our MN algorithm. Vertial lines are dimensions, horizontal lines are values (thik horizontal lines are the same value ourring twie), and irles are urrent luster enter loations. Sliding the seond irle upward inurs less error than sliding any other irle would sine more values are above it. does not provide desirable theoretial guarantees that spherial -means does. By performing regular - medians at eah iteration, the obetive does not inrease but the enters found at eah iteration do not satisfy desired onstraints (magnitude of ). After - medians stabilizes, when the luster enters are normalized, the obetive is likely to inrease steeply. 3.2 Our Solution: The MN Algorithm The struture of the spherial -means algorithm is sound. In order to obtain luster enters of magnitude, one should produe normalized luster enters iteratively throughout the algorithm suh that the obetive error metri ontinues to derease. Simple approah # tried to address this, but was flawed. The missing link needed here is an algorithm to solve the following task: given a set of points assigned to a luster, find a enter of magnitude (in a -norm sense) that minimizes the total -norm distane from all points to this enter. Here is our algorithm for doing so. Manhattan Normalization (MN) algorithm. Let x,x 2,...,x n be a set of points where x i =, i=,...,n. We wish to find a point, where =, that minimizes: n n d x = x (8) i i i= i= = where d is the number of dimensions, indiates the th omponent of luster enter, and x i represents the th omponent of point x i.. Initialize to be the median of x,x 2,...,x n. minimizes the obetive above, but it is not neessarily true that =. (If =, we terminate the algorithm.) Note that examples an be easily generated this dimension by negating both and all values x i (i=,...,n). On ompletion of the algorithm, negate again. This makes the rest of the algorithm easier to state (no speial ases for negatives), yet has no effet on its orretness. The algorithm is symmetri with respet to whether < or >, so we desribe the < ase and indiate the > ase in brakets. 2. For eah dimension, ount the number of values x i that are stritly greater than [less than]. Denote these ounts as z, =,...,d. 3. Let m be the dimension for whih z is maximal. If more than one dimension z has the same maximal value, hoose one arbitrarily. 4. Redefine m to be the smallest [largest] value x im (i=,...,n) greater than [less than] m. If m <, set m =. 5. If =, terminate the algorithm. If < m >, go to step 2. Otherwise, redefine m as ( ) ( ) m + and terminate. The idea behind the algorithm an be larified via Figure. At eah iteration, the dimension that we hange is the one that has a maximal number of values in the diretion that we want the enter to move. To see this, onsider sliding any of the irles in Figure upward a small distane ε, whih would have the effet of inreasing by ε. Furthermore, the obetive error metri (sum of -norm distanes from all points to the luster enter) is inreased by ε for eah tik on or below the irle moved, and dereased by ε for eah tik above the irle. Therefore, we slide the irle with the most tiks above it in order to inrease while inurring as little error as possible. We repeat this entire proess until =. With this intuition in mind, we prove the following theorem. Theorem: Given a set of points x,x 2,...,x n where x i =, i=,...,n, the MN algorithm finds a point ( =) that minimizes the total -norm error from all points to it. Formally, is the solution to the following optimization problem:

5 n n d arg min xi = i= i= = st.. = xi (9) Proof: We know that the initial value of as hosen in Step of the algorithm minimizes the following alternative problem [4]: arg min n xi () i= If =, then the theorem is trivially true. Suppose that <. Steps 2 and 3 hoose a dimension m of to modify. Suppose that m. Step 4 redefines the mth omponent of. We denote here the new value of as ĉ. We then observe that ĉ must be a solution for the following optimization problem: n n d arg min xi = xi i= i= = () st.. = ˆ To see this, reall that ĉ was obtained by adding ˆm m to the m th omponent of. This adds to the obetive an additional error of ˆm m times the number of values x im, i=,...,n less than or equal to m. Likewise, it subtrats from the obetive an error of ˆm m times the number of values x im, i=,...,n greater than m. Sine m was hosen to be the dimension with maximal values greater than m, and sine the total number of values in eah dimension is the same, any inrease at all in any other dimension of value less than or equal to ˆm m annot result in a lesser inrease of the obetive. This argument is easily adapted for the > and m < ases through appropriate reversals (positive / negative, less than / greater than, et.). We omit these other ases for spae saving purposes. Now that we have established the MN algorithm, we an integrate it with -medians: MN iterative -medians algorithm. We start with a partitioning of the data. Initialize t =.. For eah point, find the luster enter with losest -norm distane. 2. Compute the new set of luster enters { } = omputing the median of the luster. 3. Normalize eah luster so that it has a -norm magnitude of by using the MN algorithm. 4. Terminate if the stopping ondition is met. Inrement t and go to step otherwise. by If one wishes to use -medians on normalized data and yield normalized luster enters, MN iterative - medians will find luster enters at eah iteration with error no greater than that from the previous iteration. 3.3 MN Algorithm Performane Issues -medians is learly not as fast as -means due to its median omputations. Sine our MN algorithm ours after a median is alulated, any pre-existing algorithm for alulating medians quikly an be used. The MN algorithm itself, in its worst ase, ould require an iteration for eah value above the median in eah dimension. This is yields an upper bound of nd iterations. At the beginning of the algorithm, there is an initial step where the number of values greater than or less than the initial enter must be ounted for eah dimension. This also requires nd alulations (d dimensions, n points for eah), but this is only neessary one. Eah suessive iteration then requires determination of whih dimension has the greatest number of values above the urrent enter. This ould be handled via a priority queue where the priority is the number of values above the urrent enter. With d entries in the priority queue, this would result in log2 d alulations per iteration to update it. This yields a omplexity of Ond ( log d ) eah time that MN is used, i.e. at eah maor -medians iteration. For a sizeable dataset, then, MN forms a relatively negligible portion of alulation time. While there are many optimized approahes for alulating medians under ertain irumstanes, a straightforward modified quiksort algorithm has a omplexity of On ( log n) on average. Sine eah maor iteration of the -medians algorithm requires that a median be alulated in eah dimension within eah luster, this has a omplexity of Ond ( log n ). (In the ase where the n points are distributed evenly among the lusters, we an refine this omplexity to n n O d log. Sine is the number of lusters and is not expeted to be dramatially large, this omplexity an again be simplified to Ond ( log n ).) Therefore, for a large dataset where n d, these Ond ( log n ) alulations required by the median algorithm will ompletely dominate the Ond ( log d) alulations required by the MN algorithm. It should also be pointed out that when the data is all non-negative, as our atmospheri data is, equation (8) an be formulated as a linear program (LP). An LP

6 solver ould therefore be used as an alternative to our MN algorithm. To test this, we used CLP [7], a high quality open-soure LP solver. Beause the MN algorithm is designed to attak this problem diretly and CLP is a generi LP solver, CLP took dramatially longer than MN. Finally, we point out that our algorithm does not laim to have ompetitive running time with -means. It is well known that -means is onsiderably faster than -medians. The purpose of this paper is to demonstrate that if one wishes to use -medians beause of its outlier-resistant properties, it an be so adapted by using the MN algorithm. 4. Experiments In order to test the effetiveness of the MN algorithm in improving traditional -medians, we ompare four different versions of -medians:. MN at eah iteration. This is our MN iterative - medians algorithm as desribed in Setion MN only at the end. This is Simple Approah #2 (Setion 3.) with MN used as the normalization algorithm at the end. 3. Saling at eah iteration. This is Simple Approah # (Setion 3.): at eah iteration of the - medians algorithm, -norm saling is used to normalize the median. 4. Saling only at the end. This is Simple Approah #2 with -norm saling (Setion 3.) used as the normalization algorithm at the end. We also ompare these four algorithms with - means as a sanity hek. For all five tehniques, we hoose initial enters via the heuristi tehnique of hoosing the first point as the first enter, then hoosing eah suessive enter to be the point in the dataset whose distane to its losest enter is greatest. We terminate iteration of the algorithm when the error metri hanges by no more than.. For the two approahes that normalize only at the end, note that the error undergoes a sudden inrease after the last iteration. We test our lustering algorithms against three different data sets. We first disuss results from text douments that are lustered by word frequeny. We then move on to two mass spetral datasets, both representing aerosol partile data. The first of these two datasets onsists of a syntheti dataset generated by adding noise to seven atual partile mass spetra. The seond dataset orresponds to atual aerosol data taken from St. Louis in February of 24. For eah of these three datasets, we report our experiments using a single value for (number of lusters) for brevity. Our goal is to examine differenes in lustering error and onvergene properties, and we point out that the theoretial disussion above on the merits of these algorithms is independent of number of lusters. We typially hoose to be our best estimate as to the number of lusters atually in the data, sine this enables an easy san of the resulting lusters for orretness. Clustering error in these experiments is defined as the average Manhattan distane from eah point to its losest enter. 4. Text Douments Our text dataset was generated from a set of 55 sonnets from 5 different authors (Rossetti, Spenser, Browning, Sidney, and Shakespeare), where eah sonnet was represented as a single point by ounting word frequenies within that text [2]. We preproessed our data with the Porter stemming algorithm []. We lustered with = 5 for all algorithms. We ompare the lustering error from our four - medians variants in Figure 2. Both of the iterative normalization methods have the same general urve, but MN iterative normalization has lower error than saled iterative normalization. The saled iterative normalization error inreases and dereases slightly as it stabilizes, while MN iterative always dereases as explained in Setion 3.2. For the two ases where normalization is performed only at the end, the errors are idential until the very end of the algorithm beause these two lustering methods are exatly the same until the last pass. Notie, however, that after normalization saling gives a higher error than MN. Figure 2 also shows that for this dataset, normalizing with either tehnique at the end yields only slightly worse errors than normalization at eah iteration. This suggests that one an use this shortut of normalizing only at the end if a slight degradation in luster auray and a rapid inrease in lustering error at the end are aeptable. Figures 5-9 (grouped together at the end of the paper) represent the breakdown of eah luster by author for the four -medians algorithms and the spherial -means algorithm. Sine we know the author of eah sonnet, we an then measure how homogenous eah luster is with respet to the five authors. Figures 7 and 8, whih represent the lusters generated from the two algorithms that use saled normalization, show fairly poor results. Cluster 2 in Figure 7 and luster in Figure 8 eah ontain the bulk of the sonnets by all authors. MN iterative (Figure 5), on the other hand, does a muh better ob lustering by

7 Error author: lusters, 3, 4, and 5 are mostly homogenous. These results are onsiderably better than saled normalization. Additionally, though MN at the end has worse lusters than iterative MN (Figure 6 only has three deent lusters; 3, 4, and 5), it still performs better than saled normalization. Finally, when omparing iterative MN with spherial -means (Figure 9), we see that it performs at least as well. Note that we are unable to diretly ompare the lustering error from spherial -means to the error from the -medians algorithms in Figure 2 beause the distane metris are different. 4.2 Syntheti Partiles MN at eah iteration Saled at eah iteration MN only at the end Saled only at the end Iteration Figure 2: Error per iteration for the four - medians algorithms on the text data. Note that the algorithms involving saling onsistently perform worse than the algorithms using MN. The sale for the vertial axis has been hosen to render the results as learly as possible. We generated a syntheti dataset of mass spetra by starting with spetra from seven atual aerosol partiles. Based on these seven partiles, we reated 2 artifiial partiles by adding noise to the spetra of the seven partiles []. Sine there are seven known partiles, we lustered using = 7. Looking at the error versus the number of passes, we see the same trends as in the text data (Figure 3). MN outperforms saled normalization when performed either iteratively or at the end. There is one important differene to note between Figures 2 and 3. The final errors for iterative MN and MN at the end are idential in Figure 3. (Tehnially, iterative MN is 2.46 x -4 higher than MN at the end, but this is negligible due to possible rounding errors.) In Figure 2, however, iterative MN has visibly lower error. This again shows that our MN algorithm is a onsiderably better normalization tehnique than simple saling, and the user an hoose between using Error Iteration it at eah iteration or at the end depending on needs. The improvement that MN shows over simple saling is not as dramati in Figure 3 as it is in Figure 2. Note that we have hosen a sale for the vertial axis to render the results as learly as possible. Although we have shown theoretially that MN will always perform better than (or equal to) simple saling, how muh better MN performs depends on the partiular dataset. It is apparently the ase that for this syntheti dataset, simple saling is less error-prone than it is for our sonnets dataset. Nonetheless, in both examples, MN performs better. This is to be expeted, sine MN has been proven to be loally optimal. Figures -3 (grouped together at the end of the paper) show the luster distributions from these four algorithms. All four graphs show that eah luster is mostly homogeneous. Figure 4 shows the results from spherial -means lustering on this data. Three of the lusters in this graph (lusters, 3, and 5) show onsiderably less homogeneity. 4.3 St. Louis Dataset MN at eah iteration Saled at eah iteration MN only at the end Saled only at the end Figure 3: Error per iteration for the four - medians algorithms on the syntheti partiles. We finally present results from lustering a dataset of mass spetra orresponding to 2966 partiles olleted during February 24 in East St. Louis, Illinois at the EPA SuperSite loation. We used =9 whih seemed to provide easily interpretable results. Figure 4 shows error versus the number of passes for the four -medians algorithms, indiating behavior onsistent with the previous datasets. We have no knowledge of what the orret lusters are for this dataset. We did, however, examine the luster enters that resulted from these four - medians algorithms (author Prof. Gross is an atmospheri sientist austomed to examining suh plots). We observed that the MN iterative -medians algorithm piked up more peaks due to negative ions

8 than any of the other three -medians algorithms did. This set of luster enters is therefore more likely to be orret than the other three sets. The MN iterative - medians algorithm also showed onsiderably more peaks at loations with high m/z values (mass-toharge ratios) than any of the other algorithms did, whih also added to its redibility. Interestingly, the only other one of these four algorithms to show a luster enter with peaks at high m/z values was the saled iterative algorithm. We present two sample luster enters to illustrate these findings (Figure 5). 5. Conlusions and Future Work We have presented the MN algorithm, whih allows suessful use of -medians on normalized data when normalized luster enters are desired. This algorithm an be used during eah iteration of -medians or simply at the end of the algorithm, depending on whether one wants to emphasize non-inreasing lustering error or speed. We provided theoretial and experimental evidene that our algorithm is orret. Finally, to demonstrate experimentally the viability of our tehnique, we provided some brief omparisons with spherial -means. On our datasets, -medians ombined with MN yields lusters that are omparable to or better than those produed by spherial -means. Most of the salability issues for our algorithm are limited by the state of the art in saling traditional - medians. There is nonetheless room for future work in onsidering how to sale our algorithm, partiularly when the number of dimensions is high. Aknowledgements. This researh is supported by NSF ITR grant IIS and by Carleton College. 6. Referenes [] B. J. Anderson, D. R. Musiant, A. M. Ritz, A. Ault, D. S. Gross, M. Yuen, and M. Gaelli, "User-Friendly Error MN at eah iteration Saled at eah iteration MN only at the end Saled only at the end Iteration Figure 4: Error per iteration for the four - medians algorithms on the St. Louis data. Clustering for Atmospheri Data Analysis," Carleton College, Northfield, MN, Tehnial Report 5-a, 25. [2] E. Blomquist, Sonnet Central, 23, [3] P. S. Bradley and U. M. Fayyad, "Refining Initial Points for -Means Clustering," presented at Pro. 5th International Conf. on Mahine Learning, 998. [4] P. S. Bradley, O. L. Mangasarian, and W. N. Street, "Clustering via Conave Minimization," in Advanes in Neural Information Proessing Systems, vol. 9, M. C. Mozer, M. I. Jordan, and T. Petshe, Eds. Cambridge, MA: MIT Press, 997, pp [5] I. S. Dhillon, Y. Guan, and J. Fan, "Effiient Clustering of Very Large Doument Colletions," in Data Mining for Sientifi and Engineering Appliations, 2, pp [6] I. S. Dhillon and D. S. Modha, "Conept Deompositions for Large Sparse Text Data using Clustering," Mahine Learning, vol. 42, pp , 2. [7] J. Forrest, D. d. l. Nuez, and R. Lougee-Heimer, "CLP: COIN Linear Program Code,".2.2 ed, 25. [8] T. Hastie, R. Tibshirani, and J. H. Friedman, The Elements of Statistial Learning: Springer, 2. [9] A.. Jain and R. C. Dubes, Algorithms for Clustering Data: Prentie-Hall, 98. [] M. F. Porter, "An Algorithm for Suffix Stripping," Program, vol. 4, pp. 3-37, 98. [] D. T. Suess and. A. Prather, "Mass Spetrometry of Aerosols," Chemial Reviews, vol. 99, pp , 999.

9 # of Douments Figure 5: -medians with MN at eah iteration # of Douments Figure 6: -medians with MN only at the end # of Douments Figure 7: -medians saled at eah iteration # of Douments Figure 8: -medians saled only at the end # of Douments Figure 9: Spherial -means Rossetti Spenser # of Douments Browning Sidney Shakespeare I. Text Douments

10 # of Partiles Figure : -medians with MN at eah iteration # of Partiles Figure : -medians with MN only at the end # of Partiles Figure 2: -medians saled at eah iteration # of Partiles Figure 3: -medians saled only at the end # of Partiles Figure 4: Spherial -means Ambient Metals Ambient Brake Dust 7-3 Ambient 4 General 7 Ambient Smoke Clust er s Elemental Carbon Organi Carbon PAH II. Syntheti Partiles

11 a 27 b m/z m/z Figure 5: St. Louis luster samples. Corresponding aerosol partile mass spetra luster enters from -medians with MN at eah iteration (a) and -medians saled at eah iteration (b). Of the four -medians algorithms, only these two produed luster enters suh as these with peaks in the high m/z range. -medians with MN at eah iteration shows more of these peaks. Additionally, -medians with MN at eah iteration produes more luster enters with peaks in negative spetra. These harateristis are more representative of this data. III. St. Louis Data

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract

Learning Convention Propagation in BeerAdvocate Reviews from a etwork Perspective. Abstract CS 9 Projet Final Report: Learning Convention Propagation in BeerAdvoate Reviews from a etwork Perspetive Abstrat We look at the way onventions propagate between reviews on the BeerAdvoate dataset, and

More information

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman

NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION. Ken Sauer and Charles A. Bouman NONLINEAR BACK PROJECTION FOR TOMOGRAPHIC IMAGE RECONSTRUCTION Ken Sauer and Charles A. Bouman Department of Eletrial Engineering, University of Notre Dame Notre Dame, IN 46556, (219) 631-6999 Shool of

More information

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking

Algorithms for External Memory Lecture 6 Graph Algorithms - Weighted List Ranking Algorithms for External Memory Leture 6 Graph Algorithms - Weighted List Ranking Leturer: Nodari Sithinava Sribe: Andi Hellmund, Simon Ohsenreither 1 Introdution & Motivation After talking about I/O-effiient

More information

Extracting Partition Statistics from Semistructured Data

Extracting Partition Statistics from Semistructured Data Extrating Partition Statistis from Semistrutured Data John N. Wilson Rihard Gourlay Robert Japp Mathias Neumüller Department of Computer and Information Sienes University of Strathlyde, Glasgow, UK {jnw,rsg,rpj,mathias}@is.strath.a.uk

More information

Chapter 2: Introduction to Maple V

Chapter 2: Introduction to Maple V Chapter 2: Introdution to Maple V 2-1 Working with Maple Worksheets Try It! (p. 15) Start a Maple session with an empty worksheet. The name of the worksheet should be Untitled (1). Use one of the standard

More information

Detecting Outliers in High-Dimensional Datasets with Mixed Attributes

Detecting Outliers in High-Dimensional Datasets with Mixed Attributes Deteting Outliers in High-Dimensional Datasets with Mixed Attributes A. Koufakou, M. Georgiopoulos, and G.C. Anagnostopoulos 2 Shool of EECS, University of Central Florida, Orlando, FL, USA 2 Dept. of

More information

the data. Structured Principal Component Analysis (SPCA)

the data. Structured Principal Component Analysis (SPCA) Strutured Prinipal Component Analysis Kristin M. Branson and Sameer Agarwal Department of Computer Siene and Engineering University of California, San Diego La Jolla, CA 9193-114 Abstrat Many tasks involving

More information

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1.

Abstract. Key Words: Image Filters, Fuzzy Filters, Order Statistics Filters, Rank Ordered Mean Filters, Channel Noise. 1. Fuzzy Weighted Rank Ordered Mean (FWROM) Filters for Mixed Noise Suppression from Images S. Meher, G. Panda, B. Majhi 3, M.R. Meher 4,,4 Department of Eletronis and I.E., National Institute of Tehnology,

More information

Pipelined Multipliers for Reconfigurable Hardware

Pipelined Multipliers for Reconfigurable Hardware Pipelined Multipliers for Reonfigurable Hardware Mithell J. Myjak and José G. Delgado-Frias Shool of Eletrial Engineering and Computer Siene, Washington State University Pullman, WA 99164-2752 USA {mmyjak,

More information

arxiv: v1 [cs.db] 13 Sep 2017

arxiv: v1 [cs.db] 13 Sep 2017 An effiient lustering algorithm from the measure of loal Gaussian distribution Yuan-Yen Tai (Dated: May 27, 2018) In this paper, I will introdue a fast and novel lustering algorithm based on Gaussian distribution

More information

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks

A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks A Partial Sorting Algorithm in Multi-Hop Wireless Sensor Networks Abouberine Ould Cheikhna Department of Computer Siene University of Piardie Jules Verne 80039 Amiens Frane Ould.heikhna.abouberine @u-piardie.fr

More information

A Novel Validity Index for Determination of the Optimal Number of Clusters

A Novel Validity Index for Determination of the Optimal Number of Clusters IEICE TRANS. INF. & SYST., VOL.E84 D, NO.2 FEBRUARY 2001 281 LETTER A Novel Validity Index for Determination of the Optimal Number of Clusters Do-Jong KIM, Yong-Woon PARK, and Dong-Jo PARK, Nonmembers

More information

An Optimized Approach on Applying Genetic Algorithm to Adaptive Cluster Validity Index

An Optimized Approach on Applying Genetic Algorithm to Adaptive Cluster Validity Index IJCSES International Journal of Computer Sienes and Engineering Systems, ol., No.4, Otober 2007 CSES International 2007 ISSN 0973-4406 253 An Optimized Approah on Applying Geneti Algorithm to Adaptive

More information

Approximate logic synthesis for error tolerant applications

Approximate logic synthesis for error tolerant applications Approximate logi synthesis for error tolerant appliations Doohul Shin and Sandeep K. Gupta Eletrial Engineering Department, University of Southern California, Los Angeles, CA 989 {doohuls, sandeep}@us.edu

More information

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem

Calculation of typical running time of a branch-and-bound algorithm for the vertex-cover problem Calulation of typial running time of a branh-and-bound algorithm for the vertex-over problem Joni Pajarinen, Joni.Pajarinen@iki.fi Otober 21, 2007 1 Introdution The vertex-over problem is one of a olletion

More information

CleanUp: Improving Quadrilateral Finite Element Meshes

CleanUp: Improving Quadrilateral Finite Element Meshes CleanUp: Improving Quadrilateral Finite Element Meshes Paul Kinney MD-10 ECC P.O. Box 203 Ford Motor Company Dearborn, MI. 8121 (313) 28-1228 pkinney@ford.om Abstrat: Unless an all quadrilateral (quad)

More information

Video Data and Sonar Data: Real World Data Fusion Example

Video Data and Sonar Data: Real World Data Fusion Example 14th International Conferene on Information Fusion Chiago, Illinois, USA, July 5-8, 2011 Video Data and Sonar Data: Real World Data Fusion Example David W. Krout Applied Physis Lab dkrout@apl.washington.edu

More information

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization

Self-Adaptive Parent to Mean-Centric Recombination for Real-Parameter Optimization Self-Adaptive Parent to Mean-Centri Reombination for Real-Parameter Optimization Kalyanmoy Deb and Himanshu Jain Department of Mehanial Engineering Indian Institute of Tehnology Kanpur Kanpur, PIN 86 {deb,hjain}@iitk.a.in

More information

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering

A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering A Novel Bit Level Time Series Representation with Impliation of Similarity Searh and lustering hotirat Ratanamahatana, Eamonn Keogh, Anthony J. Bagnall 2, and Stefano Lonardi Dept. of omputer Siene & Engineering,

More information

And, the (low-pass) Butterworth filter of order m is given in the frequency domain by

And, the (low-pass) Butterworth filter of order m is given in the frequency domain by Problem Set no.3.a) The ideal low-pass filter is given in the frequeny domain by B ideal ( f ), f f; =, f > f. () And, the (low-pass) Butterworth filter of order m is given in the frequeny domain by B

More information

Outline: Software Design

Outline: Software Design Outline: Software Design. Goals History of software design ideas Design priniples Design methods Life belt or leg iron? (Budgen) Copyright Nany Leveson, Sept. 1999 A Little History... At first, struggling

More information

Drawing lines. Naïve line drawing algorithm. drawpixel(x, round(y)); double dy = y1 - y0; double dx = x1 - x0; double m = dy / dx; double y = y0;

Drawing lines. Naïve line drawing algorithm. drawpixel(x, round(y)); double dy = y1 - y0; double dx = x1 - x0; double m = dy / dx; double y = y0; Naïve line drawing algorithm // Connet to grid points(x0,y0) and // (x1,y1) by a line. void drawline(int x0, int y0, int x1, int y1) { int x; double dy = y1 - y0; double dx = x1 - x0; double m = dy / dx;

More information

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays

Analysis of input and output configurations for use in four-valued CCD programmable logic arrays nalysis of input and output onfigurations for use in four-valued D programmable logi arrays J.T. utler H.G. Kerkhoff ndexing terms: Logi, iruit theory and design, harge-oupled devies bstrat: s in binary,

More information

Using Augmented Measurements to Improve the Convergence of ICP

Using Augmented Measurements to Improve the Convergence of ICP Using Augmented Measurements to Improve the onvergene of IP Jaopo Serafin, Giorgio Grisetti Dept. of omputer, ontrol and Management Engineering, Sapienza University of Rome, Via Ariosto 25, I-0085, Rome,

More information

Introduction to Seismology Spring 2008

Introduction to Seismology Spring 2008 MIT OpenCourseWare http://ow.mit.edu 1.510 Introdution to Seismology Spring 008 For information about iting these materials or our Terms of Use, visit: http://ow.mit.edu/terms. 1.510 Leture Notes 3.3.007

More information

1. The collection of the vowels in the word probability. 2. The collection of real numbers that satisfy the equation x 9 = 0.

1. The collection of the vowels in the word probability. 2. The collection of real numbers that satisfy the equation x 9 = 0. C HPTER 1 SETS I. DEFINITION OF SET We begin our study of probability with the disussion of the basi onept of set. We assume that there is a ommon understanding of what is meant by the notion of a olletion

More information

Sequential Incremental-Value Auctions

Sequential Incremental-Value Auctions Sequential Inremental-Value Autions Xiaoming Zheng and Sven Koenig Department of Computer Siene University of Southern California Los Angeles, CA 90089-0781 {xiaominz,skoenig}@us.edu Abstrat We study the

More information

Gray Codes for Reflectable Languages

Gray Codes for Reflectable Languages Gray Codes for Refletable Languages Yue Li Joe Sawada Marh 8, 2008 Abstrat We lassify a type of language alled a refletable language. We then develop a generi algorithm that an be used to list all strings

More information

Cluster-Based Cumulative Ensembles

Cluster-Based Cumulative Ensembles Cluster-Based Cumulative Ensembles Hanan G. Ayad and Mohamed S. Kamel Pattern Analysis and Mahine Intelligene Lab, Eletrial and Computer Engineering, University of Waterloo, Waterloo, Ontario N2L 3G1,

More information

The Happy Ending Problem

The Happy Ending Problem The Happy Ending Problem Neeldhara Misra STATUTORY WARNING This doument is a draft version 1 Introdution The Happy Ending problem first manifested itself on a typial wintery evening in 1933 These evenings

More information

Gradient based progressive probabilistic Hough transform

Gradient based progressive probabilistic Hough transform Gradient based progressive probabilisti Hough transform C.Galambos, J.Kittler and J.Matas Abstrat: The authors look at the benefits of exploiting gradient information to enhane the progressive probabilisti

More information

MATH STUDENT BOOK. 12th Grade Unit 6

MATH STUDENT BOOK. 12th Grade Unit 6 MATH STUDENT BOOK 12th Grade Unit 6 Unit 6 TRIGONOMETRIC APPLICATIONS MATH 1206 TRIGONOMETRIC APPLICATIONS INTRODUCTION 3 1. TRIGONOMETRY OF OBLIQUE TRIANGLES 5 LAW OF SINES 5 AMBIGUITY AND AREA OF A TRIANGLE

More information

HEXA: Compact Data Structures for Faster Packet Processing

HEXA: Compact Data Structures for Faster Packet Processing Washington University in St. Louis Washington University Open Sholarship All Computer Siene and Engineering Researh Computer Siene and Engineering Report Number: 27-26 27 HEXA: Compat Data Strutures for

More information

with respect to the normal in each medium, respectively. The question is: How are θ

with respect to the normal in each medium, respectively. The question is: How are θ Prof. Raghuveer Parthasarathy University of Oregon Physis 35 Winter 8 3 R EFRACTION When light travels from one medium to another, it may hange diretion. This phenomenon familiar whenever we see the bent

More information

1 The Knuth-Morris-Pratt Algorithm

1 The Knuth-Morris-Pratt Algorithm 5-45/65: Design & Analysis of Algorithms September 26, 26 Leture #9: String Mathing last hanged: September 26, 27 There s an entire field dediated to solving problems on strings. The book Algorithms on

More information

A Fast Kernel-based Multilevel Algorithm for Graph Clustering

A Fast Kernel-based Multilevel Algorithm for Graph Clustering A Fast Kernel-based Multilevel Algorithm for Graph Clustering Inderjit Dhillon Dept. of Computer Sienes University of Texas at Austin Austin, TX 78712 inderjit@s.utexas.edu Yuqiang Guan Dept. of Computer

More information

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering

Volume 3, Issue 9, September 2013 International Journal of Advanced Research in Computer Science and Software Engineering Volume 3, Issue 9, September 2013 ISSN: 2277 128X International Journal of Advaned Researh in Computer Siene and Software Engineering Researh Paper Available online at: www.ijarsse.om A New-Fangled Algorithm

More information

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425)

Automatic Physical Design Tuning: Workload as a Sequence Sanjay Agrawal Microsoft Research One Microsoft Way Redmond, WA, USA +1-(425) Automati Physial Design Tuning: Workload as a Sequene Sanjay Agrawal Mirosoft Researh One Mirosoft Way Redmond, WA, USA +1-(425) 75-357 sagrawal@mirosoft.om Eri Chu * Computer Sienes Department University

More information

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines

The Minimum Redundancy Maximum Relevance Approach to Building Sparse Support Vector Machines The Minimum Redundany Maximum Relevane Approah to Building Sparse Support Vetor Mahines Xiaoxing Yang, Ke Tang, and Xin Yao, Nature Inspired Computation and Appliations Laboratory (NICAL), Shool of Computer

More information

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks

A Load-Balanced Clustering Protocol for Hierarchical Wireless Sensor Networks International Journal of Advanes in Computer Networks and Its Seurity IJCNS A Load-Balaned Clustering Protool for Hierarhial Wireless Sensor Networks Mehdi Tarhani, Yousef S. Kavian, Saman Siavoshi, Ali

More information

Semi-Supervised Affinity Propagation with Instance-Level Constraints

Semi-Supervised Affinity Propagation with Instance-Level Constraints Semi-Supervised Affinity Propagation with Instane-Level Constraints Inmar E. Givoni, Brendan J. Frey Probabilisti and Statistial Inferene Group University of Toronto 10 King s College Road, Toronto, Ontario,

More information

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR

A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Malaysian Journal of Computer Siene, Vol 10 No 1, June 1997, pp 36-41 A DYNAMIC ACCESS CONTROL WITH BINARY KEY-PAIR Md Rafiqul Islam, Harihodin Selamat and Mohd Noor Md Sap Faulty of Computer Siene and

More information

An Alternative Approach to the Fuzzifier in Fuzzy Clustering to Obtain Better Clustering Results

An Alternative Approach to the Fuzzifier in Fuzzy Clustering to Obtain Better Clustering Results An Alternative Approah to the Fuzziier in Fuzzy Clustering to Obtain Better Clustering Results Frank Klawonn Department o Computer Siene University o Applied Sienes BS/WF Salzdahlumer Str. 46/48 D-38302

More information

Finding the Equation of a Straight Line

Finding the Equation of a Straight Line Finding the Equation of a Straight Line You should have, before now, ome aross the equation of a straight line, perhaps at shool. Engineers use this equation to help determine how one quantity is related

More information

One Against One or One Against All : Which One is Better for Handwriting Recognition with SVMs?

One Against One or One Against All : Which One is Better for Handwriting Recognition with SVMs? One Against One or One Against All : Whih One is Better for Handwriting Reognition with SVMs? Jonathan Milgram, Mohamed Cheriet, Robert Sabourin To ite this version: Jonathan Milgram, Mohamed Cheriet,

More information

We P9 16 Eigenray Tracing in 3D Heterogeneous Media

We P9 16 Eigenray Tracing in 3D Heterogeneous Media We P9 Eigenray Traing in 3D Heterogeneous Media Z. Koren* (Emerson), I. Ravve (Emerson) Summary Conventional two-point ray traing in a general 3D heterogeneous medium is normally performed by a shooting

More information

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications

System-Level Parallelism and Throughput Optimization in Designing Reconfigurable Computing Applications System-Level Parallelism and hroughput Optimization in Designing Reonfigurable Computing Appliations Esam El-Araby 1, Mohamed aher 1, Kris Gaj 2, arek El-Ghazawi 1, David Caliga 3, and Nikitas Alexandridis

More information

Divide-and-conquer algorithms 1

Divide-and-conquer algorithms 1 * 1 Multipliation Divide-and-onquer algorithms 1 The mathematiian Gauss one notied that although the produt of two omplex numbers seems to! involve four real-number multipliations it an in fat be done

More information

Introductory Programming, IMM, DTU Systematic Software Test. Software test (afprøvning) Motivation. Structural test and functional test

Introductory Programming, IMM, DTU Systematic Software Test. Software test (afprøvning) Motivation. Structural test and functional test Introdutory Programming, IMM, DTU Systemati Software Test Peter Sestoft a Programs often ontain unintended errors how do you find them? Strutural test Funtional test Notes: Systemati Software Test, http://www.dina.kvl.dk/

More information

Plot-to-track correlation in A-SMGCS using the target images from a Surface Movement Radar

Plot-to-track correlation in A-SMGCS using the target images from a Surface Movement Radar Plot-to-trak orrelation in A-SMGCS using the target images from a Surfae Movement Radar G. Golino Radar & ehnology Division AMS, Italy ggolino@amsjv.it Abstrat he main topi of this paper is the formulation

More information

Incremental Mining of Partial Periodic Patterns in Time-series Databases

Incremental Mining of Partial Periodic Patterns in Time-series Databases CERIAS Teh Report 2000-03 Inremental Mining of Partial Periodi Patterns in Time-series Dataases Mohamed G. Elfeky Center for Eduation and Researh in Information Assurane and Seurity Purdue University,

More information

Improved Vehicle Classification in Long Traffic Video by Cooperating Tracker and Classifier Modules

Improved Vehicle Classification in Long Traffic Video by Cooperating Tracker and Classifier Modules Improved Vehile Classifiation in Long Traffi Video by Cooperating Traker and Classifier Modules Brendan Morris and Mohan Trivedi University of California, San Diego San Diego, CA 92093 {b1morris, trivedi}@usd.edu

More information

INTERPOLATED AND WARPED 2-D DIGITAL WAVEGUIDE MESH ALGORITHMS

INTERPOLATED AND WARPED 2-D DIGITAL WAVEGUIDE MESH ALGORITHMS Proeedings of the COST G-6 Conferene on Digital Audio Effets (DAFX-), Verona, Italy, Deember 7-9, INTERPOLATED AND WARPED -D DIGITAL WAVEGUIDE MESH ALGORITHMS Vesa Välimäki Lab. of Aoustis and Audio Signal

More information

特集 Road Border Recognition Using FIR Images and LIDAR Signal Processing

特集 Road Border Recognition Using FIR Images and LIDAR Signal Processing デンソーテクニカルレビュー Vol. 15 2010 特集 Road Border Reognition Using FIR Images and LIDAR Signal Proessing 高木聖和 バーゼル ファルディ Kiyokazu TAKAGI Basel Fardi ヘンドリック ヴァイゲル Hendrik Weigel ゲルド ヴァニーリック Gerd Wanielik This paper

More information

Performance Benchmarks for an Interactive Video-on-Demand System

Performance Benchmarks for an Interactive Video-on-Demand System Performane Benhmarks for an Interative Video-on-Demand System. Guo,P.G.Taylor,E.W.M.Wong,S.Chan,M.Zukerman andk.s.tang ARC Speial Researh Centre for Ultra-Broadband Information Networks (CUBIN) Department

More information

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction

Time delay estimation of reverberant meeting speech: on the use of multichannel linear prediction University of Wollongong Researh Online Faulty of Informatis - apers (Arhive) Faulty of Engineering and Information Sienes 7 Time delay estimation of reverberant meeting speeh: on the use of multihannel

More information

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks

Flow Demands Oriented Node Placement in Multi-Hop Wireless Networks Flow Demands Oriented Node Plaement in Multi-Hop Wireless Networks Zimu Yuan Institute of Computing Tehnology, CAS, China {zimu.yuan}@gmail.om arxiv:153.8396v1 [s.ni] 29 Mar 215 Abstrat In multi-hop wireless

More information

A {k, n}-secret Sharing Scheme for Color Images

A {k, n}-secret Sharing Scheme for Color Images A {k, n}-seret Sharing Sheme for Color Images Rastislav Luka, Konstantinos N. Plataniotis, and Anastasios N. Venetsanopoulos The Edward S. Rogers Sr. Dept. of Eletrial and Computer Engineering, University

More information

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality

Multi-Piece Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality INTERNATIONAL CONFERENCE ON MANUFACTURING AUTOMATION (ICMA200) Multi-Piee Mold Design Based on Linear Mixed-Integer Program Toward Guaranteed Optimality Stephen Stoyan, Yong Chen* Epstein Department of

More information

1. Introduction. 2. The Probable Stope Algorithm

1. Introduction. 2. The Probable Stope Algorithm 1. Introdution Optimization in underground mine design has reeived less attention than that in open pit mines. This is mostly due to the diversity o underground mining methods and omplexity o underground

More information

Facility Location: Distributed Approximation

Facility Location: Distributed Approximation Faility Loation: Distributed Approximation Thomas Mosibroda Roger Wattenhofer Distributed Computing Group PODC 2005 Where to plae ahes in the Internet? A distributed appliation that has to dynamially plae

More information

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks

Unsupervised Stereoscopic Video Object Segmentation Based on Active Contours and Retrainable Neural Networks Unsupervised Stereosopi Video Objet Segmentation Based on Ative Contours and Retrainable Neural Networks KLIMIS NTALIANIS, ANASTASIOS DOULAMIS, and NIKOLAOS DOULAMIS National Tehnial University of Athens

More information

Weak Dependence on Initialization in Mixture of Linear Regressions

Weak Dependence on Initialization in Mixture of Linear Regressions Proeedings of the International MultiConferene of Engineers and Computer Sientists 8 Vol I IMECS 8, Marh -6, 8, Hong Kong Weak Dependene on Initialization in Mixture of Linear Regressions Ryohei Nakano

More information

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2

On - Line Path Delay Fault Testing of Omega MINs M. Bellos 1, E. Kalligeros 1, D. Nikolos 1,2 & H. T. Vergos 1,2 On - Line Path Delay Fault Testing of Omega MINs M. Bellos, E. Kalligeros, D. Nikolos,2 & H. T. Vergos,2 Dept. of Computer Engineering and Informatis 2 Computer Tehnology Institute University of Patras,

More information

Smooth Trajectory Planning Along Bezier Curve for Mobile Robots with Velocity Constraints

Smooth Trajectory Planning Along Bezier Curve for Mobile Robots with Velocity Constraints Smooth Trajetory Planning Along Bezier Curve for Mobile Robots with Veloity Constraints Gil Jin Yang and Byoung Wook Choi Department of Eletrial and Information Engineering Seoul National University of

More information

Acoustic Links. Maximizing Channel Utilization for Underwater

Acoustic Links. Maximizing Channel Utilization for Underwater Maximizing Channel Utilization for Underwater Aousti Links Albert F Hairris III Davide G. B. Meneghetti Adihele Zorzi Department of Information Engineering University of Padova, Italy Email: {harris,davide.meneghetti,zorzi}@dei.unipd.it

More information

KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION

KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION KERNEL SPARSE REPRESENTATION WITH LOCAL PATTERNS FOR FACE RECOGNITION Cuiui Kang 1, Shengai Liao, Shiming Xiang 1, Chunhong Pan 1 1 National Laboratory of Pattern Reognition, Institute of Automation, Chinese

More information

Chemical, Biological and Radiological Hazard Assessment: A New Model of a Plume in a Complex Urban Environment

Chemical, Biological and Radiological Hazard Assessment: A New Model of a Plume in a Complex Urban Environment hemial, Biologial and Radiologial Haard Assessment: A New Model of a Plume in a omplex Urban Environment Skvortsov, A.T., P.D. Dawson, M.D. Roberts and R.M. Gailis HPP Division, Defene Siene and Tehnology

More information

An Efficient and Scalable Approach to CNN Queries in a Road Network

An Efficient and Scalable Approach to CNN Queries in a Road Network An Effiient and Salable Approah to CNN Queries in a Road Network Hyung-Ju Cho Chin-Wan Chung Dept. of Eletrial Engineering & Computer Siene Korea Advaned Institute of Siene and Tehnology 373- Kusong-dong,

More information

Detection and Recognition of Non-Occluded Objects using Signature Map

Detection and Recognition of Non-Occluded Objects using Signature Map 6th WSEAS International Conferene on CIRCUITS, SYSTEMS, ELECTRONICS,CONTROL & SIGNAL PROCESSING, Cairo, Egypt, De 9-31, 007 65 Detetion and Reognition of Non-Oluded Objets using Signature Map Sangbum Park,

More information

timestamp, if silhouette(x, y) 0 0 if silhouette(x, y) = 0, mhi(x, y) = and mhi(x, y) < timestamp - duration mhi(x, y), else

timestamp, if silhouette(x, y) 0 0 if silhouette(x, y) = 0, mhi(x, y) = and mhi(x, y) < timestamp - duration mhi(x, y), else 3rd International Conferene on Multimedia Tehnolog(ICMT 013) An Effiient Moving Target Traking Strateg Based on OpenCV and CAMShift Theor Dongu Li 1 Abstrat Image movement involved bakground movement and

More information

Path Sharing and Predicate Evaluation for High-Performance XML Filtering*

Path Sharing and Predicate Evaluation for High-Performance XML Filtering* Path Sharing and Prediate Evaluation for High-Performane XML Filtering Yanlei Diao, Mihael J. Franklin, Hao Zhang, Peter Fisher EECS, University of California, Berkeley {diaoyl, franklin, nhz, fisherp}@s.erkeley.edu

More information

Fast Rigid Motion Segmentation via Incrementally-Complex Local Models

Fast Rigid Motion Segmentation via Incrementally-Complex Local Models Fast Rigid Motion Segmentation via Inrementally-Complex Loal Models Fernando Flores-Mangas Allan D. Jepson Department of Computer Siene, University of Toronto {mangas,jepson}@s.toronto.edu Abstrat The

More information

Exploiting Enriched Contextual Information for Mobile App Classification

Exploiting Enriched Contextual Information for Mobile App Classification Exploiting Enrihed Contextual Information for Mobile App Classifiation Hengshu Zhu 1 Huanhuan Cao 2 Enhong Chen 1 Hui Xiong 3 Jilei Tian 2 1 University of Siene and Tehnology of China 2 Nokia Researh Center

More information

A Coarse-to-Fine Classification Scheme for Facial Expression Recognition

A Coarse-to-Fine Classification Scheme for Facial Expression Recognition A Coarse-to-Fine Classifiation Sheme for Faial Expression Reognition Xiaoyi Feng 1,, Abdenour Hadid 1 and Matti Pietikäinen 1 1 Mahine Vision Group Infoteh Oulu and Dept. of Eletrial and Information Engineering

More information

Partial Character Decoding for Improved Regular Expression Matching in FPGAs

Partial Character Decoding for Improved Regular Expression Matching in FPGAs Partial Charater Deoding for Improved Regular Expression Mathing in FPGAs Peter Sutton Shool of Information Tehnology and Eletrial Engineering The University of Queensland Brisbane, Queensland, 4072, Australia

More information

Relevance for Computer Vision

Relevance for Computer Vision The Geometry of ROC Spae: Understanding Mahine Learning Metris through ROC Isometris, by Peter A. Flah International Conferene on Mahine Learning (ICML-23) http://www.s.bris.a.uk/publiations/papers/74.pdf

More information

Colouring contact graphs of squares and rectilinear polygons de Berg, M.T.; Markovic, A.; Woeginger, G.

Colouring contact graphs of squares and rectilinear polygons de Berg, M.T.; Markovic, A.; Woeginger, G. Colouring ontat graphs of squares and retilinear polygons de Berg, M.T.; Markovi, A.; Woeginger, G. Published in: nd European Workshop on Computational Geometry (EuroCG 06), 0 Marh - April, Lugano, Switzerland

More information

HIGHER ORDER full-wave three-dimensional (3-D) large-domain techniques in

HIGHER ORDER full-wave three-dimensional (3-D) large-domain techniques in FACTA UNIVERSITATIS (NIŠ) SER.: ELEC. ENERG. vol. 21, no. 2, August 2008, 209-220 Comparison of Higher Order FEM and MoM/SIE Approahes in Analyses of Closed- and Open-Region Eletromagneti Problems Milan

More information

Comparing Images Under Variable Illumination

Comparing Images Under Variable Illumination ( This paper appeared in CVPR 8. IEEE ) Comparing Images Under Variable Illumination David W. Jaobs Peter N. Belhumeur Ronen Basri NEC Researh Institute Center for Computational Vision and Control The

More information

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks

A Dual-Hamiltonian-Path-Based Multicasting Strategy for Wormhole-Routed Star Graph Interconnection Networks A Dual-Hamiltonian-Path-Based Multiasting Strategy for Wormhole-Routed Star Graph Interonnetion Networks Nen-Chung Wang Department of Information and Communiation Engineering Chaoyang University of Tehnology,

More information

UCSB Math TI-85 Tutorials: Basics

UCSB Math TI-85 Tutorials: Basics 3 UCSB Math TI-85 Tutorials: Basis If your alulator sreen doesn t show anything, try adjusting the ontrast aording to the instrutions on page 3, or page I-3, of the alulator manual You should read the

More information

Cluster Centric Fuzzy Modeling

Cluster Centric Fuzzy Modeling 10.1109/TFUZZ.014.300134, IEEE Transations on Fuzzy Systems TFS-013-0379.R1 1 Cluster Centri Fuzzy Modeling Witold Pedryz, Fellow, IEEE, and Hesam Izakian, Student Member, IEEE Abstrat In this study, we

More information

represent = as a finite deimal" either in base 0 or in base. We an imagine that the omputer first omputes the mathematial = then rounds the result to

represent = as a finite deimal either in base 0 or in base. We an imagine that the omputer first omputes the mathematial = then rounds the result to Sientifi Computing Chapter I Computer Arithmeti Jonathan Goodman Courant Institute of Mathemaial Sienes Last revised January, 00 Introdution One of the many soures of error in sientifi omputing is inexat

More information

The Implementation of RRTs for a Remote-Controlled Mobile Robot

The Implementation of RRTs for a Remote-Controlled Mobile Robot ICCAS5 June -5, KINEX, Gyeonggi-Do, Korea he Implementation of RRs for a Remote-Controlled Mobile Robot Chi-Won Roh*, Woo-Sub Lee **, Sung-Chul Kang *** and Kwang-Won Lee **** * Intelligent Robotis Researh

More information

Layout Compliance for Triple Patterning Lithography: An Iterative Approach

Layout Compliance for Triple Patterning Lithography: An Iterative Approach Layout Compliane for Triple Patterning Lithography: An Iterative Approah Bei Yu, Gilda Garreton, David Z. Pan ECE Dept. University of Texas at Austin, Austin, TX, USA Orale Las, Orale Corporation, Redwood

More information

Graph-Based vs Depth-Based Data Representation for Multiview Images

Graph-Based vs Depth-Based Data Representation for Multiview Images Graph-Based vs Depth-Based Data Representation for Multiview Images Thomas Maugey, Antonio Ortega, Pasal Frossard Signal Proessing Laboratory (LTS), Eole Polytehnique Fédérale de Lausanne (EPFL) Email:

More information

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System

Algorithms, Mechanisms and Procedures for the Computer-aided Project Generation System Algorithms, Mehanisms and Proedures for the Computer-aided Projet Generation System Anton O. Butko 1*, Aleksandr P. Briukhovetskii 2, Dmitry E. Grigoriev 2# and Konstantin S. Kalashnikov 3 1 Department

More information

Stable Road Lane Model Based on Clothoids

Stable Road Lane Model Based on Clothoids Stable Road Lane Model Based on Clothoids C Gakstatter*, S Thomas**, Dr P Heinemann*, Prof Gudrun Klinker*** *Audi Eletronis Venture GmbH, **Leibniz Universität Hannover, ***Tehnishe Universität Münhen

More information

Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors

Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors Eurographis Symposium on Geometry Proessing (003) L. Kobbelt, P. Shröder, H. Hoppe (Editors) Rotation Invariant Spherial Harmoni Representation of 3D Shape Desriptors Mihael Kazhdan, Thomas Funkhouser,

More information

A Unified Subdivision Scheme for Polygonal Modeling

A Unified Subdivision Scheme for Polygonal Modeling EUROGRAPHICS 2 / A. Chalmers and T.-M. Rhyne (Guest Editors) Volume 2 (2), Number 3 A Unified Subdivision Sheme for Polygonal Modeling Jérôme Maillot Jos Stam Alias Wavefront Alias Wavefront 2 King St.

More information

Detecting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry

Detecting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry Deteting Moving Targets in Clutter in Airborne SAR via Keystoning and Multiple Phase Center Interferometry D. M. Zasada, P. K. Sanyal The MITRE Corp., 6 Eletroni Parkway, Rome, NY 134 (dmzasada, psanyal)@mitre.org

More information

C 2 C 3 C 1 M S. f e. e f (3,0) (0,1) (2,0) (-1,1) (1,0) (-1,0) (1,-1) (0,-1) (-2,0) (-3,0) (0,-2)

C 2 C 3 C 1 M S. f e. e f (3,0) (0,1) (2,0) (-1,1) (1,0) (-1,0) (1,-1) (0,-1) (-2,0) (-3,0) (0,-2) SPECIAL ISSUE OF IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION: MULTI-ROBOT SSTEMS, 00 Distributed reonfiguration of hexagonal metamorphi robots Jennifer E. Walter, Jennifer L. Welh, and Nany M. Amato Abstrat

More information

Simulation of Crystallographic Texture and Anisotropie of Polycrystals during Metal Forming with Respect to Scaling Aspects

Simulation of Crystallographic Texture and Anisotropie of Polycrystals during Metal Forming with Respect to Scaling Aspects Raabe, Roters, Wang Simulation of Crystallographi Texture and Anisotropie of Polyrystals during Metal Forming with Respet to Saling Aspets D. Raabe, F. Roters, Y. Wang Max-Plank-Institut für Eisenforshung,

More information

Exploring the Commonality in Feature Modeling Notations

Exploring the Commonality in Feature Modeling Notations Exploring the Commonality in Feature Modeling Notations Miloslav ŠÍPKA Slovak University of Tehnology Faulty of Informatis and Information Tehnologies Ilkovičova 3, 842 16 Bratislava, Slovakia miloslav.sipka@gmail.om

More information

Sparse Certificates for 2-Connectivity in Directed Graphs

Sparse Certificates for 2-Connectivity in Directed Graphs Sparse Certifiates for 2-Connetivity in Direted Graphs Loukas Georgiadis Giuseppe F. Italiano Aikaterini Karanasiou Charis Papadopoulos Nikos Parotsidis Abstrat Motivated by the emergene of large-sale

More information

An Interactive-Voting Based Map Matching Algorithm

An Interactive-Voting Based Map Matching Algorithm Eleventh International Conferene on Mobile Data Management An Interative-Voting Based Map Mathing Algorithm Jing Yuan* University of Siene and Tehnology of China Hefei, China yuanjing@mail.ust.edu.n Yu

More information

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating

Capturing Large Intra-class Variations of Biometric Data by Template Co-updating Capturing Large Intra-lass Variations of Biometri Data by Template Co-updating Ajita Rattani University of Cagliari Piazza d'armi, Cagliari, Italy ajita.rattani@diee.unia.it Gian Lua Marialis University

More information

Boosted Random Forest

Boosted Random Forest Boosted Random Forest Yohei Mishina, Masamitsu suhiya and Hironobu Fujiyoshi Department of Computer Siene, Chubu University, 1200 Matsumoto-ho, Kasugai, Aihi, Japan {mishi, mtdoll}@vision.s.hubu.a.jp,

More information

Fitting conics to paracatadioptric projections of lines

Fitting conics to paracatadioptric projections of lines Computer Vision and Image Understanding 11 (6) 11 16 www.elsevier.om/loate/viu Fitting onis to paraatadioptri projetions of lines João P. Barreto *, Helder Araujo Institute for Systems and Robotis, Department

More information