A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment

Size: px
Start display at page:

Download "A Webpage Similarity Measure for Web Sessions Clustering Using Sequence Alignment"

Transcription

1 A Webpage Smlarty Measure for Web Sessons Clusterng Usng Sequence Algnment Mozhgan Azmpour-Kv School of Engneerng and Scence Sharf Unversty of Technology, Internatonal Campus Ksh Island, Iran Reza Azm Department of Computer Engneerng Alzahra Unversty Tehran, Iran Abstract Web sessons clusterng s a process of web usage mnng task that ams to group web sessons wth smlar trends and usage patterns nto clusters. Ths process s crucal for effectve webste management, web personalzaton and developng web recommender systems. Accurate clusterng of web sessons s hghly dependent to smlarty measure defned to compare web sessons. In ths paper, we propose a smlarty measure for comparng web sessons. The sequental order of web navgatons n sessons s consdered usng sequence algnment method. Furrmore, we propose to consder usage smlarty of web sessons based on tme a user spends on a webpage, and also frequency of vst of each page wthn sesson. The proposed method s valdated by clusterng a collecton of web sessons usng an agglomeratve clusterng technque and comparng results wth avalable methods. The expermental results show effectveness of proposed method to capture propertes of web sesson data. Keywords-nterestngness of webpage; webpage smalrty measure; sequence algnment; web sessons clusterng I. INTRODUCTION The World Wde Web (WWW s consdered as largest dstrbuted collecton of nformaton. By rapd growth of ths nformaton resource, t has become a dffcult task for users to acqure r desred nformaton even n a partcular webste. Hence, a need for developng technques that can facltate ths ssue has been hghlghted. Knowng users and makng profles of m can be helpful for webstes to present relevant nformaton to partcular vstors. To address ths ssue, web usage mnng has recently attracted many attentons [1]. Web usage mnng s an applcaton of data mnng whch tres to extract useful patterns from data that are obtaned from nteracton of users wth web. Deployng web mnng technques are crucal for any applcaton that ams to ease use of web such as creatng adaptve webstes, web personalzaton, web recommender systems, etc. Web usage mnng from web access log fles has three steps [1]: (1 data pre-processng; (2 pattern dscovery by applyng varous technques such as clusterng, classfcaton, assocaton dscovery, and sequental pattern dscovery to data; and (3 pattern analyss whch ams at elmnatng rrelevant patterns from dscovered patterns n prevous step. In our study, we are nterested n web sessons clusterng whch s problem of groupng web sessons wth smlar usage patterns nto groups. Any clusterng process tends to maxmze ntra-group smlarty and mnmze nter-group smlarty of cluster obects [2]. One of most challengng ssues n clusterng web sesson s how to measure smlarty of web sessons. A more precse smlarty measure can defntely be more helpful for nvestgatng nature of data. The most popular measures that are used for web sessons clusterng are Eucldean dstance, Cosne smlarty measure, and Jaccard coeffcent. We should keep n mnd that a web sesson contans a sequence of URLs accessed by a user. Therefore, a good smlarty measure should be defned so that t does not gnore sequental nature of web navgatons n sessons. On or hand, not all of URLs vsted n a sesson are equally mportant to user. In ths paper, we borrow dea of sequence algnment [3] from bonformatcs n order to fnd best match of sesson sequences. Sequence algnment s one of fundamental operatons n bonformatcs n order to capture relatonshps between DNA sequences. Smlar to each sesson whch conssts of a sequence of web pages, each DNA contans a sequence of amno acds. Consequently, technques used n DNA sequences algnment can be appled to measure smlarty of web sessons. Smlar to DNA sequence algnment, problem of computng smlarty between web sessons can be facltated by usng dynamc programmng technques [3]. In addton to sequental nature of web sessons, we consder tme a user spends on a webpage and also frequency of vstaton from a partcular webpage n a web sesson n order to estmate mportance of that page to user. Usng Slhouette coeffcent [2] of obtaned clusters as evaluaton measure, we compare our method wth avalable methods. The results show that our method s more effectve n nvestgatng smlarty of web sessons for web sessons clusterng task. The remander of ths paper s organzed as follows. In Secton 2, a revew on some avalable methods for clusterng web sessons s presented. In Secton 3, we ntroduce a new method for estmatng smlarty of sessons usng sequence algnment. The expermental results for evaluatng proposed method are presented n Secton 4. Fnally, Secton 5 concludes our study. Also, future works are presented n ths secton /11/$26.00 c 2011 IEEE

2 II. RELATED WORK Dfferent smlarty measures have been proposed for capturng web sessons smlartes. Also, varous clusterng algorthms have been ntroduced n lteraturee for groupngg web users wth common practces. Most of see works represent a sesson usng a vector defnedd over space of web pages wthn a partcularr webste: a vector dmenson corresponds to specfc URL wthn webste. Dependng on values assgned to se dmensons, dfferent user behavor analyss can be performed. The most common method s to assocate bnary values to a dmenson,.e. value 1 for pages whch user has vsted m n sesson and 0 for ors. Nasraou et al. [4] used ths representaton for web sessonss and deployed normalzed cosne of angle between vectors as smlarty of m for web sessons clusterng. Some or methods have been proposed to use feature weghts based on tme a user spends on a partcular webpage (perhaps normalzed by sze of webpage or frequency of occurrence of a URL wthn user sesson nstead of bnary weghts [5], [6]. Yan et al. [5] appled Eucldean dstance n web user clusterng task and n suggested some lnks to web users accordng to r correspondng cluster. None of mentoned methods captures sequental nature of web navgatons wthn sessons. In a newer attempt, Baneree and Ghosh [7] used relatve tme spent on longest common sub-sequence between sessons, found through dynamc programmng, as smlarty between sessons. The authors n bult an abstract smlarty graph for set of sessons and appled graph parttonng methods n order to cut abstract graph nto clusters. Wang and Zaane [8] ntroduced a new method to measure smlartes between web sessons based on sequence algnment n computatonal bology. In ths method, y frst defnedd a smlarty between web pages usng herarchcal structure of URLs n webste. Then, y utlzed dynamc programmng to fnd best match between sesson sequences. In method presented n [9], accurate vewng tme of accessed pages are consdered n addton to URL of pages for defnng smlarty of web pages. Smlar to [8], a dynamc programmng process s n appled to fnd best mach for sessons. In [10], an algorthm for Web Sesson Clusterng Based on Increase of Smlartes (WSCBIS s presented usng method proposed n [9]. Ths algorthm decreasess tme and space complexty of clusterng compared to k-means and Robust Clusterng usng lnks (ROCK [11]. Hay et al. [12] have clustered web users usng dfferent smlarty measures: Sequence Algnment Method (SAM and Assocaton measure (Eucldean dstance- s based measure. In SAM, sequental order of requests taken nto consderaton and not poston of m. The results proved that SAM retreves sequences not only wth smlar pages, but order of pages s also consdered compared to assocatve measure. The method proposed n ths paper has smlar basc dea to methods presented n [9]. Compared to ths work, we consder not only tme a user spends on a webpage, but also frequency of vstaton from a partcular webpage n order to estmate nterestngness of that page to user n a sesson. Then, we defne smlarty of web pages based on conuncton of smlarty of web pages URLs and smlarty of r nterestngness to users. Fnally, we employ sequence algnment n order to fnd best match of sessons and estmate smlarty of sessons. A. Web Page Smlarty Based on URLs Wang and Zaane [8] proposed a method to measure smlarty of dfferent web pages. Ths method does not consder content of web pages but smply paths leadng to a webpage n herarchcal structure of URLs of webste. The detal nformatonn of ths method s dscussed n [8]. Brefly, n order to measure smlarty of web pages, we frst represent each level of r URLs by a token. As a result, token strng of full path of a URL s concatenaton of all representatve tokens for each level. The token for each level s assgned based on herarchcal structure of webste. Markng tree structure of a nomnal webste s llustrated n Fg.1. In order to compute smlarty of web pages, we frst determne length of longest token strng among. Then, we gve a weght to each level of tokens from last to frst. The last level of longest token strng s gven a weght equal to 1, second to last s gven weght 2, and so on. Fnally, smlarty between token strngss (Token Sm s defned as sum of weghts of those matchng tokens dvded by sum of total weghts. The obtaned smlarty for par of web pages ranges from 0, for pages wthout any dentcal token n dentcal place, to 1, for pages that are exactly same. An example of ths process for nomnal URLs (example-webste/a/c.html and example- webste/a/b.html s presented n Fg. 2. For ths example, smlarty between token strngs s (3+2 / (3+2+1 = Fgure 1. Markng URL tree of a nomnal webste. Fgure 2. An example of token strng comparson. 21

3 B. Web Page Smlarty Based on Importance to User As mentoned, we use smlarty of web pages wth conuncton of smlarty between nterests of r users n vstng those pages. In a partcular sesson, normalzed frequency of vst of th webpage (P and tme spent on ths page can be represented usng (1 and (2 respectvely. In (1, Frequency(P s smply number of tmes webpage P has been vsted n sesson. In (2, Tme Spent on(p s dfference between exact tme of request of page P and tme of request for next webpage n sesson from access log fle. Consder that, we cannot compute ths value for last webpage requested n sesson. Here, we defne tme spent on last webpage of a sesson as average tme spent on or web pages of sesson. Consderng fact that length of a webpage (n bytes can have mpact on tme that s needed to vst that page, we have normalzed spent tme on pages by dvdng ths value by length of correspondng webpage as shown n (2. Freq (P = SpentTme( P = Frequency( P Frequncey( P P All pages n sesson Tme Spent on(p Length(P Tme Spent on(p Length(P P All pages n sesson In (2 and (3, frequency of vstaton of a webpage and spent tme on that page are normalzed by r denomnator whch s sum of se values for whole requests n sesson. These measures should be combned to descrbe nterestngness of a webpage to a user. In mamatcs, harmonc mean s one of several knds of average. In our case, we use harmonc mean of Freq and SpentTme for page P as measure of nterestngness of ths page to a user n one sesson. Ths value can be descrbed usng (3. Interest(P = 1 Freq(P SpentTme( P Fnally, we defne a measure to estmate smlarty of nterestngness of vstors from th page (P and th (P page n a sesson. Ths value can be estmated usng (4. Consder that P and P can also belong to dfferent sessons. Interest Sm(P, P = mn { Interest(P max { Interest(P, Interest(P, Interest(P } } Consderng smlarty between par of pages based on (1 (2 (3 (4 r URLs (Token Sm and r nterestngness to user (Interest Sm, we can defne smlarty of web pages as shown n (5. In ths equaton, parameter s a scale factor whch should be assocate wth a value between 0 and 1. Smlarty (P, P = Token Sm(P, P + (1 Interest Sm(P, P C. Smlarty of Web Sessons As mentoned earler, we consder each sesson as a sequence of URLs that are requested by user. For estmatng smlarty of web sessons, we apply sequence algnment method n order to fnd best match between sequences. In algnng sequences, not only characters that match dentcally are consdered, but also spaces or gaps (or conversely, nsertons n or sequence and msmatches, both of whch can correspond to mutatons. In sequence algnment, we want to fnd an optmal algnment that, loosely speakng, maxmzes number of matches and mnmzes number of spaces and msmatches. For applyng sequence algnment method, we need to defne a scorng functon whch helps fnd optmal matchng between sesson sequences. Consder that, smlarty of web pages dscussed n prevous secton plays role of a page matchng goodness functon. The scorng functon deployed n method of ths paper s as follows. For each dentcal matchng,.e. a par of pages wth smlarty 1, score s 20; for each msmatchng,.e. a par of pages wth smlarty 0, or matchng a page wth a gap, score s 10; for a par of pages wth smlarty (0,1, score for r matchng s between -10 and 20. Hence, scorng functon for smlarty between pages P and P can be calculated usng (6. Consder that, parameter s actually smlarty between pages P and P whch was calculated n prevous secton. (5 Score (P, P = , 0 1 (6 As mentoned earler, estmaton of smlarty between web sessons s calculated usng sequence algnment method n order to fnd best match between sequences. The fnal smlarty between sequences s obtaned based on r optmal matchng and length of sequences. An optmal matchng s an algnment wth hghest possble score. As mentoned earler, problem of fndng optmal matchng of sequences can be facltated usng dynamc programmng,.e. smlarty of sessons sequences can be computed by consderng contrbuton of smlarty of pages n head of each sequence and maxmum smlarty n remanng subsequence. Ths process can be descrbed usng a matrx n whch one sequence (sesson s placed along top and or sequence (sesson s placed along left sde of matrx. An example of such matrx s llustrated n Fg. 3. In ths fgure, each webpage n a sesson s shown usng correspondng token to each level of webpage URL n structure of webste. Also, tme user has spent on each webpage s shown n parenses for each webpage. 22

4 Fgure 3. An example of sesson matchng matrx. In order to calculate optmal matchng usng sequence algnment matrx, a gap s added to start of each sequence whch ndcates startng pont of matchng. The goal s to fnd an optmal path from top left corner to bottom rght corner of matrx. In each step, we can only have a rght, down or dagonal move. A rght move corresponds to nsertng a gap to sequence n left and matchng sequence on top wth a gap, whle a down move corresponds to nsertng a gap to sequence on top and matchng sequence on left wth a gap. In each step, score for each three moves s calculated and maxmum of m s added to current score, whch had been obtaned from prevous moves. In or words, drecton whch provdes maxmumm score s chosen n each step. The optmal path s n acheved through back propagatng from bottom rght corner to startng pont. In gven example, optmal path found through back propagatng s shown by arrows n Fg. 3. The score thatt s put n lower rght corner s optmal sequence algnment score. In our scorng system, optmal score cannot be bgger than length of shorter sesson multpled by 20. Also, t cannot be smaller than length of longer sesson multpled by -10. Therefore, fnal smlarty measure can be calculated by normalzng optmal score wth respect to se maxm and mnma. In case of our example, smlarty of web sessons s [69.17 ( 10 6] / [(20 5 ( 10 6] = Usng ths defnton, smlarty value for web sesson wll always be between 0 and 1. III. EXPERIMENTAL EVALUATION Many attempts have been made to evaluate clusterng goodness and to fnd rules to quantfy qualty of a clusterng result. Cluster valdaton for large datasets of categorcal data such as web sesson data s a very hard task. Prevous works that proposed to use sequence algnment method to cluster web sesson data, valdates r expermental results manually rar than quanttatvely [8], [9]. Consder that, due to large number of web pages n a typcal webste, usually a large number of web sessons are needed to fully represent possble usage patterns over that webste. However, as sequence algnment method s a tme consumng process, dealng wth a large dataset ncreases processng tme for constructng clusters. The emprcal evaluaton reported n ths paper concerns queston wher proposed method, whch consdererss both smlarty of web pages based on r URLs and usage smlarty of m n order to defne a scorng functon for sequence algnment method, can properly reflect nature of sesson data. Web usage data that are used for ths experment are collected from Musc Machne webste by Perkowtz and Etzon [13]. The orgnal data, used n our experment, contans requests from log fles of web server for fve random days n After preprocessng task on orgnal data usng methods descrbedd n [14], 2664 web sessons were extracted. We have compared our method to one presented n [9], whch we refer to as extended Sequence Algnment (SA method. As descrbed earler, n extended SA method, only accurate vewng tme of accessed pages are taken nto consderatonn for defnng usage smlarty between web sessons. Ths method s clamed to perform better n revealng nature of data compared to normal SA method presented n [8]. Also, t has been proved that SA method can effectvely reflect sequental nature of web navgatons n web sessons clusterng compared to or smlarty measures such as Jaccard coeffcent [8]. To compare our method wth extended SA, dstance matrces holdng parwse sequence algnment dstance measures between web sessons are obtaned usng our proposed method and extended SA. Consderng fact that smlarty measures obtaned from both methods always range from 0 to 1, dstance between sessons can be calculated by subtractng smlarty of web sessons from 1. Fnally, obtaned dstance matrces can be used for clusterng web sessons. For our experment, we have appled agglomeratve clusterng usng average lnkage method [1] to construct a lnkage tree. The average lnkage method s not very susceptble to nose and outlers n nput data. After constructng lnkage tree, we cut off tree n order to generate desred number of clusters from sesson data. In ths experment parameter used n (5 s set to 0.7. For evaluatng effectveness of smlarty measure n ths paper, average Slhouette Coeffcent (SC [ 2] s calculated for web sessons data n r clusters. The Slhouette coeffcent consders both coheson and separaton of data ponts for evaluatng clusters. The Slhouette coeffcent value for th web sesson (s can be calculated usng (7. b a s = max(a, b In (7, a s average dstance of th sessons from or sessons n ts correspondng cluster. The parameterr b s mnmum of average dstances from th sesson to sessons n or clusters. The value of Slhouette coeffcent can vary from -1 to 1; closer value to 1, better clusterng result. The average Slhouette coeffcent values for dfferent number of clusters (range from 2 to 25 for dstance matrces are shown n Fg. 4. (7 23

5 Average Slhouette Coeffcent Number of Clusters Proposed Method Extended SA Method Fgure 4. The comparson of average SC values for dfferent number of clusters, generated by usng parwse dstance measure matrces of proposed method and extended SA method. As we can see n Fg. 4, average SC values for clusters generated by dstance measure of proposed method has hgher values compared to SC values of clusters generated by dstance measure of extended SA method. Ths result can show our approach can properly reflect nature of sesson data snce, not only we have consdered sequental nature of web sessons, but also we have taken nto account smlarty of usage patterns for comparng web sessons. As we can see n Fg. 4, average SC value reduces by ncreasng number of clusters. The reason s that, by ncreasng number of clusters we cut off lnkage tree n lower values of dstance between clusterng obects. The result would be smaller clusters whch may have hgher nter-cluster smlartes. IV. CONCLUSIONS AND FUTURE WORK As descrbed n ths paper, web sesson clusterng s an mportant task to group web sessons wth smlar trends. Ths s an essental process for effectve webste management, web personalzaton, and web recommender systems. Accurate clusterng of web sessons s hghly dependent to smlarty (or dssmlarty measure defned to compare web sessons. In ths paper, we proposed a new smlarty measure for web sessons clusterng. We consdered tme a user spends on a webpage and also frequency of vstaton from a partcular webpage wthn a sesson n order to estmate nterestngness of that page to user. Then, we defned smlarty of web pages wthn sessons based on conuncton of smlarty of web pages and smlarty of r nterestngness to users. Fnally, we employed sequence algnment method n order to fnd best match of sesson sequences and estmate smlarty of m. We compared our method wth a case n whch only spent tme on a webpage was consdered for defnng usage pattern smlarty. The evaluaton was performed by measurng average Slhouette coeffcent of sessons whch were clustered usng agglomeratve clusterng wth average lnkage method. Expermental results verfy effectveness of our method. However, tme complexty of SA methods s stll hgh. As descrbed, we estmated smlarty of web pages based on smlarty of r herarchcal structure of URLs whle gnorng content of web pages. To have a better estmaton of smlarty of web pages, we can use or methods proposed for web content mnng such as Informaton Retreval or semantc web approaches. Furrmore, for havng a more general evaluaton, we can use a larger collecton of web sessons data and apply dfferent clusterng algorthms on se data. REFERENCES [1] T. Hussan, S. Asghar, and S. Fong, A herarchcal cluster based preprocessng methodology for Web Usage Mnng, n 6th Internatonal Conference on Advanced Informaton Management and Servce (IMS, 2010, pp [2] P. J. Rousseeuw, Slhouettes: A graphcal ad to nterpretaton and valdaton of cluster analyss, Journal of Computatonal and Appled Mamatcs, vol. 20, pp , Nov [3] K. Charter, J. Schaeffer, and D. Szafron, Sequence algnment usng FastLSA, n Internatonal Conference on Mamatcs and Engneerng Technques n Medcne and Bologcal Scences (METMBS, 2000, p [4] O. Nasraou, C. C. Urbe, C. R. Coronel, and F. Gonzalez, TECNO- STREAMS: trackng evolvng clusters n nosy data streams wth a scalable mmune system learnng model, n Thrd IEEE Internatonal Conference on Data Mnng, ICDM, 2003, pp [5] T. W. Yan, M. Jacobsen, H. Garca-Molna, and U. Dayal, From user access patterns to dynamc hypertext lnkng, Computer Nerks and ISDN Systems, vol. 28, no. 7-11, pp , May [6] R. Forsat, M. R. Meybod, and A. Rahbar, An effcent algorthm for web recommendaton systems, n IEEE/ACS Internatonal Conference on Computer Systems and Applcatons, Los Alamtos, CA, USA, 2009, vol. 0, pp [7] A. Baneree and J. Ghosh, Clckstream clusterng usng weghted longest common subsequences, n Proceedngs of Web Mnng Workshop at 1st SIAM Conference on Data Mnng, 2001, p [8] Wenan Wang and O. R. Zaane, Clusterng Web sessons by sequence algnment, n In Proceedngs 13th Internatonal Workshop on Database and Expert Systems Applcatons, 2002, pp [9] Chaofeng L and Yansheng Lu, Smlarty Measurement of Web Sessons by Sequence Algnment, n IFIP Internatonal Conference on Nerk and Parallel Computng Workshops, NPC Workshops, 2007, pp [10] C. L, Research on Web Sesson Clusterng, Journal of Software, vol. 4, no. 5, Jul [11] S. Guha, R. Rastog, and K. Shm, Rock: A robust clusterng algorthm for categorcal attrbutes, Informaton Systems, vol. 25, no. 5, p , [12] B. Hay, G. Wets, and K. Vanhoof, Segmentaton of vstng patterns on web stes usng a sequence algnment method, Journal of Retalng and Consumer Servces, vol. 10, no. 3, pp , May [13] M. Perkowtz and O. Etzon, Adaptve stes: Automatcally learnng from user access patterns, n Proceedngs of 6th Internatonal World Wde Web Conference, Santa Clara, Calforna, [14] V. Sathyamoorth and V. M. Bhaskaran, Data Preparaton Technques for Web Usage Mnng n World Wde Web-An Approach, Internatonal Journal of Recent Trends n Engneerng, vol. 2, no. 4,

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15 CS434a/541a: Pattern Recognton Prof. Olga Veksler Lecture 15 Today New Topc: Unsupervsed Learnng Supervsed vs. unsupervsed learnng Unsupervsed learnng Net Tme: parametrc unsupervsed learnng Today: nonparametrc

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

An Internal Clustering Validation Index for Boolean Data

An Internal Clustering Validation Index for Boolean Data BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 16, No 6 Specal ssue wth selecton of extended papers from 6th Internatonal Conference on Logstc, Informatcs and Servce Scence

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Supervsed vs. Unsupervsed Learnng Up to now we consdered supervsed learnng scenaro, where we are gven 1. samples 1,, n 2. class labels for all samples 1,, n Ths s also

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Hierarchical clustering for gene expression data analysis

Hierarchical clustering for gene expression data analysis Herarchcal clusterng for gene expresson data analyss Gorgo Valentn e-mal: valentn@ds.unm.t Clusterng of Mcroarray Data. Clusterng of gene expresson profles (rows) => dscovery of co-regulated and functonally

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

LinkSelector: A Web Mining Approach to. Hyperlink Selection for Web Portals

LinkSelector: A Web Mining Approach to. Hyperlink Selection for Web Portals nkselector: A Web Mnng Approach to Hyperlnk Selecton for Web Portals Xao Fang and Olva R. u Sheng Department of Management Informaton Systems Unversty of Arzona, AZ 8572 {xfang,sheng}@bpa.arzona.edu Submtted

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Effective Page Recommendation Algorithms Based on. Distributed Learning Automata and Weighted Association. Rules

Effective Page Recommendation Algorithms Based on. Distributed Learning Automata and Weighted Association. Rules Effectve Page Recommendaton Algorthms Based on Dstrbuted Learnng Automata and Weghted Assocaton Rules R. Forsat 1*, M. R. Meybod 2 1 Department of Computer Engneerng, Islamc Azad Unversty, Karaj Branch,

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique

The Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique //00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy

More information

A Compressing Method for Genome Sequence Cluster using Sequence Alignment

A Compressing Method for Genome Sequence Cluster using Sequence Alignment A Compressng Method for Genome Sequence Cluster usng Sequence Algnment Kwang Su Jung 1, Nam Hee Yu 1, Seung Jung Shn 2, Keun Ho Ryu 1 1 Database/Bonformatcs Laboratory, Chungbuk Natonal Unversty, Korea

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Graph-based Clustering

Graph-based Clustering Graphbased Clusterng Transform the data nto a graph representaton ertces are the data ponts to be clustered Edges are eghted based on smlarty beteen data ponts Graph parttonng Þ Each connected component

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

Parallel matrix-vector multiplication

Parallel matrix-vector multiplication Appendx A Parallel matrx-vector multplcaton The reduced transton matrx of the three-dmensonal cage model for gel electrophoress, descrbed n secton 3.2, becomes excessvely large for polymer lengths more

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems

Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems Determnng Fuzzy Sets for Quanttatve Attrbutes n Data Mnng Problems ATTILA GYENESEI Turku Centre for Computer Scence (TUCS) Unversty of Turku, Department of Computer Scence Lemmnkäsenkatu 4A, FIN-5 Turku

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem

An Efficient Genetic Algorithm with Fuzzy c-means Clustering for Traveling Salesman Problem An Effcent Genetc Algorthm wth Fuzzy c-means Clusterng for Travelng Salesman Problem Jong-Won Yoon and Sung-Bae Cho Dept. of Computer Scence Yonse Unversty Seoul, Korea jwyoon@sclab.yonse.ac.r, sbcho@cs.yonse.ac.r

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search

Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search Can We Beat the Prefx Flterng? An Adaptve Framework for Smlarty Jon and Search Jannan Wang Guolang L Janhua Feng Department of Computer Scence and Technology, Tsnghua Natonal Laboratory for Informaton

More information

Related-Mode Attacks on CTR Encryption Mode

Related-Mode Attacks on CTR Encryption Mode Internatonal Journal of Network Securty, Vol.4, No.3, PP.282 287, May 2007 282 Related-Mode Attacks on CTR Encrypton Mode Dayn Wang, Dongda Ln, and Wenlng Wu (Correspondng author: Dayn Wang) Key Laboratory

More information

Document Representation and Clustering with WordNet Based Similarity Rough Set Model

Document Representation and Clustering with WordNet Based Similarity Rough Set Model IJCSI Internatonal Journal of Computer Scence Issues, Vol. 8, Issue 5, No 3, September 20 ISSN (Onlne): 694-084 www.ijcsi.org Document Representaton and Clusterng wth WordNet Based Smlarty Rough Set Model

More information

From Comparing Clusterings to Combining Clusterings

From Comparing Clusterings to Combining Clusterings Proceedngs of the Twenty-Thrd AAAI Conference on Artfcal Intellgence (008 From Comparng Clusterngs to Combnng Clusterngs Zhwu Lu and Yuxn Peng and Janguo Xao Insttute of Computer Scence and Technology,

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.

Complex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following. Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal

More information

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines (IJCSIS) Internatonal Journal of Computer Scence and Informaton Securty, Herarchcal Web Page Classfcaton Based on a Topc Model and Neghborng Pages Integraton Wongkot Srura Phayung Meesad Choochart Haruechayasak

More information

A new selection strategy for selective cluster ensemble based on Diversity and Independency

A new selection strategy for selective cluster ensemble based on Diversity and Independency A new selecton strategy for selectve cluster ensemble based on Dversty and Independency Muhammad Yousefnezhad a, Al Rehanan b, Daoqang Zhang a and Behrouz Mnae-Bdgol c a Department of Computer Scence,

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION 1 FENG YONG, DANG XIAO-WAN, 3 XU HONG-YAN School of Informaton, Laonng Unversty, Shenyang Laonng E-mal: 1 fyxuhy@163.com, dangxaowan@163.com, 3 xuhongyan_lndx@163.com

More information

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity

Corner-Based Image Alignment using Pyramid Structure with Gradient Vector Similarity Journal of Sgnal and Informaton Processng, 013, 4, 114-119 do:10.436/jsp.013.43b00 Publshed Onlne August 013 (http://www.scrp.org/journal/jsp) Corner-Based Image Algnment usng Pyramd Structure wth Gradent

More information

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram

Shape Representation Robust to the Sketching Order Using Distance Map and Direction Histogram Shape Representaton Robust to the Sketchng Order Usng Dstance Map and Drecton Hstogram Department of Computer Scence Yonse Unversty Kwon Yun CONTENTS Revew Topc Proposed Method System Overvew Sketch Normalzaton

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

A SYSTOLIC APPROACH TO LOOP PARTITIONING AND MAPPING INTO FIXED SIZE DISTRIBUTED MEMORY ARCHITECTURES

A SYSTOLIC APPROACH TO LOOP PARTITIONING AND MAPPING INTO FIXED SIZE DISTRIBUTED MEMORY ARCHITECTURES A SYSOLIC APPROACH O LOOP PARIIONING AND MAPPING INO FIXED SIZE DISRIBUED MEMORY ARCHIECURES Ioanns Drosts, Nektaros Kozrs, George Papakonstantnou and Panayots sanakas Natonal echncal Unversty of Athens

More information

On Some Entertaining Applications of the Concept of Set in Computer Science Course

On Some Entertaining Applications of the Concept of Set in Computer Science Course On Some Entertanng Applcatons of the Concept of Set n Computer Scence Course Krasmr Yordzhev *, Hrstna Kostadnova ** * Assocate Professor Krasmr Yordzhev, Ph.D., Faculty of Mathematcs and Natural Scences,

More information

Study of Data Stream Clustering Based on Bio-inspired Model

Study of Data Stream Clustering Based on Bio-inspired Model , pp.412-418 http://dx.do.org/10.14257/astl.2014.53.86 Study of Data Stream lusterng Based on Bo-nspred Model Yngme L, Mn L, Jngbo Shao, Gaoyang Wang ollege of omputer Scence and Informaton Engneerng,

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming

Kent State University CS 4/ Design and Analysis of Algorithms. Dept. of Math & Computer Science LECT-16. Dynamic Programming CS 4/560 Desgn and Analyss of Algorthms Kent State Unversty Dept. of Math & Computer Scence LECT-6 Dynamc Programmng 2 Dynamc Programmng Dynamc Programmng, lke the dvde-and-conquer method, solves problems

More information

Face Recognition using 3D Directional Corner Points

Face Recognition using 3D Directional Corner Points 2014 22nd Internatonal Conference on Pattern Recognton Face Recognton usng 3D Drectonal Corner Ponts Xun Yu, Yongsheng Gao School of Engneerng Grffth Unversty Nathan, QLD, Australa xun.yu@grffthun.edu.au,

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Recommendations of Personal Web Pages Based on User Navigational Patterns

Recommendations of Personal Web Pages Based on User Navigational Patterns nternatonal Journal of Machne Learnng and Computng, Vol. 4, No. 4, August 2014 Recommendatons of Personal Web Pages Based on User Navgatonal Patterns Yn-Fu Huang and Ja-ang Jhang Abstract n ths paper,

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Combining Multiple Resources, Evidence and Criteria for Genomic Information Retrieval

Combining Multiple Resources, Evidence and Criteria for Genomic Information Retrieval Combnng Multple Resources, Evdence and Crtera for Genomc Informaton Retreval Luo S 1, Je Lu 2 and Jame Callan 2 1 Department of Computer Scence, Purdue Unversty, West Lafayette, IN 47907, USA ls@cs.purdue.edu

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

A Robust Method for Estimating the Fundamental Matrix

A Robust Method for Estimating the Fundamental Matrix Proc. VIIth Dgtal Image Computng: Technques and Applcatons, Sun C., Talbot H., Ourseln S. and Adraansen T. (Eds.), 0- Dec. 003, Sydney A Robust Method for Estmatng the Fundamental Matrx C.L. Feng and Y.S.

More information

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introducton to Bonformatcs Sequence Algnment Luke Huan Electrcal Engneerng and Computer Scence http://people.eecs.ku.edu/~huan/ HMM Π s a set of states Transton Probabltes a kl Pr( l 1 k Probablty

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STRENGTH MATRIX

HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STRENGTH MATRIX HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STRENGTH MATRIX P.Shanmugavadvu 1, P.Sumathy 2, A.Vadvel 3 12 Department of Computer Scence and Applcatons, Gandhgram Rural Insttute,

More information

Clustering is a discovery process in data mining.

Clustering is a discovery process in data mining. Cover Feature Chameleon: Herarchcal Clusterng Usng Dynamc Modelng Many advanced algorthms have dffculty dealng wth hghly varable clusters that do not follow a preconceved model. By basng ts selectons on

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Semantic Image Retrieval Using Region Based Inverted File

Semantic Image Retrieval Using Region Based Inverted File Semantc Image Retreval Usng Regon Based Inverted Fle Dengsheng Zhang, Md Monrul Islam, Guoun Lu and Jn Hou 2 Gppsland School of Informaton Technology, Monash Unversty Churchll, VIC 3842, Australa E-mal:

More information

The Effect of Similarity Measures on The Quality of Query Clusters

The Effect of Similarity Measures on The Quality of Query Clusters The effect of smlarty measures on the qualty of query clusters. Fu. L., Goh, D.H., Foo, S., & Na, J.C. (2004). Journal of Informaton Scence, 30(5) 396-407 The Effect of Smlarty Measures on The Qualty of

More information

Relevance Feedback for Image Retrieval

Relevance Feedback for Image Retrieval Vashal D Dhale et al, / (IJCSIT Internatonal Journal of Computer Scence and Informaton Technologes, Vol 4 (2, 203, 39-323 Relevance Feedback for Image Retreval Vashal D Dhale, Dr A R Mahaan, Prof Uma Thakur

More information

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval Fuzzy -Means Intalzed by Fxed Threshold lusterng for Improvng Image Retreval NAWARA HANSIRI, SIRIPORN SUPRATID,HOM KIMPAN 3 Faculty of Informaton Technology Rangst Unversty Muang-Ake, Paholyotn Road, Patumtan,

More information

Optimal Workload-based Weighted Wavelet Synopses

Optimal Workload-based Weighted Wavelet Synopses Optmal Workload-based Weghted Wavelet Synopses Yoss Matas School of Computer Scence Tel Avv Unversty Tel Avv 69978, Israel matas@tau.ac.l Danel Urel School of Computer Scence Tel Avv Unversty Tel Avv 69978,

More information

Clustering Algorithm of Similarity Segmentation based on Point Sorting

Clustering Algorithm of Similarity Segmentation based on Point Sorting Internatonal onference on Logstcs Engneerng, Management and omputer Scence (LEMS 2015) lusterng Algorthm of Smlarty Segmentaton based on Pont Sortng Hanbng L, Yan Wang*, Lan Huang, Mngda L, Yng Sun, Hanyuan

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE

ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE Yordzhev K., Kostadnova H. Інформаційні технології в освіті ON SOME ENTERTAINING APPLICATIONS OF THE CONCEPT OF SET IN COMPUTER SCIENCE COURSE Yordzhev K., Kostadnova H. Some aspects of programmng educaton

More information

Maximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation

Maximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation Internatonal Conference on Logstcs Engneerng, Management and Computer Scence (LEMCS 5) Maxmum Varance Combned wth Adaptve Genetc Algorthm for Infrared Image Segmentaton Huxuan Fu College of Automaton Harbn

More information

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search

Sequential search. Building Java Programs Chapter 13. Sequential search. Sequential search Sequental search Buldng Java Programs Chapter 13 Searchng and Sortng sequental search: Locates a target value n an array/lst by examnng each element from start to fnsh. How many elements wll t need to

More information

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN

MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS XUNYU PAN MOTION PANORAMA CONSTRUCTION FROM STREAMING VIDEO FOR POWER- CONSTRAINED MOBILE MULTIMEDIA ENVIRONMENTS by XUNYU PAN (Under the Drecton of Suchendra M. Bhandarkar) ABSTRACT In modern tmes, more and more

More information

Analyzing Popular Clustering Algorithms from Different Viewpoints

Analyzing Popular Clustering Algorithms from Different Viewpoints 1000-9825/2002/13(08)1382-13 2002 Journal of Software Vol.13, No.8 Analyzng Popular Clusterng Algorthms from Dfferent Vewponts QIAN We-nng, ZHOU Ao-yng (Department of Computer Scence, Fudan Unversty, Shangha

More information

BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering

BRDPHHC: A Balance RDF Data Partitioning Algorithm based on Hybrid Hierarchical Clustering 015 IEEE 17th Internatonal Conference on Hgh Performance Computng and Communcatons (HPCC), 015 IEEE 7th Internatonal Symposum on Cyberspace Safety and Securty (CSS), and 015 IEEE 1th Internatonal Conf

More information

Personalized Concept-Based Clustering of Search Engine Queries

Personalized Concept-Based Clustering of Search Engine Queries IEEE TRANSACTIONS ON JOURNAL NAME, MANUSCRIPT ID 1 Personalzed Concept-Based Clusterng of Search Engne Queres Kenneth Wa-Tng Leung, Wlfred Ng, and Dk Lun Lee Abstract The exponental growth of nformaton

More information