A Web Site Classification Approach Based On Its Topological Structure

Size: px
Start display at page:

Download "A Web Site Classification Approach Based On Its Topological Structure"

Transcription

1 Internatonal Journal on Asan Language Processng 20 (2): A Web Ste Classfcaton Approach Based On Its Topologcal Structure J-bn Zhang,Zh-mng Xu,Kun-l Xu,Q-shu Pan School of Computer scence and Technology,Harbn Insttute of Technology No.92,West Da-Zh Street,Nangang Dstrct,Harbn , Chna Phn: , Fax: , zjbxgg@ht.edu.cn,xuzm@ht.edu.cn Abstract: Automatc web ste classfcaton has a wde applcaton prospect; however, there are few researches on t. Dfferent from pure texts, web stes are the combnaton of a large number of web pages va hyperlnks, so text classfcaton methods are not sutable to classfy them drectly. Ths paper proposes a web ste classfcaton approach based on ts topologcal structure. Gven a web ste, frstly we represent ts topologcal structure as a drected graph, and from whch we extract a strongly connected sub-graph ncludng the ste s home page. Secondly, we use an mproved PageRank algorthm on the sub-graph to select some topc-relevant resources, and represent them as a topc vector of the ste. Fnally we use an SVM classfer to classfy the ste n term of ts topc vector. Some experments are conducted for web ste classfcaton. Expermental results show our approach acheved better performance than tradtonal super page-based web ste classfcaton approach. Key words: web ste classfcaton,topologcal structure of web ste,hyperlnk analyss,topc vector of web ste We thank all students of Natural Language Processng Research Center n Harbn Insttute of Technology for dscussng some ssues about ths paper. Research for ths artcle was supported by the Natonal Natural Scence Foundaton of Chna( , ).

2 76 J-bn Zhang,Zh-mng Xu,Kun-l Xu,Q-shu Pan 1 Introducton Wth the rapd development of Internet, the nformaton of network grows explosvely. Accordng to the statstcal data released by Google, Google currently has ndexed over one trllon web pages and ths fgure s stll rapdly ncreasng every day. Internet has already become the most mportant source of nformaton and knowledge n scentfc research, educaton and other felds. Due to ts mass, varable, and non-semantc characterstcs, t s not easy for people to fnd the nformaton they want quckly and accurately. How to fnd the nformaton we need from such a huge source has become an mportant objectve we need to study. For the moment, there are two knds of servces that can help us to retreve nformaton n the Internet: search engnes le Google and drectory servces le Yahoo! and DMOZ. Search engnes usually return some web pages matched wth queres. However people sometmes need to fnd some web stes related wth a certan subject. For example, when people want to buy somethng, they wll try to fnd the retaler s web stes nstead of web pages whch only contan descrptons of commodtes. Drectory servces supply a navgaton mechanc of web stes by collectng a number of web stes and manually classfyng them nto dfferent drectores. But they spend lots of manual edtoral work to mantan drectory servces. The technology of web nformaton navgaton, especally automatc classfcaton of web nformaton s becomng the research focus. Consderng that automatc web ste classfcaton s sgnfcant to mantenance drectory servces, ths paper manly studes automatc web ste classfcaton approaches. Because a web ste s the combnaton of a large number of web pages va hyperlnks, whch has rcher structure nformaton than sngle web page, text classfcaton approaches are not sutable to classfy t drectly. Ths paper proposes a web ste classfcaton approach based on ts topologcal structure. Gven a web ste, frstly we represent ts topologcal structure as a drected graph, and from whch we extract a strongly connected sub-graph ncludng the ste s home page; secondly, we use an mproved PageRank algorthm on the sub-graph to select some topc-relevant resource, and represent them as a topc vector of ths ste; fnally we use a SVM classfer to classfy the ste n term of ts topc vector. The rest of ths paper s organzed as follows. Secton 2 gves a summarzaton of prevous research on web page and web ste classfcaton. In secton 3 we descrbe our web ste classfcaton approach. In secton 4, we contact some experments to test our web ste classfcaton approach, and the concluson s gven n the last secton.

3 A Web Ste Classfcaton Approach Based On Its Topologcal Structure 77 2 Related work Automatc classfcaton of web pages has been studed for a long tme, some text classfcaton algorthms le Naïve Bayes(McCallum 1998;Mtchell 1996), KNN(Lam 1998;Masand 1992), and SVM(Joachms 1998;Kwok 1998) have been successfully appled. Apart from the content of web pages, Chakrabart(Chakrabart 1998) and Craven (Craven 1999) mproved the accuracy of web page classfcaton by ntroducng hyperlnk analyss. However, there s a lttle research on web ste classfcaton, the dffculty n whch s that a web ste conssts of many pages, and each page has ts own topc, a ste s topc can not be reflected by a sngle web page. A famous web ste classfcaton method s super page-based method (Ester 2002), whch represents a web ste as a sngle vrtual web page combned by all ts pages, and Perre (Perre 2001) mproved t by ntroducng web pages meta data, such as ttle, keyword, and so on. Terveen(Terveen 1999) represented a web ste as a drected graph and combned content and hyperlnk analyss to classfy t. Ester(Ester 2002; Ester 2004) gave an emprcal study on web ste classfcaton, and proposed several solutons of web ste classfcaton; on the bass of the research of Ester, Kregel(Kregel 2004) ntroduced a method that represented a web ste as a topc-frequency vector. In addton, YongHong Tan(Tan 2004) used a mult-scale tree model to represent a web ste, De-yu Fu proposed a key resource-based web ste classfcaton method(fu 2006), and Bao-l Dong employed a hybrd vector space model to recognze the subject of web stes(dong 2005). 3 Web ste classfcaton approach based on ts topologcal structure In ths secton, we manly dscuss our web ste classfcaton approach based on ts topologcal structure. Ths approach manly ncludes several phases: represent a web ste s topologcal structure as a drected-graph, extract some topc-relevant resources from the ste s topologcal graph, represent extracted topc-relevant resource as a topc vector, and use the topc vector to classfy the ste. 3.1 Representng a web ste s topologcal structure as a drected-graph In ths secton, we represent the topologcal structure of a web ste as a drected graph. Some defntons about the drected graph are gven as follows: Defnton 1. Drected graph: A drected graph s an ordered par D=<V, E>. D represents the topologcal structure of a web ste. V s a set of vertces, each vertex of V s a page; E s a set of drected edges, whch s a subset of V V, each drected edge e= (u, v) means a hyperlnk e from page u to page v.

4 78 J-bn Zhang,Zh-mng Xu,Kun-l Xu,Q-shu Pan Defnton 2. Degree: A drected graph D=<V, E>. For each vertex v V, the number of edges lnked from V s defned as the out-degree of v, and the number of edges lnked to v s defned as the n-degree of v. The sum of n-degree and out-degree of v s defned as the degree of v. Defnton 3. Path: A path n a drected graph D=<V, E> from v m to v n s defned as a sequence of vertces {v m, v m1, v m2, v n }, whch ncludes edges (v m, v m1 ), (v m1, v m2 ) (v m, v n ). Defnton 4. Sub-graph: There are two drected graphs: D=<V, E> and D =<V, E >, D s called as a sub-graph of D f V` V and E` E. Defnton 5. Strongly connected graph: A drected graph D=<V, E> s called as a strongly connected graph f there s a path between any par of two vertces. Fg.1 s an example of a web ste, where the ste s represented as a drected graph, f page A can reach page B va nner hyperlnks, then there s a path from A to B. If any par of pages n ths graph has a path to connect them, then we call t strongly connected. Fg.1 A topologcal graph of a web 3.2 Extractng topc-related resource from web ste s topologcal graph After the topologcal graph of a web ste s bult, we put the emphass on extractng topc-related resource from t. Accordng to some lteratures pont of vew, a web ste s home page may be the most topc-relevant to the ste(dong 2005), and a ste s pages wth the same topc usually have a compact lnk structure(lu 2006). In addton, web ste desgners generally hope that outgong-lnked pages should be topc-relevant to the current page, so we can assume a par of pages n a ste s topc-relevant f there s a hyperlnk between them(ester 2004). Accordng to ths assumpton, we can nfer that a par of pages n a ste should be topc-relevant f there s a path between them.

5 A Web Ste Classfcaton Approach Based On Its Topologcal Structure 79 Consderng the above all, we thnk that a ste s topc-relevant resource should be located on a strongly connected sub-graph ncludng the ste s home page, on whch we can use hyperlnk analyss technology to select mportant topc-relevant sources. The PageRank(Page 1999) algorthm s often used to compute the mportance of web pages, whch regards the entre web as a drected graph, and ranks pages through hyperlnk analyss. Ths paper wll use an mproved PageRank algorthm n a ste s sub-graph to rank pages to select mportant topc-relevant sources from t Improved PageRank Algorthm PageRank s the earlest and the most successful algorthm appled to the hyperlnk analyss on commercal search engnes, whch nterprets a hyperlnk from page A to page B as a vote, by page A, for page B. If pages that cast votes are mportant, they wll make pages voted to be mportant. A smplfed verson of PageRank defned by Larry s as follows: PR PR( P) N ( s) (1) 1 C( P ) where s s a page, PR(x) means the rank score of page x, N s the n-degree of s, P s the page lnked to s, and C(P ) s the out-degree of page P. In formula (1), the rank score of page P s dvded by ts out-degree, and each page lnked from P s dstrbuted wth the same rank score. There s a small problem wth formula (1). Assumed that there are two or more pages lnked to each other but to no other pages, and there s a hyperlnk lnked to one of them; after some teratons, rank scores are accumulated nto them but never dstrbuted out from them. Ths scenaro s called rank snkng. To solve rank snkng problem, Larry modfed the orgnal PageRank formula as follows: PR s) (1 d) d PR( P ) N 1 C( P) ( (2) where d s usually set as 0.85, t s the probablty that users contnue to vew pages lnked from the current page s, (1- d) s the probablty that users leave the current page s and skp to other web pages. In PageRank algorthm, each page s dstrbutes ts rank score to pages lnked from s averagely. But the average dstrbuton scheme of rank scores among pages s not sutable for the demand of web ste classfcaton. For a web ste, we am to select topc-relevant resource from ts sub-graph, so we consder that the rank scores should be dstrbuted accordng to page smlarty. If one page A s more smlar to pages lnked to A, A wll get more rank scores, otherwse t wll get less. Here, we use an

6 80 J-bn Zhang,Zh-mng Xu,Kun-l Xu,Q-shu Pan mproved PageRank formula to dstrbute rank scores among pages, whch s shown as follows(yuan 2007) : N sm( P, s) PR( s) (1 d) d PR( P ) (3) M 1 sm( P, Q ) where sm(p,s) s the page smlarty, Q j s a page lnked from P, M s the number of pages lnked from P. Fg.2 s an example of the mproved PageRank formula. Assumed that page A has two hyperlnks lnked to page B and page C respectvely, PR(A) s 1, sm(a, B) s 0.8, and sm(a, C) s 0.4. The rank score dstrbuted from A to B s 1*0.8/ ( ) =0.6666, and C gets 1*0.4/ ( ) = Computaton of lnk-based page smlarty j1 j The mproved PageRank formula uses the page smlarty to dstrbute the rank scores among pages. In general, the smlarty between pages can be computed accordng to ther contents(wang 2003). Consderng the computaton cost of content-based page smlarty, we use the computaton methods of lnk-based page smlarty, whch only analyze hyperlnks among pages nstead of ther contents. In Fg.2, Page A has two hyperlnks lnked to page B and page C respectvely. Accordng to Lterature(Ester 2004), f B and C are both lnked to or from the same page, they may have the same topc. The more are the pages lnked to or from both B and C, they are more topc-relevant. In other words, they are more smlar. sm(a,b)=0.8 sm(a,c)=0.4 B A C Fg.2 The mproved PageRank formula Accordng to Lterature (Wang 2003), we number all the pages n a ste as {1, 2, 3 n}; for each page s, we construct two vectors: V s out, V s n. If the th page has a hyperlnk lnked to s, then the th tem of V s n s 1, otherwse t s 0. Smlarly, f th page has a hyperlnk lnked from s, the th tem of V s out s 1, otherwse t s 0. Consderng the above all, Lterature (Wang 2003) gave the n-lnk-smlarty, out-lnk-smlarty and lnk-based smlarty between page A and page B as follows:

7 A Web Ste Classfcaton Approach Based On Its Topologcal Structure 81 Smlarty Smlarty v v n n n A B ( A, B) n n va vb v v out out out A B ( A, B) out out va vb (, ) n out Smlarty A B Smlarty (, ) A B Smlarty (, (6) ) A B In the above smlarty formulas, the more common pages are lnked to A and B, the bgger s Smlarty n (A,B); the more common pages are lnked from A and B, the bgger s Smlarty out (A,B). For a gven web ste, we frstly extract the strongly connected sub-graph ncludng the ste s home page, and then we use formulas (3) and (6) to compute the rank score of each page, rank these pages accordng to ther rank scores, and fnally select some hgh-scored pages as topc-relevant resource of the ste. 3.3 Represent the Topc Vector (4) (5) After rankng the pages n the sub-graph, some topc-relevant resources on the sub-graph are selected. Now we should consder how to represent extracted pages and ther hyperlnks. Accordng to lteratures pont of vew(hodgson 2001), 61% anchor texts of hyperlnks can reflect the topc of pages they lnk to. So we vew anchor text of hyperlnks as a ste s structure feature of stes, and vew content text of pages as content feature of stes. Under vector space model, we combne content feature and structures feature of a ste to a mxed vector, called a topc vector, whch s shown as follows: v ( w, w w, w, w w ), l m n (7) ' ' ' 1 2 m 1 2 n ' w s the weght of the structure feature w s the weght of the content feature term t j. Here, we use Informaton where v s a l-dmenson mxed vector, term t ' and j Gan (IG) method to select content and structure feature tems, and use tradtonal entropy weghtng method to weght structure feature tems (anchor text tems). where a N 1 TF TF log( TF 1) 1 log log( N j) 1 n n (8) a s the weght of term n the ste k. TF s the frequency of

8 82 J-bn Zhang,Zh-mng Xu,Kun-l Xu,Q-shu Pan term appearng n the ste k. N s the number of all tranng stes. n s the numbers of stes whch nclude term. But when we want to weght content feature tems, we should consder not only frequency nformaton of terms but also ther locaton nformaton on pages. Some of HTML tags are mportant for reflectng topcs of pages, such as ttle, keyword, and descrpton, and they generally summarze the content of pages. In addton, the ttles, bold, talc nformaton n the body of pages are also mportant to reflect the topc of pages. So we put our emphass on consderng the mpact on pages topcs of a tag set, S= {ttle, keywords, descrptons, H1, H2, H3, B, U, I}, and enlarge the weghts of terms whch appear n tags of S. here we gve an mproved entropy weghtng formula to weght them, whch s shown as follows: w TF N w TF 1 S S a logw TF 1 1 log S log( N) j1 n n (9) where TF s the frequency of tem that appears n the ste k and locates on the tag β. w ttle keyword descrpton s the weghted coeffcent for the tag β, and let W W W H 1 H 2 H 3 W W W W U W I. 4 Experments In our web ste classfcaton experments, we use Google's navgaton ste ( as our data source, from whch we download 1127 web stes data from 16 categores, use 760 web stes data as our tranng samples, and use 367 web stes data as our testng samples. We use SVM model as our web ste classfer, and Informaton Gan method s used for feature selecton; n addton, we use tradtonal entropy weghtng method and the mproved entropy weghtng method to weght structure terms and content terms respectvely. All the experments were mplemented n C++ and tested on a PC equpped wth AMD Athlon processor and 1 GB man memory. In our web ste classfcaton experments, we use super page-based web ste classfcaton method as the baselne system, n whch we lmt the numbers of each ste s web pages under a maxmum of 50; for our web ste classfcaton method based on ts topologcal structure, we only select top 20 pages as each ste s topc-relevant data. Fg.3, Fg.4 and Fg.5 show the comparson of our web ste classfcaton method based on ts topologcal structure wth super page-based web ste classfcaton method on precson, recall and F1 value. Table 1 shows the comparson of these two web ste

9 A Web Ste Classfcaton Approach Based On Its Topologcal Structure 83 classfcaton methods on macro-averagng and mcro-averagng values. Expermental results show that our method acheves much better performance than super page-based web ste classfcaton method. Macro-averagng and mcro-averagng values can be ncreased nearly by 20% wth our method compared wth those wth super page-based method mltary agrculture metallurgy medcne prnt precson buldng fnery mechansm auto law logstcs envronment electronc gran spnnng superpage based on topology structure energy Fg.3 Comparson on precson mltary agrculture metallurgy medcne prnt buldng fnery superpage recall mechansm auto law logstcs envronment electronc gran spnnng energy based on topology structure Fg.4 Comparson on recall mltary agrculture metallurgy medcne prnt superpage F1 buldng fnery mechansm auto Fg.5 based on topology structure Comparson on F1 law logstcs envronment electronc gran spnnng energy

10 84 J-bn Zhang,Zh-mng Xu,Kun-l Xu,Q-shu Pan based on topologcal structure superpage MacroP MacroR MacroF1 McroP Table 1 Comparson on macro-averagng and mcro-averagng To nvestgate the performance of the mproved PageRank algorthm and tradtonal PageRank algorthm on web ste classfcaton, we conducted a comparson experment for them. The expermental results are showed n Table 2. Although the mproved PageRank algorthm decreases the MacroP than tradtonal PageRank algorthm, but t ncreases MacroR, MacroF1, and McroP values evdently. MacroP MacroR MacroF1 McroP Table 2 Improved PageRank PageRank The effect of Improved PageRank 5 Conclusons In ths paper, we propose a web ste classfcaton approach based on ts topologcal structure. The topologcal structure of a web ste can be represented as a drected graph. Assumed that the topc-relevant resource of a web ste s located on the strongly connected sub-graph ncludng the ste s home pages, we use an mproved PageRank algorthm based on lnk-based page smlarty, whch can effcently rank pages n the sub-graph. For effcently representng content feature and structure feature of a ste, we mx them nto a topc vector, and use an mproved entropy weghtng method to weght content terms accordng to ther frequency and locaton nformaton on pages. The expermental results of web ste classfcaton show that our web ste approach can acheve better performance than tradtonal web ste classfcaton approaches. 6 References Google: Search Engne. Yahoo: rectory Servce. DMOZ: Open Drectory Project.

11 A Web Ste Classfcaton Approach Based On Its Topologcal Structure 85 McCallum,A.and Ngam,K.,1998,A Comparson of Event Models for Naïve Bayes Text Classfcaton, Proceedngs of AAAI-98 Workshop on Learnng for Text Categorzaton. Mtchell, T. M.,1996,Machne Learnng. New York :McGraw Hll. Lam,W. and Ho,C.Y.,1998,Usng a Generalzed Instance Set for Automatc Text Categorzaton, proceedng of the 21st Ann Internatonal ACM SIGIR Conference on Research and Development n Informaton Retreval Melboume, AU, pp Masand,B., Lno,G. and Waltz,D.,1992,Classfyng News Stores Usng Memory Based Reasonng, proceedng of the 15th Annual ACM SIGIR Conference, Denmark: Copenhagen, pp Joachms,T.,1998,The Categorzaton wth Support Vector Machnes: Learnng wth Many Relevant Features, In European Conference on Machne Learnng (ECML),Chemntz,Germany, pp Kwok,J.T.Y.,1998,Automatc Text Categorzaton Usng Support Vector Machne, Proceedng of Internatonal Conference on Neural Informaton Processng, pp Chakrabart, S., Dom, B. and Indyk, P.,1998,Enhanced Hypertext Categorzaton Usng Hpyerlnks, Proceedng of the ACM SIGMOD Conference on Management of Data Seattle, Washngton, pp Craven, M., DPasquo, D., and Fretag, D., 1999,Learnng to Construct Knowledge Bases from the World Wde Web, In Artfcal Intellgence. Ester, M., Kregle, H.P., Schubert, M.,2002,Web Ste Mnng: A new way to spot Compettors, Customers and Supplers n the World Wde Web, Proceedng of 8th Internatonal Conference on Knowledge Dscovery and Data Mnng. Perre, J. M.,2001,On the Automated Classfcaton of Web Stes, Lnkopng Electronc Artcles n Computer and Informaton Scence,Vol. 6. Terveen,L., Hll,W., and Amento, B.,1999, Constructng, Organzng, and Vsualzng Collectons of Topcally Related Web Resources. ACM Trans. on Computer-Human Interacton,vol. 6,no.1,pp Ester, M., Kregel, H.P., Schubert,M.,2004,Accurate and Effcent Crawlng for Relevant Webstes, Proceedngs of the Thrteth nternatonal conference on Very large data bases, Aug, pp Kregel, H.P., Schubert, M.,2004,Classfcaton of Webstes as Sets of Feature Vectors, proceedngs of the IASTED Internatonal Conference DATABASES AND APPLICATIONS,Feb

12 86 J-bn Zhang,Zh-mng Xu,Kun-l Xu,Q-shu Pan Tan, Y.H., Huang, T.J.,and Gao, W.,2004, A Web Ste Representaton and Mnng Algorthm usng a Multscale Tree Model. Journey of Software, vol.15,no.9,pp Fu, D.Y., Da, C.Q., and Zhong, W.,2006, A Web Ste Categorzaton System Based on Key Resources, Journey of Harbn Insttute of Technology, vol.38,no.1, pp Dong, B.L., Q, G.N,and Gu, X.J.,2005, Specfc webste subject recognton based on the hybrd vector space model. Journal of Tsnghua unversty ( Sc & Tech), vol.45,pp Lu, Y., Wang, B., Yang, Z.F., and Zhang, X., 2006,Lnk Analyss n Web Key Resources Dscovery, Proceedngs of CNCCL,pp Page, L., Brn, S., and Motwan, R., 1999,The PageRank Ctaton Rankng: Brngng order to the Web, Techncal report, Stanford Dgtal Lbrares SIDL-WP Hodgson, J., 2001,Do HTML Tags Flag Semantc Content? IEEE Internet Computng, vol. 5,no.1,pp Wang, X.Y.,Xong, F.,Lng, B.,and Zhou, A.Y.,2003,A Smlarty-Based Algorthm for Topc Exploraton and Dstllaton, Journey of Software,vol.14,no.09, pp Yuan, F.Y.,and Zhang, Y.Y.,2007,The research and mprovement of relevance rankng method based on lnk analyss, Computer Engneerng and Desgn,vol.28, no.7,pp

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines

Keywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines (IJCSIS) Internatonal Journal of Computer Scence and Informaton Securty, Herarchcal Web Page Classfcaton Based on a Topc Model and Neghborng Pages Integraton Wongkot Srura Phayung Meesad Choochart Haruechayasak

More information

A Feature-Weighted Instance-Based Learner for Deep Web Search Interface Identification

A Feature-Weighted Instance-Based Learner for Deep Web Search Interface Identification Research Journal of Appled Scences, Engneerng and Technology 5(4): 1278-1283, 2013 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scentfc Organzaton, 2013 Submtted: June 28, 2012 Accepted: August 08, 2012

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Deep Classification in Large-scale Text Hierarchies

Deep Classification in Large-scale Text Hierarchies Deep Classfcaton n Large-scale Text Herarches Gu-Rong Xue Dkan Xng Qang Yang 2 Yong Yu Dept. of Computer Scence and Engneerng Shangha Jao-Tong Unversty {grxue, dkxng, yyu}@apex.sjtu.edu.cn 2 Hong Kong

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Study of Data Stream Clustering Based on Bio-inspired Model

Study of Data Stream Clustering Based on Bio-inspired Model , pp.412-418 http://dx.do.org/10.14257/astl.2014.53.86 Study of Data Stream lusterng Based on Bo-nspred Model Yngme L, Mn L, Jngbo Shao, Gaoyang Wang ollege of omputer Scence and Informaton Engneerng,

More information

Impact of a New Attribute Extraction Algorithm on Web Page Classification

Impact of a New Attribute Extraction Algorithm on Web Page Classification Impact of a New Attrbute Extracton Algorthm on Web Page Classfcaton Gösel Brc, Banu Dr, Yldz Techncal Unversty, Computer Engneerng Department Abstract Ths paper ntroduces a new algorthm for dmensonalty

More information

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION

A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION A CALCULATION METHOD OF DEEP WEB ENTITIES RECOGNITION 1 FENG YONG, DANG XIAO-WAN, 3 XU HONG-YAN School of Informaton, Laonng Unversty, Shenyang Laonng E-mal: 1 fyxuhy@163.com, dangxaowan@163.com, 3 xuhongyan_lndx@163.com

More information

Web Document Classification Based on Fuzzy Association

Web Document Classification Based on Fuzzy Association Web Document Classfcaton Based on Fuzzy Assocaton Choochart Haruechayasa, Me-Lng Shyu Department of Electrcal and Computer Engneerng Unversty of Mam Coral Gables, FL 33124, USA charuech@mam.edu, shyu@mam.edu

More information

Pruning Training Corpus to Speedup Text Classification 1

Pruning Training Corpus to Speedup Text Classification 1 Prunng Tranng Corpus to Speedup Text Classfcaton Jhong Guan and Shugeng Zhou School of Computer Scence, Wuhan Unversty, Wuhan, 430079, Chna hguan@wtusm.edu.cn State Key Lab of Software Engneerng, Wuhan

More information

CUM: An Efficient Framework for Mining Concept Units

CUM: An Efficient Framework for Mining Concept Units CUM: An Effcent Framework for Mnng Concept Unts P.Santh Thlagam Ananthanarayana V.S Department of Informaton Technology Natonal Insttute of Technology Karnataka - Surathkal Inda 575025 santh_soc@yahoo.co.n,

More information

Issues and Empirical Results for Improving Text Classification

Issues and Empirical Results for Improving Text Classification Issues and Emprcal Results for Improvng Text Classfcaton Youngoong Ko 1 and Jungyun Seo 2 1 Dept. of Computer Engneerng, Dong-A Unversty, 840 Hadan 2-dong, Saha-gu, Busan, 604-714, Korea yko@dau.ac.kr

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Deep Classifier: Automatically Categorizing Search Results into Large-Scale Hierarchies

Deep Classifier: Automatically Categorizing Search Results into Large-Scale Hierarchies Deep Classfer: Automatcally Categorzng Search Results nto Large-Scale Herarches Dkan Xng 1, Gu-Rong Xue 1, Qang Yang 2, Yong Yu 1 1 Shangha Jao Tong Unversty, Shangha, Chna {xaobao,grxue,yyu}@apex.sjtu.edu.cn

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Module Management Tool in Software Development Organizations

Module Management Tool in Software Development Organizations Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

A Resources Virtualization Approach Supporting Uniform Access to Heterogeneous Grid Resources 1

A Resources Virtualization Approach Supporting Uniform Access to Heterogeneous Grid Resources 1 A Resources Vrtualzaton Approach Supportng Unform Access to Heterogeneous Grd Resources 1 Cunhao Fang 1, Yaoxue Zhang 2, Song Cao 3 1 Tsnghua Natonal Labatory of Inforamaton Scence and Technology 2 Department

More information

Improving Web Search Results Using Affinity Graph

Improving Web Search Results Using Affinity Graph Improvng Web Search Results Usng Affnty Graph Benyu Zhang, Hua L 2, Y Lu 3, Le J 4, Wens X 5, Weguo Fan 5, Zheng Chen, We-Yng Ma Mcrosoft Research Asa, 49 Zhchun Road, Bejng, 00080, P. R. Chna {byzhang,

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

Query classification using topic models and support vector machine

Query classification using topic models and support vector machine Query classfcaton usng topc models and support vector machne Deu-Thu Le Unversty of Trento, Italy deuthu.le@ds.untn.t Raffaella Bernard Unversty of Trento, Italy bernard@ds.untn.t Abstract Ths paper descrbes

More information

A Method of Hot Topic Detection in Blogs Using N-gram Model

A Method of Hot Topic Detection in Blogs Using N-gram Model 84 JOURNAL OF SOFTWARE, VOL. 8, NO., JANUARY 203 A Method of Hot Topc Detecton n Blogs Usng N-gram Model Xaodong Wang College of Computer and Informaton Technology, Henan Normal Unversty, Xnxang, Chna

More information

Querying by sketch geographical databases. Yu Han 1, a *

Querying by sketch geographical databases. Yu Han 1, a * 4th Internatonal Conference on Sensors, Measurement and Intellgent Materals (ICSMIM 2015) Queryng by sketch geographcal databases Yu Han 1, a * 1 Department of Basc Courses, Shenyang Insttute of Artllery,

More information

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval Proceedngs of the Thrd NTCIR Workshop Descrpton of NTU Approach to NTCIR3 Multlngual Informaton Retreval Wen-Cheng Ln and Hsn-Hs Chen Department of Computer Scence and Informaton Engneerng Natonal Tawan

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Network Intrusion Detection Based on PSO-SVM

Network Intrusion Detection Based on PSO-SVM TELKOMNIKA Indonesan Journal of Electrcal Engneerng Vol.1, No., February 014, pp. 150 ~ 1508 DOI: http://dx.do.org/10.11591/telkomnka.v1.386 150 Network Intruson Detecton Based on PSO-SVM Changsheng Xang*

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,

More information

FAHP and Modified GRA Based Network Selection in Heterogeneous Wireless Networks

FAHP and Modified GRA Based Network Selection in Heterogeneous Wireless Networks 2017 2nd Internatonal Semnar on Appled Physcs, Optoelectroncs and Photoncs (APOP 2017) ISBN: 978-1-60595-522-3 FAHP and Modfed GRA Based Network Selecton n Heterogeneous Wreless Networks Xaohan DU, Zhqng

More information

Virtual Machine Migration based on Trust Measurement of Computer Node

Virtual Machine Migration based on Trust Measurement of Computer Node Appled Mechancs and Materals Onlne: 2014-04-04 ISSN: 1662-7482, Vols. 536-537, pp 678-682 do:10.4028/www.scentfc.net/amm.536-537.678 2014 Trans Tech Publcatons, Swtzerland Vrtual Machne Mgraton based on

More information

Document Representation and Clustering with WordNet Based Similarity Rough Set Model

Document Representation and Clustering with WordNet Based Similarity Rough Set Model IJCSI Internatonal Journal of Computer Scence Issues, Vol. 8, Issue 5, No 3, September 20 ISSN (Onlne): 694-084 www.ijcsi.org Document Representaton and Clusterng wth WordNet Based Smlarty Rough Set Model

More information

A Robust Webpage Information Hiding Method Based on the Slash of Tag

A Robust Webpage Information Hiding Method Based on the Slash of Tag Advanced Engneerng Forum Onlne: 2012-09-26 ISSN: 2234-991X, Vols. 6-7, pp 361-366 do:10.4028/www.scentfc.net/aef.6-7.361 2012 Trans Tech Publcatons, Swtzerland A Robust Webpage Informaton Hdng Method Based

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

A Novel Optimization Technique for Translation Retrieval in Networks Search Engines

A Novel Optimization Technique for Translation Retrieval in Networks Search Engines A Novel Optmzaton Technque for Translaton Retreval n Networks Search Engnes Yanyan Zhang Zhengzhou Unversty of Industral Technology, Henan, Chna Abstract - Ths paper studes models of Translaton Retreval.e.

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Scheduling Remote Access to Scientific Instruments in Cyberinfrastructure for Education and Research

Scheduling Remote Access to Scientific Instruments in Cyberinfrastructure for Education and Research Schedulng Remote Access to Scentfc Instruments n Cybernfrastructure for Educaton and Research Je Yn 1, Junwe Cao 2,3,*, Yuexuan Wang 4, Lanchen Lu 1,3 and Cheng Wu 1,3 1 Natonal CIMS Engneerng and Research

More information

Relevance Feedback for Image Retrieval

Relevance Feedback for Image Retrieval Vashal D Dhale et al, / (IJCSIT Internatonal Journal of Computer Scence and Informaton Technologes, Vol 4 (2, 203, 39-323 Relevance Feedback for Image Retreval Vashal D Dhale, Dr A R Mahaan, Prof Uma Thakur

More information

IMPACT OF RADIO MAP SIMULATION ON POSITIONING IN INDOOR ENVIRONTMENT USING FINGER PRINTING ALGORITHMS

IMPACT OF RADIO MAP SIMULATION ON POSITIONING IN INDOOR ENVIRONTMENT USING FINGER PRINTING ALGORITHMS IMPACT OF RADIO MAP SIMULATION ON POSITIONING IN INDOOR ENVIRONTMENT USING FINGER PRINTING ALGORITHMS Jura Macha and Peter Brda Unversty of Zlna, Faculty of Electrcal Engneerng, Department of Telecommuncatons

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Audio Content Classification Method Research Based on Two-step Strategy

Audio Content Classification Method Research Based on Two-step Strategy (IJACSA) Internatonal Journal of Advanced Computer Scence and Applcatons, Audo Content Classfcaton Method Research Based on Two-step Strategy Sume Lang Department of Computer Scence and Technology Chongqng

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

A Method of Query Expansion Based on Event Ontology

A Method of Query Expansion Based on Event Ontology A Method of Query Expanson Based on Event Ontology Zhaoman Zhong, Cunhua L, Yan Guan, Zongtan Lu A Method of Query Expanson Based on Event Ontology 1 Zhaoman Zhong, 1 Cunhua L, 1 Yan Guan, 2 Zongtan Lu,

More information

An Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc.

An Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 97-735 Volume Issue 9 BoTechnology An Indan Journal FULL PAPER BTAIJ, (9), [333-3] Matlab mult-dmensonal model-based - 3 Chnese football assocaton super league

More information

Classic Term Weighting Technique for Mining Web Content Outliers

Classic Term Weighting Technique for Mining Web Content Outliers Internatonal Conference on Computatonal Technques and Artfcal Intellgence (ICCTAI'2012) Penang, Malaysa Classc Term Weghtng Technque for Mnng Web Content Outlers W.R. Wan Zulkfel, N. Mustapha, and A. Mustapha

More information

CAN COMPUTERS LEARN FASTER? Seyda Ertekin Computer Science & Engineering The Pennsylvania State University

CAN COMPUTERS LEARN FASTER? Seyda Ertekin Computer Science & Engineering The Pennsylvania State University CAN COMPUTERS LEARN FASTER? Seyda Ertekn Computer Scence & Engneerng The Pennsylvana State Unversty sertekn@cse.psu.edu ABSTRACT Ever snce computers were nvented, manknd wondered whether they mght be made

More information

Remote Sensing Image Retrieval Algorithm based on MapReduce and Characteristic Information

Remote Sensing Image Retrieval Algorithm based on MapReduce and Characteristic Information Remote Sensng Image Retreval Algorthm based on MapReduce and Characterstc Informaton Zhang Meng 1, 1 Computer School, Wuhan Unversty Hube, Wuhan430097 Informaton Center, Wuhan Unversty Hube, Wuhan430097

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY

THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY Proceedngs of the 20 Internatonal Conference on Machne Learnng and Cybernetcs, Guln, 0-3 July, 20 THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY JUN-HAI ZHAI, NA LI, MENG-YAO

More information

Efficient Text Classification by Weighted Proximal SVM *

Efficient Text Classification by Weighted Proximal SVM * Effcent ext Classfcaton by Weghted Proxmal SVM * Dong Zhuang 1, Benyu Zhang, Qang Yang 3, Jun Yan 4, Zheng Chen, Yng Chen 1 1 Computer Scence and Engneerng, Bejng Insttute of echnology, Bejng 100081, Chna

More information

An Improved Image Segmentation Algorithm Based on the Otsu Method

An Improved Image Segmentation Algorithm Based on the Otsu Method 3th ACIS Internatonal Conference on Software Engneerng, Artfcal Intellgence, Networkng arallel/dstrbuted Computng An Improved Image Segmentaton Algorthm Based on the Otsu Method Mengxng Huang, enjao Yu,

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Experiments in Text Categorization Using Term Selection by Distance to Transition Point

Experiments in Text Categorization Using Term Selection by Distance to Transition Point Experments n Text Categorzaton Usng Term Selecton by Dstance to Transton Pont Edgar Moyotl-Hernández, Héctor Jménez-Salazar Facultad de Cencas de la Computacón, B. Unversdad Autónoma de Puebla, 14 Sur

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

The Shortest Path of Touring Lines given in the Plane

The Shortest Path of Touring Lines given in the Plane Send Orders for Reprnts to reprnts@benthamscence.ae 262 The Open Cybernetcs & Systemcs Journal, 2015, 9, 262-267 The Shortest Path of Tourng Lnes gven n the Plane Open Access Ljuan Wang 1,2, Dandan He

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(6):2860-2866 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A selectve ensemble classfcaton method on mcroarray

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

Domain Thesaurus Construction from Wikipedia *

Domain Thesaurus Construction from Wikipedia * Internatonal Conference on Computer, Networks and Communcaton Engneerng (ICCNCE 2013) Doman Thesaurus Constructon from Wkpeda * WenKe Yn 1, Mng Zhu 2, TanHao Chen 2 1 Department of Electronc Engneerng

More information

A New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1

A New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1 A New Feature of Unformty of Image Texture Drectons Concdng wth the Human Eyes Percepton Xng-Jan He, De-Shuang Huang, Yue Zhang, Tat-Mng Lo 2, and Mchael R. Lyu 3 Intellgent Computng Lab, Insttute of Intellgent

More information

Improved Mutual Information Based on Relative Frequency. Factor and Degree of Difference among Classes

Improved Mutual Information Based on Relative Frequency. Factor and Degree of Difference among Classes 2nd Informaon Technology and Mechatroncs Engneerng Conference (ITOEC 2016 Improved Mutual Informaon Based on Relave Frequency Factor and Degree of Dfference among Classes Janwen Gao a*, X Yangb,Wen Wenc

More information

Face Recognition Based on SVM and 2DPCA

Face Recognition Based on SVM and 2DPCA Vol. 4, o. 3, September, 2011 Face Recognton Based on SVM and 2DPCA Tha Hoang Le, Len Bu Faculty of Informaton Technology, HCMC Unversty of Scence Faculty of Informaton Scences and Engneerng, Unversty

More information

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b

Clustering Algorithm Combining CPSO with K-Means Chunqin Gu 1, a, Qian Tao 2, b Internatonal Conference on Advances n Mechancal Engneerng and Industral Informatcs (AMEII 05) Clusterng Algorthm Combnng CPSO wth K-Means Chunqn Gu, a, Qan Tao, b Department of Informaton Scence, Zhongka

More information

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB V. Hotař, A. Hotař Techncal Unversty of Lberec, Department of Glass Producng Machnes and Robotcs, Department of Materal

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

On-line Hot Topic Recommendation Using Tolerance Rough Set Based Topic Clustering

On-line Hot Topic Recommendation Using Tolerance Rough Set Based Topic Clustering JOURNAL OF COMPUTERS, VOL. 5, NO. 4, APRIL 2010 549 On-lne Hot Topc Recommendaton Usng Tolerance Rough Set Based Topc Clusterng Yonghu Wu, Yuxn Dng, Xaolong Wang, Jun Xu Intellgence Computng Research Center

More information

Impact of Contextual Information for Hypertext Documents Retrieval

Impact of Contextual Information for Hypertext Documents Retrieval Impact of Contextual Informaton for Hypertext ocuments Retreval Idr Chbane and Bch-Lên oan SUPELEC Computer Scence dpt. Plateau de Moulon 3 rue Jolot Cure 9 92 Gf/Yvette France {Idr.Chbane Bch-Len.oan}@supelec.fr

More information

Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study

Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study Arabc Text Classfcaton Usng N-Gram Frequency Statstcs A Comparatve Study Lala Khresat Dept. of Computer Scence, Math and Physcs Farlegh Dcknson Unversty 285 Madson Ave, Madson NJ 07940 Khresat@fdu.edu

More information

Load-Balanced Anycast Routing

Load-Balanced Anycast Routing Load-Balanced Anycast Routng Chng-Yu Ln, Jung-Hua Lo, and Sy-Yen Kuo Department of Electrcal Engneerng atonal Tawan Unversty, Tape, Tawan sykuo@cc.ee.ntu.edu.tw Abstract For fault-tolerance and load-balance

More information

News. Recap: While Loop Example. Reading. Recap: Do Loop Example. Recap: For Loop Example

News. Recap: While Loop Example. Reading. Recap: Do Loop Example. Recap: For Loop Example Unversty of Brtsh Columba CPSC, Intro to Computaton Jan-Apr Tamara Munzner News Assgnment correctons to ASCIIArtste.java posted defntely read WebCT bboards Arrays Lecture, Tue Feb based on sldes by Kurt

More information

A Clustering Algorithm for Key Frame Extraction Based on Density Peak

A Clustering Algorithm for Key Frame Extraction Based on Density Peak Journal of Computer and Communcatons, 2018, 6, 118-128 http://www.scrp.org/ournal/cc ISSN Onlne: 2327-5227 ISSN Prnt: 2327-5219 A Clusterng Algorthm for Key Frame Extracton Based on Densty Peak Hong Zhao

More information

High-Boost Mesh Filtering for 3-D Shape Enhancement

High-Boost Mesh Filtering for 3-D Shape Enhancement Hgh-Boost Mesh Flterng for 3-D Shape Enhancement Hrokazu Yagou Λ Alexander Belyaev y Damng We z Λ y z ; ; Shape Modelng Laboratory, Unversty of Azu, Azu-Wakamatsu 965-8580 Japan y Computer Graphcs Group,

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity

Efficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs

More information

Enhancement of Infrequent Purchased Product Recommendation Using Data Mining Techniques

Enhancement of Infrequent Purchased Product Recommendation Using Data Mining Techniques Enhancement of Infrequent Purchased Product Recommendaton Usng Data Mnng Technques Noraswalza Abdullah, Yue Xu, Shlomo Geva, and Mark Loo Dscplne of Computer Scence Faculty of Scence and Technology Queensland

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

A Novel Term_Class Relevance Measure for Text Categorization

A Novel Term_Class Relevance Measure for Text Categorization A Novel Term_Class Relevance Measure for Text Categorzaton D S Guru, Mahamad Suhl Department of Studes n Computer Scence, Unversty of Mysore, Mysore, Inda Abstract: In ths paper, we ntroduce a new measure

More information

PERFORMANCE EVALUATION FOR SCENE MATCHING ALGORITHMS BY SVM

PERFORMANCE EVALUATION FOR SCENE MATCHING ALGORITHMS BY SVM PERFORMACE EVALUAIO FOR SCEE MACHIG ALGORIHMS BY SVM Zhaohu Yang a, b, *, Yngyng Chen a, Shaomng Zhang a a he Research Center of Remote Sensng and Geomatc, ongj Unversty, Shangha 200092, Chna - yzhac@63.com

More information

Object-Based Techniques for Image Retrieval

Object-Based Techniques for Image Retrieval 54 Zhang, Gao, & Luo Chapter VII Object-Based Technques for Image Retreval Y. J. Zhang, Tsnghua Unversty, Chna Y. Y. Gao, Tsnghua Unversty, Chna Y. Luo, Tsnghua Unversty, Chna ABSTRACT To overcome the

More information

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm Recommended Items Ratng Predcton based on RBF Neural Network Optmzed by PSO Algorthm Chengfang Tan, Cayn Wang, Yuln L and Xx Q Abstract In order to mtgate the data sparsty and cold-start problems of recommendaton

More information

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET 1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School

More information

Real-time Fault-tolerant Scheduling Algorithm for Distributed Computing Systems

Real-time Fault-tolerant Scheduling Algorithm for Distributed Computing Systems Real-tme Fault-tolerant Schedulng Algorthm for Dstrbuted Computng Systems Yun Lng, Y Ouyang College of Computer Scence and Informaton Engneerng Zheang Gongshang Unversty Postal code: 310018 P.R.CHINA {ylng,

More information

A Novel Video Retrieval Method Based on Web Community Extraction Using Features of Video Materials

A Novel Video Retrieval Method Based on Web Community Extraction Using Features of Video Materials IEICE TRANS. FUNDAMENTALS, VOL.E92 A, NO.8 AUGUST 2009 1961 PAPER Specal Secton on Sgnal Processng A Novel Vdeo Retreval Method Based on Web Communty Extracton Usng Features of Vdeo Materals Yasutaka HATAKEYAMA

More information

Modular PCA Face Recognition Based on Weighted Average

Modular PCA Face Recognition Based on Weighted Average odern Appled Scence odular PCA Face Recognton Based on Weghted Average Chengmao Han (Correspondng author) Department of athematcs, Lny Normal Unversty Lny 76005, Chna E-mal: hanchengmao@163.com Abstract

More information

A Clustering Algorithm Solution to the Collaborative Filtering

A Clustering Algorithm Solution to the Collaborative Filtering Internatonal Journal of Scence Vol.4 No.8 017 ISSN: 1813-4890 A Clusterng Algorthm Soluton to the Collaboratve Flterng Yongl Yang 1, a, Fe Xue, b, Yongquan Ca 1, c Zhenhu Nng 1, d,* Hafeng Lu 3, e 1 Faculty

More information

Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks

Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks Federated Search of Text-Based Dgtal Lbrares n Herarchcal Peer-to-Peer Networks Je Lu School of Computer Scence Carnege Mellon Unversty Pttsburgh, PA 15213 jelu@cs.cmu.edu Jame Callan School of Computer

More information

Keyword-based Document Clustering

Keyword-based Document Clustering Keyword-based ocument lusterng Seung-Shk Kang School of omputer Scence Kookmn Unversty & AIrc hungnung-dong Songbuk-gu Seoul 36-72 Korea sskang@kookmn.ac.kr Abstract ocument clusterng s an aggregaton of

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information