Impact of a New Attribute Extraction Algorithm on Web Page Classification
|
|
- Ashley Cain
- 5 years ago
- Views:
Transcription
1 Impact of a New Attrbute Extracton Algorthm on Web Page Classfcaton Gösel Brc, Banu Dr, Yldz Techncal Unversty, Computer Engneerng Department Abstract Ths paper ntroduces a new algorthm for dmensonalty reducton and ts applcaton on web page classfcaton. A heterogeneous collecton of web pages s used as the dataset. Selected attrbutes for classfcaton are the textual content of pages. Usng the offered algorthm, hgh dmenson of attrbutes- words extracted from the pages- are proected onto a new hyper plane havng dmensons equal to the number of classes. Results show that processng tmes of classfcaton algorthms dramatcally decrease wth the offered reducton algorthm. Ths mostly reles on the number of attrbutes gven to classfers fall off. Accuraces of the classfcaton algorthms also ncrease compared to tests run wthout usng the proposed reducton algorthm. I. INTRODUCTION HE rapd and exponental growth of web pages on the T nternet maes dffcult to organze relevant pages together. People tend to organze data sources nto categores dependng on the content because t s easer to trac between categores and reach the desred data. Dewey and Lbrary of Congress Headngs are the examples of classfcaton systems used n lbrares to acheve categorzaton. Web pages are not logcally organzed n the nternet whch maes the data access and retreval dffcult. When we loo at the nternet, we can fnd web page classfcaton solutons n forms of web drectores le Dmoz or Yahoo!. The problem s that these drectores are formed by human effort- wth the hands and mnds of the edtors. The web s n fact several tmes bgger than the coverage of these drectores plus search engnes. That s why automaton of ths categorzaton process s mportant. Classfcaton algorthms may help edtors to classfy new comng web pages nto exstng categores. Ths s also nown as web page classfcaton. There are many solutons proposed to help solve ths problem. Dumas and Chen [1] offers herarchcal classfcaton usng SVM s on LooSmart web drectory. Cho and Peng [2] offers a dynamc and herarchcal classfcaton scheme. Mladenc uses Yahoo web drectory wth a Naïve Bayes classfer [3]. Holden and Fretas classfy ther web dataset usng ant-colony algorthm [4]. Many other researches can be found n lterature about web page classfcaton problem. One of the models wdely used n text classfcaton s the G.Brc s wth the Computer Engneerng Department, Yldz Techncal Unversty, Istanbul, Turey (e-mal: gosel@ce.yldz.edu.tr). B.Dr s wth the Computer Engneerng Department, Yldz Techncal Unversty, Istanbul, Turey (e-mal: banu@ce.yldz.edu.tr). vector space model n whch the documents are represented as vectors descrbed by a set of dentfers, for example, words as terms. Ths model s also nown as bag-of-words model, every document s ust a contaner for the words t contans. Thnng n the vector space, each term s a dmenson for the document vectors. The nature of ths representaton causes a very hgh-dmensonal and sparse feature space, whch s a common problem to deal wth when usng bag-of-words model. There are two effectve ways to overcome ths dmensonalty problem: Attrbute selecton and attrbute extracton. Attrbute selecton algorthms output a subset of the nput attrbutes, results n a lower dmensonal space. Instead of usng all words as attrbutes, attrbute selecton algorthms evaluate attrbutes on a specfc classfer to fnd the best subset of terms [5]. Ths results n reduced cost for classfcaton and better classfcaton accuracy. The most popular attrbute selecton algorthms nclude document frequency, ch statstc, nformaton gan, term strength and mutual nformaton [6]. Ch-square and correlaton coeffcent methods have been shown to produce better results than document frequency [7]. The lac of attrbute selecton algorthms s that the selecton procedure s evaluated on a certan classfer. The produced subset may not be sutable for another classfer to mprove ts performance. Attrbute extracton algorthms smplfes the amount of resource requred to descrbe a large set of data. The hghdmensonal and sparse structure of vector space model requres large amount of memory and computaton power. The am of attrbute extracton s to combne terms to form a new descrpton for the data wth suffcent accuracy. Attrbute extracton can be seen as proectng the hghdmensonal data nto a new, lower-dmensonal hyperspace. Mostly used technques are prncpal components analyss (PCA), Isomap and Latent Semantc Analyss (LSA). Latent Semantc Indexng s based on LSA and t s the mostly used algorthm n text mnng tass nowadays. Our algorthm also extracts attrbutes from the orgnal vector space, proects nto a new hyperspace wth dmensons equal to number of classes. In secton 2, we ntroduce our evaluaton dataset and the preprocessng steps. Secton 3 gves bref descrpton about related attrbute extracton algorthms and ntroduces the proposed method. In secton 4 we dscuss our expermental results. Secton 5 addresses conclusons and feature wor.
2 II. EVALUATION DATASET AND PREPROCESSING A. Evaluaton Dataset Evaluaton dataset s prepared from web pages under Dmoz [8] category World/Turce. We wor to develop an automated classfer for the Dmoz Tursh drectory. We used the dataset that we prepared for our study because the attrbute extracton algorthm we ntroduce n ths paper s a part of our system. World/Turce category ncluded unque web pages n Tursh at the tme we crawled. There are thrteen man categores n ths set: 1. Shoppng 2. News 3. Computers 4. Scence 5. Regonal 6. Recreaton 7. Busness 8. Reference 9. Arts 10. Games 11. Health 12. Sports 13. Socety Each category has several sub categores (total 758), whch are not consdered at ths tme. B. Preprocessng The dataset s crawled n raw HTML format. It s then parsed wth HTML Parser 1 to fetch the content of web pages. Fg.1. Dstrbuton of terms and documents n the dataset. We can see that most of the dataset elements contan less than 200 eywords. Term statstcs are calculated to see the outlyng documents. Fg.1 shows the dstrbuton of terms and documents, n other words, the number of words documents have and the number of documents wth a certan amount of terms. Ths plot shows us that some documents contan lots of eywords, whle some have very few. A flter smlar to Box-Plot s used to fnd the outlers n ths dstrbuton. The mean and standard devatons of the axes are calculated and a box s drawn on the dstrbuton wth the center (mean x, mean y ) and boundares (σ x, σ y ). The area wthn the box s used as the nput dataset; the samples outsde ths box are consdered as outlers and cleared. The center of the box and the boundares are also shown n Fg.1. The dataset fetched out of Dmoz s World/Turce category s n Tursh. Because Tursh s a suffx language, meanngs of the words change wth suffxes. To get rd of ths effect, all of the terms are stemmed nto ther roots by usng a Tursh stemmer tool named Zembere 2. Number of web pages Shoppng News Computers Scence Regonal Recreaton Busness Reference Classes Arts Games Health Sports Socety Fg.2. Dstrbuton of web pages among classes after preprocessng step. The fnal preprocess step s cleanng up the Tursh stop words. After the preprocessng step, we have documents (web pages) and unque eywords. Loong from the bag-of-words vew, we have a termdocument matrx wth rows and columns. Because any of the documents wll contan only a tny subset of our terms, our matrx s very sparse. The cells of the matrx contan tf-df values of terms calculated by (1), (2) and (3), where n s the number of occurrences of the consdered term n document d, D s the total number of documents, {d :t d } s the number of documents where term t appears.. n, tf, = (1) n df, D = log (2) { d : t d } tfdf, = tf, df (3) Our matrx leads us to a dmensonal vector space, 1 Avalable at 2 Avalable at
3 whch s hard to deal wth because runnng classfcaton algorthms on such a multdmensonal space s very tme and memory consumng. The dstrbuton of web pages among 13 classes s also uneven, whch causes another problem for classfcaton algorthms. Number of web pages per class after preprocessng step s gven n Fg.2. III. ATTRIBUTE EXTRACTION Attrbute extracton algorthms smplfes the amount of resource requred to descrbe a large set of data. The am of attrbute extracton s to combne terms to form a new descrpton for the data wth suffcent accuracy. In ths secton, commonly used extracton algorthms -prncpal component analyss, PCA and latent semantc analyss, LSA- and the proposed method are ntroduced. Our algorthm proects attrbutes nto a new hyperspace wth dmensons equal to number of classes. A. Related Research 1) Prncpal Component Analyss: PCA transforms correlate varables nto a smaller number of correlated varables- prncpal components. Invented by Pearson n 1901 [9], t s mostly used for exploratory data analyss. PCA s used for attrbute extracton by retanng the characterstcs of the dataset that contrbute most to ts varance, by eepng lower order prncpals, whch tend to have the most mportant aspects of data. Ths s done by a proecton nto a new hyper plane usng Egen values and Egen vectors. PCA s a popular technque n pattern recognton, but ts applcatons are not very common because t s not optmzed for class separablty [10]. It s wdely used n mage processng, 2) Latent Semantc Analyss: Patented n 1988, LSA s a technque that analyzes relatonshps between a document set and the terms that they contan by producng a set of concepts related to them [11]. As the name suggests, sngular value decomposton breas our matrx down nto a set of smaller components. LSA uses sngular value decomposton (SVD) method to fnd the relatonshps between documents. Sngular value decomposton breas term-document matrx down nto a set of smaller components. The algorthm alters one of these components (reduces the number of dmensons), and then recombnes them nto a matrx of the same shape as the orgnal, so we can agan use t as a looup grd. The matrx we get bac s an approxmaton of the term-document matrx we provded as nput, and loos much dfferent from the orgnal. LSI s mostly used for web page retreval, document clusterng purposes. It s also used for document classfcaton va votng, or nformaton flterng. Many algorthms are combned wth LSI to mprove performance by worng n a less complex hyperspace. B. Proposed Attrbute Extracton Method Our attrbute extracton algorthm nether uses Egen values, Egen vectors, nor sngular value decomposton. Weghts of terms and ther probablstc dstrbuton over the classes are taen nto account. We proect term probabltes to classes, and sum up those probabltes to get the mpact of each term to each class [15]. Assume we have I terms, J documents and K classes. Let n, be the number of occurrences of term t n document d and N be the total number of documents that contan t. We calculate the total number of occurrences of a term t n class c wth (4). We calculate the weght of term t (n a document d ) that affects class c wth (5). nc = n, d c,, (4) N w, = log, (5) ( nc ) + 1 log N We calculate the total effect of terms n a document over a class c wth (6). At the end, I terms are proected onto number of classes; we have K new terms n hand for a document. Repeatng ths procedure for all documents gves us a reduced matrx wth J rows (one row per document) and K columns (number of extracted features equals the number of classes). Fnally, we normalze new terms by usng (7). Y = w, w d, (6) Y newterm = (7) Y We gve a bref example to explan the algorthm comprehensbly. Assume we have 8 terms, 6 documents and 2 classes gven n Fg.3. The correspondng term-document matrx s n Table 1. The cells of Table 1 correspond to term weghts calculated wth (4). Fg.3. Sample dataset to explan the proposed extracton algorthm comprehensbly.
4 TABLE I TERM-DOCUMENT MATRIX OF THE SAMPLE DATASET D1 D2 D3 D4 D5 D6 t t t t t t t t We calculate term weghts over classes wth (5). Ths produces I by K matrx, gven n Table 2. After applyng (6) and (7), we have the reduced J by K matrx wth the extracted features of the processed documents. Result matrx s gven n Table 3. Ths matrx s vsualzed n Fg.4. We can also read the extracted values as the membershp probabltes of the documents to classes, as seen on Fg.4. For example, the content of D4 tells us that t s 84% Class2 and 16% Class1. Thus, the extracton method we propose may also be used as a stand-alone classfer. When we feed a new document, the system decdes whch class ths document belongs to by calculatng (6) and (7) wth the new document s content. TABLE II TERM WEIGHTS OVER CLASSES FOR THE SAMPLE DATASET C1 C2 t t t t t t t t TABLE III TERM WEIGHTS OVER CLASSES FOR THE SAMPLE DATASET C1 C2 D D D D D D Membershp probabltes 1 0,9 0,8 0,7 0,6 0,5 0,4 0,3 0,2 0,1 0 D1 D2 D3 Documents D4 D5 D6 Class2 Class1 Fg.4. Vsualzaton of Table III. Extracted values can be taen as the membershp probabltes of the documents to the classes. IV. EXPERIMENTAL RESULTS We prepare our evaluaton dataset and test our attrbute extracton algorthm s performance usng 5 dfferent classfers. We use 10-fold cross-valdaton for testng. We use Naïve Bayes as a smple probablstc classfer, whch s based on applyng Bayes' theorem wth strong ndependence assumptons [12]. We choose J48 Tree, an mplementaton of Qunlan s C4.5 decson tree algorthm [13] for a basc tree based classfer, and a random forest wth 100 trees to construct a collecton of decson trees wth controlled varatons [14]. We choose radal bass functons networ for a functon based classfer and K-nearest neghbor algorthm wth K=10 to test nstance-based classfers. We evaluate the classfers performance wth ther classfcaton accuracy whch s calculated by (8) and F- Measure by usng (9).The classfcaton performance results are gven n Table 4. We can see that K-nearest neghbor classfer fals because our documents are unevenly dstrbuted between classes. Regonal and Busness classes have much more nstances than the other classes; ths s the reason why an nstance-based learner fals. RBF networ also fals because the dstrbuton of the attrbutes can not be easly modeled wth Gaussan local models. TP + TN accuracy = (8) TP + FP + FN + TN 2 TP F = (9) 2 ( TP + FP + FN ) TABLE IV RESULTS FOR THE CLASSIFIERS EVALUATED ON OUR DATASET WITHOUT USING ATTRIBUTE EXTRACTION Classfcaton F-Measure Accuracy Naïve Bayes 55.7% J48 Tree (C4.5) 57.7% Functons Networ 32.4% K-Nearest Neghbor 31.8% Random Forest (wth 100 trees) 61.4% TABLE IV RESULTS FOR THE CLASSIFIERS EVALUATED ON OUR DATASET USING THE PROPOSED ATTRIBUTE EXTRACTION METHOD Classfcaton F-Measure Accuracy Naïve Bayes 67.5% J48 Tree (C4.5) 58.7% Functons Networ 70.6% K-Nearest Neghbor 74.3% Random Forest (wth 100 trees) 75.5% 0.753
5 We gve the classfcaton performance results of the classfers usng our proposed attrbute extracton algorthm n Table 5. We can see that our attrbute extracton algorthm ncreases the classfcaton accuracy by 11.8% when usng Naïve Bayes, 1.0% usng J48, 38.2% usng RBF networ, 42.5% usng K-nearest neghbor, and 14.1% usng a random forest. Another outcome of our method s that t decreases classfcaton tmes on both classfers dramatcally. Fg.5 compares the classfcaton accuraces and Fg.6 compares F-measures wth and wthout usng our extracton method. We can see that both classfcaton accuraces and F-measures ncrease. Our method hghly mproves the performance of K-nearest neghbor classfer on unevenly dstrbuted dataset. The performance of RBF networ s also boosted because the fnal attrbutes tend to have a Gaussan le dstrbuton. Classfcaton Accuracy wthout attrbute extracton wth attrbute extracton appled 74,3% 75,5% 70,6% 67,5% 58,7% 61,4% 55,7% 57,7% Naïve Bayes J48 Tree (C4.5) 32,4% 31,8% Functons Networ K-Nearest Neghbor Random Forest (wth 100 trees) Fg.5. Comparson of classfcaton accuraces wth and wthout usng the proposed attrbute extracton algorthm. F-Measure 0,675 0,591 0,589 0,572 Naïve Bayes wthout attrbute extracton J48 Tree (C4.5) wth attrbute extracton appled 0,740 0,753 0,705 0,221 0,245 Functons Networ K-Nearest Neghbor 0,583 Random Forest (wth 100 trees) Fg.6. Comparson of F-measures of the classfers wth and wthout usng the proposed attrbute extracton algorthm. V. CONCLUSION AND FUTURE WORK We ntroduce a new algorthm for attrbute extracton and ts applcaton on web page classfcaton. We use a heterogeneous collecton of web pages crawled from Dmoz s World/Turce as the dataset. Selected attrbutes for classfcaton are the textual content of the web pages, n other words, preprocessed terms accordng to vector-space model. Usng the offered algorthm, hgh dmenson of attrbutes- terms extracted from the pages- are proected onto a new hyper plane havng dmensons equal to the number of classes. Results show that processng tmes of classfcaton algorthms dramatcally decrease wth our reducton algorthm. Ths mostly reles on the number of attrbutes gven to classfers fall off. Accuraces of the classfcaton algorthms also ncrease compared to tests run wthout usng the proposed reducton algorthm, especally on the classfers that fal on unevenly dstrbuted datasets. We plan to compare our attrbute extracton method wth other algorthms le LSA and PCA as a future wor. As we see that our method can also act as a classfer, we program to test our method as a classfer wth other classfcaton algorthms. Furthermore, we have n vew to mprove our method to ft herarchcal classes. REFERENCES [1] S.Dumas and H.Chen, Herarchcal Classfcaton of web Content, In Proc. of SIGIR-00, 23rd ACM Internatonal Conference on Research and Development n Informaton Retreval, Athens, Greece, 2000, pp [2] B.Cho, X.Peng, Dynamc and Herarchcal Classfcaton of Web Pages, Onlne Informaton Revew, vol.28, no.2, pp , [3] D.Mladenc, Turnng Yahoo nto an Automatc Web-Page Classfer, 13 th European Conference on Artfcal Intellgence Young Researcher Paper, [4] N.Holden, A.A.Fretas, Web Page Classfcaton wth an Ant Colony Algorthm, n Parallel Problem Solvng from Nature PPSN VIII, vol.3242, Sprnger Berln-Hedelberg, 2004, pp [5] Y.Ymng, J.O.Pedersen, A comparatve study on feature selecton n text categorzaton, n 14 th Internatonal Conference on Machne Learnng, 1997, pp [6] J.Zhu, H. Wang and X.Zhang, Dscrmnaton-Based Feature Selecton for Multnomal Naïve Bayes Text Classfcaton, n 2006 Proc. ICCPOL2006, LNAI 4285, pp [7] R.Jensen, Q.Shen, Computatonal Intellgence and Feature Selecton Rough and Fuzzy Approaches, IEEE-Wley, New Jersey, 2008, pp [8] Drectory Mozlla Proect, DMOZ, avalable at: [9] K.Pearson, On Lnes and Planes of Closest Ft to Systems of Ponts n Space, Phlosophcal Magazne2(6):pp , [10] K.Fuunaga, Introducton to Statstcal Pattern Recognton, Academc Press, London,1990. [11] T.K. Landauer, S.T. Dumas, Soluton to Plato's Problem: The Latent Semantc Analyss Theory of Acquston, Inducton and Representaton of Knowledge. Psychologcal Revew, 1997, vol:104, no:2, pp [12] A. McCallum, K. Ngam, "A Comparson of Event Models for Nave Bayes Text Classfcaton". In Proc. AAAI/ICML-98 Worshop on Learnng for Text Categorzaton, 1998, pp [13] J.R. Qunlan, C4.5: Programs for Machne Learnng. Morgan Kaufmann Publshers, [14] L.Breman, "Random Forests". Machne Learnng, vol:45, no:1, pp.5-32, [15] Z.Kaban, B.Dr, Recognzng Type and Author n Tursh Documents Usng Artfcal Immune Systems, 16 th Sgnal Processng and ts Applcatons Congress, Turey, 2008.
Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers
IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth
More informationFeature Reduction and Selection
Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components
More informationLecture 4: Principal components
/3/6 Lecture 4: Prncpal components 3..6 Multvarate lnear regresson MLR s optmal for the estmaton data...but poor for handlng collnear data Covarance matrx s not nvertble (large condton number) Robustness
More informationThe Research of Support Vector Machine in Agricultural Data Classification
The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou
More informationTerm Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task
Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto
More informationCluster Analysis of Electrical Behavior
Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School
More informationClassifier Selection Based on Data Complexity Measures *
Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.
More informationLearning the Kernel Parameters in Kernel Minimum Distance Classifier
Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department
More informationParallelism for Nested Loops with Non-uniform and Flow Dependences
Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr
More informationDetection of an Object by using Principal Component Analysis
Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,
More informationSubspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;
Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features
More informationTN348: Openlab Module - Colocalization
TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages
More informationSupport Vector Machines
/9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.
More informationWeb Document Classification Based on Fuzzy Association
Web Document Classfcaton Based on Fuzzy Assocaton Choochart Haruechayasa, Me-Lng Shyu Department of Electrcal and Computer Engneerng Unversty of Mam Coral Gables, FL 33124, USA charuech@mam.edu, shyu@mam.edu
More informationFace Recognition University at Buffalo CSE666 Lecture Slides Resources:
Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural
More informationType-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data
Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES
More informationSmoothing Spline ANOVA for variable screening
Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory
More informationCSCI 5417 Information Retrieval Systems Jim Martin!
CSCI 5417 Informaton Retreval Systems Jm Martn! Lecture 11 9/29/2011 Today 9/29 Classfcaton Naïve Bayes classfcaton Ungram LM 1 Where we are... Bascs of ad hoc retreval Indexng Term weghtng/scorng Cosne
More informationFEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur
FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents
More informationOutline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:
Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A
More informationBOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET
1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School
More informationMULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION
MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and
More informationAn Anti-Noise Text Categorization Method based on Support Vector Machines *
An Ant-Nose Text ategorzaton Method based on Support Vector Machnes * hen Ln, Huang Je and Gong Zheng-Hu School of omputer Scence, Natonal Unversty of Defense Technology, hangsha, 410073, hna chenln@nudt.edu.cn,
More informationAn Improvement to Naive Bayes for Text Classification
Avalable onlne at www.scencedrect.com Proceda Engneerng 15 (2011) 2160 2164 Advancen Control Engneerngand Informaton Scence An Improvement to Nave Bayes for Text Classfcaton We Zhang a, Feng Gao a, a*
More informationA Binarization Algorithm specialized on Document Images and Photos
A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a
More informationOutline. Type of Machine Learning. Examples of Application. Unsupervised Learning
Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton
More informationProblem Definitions and Evaluation Criteria for Computational Expensive Optimization
Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty
More informationDeep Classification in Large-scale Text Hierarchies
Deep Classfcaton n Large-scale Text Herarches Gu-Rong Xue Dkan Xng Qang Yang 2 Yong Yu Dept. of Computer Scence and Engneerng Shangha Jao-Tong Unversty {grxue, dkxng, yyu}@apex.sjtu.edu.cn 2 Hong Kong
More informationOutline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1
4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:
More informationClassification / Regression Support Vector Machines
Classfcaton / Regresson Support Vector Machnes Jeff Howbert Introducton to Machne Learnng Wnter 04 Topcs SVM classfers for lnearly separable classes SVM classfers for non-lnearly separable classes SVM
More informationSupport Vector Machines
Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned
More informationKeywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines
(IJCSIS) Internatonal Journal of Computer Scence and Informaton Securty, Herarchcal Web Page Classfcaton Based on a Topc Model and Neghborng Pages Integraton Wongkot Srura Phayung Meesad Choochart Haruechayasak
More informationImplementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status
Internatonal Journal of Appled Busness and Informaton Systems ISSN: 2597-8993 Vol 1, No 2, September 2017, pp. 6-12 6 Implementaton Naïve Bayes Algorthm for Student Classfcaton Based on Graduaton Status
More informationExperiments in Text Categorization Using Term Selection by Distance to Transition Point
Experments n Text Categorzaton Usng Term Selecton by Dstance to Transton Pont Edgar Moyotl-Hernández, Héctor Jménez-Salazar Facultad de Cencas de la Computacón, B. Unversdad Autónoma de Puebla, 14 Sur
More informationMachine Learning 9. week
Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below
More informationFace Recognition Based on SVM and 2DPCA
Vol. 4, o. 3, September, 2011 Face Recognton Based on SVM and 2DPCA Tha Hoang Le, Len Bu Faculty of Informaton Technology, HCMC Unversty of Scence Faculty of Informaton Scences and Engneerng, Unversty
More informationA Novel Term_Class Relevance Measure for Text Categorization
A Novel Term_Class Relevance Measure for Text Categorzaton D S Guru, Mahamad Suhl Department of Studes n Computer Scence, Unversty of Mysore, Mysore, Inda Abstract: In ths paper, we ntroduce a new measure
More informationS1 Note. Basis functions.
S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type
More informationMachine Learning: Algorithms and Applications
14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of
More informationA Fast Visual Tracking Algorithm Based on Circle Pixels Matching
A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng
More informationA Fast Content-Based Multimedia Retrieval Technique Using Compressed Data
A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,
More informationAvailable online at Available online at Advanced in Control Engineering and Information Science
Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced
More informationInvestigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers
Journal of Convergence Informaton Technology Volume 5, Number 2, Aprl 2010 Investgatng the Performance of Naïve- Bayes Classfers and K- Nearest Neghbor Classfers Mohammed J. Islam *, Q. M. Jonathan Wu,
More informationDeep Classifier: Automatically Categorizing Search Results into Large-Scale Hierarchies
Deep Classfer: Automatcally Categorzng Search Results nto Large-Scale Herarches Dkan Xng 1, Gu-Rong Xue 1, Qang Yang 2, Yong Yu 1 1 Shangha Jao Tong Unversty, Shangha, Chna {xaobao,grxue,yyu}@apex.sjtu.edu.cn
More informationHuman Face Recognition Using Generalized. Kernel Fisher Discriminant
Human Face Recognton Usng Generalzed Kernel Fsher Dscrmnant ng-yu Sun,2 De-Shuang Huang Ln Guo. Insttute of Intellgent Machnes, Chnese Academy of Scences, P.O.ox 30, Hefe, Anhu, Chna. 2. Department of
More informationAn Image Fusion Approach Based on Segmentation Region
Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua
More informationUnsupervised Learning
Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and
More informationUser Authentication Based On Behavioral Mouse Dynamics Biometrics
User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA
More informationNon-Negative Matrix Factorization and Support Vector Data Description Based One Class Classification
IJCSI Internatonal Journal of Computer Scence Issues, Vol. 9, Issue 5, No, September 01 ISSN (Onlne): 1694-0814 www.ijcsi.org 36 Non-Negatve Matrx Factorzaton and Support Vector Data Descrpton Based One
More informationDetermining the Optimal Bandwidth Based on Multi-criterion Fusion
Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn
More informationAn Efficient Algorithm for PC Purchase Decision System
Proceedngs of the 6th WSAS Internatonal Conference on Instrumentaton, Measurement, Crcuts & s, Hangzhou, Chna, Aprl 15-17, 2007 216 An ffcent Algorthm for PC Purchase Decson Huay Chang Department of Informaton
More informationSpam Filtering Based on Support Vector Machines with Taguchi Method for Parameter Selection
E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton We-Chh Hsu, Tsan-Yng Yu E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton
More informationX- Chart Using ANOM Approach
ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are
More informationA PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION
1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute
More informationA New Approach For the Ranking of Fuzzy Sets With Different Heights
New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays
More informationBAYESIAN MULTI-SOURCE DOMAIN ADAPTATION
BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,
More informationAutomatic Control of a Digital Reverberation Effect using Hybrid Models
Automatc Control of a Dgtal Reverberaton Effect usng Hybrd Models Emmanoul Theofans Chourdaks 1 and Joshua D. Ress 1 1 Queen Mary Unversty of London, Mle End Road, London E14NS, Unted Kngdom Correspondence
More informationSum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints
Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan
More informationData Mining: Model Evaluation
Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct
More informationA Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems
A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty
More informationClassification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM
Classfcaton of Face Images Based on Gender usng Dmensonalty Reducton Technques and SVM Fahm Mannan 260 266 294 School of Computer Scence McGll Unversty Abstract Ths report presents gender classfcaton based
More informationCLASSIFICATION OF ULTRASONIC SIGNALS
The 8 th Internatonal Conference of the Slovenan Socety for Non-Destructve Testng»Applcaton of Contemporary Non-Destructve Testng n Engneerng«September -3, 5, Portorož, Slovena, pp. 7-33 CLASSIFICATION
More informationWishing you all a Total Quality New Year!
Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma
More informationHelsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)
Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute
More informationSimulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010
Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement
More informationResearch and Application of Fingerprint Recognition Based on MATLAB
Send Orders for Reprnts to reprnts@benthamscence.ae The Open Automaton and Control Systems Journal, 205, 7, 07-07 Open Access Research and Applcaton of Fngerprnt Recognton Based on MATLAB Nng Lu* Department
More informationEdge Detection in Noisy Images Using the Support Vector Machines
Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona
More informationEfficient Segmentation and Classification of Remote Sensing Image Using Local Self Similarity
ISSN(Onlne): 2320-9801 ISSN (Prnt): 2320-9798 Internatonal Journal of Innovatve Research n Computer and Communcaton Engneerng (An ISO 3297: 2007 Certfed Organzaton) Vol.2, Specal Issue 1, March 2014 Proceedngs
More informationCombining The Global and Partial Information for Distance-Based Time Series Classification and Clustering
Paper: Combnng The Global and Partal Informaton for Dstance-Based Tme Seres Hu Zhang, Tu Bao Ho, Mao-Song Ln, and We Huang School of Knowledge Scence, Japan Advanced Insttute of Scence and Technology,
More informationTsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance
Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for
More informationSkew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach
Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research
More informationMeta-heuristics for Multidimensional Knapsack Problems
2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,
More informationAn Entropy-Based Approach to Integrated Information Needs Assessment
Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology
More informationA New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1
A New Feature of Unformty of Image Texture Drectons Concdng wth the Human Eyes Percepton Xng-Jan He, De-Shuang Huang, Yue Zhang, Tat-Mng Lo 2, and Mchael R. Lyu 3 Intellgent Computng Lab, Insttute of Intellgent
More informationPRÉSENTATIONS DE PROJETS
PRÉSENTATIONS DE PROJETS Rex Onlne (V. Atanasu) What s Rex? Rex s an onlne browser for collectons of wrtten documents [1]. Asde ths core functon t has however many other applcatons that make t nterestng
More informationSupervised Nonlinear Dimensionality Reduction for Visualization and Classification
IEEE Transactons on Systems, Man, and Cybernetcs Part B: Cybernetcs 1 Supervsed Nonlnear Dmensonalty Reducton for Vsualzaton and Classfcaton Xn Geng, De-Chuan Zhan, and Zh-Hua Zhou, Member, IEEE Abstract
More information12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification
Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero
More informationLRD: Latent Relation Discovery for Vector Space Expansion and Information Retrieval
LRD: Latent Relaton Dscovery for Vector Space Expanson and Informaton Retreval Techncal Report KMI-06-09 March, 006 Alexandre Gonçalves, Janhan Zhu, Dawe Song, Vctora Uren, Roberto Pacheco In Proc. of
More informationAn Optimal Algorithm for Prufer Codes *
J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,
More informationRecognizing Faces. Outline
Recognzng Faces Drk Colbry Outlne Introducton and Motvaton Defnng a feature vector Prncpal Component Analyss Lnear Dscrmnate Analyss !"" #$""% http://www.nfotech.oulu.f/annual/2004 + &'()*) '+)* 2 ! &
More informationTHE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY
Proceedngs of the 20 Internatonal Conference on Machne Learnng and Cybernetcs, Guln, 0-3 July, 20 THE CONDENSED FUZZY K-NEAREST NEIGHBOR RULE BASED ON SAMPLE FUZZY ENTROPY JUN-HAI ZHAI, NA LI, MENG-YAO
More informationA Multivariate Analysis of Static Code Attributes for Defect Prediction
Research Paper) A Multvarate Analyss of Statc Code Attrbutes for Defect Predcton Burak Turhan, Ayşe Bener Department of Computer Engneerng, Bogazc Unversty 3434, Bebek, Istanbul, Turkey {turhanb, bener}@boun.edu.tr
More informationLearning-Based Top-N Selection Query Evaluation over Relational Databases
Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **
More informationA Probabilistic Approach to Detect Urban Regions from Remotely Sensed Images Based on Combination of Local Features
A Probablstc Approach to Detect Urban Regons from Remotely Sensed Images Based on Combnaton of Local Features Berl Sırmaçek German Aerospace Center (DLR) Remote Sensng Technology Insttute Weßlng, 82234,
More informationBiostatistics 615/815
The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts
More informationNovel Pattern-based Fingerprint Recognition Technique Using 2D Wavelet Decomposition
Mathematcal Methods for Informaton Scence and Economcs Novel Pattern-based Fngerprnt Recognton Technque Usng D Wavelet Decomposton TUDOR BARBU Insttute of Computer Scence of the Romanan Academy T. Codrescu,,
More informationThe Discriminate Analysis and Dimension Reduction Methods of High Dimension
Open Journal of Socal Scences, 015, 3, 7-13 Publshed Onlne March 015 n ScRes. http://www.scrp.org/journal/jss http://dx.do.org/10.436/jss.015.3300 The Dscrmnate Analyss and Dmenson Reducton Methods of
More informationDecision Strategies for Rating Objects in Knowledge-Shared Research Networks
Decson Strateges for Ratng Objects n Knowledge-Shared Research etwors ALEXADRA GRACHAROVA *, HAS-JOACHM ER **, HASSA OUR ELD ** OM SUUROE ***, HARR ARAKSE *** * nsttute of Control and System Research,
More informationUnder-Sampling Approaches for Improving Prediction of the Minority Class in an Imbalanced Dataset
Under-Samplng Approaches for Improvng Predcton of the Mnorty Class n an Imbalanced Dataset Show-Jane Yen and Yue-Sh Lee Department of Computer Scence and Informaton Engneerng, Mng Chuan Unversty 5 The-Mng
More informationCS 534: Computer Vision Model Fitting
CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust
More informationData Mining For Multi-Criteria Energy Predictions
Data Mnng For Mult-Crtera Energy Predctons Kashf Gll and Denns Moon Abstract We present a data mnng technque for mult-crtera predctons of wnd energy. A mult-crtera (MC) evolutonary computng method has
More informationIMAGE FUSION BASED ON EXTENSIONS OF INDEPENDENT COMPONENT ANALYSIS
IMAGE FUSION BASED ON EXTENSIONS OF INDEPENDENT COMPONENT ANALYSIS M Chen a, *, Yngchun Fu b, Deren L c, Qanqng Qn c a College of Educaton Technology, Captal Normal Unversty, Bejng 00037,Chna - (merc@hotmal.com)
More informationExperimental Analysis on Character Recognition using Singular Value Decomposition and Random Projection
Expermental Analyss on Character Recognton usng Sngular Value Decomposton and Random Projecton Manjusha K. 1, Anand Kumar M. 2, Soman K. P. 3 Centre for Excellence n Computatonal Engneerng and Networkng,
More informationUB at GeoCLEF Department of Geography Abstract
UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department
More informationImage Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline
mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and
More informationIncremental Learning with Support Vector Machines and Fuzzy Set Theory
The 25th Workshop on Combnatoral Mathematcs and Computaton Theory Incremental Learnng wth Support Vector Machnes and Fuzzy Set Theory Yu-Mng Chuang 1 and Cha-Hwa Ln 2* 1 Department of Computer Scence and
More informationPCA Based Gait Segmentation
Honggu L, Cupng Sh & Xngguo L PCA Based Gat Segmentaton PCA Based Gat Segmentaton Honggu L, Cupng Sh, and Xngguo L 2 Electronc Department, Physcs College, Yangzhou Unversty, 225002 Yangzhou, Chna 2 Department
More informationClassifying Acoustic Transient Signals Using Artificial Intelligence
Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)
More informationCollaboratively Regularized Nearest Points for Set Based Recognition
Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,
More informationYan et al. / J Zhejiang Univ-Sci C (Comput & Electron) in press 1. Improving Naive Bayes classifier by dividing its decision regions *
Yan et al. / J Zhejang Unv-Sc C (Comput & Electron) n press 1 Journal of Zhejang Unversty-SCIENCE C (Computers & Electroncs) ISSN 1869-1951 (Prnt); ISSN 1869-196X (Onlne) www.zju.edu.cn/jzus; www.sprngerlnk.com
More information