A Taxonomy Fuzzy Filtering Approach
|
|
- Hollie Moore
- 6 years ago
- Views:
Transcription
1 JOURNAL OF AUTOMATIC CONTROL, UNIVERSITY OF BELGRADE, VOL. 13(1):25-29, 2003 A Taxonomy Fuzzy Flterng Approach S. Vrettos and A. Stafylopats Abstract - Our work proposes the use of topc taxonomes as part of a flterng language. Gven a taxonomy, a classfer s traned for each one of ts topcs. The user s able to formulate logcal rules combnng the avalable topcs, e.g. (Topc1 AND Topc2) OR Topc3, n order to flter related documents n a stream. Usng the traned classfers, every document n the stream s assgned a belef value of belongng to the topcs of the flter. These belef values are then aggregated usng logcal operators to yeld the belef to the flter. In our study, Support Vector Machnes and Naïve Bayes classfers were used to provde topc probabltes. Aggregaton of topc probabltes based on fuzzy logc operators was found to mprove flterng performance on the Reuters text corpus, as compared to the use of ther Boolean counterparts. Fnally, we deployed a flterng system on the web usng a sample taxonomy of the Open Drectory Proect. Index Terms- Fuzzy Aggregaton, Taxonomy Flterng, Web Flterng Systems. I. INTRODUCTION The prmary way of nteractvely fndng nformaton on the web s by makng a query n a search engne and then browsng a ranked lst of possbly related web pages. Alternatvely, we can browse a manually organzed topc taxonomy to fnd pages related to the query that we have n mnd. Although web taxonomes may be very large, they cover a small porton of the web relatve to search engnes, prmarly because they rely on human effort. Content-based flterng s the task of analysng a stream of nformaton based on a semantc structure, whch can be automatcally derved or pre-specfed [1],[2]. Text/Hypertext categorzaton promses not only to help mantan updated and large web taxonomes, but to be used n the context of content-based flterng [3]-[8]. The dea s to use topc classfers that have been traned usng the porton correspondng to the well-structured web taxonomy n order to organze the results of a query addressed to the much larger but unclassfed web porton ndexed by a search engne. Bascally, as regards the nterface used to nclude topc nformaton n the query results, t can be topc or lst-orented. In topc-orented nterfaces, results are organzed n a flat or herarchcal taxonomy, whle, n lstorented nterfaces, the orgnal query lst s enrched wth topc meta-data. Our work proposes the use of topc taxonomes as part of a flterng language. The user s able to formulate logcal rules (flters) combnng the avalable topcs, e.g. ( Topc1 AND Topc2) OR Topc3, n order to flter related documents or to provde relevance feedback as well [9]-[10]. School of Electrcal and Computer Engneerng Natonal Techncal Unversty of Athens Zographou, Athens, Greece vrettos@cslab.ntua.gr, andreas@cs.ntua.gr Typcally, classfcaton s a YES/NO assgnment, so the Boolean model s a good canddate for the flterng task. Nevertheless, Boolean flterng provdes no orderng, whch s a drawback to both retreval effectveness and man-machne nteracton. If perfect classfers were avalable, Boolean flterng would be enough. That s because all the true postve documents of the stream and only them would be retreved. In that case, Boolean flterng would yeld recall and precson equal to 1. Unfortunately, no perfect classfers are avalable yet and even the best performng classfers n laboratory text corpora mght have poor results n real, nosy envronments, lke the web. In such cases, rankng accordng to some sutable measure of classfcaton accuracy s able to mprove retreval performance, ether by mprovng recall through the retreval of false negatve documents that were not ncluded n the answer set, or by mprovng precson through the orderng of true postve documents hgher n the rank, above false postve ones. In ths work, Support Vector Machnes (SVM) and Naïve Bayes (NB) classfers are used to provde topc probabltes. Aggregaton of topc probabltes based on fuzzy logc s found to mprove flterng performance on the Reuters text corpus wth respect to Boolean flterng. Fnally, fuzzy aggregaton s embedded n a web flterng system based on a sample taxonomy of the Open Drectory Proect. Support Vector Machnes II. PRODUCING TOPIC PROBABILITIES The SVM model was proposed by Vapnk n 1979 and ganed much popularty n the recent years due to ts strong theoretcal and emprcal ustfcaton [11]. In the smplest lnear form, a SVM s a hyperplane that separates a set of postve examples from a set of negatve examples wth maxmum margn (the mnmal dstance τ from the separatng hyperlane to the closest data pont).a separatng hyperplane s a lnear functon capable of separatng the tranng data n the classfcaton problem wthout error [12]. Suppose that the tranng data consst of m samples (documents) that belong or not to a gven category : (d 1,y 1 ),, (d m,y m ), d R d, y {+1,-1}, whch can be separated by a hyperplane decson functon Dd ( ) = ( wd ) + w (1) wth approprate coeffcents w and w 0. A separatng hyperplane satsfes the constrants that defne the separaton of data samples: y[( w d ) + w ] 1, = 1, K, m (2) It s ntutvely clear that a larger margn corresponds to better generalzaton. maxmzng the margn τ s equvalent to mnmzng the norm of w. An optmal hyperplane s one that satsfes condton (2) above and addtonally mnmzes 0 0
2 26 VRETTOS, S., STAFYLOPATIS, A., A TAXONOMY FUZZY FILTERING APPROACH wth respect to both w and w 0. 2 η ( w) = w (3) The data ponts that exst at the margn, or equvalentlythe data ponts for whch (2) s an equalty, defne the locaton of the decson surface and are called the support vectors. In the case of tranng data that cannot be separated wthout error, t would be desrable to separate the data wth a mnmal number of errors. To do so, we penalze examples that fall on the wrong sde of the decson boundary ntroducng postve slack varables ξ, =1,,m, to quantfy the nonseperable data n the defnng condton of the hyperplane. For a tranng sample d the slack varable ξ s the devaton from the margn border correspondng to the gven category. Slack varables greater than zero correspond to nonseparable ponts, whle slack varables greater than one correspond to msclassfed samples.to releve the problem of nonlnear separablty, a nonlnear mappng of the tranng data nto a hgh dmensonal feature space s usually performed accordng to Cover s theorem on the separablty of patterns [13]. The constraned optmzaton problem s solved usng the method of Lagrange multplers. In ths work, we used the OSU SVM Classfer Matlab Toolbox to tran classfers [14]. For each test example d, an SVM classfer outputs a score that s the dstance of d from the hyperplane learned for separatng postve from negatve examples. The sgn of the score ndcates whether the example s classfed as postve or negatve. In our approach, we want to have a measure of confdence (belef) n the predcton To provde an accurate measure of confdence, a parametrc approach was proposed by Platt for SVM, whch conssts of fndng the parameters of a sgmod functon, mappng the scores nto probablty estmates [15]. Naïve Bayes classfers The decson of whether an unseen document d belongs or not to a category c s based on the estmated belef of d belongng to class c. Bayes theorem can be used to estmate the probablty Pr(c d ), that a document d s n class c. Pr ( ) ( d c ) Pr( c ) Pr c d = (4) c Pr d c Pr c l= 1 ( ) ( ) where Pr(c ) s the pror probablty that a document s n class c and Pr(d c ) s the lkelhood of observng Prˆ c, the estmate of Pr(c ), can be document d n class c. ( ) calculated from the fracton of the tranng documents that s assgned to ths class. The probablty of observng a document lke d n class c s based on the nave assumpton that a word's occurrence n class c s ndependent of the occurrences of the other words. Therefore Pr(d c ) s: n Pr d c = Pr t c, t d (5) ( ) ( ) k = 1 k k where t k represents the k th term of the collecton document. The estmaton of Pr(d c ) s now reduced to the estmaton of Pr(t k c ) (Laplace estmator), whch s the lkelhood of observng t k n class c : l l ( t c ) Pr = n k n 1 + tf ( tk, c ) + tf ( tl, c) where tf(t k, c ) s the number of occurrences of the word t k n category c and n s the number of the terms of the corpus. III. l = 1 FUZZY LOGIC In ths secton, we brefly revew some basc concepts n fuzzy sets and fuzzy logc, that wll be used n the proposed web flterng approach. Let X be a space of obects and x be an element of X. A classcal set A s defned as a collecton of elements x A, such that each x can ether belong or not belong to the set A. Therefore, we can represent a classcal set A by a set of ordered pars (x,0) or (x,1), whch ndcates that x A or x A, respectvely. Extendng the defnton of the classcal set, a fuzzy set s defned as a set of elements that may belong to the set by a membershp degree value between 0 and 1. More formally, a fuzzy set A n X s {,, } defned as a set of ordered pars ( µ ( )) A = x x x A, where µ ( x) s the membershp functon (MF) for the fuzzy set A. Usually, X s referred to as the unverse of dscourse and may consst of dscrete (ordered or unordered) or contnuous spaces. Smlar to classcal set operatons of unon, ntersecton and complement, can be defned for fuzzy sets accordngly. The unon of two fuzzy sets A and B s a fuzzy set C, denoted C = A B or C OR B, whose MF s related to those of A and B by µ x = max µ ( x), µ x = µ x µ x (7) ( ) ( ( )) ( ) ( ) C A B A B The ntersecton of two fuzzy sets A and B s a fuzzy set C, denoted C = A Bor C AND B, whose MF s related to those of A and B by ( x) = mn ( ( x), ( x) ) = ( x) ( x) µ µ µ µ µ (8) C A B A B The complement of fuzzy set A, denoted by A, s defned as µ A( x) = 1 µ A( x) (9) IV. EVALUATION AND RESULTS To evaluate fuzzy aganst Boolean flterng, we used the Reuters corpus [16]. We consder the flat topc taxonomy that conssts of the 10 most frequently assgned topc categores. Usng the labels of the topcs we are able to formulate flters of the form (Earn) AND (Trade), (Acq) OR (Money-Fx), and use them to fnd relevant documents n a stream of data. To specfy the exact flters to use and measure ther effectveness as regards the logcal operators, we consdered the ''ModApte'' splt, a standard commonly used parttonng of the Reuters corpus nto tranng and test sets. A search n the test set of the ''ModApte'' splt yelded 213 documents that belong to 2 or more of the specfed topcs. These documents consttute the set F to be fltered and are mapped to 19 mult-category vectors n the table CM, where cm = 1 mples that category c exsts n multcategory vector m (Table I). (6)
3 JOURNAL OF AUTOMATIC CONTROL, UNIVERSITY OF BELGRADE 27 Table I The category multcategory matrx cm m 1 m m 19 c 1 cm 11 cm 1 cm 1,19 c cm 1 cm cm,19 c 10 cm 10,1 cm 10, cm 10,19 For every mult-category vector m, =1 19, of the flterng set, the logc operator AND s used to combne traned classfers of all categores havng cm = 1, n order to fnd documents n F that belong to all of them. In the same way, the logc operator OR s used to combne traned classfers of all categores havng cm = 1, n order to fnd documents n F that belong to at least one of them. Fnally, the logc operator NOT s used n conuncton wth OR to combne traned classfers of all categores havng cm = 1, n order to fnd documents n F that do not belong to any of them. As a result, we form 19 flters and obtan ther relevant documents n F for every logcal operator. To tran NB classfers, a term-document matrx was created after removng the nfrequent and the most commonly used Englsh words [17]. We reduced dmensonalty, by applyng the Document Frequency (DF) method on the tranng set leavng the 300 most nformatve features [18]. We traned and valdated one two class NB classfer for every topc usng the ''ModApte'' splt, takng as postve examples all the samples that belong to the topc and as negatve examples all the samples that do not. In that way, a document s assgned to a class when ts probablty of belongng to that class s larger than the probablty of belongng to all other classes. In ths work, however, we consdered that d s categorzed under category c when Pr(c d )>h where h s a threshold selected to optmze classfcaton accuracy on the flterng set F. In Table II, the classfcaton accuracy of NB on the flterng/test set s dsplayed. For the tranng of SVM, a term-document matrx was created after removng the nfrequent and the most commonly used Englsh words [17]. To reduce dmensonalty, we appled Prncpal Component Analyss (PCA) on the tranng set leavng the 300 most nformatve features. PCA lead to better classfcaton results than DF on the test set. We traned and valdated one two class SVM classfer for every topc usng the ''ModApte'' splt, takng as postve examples all the samples that belong to the topc and as negatve examples all the samples that do not. To obtan the best possble classfcaton accuracy we optmzed the hyperparameters of non-lnear SVM on the flterng set F. Table II shows the classfcaton accuracy of SVM on the flterng/test set. The output of each SVM was transformed to probablty usng maxmum lkelhood, as descrbed n Secton II. Operators are used to create flters that decde of the membershp of a document to a conuncton, a dsuncton or a negaton of topcs. Table II Classfcaton accuracy on the test/flterng set Topc SVM NB Samples n F Earn Acq Money-fx Crude Gran Trade Interest Shp Wheat Corn Average For every logc operator, we summarze and compare the performance of Boolean and fuzzy flterng over all flters. In the case of Boolean flterng, every document d n F was assgned a crsp value {0,1} dependng on whether t belongs to class c or not. For every flter, the related decsons were aggregated usng Boolean logc. The classc notons of precson (Pr) and recall (Re) are used to measure classfcaton effectveness. For categorzaton nto category c, let tp, fp, tn, fn be the true postve, false postve, true negatve and false negatve documents respectvely. Precson s defned as the estmated probablty that, f a random document d s categorzed under category c, ths decson s correct, that s: tp Pr = (10) tp + fp Analogously, recall s defned as the estmated probablty that, f a random document d should be categorzed under category c, ths decson s actually taken, that s: tp Re = (11) tp + fn Precson and recall are related n an nverse manner. Hgh levels of precson can be acheved by keepng recall low and vce versa. Because no orderng s avalable, Boolean flterng was evaluated by averagng recall and precson of a logcal operator over all flters (Table III). On the contrary, fuzzy flterng provdes orderng, so a standard recall-precson dagram of a logcal operator can be constructed [19]. In all cases, fuzzy aggregaton succeeded n mprovng retreval performance (Fg. 1 and 2, Table III). In the case of OR and NOT operators the mprovement was due to hgher precson. Ths means that true postve documents are placed hgh n the ranked answer set. In the case of AND operators fuzzy aggregaton managed to mprove both recall and precson. Table III Average recall and precson for all flters and operators SVM NB Operator Precson Recall Precson Recall AND OR NOT
4 28 VRETTOS, S., STAFYLOPATIS, A., A TAXONOMY FUZZY FILTERING APPROACH pages. Queres are forwarded to ODP and the results are parsed and gven topc probabltes accordng to the taxonomy. Fnally, these probabltes are aggregated accordng to the flters and the results are fed back to the clent. Clent Sded Flterng Server Sded Flterng Clent User Clent User Server Classfers Interface Classfers Interface Indexng Server Indexng Fg. 1. Results for NB classfers Taxonomy: Bookmarks Search Engne-(s) Taxonomy: Drectory Search Engne-(s) Fg. 3. Clent vs server sded flterng systems CLIENTS (Html Java Applcatons Applets) SERVLETS RMI HTML WRAPPER TAXONOMY BUILDER QUERY- FILTER TCP/IP SEARCH ENGINES Fg. 4. The archtecture of the flterng system Fg. 2. Results for SVM classfer V. AN EXAMPLE FILTERING SYSTEM Generally, an Informaton Flterng system can be on the server or on the clent sde (Fg. 3). The proposed fuzzy flterng approach can be used both ways. In clent sded flterng, the taxonomy may be n the form of user s bookmarks, for example. The system creates the topc classfers that the user uses to flter the results of a search engne accordng to the descrbed method. In server sded flterng, the taxonomy may be n the form of a web drectory. In order to provde an example applcaton, we have developed a server sded flterng system on the web usng the Open Drectory Proect (ODP) [20]. The system bascally conssts of an Html Wrapper ( and a Taxonomy Bulder located on a server (Fg. 4). In that way we can take advantage of powerful servers for the tranng of classfers and the parsng of large quanttes of html pages. Clents are n the form of html pages, applets and ava applcatons that communcate wth the server usng servlets or RMI (Remote Method Invocaton). We created NB classfers for 10 topcs related to the Computers/Artfcal Intellgence drectory of the ODP, usng about 40 web pages as tranng examples for each topc. Through the nterface (html clent), the user s able to create queres and flters n order to retreve and flter web VI. CONCLUSIONS We have descrbed and evaluated a framework that can take advantage of a topc taxonomy as part of a flterng language. We used SVM and NB classfers on the Reuters corpus to create flters that decde of the membershp of a document to a conuncton, dsuncton or negaton of topcs. Fuzzy aggregaton of the estmated topc probabltes proved to exhbt superor performance than Boolean aggregaton for all knds of flters. Fnally, we deployed a flterng system based on ths framework usng a sample taxonomy of the Open Drectory Proect. REFERENCES [1] 1. B.U. Oztekn, G. Karyps and V. Kumar, Expert Agreement and Content Based Rerankng n Meta Search Envronment usng Mearf, Proc. WWW2002, Honolulu, May [2] S. Vrettos and A. Stafylopats, A Fuzzy Rule-Based Agent for Web Retreval Flterng, Proc. of Frst Asa-Pasfc Conference on Web Intellgence (WI 2001), Maebash Cty, Japan, October 2001, pp [3] Y. Yang, An evaluaton of statstcal approaches to text categorzaton, Journal of Informaton retreval, vol. 1,1999, pp [4] Y. Yang and X. Lu, A re-examnaton of text categorzaton methods, Proc. of SIGIR-99, 22nd ACM Internatonal Conference on Research and Development n Informaton Retreval, [5] H. Chen and S.T. Dumas, Brngng order to the Web: Automatcally categorzng search results, Proc. of ACM SIGCHI Conference on Human Factors In Computng Systems(CHI), 2000, pp
5 JOURNAL OF AUTOMATIC CONTROL, UNIVERSITY OF BELGRADE 29 [6] S. Dumas, E. Cutrell and H. Chen, Optmzng Search by Showng Results In Context,, Proc. of SIGCHI,Seattle, WA, USA,March 31,2001, pp [7] C. Chekur, M. Goldwasser, P. Raghavan and E. Upfal, Web search usng automated classfcaton, Proc. of Sxth Internatonal World Wde Web Conference, Poster POS725,Santa Clara, Calforna,Aprl [8] S. Chakrabart, B. Dom, R. Agrawal and P. Raghavan, Scalable feature selecton, classfcaton and sgnature generaton for organzng large text databases nto herarchcal topc taxonomes, The VLDB Journal, vol. 7, 1998, pp [9] Vrettos, S. and Stafylopats A, A Fuzzy logc Framework for Web Page Flterng, Proc. 6th Semnar on Neural Networks Applcatons n Electrcal Engneerng NEUREL 2002, September 2002, pp [10] M.A. Hearst, Improvng Full-Text Precson on Short Queres usng Smple Constrants, Proc. of SDAIR,Las Vegas, NV, Aprl,1996. [11] Susan Dumas, John Platt, Davd Heckerman, and Mehran Saham, Inductve learnng algorthms and representatons for text categorzaton, Proc. of ACM Conference on Informaton and Knowledge Management, 1998, pp [12] Vladmr Cherkassky and Flp Muler, Learnng from Data, (Jown Wley & Sons, 1998). [13] Smon Haykn, Neural Networks a Comprehensve Foundaton (Prentce Hall, 1999). [14] OSU SVM Classfer Matlab Toolbox (ver 3.00) [15] John C. Platt, Probabltes for SV Machnes, Advances n Large Margn Classfers, (The MIT Press, Cambrdge, 2000). [16] The Reuters collecton: [17] 17. T. Joachms, A probablstc Analyss of the Roccho Algorthm wth TFIDF for Text Categorzaton, Proc. of the 14th Internatonal Conference on Machne Learnng ICML97, 1997, pp [18] Ymng Yang and Jan O. Pedersen, A Comparatve Study on Feature Selecton n Text Categorzaton, Proc. of ICML-97, 14th Internatonal Conference on Machne Learnng, [19] Rcardo Baeza-Yates and Berther Rbero Neto, Modern Informaton Retreval, (ACM Press Books, 1999). [20] Open Drectory Proect. [
The Research of Support Vector Machine in Agricultural Data Classification
The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou
More informationClassification / Regression Support Vector Machines
Classfcaton / Regresson Support Vector Machnes Jeff Howbert Introducton to Machne Learnng Wnter 04 Topcs SVM classfers for lnearly separable classes SVM classfers for non-lnearly separable classes SVM
More informationSupport Vector Machines
/9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.
More informationFeature Reduction and Selection
Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components
More informationMachine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)
Machne Learnng Support Vector Machnes (contans materal adapted from talks by Constantn F. Alfers & Ioanns Tsamardnos, and Martn Law) Bryan Pardo, Machne Learnng: EECS 349 Fall 2014 Support Vector Machnes
More informationSupport Vector Machines
Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned
More informationLearning the Kernel Parameters in Kernel Minimum Distance Classifier
Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department
More informationOutline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1
4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:
More informationTerm Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task
Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto
More informationSum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints
Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan
More informationExperiments in Text Categorization Using Term Selection by Distance to Transition Point
Experments n Text Categorzaton Usng Term Selecton by Dstance to Transton Pont Edgar Moyotl-Hernández, Héctor Jménez-Salazar Facultad de Cencas de la Computacón, B. Unversdad Autónoma de Puebla, 14 Sur
More informationCLASSIFICATION OF ULTRASONIC SIGNALS
The 8 th Internatonal Conference of the Slovenan Socety for Non-Destructve Testng»Applcaton of Contemporary Non-Destructve Testng n Engneerng«September -3, 5, Portorož, Slovena, pp. 7-33 CLASSIFICATION
More informationPerformance Evaluation of Information Retrieval Systems
Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence
More informationAn Entropy-Based Approach to Integrated Information Needs Assessment
Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology
More informationTN348: Openlab Module - Colocalization
TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages
More informationEdge Detection in Noisy Images Using the Support Vector Machines
Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona
More informationKeywords - Wep page classification; bag of words model; topic model; hierarchical classification; Support Vector Machines
(IJCSIS) Internatonal Journal of Computer Scence and Informaton Securty, Herarchcal Web Page Classfcaton Based on a Topc Model and Neghborng Pages Integraton Wongkot Srura Phayung Meesad Choochart Haruechayasak
More informationAnnouncements. Supervised Learning
Announcements See Chapter 5 of Duda, Hart, and Stork. Tutoral by Burge lnked to on web page. Supervsed Learnng Classfcaton wth labeled eamples. Images vectors n hgh-d space. Supervsed Learnng Labeled eamples
More informationSupport Vector Machines. CS534 - Machine Learning
Support Vector Machnes CS534 - Machne Learnng Perceptron Revsted: Lnear Separators Bnar classfcaton can be veed as the task of separatng classes n feature space: b > 0 b 0 b < 0 f() sgn( b) Lnear Separators
More informationA New Approach For the Ranking of Fuzzy Sets With Different Heights
New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays
More informationClassifying Acoustic Transient Signals Using Artificial Intelligence
Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)
More informationIncremental Learning with Support Vector Machines and Fuzzy Set Theory
The 25th Workshop on Combnatoral Mathematcs and Computaton Theory Incremental Learnng wth Support Vector Machnes and Fuzzy Set Theory Yu-Mng Chuang 1 and Cha-Hwa Ln 2* 1 Department of Computer Scence and
More informationBOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET
1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS46: Mnng Massve Datasets Jure Leskovec, Stanford Unversty http://cs46.stanford.edu /19/013 Jure Leskovec, Stanford CS46: Mnng Massve Datasets, http://cs46.stanford.edu Perceptron: y = sgn( x Ho to fnd
More informationOptimizing Document Scoring for Query Retrieval
Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng
More informationFace Recognition University at Buffalo CSE666 Lecture Slides Resources:
Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural
More informationClassifier Selection Based on Data Complexity Measures *
Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.
More informationAn Anti-Noise Text Categorization Method based on Support Vector Machines *
An Ant-Nose Text ategorzaton Method based on Support Vector Machnes * hen Ln, Huang Je and Gong Zheng-Hu School of omputer Scence, Natonal Unversty of Defense Technology, hangsha, 410073, hna chenln@nudt.edu.cn,
More informationInvestigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers
Journal of Convergence Informaton Technology Volume 5, Number 2, Aprl 2010 Investgatng the Performance of Naïve- Bayes Classfers and K- Nearest Neghbor Classfers Mohammed J. Islam *, Q. M. Jonathan Wu,
More informationCluster Analysis of Electrical Behavior
Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School
More informationContent Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers
IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth
More informationCAN COMPUTERS LEARN FASTER? Seyda Ertekin Computer Science & Engineering The Pennsylvania State University
CAN COMPUTERS LEARN FASTER? Seyda Ertekn Computer Scence & Engneerng The Pennsylvana State Unversty sertekn@cse.psu.edu ABSTRACT Ever snce computers were nvented, manknd wondered whether they mght be made
More information12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification
Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero
More informationRelevance Feedback Document Retrieval using Non-Relevant Documents
Relevance Feedback Document Retreval usng Non-Relevant Documents TAKASHI ONODA, HIROSHI MURATA and SEIJI YAMADA Ths paper reports a new document retreval method usng non-relevant documents. From a large
More informationParallelism for Nested Loops with Non-uniform and Flow Dependences
Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr
More informationSVM-based Learning for Multiple Model Estimation
SVM-based Learnng for Multple Model Estmaton Vladmr Cherkassky and Yunqan Ma Department of Electrcal and Computer Engneerng Unversty of Mnnesota Mnneapols, MN 55455 {cherkass,myq}@ece.umn.edu Abstract:
More information(1) The control processes are too complex to analyze by conventional quantitative techniques.
Chapter 0 Fuzzy Control and Fuzzy Expert Systems The fuzzy logc controller (FLC) s ntroduced n ths chapter. After ntroducng the archtecture of the FLC, we study ts components step by step and suggest a
More informationCSCI 5417 Information Retrieval Systems Jim Martin!
CSCI 5417 Informaton Retreval Systems Jm Martn! Lecture 11 9/29/2011 Today 9/29 Classfcaton Naïve Bayes classfcaton Ungram LM 1 Where we are... Bascs of ad hoc retreval Indexng Term weghtng/scorng Cosne
More informationDeep Classification in Large-scale Text Hierarchies
Deep Classfcaton n Large-scale Text Herarches Gu-Rong Xue Dkan Xng Qang Yang 2 Yong Yu Dept. of Computer Scence and Engneerng Shangha Jao-Tong Unversty {grxue, dkxng, yyu}@apex.sjtu.edu.cn 2 Hong Kong
More informationOutline. Type of Machine Learning. Examples of Application. Unsupervised Learning
Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton
More informationFor instance, ; the five basic number-sets are increasingly more n A B & B A A = B (1)
Secton 1.2 Subsets and the Boolean operatons on sets If every element of the set A s an element of the set B, we say that A s a subset of B, or that A s contaned n B, or that B contans A, and we wrte A
More informationA NOTE ON FUZZY CLOSURE OF A FUZZY SET
(JPMNT) Journal of Process Management New Technologes, Internatonal A NOTE ON FUZZY CLOSURE OF A FUZZY SET Bhmraj Basumatary Department of Mathematcal Scences, Bodoland Unversty, Kokrajhar, Assam, Inda,
More informationMachine Learning. Topic 6: Clustering
Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess
More informationSmoothing Spline ANOVA for variable screening
Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory
More informationBAYESIAN MULTI-SOURCE DOMAIN ADAPTATION
BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,
More informationMachine Learning: Algorithms and Applications
14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of
More informationA Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems
A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty
More informationBiostatistics 615/815
The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts
More informationThe Greedy Method. Outline and Reading. Change Money Problem. Greedy Algorithms. Applications of the Greedy Strategy. The Greedy Method Technique
//00 :0 AM Outlne and Readng The Greedy Method The Greedy Method Technque (secton.) Fractonal Knapsack Problem (secton..) Task Schedulng (secton..) Mnmum Spannng Trees (secton.) Change Money Problem Greedy
More informationSubspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;
Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features
More informationFace Recognition Method Based on Within-class Clustering SVM
Face Recognton Method Based on Wthn-class Clusterng SVM Yan Wu, Xao Yao and Yng Xa Department of Computer Scence and Engneerng Tong Unversty Shangha, Chna Abstract - A face recognton method based on Wthn-class
More informationReliable Negative Extracting Based on knn for Learning from Positive and Unlabeled Examples
94 JOURNAL OF COMPUTERS, VOL. 4, NO. 1, JANUARY 2009 Relable Negatve Extractng Based on knn for Learnng from Postve and Unlabeled Examples Bangzuo Zhang College of Computer Scence and Technology, Jln Unversty,
More informationUsing Neural Networks and Support Vector Machines in Data Mining
Usng eural etworks and Support Vector Machnes n Data Mnng RICHARD A. WASIOWSKI Computer Scence Department Calforna State Unversty Domnguez Hlls Carson, CA 90747 USA Abstract: - Multvarate data analyss
More informationUser Authentication Based On Behavioral Mouse Dynamics Biometrics
User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA
More informationCS47300: Web Information Search and Management
CS47300: Web Informaton Search and Management Prof. Chrs Clfton 15 September 2017 Materal adapted from course created by Dr. Luo S, now leadng Albaba research group Retreval Models Informaton Need Representaton
More informationUB at GeoCLEF Department of Geography Abstract
UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department
More informationSelecting Query Term Alterations for Web Search by Exploiting Query Contexts
Selectng Query Term Alteratons for Web Search by Explotng Query Contexts Guhong Cao Stephen Robertson Jan-Yun Ne Dept. of Computer Scence and Operatons Research Mcrosoft Research at Cambrdge Dept. of Computer
More informationSpam Filtering Based on Support Vector Machines with Taguchi Method for Parameter Selection
E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton We-Chh Hsu, Tsan-Yng Yu E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton
More informationA Binarization Algorithm specialized on Document Images and Photos
A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a
More informationNUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS
ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana
More informationCS 534: Computer Vision Model Fitting
CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust
More informationTsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance
Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for
More informationImpact of a New Attribute Extraction Algorithm on Web Page Classification
Impact of a New Attrbute Extracton Algorthm on Web Page Classfcaton Gösel Brc, Banu Dr, Yldz Techncal Unversty, Computer Engneerng Department Abstract Ths paper ntroduces a new algorthm for dmensonalty
More informationSolving two-person zero-sum game by Matlab
Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by
More informationCS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 15
CS434a/541a: Pattern Recognton Prof. Olga Veksler Lecture 15 Today New Topc: Unsupervsed Learnng Supervsed vs. unsupervsed learnng Unsupervsed learnng Net Tme: parametrc unsupervsed learnng Today: nonparametrc
More informationUnsupervised Learning
Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and
More informationWeb Document Classification Based on Fuzzy Association
Web Document Classfcaton Based on Fuzzy Assocaton Choochart Haruechayasa, Me-Lng Shyu Department of Electrcal and Computer Engneerng Unversty of Mam Coral Gables, FL 33124, USA charuech@mam.edu, shyu@mam.edu
More informationGSLM Operations Research II Fall 13/14
GSLM 58 Operatons Research II Fall /4 6. Separable Programmng Consder a general NLP mn f(x) s.t. g j (x) b j j =. m. Defnton 6.. The NLP s a separable program f ts objectve functon and all constrants are
More informationProblem Definitions and Evaluation Criteria for Computational Expensive Optimization
Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty
More informationCHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION
48 CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 3.1 INTRODUCTION The raw mcroarray data s bascally an mage wth dfferent colors ndcatng hybrdzaton (Xue
More informationDiscriminative classifiers for object classification. Last time
Dscrmnatve classfers for object classfcaton Thursday, Nov 12 Krsten Grauman UT Austn Last tme Supervsed classfcaton Loss and rsk, kbayes rule Skn color detecton example Sldng ndo detecton Classfers, boostng
More informationTaxonomy of Large Margin Principle Algorithms for Ordinal Regression Problems
Taxonomy of Large Margn Prncple Algorthms for Ordnal Regresson Problems Amnon Shashua Computer Scence Department Stanford Unversty Stanford, CA 94305 emal: shashua@cs.stanford.edu Anat Levn School of Computer
More informationFuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches
Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of
More informationDeep Classifier: Automatically Categorizing Search Results into Large-Scale Hierarchies
Deep Classfer: Automatcally Categorzng Search Results nto Large-Scale Herarches Dkan Xng 1, Gu-Rong Xue 1, Qang Yang 2, Yong Yu 1 1 Shangha Jao Tong Unversty, Shangha, Chna {xaobao,grxue,yyu}@apex.sjtu.edu.cn
More informationUsing Ambiguity Measure Feature Selection Algorithm for Support Vector Machine Classifier
Usng Ambguty Measure Feature Selecton Algorthm for Support Vector Machne Classfer Saet S.R. Mengle Informaton Retreval Lab Computer Scence Department Illnos Insttute of Technology Chcago, Illnos, U.S.A
More informationHermite Splines in Lie Groups as Products of Geodesics
Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the
More informationA Novel Term_Class Relevance Measure for Text Categorization
A Novel Term_Class Relevance Measure for Text Categorzaton D S Guru, Mahamad Suhl Department of Studes n Computer Scence, Unversty of Mysore, Mysore, Inda Abstract: In ths paper, we ntroduce a new measure
More informationTraining of Kernel Fuzzy Classifiers by Dynamic Cluster Generation
Tranng of Kernel Fuzzy Classfers by Dynamc Cluster Generaton Shgeo Abe Graduate School of Scence and Technology Kobe Unversty Nada, Kobe, Japan abe@eedept.kobe-u.ac.jp Abstract We dscuss kernel fuzzy classfers
More informationImage Representation & Visualization Basic Imaging Algorithms Shape Representation and Analysis. outline
mage Vsualzaton mage Vsualzaton mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and Analyss outlne mage Representaton & Vsualzaton Basc magng Algorthms Shape Representaton and
More informationLoad-Balanced Anycast Routing
Load-Balanced Anycast Routng Chng-Yu Ln, Jung-Hua Lo, and Sy-Yen Kuo Department of Electrcal Engneerng atonal Tawan Unversty, Tape, Tawan sykuo@cc.ee.ntu.edu.tw Abstract For fault-tolerance and load-balance
More informationIntra-Parametric Analysis of a Fuzzy MOLP
Intra-Parametrc Analyss of a Fuzzy MOLP a MIAO-LING WANG a Department of Industral Engneerng and Management a Mnghsn Insttute of Technology and Hsnchu Tawan, ROC b HSIAO-FAN WANG b Insttute of Industral
More informationImplementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status
Internatonal Journal of Appled Busness and Informaton Systems ISSN: 2597-8993 Vol 1, No 2, September 2017, pp. 6-12 6 Implementaton Naïve Bayes Algorthm for Student Classfcaton Based on Graduaton Status
More informationAn Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices
Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal
More informationClassification and clustering using SVM
Lucan Blaga Unversty of Sbu Hermann Oberth Engneerng Faculty Computer Scence Department Classfcaton and clusterng usng SVM nd PhD Report Thess Ttle: Data Mnng for Unstructured Data Author: Danel MORARIU,
More informationAutomatic Control of a Digital Reverberation Effect using Hybrid Models
Automatc Control of a Dgtal Reverberaton Effect usng Hybrd Models Emmanoul Theofans Chourdaks 1 and Joshua D. Ress 1 1 Queen Mary Unversty of London, Mle End Road, London E14NS, Unted Kngdom Correspondence
More informationFuzzy Modeling of the Complexity vs. Accuracy Trade-off in a Sequential Two-Stage Multi-Classifier System
Fuzzy Modelng of the Complexty vs. Accuracy Trade-off n a Sequental Two-Stage Mult-Classfer System MARK LAST 1 Department of Informaton Systems Engneerng Ben-Guron Unversty of the Negev Beer-Sheva 84105
More informationImproving Web Image Search using Meta Re-rankers
VOLUME-1, ISSUE-V (Aug-Sep 2013) IS NOW AVAILABLE AT: www.dcst.com Improvng Web Image Search usng Meta Re-rankers B.Kavtha 1, N. Suata 2 1 Department of Computer Scence and Engneerng, Chtanya Bharath Insttute
More informationA Modified Median Filter for the Removal of Impulse Noise Based on the Support Vector Machines
A Modfed Medan Flter for the Removal of Impulse Nose Based on the Support Vector Machnes H. GOMEZ-MORENO, S. MALDONADO-BASCON, F. LOPEZ-FERRERAS, M. UTRILLA- MANSO AND P. GIL-JIMENEZ Departamento de Teoría
More informationData Mining: Model Evaluation
Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct
More informationType-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data
Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES
More information2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements
Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.
More informationArabic Text Classification Using N-Gram Frequency Statistics A Comparative Study
Arabc Text Classfcaton Usng N-Gram Frequency Statstcs A Comparatve Study Lala Khresat Dept. of Computer Scence, Math and Physcs Farlegh Dcknson Unversty 285 Madson Ave, Madson NJ 07940 Khresat@fdu.edu
More informationPruning Training Corpus to Speedup Text Classification 1
Prunng Tranng Corpus to Speedup Text Classfcaton Jhong Guan and Shugeng Zhou School of Computer Scence, Wuhan Unversty, Wuhan, 430079, Chna hguan@wtusm.edu.cn State Key Lab of Software Engneerng, Wuhan
More informationEfficient Text Classification by Weighted Proximal SVM *
Effcent ext Classfcaton by Weghted Proxmal SVM * Dong Zhuang 1, Benyu Zhang, Qang Yang 3, Jun Yan 4, Zheng Chen, Yng Chen 1 1 Computer Scence and Engneerng, Bejng Insttute of echnology, Bejng 100081, Chna
More informationCompiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz
Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster
More informationModule Management Tool in Software Development Organizations
Journal of Computer Scence (5): 8-, 7 ISSN 59-66 7 Scence Publcatons Management Tool n Software Development Organzatons Ahmad A. Al-Rababah and Mohammad A. Al-Rababah Faculty of IT, Al-Ahlyyah Amman Unversty,
More informationLocal Quaternary Patterns and Feature Local Quaternary Patterns
Local Quaternary Patterns and Feature Local Quaternary Patterns Jayu Gu and Chengjun Lu The Department of Computer Scence, New Jersey Insttute of Technology, Newark, NJ 0102, USA Abstract - Ths paper presents
More informationLECTURE : MANIFOLD LEARNING
LECTURE : MANIFOLD LEARNING Rta Osadchy Some sldes are due to L.Saul, V. C. Raykar, N. Verma Topcs PCA MDS IsoMap LLE EgenMaps Done! Dmensonalty Reducton Data representaton Inputs are real-valued vectors
More informationIssues and Empirical Results for Improving Text Classification
Issues and Emprcal Results for Improvng Text Classfcaton Youngoong Ko 1 and Jungyun Seo 2 1 Dept. of Computer Engneerng, Dong-A Unversty, 840 Hadan 2-dong, Saha-gu, Busan, 604-714, Korea yko@dau.ac.kr
More informationAn Evaluation of Divide-and-Combine Strategies for Image Categorization by Multi-Class Support Vector Machines
An Evaluaton of Dvde-and-Combne Strateges for Image Categorzaton by Mult-Class Support Vector Machnes C. Demrkesen¹ and H. Cherf¹, ² 1: Insttue of Scence and Engneerng 2: Faculté des Scences Mrande Galatasaray
More information