SUPPORT VECTOR MACHINES FOR CLASSIFICATION OF MALIGNANT AND BENIGN LESIONS. Anatoli Nachev, Mairead Hogan

Size: px
Start display at page:

Download "SUPPORT VECTOR MACHINES FOR CLASSIFICATION OF MALIGNANT AND BENIGN LESIONS. Anatoli Nachev, Mairead Hogan"

Transcription

1 311 ITHEA SUPPORT VECTOR MACHINES FOR CLASSIFICATION OF MALIGNANT AND BENIGN LESIONS Anatol Nachev, Maread Hogan Abstract: Ths paper presents an exploratory study of the effectveness of support vector machnes used as a tool for computer-aded breast cancer dagnoss. We explore the dscrmnatory power of heterogeneous mammographc and sonographc descrptors n solvng the classfcaton task. Varous feature selecton technques were tested to fnd a set of descrptors that outperforms those from smlar studes. We also explored how choce of the SVM kernel functon and model parameters affect ts predctve abltes. The kernels explored were lnear, radal bass functon, polynomal, and sgmod. The model performance was estmated by ROC analyss and metrcs, such as true and false postve rates, maxmum accuracy, area under the ROC curve, partal area under the ROC curve wth senstvty above 90%, and specfcty at 98% senstvty. Partcular attenton was pad to the latter two as lack of specfcty causes unnecessary surgcal bopses. Experments regstered that an approprate reducton of varables can greatly mprove the predctve power of the model, as long as the choce of the kernel affects the model performance margnally. We also found that the SVM s superor to the common classfcaton technque used n the feld - MLP neural networks. Keywords: data mnng, support vector machnes, heterogeneous data; breast cancer dagnoss, computer aded dagnoss. ACM Classfcaton Keywords: I.5.2- Computng Methodologes - Pattern Recognton Desgn Methodology - Classfer desgn and evaluaton. Introducton A proper treatment of breast cancer dsease requres tmely, relable, and accurate dagnoss, whch allows radologsts and physcans to dfferentate between bengn and malgnant lesons. Many computer-aded detecton/dagnoss (CAD) tools currently support medcal practces by capturng knowledge from prevous cases and applyng that knowledge to the new cases. CAD s a typcal machne-learnng problem, whch has been dealt wth by varous data mnng technques and tools such as lnear dscrmnant analyss (LDA), logstc regresson analyss (LRA), multlayer perceptons (MLP), etc. [Chen et al., 2009].

2 Artfcal Intellgence Methods and Technques for Busness and Engneerng Applcatons 312 Most of the current mplementatons tend to use only one nformaton source, usually mammographc data n the form of data descrptors defned by the Breast Imagng Reportng and Data System (BI-RADS) lexcon, developed by the Amercan College of Radology (ACR) n order to standardze the mammographc language and nterpretatons, and to facltate communcaton between clncans [BI-RADS, 2003], [Kopans, 1992]. Jesneck et al. [2007] have used a novel combnaton of BI-RADS mammographc and sonographc descrptors and some proposed by Stavros et al. [1995] n order to buld a predctve model based on MLP, whch shows superor characterstcs to those that use one data source. Our study takes that approach, but nvestgate another predctve technque - support vector machnes (SVM). We also address the problem of hgh false postve rate of ndcaton for bopsy (specfcty rate), whch causes unnecessary surgcal bopses, lowers the effcency of the dagnoss, exposes patents to dscomfort, and creates fnancal burden as procedures cost thousands of euros each [Lacey et al., 2002]. Further to the study of Jesneck et al. [2007] who used a set of fourteen descrptors to tran and test a MLP neural network, we explore the dscrmnatory power of all descrptors n order to seek alternatve sets that when appled to SVM can ensure even hgher senstvty and specfcty. The paper s organzed as follows: Secton 2 provdes a bref overvew of the support vector machnes used as data mnng tools; Secton 3 ntroduces the dataset used n ths study and dscusses varable selecton as part of the data preprocessng; Secton 4 presents and dscuses results obtaned from experments; and Secton 5 gves the conclusons. Support Vector Machnes Support vector machnes are common machne learnng technques. They belong to the famly of generalzed lnear models, whch acheve a classfcaton or regresson decson based on the value of the lnear combnaton of nput features. Usng hstorcal data along wth supervsed learnng algorthms, SVM generate mathematcal functons to map nput varables to desred outputs for classfcaton or regresson predcton problems. SVM, orgnally ntroduced by Vapnk [1995], provde a new approach to the problem of pattern recognton wth clear connectons to the underlyng statstcal learnng theory. They dffer radcally from comparable approaches such as neural networks because SVM tranng always fnds a global mnmum n contrast to the neural networks. SVM can be formalzed as follows. Tranng data s a set of ponts of the form, (1)

3 313 ITHEA where the c s ether 1 or -1, ndcatng the class to whch the pont x belongs. Each data pont x s a p-dmensonal real vector. Durng tranng a lnear SVM constructs a p-1- dmensonal hyperplane that separates the ponts nto two classes (see Fgure 1). Any hyperplane can be represented by: w x b = 0 where w s a normal vector and denotes dot product. Among all possble hyperplanes that mght classfy the data, SVM selects one wth maxmal dstance (margn) to the nearest data ponts (support vectors). When the classes are not lnearly separable (there s no hyperplane that can splt the two classes), a varant of SVM, called soft-margn SVM, chooses a hyperplane that splts the ponts as cleanly as possble, whle stll maxmzng the dstance to the nearest cleanly splt examples. The method ntroduces slack varables,, whch measure the degree of msclassfcaton of the datum x. Soft-margn SVM penalzes msclassfcaton errors and employs a parameter (the soft-margn constant C) to control the cost of msclassfcaton. Tranng a lnear SVM classer solves the constraned optmzaton problem (2). mn w, b, k 1 2 w s. t. 2 C n 1 w x b 1 In dual form the optmzaton problem can be represented by (3) (2) mn 1 2 n n y y x x j j j 1 j1 1 n (3) s. t. 0 C, c n 1 0 The resultng decson functon f ( x) w x b has a weght vector n w k k yk x 1 k. Data ponts x for whch 0 are called support vectors, snce they unquely defne the maxmum margn hyperplane. Maxmzng the margn allows one to mnmze bounds on generalzaton error. If every dot product s replaced by a non-lnear kernel functon, t transforms the feature space nto a hgher-dmensonal one, thus though the classfer s a hyperplane n the hgh-dmensonal feature space (see Fgure 2). The resultng classfer fts the maxmummargn hyperplane n the transformed feature space. The kernel functon can be defned as (4)

4 Artfcal Intellgence Methods and Technques for Busness and Engneerng Applcatons 314 where (x) maps the vector x to some other Eucldean space. The dot product x x j n the formulae above s replaced by k(x, x j ) so that the SVM optmzaton problem n ts dual form can be redefned as: maxmze (n ) 1 L ~ ( ) j y y jk( x, x j ), s. t. y 0 ; 0 for all 2 j 1 N (5) A non-lnear SVM s largely characterzed by the choce of ts kernel, and SVMs thus lnk the problems they are desgned for wth a large body of exstng work on kernel-based methods. Some common kernels functons nclude: Lnear kernel: k(x,x ) = (xx ) d Polynomal kernel: k ( x, x ) ( sx x c) 2 RBF kernel: k( x, x) exp( ( x x) ) Sgmod kernel: k( x, x ) tanh( s( x x) c) Once the kernel s fxed, SVM classfers have few user-chosen parameters. The best choce of kernel for a gven problem s stll a research ssue. Because the sze of the margn does not depend on the data dmenson, SVM are robust wth respect to data wth hgh nput dmenson. However, SVM are senstve to the presence of outlers, due to the regularzaton term for penalzng msclassfcaton (whch depends on the choce of C). The SVM algorthm requres O(n 2 ) storage and O(n 3 ) to learn. The SVM method can also be appled to the case of regresson. A verson of SVM for regresson, called support vector regresson (SVR), was proposed by Drucker et al. [1997]. The basc dea of SVR s that a non-lnear functon learns by a lnear learnng method n a kernel-nduced hgher dmensonal space. Smlarly to how SVM classfcaton gnores data ponts that are not support vectors, the SVR depend on a small subset of tranng data ponts. The SVM s major advantage les wth ther ablty to map varables onto an extremely hgh feature space. Ths, n essence facltates a means for the exploraton of nonlnear kernelbased classfers [Oladunn and Snghal, 2009; Burges, 1998], however, t has been dscovered they do not favour large datasets, due to the demands mposed on vrtual memory, and the tranng complexty resultant from the use of such a scaled collecton of data [Horng et al., 2010]. Work from Fe et al. [2008] hghlghted three crucal problems n the use of support vector machnes. These are attanng the optmal nput subset, correct kernel functon, and the optmal parameters of the selected kernel, all of whch are prme consderatons wthn ths study.

5 315 ITHEA Fg. 1. Maxmum-margn hyperplane for a SVM traned wth samples from two classes. Samples on the margn are support vectors Fg. 2. Kernel functon: a lnearly nseparable nput space can be mapped to a lnearly separable hgher-dmmensonal space Dataset and Preprocessng Ths study uses a dataset that contans data from physcal examnaton of patents, ncludng mammographc and sonographc examnatons, famly hstory of breast cancer, and personal hstory of breast malgnancy, all collected at Duke Unversty Medcal Centre [Jesneck et al., 2007]. Samples ncluded n the dataset are those selected for bopsy only f the lesons corresponded to sold masses on sonograms and f both mammographc and sonographc mages taken before the bopsy were avalable for revew. Data contan 803 samples, 296 of whch are malgnant and 507 bengn. Out of 39 descrptors, 13 are mammographc BI-RADS, 13 sonographc BI-RADS, 6 sonographc suggested by Stavros et al. [1995], 4 sonographc mass descrptors, and 3 patent hstory features. There s also a class label, -1 and 1, that ndcates f a sample s malgnant or bengn. The data features are as follows: mass sze, parenchyma densty, mass margn, mass shape, mass densty, calcfcaton number of partcles, calcfcaton dstrbuton, calcfcaton descrpton, archtectural dstorton, assocated fndngs, specal cases (as defned by the BI-RADS lexcon [BI-RADS, 2003]: asymmetrc tubular structure, ntramammary lymph node, global asymmetry, and focal asymmetry), comparson wth fndngs at pror examnaton, and change n mass sze. The sonographc features are radal dameter, antradal dameter, anteroposteror dameter, background tssue echo texture, mass shape, mass orentaton, mass margn, leson boundary, echo pattern, posteror acoustc features, calcfcatons wthn mass, specal cases (as defned by the BI- RADS lexcon: clustered mcrocysts, complcated cysts, mass n or on skn, foregn body,

6 Artfcal Intellgence Methods and Technques for Busness and Engneerng Applcatons 316 ntramammary lymph node, and axllary lymph node), and vascularty. The sx features suggested by Stavros [Stavros et al., 1995] are mass shape, mass margn, acoustc transmsson, thn echo pseudocapsule, mass echogencty, and calcfcatons. The four other sonographc mass descrptors are edge shadow, cystc component, and two mammographc BI-RADS descrptors appled to sonography mass shape (oval and lobulated are separate descrptors) and mass margn (replaces sonographc descrptor angular wth obscured). The three patent hstory features were famly hstory, patent age, and ndcaton for sonography [Jesneck et al., 2007]. Usng the dataset n ts orgnal format for classfcaton wth SVM would be problematc due to the large ampltude of feature values caused by the dfferent nature of the data varables and dfferent unts of measurements used. For example, the mass sze values range from 0 to 75, as long as calcfcaton ranges from 0 to 3. Such an nconsstency could affect the predctve abltes of a SVM classfer as some varables can be vewed as more nfluental than others. The approach we used to address that problem was to process each data varable (data column) separately by transformaton (6). It scales down the varables wthn the unt hypercube. We also explored how presence or absence of varables presented to the model for tranng and testng affects the classfer performance. Removng most rrelevant and redundant features from the data helps to allevate the effect of the curse of dmensonalty and to enhance the generalzaton capablty of the model, and to speed up the learnng process and to mprove the model nterpretablty. The feature selecton also helps to acqure better understandng about data and how they are related wth each other. The exhaustve search approach that consders all possble subsets of varables s best for datasets wth small cardnalty, but mpractcal for large number of features as n our case. Jesneck et al. [2007] proposed a feature subset of 14 descrptors (s14) for ther experments wth neural networks. They were derved by the stepwse feature selecton technque. There s no guarantee, however, that an optmal varable selecton for one classfcaton technque wll be optmal for another. In order to fnd the alternatve selectons for the SVM model we consdered several feature selecton algorthms, whch generally fall nto two categores: feature rankng and subset selecton. The latter s more advanced and wdely used n practce, whch made us focus on t. We consdered best frst, subset sze forward selecton, race search, scatter search and genetc search combned wth a set evaluaton technque that consders ndvdual predctve ablty of each feature along wth the degree of redundancy between them [Goldberg, 1989; (6)

7 317 ITHEA Hall, 1998]. We propose a set of 17 varables (s17) derved by the lnear forward selecton technque, proposed by Guetlen et al. [2009]. The feature set we obtaned conssts of the followng varables: patent age, ndcaton for sonography, mass margn, calcfcaton number of partcles, archtectural dstorton, anteroposteror dameter, mass shape, mass orentaton, leson boundary, specal cases, mass shape, mass margn, thn echo pseudocapsule, mass echogencty, edge shadow, cystc component, and mass margn. Two of these are general descrptors; three - mammographc BI-RADS; fve - sonographc BI-RADS; four - Stavros ; and three - sonographc mass descrptors. The feature set s relatvely balanced n representng dfferent categores of data. In our experments we also used the set of 14 varables (s14) mentoned above and the orgnal full set of 39 varables (s39). Emprcal Results and Dscusson Usng the tranng and testng datasets descrbed above, we bult a classfcaton model based on SVM. For the purposes of the ROC we used the support vector regresson technque, whch outputs predctons as real numbers between -1 and 1, whch mapped to the class labels (ether -1 or 1). In order to mnmze the bas n results assocated wth the random samplng of the tranng and testng data samples, we appled fve-fold crossvaldaton, a.k.a. rotaton estmaton. The dataset was randomly spt nto fve mutually exclusve subsets (folds) of equal sze. The model was traned and tested fve tmes so that each tme t was traned on one combnaton of four folds and tested on the remanng one. The cross-valdaton estmate of the overall model accuracy was calculated by (7). C V A 1 k k 1 A, (7) where the number of folds k=5, CVA s the cross-valdaton accuracy, and A s the accuracy measure of the -th fold (e.g. ht-rate, senstvty, specfcty). The prmary source for estmatng the model accuracy s the confuson matrx (a.k.a. contngency table), llustrated n Fgure 3. Results from experments were summarzed n four categores: true postves (TP), true negatves (TN), false postves (FP), and false negatves (FN). The numbers along the prmary dagonal n the matrx represent correct predctons, as long as those outsde the dagonal represent the errors.

8 Predcted Class Negatve Postve Artfcal Intellgence Methods and Technques for Busness and Engneerng Applcatons 318 Postve True Postve Count (TP) True Class Negatve False Postve Count (FP) False Negatve Count (FN) True Negatve Count (TN) Fg. 3. Confuson matrx for tabulaton of classfcaton results In order to estmate the model performance we used the followng dervatons from the confuson matrx: True Postve Rate (TPR), a.k.a. senstvty, ht rate, or recall s the rato of correctly classfed postves dvded by the total postve count. TPR= TP TP+ FN False postve rate (FPR), a.k.a. fall-out, or (1-specfcty) s the rato of ncorrectly classfed postves dvded by the total negatve count. FPR= FP FP+TN Accuracy (ACC) s the rato of correctly classfed nstances (both postves and negatves) dvded by the total number of nstances. ACC = TP+TN TP+TN + FP+ FN (8) (9) (10) Estmatng the accuracy of the bult SVM model s mportant for the followng two reasons: frst, t can be used to estmate the future predcton accuracy, whch could mply the level of confdence, the potental users may have; secondly, t can be used for choosng a partcular nstance of the SVM model among avalable optons, e.g. selecton of kernel functon and parameter settngs. Accuracy s a common performance estmator n machne learnng and data mnng, but n many cases and problem domans t s not suffcent metrc. Sometmes, accuracy can be

9 319 ITHEA msleadng, for example where mportant classes are underrepresented n the datasets and class dstrbuton s skewed. In that case accuracy s helpless n countng dfferent costs and consequences from msclassfcatons. Ths s the case of the doman we consder, as msdagnosed malgnant and bengn samples have dfferent consequences and even may cost lfe. Another drawback of the accuracy s that t depends on the classfer s operatng threshold. When SVM runs as a regresson functon that outputs real numbers between -1 and 1, mappng outputs to class labels requres defnng a threshold between -1 and 1, so that the output can fall below or above t,.e. mapped to one or another class label. Applyng dfferent thresholds produces dfferent nstances of the classfcaton model, each of whch features a specfc accuracy. In order to address those accuracy defcences, we dd Recever Operatng Characterstcs (ROC) analyss [Fawcett, 2006]. Ths s a graphcal assessment technque where the true postve rate s plotted on the Y-axs and false postve rate s plotted on the X-axs (Fgure 4). In the ROC space, a classfcaton model s a step curve plotted by connectng all model nstances made by varyng the threshold value. The lne that lnks (0,0) and (1,1) s the no-dscrmnaton lne. It represents the worst possble model, whch predcts by a completely random guess. Any other classfer should appear above that lne. If t pops below the lne, a negaton of ts predctons would move t above the no-dscrmnaton lne. On the other hand, the 'deal' classfer would be represented by the pont (0,1), the top-left corner, whch shows that all true postves are found and no false postves are found. Any model performance can be measured by ts proxmty to the 'deal' classfer. The closer the ROC curve s as a whole to the north-west corner, the better. That s also the most dstant from the no-dscrmnaton lne, the better. Gven a curve, the most northwest pont of the curve represents the model nstance wth maxmal accuracy. The ROC analyss also provdes means for quantfcaton of a model performance, these are Area Under the ROC Curve (AUC) and partal Area Under the ROC curve (pauc) where senstvty s above a certan value (p). The AUC / pauc are scalars that measure the overall model performance, regardless of the operatonal threshold. The bgger the values, the better the model s. As long as AUC provdes an overall estmaton of the model, the pauc s more relevant to the applcaton area, as the potental users of CAD tools are partcularly nterested n workng wth hgh levels of senstvty. It s beleved that senstvty above 90% ( 0.90AUC) s relevant to the applcaton feld. Another clncally relevant metrc that we estmated s specfcty at gven senstvty. In order to be consstent wth prevous studes [Jesneck et al., 2007], we consdered specfcty at 98% senstvty.

10 Artfcal Intellgence Methods and Technques for Busness and Engneerng Applcatons 320 Table 1 summarzes results from numerous experments where the SVM model was traned and tested usng four kernels: lnear, polynomal, RBF, and sgmod. For each of those kernels we expermented wth three dfferent sets of varables: s39 that contans all of them n the orgnal dataset and the selectons s14 and s17 dscussed above. Table 1. Performance of SVM wth lnear, polynomal, RBF, and sgmod kernels. Metrcs for comparson nclude: area under the ROC curve (AUC), partal AUC at senstvty above 90% (0.90AUC), specfcty at 98% senstvty, and maxmal accuracy (ACCmax). Models have been tested wth three varable selectons: s39, s17, and s14. Typcal radologst assessment values are also ncluded. Fgures n bold show best values. SVM lnear s39 s17 s14 Radologst SVM polynomal s39 s17 s14 Radologst AUC AUC AUC AUC Spec /98% sens Spec /98% sens ACCmax n/a ACCmax n/a SVM FBF s39 s17 s14 Radologst SVM sgmod s39 s17 s14 Radologst AUC AUC AUC AUC Spec /98% sens Spec /98% sens ACCmax n/a ACCmax n/a The table fgures show that the selecton of descrptors for tranng and testng plays a sgnfcant role n the SVM performance. Accordng to all metrcs and no matter whch kernel s selected, t s evdent that the varable set s14, proposed by Jesneck et al. [2007] for classfcaton wth MLP s outperformed by both s39 and s17. As mentoned before, that s not surprsng as an optmal varable selecton for one classfcaton model would not be optmal for another. We also show that the alternatve selecton of varables, s17, can outperform both s14 and s39. That selecton sgnfcantly mproves the 0.90AUC of s14 from 12% to 17%, dependng on whch kernel s used, and also outperforms the radologst value by 23%. The other clncally relevant metrc, specfcty at 89% senstvty, s also mproved by s17 n comparson wth s14 - from 6% to 16%. In some cases s39 performs as well as s17, but t never gets better. The varable set s17 shows tself as the best performer regardng AUC and ACC max wth only few exceptons.

11 321 ITHEA a) b) c) d) e) f) g) h) Fg. 4. Performance of SVM wth lnear, polynmal, RBF, and sgmod kernels wth three varable sets: all attrbutes (s39); selecton of 17 attrbutes based on the subset sze forward selecton method (s17) Guetln et al. [2009]; and selecton of 14 attrbutes proposed by Jesneck et al. [2007]

12 Artfcal Intellgence Methods and Technques for Busness and Engneerng Applcatons 322 We also explored how choce of the kernel functon nfluences the SVM predctve abltes. Ths s partcularly mportant when data belong to classes, whch are not lnearly separable. In those cases we can expect that the SVM model wth lnear kernel wouldn't perform well n contrast to the non-lnear ones. Table 1 shows the results for each kernel functon. The SVM works well wth all of them. Consderng s17 only, AUC for all four kernels s 91% and 0.90AUC s from 74% to 75%. Specfcty at hgh senstvty, however, vares wth dfferent kernels. Regardng ths metrc, the sgmod kernel outperforms the others, followed by the RBF, lnear, and polynomal. Our fndngs also could be compared wth those from studes that use models based on the most common neural networks - MLPs, gven that all methods use the same dataset and varable sets (Nachev & Stoyanov, 2010). SVM shows the same AUC as MLP, but mproves 0.90AUC by 7% (68% vs. 75%) and max accuracy by 2% (83% vs. 85%). Fg. 4 gves further detals on the SVM ROC analyss, The left-hand fgures llustrate the ROC curves and AUC for each kernel and varable set; the rght-hand fgures llustrate the area of senstvty above 90% and lst 0.90AUC. Concluson Ths study explores support vector machnes utlzed as predctors of malgnant breast masses, traned and tested wth data from mammographc and sonographc examnatons. We used data collected from Duke Unversty Medcal Centre, whch contans 39 descrptors. Our study was focused on two ssues: how reducton of dmensonalty of the tranng and testng data affect the dscrmnatory power of the model; and how choce of the SVM kernel functon and model parameters affect ts predctve abltes. In order to quantfy the model performance we dd ROC analyss and utlzed metrcs, such as true postve rate, false postve rate, area under the ROC curve, partal area under the ROC curve, and specfcty at hgh senstvty. Our results show that the reducton of dmensonalty plays a sgnfcant role n the model performance. We propose a set of 17 varables, whch outperforms the 14 varables set of Jesneck et al. [2007]. The choce kernel functon among lnear, polynomal, RBF, and sgmod, however, does not nfluences the model performance, wth excepton of one metrc - specfcty at hgh senstvty. In that case the sgmod kernel s the best performer. The fact that the lnear kernel shows smlar performance to that of the non-lnear kernels s an ndcaton that the feature space s lnearly separable and data ponts are dstrbuted n a way that makes the classfcaton task lnear n terms of complexty.

13 323 ITHEA We also found expermentally that the SVM outperform a common classfcaton technque used n the feld - MLP neural networks. SVM shows the same AUC, but mproves 0.90AUC by 7% (68% vs. 75%) and max accuracy by 2% (83% vs. 85%). In concluson, we beleve that SVM s a promsng technque for breast cancer dagnoss, but when used, t requres a careful reducton of dmensonalty and well-selected model parameters. Bblography [BI-RADS, 2003] Amercan College of Radology. BI-RADS: ultrasound, 1st ed. In: Breast magng reportng and data system: BI-RADS atlas, 4th ed. Reston, VA: Amercan College of Radology, 2003 [Burges, 1998] Burges, C. A Tutoral on Support Vector Machnes for Pattern Recognton. Data Mnng and Knowledge Dscovery, 2, , [Chen et al., 2009] Chen, S., Hsao, Y., Huang, Y., Kuo, S., Tseng, H., Wu, H., Chen, D.: Comparatve Analyss of Logstc Regresson, Support Vector Machne and Artfcal Neural Network for the Dfferental Dagnoss of Bengn and Malgnant Sold Breast Tumors by the Use of Three-Dmensonal Power Doppler Imagng. Korean J Radol vol. 10, [Drucker et al., 1997] Drucker, H. Burges, C., Kaufman, L., Smola, A., and Vapnk, V., Support vector regresson machnes, Advances n Neural Informaton Processng Systems 9, pages , Cambrdge, MA, MIT Press, [Fawcett, 2006] Fawcett, T. An ntroducton to ROC analyss ; Pattern Recognton Letters, Vol. 27 Issue 8, pp , [Fe et al., 2008] Fe, L., L, W. & Yong, H. Applcaton of least squares support vector machnes for dscrmnaton of red wne usng vsble and near nfrared spectroscopy. Intellgent System and Knowledge Engneerng, ISKE' 08, [Goldberg, 1989] Goldberg, D.: Genetc Algorthms n Search, Optmzaton, and Machne Learnng, Addson-Wesley: Readng, MA, [Guetlen et al., 2009] Guetlen, M., Frank, E., Hall, M., Karwath, A. Large Scale Attrbute Selecton Usng Wrappers ; In Proc. IEEE Symposum on CIDM, pp , [Hall et al., 2009] Hall, M., Frank, E., Holmes, G., Pfahrnger, B., Reutemann, P., and Wtten. I., The WEKA data mnng software: an update. SIGKDD Explor. Newsl. 11, 1, 10-18, [Horng et al., 2010] Horng, S., Su, M., Chen, Y., Kao, T., Chen, R., La, J. and Perkasa, C. A novel ntruson detecton system based on herarchcal clusterng and support vector machnes. Expert Systems wth Applcatons, 38, , [Jesneck et al., 2007] Jesneck, J., Lo, J., Baker, J. Breast Mass Lesons: Computer-Aded Dagnoss Models wth Mamographc and Sonographc Descrptors ; Radology, vol.244, Issue 2, pp , [Kopans, 1992] Kopans D. Standardzed mammographc reportng ; Radol Cln North Am, Vol. 30, pp , 1992 [Lacey et al., 2002] Lacey, J., Devesa, S., Brnton, L. Recent Trends n Breast Cancer Incdence and Mortalty. ; Envronmental and Molecular Mutageness, Vol. 39, pp , [Nachev & Stoyanov, 2010] Nachev, A. and Stoyanov, B., An Approach to Computer Aded Dagnoss by Mult-Layer Preceptrons, In Proceedngs of Internatonal Conference Artfcal Intellgence (IC-AI 10), Las Vegas, [Oladunn and Snghal, 2009] Oladunn, O. O. & Snghal, G Pecewse mult-classfcaton support vector machnes. Internatonal Jont Conference on Neural Networks, IJCNN'09, 2009.

14 Artfcal Intellgence Methods and Technques for Busness and Engneerng Applcatons 324 [Stavros et al., 1995] Stavros, A., Thckman, D., Rapp, C., Denns, M., Parker, S., Ssney, G. Sold Breast Modules: Use of Sonography to Destngush between Bengn and Malgnant Lesons ; Radology, Vol. 196, pp , [Vapnk, 1995] Vapnk, V., The Nature of Statstcal Learnng Theory. Sprnger, New York (1995) Authors' Informaton Anatol Nachev Busness Informaton Systems, Carnes Busness School, Natonal Unversty of Ireland, Galway, Ireland; e-mal: anatol.nachev@nugalway.e Major Felds of Scentfc Research: data mnng, neural networks, support vector machnes, adaptve resonance theory. Maread Hogan Busness Informaton Systems, Carnes Busness School, Natonal Unversty of Ireland, Galway, Ireland; e-mal: maread.hogan@nugalway.e Major Felds of Scentfc Research: HCI, usablty and accessblty n nformaton systems, data mnng.

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Data Mining: Model Evaluation

Data Mining: Model Evaluation Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Classification / Regression Support Vector Machines

Classification / Regression Support Vector Machines Classfcaton / Regresson Support Vector Machnes Jeff Howbert Introducton to Machne Learnng Wnter 04 Topcs SVM classfers for lnearly separable classes SVM classfers for non-lnearly separable classes SVM

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET 1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION

CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 48 CHAPTER 3 SEQUENTIAL MINIMAL OPTIMIZATION TRAINED SUPPORT VECTOR CLASSIFIER FOR CANCER PREDICTION 3.1 INTRODUCTION The raw mcroarray data s bascally an mage wth dfferent colors ndcatng hybrdzaton (Xue

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification

12/2/2009. Announcements. Parametric / Non-parametric. Case-Based Reasoning. Nearest-Neighbor on Images. Nearest-Neighbor Classification Introducton to Artfcal Intellgence V22.0472-001 Fall 2009 Lecture 24: Nearest-Neghbors & Support Vector Machnes Rob Fergus Dept of Computer Scence, Courant Insttute, NYU Sldes from Danel Yeung, John DeNero

More information

Lecture 5: Multilayer Perceptrons

Lecture 5: Multilayer Perceptrons Lecture 5: Multlayer Perceptrons Roger Grosse 1 Introducton So far, we ve only talked about lnear models: lnear regresson and lnear bnary classfers. We noted that there are functons that can t be represented

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law)

Machine Learning. Support Vector Machines. (contains material adapted from talks by Constantin F. Aliferis & Ioannis Tsamardinos, and Martin Law) Machne Learnng Support Vector Machnes (contans materal adapted from talks by Constantn F. Alfers & Ioanns Tsamardnos, and Martn Law) Bryan Pardo, Machne Learnng: EECS 349 Fall 2014 Support Vector Machnes

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Using Neural Networks and Support Vector Machines in Data Mining

Using Neural Networks and Support Vector Machines in Data Mining Usng eural etworks and Support Vector Machnes n Data Mnng RICHARD A. WASIOWSKI Computer Scence Department Calforna State Unversty Domnguez Hlls Carson, CA 90747 USA Abstract: - Multvarate data analyss

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

A Modified Median Filter for the Removal of Impulse Noise Based on the Support Vector Machines

A Modified Median Filter for the Removal of Impulse Noise Based on the Support Vector Machines A Modfed Medan Flter for the Removal of Impulse Nose Based on the Support Vector Machnes H. GOMEZ-MORENO, S. MALDONADO-BASCON, F. LOPEZ-FERRERAS, M. UTRILLA- MANSO AND P. GIL-JIMENEZ Departamento de Teoría

More information

Incremental Learning with Support Vector Machines and Fuzzy Set Theory

Incremental Learning with Support Vector Machines and Fuzzy Set Theory The 25th Workshop on Combnatoral Mathematcs and Computaton Theory Incremental Learnng wth Support Vector Machnes and Fuzzy Set Theory Yu-Mng Chuang 1 and Cha-Hwa Ln 2* 1 Department of Computer Scence and

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Face Recognition Based on SVM and 2DPCA

Face Recognition Based on SVM and 2DPCA Vol. 4, o. 3, September, 2011 Face Recognton Based on SVM and 2DPCA Tha Hoang Le, Len Bu Faculty of Informaton Technology, HCMC Unversty of Scence Faculty of Informaton Scences and Engneerng, Unversty

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Face Recognition Method Based on Within-class Clustering SVM

Face Recognition Method Based on Within-class Clustering SVM Face Recognton Method Based on Wthn-class Clusterng SVM Yan Wu, Xao Yao and Yng Xa Department of Computer Scence and Engneerng Tong Unversty Shangha, Chna Abstract - A face recognton method based on Wthn-class

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

A classification scheme for applications with ambiguous data

A classification scheme for applications with ambiguous data A classfcaton scheme for applcatons wth ambguous data Thomas P. Trappenberg Centre for Cogntve Neuroscence Department of Psychology Unversty of Oxford Oxford OX1 3UD, England Thomas.Trappenberg@psy.ox.ac.uk

More information

Announcements. Supervised Learning

Announcements. Supervised Learning Announcements See Chapter 5 of Duda, Hart, and Stork. Tutoral by Burge lnked to on web page. Supervsed Learnng Classfcaton wth labeled eamples. Images vectors n hgh-d space. Supervsed Learnng Labeled eamples

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT 3. - 5. 5., Brno, Czech Republc, EU APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT Abstract Josef TOŠENOVSKÝ ) Lenka MONSPORTOVÁ ) Flp TOŠENOVSKÝ

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Discriminative classifiers for object classification. Last time

Discriminative classifiers for object classification. Last time Dscrmnatve classfers for object classfcaton Thursday, Nov 12 Krsten Grauman UT Austn Last tme Supervsed classfcaton Loss and rsk, kbayes rule Skn color detecton example Sldng ndo detecton Classfers, boostng

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Computer-Aided Diagnosis Applied to US of Solid Breast Nodules by Using Principal Component Analysis and Image Retrieval*

Computer-Aided Diagnosis Applied to US of Solid Breast Nodules by Using Principal Component Analysis and Image Retrieval* Computer-Aded Dagnoss Appled to US of Sold Breast Nodules by Usng Prncpal Component Analyss and Image Retreval* Yu-Len Huang Dar-Ren Chen and Sheng-Hsung Ln Department of Computer Scence and Informaton

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

CLASSIFICATION OF ULTRASONIC SIGNALS

CLASSIFICATION OF ULTRASONIC SIGNALS The 8 th Internatonal Conference of the Slovenan Socety for Non-Destructve Testng»Applcaton of Contemporary Non-Destructve Testng n Engneerng«September -3, 5, Portorož, Slovena, pp. 7-33 CLASSIFICATION

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS46: Mnng Massve Datasets Jure Leskovec, Stanford Unversty http://cs46.stanford.edu /19/013 Jure Leskovec, Stanford CS46: Mnng Massve Datasets, http://cs46.stanford.edu Perceptron: y = sgn( x Ho to fnd

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Comparison Study of Textural Descriptors for Training Neural Network Classifiers

Comparison Study of Textural Descriptors for Training Neural Network Classifiers Comparson Study of Textural Descrptors for Tranng Neural Network Classfers G.D. MAGOULAS (1) S.A. KARKANIS (1) D.A. KARRAS () and M.N. VRAHATIS (3) (1) Department of Informatcs Unversty of Athens GR-157.84

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Research of Neural Network Classifier Based on FCM and PSO for Breast Cancer Classification

Research of Neural Network Classifier Based on FCM and PSO for Breast Cancer Classification Research of Neural Network Classfer Based on FCM and PSO for Breast Cancer Classfcaton Le Zhang 1, Ln Wang 1, Xujewen Wang 2, Keke Lu 2, and Ajth Abraham 3 1 Shandong Provncal Key Laboratory of Network

More information

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,

More information

Journal of Process Control

Journal of Process Control Journal of Process Control (0) 738 750 Contents lsts avalable at ScVerse ScenceDrect Journal of Process Control j ourna l ho me pag e: wwwelsevercom/locate/jprocont Decentralzed fault detecton and dagnoss

More information

CAN COMPUTERS LEARN FASTER? Seyda Ertekin Computer Science & Engineering The Pennsylvania State University

CAN COMPUTERS LEARN FASTER? Seyda Ertekin Computer Science & Engineering The Pennsylvania State University CAN COMPUTERS LEARN FASTER? Seyda Ertekn Computer Scence & Engneerng The Pennsylvana State Unversty sertekn@cse.psu.edu ABSTRACT Ever snce computers were nvented, manknd wondered whether they mght be made

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces

Range images. Range image registration. Examples of sampling patterns. Range images and range surfaces Range mages For many structured lght scanners, the range data forms a hghly regular pattern known as a range mage. he samplng pattern s determned by the specfc scanner. Range mage regstraton 1 Examples

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

A MODIFIED K-NEAREST NEIGHBOR CLASSIFIER TO DEAL WITH UNBALANCED CLASSES

A MODIFIED K-NEAREST NEIGHBOR CLASSIFIER TO DEAL WITH UNBALANCED CLASSES A MODIFIED K-NEAREST NEIGHBOR CLASSIFIER TO DEAL WITH UNBALANCED CLASSES Aram AlSuer, Ahmed Al-An and Amr Atya 2 Faculty of Engneerng and Informaton Technology, Unversty of Technology, Sydney, Australa

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article. A selective ensemble classification method on microarray data Avalable onlne www.ocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(6):2860-2866 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 A selectve ensemble classfcaton method on mcroarray

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

An Evolvable Clustering Based Algorithm to Learn Distance Function for Supervised Environment

An Evolvable Clustering Based Algorithm to Learn Distance Function for Supervised Environment IJCSI Internatonal Journal of Computer Scence Issues, Vol. 7, Issue 5, September 2010 ISSN (Onlne): 1694-0814 www.ijcsi.org 374 An Evolvable Clusterng Based Algorthm to Learn Dstance Functon for Supervsed

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

A Novel Term_Class Relevance Measure for Text Categorization

A Novel Term_Class Relevance Measure for Text Categorization A Novel Term_Class Relevance Measure for Text Categorzaton D S Guru, Mahamad Suhl Department of Studes n Computer Scence, Unversty of Mysore, Mysore, Inda Abstract: In ths paper, we ntroduce a new measure

More information

General Vector Machine. Hong Zhao Department of Physics, Xiamen University

General Vector Machine. Hong Zhao Department of Physics, Xiamen University General Vector Machne Hong Zhao (zhaoh@xmu.edu.cn) Department of Physcs, Xamen Unversty The support vector machne (SVM) s an mportant class of learnng machnes for functon approach, pattern recognton, and

More information

A Statistical Model Selection Strategy Applied to Neural Networks

A Statistical Model Selection Strategy Applied to Neural Networks A Statstcal Model Selecton Strategy Appled to Neural Networks Joaquín Pzarro Elsa Guerrero Pedro L. Galndo joaqun.pzarro@uca.es elsa.guerrero@uca.es pedro.galndo@uca.es Dpto Lenguajes y Sstemas Informátcos

More information

A Similarity-Based Prognostics Approach for Remaining Useful Life Estimation of Engineered Systems

A Similarity-Based Prognostics Approach for Remaining Useful Life Estimation of Engineered Systems 2008 INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT A Smlarty-Based Prognostcs Approach for Remanng Useful Lfe Estmaton of Engneered Systems Tany Wang, Janbo Yu, Davd Segel, and Jay Lee

More information

Network Intrusion Detection Based on PSO-SVM

Network Intrusion Detection Based on PSO-SVM TELKOMNIKA Indonesan Journal of Electrcal Engneerng Vol.1, No., February 014, pp. 150 ~ 1508 DOI: http://dx.do.org/10.11591/telkomnka.v1.386 150 Network Intruson Detecton Based on PSO-SVM Changsheng Xang*

More information

Spam Filtering Based on Support Vector Machines with Taguchi Method for Parameter Selection

Spam Filtering Based on Support Vector Machines with Taguchi Method for Parameter Selection E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton We-Chh Hsu, Tsan-Yng Yu E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton

More information

Relevance Assignment and Fusion of Multiple Learning Methods Applied to Remote Sensing Image Analysis

Relevance Assignment and Fusion of Multiple Learning Methods Applied to Remote Sensing Image Analysis Assgnment and Fuson of Multple Learnng Methods Appled to Remote Sensng Image Analyss Peter Bajcsy, We-Wen Feng and Praveen Kumar Natonal Center for Supercomputng Applcaton (NCSA), Unversty of Illnos at

More information

Fuzzy Modeling of the Complexity vs. Accuracy Trade-off in a Sequential Two-Stage Multi-Classifier System

Fuzzy Modeling of the Complexity vs. Accuracy Trade-off in a Sequential Two-Stage Multi-Classifier System Fuzzy Modelng of the Complexty vs. Accuracy Trade-off n a Sequental Two-Stage Mult-Classfer System MARK LAST 1 Department of Informaton Systems Engneerng Ben-Guron Unversty of the Negev Beer-Sheva 84105

More information

Classification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM

Classification of Face Images Based on Gender using Dimensionality Reduction Techniques and SVM Classfcaton of Face Images Based on Gender usng Dmensonalty Reducton Technques and SVM Fahm Mannan 260 266 294 School of Computer Scence McGll Unversty Abstract Ths report presents gender classfcaton based

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

ÇUKUROVA UNIVERSITY INSTITUTE OF NATURAL AND APPLIED SCIENCES. Dissertation.com

ÇUKUROVA UNIVERSITY INSTITUTE OF NATURAL AND APPLIED SCIENCES. Dissertation.com Predctng the Admsson Decson of a Partcpant to the School of Physcal Educaton and Sports at Çukurova Unversty by Usng Dfferent Machne Learnng Methods Combned wth Feature Selecton Gözde Özsert Yğt Mehmet

More information

Associative Based Classification Algorithm For Diabetes Disease Prediction

Associative Based Classification Algorithm For Diabetes Disease Prediction Internatonal Journal of Engneerng Trends and Technology (IJETT) Volume-41 Number-3 - November 016 Assocatve Based Classfcaton Algorthm For Dabetes Dsease Predcton 1 N. Gnana Deepka, Y.surekha, 3 G.Laltha

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status Internatonal Journal of Appled Busness and Informaton Systems ISSN: 2597-8993 Vol 1, No 2, September 2017, pp. 6-12 6 Implementaton Naïve Bayes Algorthm for Student Classfcaton Based on Graduaton Status

More information

RECOGNIZING GENDER THROUGH FACIAL IMAGE USING SUPPORT VECTOR MACHINE

RECOGNIZING GENDER THROUGH FACIAL IMAGE USING SUPPORT VECTOR MACHINE Journal of Theoretcal and Appled Informaton Technology 30 th June 06. Vol.88. No.3 005-06 JATIT & LLS. All rghts reserved. ISSN: 99-8645 www.jatt.org E-ISSN: 87-395 RECOGNIZING GENDER THROUGH FACIAL IMAGE

More information

ISSN: International Journal of Engineering and Innovative Technology (IJEIT) Volume 1, Issue 4, April 2012

ISSN: International Journal of Engineering and Innovative Technology (IJEIT) Volume 1, Issue 4, April 2012 Performance Evoluton of Dfferent Codng Methods wth β - densty Decodng Usng Error Correctng Output Code Based on Multclass Classfcaton Devangn Dave, M. Samvatsar, P. K. Bhanoda Abstract A common way to

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

GSLM Operations Research II Fall 13/14

GSLM Operations Research II Fall 13/14 GSLM 58 Operatons Research II Fall /4 6. Separable Programmng Consder a general NLP mn f(x) s.t. g j (x) b j j =. m. Defnton 6.. The NLP s a separable program f ts objectve functon and all constrants are

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

An Anti-Noise Text Categorization Method based on Support Vector Machines *

An Anti-Noise Text Categorization Method based on Support Vector Machines * An Ant-Nose Text ategorzaton Method based on Support Vector Machnes * hen Ln, Huang Je and Gong Zheng-Hu School of omputer Scence, Natonal Unversty of Defense Technology, hangsha, 410073, hna chenln@nudt.edu.cn,

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS

EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS P.G. Demdov Yaroslavl State Unversty Anatoly Ntn, Vladmr Khryashchev, Olga Stepanova, Igor Kostern EYE CENTER LOCALIZATION ON A FACIAL IMAGE BASED ON MULTI-BLOCK LOCAL BINARY PATTERNS Yaroslavl, 2015 Eye

More information

Human Face Recognition Using Generalized. Kernel Fisher Discriminant

Human Face Recognition Using Generalized. Kernel Fisher Discriminant Human Face Recognton Usng Generalzed Kernel Fsher Dscrmnant ng-yu Sun,2 De-Shuang Huang Ln Guo. Insttute of Intellgent Machnes, Chnese Academy of Scences, P.O.ox 30, Hefe, Anhu, Chna. 2. Department of

More information

PRÉSENTATIONS DE PROJETS

PRÉSENTATIONS DE PROJETS PRÉSENTATIONS DE PROJETS Rex Onlne (V. Atanasu) What s Rex? Rex s an onlne browser for collectons of wrtten documents [1]. Asde ths core functon t has however many other applcatons that make t nterestng

More information

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES

More information

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers Journal of Convergence Informaton Technology Volume 5, Number 2, Aprl 2010 Investgatng the Performance of Naïve- Bayes Classfers and K- Nearest Neghbor Classfers Mohammed J. Islam *, Q. M. Jonathan Wu,

More information