Feature Selection for Natural Language Call Routing Based on Self-Adaptive Genetic Algorithm

Size: px
Start display at page:

Download "Feature Selection for Natural Language Call Routing Based on Self-Adaptive Genetic Algorithm"

Transcription

1 IOP Conference Seres: Materals Scence and Engneerng PAPER OPEN ACCESS Feature Selecton for Natural Language Call Routng Based on Self-Adaptve Genetc Algorthm To cte ths artcle: A Koromyslova et al 017 IOP Conf. Ser.: Mater. Sc. Eng Vew the artcle onlne for updates and enhancements. Related content - Dscrmnatve feature selecton for vsual trackng Junka Ma, Habo Luo, We Zhou et al. - Sentment analyss of feature rankng methods for classfcaton accuracy Shashank Joseph, Calvn Mugaur and S Sumathy - A new Self-Adaptve dspatchng System for local clusters Bowen Kan, Jngyan Sh and Xaofeng Le Recent ctatons - Context Analyss of Customer Requests usng a Hybrd Adaptve Neuro Fuzzy Inference System and Hdden Markov Models n the Natural Language Call Routng Problem Samr Rustamov et al Ths content was downloaded from IP address on 30/06/018 at 1:1

2 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 Internatonal Conference on Recent Trends n Physcs 016 (ICRTP016) Journal of Physcs: Conference Seres 755 (016) IOP Publshng do: / /755/1/ Feature Selecton for Natural Language Call Routng Based on Self-Adaptve Genetc Algorthm A Koromyslova 1, M Semenkna 1, R Sergenko 1Reshetnev Sberan State Aerospace Unversty, Krasnoyarsky rabochy avenue 31, Krasnoyarsk, Russa, Ulm Unversty, Lse-Metner-Str. 9, Ulm, Germany, E-mal: akoromyslova@mal.ru, semenkna88@mal.ru, roman.sergenko@unulm.de Abstract: The text classfcaton problem for natural language call routng was consdered n the paper. Seven dfferent term weghtng methods were appled. As dmensonalty reducton methods, the feature selecton based on self-adaptve GA s consdered. k-nn, lnear SVM and ANN were used as classfcaton algorthms. The tasks of the research are the followng: perform research of text classfcaton for natural language call routng wth dfferent term weghtng methods and classfcaton algorthms and nvestgate the feature selecton method based on self-adaptve GA. The numercal results showed that the most effectve term weghtng s TRR. The most effectve classfcaton algorthm s ANN. Feature selecton wth self-adaptve GA provdes mprovement of classfcaton effectveness and sgnfcant dmensonalty reducton wth all term weghtng methods and wth all classfcaton algorthms. 1. Introducton Natural language call routng s an mportant problem n the desgn of modern automatc call servces and the solvng of ths problem could lead to mprovement of the call servce [1]. Generally natural language call routng can be consdered as two dfferent problems. The frst one s speech recognton of calls and the second one s topc categorzaton of users utterances for further routng. Topc categorzaton of users utterances can be also useful for multdoman spoken dalogue system desgn [1]. In ths work we treat call routng as an example of a text classfcaton applcaton. In the vector space model [16] text classfcaton s consdered as a machne learnng problem. The complexty of text categorzaton wth a vector space model s compounded by the need to extract the numercal data from text nformaton before applyng machne learnng algorthms. Therefore, text classfcaton conssts of two parts: text preprocessng and classfcaton algorthm applcaton usng the obtaned numercal data. Text preprocessng comprses three stages: - Textual feature extracton. - Term weghtng - Dmensonalty reducton. The frst one s the textual feature extracton based on raw preprocessng of the documents. Ths process ncludes deletng punctuaton, transformng captal letters to lowercase, and addtonal procedures such as stop-words flterng [4] and stemmng [14]. Stop-words lst contans pronouns, prepostons, artcles and other words that usually have no mportance for the classfcaton. Usng stemmng t s possble to on dfferent forms of the same word nto one textual feature. The second stage s the numercal feature extracton based on term weghtng. For term weghtng we use bag-of-words model, n whch the word order s gnored. There exst dfferent unsupervsed and supervsed term weghtng methods. The most well-known unsupervsed term weghtng method s TFIDF [15]. The followng supervsed term weghtng methods are also consdered n the paper: Content from ths work may be used under the terms of the Creatve Commons Attrbuton 3.0 lcence. Any further dstrbuton of ths work must mantan attrbuton to the author(s) and the ttle of the work, ournal ctaton and DOI. Publshed under lcence by IOP Publshng Ltd 1

3 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 Gan Rato (GR) [3], Confdent Weghts (CW) [10], Term Second Moment (TM) [], Relevance Frequency (RF) [11], Term Relevance Rato (TRR) [9], and Novel Term Weghtng (NTW) [18]; these methods nvolve nformaton about the classes of the documents. As a rule, the dmensonalty for text classfcaton problems s hgh even after stop-words flterng and stemmng. Due to the hgh dmensonalty, the classfcaton may be napproprate tmeconsumng, especally for real-tme systems such as natural language call routng. Therefore, the next stage of preprocessng s the dmensonalty reducton based on numercal features; t s possble wth feature selecton or feature transformaton. In our research we use a feature selecton method based on genetc algorthm or GA-based wrapper. One of the most complcated problems wth GA applcatons s settng algorthm parameters. A conventonal genetc algorthm has at least three methods of selecton (proportonal, tournament, and rank), three methods of recombnaton (one-pont, two-pont, and unform). Mutaton probablty requres tunng as well. The amount of varous combnatons can be estmated at tens. Exhaustve search of combnatons requres a lot of tme and computatonal power, especally for such tmeconsumng problems as GA applcatons for machne learnng. Parameters combnaton selecton at random can be also nsuffcent as algorthm effcency on same problem can dffer very much for varous parameters settng. Ths problem can be solved wth self-confgurng GA [17] or coevolutonary GA [19]. Therefore, we propose a use of self- adaptve GA for the feature selecton n the feld of natural language call routng. As classfcaton algorthms we use the k-nn algorthm, the lnear SVM, and artfcal neural networks (ANN). Some comparatve studes of machne learnng algorthms n the feld of text classfcaton showed hgh classfcaton effectveness of k-nn, SVM-based algorthms, and ANN [, 7, 8, 10, 13]. The tasks of our research are the followng: - Perform research of text classfcaton for natural language call routng wth dfferent term weghtng methods and classfcaton algorthms. - Investgate the feature selecton method based on self-adaptve GA. The paper s organzed as follows: In Secton, we descrbe the consdered corpus for natural language call routng. Secton 3 descrbes the consdered term weghtng methods. The GA-based feature selecton s descrbed n Secton 4. The self- adaptve GA are descrbed n Secton 5. Secton 6 contans short descrpton of classfcaton algorthms. The results of numercal experments are presented n Secton 7. Fnally, we provde concludng remarks n Secton 8.. Corpus descrpton The data for testng and evaluaton conssts of 9,156 user utterances recorded n Englsh language from caller nteractons wth commercal automated agents. Utterances are short and contan only one phrase for further routng. The database contans calls n textual format after speech recognton. The database s provded by the company Speech Cycle (New York, USA). Utterances from ths database are manually labelled by experts and dvded nto 0 classes (such as appontments, operator, bll, nternet, phone and techncal support). One of them s a specal class TE-NOMATCH whch ncludes utterances that cannot be put nto another class or can be put nto more than one class. The database contans 45 unclassfed calls and they were removed. The database contans also 3,561 empty calls wthout any words. These calls were placed n the class TE-NOMATCH automatcally and they were also removed from the database. As a rule, the calls are short n the database; many of them contan only one or two words. The average length of an utterance s 4.66 words, the maxmal length s 19 words. There are a lot of dentcal utterances n the database; the corpus contans only 4,458 unque non-empty classfed calls. The corpus s unbalanced. The largest class contans 7.05% and the smallest one contans 0.04% of the unque calls. Due to the very hgh frequency of a small number of utterances n the corpus, we formulate two dfferent problem defntons.

4 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 Problem defnton 1. The whole database wth 68,550 classfed non-empty calls s used for tranng and test sets formng. Numbers of repettons of the utterances n tranng and test sets are used as weghts for classfcaton. Ths problem defnton s the closest to the real stuaton but frequently repeated utterances decrease dfference between preprocessng and classfcaton methods. Addtonally, there are some dentcal utterances n tranng and test sets smultaneously. In ths case the over-fttng problem of classfcaton may be hdden. Therefore, ths problem defnton s not very approprate for the comparatve study. Problem defnton. Before tranng and test samples formng, all utterance duplcates were removed from the database. It means that there s no ntersecton between tranng and test sets and frequency of utterances s gnored. Therefore, the problem defnton 1 s sutable for the qualty estmaton of the real natural language call routng system; the problem defnton s the most approprate for the comparatve study of dfferent preprocessng and classfcaton methods. For statstcal analyss we performed 0 dfferent dvsons of the database nto tranng and test samples randomly. Ths procedure was performed for two problem defntons separately. The tran samples contan 90% of the calls and the test samples contan 10% of the calls. For each tranng sample we have desgned a dctonary of unque words whch appear n the tranng sample after deletng punctuaton and transformng captal letters to lowercase. The sze of the dctonary vares from 3,75 to 3,39 words for problem defnton 1 and from 3,77 to 3,311 for problem defnton. 3. Term weghtng methods As a rule, term weghtng s a multplcaton of two parts: the part based on the term frequency n a document (TF) and the part based on the term frequency n the whole tranng database. The TF-part s fxed for all consdered term weghtng methods and s calculated as followng: TF log n tf 1 ; tf, where n s the number of tmes the th word occurs n the th document, N s the document sze (number of words n the document). The second part of the term weghtng s calculated once for each word from the dctonary and does not depend on an utterance for classfcaton. We consder seven dfferent methods for the calculaton of the second part of term weghtng Inverse Document Frequency (IDF) IDF s a well-known unsupervsed term weghtng method whch was proposed n [15]. There are some modfcatons of IDF and we use the most popular one: N d f D log, n where D s the number of documents n the tranng set and n s the number of documents that have the th word. 3.. Gan Rato (GR) Gan Rato (GR) s manly used n term selecton [4], but n [3] t was shown that t could also be used for weghtng terms. The defnton of GR s as follows: 3

5 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 GR t, M t c c c c t cc,, c Pt, c c t c,, t P M t, c logp P t, c log P c t Pc, where P(t, c) s the relatve frequency that a document contans the term t and belongs to the category c; P(t)s the relatve frequency that a document contans the term t and P(c) s the relatve frequency that a document belongs to category c. Then, the weght of the term t s the max value between all categores as follows: where C s a set of all classes. GR t maxgrt, c, c C 3.3. Confdent Weghts (CW) Ths supervsed term weghtng approach has been proposed n [0]. Frstly, the proporton of documents contanng term t s defned as the Wlson proporton estmate p(x, n) by the followng equaton: p x, n, x 0.5Z n Z where x s the number of documents contanng the term t n the gven corpus, n s the number of documents n the corpus and Z, where the t-dstrbuton (Students law) when n < 30 and the normal dstrbuton when n 30. In ths work α=0.95 and 0.5Z (as recommended by the authors of the method). For each term t and each class c two functons p pos (x, n) and p neg (x, n) are calculated. For p pos (x, n) x s the number of documents whch belong to the class c and have term t; n s the number of documents whch belong to the class c. For p neg (x, n) x s the number of documents whch have the term t but do not belong to the class c; n s the number of documents whch do not belong to the class c. The confdence nterval (p -, p + ) at 0.95 s calculated usng the followng equaton: р M 0.5Z, р 1- р n Z р М; р р М. The strength of the term t n the category c s defned as the follows:, str t, _ ppos _ log, f p с _ ppos pneg 0, otherwse. pos > p neg, 4

6 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 The maxmum strength (Maxstr) of the term t s calculated as follows: Maxstr (t ) max str (t, c c C 3.4. Term Second Moment (TM) Ths supervsed term weghtng method was proposed n []. Let P(c t) be the emprcal estmaton of the probablty that a document belongs to the category c wth the condton that the document contans the term t; P(c ) s the emprcal estmaton of the probablty that a document belongs to the category c wthout any condtons. The dea s the followng: the more P(c t) s dfferent from P(c ), the more mportant the term t s. Therefore, we can calculate the term weght as the followng: where C s a set of all classes. TM(t ) = С 1 ) (P(c t) - P(c )), 3.5. Relevance Frequency (RF) The RF term weghtng method was proposed n [11] and s calculated as the followng: rf(t ) = max rf (t, c ), с С rf (t, c ) log а max1,. а, where a s the number of documents of the category c whch contan the term t and of documents of all the other categores whch also contan ths term Term Relevance Rato (TRR) The TRR method [9] uses tf weghts and t s calculated as the followng: TRR t log P t P t, c P TRR t k k 1 t c, V Tc tf Tc tf lk l1k 1 c c, maxtrrt, c, c C а s the number where c s a class of the document, c s all of the other classes of c, V s the vocabulary of the tranng data and T c s the document set of the class c. 5

7 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/ Novel Term Weghtng (NTW) Ths method was proposed n [1, 18]. The detals of the procedure are the followng. Let L be the number of classes; n s the number of documents whch belong to the th class; N s the number of occurrences of the th word n all documents from the th class. T = N /n s the relatve frequency of occurrences of the th word n the th class; R = max T ; S = arg max T s the class whch we assgn to the th word. The term relevance C s calculated by the followng: С L 1 T 1 R 1 L 1 L T. 1, S 4. GA-based feature selecton The popular knd of wrappers s feature selecton based on genetc algorthms (GA). GA s used as an optmzaton algorthm for fndng the optmal or the sub-optmal subset of the orgnal feature set wth the predefned classfcaton algorthm [3]. The detals of the GA-based wrapper are the followng: 1. Randomly ntalze the populaton of the bnary strngs wth the length equals to dmensonalty of the consdered classfcaton problem. Set generaton counter g = 0.. Apply the classfcaton algorthm on the valdaton set for all ndvduals form the populaton (wth ncludng only features wth 1 value n the chromosomes). Set the classfcaton effectveness measure (.e. ) on the valdaton set as a ftness functon value for all ndvduals. 3. Save the ndvdual wth the best value of ftness functon. 4. Check f the generaton counter s greater than some predefned value G (maxmal number of generatons): f g > G than go to the step 7; otherwse go to step Usng GA operators: selecton, crossover and mutaton, form the next populaton. 6. Increment the generaton counter g = g + 1. Go to step. 7. Put the best ndvdual as a soluton of the feature selecton procedure; apply the chosen feature subset for the test set. 5. Self-adaptve GA For solvng the problem of GA settng parameters we use the self-confgurng algorthm that was proposed n [17]. The scheme of the self-confgurng GA s presented n Fgure 1. In the self-confgurng GA (SCGA), dfferent types of selecton, recombnaton, and dfferent levels of mutaton are performed smultaneously. In the begnnng of the algorthm, all types of GA operators have the same probablty to be use for a new off-sprng generaton. After that, the dynamc adaptaton of probabltes s performed accordng usefulness of GA operator types n terms of ftness functon. 6

8 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 Fgure 1. Self-confgurng GA (SCGA). The detals of the self-adustng GA are the followng: 1. Put the probabltes of all used types of GA operators: p =1/N, where = 1,,N, N s the number of types of the th operator, = 1,,N, N s the number of GA operators. In our case N=3: selecton, recombnaton, and mutaton. p Set threshold probabltes for all used types of GA operators: N 3. Generate new populaton. For each off-sprng we randomly choose types of selecton, recombnaton, and mutaton accordng probabltes p. 4. Recalculate the probabltes wth the followng: 4.1. For each th operator do: Set S For each th type of the th operator do: If p p 1 T N AND p p, where T s the number of generatons, then: S S p p ; p p If p p 1 T N then: S S 1 T N ; p p T N Calculate average ftness functon F of all off-sprngs of the current generaton that were generated wth the th type of the th operator Fnd the best type d of the th operator wth the maxmal ftness functon on the current generaton and recalculate ts probablty: p d = p d +S 5. Check stop crteron. If TRUE then: END; else: go to the step 3. Moreover two co-evolutonary algorthms (CEA) [19] have been realzed. Both use sland model wth four slands and cooperatve-compettve strategy for resources allocatng. Frst of them (CEA) used four best genetc algorthms (for dfferent types of problems), second one use four ndependent (on the start and on the probabltes recalculaton stage) SCGA (CESCGA). 6. Classfcaton Algorthms As classfcaton algorthms we use the k-nn algorthm, the lnear SVM algorthm, and ANN. The classfcaton crteron s the macro [6] whch s approprate for classfcaton problems wth 7

9 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 unbalanced classes. For k-nn we performed valdaton of k from 1 to 15 on the valdaton sample. We used 80% of the tran sample for the frst level of learnng and 0% for the valdaton. The same scheme of valdaton s used for feature selecton based on self-adustng GA Genetc Algorthms for Automated Artfcal Neural Network Desgn Artfcal neural network (ANN) s a set of nterconnected neurons. Typcally, the transfer functons of all the neurons n the neural network are fxed, and the weghts are the parameters of the neural network. In case of GA usng for ANN weghts tunng: weghts are recorded sequentally n the chromosome as a bnary code. The number of bts that are used to encode a sngle weghtng coeffcent depends on the accuracy and the spread of possble values of weghts. In case of GA usng for ANN structure desgn (GA-ANN): 1. Hdden layers are coded sequentally;. Each neuron s encoded n four bts; 3. Each neuron wll exst wth probablty equal to If n the network a neuron s not presented, ts place n the chromosome s marked wth zeros. Otherwse, t s randomly selected as one of the ffteen actvaton functons, whose number s wrtten n bnary code Snce the effcency of the genetc algorthm for ANN desgn depends on the dmenson of the problem n hand, t s reasonable to avod the use of unnformatve features. The modfcaton of the genetc algorthm for the choce of the most nformatve features durng the automated desgn of ANN assumes the use of addtonal bts n the GA chromosomes. These bts determne whether an nput s ncluded n the nput layer or not. 7. Results of numercal experments Tables 1-10 show the results of the numercal experments for problem defntons 1 and wthout dmensonalty reducton and after feature selecton wth the self-adaptve GA-based wrapper. The best results n terms of are bold (for each case ndependently). The column contans number of features after GA-based wrapper performng. The numercal results showed that the most effectve term weghtng s TRR. The most effectve classfcaton algorthm s ANN. The most effectve optmzaton technque s SCGA. Feature selecton wth self-confgurng GA provdes mprovement of classfcaton effectveness and sgnfcant dmensonalty reducton wth all term weghtng methods and wth all classfcaton algorthms. Table 1. Results for problem defnton 1 wth k-nn Term weghtng method All terms Feature selecton IDF GR CW RF TM TRR NTW Table. Results for problem defnton wth k-nn Term weghtng method All terms Feature selecton IDF GR

10 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 CW RF TM TRR NTW Table 3. Results for problem defnton 1 wth SVM Term weghtng method All terms Feature selecton IDF GR CW RF TM TRR NTW Table 4. Results for problem defnton wth SVM Term weghtng method All terms Feature selecton IDF GR CW RF TM TRR NTW Table 5. Results for problem defnton 1 wth SCGA-ANN Term weghtng method All terms Feature selecton IDF GR CW RF TM TRR NTW Table 6. Results for problem defnton wth SCGA-ANN Term weghtng method All terms Feature selecton IDF GR CW RF TM TRR NTW

11 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 Table 7. Results for problem defnton 1 wth CEA-ANN Term weghtng method All terms Feature selecton IDF 0,673 0, GR 0,711 0, CW 0,739 0, RF 0,8 0,88 1 TM 0,69 0, TRR 0,85 0, NTW 0,813 0, Table 8. Results for problem defnton wth CEA-ANN Term weghtng method All terms Feature selecton IDF 0,655 0, GR 0,647 0, CW 0,634 0,67 91 RF 0,655 0, TM 0,599 0, TRR 0,68 0, NTW 0,675 0, Table 9. Results for problem defnton 1 wth CESCGA-ANN Term weghtng method All terms Feature selecton IDF 0,644 0, GR 0,639 0, CW 0,61 0,66 19 RF 0,64 0, TM 0,586 0, TRR 0,674 0, NTW 0,667 0, Table 10. Results for problem defnton wth CESCGA-ANN Term weghtng method All terms Feature selecton IDF 0,67 0, GR 0,697 0,76 65 CW 0,738 0, RF 0,817 0, TM 0,685 0, TRR 0,8 0,87 45 NTW 0,804 0, Conclusons The text classfcaton problem for natural language call routng was consdered n the paper. Seven dfferent term weghtng methods were appled. As dmensonalty reducton methods, the feature 10

12 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 selecton based on self-confgurng GA s consdered. k-nn, lnear SVM and ANN were used as classfcaton algorthms. The numercal results showed that the most effectve term weghtng s TRR. The most effectve classfcaton algorthm s ANN. Feature selecton wth self-confgurng GA provdes mprovement of classfcaton effectveness and sgnfcant dmensonalty reducton wth all term weghtng methods and wth all classfcaton algorthms. Acknowledgements The reported study was funded by Russan Foundaton for Basc Research, Government of Krasnoyarsk Terrtory, Krasnoyarsk Regon Scence and Technology Support Fund to the research proect References [1] Akhmedova S, Semenkn E and Sergenko R 014 Automatcally generated classfers for opnon mnng wth dfferent term weghtng schemes Informatcs n Control, Automaton and Robotcs (ICINCO 014) pp [] Baharudn B, Lee L H and Khan K 010 A revew of machne learnng algorthms for textdocuments classfcaton Journal of advances n nformaton technology 1(1) pp 4 0 [3] Debole F and Sebastan F 004 Supervsed term weghtng for automated text categorzaton In Text mnng and ts applcatons pp [4] Fox C 1989 A stop lst for general text In ACM SIGIR Forum 4 pp 19 1 [5] Gasanova T, Sergenko R, Akhmedova S, Semenkn E and Mnker W 014 Opnon mnng and topc categorzaton wth novel term weghtng Proceedngs of the 5th Workshop on Computatonal Approaches to Subectvty, Sentment and Socal Meda Analyss, ACL 014 pp [6] Goutte C and Gausser E 005 A probablstc nterpretaton of precson, recall and f-score, wth mplcaton for evaluaton Advances n nformaton retreval pp [7] Han E-H S, Karyps G, and Kumar V 001 Text Categorzaton Usng Weght Adusted k-nearest Neghbor Classfcaton [8] Joachms T 00 Learnng to Classfy Text Usng Support Vector Machnes: Methods, Theory and Algorthms [9] Ko Y 01 A study of term weghtng schemes usng class nformaton for text classfcaton Proceedngs of the 35th nternatonal ACM SIGIR conference on Research and development n nformaton retreval pp [10] Kwon O-W and Lee J-H 003 Text categorzaton based on k-nearest neghbor approach for web ste classfcaton Informaton Processng & Management 39(1) pp 5 44 [11] Lan M, Tan C L, Su J and Lu Y 009 Supervsed and tradtonal term weghtng methods for automatc text categorzaton IEEE Transactons on Pattern Analyss and Machne Intellgence 31(4) pp [1] Lee C, Jung S, Km S and Lee G 009 Example based dalog modelng for practcal mult-doman dalog system Speech Communcaton 51(5) pp [13] Moraru D I, Vntan L N and Tresp V 005 Metaclassfcaton usng SVM classfers for text documents Intl. Jrnl. of Appled Mathematcs and Computer Scences 1(1) [14] Porter M F 001 Snowball: A language for stemmng algorthms [15] Salton G and Buckley C 1988 Term-weghtng approaches n automatc text retreval Informaton processng & management 4(5) pp [16] Sebastan F 00 Machne learnng n automated text categorzaton ACM computng surveys (CSUR) 34(1) pp

13 V Internatonal Workshop on Mathematcal Models and ther Applcatons 016 IOP Publshng IOP Conf. Seres: Materals Scence and Engneerng 173 (017) do: / x/173/1/01008 [17] Semenkn E and Semenkna M 01 Self-confgurng genetc programmng algorthm wth modfed unform crossover 01 IEEE Congress on Evolutonary Computaton [18] Sergenko R, Gasanova T, Semenkn E and Mnker W 014 Text categorzaton methods applcaton for natural language call routng 11 th Internatonal Conference on Informatcs n Control, Automaton and Robotcs (ICINCO 014) pp [19] Sergenko R and Semenkn E 010 Compettve cooperaton for strategy adaptaton n coevolutonary genetc algorthm for constraned optmzaton IEEE Congress on Evolutonary Computaton [0] Soucy P and Mneau G W 005 Beyond tfdf weghtng for text categorzaton n the vector space model IJCAI 5 pp [1] Suhm B, Bers J, McCarthy D, Freeman B, Getty D, Godfrey K and Peterson P 00 A comparatve study of speech n the call center: Natural language call routng vs. touch-tone menus Proceedngs of the SIGCHI conference on Human Factors n Computng Systems pp [] Xu H and L C 007 A novel term weghtng scheme for automated text categorzaton Seventh Internatonal Conference on Intellgent Systems Desgn and Applcatons (ISDA 007) pp [3] Yang J and Honavar V 1998 Feature subset selecton usng a genetc algorthm Feature extracton, constructon and selecton pp [4] Yang Y and Pedersen J O 1997 A comparatve study on feature selecton n text categorzaton ICML 97 pp

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

A Novel Term_Class Relevance Measure for Text Categorization

A Novel Term_Class Relevance Measure for Text Categorization A Novel Term_Class Relevance Measure for Text Categorzaton D S Guru, Mahamad Suhl Department of Studes n Computer Scence, Unversty of Mysore, Mysore, Inda Abstract: In ths paper, we ntroduce a new measure

More information

Experiments in Text Categorization Using Term Selection by Distance to Transition Point

Experiments in Text Categorization Using Term Selection by Distance to Transition Point Experments n Text Categorzaton Usng Term Selecton by Dstance to Transton Pont Edgar Moyotl-Hernández, Héctor Jménez-Salazar Facultad de Cencas de la Computacón, B. Unversdad Autónoma de Puebla, 14 Sur

More information

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION

BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION BAYESIAN MULTI-SOURCE DOMAIN ADAPTATION SHI-LIANG SUN, HONG-LEI SHI Department of Computer Scence and Technology, East Chna Normal Unversty 500 Dongchuan Road, Shangha 200241, P. R. Chna E-MAIL: slsun@cs.ecnu.edu.cn,

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints

Sum of Linear and Fractional Multiobjective Programming Problem under Fuzzy Rules Constraints Australan Journal of Basc and Appled Scences, 2(4): 1204-1208, 2008 ISSN 1991-8178 Sum of Lnear and Fractonal Multobjectve Programmng Problem under Fuzzy Rules Constrants 1 2 Sanjay Jan and Kalash Lachhwan

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Efficient Text Classification by Weighted Proximal SVM *

Efficient Text Classification by Weighted Proximal SVM * Effcent ext Classfcaton by Weghted Proxmal SVM * Dong Zhuang 1, Benyu Zhang, Qang Yang 3, Jun Yan 4, Zheng Chen, Yng Chen 1 1 Computer Scence and Engneerng, Bejng Insttute of echnology, Bejng 100081, Chna

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Extraction of Fuzzy Rules from Trained Neural Network Using Evolutionary Algorithm *

Extraction of Fuzzy Rules from Trained Neural Network Using Evolutionary Algorithm * Extracton of Fuzzy Rules from Traned Neural Network Usng Evolutonary Algorthm * Urszula Markowska-Kaczmar, Wojcech Trelak Wrocław Unversty of Technology, Poland kaczmar@c.pwr.wroc.pl, trelak@c.pwr.wroc.pl

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like:

Outline. Self-Organizing Maps (SOM) US Hebbian Learning, Cntd. The learning rule is Hebbian like: Self-Organzng Maps (SOM) Turgay İBRİKÇİ, PhD. Outlne Introducton Structures of SOM SOM Archtecture Neghborhoods SOM Algorthm Examples Summary 1 2 Unsupervsed Hebban Learnng US Hebban Learnng, Cntd 3 A

More information

Available online at Available online at Advanced in Control Engineering and Information Science

Available online at   Available online at   Advanced in Control Engineering and Information Science Avalable onlne at wwwscencedrectcom Avalable onlne at wwwscencedrectcom Proceda Proceda Engneerng Engneerng 00 (2011) 15000 000 (2011) 1642 1646 Proceda Engneerng wwwelsevercom/locate/proceda Advanced

More information

Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study

Arabic Text Classification Using N-Gram Frequency Statistics A Comparative Study Arabc Text Classfcaton Usng N-Gram Frequency Statstcs A Comparatve Study Lala Khresat Dept. of Computer Scence, Math and Physcs Farlegh Dcknson Unversty 285 Madson Ave, Madson NJ 07940 Khresat@fdu.edu

More information

GA-Based Learning Algorithms to Identify Fuzzy Rules for Fuzzy Neural Networks

GA-Based Learning Algorithms to Identify Fuzzy Rules for Fuzzy Neural Networks Seventh Internatonal Conference on Intellgent Systems Desgn and Applcatons GA-Based Learnng Algorthms to Identfy Fuzzy Rules for Fuzzy Neural Networks K Almejall, K Dahal, Member IEEE, and A Hossan, Member

More information

CSCI 5417 Information Retrieval Systems Jim Martin!

CSCI 5417 Information Retrieval Systems Jim Martin! CSCI 5417 Informaton Retreval Systems Jm Martn! Lecture 11 9/29/2011 Today 9/29 Classfcaton Naïve Bayes classfcaton Ungram LM 1 Where we are... Bascs of ad hoc retreval Indexng Term weghtng/scorng Cosne

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE

SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE Dorna Purcaru Faculty of Automaton, Computers and Electroncs Unersty of Craoa 13 Al. I. Cuza Street, Craoa RO-1100 ROMANIA E-mal: dpurcaru@electroncs.uc.ro

More information

Correlative features for the classification of textural images

Correlative features for the classification of textural images Correlatve features for the classfcaton of textural mages M A Turkova 1 and A V Gadel 1, 1 Samara Natonal Research Unversty, Moskovskoe Shosse 34, Samara, Russa, 443086 Image Processng Systems Insttute

More information

Maximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation

Maximum Variance Combined with Adaptive Genetic Algorithm for Infrared Image Segmentation Internatonal Conference on Logstcs Engneerng, Management and Computer Scence (LEMCS 5) Maxmum Varance Combned wth Adaptve Genetc Algorthm for Infrared Image Segmentaton Huxuan Fu College of Automaton Harbn

More information

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network

A New Token Allocation Algorithm for TCP Traffic in Diffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network A New Token Allocaton Algorthm for TCP Traffc n Dffserv Network S. Sudha and N. Ammasagounden Natonal Insttute of Technology, Truchrappall,

More information

Classic Term Weighting Technique for Mining Web Content Outliers

Classic Term Weighting Technique for Mining Web Content Outliers Internatonal Conference on Computatonal Technques and Artfcal Intellgence (ICCTAI'2012) Penang, Malaysa Classc Term Weghtng Technque for Mnng Web Content Outlers W.R. Wan Zulkfel, N. Mustapha, and A. Mustapha

More information

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET

BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET 1 BOOSTING CLASSIFICATION ACCURACY WITH SAMPLES CHOSEN FROM A VALIDATION SET TZU-CHENG CHUANG School of Electrcal and Computer Engneerng, Purdue Unversty, West Lafayette, Indana 47907 SAUL B. GELFAND School

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

An Evolvable Clustering Based Algorithm to Learn Distance Function for Supervised Environment

An Evolvable Clustering Based Algorithm to Learn Distance Function for Supervised Environment IJCSI Internatonal Journal of Computer Scence Issues, Vol. 7, Issue 5, September 2010 ISSN (Onlne): 1694-0814 www.ijcsi.org 374 An Evolvable Clusterng Based Algorthm to Learn Dstance Functon for Supervsed

More information

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB V. Hotař, A. Hotař Techncal Unversty of Lberec, Department of Glass Producng Machnes and Robotcs, Department of Materal

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Network Intrusion Detection Based on PSO-SVM

Network Intrusion Detection Based on PSO-SVM TELKOMNIKA Indonesan Journal of Electrcal Engneerng Vol.1, No., February 014, pp. 150 ~ 1508 DOI: http://dx.do.org/10.11591/telkomnka.v1.386 150 Network Intruson Detecton Based on PSO-SVM Changsheng Xang*

More information

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm

Recommended Items Rating Prediction based on RBF Neural Network Optimized by PSO Algorithm Recommended Items Ratng Predcton based on RBF Neural Network Optmzed by PSO Algorthm Chengfang Tan, Cayn Wang, Yuln L and Xx Q Abstract In order to mtgate the data sparsty and cold-start problems of recommendaton

More information

Classification / Regression Support Vector Machines

Classification / Regression Support Vector Machines Classfcaton / Regresson Support Vector Machnes Jeff Howbert Introducton to Machne Learnng Wnter 04 Topcs SVM classfers for lnearly separable classes SVM classfers for non-lnearly separable classes SVM

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

A Misclassification Reduction Approach for Automatic Call Routing

A Misclassification Reduction Approach for Automatic Call Routing A Msclassfcaton Reducton Approach for Automatc Call Routng Fernando Uceda-Ponga 1, Lus Vllaseñor-Pneda 1, Manuel Montes-y-Gómez 1, Alejandro Barbosa 2 1 Laboratoro de Tecnologías del Lenguaje, INAOE, Méxco.

More information

Multiclass Object Recognition based on Texture Linear Genetic Programming

Multiclass Object Recognition based on Texture Linear Genetic Programming Multclass Object Recognton based on Texture Lnear Genetc Programmng Gustavo Olague 1, Eva Romero 1 Leonardo Trujllo 1, and Br Bhanu 2 1 CICESE, Km. 107 carretera Tjuana-Ensenada, Mexco, olague@ccese.mx,

More information

Collaboratively Regularized Nearest Points for Set Based Recognition

Collaboratively Regularized Nearest Points for Set Based Recognition Academc Center for Computng and Meda Studes, Kyoto Unversty Collaboratvely Regularzed Nearest Ponts for Set Based Recognton Yang Wu, Mchhko Mnoh, Masayuk Mukunok Kyoto Unversty 9/1/013 BMVC 013 @ Brstol,

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Backpropagation: In Search of Performance Parameters

Backpropagation: In Search of Performance Parameters Bacpropagaton: In Search of Performance Parameters ANIL KUMAR ENUMULAPALLY, LINGGUO BU, and KHOSROW KAIKHAH, Ph.D. Computer Scence Department Texas State Unversty-San Marcos San Marcos, TX-78666 USA ae049@txstate.edu,

More information

Review of approximation techniques

Review of approximation techniques CHAPTER 2 Revew of appromaton technques 2. Introducton Optmzaton problems n engneerng desgn are characterzed by the followng assocated features: the objectve functon and constrants are mplct functons evaluated

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

Face Recognition Based on SVM and 2DPCA

Face Recognition Based on SVM and 2DPCA Vol. 4, o. 3, September, 2011 Face Recognton Based on SVM and 2DPCA Tha Hoang Le, Len Bu Faculty of Informaton Technology, HCMC Unversty of Scence Faculty of Informaton Scences and Engneerng, Unversty

More information

A Method of Hot Topic Detection in Blogs Using N-gram Model

A Method of Hot Topic Detection in Blogs Using N-gram Model 84 JOURNAL OF SOFTWARE, VOL. 8, NO., JANUARY 203 A Method of Hot Topc Detecton n Blogs Usng N-gram Model Xaodong Wang College of Computer and Informaton Technology, Henan Normal Unversty, Xnxang, Chna

More information

A Serial and Parallel Genetic Based Learning Algorithm for Bayesian Classifier to Predict Metabolic Syndrome

A Serial and Parallel Genetic Based Learning Algorithm for Bayesian Classifier to Predict Metabolic Syndrome A Seral and Parallel Genetc Based Learnng Algorthm for Bayesan Classfer to Predct Metabolc Syndrome S. Dehur Department of Informaton and Communcaton Technology Fakr Mohan Unversty, Vyasa Vhar Balasore-756019,

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

A New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1

A New Feature of Uniformity of Image Texture Directions Coinciding with the Human Eyes Perception 1 A New Feature of Unformty of Image Texture Drectons Concdng wth the Human Eyes Percepton Xng-Jan He, De-Shuang Huang, Yue Zhang, Tat-Mng Lo 2, and Mchael R. Lyu 3 Intellgent Computng Lab, Insttute of Intellgent

More information

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Positive Semi-definite Programming Localization in Wireless Sensor Networks Postve Sem-defnte Programmng Localzaton n Wreless Sensor etworks Shengdong Xe 1,, Jn Wang, Aqun Hu 1, Yunl Gu, Jang Xu, 1 School of Informaton Scence and Engneerng, Southeast Unversty, 10096, anjng Computer

More information

Using Neural Networks and Support Vector Machines in Data Mining

Using Neural Networks and Support Vector Machines in Data Mining Usng eural etworks and Support Vector Machnes n Data Mnng RICHARD A. WASIOWSKI Computer Scence Department Calforna State Unversty Domnguez Hlls Carson, CA 90747 USA Abstract: - Multvarate data analyss

More information

Deep Classification in Large-scale Text Hierarchies

Deep Classification in Large-scale Text Hierarchies Deep Classfcaton n Large-scale Text Herarches Gu-Rong Xue Dkan Xng Qang Yang 2 Yong Yu Dept. of Computer Scence and Engneerng Shangha Jao-Tong Unversty {grxue, dkxng, yyu}@apex.sjtu.edu.cn 2 Hong Kong

More information

Using an Automatic Weighted Keywords Dictionary for Intelligent Web Content Filtering

Using an Automatic Weighted Keywords Dictionary for Intelligent Web Content Filtering Journal of Advances n Computer Research Quarterly pissn: 2345-606x eissn: 2345-6078 Sar Branch, Islamc Azad Unversty, Sar, I.R.Iran (Vol. 6, No. 1, February 2015), Pages: 101-114 www.jacr.ausar.ac.r Usng

More information

A Notable Swarm Approach to Evolve Neural Network for Classification in Data Mining

A Notable Swarm Approach to Evolve Neural Network for Classification in Data Mining A Notable Swarm Approach to Evolve Neural Network for Classfcaton n Data Mnng Satchdananda Dehur 1, Bjan Bhar Mshra 2 and Sung-Bae Cho 1 1 Soft Computng Laboratory, Department of Computer Scence, Yonse

More information

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation

An Iterative Solution Approach to Process Plant Layout using Mixed Integer Optimisation 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 An Iteratve Soluton Approach to Process Plant Layout usng Mxed

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION

A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION 1 THE PUBLISHING HOUSE PROCEEDINGS OF THE ROMANIAN ACADEMY, Seres A, OF THE ROMANIAN ACADEMY Volume 4, Number 2/2003, pp.000-000 A PATTERN RECOGNITION APPROACH TO IMAGE SEGMENTATION Tudor BARBU Insttute

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Intra-Parametric Analysis of a Fuzzy MOLP

Intra-Parametric Analysis of a Fuzzy MOLP Intra-Parametrc Analyss of a Fuzzy MOLP a MIAO-LING WANG a Department of Industral Engneerng and Management a Mnghsn Insttute of Technology and Hsnchu Tawan, ROC b HSIAO-FAN WANG b Insttute of Industral

More information

Image Feature Selection Based on Ant Colony Optimization

Image Feature Selection Based on Ant Colony Optimization Image Feature Selecton Based on Ant Colony Optmzaton Lng Chen,2, Bolun Chen, Yxn Chen 3, Department of Computer Scence, Yangzhou Unversty,Yangzhou, Chna 2 State Key Lab of Novel Software Tech, Nanng Unversty,

More information

Spam Filtering Based on Support Vector Machines with Taguchi Method for Parameter Selection

Spam Filtering Based on Support Vector Machines with Taguchi Method for Parameter Selection E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton We-Chh Hsu, Tsan-Yng Yu E-mal Spam Flterng Based on Support Vector Machnes wth Taguch Method for Parameter Selecton

More information

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures

A Novel Adaptive Descriptor Algorithm for Ternary Pattern Textures A Novel Adaptve Descrptor Algorthm for Ternary Pattern Textures Fahuan Hu 1,2, Guopng Lu 1 *, Zengwen Dong 1 1.School of Mechancal & Electrcal Engneerng, Nanchang Unversty, Nanchang, 330031, Chna; 2. School

More information

Feature Selection for Target Detection in SAR Images

Feature Selection for Target Detection in SAR Images Feature Selecton for Detecton n SAR Images Br Bhanu, Yngqang Ln and Shqn Wang Center for Research n Intellgent Systems Unversty of Calforna, Rversde, CA 95, USA Abstract A genetc algorthm (GA) approach

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap Int. Journal of Math. Analyss, Vol. 8, 4, no. 5, 7-7 HIKARI Ltd, www.m-hkar.com http://dx.do.org/.988/jma.4.494 Emprcal Dstrbutons of Parameter Estmates n Bnary Logstc Regresson Usng Bootstrap Anwar Ftranto*

More information

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval

Description of NTU Approach to NTCIR3 Multilingual Information Retrieval Proceedngs of the Thrd NTCIR Workshop Descrpton of NTU Approach to NTCIR3 Multlngual Informaton Retreval Wen-Cheng Ln and Hsn-Hs Chen Department of Computer Scence and Informaton Engneerng Natonal Tawan

More information

Incremental Learning with Support Vector Machines and Fuzzy Set Theory

Incremental Learning with Support Vector Machines and Fuzzy Set Theory The 25th Workshop on Combnatoral Mathematcs and Computaton Theory Incremental Learnng wth Support Vector Machnes and Fuzzy Set Theory Yu-Mng Chuang 1 and Cha-Hwa Ln 2* 1 Department of Computer Scence and

More information

Feature Selection as an Improving Step for Decision Tree Construction

Feature Selection as an Improving Step for Decision Tree Construction 2009 Internatonal Conference on Machne Learnng and Computng IPCSIT vol.3 (2011) (2011) IACSIT Press, Sngapore Feature Selecton as an Improvng Step for Decson Tree Constructon Mahd Esmael 1, Fazekas Gabor

More information

EVALUATION OF THE PERFORMANCES OF ARTIFICIAL BEE COLONY AND INVASIVE WEED OPTIMIZATION ALGORITHMS ON THE MODIFIED BENCHMARK FUNCTIONS

EVALUATION OF THE PERFORMANCES OF ARTIFICIAL BEE COLONY AND INVASIVE WEED OPTIMIZATION ALGORITHMS ON THE MODIFIED BENCHMARK FUNCTIONS Academc Research Internatonal ISS-L: 3-9553, ISS: 3-9944 Vol., o. 3, May 0 EVALUATIO OF THE PERFORMACES OF ARTIFICIAL BEE COLOY AD IVASIVE WEED OPTIMIZATIO ALGORITHMS O THE MODIFIED BECHMARK FUCTIOS Dlay

More information

Face Recognition University at Buffalo CSE666 Lecture Slides Resources:

Face Recognition University at Buffalo CSE666 Lecture Slides Resources: Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Impact of a New Attribute Extraction Algorithm on Web Page Classification

Impact of a New Attribute Extraction Algorithm on Web Page Classification Impact of a New Attrbute Extracton Algorthm on Web Page Classfcaton Gösel Brc, Banu Dr, Yldz Techncal Unversty, Computer Engneerng Department Abstract Ths paper ntroduces a new algorthm for dmensonalty

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

An Improved Image Segmentation Algorithm Based on the Otsu Method

An Improved Image Segmentation Algorithm Based on the Otsu Method 3th ACIS Internatonal Conference on Software Engneerng, Artfcal Intellgence, Networkng arallel/dstrbuted Computng An Improved Image Segmentaton Algorthm Based on the Otsu Method Mengxng Huang, enjao Yu,

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

Enhancement of Infrequent Purchased Product Recommendation Using Data Mining Techniques

Enhancement of Infrequent Purchased Product Recommendation Using Data Mining Techniques Enhancement of Infrequent Purchased Product Recommendaton Usng Data Mnng Technques Noraswalza Abdullah, Yue Xu, Shlomo Geva, and Mark Loo Dscplne of Computer Scence Faculty of Scence and Technology Queensland

More information