Fuzzy Modeling for Multi-Label Text Classification Supported by Classification Algorithms

Size: px
Start display at page:

Download "Fuzzy Modeling for Multi-Label Text Classification Supported by Classification Algorithms"

Transcription

1 Journal of Computer Senes Orgnal Researh Paper Fuzzy Modelng for Mult-Label Text Classfaton Supported by Classfaton Algorthms 1 Beatrz Wlges, 2 Gustavo Mateus, 2 Slva Nassar, 2 Renato Cslagh and 3 Rogéro Cd Bastos 1 Department of Informaton Systems, Federal Unversty of Santa Catarna, Floranópols, Brazl 2 Department of Informats and Statst, Federal Unversty of Santa Catarna, Floranópols, Brazl 3 Department of Engneerng and Knowledge Management, Federal Unversty of Santa Catarna, Floranópols, Brazl Artle hstory Reeved: Revsed: Aepted: Correspondng Author: Beatrz Wlges Department of Informaton Systems, Federal Unversty of Santa Catarna, Floranópols, Brazl Emal: beatrz.wlges@ufs.br Abstrat: The ever-nreasng amount of nformaton on the Web s organzed n strutured, sem-strutured and unstrutured data. Text lassfaton systems, apable of handlng suh dfferent strutures, may faltate the work of mportant tasks suh as ndexaton and nformaton retreval n searh engnes. The objetve of ths researh s to develop a method for the lassfaton of douments nto multple ategores wth fuzzy log. Ths method was bult from a proess of pattern reognton and, also, two varables alled smlarty and auray were used. The proposed fuzzy lassfaton method uses varables that express the ablty to analyze the smlarty and auray of a doument through a database of terms. The database of terms s generated by a olleton of pre-lassfed douments n ategores of nterest. The douments proessed aordng to the smlarty and auray n the database of terms omposes a tranng set also alled knowledge base. From ths database, t s possble to dentfy a pattern that spefes a set of rules through a knowledge dsovery proess. Ths proess nvolves the data mnng of the knowledge base. Thus, t was possble to defne a general model that s used n the reaton of rules and membershp funtons of the fuzzy model for the lassfaton of douments nto multple ategores. The general model of the rules dentfed n the data mnng proess and mplemented n fuzzy model onsders the most sgnfant varables and also ontrbutes to the spefaton of the membershp funtons, suh as the defnton of lngust terms of fuzzy sets. Thus, t was possble to mplement a more determnst approah regardng the nput, membershp funtons and nferene rules of the fuzzy model. The results of the proposed method for lassfaton of douments are relevant beause they have a satsfatory auray rate. Keywords: Text Categorzaton, Deson Tree, Fuzzy Modelng Introduton The volume of nformaton avalable s nreasng sgnfantly over the years, whh means that people have more aess to knowledge from any eletron deve over the Internet. Thus, automat ategorzaton systems play an mportant role n the text lassfaton proess, by assstng n nformaton retreval proesses. Aordng to Yang and Pedersen (1997) the proess of retrevng douments n properly lassfed databases s more effent and the searh sope s redued even f a large volume of nformaton s avalable. L et al. (2011) further state that the goal of text lassfaton s to label textual douments wth themat lasses from a predefned set. Also aordng to these authors, many dfferent methods have been appled to text lassfaton tasks nludng the K-Nearest Neghbor (KNN) approah (Bang et al., 2006; Tan, 2006), Naïve Bayesan approahes (Baker and MCallum, 1998; Lews, 1998; Yang and Lu, 1999), support vetor mahne (Dumas and Chen, 2000) and deson trees (Lews and Rnguette, 1994; Qunlan, 1993). However, most mahne learnng algorthms were desgned for sngle-label lassfaton n whh a doument an only belong to one ategory (Jang et al., 2012). In mult-label text lassfaton, a doument an belong to more than one ategory. Therefore, the fous of 2016 Beatrz Wlges, Gustavo Mateus, Slva Nassar, Renato Cslagh and Rogéro Cd Bastos. Ths open aess artle s dstrbuted under a Creatve Commons Attrbuton (CC-BY) 3.0 lense.

2 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp ths researh s on mult-label text lassfaton assoated wth fuzzy models. The goal of ths researh s to analyze a mult-label text lassfaton proess, whh generated a fuzzy model. In the researh (Peters and Koster, 2003) was used a new rteron for Term Seleton, whh s based on the unertanty n Term Frequeny aross ategores. Ths proposal nvolved onstrutng a database of terms organzed by ategory, where lassfaton algorthms were appled to generate Deson Trees (DT) from several text lassfaton tests. Wth these DT, t was possble to generate a lned up fuzzy model apable of dsplayng the relevane degrees of the analyzed text for eah ategory. Thus, a database was bult for eah ategory usng unstrutured text extrated from the Web, mostly onlne magaznes and newspapers. Ths database was organzed nto four ategores nvolvng Eduaton, Tehnology, Sports and Eonomy. In ths researh, the most frequent terms n eah ategory were onsdered. Ths artle s organzed as follows: Intally, the onstruton of the base of terms from dfferent soures wll be presented, wth an estmate of approxmately 3000 words have been used for eah ategory. In sequene, the methodology of the lassfaton proess was detaled as well as ts varables and ndators. Followng, the lassfaton proess and ther results usng Deson Tree algorthms (DT) are presented. The fuzzy model bult from the lassfaton proess s presented next and fnally, onsderatons and further analyss on fuzzy modelng are suggested. General Organzaton Model The proposed model wll allow a olleton of douments from the OM to be lassfed aordng to ategores of nterest to the organzaton. For ths, a method of doument lassfaton that onsders multple ategores s appled. To perform the mplementaton of ths organzatonal model, we developed a method of doument lassfaton. Ths method onsders the hgh dmensonalty nvolved n the lassfaton of douments,.e., as a doument ontans lots of nformaton that an be represented n several lasses, t s ategorzed n multple ategores. Fgure 1 shows the steps of the development of the doument organzaton model based on ths method. Also at ths stage the varables are defned and the data to a knowledge base s generated wth these varables. Ths database s bult to allow the knowledge dsovery proess fnd assoatons between varables n order to desgn rules for the fuzzy modelng. So part of these atvtes nvolve a proess of Knowledge Dsovery n Databases (KDD), spefally n a learnng set dvded nto tranng and testng. The proessng on ths tranng set s done by rankng algorthms. The evaluaton of the model generated n ths phase s done by alulatng the auray and error. Both auray and error of the model onsder the results of the mathes between the real lass and the expeted lass. The proposed method onsders a proess of Knowledge Dsovery (KDD) on the rules that are mplemented n a fuzzy modelng. The use of fuzzy log deals wth overlappng ategores n a doument handlng the vagueness of the nput varables. The struture of the doument organzaton model s shown n Fg. 2. The applaton of the model s made from a preproessed doument, whose smlarty and onfdene are alulated aordng to the defnton of a database of terms. These two varables are the nputs of the fuzzy modelng. Thus, a olleton of douments of an organzaton an be ategorzed by assgnng ts relevane to eah ategory dentfed as of nterest to the organzaton. Fg. 1. Model development 342

3 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp Fg. 2. Model applaton Buldng the Term Database After seletng a olleton of douments belongng to the same ategory, the stage of text preparaton ommened. Ths step nvolves ertan tehnques that faltate the proess of seletng textual features. The proposal n order to buld the database (DB) was to dentfy all words that best express the haratersts of a gven ategory. Ths DB s onstruted based on the most ommon terms n all douments of the same ategory. In ths ase, the doument database should be large enough to over most terms that defne the ategory. From that DB t s possble to defne f any text belongs to a ategory n the DB. Pre-proessng s a very mportant stage n the text lassfaton, nludng several steps to transform the set of douments, n natural language, nto a lst of useful terms and n a format whh s ompatble for knowledge extraton. In ths study, a tool alled RapdMner (2016) was used to perform both preproessng and DT generaton. Doument proessng frstly nvolved the Extrat Content operator, whh extrats the textual ontent of an HTML doument and yelds extrated text bloks from the doument. Later, the tokenze operator dvdes the text of a doument n a sequene of tokens. Further, the Transform Cases operator transforms all haraters of a doument n lowerase or upperase. Fnally, the Flter Stop words (Dtonary) operator removes all the words that belong to a lst of stop-words, whh s loaded from a fle on the operator tself. The stop-words lst ontans words that are not relevant to ountng the most ommon terms n the text doument; therefore, the lst ontans grammatal words that do not affet the defnton of the ategory of the text. Fgure 3 shows a part of the DB onstruted from doument proessng. In Fg. 3 t s observed that, for every text doument, all terms n respet to ther total ourrene n the text (total ourenes) and whh doument the term was extrated from (doument ourenes) were analyzed. At ths stage of the text doument proessng, studes and heks have been onduted to dentfy the best way to mplement ths proess. It was onsdered a aseomparson study utlzng both seral and parallel paradgms n the text proessng (Wlges et al., 2014). In the results, t was observed the hgher effeny of seralzed proessng. Ths s beause the text fles used n proessng dd not have a sze on the order of petabytes. In ths researh t s lear that beng able to defne whether a text belongs to a ategory s not enough. It s mportant to know the auray of the analyss durng the deson makng proess. Hene, a fuzzy model that ould onsder the membershp degree of a text doument to a partular ategory was developed. Therefore, all terms of eah ategory from the DB were ordered and normalzed to the hghest and lowest frequeny for eah ategory. Thus, an ndex alled j relevane degree ( w ) j The relevane degree ( w ) RD was obtaned for eah word. RD of eah word (w) must be alulated for all ategores (), whh are omposed by many text douments. Thus, the weght of the terms n (DB) for eah ategory, known as the relevane degree j of the term( RD w ), s alulated as follows: RD j w Where: j RD w f ( w,j) = (1) max f ( w z,j) f ( w,j) max( f ( w )) z,j ( ) = Relevane degree of the term w n the ategory j = Term frequeny w n the ategory j = Hghest frequeny represented by the term w z n the ategory j 343

4 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp relaton to ther relevane degree for eah ategory n DB. For eah evaluated word n the analyzed text the respetve RD has been observed: w V = f * RD (2) w w w where, V w represents a value for eah word alulated as a funton of frequeny RD f the word. From the w of V w fw and the relevane degree V of eah word the average w from the analyzed text has been extrated: AV nw 1 = Vw nw Σ (3) =1 Fg. 3. DB wth terms of a ategory For eah of the analyzed ategores n the doument organzaton model, the database should store keys to dentfy the doument, to dentfy the ategory, the representaton of the term, the frequeny of the term n the ategory and the relevane degree n the ategory. The purpose of the terms database s to dentfy all the words that best express the haratersts of a gven ategory. Ths database s bult aordng to of the most ommon terms between the varous douments of the same ategory. Thus, Defnton of varables f a partular ategory has at least 100 thousand words, the desgned database should have a mnmum sample sze of 380 words per ategory, onsderng a samplng error margn of 5% wth a 95% onfdene level. Defnton of Varables Many studes related to Informaton Retreval (IR) work wth models of doument ndexng, doument representaton and smlarty measures from retreved douments. In ths researh, the onepts of smlarty and auray were used. That s, smlarty S between the partes nvolved: The Parsed Text () and the texts from the (DB). In turn, auray A measured the relablty of results presented by smlarty. The Parsed Text () went through the same text doument preparaton proess presented prevously: Extraton, leanup, stop-words removal and harater ase transformaton. From the frequeny of terms n, multplaton by the relevane degree of the term w n eah ategory from DB w was performed. Equaton 2 expresses the relatonshp between the terms of () and the terms of (DB), where fw s frequeny and RD w s the relevane degree of the term for a spef ategory of the DB. TheV stores the frequeny values of n w C Also, the average has been extrated from all RD of eah word of a gven relevane degrees ( w ) ategory from the database: ARD = DB nw 1 RDw nw Σ (4) =1 The smlarty S of the Parsed Text () regardng to the DB Texts s alulated aordng to the average AV T of, alulated by Equaton 3. values ( P ) The AVs dvded by the Average Degree ( DB) terms for eah ategory of the DB. Thus, through Equaton 5: DB ARD of S s obtaned AV S = (5) ARD Confdene s the number of terms of the analyzed text () orrespondng to the database (DB) for eah ategory. Thus the alulaton s represented by Equaton 6: A ( nw nw ) = 1 (6) nw C DB DB Where: nw DB = Total terms for eah ategory of DB nwdb = Total terms whh are ommon between the DB and the analyzed text () = Total terms from the analyzed text () nw Classfyng Tranng Douments A set of test douments, wthn the ategores spefed n the terms database, s seleted to buld a knowledge base on the results of the alulaton of smlarty varables and auray. Thus, the whole set of ntal test douments 344

5 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp passes through the preproessng step, just as the douments that generated the database of terms. After ths step, the vetor representng the test doument s proessed and values of smlarty and auray for eah ategory under onsderaton are obtaned. Furthermore, the pror ndex of the ategory n eah test doument s onsdered,.e., the ategory from whh the doument has been extrated, s ntally dentfed. Therefore, eah analyzed text has the smlarty( S ) C and auray( A ) taken regardng to all ategores from the DB. Table 1 presents the results of ths proess of extraton from S and A onduted on two texts. The C attrbute belongs used n the table orresponds to the atual text lassfaton based on ther ategory. For example, Text 1 belongs to the ategory Eonomy, thus smlarty and auray were evaluated and assgned to eah of the four ategores stored n the database, aordng to the aforementoned equatons. In all results, the Smlarty equaton ponted exatly whh ategory the text belonged to, wthout error. Auray, as shown by the results, has supported Smlarty when t was not the hghest value, beng at least the seond hghest resultng value. These results were arred for the lassfaton of 352 texts and were onsdered satsfatory to desgn a fuzzy model. To onstrut the fuzzy model all results of smlarty and auray were used from the analyzed texts, aordng to the data presented n Table 1. Data Classfaton Proess From the results shown n Table 1, there was a proess of lassfaton for the 352 texts. The proess of data lassfaton uses Deson Tree (DT) algorthms and ams to provde a more determnst approah to the onstruton of the fuzzy model. The DT s bult onsderng the onept of entropy, whh measures the nformaton level of an attrbute. The smaller the entropy value, the lower the unertanty and the most useful the attrbute s for lassfaton. Fgure 4 shows the tree generated under the tested dataset by applyng the ID3 lassfaton algorthm (Qunlan, 1986) n the RapdMner tool. On ths DT the values of the results from auray and smlarty were normalzed to perentages. Fgure 4 shows the frst analyss of the mnng proess, wth lowest entropy n the smlarty attrbute already ategorzed. The results of varables smlarty and auray were lustered nto three groups: Hgh, medum and low. Due to ths onstruton, the lngust varables nlude hgh, medum and low for both the smlarty and auray attrbutes. That s two thresholds were establshed for eah varable (smlarty and auray). Thus, the lmts that lassfy the results nto hgh, medum and low were obtaned. In Fg. 5 we present the results of the proessng from the seond DT algorthm, C4.5 (Qunlan, 1993), to buld the tree. Fg. 4. Result of the ID3 proess Fg. 5. Result of the C4.5 algorthm Table 1. Results of text analyss Smlarty Auray Belong Text Category (S) (A) (B) 2 Eonomy 1 1 yes 2 Eduaton 0,04 0,09 no 2 Tehnology 0 0 no 2 Sport 0,1 0,09 no 3 Eonomy 1 1 yes 3 Eduaton 0,08 0,8 no 3 Tehnology 0 0 no 3 Sport 0,34 0,85 no The proposal of lassfyng data was performed to adjust, n the best possble way, the membershp funtons and ths was only possble through the DT n Fg. 5. A fuzzy model was generated aordng to ths tree. It s observed that, n ths tree, when the Smlarty s Hgh (SH) and the Auray s Hgh (AH) the text Belongs (BY) to a ategory wth, at least, 85% of ertanty. In ases where the Smlarty s Hgh (SH) and the Auray s Medum (AM) the text belongs (BY) wth about 50% of ertanty to the ategory and onsderng the medum (SM) and low (SL) smlartes, the text does not belong (BN) to the ategory wth ertanty lose to 100%. Thus, the membershp funtons and fuzzy model rules have been adjusted to meet ths purpose. 345

6 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp Table 2. Performane evaluaton matrx of the DT model Real lass Correspondene matrx Yes No Expeted lass Yes 26 6 No 8 88 The model verfaton was onduted wth a set of 32 test douments (Fg. 5). Ths model s represented by the DT that was generated by proessng the tranng set. The results presented by the matrx n Table 2 evaluate the performane of the DT model. The auray and the error of ths model onsder the results of the mathes between the real lass and the expeted lass observed n Table 2 and followed the defntons of the funtons as presented n the methodology of ths study. Thus, the alulaton of the auray from the model resulted n 89%. Ths result was onsdered adequate for the mplementaton of the rules n the fuzzy model based on the DT n Fg. 5. Buldng the Fuzzy Model The proposal of the fuzzy model for text lassfaton n multple ategores s relevant beause t an handle the mpreson of knowledge omng from gaps and blanks present n the dataset. Moreover, fuzzy log s able to summarze the results of a lassfaton onsderng two nputs (smlarty and auray) nto an output wth level of relevane. In ths study, MATLAB (2016) was used to buld and smulate the Fuzzy Model. The MATLAB platform s optmzed for solvng engneerng and sentf problems, moreover t has a Fuzzy Log Toolbox that provdes funtons, apps and a Smulnk blok for analyzng, desgnng and smulatng systems based on fuzzy log. The fuzzy modelng s omposed of a fuzzfaton proess, a knowledge base and a defuzzfaton proess. The knowledge base s represented by the rules, nferene engnes and aggregaton funtons. The struture of the desrbed fuzzy modelng s shown n Fg. 6. The fuzzy model was bult based on the results of the DT that allowed the reaton of a more prese approah for both the nput varables as n the rule base. From that, the fuzzy model was able to treat the unertanty n the lassfaton proess of texts. The nput funton alled smlarty was developed wth the lngust varables hgh, medum and low and the nput funton auray was bult wth only the hgh and medum lngust varables as presented n the lassfaton proess of the DT. The output funton alled belong was developed wth the lngust varables yes, maybe and no. All membershp funtons have been mplemented wth Gaussan dstrbuton funtons, whh have a smple urve. In partular gaussmf funton was used, whh has a bell urve wth maxmum of 1 and mnmum at 0. Fgure 7 shows the overvew of the fuzzy model. Fg. 6. The struture of the fuzzy modelng Fg. 7. Fuzzy modelng n text lassfaton The ontroller of the nferene engne used was Mamdan, mplemented wth the mnmum operator and the defuzzfaton method was the smallest of maxmum. Aordng to Wang (1997), defuzzfaton s the reverse proedure of fuzzfaton. In ths phase, we desgn a mappng from a fuzzy set to a rsp value. There are several defuzzfers, suh as Center of Gravty (Centrod), Center Average, Smallest of Maxma (SoM), Mean of Maxma (MoM) and Largest of Maxma (LoM). In ths study we have used the Smallest of Maxma (SoM) as the defuzzfaton method. Aordng (Yagunuma et al., 2013), when usng fuzzy DL wth onept defntons, the defuzzfaton queres avalable are SoM, MoM, LoM, whh onsder the extremes of maxmum degree. Dependng on the stuaton, they may lose nformaton ompared wth other defuzzfaton methods that are based on the shape of the fuzzy set. In the experments of ths study the results wth the SoM method were more aurate. 346

7 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp The Smallest of Maxma (SoM) represents the hoe of smallest pont of the unverse wth the hghest degree of relevane. We alulate the So Musng the followng equaton: µ = mn( x ), suh thatϕ ( x ) = ϕ SoM A where, ϕ A s the membershp funton of set A. Four nferene rules that over all lngust varables were defned, eah rule onssts of the operator and assoated wth the method mnmum. The aggregaton of the rules s made by the method maxmum. Fgure 8 presents the rule base. The rules shown n Fg. 8 were bult aordng to the defntons presented n the C4.5 deson tree algorthm (Fg. 5). Thus, the membershp funtons of the fuzzy model were adjusted so that the output ould orrespond to the values presented by the DT, aordng to Fg. 9. max, Fg. 8. Fuzzy modelng rules n text lassfaton Fg. 9. Fuzzy model rules adjusted by the deson tree The results n fuzzy model are relevant beause they show the atvaton of the membershp funton n at least two ponts for eah of the nput funtons. Fgure 10 provdes the fuzzy ategorzaton when the smlarty s hgh and auray s hgh, showng smlar results to that of the DT. Also, when the smlarty nput s hgh and auray s medum. Results It s known that there s, usually, a predomnant ategory when performng the lassfaton proess of the doument. Yet, there are other relevant values assoated wth the doument for eah of the ategores onsdered n the doman. These values vary n a range of [0,1]. The results of the model, through the fuzzy lassfer, wth ther respetve nputs for auray and smlarty to eah ategory are shown n Table 3. These results show the synthess of the defuzzfed output from the fuzzy modelng for eah entry. The overall assessment was performed n relaton to the alloaton of the defuzzfed output value for the ategory n the orgnal doument ndexng. In some ases, t s lear that even f the smlarty s relatvely hgh, f the auray does not have a sgnfant value to the ategory, the result of fuzzy output s also not favorable. The fuzzy model prortzes the ombnaton of the values of both of the most sgnfant ndators (smlarty and auray) n the result set. Table 4 shows the summary of the overall performane, onsderng a set of 97 douments n relaton to the pror ndexng of the doument to a partular ategory and the fuzzy output generated by the model. Aordng to Table 4 the ht rate was about 78%, that s, from the 97 douments of the tranng set, 75 had the defuzzfed output value orrespondng to the orgnal ndexng. Table 3. Results of text analyss wth fuzzy output Smlarty Auray Belong Fuzzy Text Category (S) (A) (B) out 2 Eonomy 1 1 yes 0,877 2 Eduaton 0,04 0,09 no 0,121 2 Tehnology 0 0 no 0,265 2 Sport 0,1 0,09 no 0,123 3 Eonomy 1 1 yes 0,877 3 Eduaton 0,08 0,8 no 0,467 3 Tehnology 0 0 no 0,117 3 Sport 0,34 0,85 no 0,490 Fg. 10. Results after applyng the frst and seond rules n the fuzzy model Table 4. General results Indexed ategory Sore Yes 76 No 21 Total

8 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp Conluson In ths researh, a database of terms wth four dfferent ategores of texts was onstruted. Eah term was normalzed wthn ther ategory n order to evaluate ther degree of mportane wthn t. A mult-label text lassfaton proess was bult based on two onepts, smlarty and auray. In the prelmnary results, the model has worked onsstently as expeted whle keepng orrespondene between the text and ts ategory. The most experments n text lassfaton have been made from statstal tehnques. Consderng the advantages of text lassfaton, the deal would be that more experments should be desgned to treat other languages, whle takng nto aount the peulartes of eah, espeally regardng the semant analyss of texts. Aordng to (Mannng et al., 2008) the statstal text lassfaton requre a number of good example douments for eah lass. Furthermore, the need for manual lassfaton sn't elmnated beause the tranng douments ome from a person who has labeled them. In order to nrease the effetveness of the presented results by the statstal analyss of ths researh, a way to mplement a fuzzy model was studed, enablng t to nform the relevane degree of the text to eah ategory. Deson tree algorthms were used to buld a fuzzy model that ould math the settngs of results n a database. Ths approah has served ts purpose, beause the onstruted model represents losely the results aheved wth the analyss. Moreover, t was possble to generate output ndatng the relevane n whh the analyzed text belonged to a gven ategory. Text lassfaton problems are well represented when solutons nvolve fuzzy models, sne lassfaton meets the requred expetatons and also ndate what s the. Aknowledgement We aknowledge the Federal Unversty of Santa Catarna and the Engneerng and Knowledge Management Program for provdng the the opportunty to develop ths researh. Fundng Informaton Authors would lke to thank the Mnstry of Eduaton for provdng the fnanal support dretly to the Engneerng and Knowledge Management Program to whh the researh s lnked. Author s Contrbutons Beatrz Wlges: Man researher n the projet. Desgned the researh and developed the proposed solutons. Responsble for the wrtng of the majorty of the paper. Gustavo Mateus: Prepared the workflow for the experments, drew llustratons and also organzed data of the tables. Slva Nassar: Organzed the wrtng and struture of the manusrpt. Renato Cslagh: Responsble for edtng some parts of the paper and ommentng researh deas. Rogéro Cd Bastos: Researh advsor. Supervson and montorng of the researh. Eths Ths artle s orgnal and ontans unpublshed materals. The orrespondng author onfrms that all other authors have read and approved the manusrpt and there are no ethal ssues nvolved. Referenes Baker, L.D. and A.K. MCallum, Dstrbutonal lusterng of words for text lassfaton. Proeedngs of the 21st Annual Internatonal ACM SIGIR Conferene on Researh and Development n Informaton Retreval, Aug , Melbourne, Australa, pp: DOI: / Bang, S.L., J.D. Yang and H.J. Yang, Herarhal doument ategorzaton wth k-nn and oneptbased thesaur. Inform. Proess. Manage., 42: DOI: /j.pm Dumas, S. and H. Chen, Herarhal lassfaton of Web ontent. Proeedngs of the 23rd Annual Internatonal Conferene on Researh and Development n Informaton Retreval, Jul , Athens, Greee, pp: DOI: / Lews, D.D. and M. Rnguette, Comparson of two learnng algorthms for text ategorzaton. Proeedngs of the 3rd Annual Symposum on Doument Analyss and Informaton Retreval, (AIR 94), pp: Lews, D. D, Nave (Bayes) at forty: The ndependene assumpton n nformaton retreval. European onferene on mahne learnng. Sprnger Berln Hedelberg. Jang, J.Y., S.C. Tsa and S.J. Lee, FSKNN: Multlabel text ategorzaton based on fuzzy smlarty and k nearest neghbors. Expert Syst. Appl., 39: DOI: /j.eswa Wang, L.X., A Course n Fuzzy Systems and Control. 1st Edn., Prente Hall R, Upper Saddle Rver, ISBN-10: , pp: 424. MATLAB, Fuzzy log toolbox desgn and smulate fuzzy log systems. The MathWorks, In, MATLAB. Mannng, C.D., P. Raghavan and H. Shütze, Introduton to Informaton Retreval. 1st Edn., Cambrdge Unversty Press. ISBN-10:

9 Beatrz Wlges et al. / Journal of Computer Senes 2016, 12 (7): DOI: /jssp L, M., L. Lu and C.B. L, An approah to expert reommendaton based on fuzzy lngust method and fuzzy text lassfaton n knowledge management systems. Expert Syst. Appl. Int. J., 38: DOI: /j.eswa Peters, C.M.E.E. and C.H.A. Koster, Unertanty and term seleton n text ategorzaton. Int. J. Un. Fuzz. Knowl. Based Syst., 11: DOI: /S Qunlan, J.R., C4.5: Programs for Mahne Learnng. 1st Edn., Morgan Kaufmann, San Mateo, ISBN-10: , pp: 302. Qunlan, J.R., Induton of deson trees. Mahne Learn., 1: DOI: /A: RapdMner, Tan, S., An effetve refnement strategy for KNN text lassfer. Expert Syst. Appl. Int. J., 30: DOI: /j.eswa Wlges, B., G. Mateus, R. Bastos and M. Dantas, A ase-omparson study of automat doument lassfaton utlzng both seral and parallel approahes. J. Phys. Conf. Seres. Yagunuma, C.A., W.C. Magalhães Jr, M.T. Santos, H.A. Camargo and M. Reformat, Combnng fuzzy ontology reasonng and mamdan fuzzy nferene system wth HyFOM reasoner. Proeedngs of the Internatonal Conferene on Enterprse Informaton Systems, Jul. 4-7, Angers, Frane, pp: DOI: / _11 Yang, Y. and J. Pedersen, A omparatve study on feature seleton n text ategorzaton. Proeedngs of 14th Internatonal Conferene on Mahne Learnng, (CML 97), Morgan Kaufmann Publshers, San Franso, US pp: Yang, Y. and X. Lu, A re-examnaton of text ategorzaton methods. Proeedngs of the 22nd Annual Internatonal ACM SIGIR Conferene on Researh and Development n Informaton Retreval, Aug , Berkeley, CA, USA, pp: DOI: /

Performance Evaluation of TreeQ and LVQ Classifiers for Music Information Retrieval

Performance Evaluation of TreeQ and LVQ Classifiers for Music Information Retrieval Performane Evaluaton of TreeQ and LVQ Classfers for Mus Informaton Retreval Matna Charam, Ram Halloush, Sofa Tsekerdou Athens Informaton Tehnology (AIT) 0.8 km Markopoulo Ave. GR - 19002 Peana, Athens,

More information

Link Graph Analysis for Adult Images Classification

Link Graph Analysis for Adult Images Classification Lnk Graph Analyss for Adult Images Classfaton Evgeny Khartonov Insttute of Physs and Tehnology, Yandex LLC 90, 6 Lev Tolstoy st., khartonov@yandex-team.ru Anton Slesarev Insttute of Physs and Tehnology,

More information

Bottom-Up Fuzzy Partitioning in Fuzzy Decision Trees

Bottom-Up Fuzzy Partitioning in Fuzzy Decision Trees Bottom-Up Fuzzy arttonng n Fuzzy eson Trees Maej Fajfer ept. of Mathemats and Computer Sene Unversty of Mssour St. Lous St. Lous, Mssour 63121 maejf@me.pl Cezary Z. Janow ept. of Mathemats and Computer

More information

Multilabel Classification with Meta-level Features

Multilabel Classification with Meta-level Features Multlabel Classfaton wth Meta-level Features Sddharth Gopal Carnege Mellon Unversty Pttsburgh PA 523 sgopal@andrew.mu.edu Ymng Yang Carnege Mellon Unversty Pttsburgh PA 523 ymng@s.mu.edu ABSTRACT Effetve

More information

Research on Neural Network Model Based on Subtraction Clustering and Its Applications

Research on Neural Network Model Based on Subtraction Clustering and Its Applications Avalable onlne at www.senedret.om Physs Proeda 5 (01 ) 164 1647 01 Internatonal Conferene on Sold State Deves and Materals Sene Researh on Neural Networ Model Based on Subtraton Clusterng and Its Applatons

More information

Connectivity in Fuzzy Soft graph and its Complement

Connectivity in Fuzzy Soft graph and its Complement IOSR Journal of Mathemats (IOSR-JM) e-issn: 2278-5728, p-issn: 2319-765X. Volume 1 Issue 5 Ver. IV (Sep. - Ot.2016), PP 95-99 www.osrjournals.org Connetvty n Fuzzy Soft graph and ts Complement Shashkala

More information

Color Texture Classification using Modified Local Binary Patterns based on Intensity and Color Information

Color Texture Classification using Modified Local Binary Patterns based on Intensity and Color Information Color Texture Classfaton usng Modfed Loal Bnary Patterns based on Intensty and Color Informaton Shvashankar S. Department of Computer Sene Karnatak Unversty, Dharwad-580003 Karnataka,Inda shvashankars@kud.a.n

More information

Matrix-Matrix Multiplication Using Systolic Array Architecture in Bluespec

Matrix-Matrix Multiplication Using Systolic Array Architecture in Bluespec Matrx-Matrx Multplaton Usng Systol Array Arhteture n Bluespe Team SegFault Chatanya Peddawad (EEB096), Aman Goel (EEB087), heera B (EEB090) Ot. 25, 205 Theoretal Bakground. Matrx-Matrx Multplaton on Hardware

More information

TAR based shape features in unconstrained handwritten digit recognition

TAR based shape features in unconstrained handwritten digit recognition TAR based shape features n unonstraned handwrtten dgt reognton P. AHAMED AND YOUSEF AL-OHALI Department of Computer Sene Kng Saud Unversty P.O.B. 578, Ryadh 543 SAUDI ARABIA shamapervez@gmal.om, yousef@s.edu.sa

More information

Cluster ( Vehicle Example. Cluster analysis ( Terminology. Vehicle Clusters. Why cluster?

Cluster (  Vehicle Example. Cluster analysis (  Terminology. Vehicle Clusters. Why cluster? Why luster? referene funton R R Although R and R both somewhat orrelated wth the referene funton, they are unorrelated wth eah other Cluster (www.m-w.om) A number of smlar ndvduals that our together as

More information

Session 4.2. Switching planning. Switching/Routing planning

Session 4.2. Switching planning. Switching/Routing planning ITU Semnar Warsaw Poland 6-0 Otober 2003 Sesson 4.2 Swthng/Routng plannng Network Plannng Strategy for evolvng Network Arhtetures Sesson 4.2- Swthng plannng Loaton problem : Optmal plaement of exhanges

More information

A MPAA-Based Iterative Clustering Algorithm Augmented by Nearest Neighbors Search for Time-Series Data Streams

A MPAA-Based Iterative Clustering Algorithm Augmented by Nearest Neighbors Search for Time-Series Data Streams A MPAA-Based Iteratve Clusterng Algorthm Augmented by Nearest Neghbors Searh for Tme-Seres Data Streams Jessa Ln 1, Mha Vlahos 1, Eamonn Keogh 1, Dmtros Gunopulos 1, Janwe Lu 2, Shouan Yu 2, and Jan Le

More information

Pattern Classification: An Improvement Using Combination of VQ and PCA Based Techniques

Pattern Classification: An Improvement Using Combination of VQ and PCA Based Techniques Ameran Journal of Appled Senes (0): 445-455, 005 ISSN 546-939 005 Sene Publatons Pattern Classfaton: An Improvement Usng Combnaton of VQ and PCA Based Tehnques Alok Sharma, Kuldp K. Palwal and Godfrey

More information

Optimal shape and location of piezoelectric materials for topology optimization of flextensional actuators

Optimal shape and location of piezoelectric materials for topology optimization of flextensional actuators Optmal shape and loaton of pezoeletr materals for topology optmzaton of flextensonal atuators ng L 1 Xueme Xn 2 Noboru Kkuh 1 Kazuhro Satou 1 1 Department of Mehanal Engneerng, Unversty of Mhgan, Ann Arbor,

More information

Computing Cloud Cover Fraction in Satellite Images using Deep Extreme Learning Machine

Computing Cloud Cover Fraction in Satellite Images using Deep Extreme Learning Machine Computng Cloud Cover Fraton n Satellte Images usng Deep Extreme Learnng Mahne L-guo WENG, We-bn KONG, Mn XIA College of Informaton and Control, Nanjng Unversty of Informaton Sene & Tehnology, Nanjng Jangsu

More information

A Fast Way to Produce Optimal Fixed-Depth Decision Trees

A Fast Way to Produce Optimal Fixed-Depth Decision Trees A Fast Way to Produe Optmal Fxed-Depth Deson Trees Alreza Farhangfar, Russell Grener and Martn Znkevh Dept of Computng Sene Unversty of Alberta Edmonton, Alberta T6G 2E8 Canada {farhang, grener, maz}@s.ualberta.a

More information

International Journal of Pharma and Bio Sciences HYBRID CLUSTERING ALGORITHM USING POSSIBILISTIC ROUGH C-MEANS ABSTRACT

International Journal of Pharma and Bio Sciences HYBRID CLUSTERING ALGORITHM USING POSSIBILISTIC ROUGH C-MEANS ABSTRACT Int J Pharm Bo S 205 Ot; 6(4): (B) 799-80 Researh Artle Botehnology Internatonal Journal of Pharma and Bo Senes ISSN 0975-6299 HYBRID CLUSTERING ALGORITHM USING POSSIBILISTIC ROUGH C-MEANS *ANURADHA J,

More information

Learning the Kernel Parameters in Kernel Minimum Distance Classifier

Learning the Kernel Parameters in Kernel Minimum Distance Classifier Learnng the Kernel Parameters n Kernel Mnmum Dstance Classfer Daoqang Zhang 1,, Songcan Chen and Zh-Hua Zhou 1* 1 Natonal Laboratory for Novel Software Technology Nanjng Unversty, Nanjng 193, Chna Department

More information

Performance Analysis of Hybrid (supervised and unsupervised) method for multiclass data set

Performance Analysis of Hybrid (supervised and unsupervised) method for multiclass data set IOSR Journal of Computer Engneerng (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 16, Issue 4, Ver. III (Jul Aug. 2014), PP 93-99 www.osrjournals.org Performane Analyss of Hybrd (supervsed and

More information

Interval uncertain optimization of structures using Chebyshev meta-models

Interval uncertain optimization of structures using Chebyshev meta-models 0 th World Congress on Strutural and Multdsplnary Optmzaton May 9-24, 203, Orlando, Florda, USA Interval unertan optmzaton of strutures usng Chebyshev meta-models Jngla Wu, Zhen Luo, Nong Zhang (Tmes New

More information

Boosting Weighted Linear Discriminant Analysis

Boosting Weighted Linear Discriminant Analysis . Okada et al. / Internatonal Journal of Advaned Statsts and I&C for Eonoms and Lfe Senes Boostng Weghted Lnear Dsrmnant Analyss azunor Okada, Arturo Flores 2, Marus George Lnguraru 3 Computer Sene Department,

More information

Progressive scan conversion based on edge-dependent interpolation using fuzzy logic

Progressive scan conversion based on edge-dependent interpolation using fuzzy logic Progressve san onverson based on edge-dependent nterpolaton usng fuzzy log P. Brox brox@mse.nm.es I. Baturone lum@mse.nm.es Insttuto de Mroeletróna de Sevlla, Centro Naonal de Mroeletróna Avda. Rena Meredes

More information

Gabor-Filtering-Based Completed Local Binary Patterns for Land-Use Scene Classification

Gabor-Filtering-Based Completed Local Binary Patterns for Land-Use Scene Classification Gabor-Flterng-Based Completed Loal Bnary Patterns for Land-Use Sene Classfaton Chen Chen 1, Lbng Zhou 2,*, Janzhong Guo 1,2, We L 3, Hongjun Su 4, Fangda Guo 5 1 Department of Eletral Engneerng, Unversty

More information

Pixel-Based Texture Classification of Tissues in Computed Tomography

Pixel-Based Texture Classification of Tissues in Computed Tomography Pxel-Based Texture Classfaton of Tssues n Computed Tomography Ruhaneewan Susomboon, Danela Stan Rau, Jaob Furst Intellgent ultmeda Proessng Laboratory Shool of Computer Sene, Teleommunatons, and Informaton

More information

The Simulation of Electromagnetic Suspension System Based on the Finite Element Analysis

The Simulation of Electromagnetic Suspension System Based on the Finite Element Analysis 308 JOURNAL OF COMPUTERS, VOL. 8, NO., FEBRUARY 03 The Smulaton of Suspenson System Based on the Fnte Element Analyss Zhengfeng Mng Shool of Eletron & Mahanal Engneerng, Xdan Unversty, X an, Chna Emal:

More information

LOCAL BINARY PATTERNS AND ITS VARIANTS FOR FACE RECOGNITION

LOCAL BINARY PATTERNS AND ITS VARIANTS FOR FACE RECOGNITION IEEE-Internatonal Conferene on Reent Trends n Informaton Tehnology, ICRTIT 211 MIT, Anna Unversty, Chenna. June 3-5, 211 LOCAL BINARY PATTERNS AND ITS VARIANTS FOR FACE RECOGNITION K.Meena #1, Dr.A.Suruland

More information

Microprocessors and Microsystems

Microprocessors and Microsystems Mroproessors and Mrosystems 36 (2012) 96 109 Contents lsts avalable at SeneDret Mroproessors and Mrosystems journal homepage: www.elsever.om/loate/mpro Hardware aelerator arhteture for smultaneous short-read

More information

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval

Fuzzy C-Means Initialized by Fixed Threshold Clustering for Improving Image Retrieval Fuzzy -Means Intalzed by Fxed Threshold lusterng for Improvng Image Retreval NAWARA HANSIRI, SIRIPORN SUPRATID,HOM KIMPAN 3 Faculty of Informaton Technology Rangst Unversty Muang-Ake, Paholyotn Road, Patumtan,

More information

Adaptive Class Preserving Representation for Image Classification

Adaptive Class Preserving Representation for Image Classification Adaptve Class Preservng Representaton for Image Classfaton Jan-Xun M,, Qankun Fu,, Wesheng L, Chongqng Key Laboratory of Computatonal Intellgene, Chongqng Unversty of Posts and eleommunatons, Chongqng,

More information

Steganalysis of DCT-Embedding Based Adaptive Steganography and YASS

Steganalysis of DCT-Embedding Based Adaptive Steganography and YASS Steganalyss of DCT-Embeddng Based Adaptve Steganography and YASS Qngzhong Lu Department of Computer Sene Sam Houston State Unversty Huntsvlle, TX 77341, U.S.A. lu@shsu.edu ABSTRACT Reently well-desgned

More information

Avatar Face Recognition using Wavelet Transform and Hierarchical Multi-scale LBP

Avatar Face Recognition using Wavelet Transform and Hierarchical Multi-scale LBP 2011 10th Internatonal Conferene on Mahne Learnng and Applatons Avatar Fae Reognton usng Wavelet Transform and Herarhal Mult-sale LBP Abdallah A. Mohamed, Darryl D Souza, Naouel Bal and Roman V. Yampolsky

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Multi-Collaborative Filtering Algorithm for Accurate Push of Command Information System

Multi-Collaborative Filtering Algorithm for Accurate Push of Command Information System Mult-Collaboratve Flterng Algorthm for Aurate Push of Command Informaton System Cu Xao-long,, Du Bo, Su Guo-Png, Yu Yan Urumq Command College of CPAPF, Urumq 83000, Chna XnJang Tehnal Insttute of Physs

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

FUZZY SEGMENTATION IN IMAGE PROCESSING

FUZZY SEGMENTATION IN IMAGE PROCESSING FUZZY SEGMENTATION IN IMAGE PROESSING uevas J. Er,, Zaldívar N. Danel,, Roas Raúl Free Unverstät Berln, Insttut für Inforat Tausstr. 9, D-495 Berln, Gerany. Tel. 0049-030-8385485, Fax. 0049-030-8387509

More information

On the End-to-end Call Acceptance and the Possibility of Deterministic QoS Guarantees in Ad hoc Wireless Networks

On the End-to-end Call Acceptance and the Possibility of Deterministic QoS Guarantees in Ad hoc Wireless Networks On the End-to-end Call Aeptane and the Possblty of Determnst QoS Guarantees n Ad ho Wreless Networks S. Srram T. heemarjuna Reddy Dept. of Computer Sene Dept. of Computer Sene and Engneerng Unversty of

More information

Evaluation of Segmentation in Magnetic Resonance Images Using k-means and Fuzzy c-means Clustering Algorithms

Evaluation of Segmentation in Magnetic Resonance Images Using k-means and Fuzzy c-means Clustering Algorithms ELEKTROTEHIŠKI VESTIK 79(3): 129-134, 2011 EGLISH EDITIO Evaluaton of Segmentaton n Magnet Resonane Images Usng k-means and Fuzzy -Means Clusterng Algorthms Tomaž Fnkšt Unverza v Lublan, Fakulteta za stronštvo,

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data

Type-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES

More information

Time Synchronization in WSN: A survey Vikram Singh, Satyendra Sharma, Dr. T. P. Sharma NIT Hamirpur, India

Time Synchronization in WSN: A survey Vikram Singh, Satyendra Sharma, Dr. T. P. Sharma NIT Hamirpur, India Internatonal Journal of Enhaned Researh n Sene Tehnology & Engneerng, ISSN: 2319-7463 Vol. 2 Issue 5, May-2013, pp: (61-67), Avalable onlne at: www.erpublatons.om Tme Synhronzaton n WSN: A survey Vkram

More information

OSSM Ordered Sequence Set Mining for Maximal Length Frequent Sequences A Hybrid Bottom-Up-Down Approach

OSSM Ordered Sequence Set Mining for Maximal Length Frequent Sequences A Hybrid Bottom-Up-Down Approach Global Journal of Computer Sene and Tehnology Volume 12 Issue 7 Verson 1.0 Aprl 2012 Type: Double Blnd Peer Revewed Internatonal Researh Journal Publsher: Global Journals In. (USA) Onlne ISSN: 0975-4172

More information

Bit-level Arithmetic Optimization for Carry-Save Additions

Bit-level Arithmetic Optimization for Carry-Save Additions Bt-leel Arthmet Optmzaton for Carry-Sae s Ke-Yong Khoo, Zhan Yu and Alan N. Wllson, Jr. Integrated Cruts and Systems Laboratory Unersty of Calforna, Los Angeles, CA 995 khoo, zhanyu, wllson @sl.ula.edu

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

Language Understanding in the Wild: Combining Crowdsourcing and Machine Learning

Language Understanding in the Wild: Combining Crowdsourcing and Machine Learning Language Understandng n the Wld: Combnng Crowdsourng and Mahne Learnng Edwn Smpson Unversty of Oxford, UK edwn@robots.ox.a.uk Pushmeet Kohl Mrosoft Researh, Cambrdge, UK pkohl@mrosoft.om Matteo Venanz

More information

An Adaptive Filter Based on Wavelet Packet Decomposition in Motor Imagery Classification

An Adaptive Filter Based on Wavelet Packet Decomposition in Motor Imagery Classification An Adaptve Flter Based on Wavelet Paket Deomposton n Motor Imagery Classfaton J. Payat, R. Mt, T. Chusak, and N. Sugno Abstrat Bran-Computer Interfae (BCI) s a system that translates bran waves nto eletral

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

Topic 5: semantic analysis. 5.5 Types of Semantic Actions

Topic 5: semantic analysis. 5.5 Types of Semantic Actions Top 5: semant analyss 5.5 Types of Semant tons Semant analyss Other Semant tons Other Types of Semant tons revously, all semant atons were for alulatng attrbute values. In a real ompler, other types of

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc.

BioTechnology. An Indian Journal FULL PAPER. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 0974-74 Volume 0 Issue BoTechnology 04 An Indan Journal FULL PAPER BTAIJ 0() 04 [684-689] Revew on Chna s sports ndustry fnancng market based on market -orented

More information

Scalable Parametric Runtime Monitoring

Scalable Parametric Runtime Monitoring Salable Parametr Runtme Montorng Dongyun Jn Patrk O Nel Meredth Grgore Roşu Department of Computer Sene Unversty of Illnos at Urbana Champagn Urbana, IL, U.S.A. {djn3, pmeredt, grosu}@s.llnos.edu Abstrat

More information

Measurement and Calibration of High Accuracy Spherical Joints

Measurement and Calibration of High Accuracy Spherical Joints 1. Introduton easurement and Calbraton of Hgh Auray Spheral Jonts Ale Robertson, Adam Rzepnewsk, Alexander Sloum assahusetts Insttute of Tehnolog Cambrdge, A Hgh auray robot manpulators are requred for

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

arxiv: v3 [cs.cv] 31 Oct 2016

arxiv: v3 [cs.cv] 31 Oct 2016 Unversal Correspondene Network Chrstopher B. Choy Stanford Unversty hrshoy@a.stanford.edu JunYoung Gwak Stanford Unversty jgwak@a.stanford.edu Slvo Savarese Stanford Unversty sslvo@stanford.edu arxv:1606.03558v3

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

A Study on the uncertainty and sensitivity in numerical simulation of parametric roll

A Study on the uncertainty and sensitivity in numerical simulation of parametric roll Downloaded from orbt.dtu.dk on: Nov, 8 A Study on the unertanty and senstvty n numeral smulaton of parametr roll Cho, Ju Hyuk; Nelsen, Ulrk Dam; Jensen, Jørgen Junher Publshed n: Proeedngs of the th Internatonal

More information

A Semi-parametric Approach for Analyzing Longitudinal Measurements with Non-ignorable Missingness Using Regression Spline

A Semi-parametric Approach for Analyzing Longitudinal Measurements with Non-ignorable Missingness Using Regression Spline Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 93-9466 Vol., Issue (June 5), pp. 95 - Applatons and Appled Mathemats: An Internatonal Journal (AAM) A Sem-parametr Approah for Analyzng Longtudnal

More information

Query Clustering Using a Hybrid Query Similarity Measure

Query Clustering Using a Hybrid Query Similarity Measure Query clusterng usng a hybrd query smlarty measure Fu. L., Goh, D.H., & Foo, S. (2004). WSEAS Transacton on Computers, 3(3), 700-705. Query Clusterng Usng a Hybrd Query Smlarty Measure Ln Fu, Don Hoe-Lan

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

FULLY AUTOMATIC IMAGE-BASED REGISTRATION OF UNORGANIZED TLS DATA

FULLY AUTOMATIC IMAGE-BASED REGISTRATION OF UNORGANIZED TLS DATA FULLY AUTOMATIC IMAGE-BASED REGISTRATION OF UNORGANIZED TLS DATA Martn Wenmann, Bors Jutz Insttute of Photogrammetry and Remote Sensng, Karlsruhe Insttute of Tehnology (KIT) Kaserstr. 12, 76128 Karlsruhe,

More information

Semi-analytic Evaluation of Quality of Service Parameters in Multihop Networks

Semi-analytic Evaluation of Quality of Service Parameters in Multihop Networks U J.T. (4): -4 (pr. 8) Sem-analyt Evaluaton of Qualty of Serve arameters n Multhop etworks Dobr tanassov Batovsk Faulty of Sene and Tehnology, ssumpton Unversty, Bangkok, Thaland bstrat

More information

Clustering incomplete data using kernel-based fuzzy c-means algorithm

Clustering incomplete data using kernel-based fuzzy c-means algorithm Clusterng noplete data usng ernel-based fuzzy -eans algorth Dao-Qang Zhang *, Song-Can Chen Departent of Coputer Sene and Engneerng, Nanjng Unversty of Aeronauts and Astronauts, Nanjng, 210016, People

More information

A Symbolic Representation of Time Series, with Implications for Streaming Algorithms

A Symbolic Representation of Time Series, with Implications for Streaming Algorithms A Symbol Representaton of Tme Seres, th Implatons for Streamng Algorthms Jessa Ln Eamonn Keogh Stefano Lonard Bll Chu Unversty of Calforna - Rversde Computer Sene & Engneerng Department Rversde, CA 9252,

More information

A Novel Dynamic and Scalable Caching Algorithm of Proxy Server for Multimedia Objects

A Novel Dynamic and Scalable Caching Algorithm of Proxy Server for Multimedia Objects Journal of VLSI Sgnal Proessng 2007 * 2007 Sprnger Sene + Busness Meda, LLC. Manufatured n The Unted States. DOI: 10.1007/s11265-006-0024-7 A Novel Dynam and Salable Cahng Algorthm of Proxy Server for

More information

Machine Learning 9. week

Machine Learning 9. week Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

MULTIPLE OBJECT DETECTION AND TRACKING IN SONAR MOVIES USING AN IMPROVED TEMPORAL DIFFERENCING APPROACH AND TEXTURE ANALYSIS

MULTIPLE OBJECT DETECTION AND TRACKING IN SONAR MOVIES USING AN IMPROVED TEMPORAL DIFFERENCING APPROACH AND TEXTURE ANALYSIS U.P.B. S. Bull., Seres A, Vol. 74, Iss. 2, 2012 ISSN 1223-7027 MULTIPLE OBJECT DETECTION AND TRACKING IN SONAR MOVIES USING AN IMPROVED TEMPORAL DIFFERENCING APPROACH AND TEXTURE ANALYSIS Tudor BARBU 1

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

Clustering Data. Clustering Methods. The clustering problem: Given a set of objects, find groups of similar objects

Clustering Data. Clustering Methods. The clustering problem: Given a set of objects, find groups of similar objects Clusterng Data The lusterng problem: Gven a set of obets, fnd groups of smlar obets Cluster: a olleton of data obets Smlar to one another wthn the same luster Dssmlar to the obets n other lusters What

More information

Integrating Fuzzy c-means Clustering with PostgreSQL *

Integrating Fuzzy c-means Clustering with PostgreSQL * SQL alle pgfcm. Seton 0 brefly susses relate work. Seton 0 ontans onluson remarks an retons for future work. Integratng Fuzzy -Means Clusterng wth PostgreSQL * R. M. Mnakhmetov taven@gmal.om South Ural

More information

Path Following Control of a Spherical Robot Rolling on an Inclined Plane

Path Following Control of a Spherical Robot Rolling on an Inclined Plane Sensors & ransduers, Vol., Speal Issue, May 3, pp. 4-47 Sensors & ransduers 3 by IFSA http://www.sensorsportal.om Path Followng Control of a Spheral Robot Rollng on an Inlned Plane ao Yu, Hanxu Sun, Qngxuan

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

Multi-scale and Discriminative Part Detectors Based Features for Multi-label Image Classification

Multi-scale and Discriminative Part Detectors Based Features for Multi-label Image Classification Proeedngs of the wenty-seventh Internatonal Jont Conferene on Artfal Intellgene (IJCAI-8) Mult-sale and Dsrmnatve Part Detetors Based Features for Mult-lael Image Classfaton Gong Cheng, Deheng Gao, Yang

More information

A Robust Algorithm for Text Detection in Color Images

A Robust Algorithm for Text Detection in Color Images A Robust Algorthm for Tet Deteton n Color Images Yangng LIU Satosh GOTO Takesh IKENAGA Abstrat Tet deteton n olor mages has beome an atve researh area sne reent deades. In ths paper we present a novel

More information

REGISTRATION OF TERRESTRIAL LASER SCANNER DATA USING IMAGERY INTRODUCTION

REGISTRATION OF TERRESTRIAL LASER SCANNER DATA USING IMAGERY INTRODUCTION EGISTATION OF TEESTIAL LASE SCANNE DATA USING IMAGEY Khall Al-Manasr, Ph.D student Clve S. Fraser, Professor Department of Geomats Unversty of Melbourne Vtora 3010 Australa k.al-manasr@pgrad.unmelb.edu.au.fraser@unmelb.edu.au

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

Research on Categorization of Animation Effect Based on Data Mining

Research on Categorization of Animation Effect Based on Data Mining MATEC Web of Conferences 22, 0102 0 ( 2015) DOI: 10.1051/ matecconf/ 2015220102 0 C Owned by the authors, publshed by EDP Scences, 2015 Research on Categorzaton of Anmaton Effect Based on Data Mnng Na

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Professional competences training path for an e-commerce major, based on the ISM method

Professional competences training path for an e-commerce major, based on the ISM method World Transactons on Engneerng and Technology Educaton Vol.14, No.4, 2016 2016 WIETE Professonal competences tranng path for an e-commerce maor, based on the ISM method Ru Wang, Pn Peng, L-gang Lu & Lng

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Minimize Congestion for Random-Walks in Networks via Local Adaptive Congestion Control

Minimize Congestion for Random-Walks in Networks via Local Adaptive Congestion Control Journal of Communatons Vol. 11, No. 6, June 2016 Mnmze Congeston for Random-Walks n Networks va Loal Adaptve Congeston Control Yang Lu, Y Shen, and Le Dng College of Informaton Sene and Tehnology, Nanjng

More information

A Real-Time Detecting Algorithm for Tracking Community Structure of Dynamic Networks

A Real-Time Detecting Algorithm for Tracking Community Structure of Dynamic Networks A Real-Tme Detetng Algorthm for Trakng Communty Struture of Dynam Networks Jaxng Shang*, Lanhen Lu*, Feng Xe, Zhen Chen, Jaa Mao, Xueln Fang, Cheng Wu* Department of Automaton, Tsnghua Unversty, Beng,,

More information

RV-Monitor: Efficient Parametric Runtime Verification with Simultaneous Properties

RV-Monitor: Efficient Parametric Runtime Verification with Simultaneous Properties RV-Montor: Effent Parametr Runtme Verfaton wth Smultaneous Propertes Qngzhou Luo 1, Y Zhang 1, Choonghwan Lee 1, Dongyun Jn 2, Patrk O Nel Meredth 1, Traan Florn Şerbănuţă 3, and Grgore Roşu 1 1 Unversty

More information

DETECTING AND ANALYZING CORROSION SPOTS ON THE HULL OF LARGE MARINE VESSELS USING COLORED 3D LIDAR POINT CLOUDS

DETECTING AND ANALYZING CORROSION SPOTS ON THE HULL OF LARGE MARINE VESSELS USING COLORED 3D LIDAR POINT CLOUDS ISPRS Annals of the Photogrammetry, Remote Sensng and Spatal Informaton Senes, Volume III-3, 2016 XXIII ISPRS Congress, 12 19 July 2016, Prague, Czeh Republ DETECTING AND ANALYZING CORROSION SPOTS ON THE

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

A Flexible Solution for Modeling and Tracking Generic Dynamic 3D Environments*

A Flexible Solution for Modeling and Tracking Generic Dynamic 3D Environments* A Flexble Soluton for Modelng and Trang Gener Dynam 3D Envronments* Radu Danesu, Member, IEEE, and Sergu Nedevsh, Member, IEEE Abstrat The traff envronment s a dynam and omplex 3D sene, whh needs aurate

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

A GENETIC APPROACH FOR THE AUTOMATIC ADAPTATION OF SEGMENTATION PARAMETERS

A GENETIC APPROACH FOR THE AUTOMATIC ADAPTATION OF SEGMENTATION PARAMETERS A GENETIC APPROACH FOR THE AUTOMATIC ADAPTATION OF SEGMENTATION PARAMETERS R. Q. Fetosa a, *, G. A. O. P. Costa a, T. B. Cazes a, B. Fejo b a Dept. of Eletral Engneerng, b Dept of Informats, Cathol Unversty

More information