A Comparative Study for Outlier Detection Techniques in Data Mining

Size: px
Start display at page:

Download "A Comparative Study for Outlier Detection Techniques in Data Mining"

Transcription

1 A Comparatve Study for Outler Detecton Technques n Data Mnng Zurana Abu Bakar, Rosmayat Mohemad, Akbar Ahmad Department of Computer Scence Faculty of Scence and Technology Unversty College of Scence and Technology 230 Kuala Terengganu, Malaysa {zurana, rosmayat}@kustem.edu.my, Mustafa Mat Ders, Member IEEE Faculty of Informaton Technology and Multmeda College Unversty Technology Tun Hussen Onn Part Raja, Batu Pahat Johor, Malaysa mmustafa@kuttho.edu.my Abstract Exstng studes n data mnng mostly focus on fndng patterns n large datasets and further usng t for organzatonal decson makng. However, fndng such exceptons and outlers has not yet receved as much attenton n the data mnng feld as some other topcs have, such as assocaton rules, classfcaton and clusterng. Thus, ths paper descrbes the performance of control chart, lnear regresson, and Manhattan dstance technques for outler detecton n data mnng. Expermental studes show that outler detecton technque usng control chart s better than the technque modeled from lnear regresson because the number of outler data detected by control chart s smaller than lnear regresson. Further, expermental studes shows that Manhattan dstance technque outperformed compared wth the other technques when the threshold values ncreased. Keywords data mnng, clusterng, outler I. INTRODUCTION Data mnng s a process of extractng vald, prevously unknown, and ultmately comprehensble nformaton from large datasets and usng t for organzatonal decson makng [1]. However, there a lot of problems exst n mnng data n large datasets such as data redundancy, the value of attrbutes s not specfc, data s not complete and outler [2]. An outler s defned as data pont whch s very dfferent from the rest of the data based on some measure. Such a pont often contans useful nformaton on abnormal behavor of the system descrbed by data [3]. On the other hand, many data mnng algorthms n the lterature fnd outlers as a sdeproduct of clusterng algorthms. From the vewpont of a clusterng algorthm, outlers are objects not located n clusters of dataset, usually called nose [2]. Outler detecton problem s one of the very nterestng problems arsng recently n the data mnng research. Recently, a few studes have been conducted on outler detecton for large datasets [3]. Many data mnng algorthms try to mnmze the nfluence of outlers or elmnate them all together. However, ths could result n the loss of mportant hdden nformaton snce one person s nose could be another person s sgnal [4]. In other words, the outlers themselves may be of partcular nterest, such as n the case of fraud detecton, where outlers may ndcate fraudulent actvty [5]. Outler detecton or outler mnng s the process of dentfyng outlers n a set of data. The outler detecton technque fnds applcatons n credt card fraud, network robustness analyss, network ntruson detecton, fnancal applcatons and marketng [3]. Thus, outler detecton and analyss s an nterestng and mportant data mnng task. Ths paper dscussed about control chart, lnear regresson and Manhattan dstance technques for outler data detecton from data mnng perspectve. The man nherent dea s to compare those technques to determne whch technque s better based on the number of outler data detected and threshold values. There are many types of data n outler detecton analyss such as bnary varables, nomnal and ordnal. However, n ths outler detecton analyss, only numercal data wll be consdered. The rest of ths paper s organzed as follows. Secton 2 dscuss related work on outler data detecton technques. The framework and formulas (equatons) for control chart, lnear regresson, and Manhattan dstance technques are presented n Secton 3 and extensve performance evaluaton s reported n secton 4. Secton 5 concludes wth a summary of those outler data detecton technques. II. RELATED WORK Recently, a few studes have been conducted on outler data detecton for large datasets. Dstrbuton based methods was prevously conducted by the statstcs communty. In these technques, the data ponts are modeled usng a stochastc dstrbuton, and ponts are determned to be outlers dependng upon ther relatonshp wth ths model. However, wth ncreasng dmensonalty, t becomes ncreasngly dffcult and naccurate to estmate the multdmensonal dstrbutons of the data ponts [6]. Dstance based method was orgnally proposed by Knorr and Ang [4]. Further, Ramaswamy et al. [6], had extended dstance-based outler detecton algorthm: the top n ponts wth the maxmum Dk are consdered outlers, where Dk(p) denotes the dstance of the k-th nearest neghbor of p. They used a /06/$ IEEE CIS 2006

2 cluster algorthm to partton a dataset nto several groups. Prunng and batch processng on these groups could mprove effcency for outler detecton [7]. On the other hand, Devaton-based outler detecton does not use statstcal tests or dstance-based measures to dentfy exceptonal objects. Instead, t dentfes outlers by examnng the man charaterstcs of objects n a group. Objects that devate from ths descrpton are consdered outlers. Hence, n ths approach the term devatons s typcally used to refer to outlers [5]. Whlst, densty based was proposed by Breung et al. [2]. It reles on the local outler factor (LOF) of each pont, whch depends on the local densty of ts neghborhood. Clusterngbased outler detecton technques regarded small clusters as outlers [8] or dentfed outlers by removng clusters from the orgnal dataset [1]. Meanwhle Dangtong Yu et al. [1] proposed a new method whch apply sgnal-processng technques to solve mportant problems n data mnng. They ntroduced a novel devaton (or outler) detecton approach, termed FndOut, based on wavelet transform. The man dea n FndOut s to remove the clusters from the orgnal data and then dentfy the outlers. Although prevous research showed that such technques may not be effectve because of the nature of the clusterng, FndOut can successfully dentfy outlers from large datasets. Expermental results showed that the proposed approach s effcent and effectve on very large datasets [1]. In addton, Aggarwal and Yu [3] ntroduced a new technque for outler detecton whch s especally suted to very hgh dmensonal data sets. The method works by fndng lower dmensonal projectons whch are locally sparse, and cannot be dscovered easly by brute force technques because of the number of combnatons of possbltes. Ths technque for outler detecton has advantages over smple dstance based outlers whch cannot overcome the effects of the dmensonalty curse. They llustrated how to mplement the technque effectvely for hgh dmensonal applcatons by usng an evolutonary search technque. Ths mplementaton works almost as well as a brute-force mplementaton over the search space n terms of fndng projectons wth very negatve sparsty coeffcents, but at a much lower cost. The technques dscussed n ths paper extend the applcablty of outler detecton technques to hgh dmensonal problems; such cases are most valuable from the perspectve of data mnng applcatons [3]. Whle as Wllams et. al [9], proposed replcator neural networks (RNNs)for outler detecton. They compared RNN for outler detecton wth three other methods usng both publcly avalable statstcal datasets (generally small) and data mnng datasets (generally much larger and generally real data). The RNN method performed satsfactorly for both small and large datasets. It was of nterest that t performed well on the small datasets snce neural network methods often have dffculty wth such smaller datasets. Its performance appears to degrade wth datasets contanng radal outlers and so t s not recommended for ths type of dataset. RNN performed the best overall on the KDD ntruson dataset [9] Thus, from the several studes dscussed above, we found that research n outler detecton can lead to the dscovery of truly unexpected knowledge n areas such as electronc commerce exceptons, bankruptcy and credt card fraud. Such knowledge can lead to new drectons for future nvestment, marketng, and other purposes. III. RESEARCH METHODOLOGY Outler detecton approach can be categorzed nto three approaches whch there are the statstcal approach, the dstance-based approach and the devaton-based approach. In ths outler analyss, we examne statstcal approach because ths approach s approprate for one-dmensonal samples. Therefore, ths approach s applcable snce ths analyss s based on one-dmensonal data. Ths analyss appled control chart and lnear regresson technques for statstcal approach. Besdes that, we also examne dstance-based approach n order to counter the man lmtatons mposed by statstcal approach [9]. The Manhattan dstance technque was appled for dstance-based approach. A. Statstcal Approach The statstcal approach to outler detecton assumes a dstrbuton or probablty model for the gven data set and then dentfes outlers wth respect to the model usng a dscordancy test [5]. In partcular, an analyss for statstcal approach s based on the fve phases: 1) Data collecton: Ths analyss s based on our observaton of the ar polluton data taken n Kuala Lumpur on the August A set of ar polluton data tems conssts of fve major aspects that can cause the ar polluton,.e. {Carbon Monoxde (CO), Ozone (O 3 ), Partculate Matter (PM ), Ntrogen Doxde (NO 2 )and Sulfur Doxde (SO 2) }. The value of each tem s wth the unt of part per mllon (ppm) except PM s wth the unt of mcro-grams (µgm). The data were taken for every one-hour every day. We present the actual data as the average amount of each data tem per day. 2) Compute average value/compute Lnear Regresson equaton: At ths phase, average value was computed n order to gan the centre lne for the control chart technque. Otherwse, lnear regresson equaton also calculated to determne lnear regresson lne. 3) Compute upper and lower control lmts/compute upper and lower bound value: Upper control lmt (UCL) and lower control lmt (LCL) for control graph technque are based on the partcular formula (refer equaton (2) to (5) at B secton). Whle as, upper and lower bound for lnear regresson technque s based on 95 percent from lnear regresson equaton (lne). 4) Data Testng: At ths phase, actual data, centre lne, UCL and LCL are plotted on the control graph whle as actual data, lnear regresson lne, upper and lower bound are plotted lnear regresson graph. Outler data could be dentfed from those graphs. Data that are plotted out from upper and lower control lmts/bound are detected as outler data. 5) Analyss and comparson the output: The output from data testng wll be used n order to compare and analyss those technques. The purpose of these actvtes s to get the

3 best technque n detectng outler data based on statstcal approach. B. Control Chart Technque (CCT) In ths secton, we study control chart technque for outler data detecton. Usually, CCT s used to determne whether your process s operatng n statstcal control. The purpose of a control chart s to detect any unwanted changes n the process. These changes wll be sgnaled by abnormal (outler) ponts on the graph []. Bascally, control chart conssts of three basc components: 1) a centre lne, usually the mathematcal average of all the samples plotted. 2) upper and lower control lmts that defne the constrants of common cause varatons. 3) performance data plotted over tme. Frstly, calculate the average for data ponts to get a centerlne of a control chart. The formula s, where, X = mean/average value X = every data value (X X n) n = total number of data Secondly, calculate the upper control (UCL) and lower control lmt (LCL) by usng formula below, In a 3-sgma system, Z s equal to 3. The reason that 3-sgma control lmts balance the rsk of error s that, for normally dstrbuted data, data ponts wll fall nsde 3-sgma lmts 99.7% of the tme when a process s n control. Ths makes the wtch hunts nfrequent but stll makes t lkely that unusual causes of varaton wll be detected []. Fnally, data are plotted on the chart and data that are out from UCL and LCL and are detected as outler data. Fgure 1 shows an example of control chart that has one data outsde UCL. Ths data s known as outler data. (2) (3) (4) (5) (1) Fgure 1. An example of control chart C. Lnear Regresson Technque (LRT) There have been many statstcal concepts that are bass for data mnng technques such as pont estmaton, Bayes theorem and regresson. Nevertheless, for ths outler detecton analyss, LRT s beng used because t s approprate to evaluate the strength of a relatonshp between two varables. In general, regresson s the problem of estmatng a condtonal expected value. Whle as lnear refers to the assumpton of a lnear relatonshp between y (response varable) and x (predctor varable). Thus, n statstcs, lnear regresson s a method of estmatng that lnear relatonshp between the nput data and the output data [11]. The common formula for a lnear relatonshp used n ths model s [5], Y = α + βx where, the varance of Y s assumed to be constant, and α and β are regresson coeffcents specfyng the Y-ntercept and slope of the lne, respectvely. Gven s samples or data ponts of the form (x 1, y 1 ), (x 2, y 2 ), (x s, y s ), then α and β can be estmated usng ths method wth the followng equatons, β s ( = 1 = s x x)( y = 1 ( x x) α = y βx y) where, x s the average of x 1, x 2,, x s, and y s the average of y 1, y 2,, y s. The coeffcents α and β often provde good approxmatons to otherwse complcated regresson equatons. D. Dstance-based Approach One of the statstcal approach drawbacks s t requres knowledge about parameters of the data set, such as the data dstrbuton. However, n many cases, the data dstrbuton may not be known [5]. Therefore, a dstance-based approach was ntroduced to overcome the problem arse from statstcal approach. The crteron for outler detecton usng ths approach s based on two parameters, parameter (p) and dstance (d), whch may be gven n advance usng knowledge about the data, or whch may be changed durng the teratons 2 (6) (7) (8)

4 to select the most representatve outlers. In partcular, an analyss for dstance-based approach s based on nne phases below. 1) Data collecton: As dscussed n secton A. 2) Compute the dstances of each data (d 1 ): The dstance between data was computed to yeld dstances of each data. 3) Identfy maxmum dstance value of data (d 2 ): The maxmum dstance value was dentfed to determne a range for threshold dstance value (d 3 ). 4) Determne threshold dstance value (d 3 ): Ths value was determned based on maxmum dstance value (d 2 ). Threshold dstance value (d 3 ) should be smaller than maxmum dstance value (d 2 ). Otherwse, comparson process could not be done. 5) Compare between d 3 and d 1 (p): At ths phase, parameter value (p) could be determne by comparng between d 3 and d 1 where p equal to d 1 >= d 3. 6) Determne threshold value (t): Threshold value (t) has to be assgned to ndcates the research space. 7) Compare between t and p: At ths phase, threshold value wll compare wth the result at phase fve. 8) Data testng: At ths phase outler data could be dentfed. 9) Analyss and comparson the output: The output from data testng wll be used n order to compare and analyss ths technques. E. Manhattan Dstance Technque (MDT) Commonly, the dstances can be based on a sngle dmenson or multple dmensons. It s up to the researcher to select the rght method for hs/her specfc applcaton. For ths outler detecton analyss MDT s used because the data are sngle dmenson. The general formula for MDT s, j k h 1 d( t, t ) = = ( t t h jh ) (9) where: t = <t 1,,t k > and t j = <t j1,,t jk > are tuples n a database. IV. PERFORMANCE EVALUATION In ths secton, frstly, we compared the effcency of the lnear regresson and control chart technques (statstcal approach). The mplementaton of both algorthms s usng Matlab 6.5 and Mcrosoft Access as ts database. Through the performance evaluaton, we are gong to show that the control chart technque s better than lnear regresson due to the number of outler data detecton s smaller than lnear regresson technque. As menton n 3.3, ths outler analyss s based on ar polluton data. The example of ar polluton data s shows n Table I: TABLE I. Date CO O 3 AIR POLLUTION DATA PM NO 2 SO 2 1/8/ /8/ /8/ Based on both technques, outler data was determned f the data was out of the control lmts or boundares. In control chart technque, UCL and LCL were determned based on the formulas (equatons) dscussed n secton A. Whle as, upper and lower boundares n lnear regresson technques are based on 95 percent computaton from lner regresson equaton that has been dentfed. Number of Outler Data TABLE II. THE RESULT FOR CCT AND LRT Data Outler data for CCT Outler data for LRT CO O PM SO NO Result Testng: Comparson between Two Outler DetectonTecnques CO PM NO2 Ar Polluton Component Control Chart Technque Lnear Regresson Technque Fgure 2. Graph for outler data detecton usng CCT and LRT As llustrated n Table II and Fgure 2, outler data that have been detected by control chart were lower than lnear regresson technque. Ths mples that, the lower the number of outler data detected, the better the technque s. Ths s due to data plotted on control chart technque are more converged on the data average lne. Thus, there are more useful data that could be used for analyss and further could acqure an accurate result. Secondly, we analyss the MDT (dstance-based approach). The mplementaton of ths algorthm also usng Matlab 6.5 and Mcrosoft Access as ts database. In Manhattan dstance technque, the threshold values (tv) have to be assgned. Besdes that, outler data also depends on the threshold dstance values (d 3 ). The d 3 have to be smaller than maxmum dstance values (d 2 ) that exst between each of the data. Ths s to ensure that d 3 dd not out of range and the comparson process could be done. We can get the parameter value (p) by comparng d 3 and the dstances of each data (d 1 ). Further, we compare t wth

5 p to gan outler data. From equaton (9), we obtaned d 2, d 3, tv and the number of outler as n Table 3. Data TABLE III. Max. dstance value (d 2) THE RESULT FOR MDT Threshold dstance value (d 3) CO O PM SO NO Number of Outler Threshold value (tv) Number of Outler Result Testng: Manhattan Dsance Technque Threshold CO O3 PM SO2 NO2 Fgure 3. Graph for outler data detecton usng MDT Table III and Fgure 3 show that when the threshold values ncreases, the number of outler data detected decreased. Ths mples that, numbers of outlers are nversed wth threshold value. Ths s due to the space of the useful data n the cluster becomes bgger. TABLE IV. Data THE COMPARISON RESULT FOR THREE TECHNIQUES Outler data for MDT Outler data for CCT Outler data for LRT CO O PM SO NO Number of Outler Data Result Testng: Comparson between Three Technques CO O3 PM SO2 NO2 Ar polluton Component Manhattan Dstance Control Chart Lnear Regresson Fgure 4. Graph for outler data detecton usng three technques As llustrated n Table IV and Fgure 4, outler data that have been detected by Manhattan dstance were lower than control chart and lnear regresson technques. Ths s due the lower number of outler data detected, the better the technque. Ths mples that, dstance-based approach s more practcal and relable than statstcal approach n outler data detecton. V. CONCLUSON Ths paper presented the result of an expermental study of some common outler detecton technques. Frstly, we compare the two outler detecton technques n statstcal approach, lnear regresson and control chart technques. The expermental results ndcate that the control chart technque s better than that lner regresson technque for outler data detecton. Next, we analyze Manhattan dstance technque based on dstance-based approach. The expermental studes shows that Manhattan dstance technque outperformed the other technques (dstance-based and statstcal-based approaches) when the threshold values ncreased. REFERENCES [1] Yu, D., Shekholeslam, G. and Zang, A fnd out: fndng outlers n very large datasets, In Knowledge and Informaton Systems, 2002, pp [2] Breung, M.M., Kregel, H.P., and Ng, R.T., LOF: Identfyng denstybased local outlers., ACM Conference Proceedngs, 2000, pp [3] Aggarwal, C. C., Yu, S. P., An effectve and effcent algorthm for hgh-dmensonal outler detecton, The VLDB Journal, 2005, vol. 14, pp [4] Knorr, E.M., Ng, R. T., Tucakov, V., Dstance-based outlers: algorthms and applcatons, The VLDB Journal, 2000, vol. 8, pp [5] Han, J. and Kamber, M., Data Mnng Concepts and Technques, USA: Morgan Kaufmann, [6] S. Ramaswamy, R. Rastog, and S. Kyuseok, Effcent algorthms for mnng outlers from large data sets. In Proc. of the ACM SIGMOD Internatonal Conference on Management of Data, 2000, pp [7] Aggarwal, C. C., Yu, S. P., Outler detecton for hgh dmensonal data, SIGMOD 01, 2001, pp [8] M.F. Jang, S.s. Tseng, C. M. Su., Two-phase clusterng process for outler detecton. pattern recognton letters, 2001, vol. 22(6-7), pp

6 [9] G.Wllams, R. Baxter, H. He, S. Hawkns, L. Gu, A comparatve study of RNN for outler detecton n data mnng. Proceedngs of the 2nd IEEE Internatonal Conference on Data Mnng (ICDM02) Maebash Cty, Japan, 2002, pp [] SkyMark: Control Chart, at (accessed: 13 December 2005) [11] Wkpeda: Lnear Regresson, at (accessed: 13 December 2005)

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Outlier Detection Methodologies Overview

Outlier Detection Methodologies Overview Outler Detecton Methodologes Overvew Mohd. Noor Md. Sap Department of Computer and Informaton Systems Faculty of Computer Scence and Informaton Systems Unverst Teknolog Malaysa 81310 Skuda, Johor Bahru,

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task

Term Weighting Classification System Using the Chi-square Statistic for the Classification Subtask at NTCIR-6 Patent Retrieval Task Proceedngs of NTCIR-6 Workshop Meetng, May 15-18, 2007, Tokyo, Japan Term Weghtng Classfcaton System Usng the Ch-square Statstc for the Classfcaton Subtask at NTCIR-6 Patent Retreval Task Kotaro Hashmoto

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers

Content Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth

More information

The Research of Support Vector Machine in Agricultural Data Classification

The Research of Support Vector Machine in Agricultural Data Classification The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou

More information

Cluster Analysis of Electrical Behavior

Cluster Analysis of Electrical Behavior Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School

More information

A New Approach For the Ranking of Fuzzy Sets With Different Heights

A New Approach For the Ranking of Fuzzy Sets With Different Heights New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data

A Fast Content-Based Multimedia Retrieval Technique Using Compressed Data A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,

More information

A Simple and Efficient Goal Programming Model for Computing of Fuzzy Linear Regression Parameters with Considering Outliers

A Simple and Efficient Goal Programming Model for Computing of Fuzzy Linear Regression Parameters with Considering Outliers 62626262621 Journal of Uncertan Systems Vol.5, No.1, pp.62-71, 211 Onlne at: www.us.org.u A Smple and Effcent Goal Programmng Model for Computng of Fuzzy Lnear Regresson Parameters wth Consderng Outlers

More information

Concurrent Apriori Data Mining Algorithms

Concurrent Apriori Data Mining Algorithms Concurrent Apror Data Mnng Algorthms Vassl Halatchev Department of Electrcal Engneerng and Computer Scence York Unversty, Toronto October 8, 2015 Outlne Why t s mportant Introducton to Assocaton Rule Mnng

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

A Deflected Grid-based Algorithm for Clustering Analysis

A Deflected Grid-based Algorithm for Clustering Analysis A Deflected Grd-based Algorthm for Clusterng Analyss NANCY P. LIN, CHUNG-I CHANG, HAO-EN CHUEH, HUNG-JEN CHEN, WEI-HUA HAO Department of Computer Scence and Informaton Engneerng Tamkang Unversty 5 Yng-chuan

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching

A Fast Visual Tracking Algorithm Based on Circle Pixels Matching A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status Internatonal Journal of Appled Busness and Informaton Systems ISSN: 2597-8993 Vol 1, No 2, September 2017, pp. 6-12 6 Implementaton Naïve Bayes Algorthm for Student Classfcaton Based on Graduaton Status

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

SVM-based Learning for Multiple Model Estimation

SVM-based Learning for Multiple Model Estimation SVM-based Learnng for Multple Model Estmaton Vladmr Cherkassky and Yunqan Ma Department of Electrcal and Computer Engneerng Unversty of Mnnesota Mnneapols, MN 55455 {cherkass,myq}@ece.umn.edu Abstract:

More information

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration

Improvement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,

More information

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap Int. Journal of Math. Analyss, Vol. 8, 4, no. 5, 7-7 HIKARI Ltd, www.m-hkar.com http://dx.do.org/.988/jma.4.494 Emprcal Dstrbutons of Parameter Estmates n Bnary Logstc Regresson Usng Bootstrap Anwar Ftranto*

More information

A Binarization Algorithm specialized on Document Images and Photos

A Binarization Algorithm specialized on Document Images and Photos A Bnarzaton Algorthm specalzed on Document mages and Photos Ergna Kavalleratou Dept. of nformaton and Communcaton Systems Engneerng Unversty of the Aegean kavalleratou@aegean.gr Abstract n ths paper, a

More information

An Optimal Algorithm for Prufer Codes *

An Optimal Algorithm for Prufer Codes * J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,

More information

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems

A Unified Framework for Semantics and Feature Based Relevance Feedback in Image Retrieval Systems A Unfed Framework for Semantcs and Feature Based Relevance Feedback n Image Retreval Systems Ye Lu *, Chunhu Hu 2, Xngquan Zhu 3*, HongJang Zhang 2, Qang Yang * School of Computng Scence Smon Fraser Unversty

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

An Image Fusion Approach Based on Segmentation Region

An Image Fusion Approach Based on Segmentation Region Rong Wang, L-Qun Gao, Shu Yang, Yu-Hua Cha, and Yan-Chun Lu An Image Fuson Approach Based On Segmentaton Regon An Image Fuson Approach Based on Segmentaton Regon Rong Wang, L-Qun Gao, Shu Yang 3, Yu-Hua

More information

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance

Tsinghua University at TAC 2009: Summarizing Multi-documents by Information Distance Tsnghua Unversty at TAC 2009: Summarzng Mult-documents by Informaton Dstance Chong Long, Mnle Huang, Xaoyan Zhu State Key Laboratory of Intellgent Technology and Systems, Tsnghua Natonal Laboratory for

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1

Outline. Discriminative classifiers for image recognition. Where in the World? A nearest neighbor recognition example 4/14/2011. CS 376 Lecture 22 1 4/14/011 Outlne Dscrmnatve classfers for mage recognton Wednesday, Aprl 13 Krsten Grauman UT-Austn Last tme: wndow-based generc obect detecton basc ppelne face detecton wth boostng as case study Today:

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

The Research of Ellipse Parameter Fitting Algorithm of Ultrasonic Imaging Logging in the Casing Hole

The Research of Ellipse Parameter Fitting Algorithm of Ultrasonic Imaging Logging in the Casing Hole Appled Mathematcs, 04, 5, 37-3 Publshed Onlne May 04 n ScRes. http://www.scrp.org/journal/am http://dx.do.org/0.436/am.04.584 The Research of Ellpse Parameter Fttng Algorthm of Ultrasonc Imagng Loggng

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Detection of an Object by using Principal Component Analysis

Detection of an Object by using Principal Component Analysis Detecton of an Object by usng Prncpal Component Analyss 1. G. Nagaven, 2. Dr. T. Sreenvasulu Reddy 1. M.Tech, Department of EEE, SVUCE, Trupath, Inda. 2. Assoc. Professor, Department of ECE, SVUCE, Trupath,

More information

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION

MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION MULTISPECTRAL IMAGES CLASSIFICATION BASED ON KLT AND ATR AUTOMATIC TARGET RECOGNITION Paulo Quntlano 1 & Antono Santa-Rosa 1 Federal Polce Department, Brasla, Brazl. E-mals: quntlano.pqs@dpf.gov.br and

More information

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated. Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,

More information

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz

Compiler Design. Spring Register Allocation. Sample Exercises and Solutions. Prof. Pedro C. Diniz Compler Desgn Sprng 2014 Regster Allocaton Sample Exercses and Solutons Prof. Pedro C. Dnz USC / Informaton Scences Insttute 4676 Admralty Way, Sute 1001 Marna del Rey, Calforna 90292 pedro@s.edu Regster

More information

Lecture 4: Principal components

Lecture 4: Principal components /3/6 Lecture 4: Prncpal components 3..6 Multvarate lnear regresson MLR s optmal for the estmaton data...but poor for handlng collnear data Covarance matrx s not nvertble (large condton number) Robustness

More information

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters

Proper Choice of Data Used for the Estimation of Datum Transformation Parameters Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK

FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK FINDING IMPORTANT NODES IN SOCIAL NETWORKS BASED ON MODIFIED PAGERANK L-qng Qu, Yong-quan Lang 2, Jng-Chen 3, 2 College of Informaton Scence and Technology, Shandong Unversty of Scence and Technology,

More information

Clustering Algorithm of Similarity Segmentation based on Point Sorting

Clustering Algorithm of Similarity Segmentation based on Point Sorting Internatonal onference on Logstcs Engneerng, Management and omputer Scence (LEMS 2015) lusterng Algorthm of Smlarty Segmentaton based on Pont Sortng Hanbng L, Yan Wang*, Lan Huang, Mngda L, Yng Sun, Hanyuan

More information

Study of Data Stream Clustering Based on Bio-inspired Model

Study of Data Stream Clustering Based on Bio-inspired Model , pp.412-418 http://dx.do.org/10.14257/astl.2014.53.86 Study of Data Stream lusterng Based on Bo-nspred Model Yngme L, Mn L, Jngbo Shao, Gaoyang Wang ollege of omputer Scence and Informaton Engneerng,

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Understanding K-Means Non-hierarchical Clustering

Understanding K-Means Non-hierarchical Clustering SUNY Albany - Techncal Report 0- Understandng K-Means Non-herarchcal Clusterng Ian Davdson State Unversty of New York, 1400 Washngton Ave., Albany, 105. DAVIDSON@CS.ALBANY.EDU Abstract The K-means algorthm

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

User Authentication Based On Behavioral Mouse Dynamics Biometrics

User Authentication Based On Behavioral Mouse Dynamics Biometrics User Authentcaton Based On Behavoral Mouse Dynamcs Bometrcs Chee-Hyung Yoon Danel Donghyun Km Department of Computer Scence Department of Computer Scence Stanford Unversty Stanford Unversty Stanford, CA

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Identifying Top-k Most Influential Nodes by using the Topological Diffusion Models in the Complex Networks

Identifying Top-k Most Influential Nodes by using the Topological Diffusion Models in the Complex Networks (IJACSA) Internatonal Journal of Advanced Computer Scence and Applcatons, Vol. 8, No., 07 Identfyng Top-k Most Influental Nodes by usng the Topologcal Dffuson Models n the Complex Networks Maryam Padar,

More information

Machine Learning. Topic 6: Clustering

Machine Learning. Topic 6: Clustering Machne Learnng Topc 6: lusterng lusterng Groupng data nto (hopefully useful) sets. Thngs on the left Thngs on the rght Applcatons of lusterng Hypothess Generaton lusters mght suggest natural groups. Hypothess

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE

SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE SHAPE RECOGNITION METHOD BASED ON THE k-nearest NEIGHBOR RULE Dorna Purcaru Faculty of Automaton, Computers and Electroncs Unersty of Craoa 13 Al. I. Cuza Street, Craoa RO-1100 ROMANIA E-mal: dpurcaru@electroncs.uc.ro

More information

Meta-heuristics for Multidimensional Knapsack Problems

Meta-heuristics for Multidimensional Knapsack Problems 2012 4th Internatonal Conference on Computer Research and Development IPCSIT vol.39 (2012) (2012) IACSIT Press, Sngapore Meta-heurstcs for Multdmensonal Knapsack Problems Zhbao Man + Computer Scence Department,

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Outlier Detection based on Robust Parameter Estimates

Outlier Detection based on Robust Parameter Estimates Outler Detecton based on Robust Parameter Estmates Nor Azlda Aleng 1, Ny Ny Nang, Norzan Mohamed 3 and Kasyp Mokhtar 4 1,3 School of Informatcs and Appled Mathematcs, Unverst Malaysa Terengganu, 1030 Kuala

More information

Learning-Based Top-N Selection Query Evaluation over Relational Databases

Learning-Based Top-N Selection Query Evaluation over Relational Databases Learnng-Based Top-N Selecton Query Evaluaton over Relatonal Databases Lang Zhu *, Wey Meng ** * School of Mathematcs and Computer Scence, Hebe Unversty, Baodng, Hebe 071002, Chna, zhu@mal.hbu.edu.cn **

More information

An Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc.

An Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc. [Type text] [Type text] [Type text] ISSN : 97-735 Volume Issue 9 BoTechnology An Indan Journal FULL PAPER BTAIJ, (9), [333-3] Matlab mult-dmensonal model-based - 3 Chnese football assocaton super league

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

Data Mining: Model Evaluation

Data Mining: Model Evaluation Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Reducing Frame Rate for Object Tracking

Reducing Frame Rate for Object Tracking Reducng Frame Rate for Object Trackng Pavel Korshunov 1 and We Tsang Oo 2 1 Natonal Unversty of Sngapore, Sngapore 11977, pavelkor@comp.nus.edu.sg 2 Natonal Unversty of Sngapore, Sngapore 11977, oowt@comp.nus.edu.sg

More information

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices

An Application of the Dulmage-Mendelsohn Decomposition to Sparse Null Space Bases of Full Row Rank Matrices Internatonal Mathematcal Forum, Vol 7, 2012, no 52, 2549-2554 An Applcaton of the Dulmage-Mendelsohn Decomposton to Sparse Null Space Bases of Full Row Rank Matrces Mostafa Khorramzadeh Department of Mathematcal

More information

A Semi-parametric Regression Model to Estimate Variability of NO 2

A Semi-parametric Regression Model to Estimate Variability of NO 2 Envronment and Polluton; Vol. 2, No. 1; 2013 ISSN 1927-0909 E-ISSN 1927-0917 Publshed by Canadan Center of Scence and Educaton A Sem-parametrc Regresson Model to Estmate Varablty of NO 2 Meczysław Szyszkowcz

More information

Support Vector Machines

Support Vector Machines Support Vector Machnes Decson surface s a hyperplane (lne n 2D) n feature space (smlar to the Perceptron) Arguably, the most mportant recent dscovery n machne learnng In a nutshell: map the data to a predetermned

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility

Evaluation of an Enhanced Scheme for High-level Nested Network Mobility IJCSNS Internatonal Journal of Computer Scence and Network Securty, VOL.15 No.10, October 2015 1 Evaluaton of an Enhanced Scheme for Hgh-level Nested Network Moblty Mohammed Babker Al Mohammed, Asha Hassan.

More information

Intra-Parametric Analysis of a Fuzzy MOLP

Intra-Parametric Analysis of a Fuzzy MOLP Intra-Parametrc Analyss of a Fuzzy MOLP a MIAO-LING WANG a Department of Industral Engneerng and Management a Mnghsn Insttute of Technology and Hsnchu Tawan, ROC b HSIAO-FAN WANG b Insttute of Industral

More information

Load Balancing for Hex-Cell Interconnection Network

Load Balancing for Hex-Cell Interconnection Network Int. J. Communcatons, Network and System Scences,,, - Publshed Onlne Aprl n ScRes. http://www.scrp.org/journal/jcns http://dx.do.org/./jcns.. Load Balancng for Hex-Cell Interconnecton Network Saher Manaseer,

More information

A Statistical Model Selection Strategy Applied to Neural Networks

A Statistical Model Selection Strategy Applied to Neural Networks A Statstcal Model Selecton Strategy Appled to Neural Networks Joaquín Pzarro Elsa Guerrero Pedro L. Galndo joaqun.pzarro@uca.es elsa.guerrero@uca.es pedro.galndo@uca.es Dpto Lenguajes y Sstemas Informátcos

More information

Enhancement of Infrequent Purchased Product Recommendation Using Data Mining Techniques

Enhancement of Infrequent Purchased Product Recommendation Using Data Mining Techniques Enhancement of Infrequent Purchased Product Recommendaton Usng Data Mnng Technques Noraswalza Abdullah, Yue Xu, Shlomo Geva, and Mark Loo Dscplne of Computer Scence Faculty of Scence and Technology Queensland

More information

The Grouping Methods and Rank Estimator, Based on Ranked Set sampling, for the linear Error in Variable Models

The Grouping Methods and Rank Estimator, Based on Ranked Set sampling, for the linear Error in Variable Models P Internatonal Journal of Scentfc Engneerng and Appled Scence (IJSEAS) Volume-, Issue-7,Jul 06 The Groupng Methods and Rank Estmator, Based on Ranked Set samplng, for the lnear Error n Varable Models Ahmed

More information

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices

Steps for Computing the Dissimilarity, Entropy, Herfindahl-Hirschman and. Accessibility (Gravity with Competition) Indices Steps for Computng the Dssmlarty, Entropy, Herfndahl-Hrschman and Accessblty (Gravty wth Competton) Indces I. Dssmlarty Index Measurement: The followng formula can be used to measure the evenness between

More information

Edge Detection in Noisy Images Using the Support Vector Machines

Edge Detection in Noisy Images Using the Support Vector Machines Edge Detecton n Nosy Images Usng the Support Vector Machnes Hlaro Gómez-Moreno, Saturnno Maldonado-Bascón, Francsco López-Ferreras Sgnal Theory and Communcatons Department. Unversty of Alcalá Crta. Madrd-Barcelona

More information

UB at GeoCLEF Department of Geography Abstract

UB at GeoCLEF Department of Geography   Abstract UB at GeoCLEF 2006 Mguel E. Ruz (1), Stuart Shapro (2), June Abbas (1), Slva B. Southwck (1) and Davd Mark (3) State Unversty of New York at Buffalo (1) Department of Lbrary and Informaton Studes (2) Department

More information

Biostatistics 615/815

Biostatistics 615/815 The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Classifying Acoustic Transient Signals Using Artificial Intelligence

Classifying Acoustic Transient Signals Using Artificial Intelligence Classfyng Acoustc Transent Sgnals Usng Artfcal Intellgence Steve Sutton, Unversty of North Carolna At Wlmngton (suttons@charter.net) Greg Huff, Unversty of North Carolna At Wlmngton (jgh7476@uncwl.edu)

More information

Clustering algorithms and validity measures

Clustering algorithms and validity measures Clusterng algorthms and valdty measures M. Hald, Y. Batstas, M. Vazrganns Department of Informatcs Athens Unversty of Economcs & Busness Emal: {mhal, yanns, mvazrg}@aueb.gr Abstract Clusterng ams at dscoverng

More information

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp Lfe Tables (Tmes) Summary... 1 Data Input... 2 Analyss Summary... 3 Survval Functon... 5 Log Survval Functon... 6 Cumulatve Hazard Functon... 7 Percentles... 7 Group Comparsons... 8 Summary The Lfe Tables

More information

A NEW APPROACH FOR SUBWAY TUNNEL DEFORMATION MONITORING: HIGH-RESOLUTION TERRESTRIAL LASER SCANNING

A NEW APPROACH FOR SUBWAY TUNNEL DEFORMATION MONITORING: HIGH-RESOLUTION TERRESTRIAL LASER SCANNING A NEW APPROACH FOR SUBWAY TUNNEL DEFORMATION MONITORING: HIGH-RESOLUTION TERRESTRIAL LASER SCANNING L Jan a, Wan Youchuan a,, Gao Xanjun a a School of Remote Sensng and Informaton Engneerng, Wuhan Unversty,129

More information

Mathematics 256 a course in differential equations for engineering students

Mathematics 256 a course in differential equations for engineering students Mathematcs 56 a course n dfferental equatons for engneerng students Chapter 5. More effcent methods of numercal soluton Euler s method s qute neffcent. Because the error s essentally proportonal to the

More information

IMPROVING AND EXTENDING THE INFORMATION ON PRINCIPAL COMPONENT ANALYSIS FOR LOCAL NEIGHBORHOODS IN 3D POINT CLOUDS

IMPROVING AND EXTENDING THE INFORMATION ON PRINCIPAL COMPONENT ANALYSIS FOR LOCAL NEIGHBORHOODS IN 3D POINT CLOUDS IMPROVING AND EXTENDING THE INFORMATION ON PRINCIPAL COMPONENT ANALYSIS FOR LOCAL NEIGHBORHOODS IN 3D POINT CLOUDS Davd Belton Cooperatve Research Centre for Spatal Informaton (CRC-SI) The Insttute for

More information

Associative Based Classification Algorithm For Diabetes Disease Prediction

Associative Based Classification Algorithm For Diabetes Disease Prediction Internatonal Journal of Engneerng Trends and Technology (IJETT) Volume-41 Number-3 - November 016 Assocatve Based Classfcaton Algorthm For Dabetes Dsease Predcton 1 N. Gnana Deepka, Y.surekha, 3 G.Laltha

More information

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers

Investigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers Journal of Convergence Informaton Technology Volume 5, Number 2, Aprl 2010 Investgatng the Performance of Naïve- Bayes Classfers and K- Nearest Neghbor Classfers Mohammed J. Islam *, Q. M. Jonathan Wu,

More information

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields

A mathematical programming approach to the analysis, design and scheduling of offshore oilfields 17 th European Symposum on Computer Aded Process Engneerng ESCAPE17 V. Plesu and P.S. Agach (Edtors) 2007 Elsever B.V. All rghts reserved. 1 A mathematcal programmng approach to the analyss, desgn and

More information

Solving two-person zero-sum game by Matlab

Solving two-person zero-sum game by Matlab Appled Mechancs and Materals Onlne: 2011-02-02 ISSN: 1662-7482, Vols. 50-51, pp 262-265 do:10.4028/www.scentfc.net/amm.50-51.262 2011 Trans Tech Publcatons, Swtzerland Solvng two-person zero-sum game by

More information

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches

Fuzzy Filtering Algorithms for Image Processing: Performance Evaluation of Various Approaches Proceedngs of the Internatonal Conference on Cognton and Recognton Fuzzy Flterng Algorthms for Image Processng: Performance Evaluaton of Varous Approaches Rajoo Pandey and Umesh Ghanekar Department of

More information

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB

SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB SURFACE PROFILE EVALUATION BY FRACTAL DIMENSION AND STATISTIC TOOLS USING MATLAB V. Hotař, A. Hotař Techncal Unversty of Lberec, Department of Glass Producng Machnes and Robotcs, Department of Materal

More information

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article Avalable onlne www.jocpr.com Journal of Chemcal and Pharmaceutcal Research, 2014, 6(6):2512-2520 Research Artcle ISSN : 0975-7384 CODEN(USA) : JCPRC5 Communty detecton model based on ncremental EM clusterng

More information

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis

Solitary and Traveling Wave Solutions to a Model. of Long Range Diffusion Involving Flux with. Stability Analysis Internatonal Mathematcal Forum, Vol. 6,, no. 7, 8 Soltary and Travelng Wave Solutons to a Model of Long Range ffuson Involvng Flux wth Stablty Analyss Manar A. Al-Qudah Math epartment, Rabgh Faculty of

More information

Simulation Based Analysis of FAST TCP using OMNET++

Simulation Based Analysis of FAST TCP using OMNET++ Smulaton Based Analyss of FAST TCP usng OMNET++ Umar ul Hassan 04030038@lums.edu.pk Md Term Report CS678 Topcs n Internet Research Sprng, 2006 Introducton Internet traffc s doublng roughly every 3 months

More information