Outlier Detection based on Robust Parameter Estimates
|
|
- Reynard Butler
- 5 years ago
- Views:
Transcription
1 Outler Detecton based on Robust Parameter Estmates Nor Azlda Aleng 1, Ny Ny Nang, Norzan Mohamed 3 and Kasyp Mokhtar 4 1,3 School of Informatcs and Appled Mathematcs, Unverst Malaysa Terengganu, 1030 Kuala Terengganu, Terengganu, Malaysa. Insttute for Communty (Health) Development (-CODE), Unverst Sultan Zanal Abdn, Gong Badak Campus, 1300 Kuala Terengganu, Malaysa. 4 School of Martme Busness and Management, Unverst Malaysa Terengganu, 1030 Kuala Terengganu, Terengganu, Malaysa. Orcd; Abstract Outlers can nfluence the analyss of data n varous dfferent ways. The outlers can lead to model msspecfcaton, ncorrect analyss results and can make all estmaton procedures meanngless. In regresson analyss, ordnary least square estmaton s most frequently used for estmaton of the parameters n the model. Unfortunately, ths estmator s senstve to outlers. Thus, n ths paper we proposed some statstcs for detecton of outlers based on robust estmaton, namely least trmmed squares (LTS). A smulaton study was performed to prove that the alternatve approach gves a better results than OLS estmaton to dentfy outlers. Keywords: Outlers, least trmmed squares (LTS) and robust regresson. INTRODUCTION The lnear regresson can be expressed n terms of matrces as y X ; where y s an N-dmensonal column vector, X s an N (K + 1) matrx and s an N- dmensonal column vector of error terms,.e. y1 y y 3 yn N 1 1 x 11 1 x 1 1 x x N1 N ( K 1) x1 K x K x 3K x NK 0 1 K ( K 1) 1 1 N N 1 In fttng multple lnear regesson model (1.1), the most wdely used ordnary least squares (OLS) to fnd the best estmates of. Unfortunately, n the presence of outlers, the OLS estmators are stll based. Outlers play an mportant role n regresson. An outlers (observatons) that s qute dfferent from most the other values or observatons n a data set. Observatons n a data set can be outlers n several dfferent ways. Accordng to Barnett and Lews (1994), an outler s an observaton that s nconsstent wth the rest of the data. Even one outler can effect the regresson model. There s an (1) evdence that the outlers can lead to model msspecfcaton, ncorrect analyss result and can make all estmaton procedures meanngless, (Rousseeuw and Leroy, 1987; Alma, 011; Zmmerman, 1994, 1995, 1998). Outlers n the response varable represent model falure. Outlers n the regressor varable values are extreme n X-space are called leverage ponts. Rousseeuw and Lorey (1987) defned that outlers n three types; 1) vertcal outlers, ) bad leverage pont, 3) good leverage pont. Vertcal outler s an observaton that has nfluence on the error term of y-dmenson (response) but s not nfluental n the space of x-dmenson (regressor). Good leverage pont s an observaton that are outlyng n the ndependent varables but s not a regresson outler. Ths pont does not affect the least square estmaton but t affects statstcal nference snce ths pont cut down the estmated standard errors. Good leverage ponts mprove the precson of the regresson coeffcents. Bad laverage pont s an observaton that s outlyng n ndependent varables and located far from the true regresson lne and reduce the precson of regresson coeffcents. Generally speakng, there are two technques for handlng the outlers. The frst s to use the some robust procedure whch resst ther nfluence n the statstcal analyss. The second s to remove outlers from the data set. Therefore, n ths study t s proposed some knd of robust procedure namely the least trmmed squares (LTS). Ths method s relable for dentfyng the regresson outlers both n smple and multvarates stuatons. The objectve of ths study was to detect the outlers and leverage ponts n the data set. The performance of the proposed method was dscussed extensvely by usng medcal data. MATERIALS AND METHODS Ths study focused on the blood pressure data. Systolc blood pressure (sbp) s a dependent varable and ndependent varables namely n Table 1. A total of 100 repondents were selected and dagnosed to have blood pressure problem based on WHO crtera. The explanaton 1349
2 of the varables s shown n Table 1 and the data were collected from Health Centre HUSM n Malaysa. Table 1: Explanaton of the Varables Code Varables Explanaton of the varables y SBP Systolc blood pressure x 1 AGE Age (year) x BMI Body mass ndex x TOTCHOLES Total cholesterol 3 (Mmol/L) x DIABETES Dabetes melltus; 0 = No, 4 1 = Yes x DBP Dastolc blood pressure 5 x HDL HDL cholesterol 6 x HEIGHT Heght (m) 7 x TRIG Trglycerdes 8 x WEIGHT Weght (kg) 9 As s well known, a large number of dagnostcs have been proposed to detect outlers. Practcally, the dagnostcs whch was based on the ordnary least squares estmates were not effcent and based when outlers exsted n the data. Thus, to remedy ths problem, least trmmed squres estmamtors (LTS) was proposed. Ths was an alternatve approach n dealng wth outlers n regresson analyss. The Least Trmmed Squares Estmators (LTS) A statstcal procedure s regarded as robust f t performs reasonably well even when the assumptons of the statstcal model are not true. If we assume our data follows standard lnear regresson, least squares estmates and test perform qute well, but they are not robust wth the presence of outler observaton(s) n the data set (Rousseeuw and Leroy, 1987). In ths case we proposed the popular robust technque s the called LTS estmator. LTS estmaton s a hgh breakdown pont. The breakdown pont s a measure for stablty of the estmator when the sample contans a large fracton of outlers (Hampel, 1975). LTS defned as: ( LTS) h r 1 argmn ( ) () where r(1)... r( n) are ordered squared resduals. Robust regresson s extremely useful n dentfyng outlers. LTS regresson s a relable data analytcal tool that may be used to dscover regresson outlers both n smple and multvarable condtons. In ths paper, three dagnosts are used to dentfy outlers whch are gven below. The robust dstance s defned as: T 1 RD( x ) [ x T( X )] C( X ) [ X T( X )] (3) where T( X ) and CX ( ) are the robust locatonand scatter matrx for thr multvarable. The Mahalanobs dstances s useful technque for detectng outlers s defned as 1 ( ). ( ) T MD x C x (4) Where C s the classcal sample covarance matrx. In classcal lnear regresson,the dagonal element of the hat matrx, T 1 h H X ( X X ) X T are used to dentfy leverage ponts. Rousseeuw and Van Zomeren (1990) defned the relatonshp between the h and MD h [(( MD ) / ( n 1)] [1/ n]. Rousseeuw and Lorey (1987) suggest usng h p / n and MD benchmarks for leverage and Mahalanobs dstances. The Cook s dstance s defned as, where p1;0.95 as 1 ˆ ˆ T CD ( p ) ( Y Y) ( Yˆ Yˆ ) (5) s estmator of the error varance, n r / n p. Cook suggests that CD be compared to a 1 central F dstrbuton wth p and n p degrees of freedom. Ths gves the cutoff values s very hgh. The conventonal cutoff value s 4 / n p. Generally, when the statstcs CD, h and MD are large, case may be an outler or nfluental case. Therefore the dagnostc s very mportant to dentfy the outlers and provdes resstant results n the presence of outlers. Below s the algorthm n SAS language for the multple lnear regresson and robust regresson. Ths algorthm usng SAS software whch s gven as follows: data BP; nput sbp age bm totcholes dabetes dbp hdl heght trg weght; datalnes; 13430
3
4 ; ods rtf fle='robdunc0.rtf' style=journal; ods graphcs on; /* frst we do multple lnear regresson */ proc reg data=bp; model sbp = age bm totcholes dabetes dbp hdl heght trg weght; /* then we do robust regresson, n ths case, LTS-estmaton */ proc robustreg data=bp method=lts; model sbp = age bm totcholes dabetes dbp hdl heght trg weght ; ods graphcs off; /* QQ plot and hstogram */ ods graphcs on; proc robustreg data=bp plots=(rdplot ddplot reshstogram resqqplot); model sbp = age bm totcholes dabetes dbp hdl heght trg weght; ods graphcs off; /* Cook s dstance plot*/ proc reg data=bp; plot (only label)=(rstudentbyleverage CooksD); ods graphcs off; /* then we do robust weghted regresson */ ods graphcs on; proc robustreg method=lts(h=33)fwls data=bp plots=all; model sbp = age bm totcholes dabetes dbp hdl heght trg weght /dagnostcs leverage; output out=robout r=resd sr=stdres; ods rtf close; RESULTS AND DISCUSSIONS In ths study, a set of real data whch s referred to blood pressure data s used to see how well the dagnostc statstcs wth robust estmator perform for the regresson model. The step s the blood pressure data was analyzed usng robust regresson method LTS wth dagnostcs tool such as Mahalanobs dstance, robust MCD dstance and standardzed robust resduals. Results of dagnostcs of outlers and leverage ponts are presented n Table. 1343
5 Obs Table : Robust dagnostcs based on least trmmed squares (LTS) Mahalanobs Dstance Robust MCD Dstance Leverage Standardzed Robust Resdual * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * * Outler The model of blood pressure s: yˆ age dbp e heght weght (6) predct a trend. The results of Table show that the exstng of outlers n the blood pressure data. 13 observatons are consdered as outlers and observaton 93 has hgh leverage. Although, the set data presence of outlers, results reman robust. and R squared valus s (85.3%), ths ndcated the greater the ablty of that model to 13433
6 Outler and Leverage Dagnostcs for sbp Standardzed Robust Resdual Observatons Outlers Leverage Pts Res Cutoff Lev Cutoff Robust MCD Dstance Outler Leverage Outler and Leverage Fgure 1: Regresson dagnostc plot for systolc blood pressure. Based on Fgure 1, gves evdence of the presence of outlyng observatons because the pots fall behnd the band. Observatons (1, 0, 1, 3, 6, 43, 44, 49, 55, 57, 96, 97 and 100) are dentfed as outlers. We can see that, 0 observatons are dentfed as laverage ponts and observatons 1, 6 and 96 dentfed as outlers and leverage ponts at the same tme. CONCLUSION Least trmmed squres estmamtors (LTS) s an alternatve approach n dealng wth outlers n regresson analyss. The value R gave strong correlaton and relaton between varables, so t shown that strong good ft model. Robust verson of the dagnostcs detect all outlers n the data n one step. The results of the smulaton study agree well wth the real data. REFERENCES [1] Alma, O. G. (011). Comparson of robust regresson methods n lnear regresson. Int. [] Journal Contemp. Math. Scences, 6(9), [3] Barnett, V., & Lews, T. (1994). Outlers n statstcal data. New York: John Wley and Sons. [4] Rousseeuw, P. J. and Leory, A. (1987). Robust Regresson and Outler Detecton. Wley Seres n [5] Probablty and Statstcs. [6] Zmmerman, D. W. (1994). A note on the nfluence of outlers on parametrc and nonparametrc [7] tests. Journal of General Psychology. 11(4): [8] Zmmerman, D. W. (1995). Increasng the power of nonparametrc tests by detectng and [9] downweghtng outlers. Journal of Expermental Educaton, 64(1), [10] Zmmerman, D. W. (1998). Invaldaton of parametrc and nonparametrc statstcal tests by [11] concurrent volaton of two assumptons. Journal of Expermental Educaton. 67(1):
Feature Reduction and Selection
Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components
More informationy and the total sum of
Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton
More informationCS 534: Computer Vision Model Fitting
CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust
More informationEstimating Regression Coefficients using Weighted Bootstrap with Probability
Norazan M R, Habshah Md, A H M R Imon Estmatng Regresson Coeffcents usng Weghted Bootstrap wth Probablty NORAZAN M R, HABSHAH MIDI AND A H M R IMON Faculty of Computer and Mathematcal Scences, Unversty
More informationEmpirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap
Int. Journal of Math. Analyss, Vol. 8, 4, no. 5, 7-7 HIKARI Ltd, www.m-hkar.com http://dx.do.org/.988/jma.4.494 Emprcal Dstrbutons of Parameter Estmates n Bnary Logstc Regresson Usng Bootstrap Anwar Ftranto*
More informationMalaysian Journal of Applied Sciences
Unverst Sultan Zanal Abdn eissn 0127-9246 (Onlne) Malasan Journal of Appled Scences ORIGINAL ARTICLE A New Strateg of Handlng General Insurance Modellng Usng Appled Lnear Method *Wan Muhamad Amr W Ahmad
More informationX- Chart Using ANOM Approach
ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are
More informationA Simple and Efficient Goal Programming Model for Computing of Fuzzy Linear Regression Parameters with Considering Outliers
62626262621 Journal of Uncertan Systems Vol.5, No.1, pp.62-71, 211 Onlne at: www.us.org.u A Smple and Effcent Goal Programmng Model for Computng of Fuzzy Lnear Regresson Parameters wth Consderng Outlers
More informationNAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics
Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson
More informationWhy visualisation? IRDS: Visualization. Univariate data. Visualisations that we won t be interested in. Graphics provide little additional information
Why vsualsaton? IRDS: Vsualzaton Charles Sutton Unversty of Ednburgh Goal : Have a data set that I want to understand. Ths s called exploratory data analyss. Today s lecture. Goal II: Want to dsplay data
More informationSimulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010
Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement
More informationA Semi-parametric Regression Model to Estimate Variability of NO 2
Envronment and Polluton; Vol. 2, No. 1; 2013 ISSN 1927-0909 E-ISSN 1927-0917 Publshed by Canadan Center of Scence and Educaton A Sem-parametrc Regresson Model to Estmate Varablty of NO 2 Meczysław Szyszkowcz
More informationProper Choice of Data Used for the Estimation of Datum Transformation Parameters
Proper Choce of Data Used for the Estmaton of Datum Transformaton Parameters Hakan S. KUTOGLU, Turkey Key words: Coordnate systems; transformaton; estmaton, relablty. SUMMARY Advances n technologes and
More informationLecture 4: Principal components
/3/6 Lecture 4: Prncpal components 3..6 Multvarate lnear regresson MLR s optmal for the estmaton data...but poor for handlng collnear data Covarance matrx s not nvertble (large condton number) Robustness
More informationCluster-Based Profile Monitoring in Phase I Analysis. Yajuan Chen. Doctor of Philosophy In Statistics
Cluster-Based Profle Montorng n Phase I Analyss Yajuan Chen Dssertaton submtted to the faculty of the Vrgna Polytechnc Insttute and State Unversty n partal fulfllment of the requrements for the degree
More informationSubspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;
Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features
More informationUnsupervised Learning and Clustering
Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned
More informationJMASM42: An Alternative Algorithm and Programming Implementation for Least Absolute Deviation Estimator of the Linear Regression Models (R)
Journal of Modern Appled Statstcal Methods Volume 15 Issue 2 Artcle 45 11-1-2016 JMASM42: An Alternatve Algorthm and Programmng Implementaton for Least Absolute Devaton Estmator of the Lnear Regresson
More informationThe Man-hour Estimation Models & Its Comparison of Interim Products Assembly for Shipbuilding
Internatonal Journal of Operatons Research Internatonal Journal of Operatons Research Vol., No., 9 4 (005) The Man-hour Estmaton Models & Its Comparson of Interm Products Assembly for Shpbuldng Bn Lu and
More informationCluster Analysis of Electrical Behavior
Journal of Computer and Communcatons, 205, 3, 88-93 Publshed Onlne May 205 n ScRes. http://www.scrp.org/ournal/cc http://dx.do.org/0.4236/cc.205.350 Cluster Analyss of Electrcal Behavor Ln Lu Ln Lu, School
More informationAvailable online at ScienceDirect. Procedia Environmental Sciences 26 (2015 )
Avalable onlne at www.scencedrect.com ScenceDrect Proceda Envronmental Scences 26 (2015 ) 109 114 Spatal Statstcs 2015: Emergng Patterns Calbratng a Geographcally Weghted Regresson Model wth Parameter-Specfc
More informationProbability Base Classification Technique: A Preliminary Study for Two Groups
Mathematcal Theory and Modelng ISSN 4-5804 (Paper) ISSN 5-05 (Onlne) www.ste.org Probablty Base Classfcaton Technque: A Prelmnary Study for Two Groups Frday Znzendoff Okwonu,* Abdul Rahman Othman. Department
More informationOutlier Detection Methodologies Overview
Outler Detecton Methodologes Overvew Mohd. Noor Md. Sap Department of Computer and Informaton Systems Faculty of Computer Scence and Informaton Systems Unverst Teknolog Malaysa 81310 Skuda, Johor Bahru,
More information3D FCRM MODELING IN MILES PER GALLON OF CAR
3D FCRM Modelng n Mles Per Gallon of Car 3 4 3D FCRM MODELING IN MILES PER GALLON OF CAR Mohd Safullah Rusman Robah Adnan Efend N. Nasbo The new fuzzy c-regresson modelng (FcRM) are wdely used n order
More informationAPPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT
3. - 5. 5., Brno, Czech Republc, EU APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT Abstract Josef TOŠENOVSKÝ ) Lenka MONSPORTOVÁ ) Flp TOŠENOVSKÝ
More informationParameter estimation for incomplete bivariate longitudinal data in clinical trials
Parameter estmaton for ncomplete bvarate longtudnal data n clncal trals Naum M. Khutoryansky Novo Nordsk Pharmaceutcals, Inc., Prnceton, NJ ABSTRACT Bvarate models are useful when analyzng longtudnal data
More informationA Bootstrap Approach to Robust Regression
Internatonal Journal of Appled Scence and Technology Vol. No. 9; November A Bootstrap Approach to Robust Regresson Dr. Hamadu Dallah Department of Actuaral Scence and Insurance Unversty of Lagos Akoka,
More informationLearning Ensemble of Local PDM-based Regressions. Yen Le Computational Biomedicine Lab Advisor: Prof. Ioannis A. Kakadiaris
Learnng Ensemble of Local PDM-based Regressons Yen Le Computatonal Bomedcne Lab Advsor: Prof. Ioanns A. Kakadars 1 Problem statement Fttng a statstcal shape model (PDM) for mage segmentaton Callosum segmentaton
More informationContent Based Image Retrieval Using 2-D Discrete Wavelet with Texture Feature with Different Classifiers
IOSR Journal of Electroncs and Communcaton Engneerng (IOSR-JECE) e-issn: 78-834,p- ISSN: 78-8735.Volume 9, Issue, Ver. IV (Mar - Apr. 04), PP 0-07 Content Based Image Retreval Usng -D Dscrete Wavelet wth
More informationA New Approach For the Ranking of Fuzzy Sets With Different Heights
New pproach For the ankng of Fuzzy Sets Wth Dfferent Heghts Pushpnder Sngh School of Mathematcs Computer pplcatons Thapar Unversty, Patala-7 00 Inda pushpndersnl@gmalcom STCT ankng of fuzzy sets plays
More informationSVM-based Learning for Multiple Model Estimation
SVM-based Learnng for Multple Model Estmaton Vladmr Cherkassky and Yunqan Ma Department of Electrcal and Computer Engneerng Unversty of Mnnesota Mnneapols, MN 55455 {cherkass,myq}@ece.umn.edu Abstract:
More informationS1 Note. Basis functions.
S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type
More informationFEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur
FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents
More informationDETECTING ERRORS AND IMPUTING MISSING DATA FOR SINGLE LOOP SURVEILLANCE SYSTEMS
DETECTING ERRORS AND IMPUTING MISSING DATA FOR SINGLE LOOP SURVEILLANCE SYSTEMS Chao Chen Department of Electrcal Engneerng and Computer Scence Unversty of Calforna, Berkeley CA 94720 Tel: (510) 643-5894
More informationAdaptive Transfer Learning
Adaptve Transfer Learnng Bn Cao, Snno Jaln Pan, Yu Zhang, Dt-Yan Yeung, Qang Yang Hong Kong Unversty of Scence and Technology Clear Water Bay, Kowloon, Hong Kong {caobn,snnopan,zhangyu,dyyeung,qyang}@cse.ust.hk
More informationSome Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.
Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,
More informationDetecting Errors and Imputing Missing Data for Single-Loop Surveillance Systems
16 Transportaton Research Record 1855 Paper No. 3-357 Detectng Errors and Imputng Mssng Data for Sngle-Loop Survellance Systems Chao Chen, Jamyoung Kwon, John Rce, Alexander Skabardons, and Pravn Varaya
More informationUnsupervised Learning
Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and
More informationUSING GRAPHING SKILLS
Name: BOLOGY: Date: _ Class: USNG GRAPHNG SKLLS NTRODUCTON: Recorded data can be plotted on a graph. A graph s a pctoral representaton of nformaton recorded n a data table. t s used to show a relatonshp
More informationOutline. Type of Machine Learning. Examples of Application. Unsupervised Learning
Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton
More informationSupport Vector Machines
/9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.
More informationLife Tables (Times) Summary. Sample StatFolio: lifetable times.sgp
Lfe Tables (Tmes) Summary... 1 Data Input... 2 Analyss Summary... 3 Survval Functon... 5 Log Survval Functon... 6 Cumulatve Hazard Functon... 7 Percentles... 7 Group Comparsons... 8 Summary The Lfe Tables
More information7/12/2016. GROUP ANALYSIS Martin M. Monti UCLA Psychology AGGREGATING MULTIPLE SUBJECTS VARIANCE AT THE GROUP LEVEL
GROUP ANALYSIS Martn M. Mont UCLA Psychology NITP AGGREGATING MULTIPLE SUBJECTS When we conduct mult-subject analyss we are tryng to understand whether an effect s sgnfcant across a group of people. Whether
More informationA Comparative Study for Outlier Detection Techniques in Data Mining
A Comparatve Study for Outler Detecton Technques n Data Mnng Zurana Abu Bakar, Rosmayat Mohemad, Akbar Ahmad Department of Computer Scence Faculty of Scence and Technology Unversty College of Scence and
More informationSignature and Lexicon Pruning Techniques
Sgnature and Lexcon Prunng Technques Srnvas Palla, Hansheng Le, Venu Govndaraju Centre for Unfed Bometrcs and Sensors Unversty at Buffalo {spalla2, hle, govnd}@cedar.buffalo.edu Abstract Handwrtten word
More informationProgramming in Fortran 90 : 2017/2018
Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values
More informationImproved Methods for Lithography Model Calibration
Improved Methods for Lthography Model Calbraton Chrs Mack www.lthoguru.com, Austn, Texas Abstract Lthography models, ncludng rgorous frst prncple models and fast approxmate models used for OPC, requre
More informationTECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.
TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of
More informationSmoothing Spline ANOVA for variable screening
Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory
More informationWishing you all a Total Quality New Year!
Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma
More informationMachine Learning 9. week
Machne Learnng 9. week Mappng Concept Radal Bass Functons (RBF) RBF Networks 1 Mappng It s probably the best scenaro for the classfcaton of two dataset s to separate them lnearly. As you see n the below
More informationData Mining: Model Evaluation
Data Mnng: Model Evaluaton Aprl 16, 2013 1 Issues: Evaluatng Classfcaton Methods Accurac classfer accurac: predctng class label predctor accurac: guessng value of predcted attrbutes Speed tme to construct
More informationAnalysis of Malaysian Wind Direction Data Using ORIANA
Modern Appled Scence March, 29 Analyss of Malaysan Wnd Drecton Data Usng ORIANA St Fatmah Hassan (Correspondng author) Centre for Foundaton Studes n Scence Unversty of Malaya, 63 Kuala Lumpur, Malaysa
More informationAnonymisation of Public Use Data Sets
Anonymsaton of Publc Use Data Sets Methods for Reducng Dsclosure Rsk and the Analyss of Perturbed Data Harvey Goldsten Unversty of Brstol and Unversty College London and Natale Shlomo Unversty of Manchester
More informationSLAM Summer School 2006 Practical 2: SLAM using Monocular Vision
SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,
More informationAn Indian Journal FULL PAPER ABSTRACT KEYWORDS. Trade Science Inc.
[Type text] [Type text] [Type text] ISSN : 97-735 Volume Issue 9 BoTechnology An Indan Journal FULL PAPER BTAIJ, (9), [333-3] Matlab mult-dmensonal model-based - 3 Chnese football assocaton super league
More informationModular PCA Face Recognition Based on Weighted Average
odern Appled Scence odular PCA Face Recognton Based on Weghted Average Chengmao Han (Correspondng author) Department of athematcs, Lny Normal Unversty Lny 76005, Chna E-mal: hanchengmao@163.com Abstract
More informationA CLASS OF TRANSFORMED EFFICIENT RATIO ESTIMATORS OF FINITE POPULATION MEAN. Department of Statistics, Islamia College, Peshawar, Pakistan 2
Pa. J. Statst. 5 Vol. 3(4), 353-36 A CLASS OF TRANSFORMED EFFICIENT RATIO ESTIMATORS OF FINITE POPULATION MEAN Sajjad Ahmad Khan, Hameed Al, Sadaf Manzoor and Alamgr Department of Statstcs, Islama College,
More informationPrincipal Component Inversion
Prncpal Component Inverson Dr. A. Neumann, H. Krawczyk German Aerospace Centre DLR Remote Sensng Technology Insttute Marne Remote Sensng Prncpal Components - Propertes The Lnear Inverson Algorthm Optmsaton
More informationThe Grouping Methods and Rank Estimator, Based on Ranked Set sampling, for the linear Error in Variable Models
P Internatonal Journal of Scentfc Engneerng and Appled Scence (IJSEAS) Volume-, Issue-7,Jul 06 The Groupng Methods and Rank Estmator, Based on Ranked Set samplng, for the lnear Error n Varable Models Ahmed
More informationA Statistical Model Selection Strategy Applied to Neural Networks
A Statstcal Model Selecton Strategy Appled to Neural Networks Joaquín Pzarro Elsa Guerrero Pedro L. Galndo joaqun.pzarro@uca.es elsa.guerrero@uca.es pedro.galndo@uca.es Dpto Lenguajes y Sstemas Informátcos
More informationImprovement of Spatial Resolution Using BlockMatching Based Motion Estimation and Frame. Integration
Improvement of Spatal Resoluton Usng BlockMatchng Based Moton Estmaton and Frame Integraton Danya Suga and Takayuk Hamamoto Graduate School of Engneerng, Tokyo Unversty of Scence, 6-3-1, Nuku, Katsuska-ku,
More informationAn Optimal Algorithm for Prufer Codes *
J. Software Engneerng & Applcatons, 2009, 2: 111-115 do:10.4236/jsea.2009.22016 Publshed Onlne July 2009 (www.scrp.org/journal/jsea) An Optmal Algorthm for Prufer Codes * Xaodong Wang 1, 2, Le Wang 3,
More informationVanishing Hull. Jinhui Hu, Suya You, Ulrich Neumann University of Southern California {jinhuihu,suyay,
Vanshng Hull Jnhu Hu Suya You Ulrch Neumann Unversty of Southern Calforna {jnhuhusuyay uneumann}@graphcs.usc.edu Abstract Vanshng ponts are valuable n many vson tasks such as orentaton estmaton pose recovery
More informationType-2 Fuzzy Non-uniform Rational B-spline Model with Type-2 Fuzzy Data
Malaysan Journal of Mathematcal Scences 11(S) Aprl : 35 46 (2017) Specal Issue: The 2nd Internatonal Conference and Workshop on Mathematcal Analyss (ICWOMA 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES
More informationTN348: Openlab Module - Colocalization
TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages
More informationA Fast Visual Tracking Algorithm Based on Circle Pixels Matching
A Fast Vsual Trackng Algorthm Based on Crcle Pxels Matchng Zhqang Hou hou_zhq@sohu.com Chongzhao Han czhan@mal.xjtu.edu.cn Ln Zheng Abstract: A fast vsual trackng algorthm based on crcle pxels matchng
More informationFusion Performance Model for Distributed Tracking and Classification
Fuson Performance Model for Dstrbuted rackng and Classfcaton K.C. Chang and Yng Song Dept. of SEOR, School of I&E George Mason Unversty FAIRFAX, VA kchang@gmu.edu Martn Lggns Verdan Systems Dvson, Inc.
More informationFace Recognition University at Buffalo CSE666 Lecture Slides Resources:
Face Recognton Unversty at Buffalo CSE666 Lecture Sldes Resources: http://www.face-rec.org/algorthms/ Overvew of face recognton algorthms Correlaton - Pxel based correspondence between two face mages Structural
More informationUSING LINEAR REGRESSION FOR THE AUTOMATION OF SUPERVISED CLASSIFICATION IN MULTITEMPORAL IMAGES
USING LINEAR REGRESSION FOR THE AUTOMATION OF SUPERVISED CLASSIFICATION IN MULTITEMPORAL IMAGES 1 Fetosa, R.Q., 2 Merelles, M.S.P., 3 Blos, P. A. 1,3 Dept. of Electrcal Engneerng ; Catholc Unversty of
More informationA Fast Content-Based Multimedia Retrieval Technique Using Compressed Data
A Fast Content-Based Multmeda Retreval Technque Usng Compressed Data Borko Furht and Pornvt Saksobhavvat NSF Multmeda Laboratory Florda Atlantc Unversty, Boca Raton, Florda 3343 ABSTRACT In ths paper,
More information6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour
6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the
More informationC. Markert-Hahn, K. Schiffl, M. Strohmeier, Nonclinical Statistics Conference,
Roche Pharma Producton Penzberg Practcal Applcatons of Statstcal Process Control C. Markert-Hahn, K. Schffl, M. Strohmeer, Roche Dagnostcs GmbH, Penzberg Operatonal Excellence Statstcs Nonclncal Statstcs
More informationA Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning
A Post Randomzaton Framework for Prvacy-Preservng Bayesan Network Parameter Learnng JIANJIE MA K.SIVAKUMAR School Electrcal Engneerng and Computer Scence, Washngton State Unversty Pullman, WA. 9964-75
More informationRobust Statistics for Computer Vision: Model Fitting, Image Segmentation and Visual Motion Analysis
Robust Statstcs for Computer Vson: Model Fttng, Image Segmentaton and Vsual Moton Analyss by Hanz Wang B.S. (Schuan Unversty) 1996 M.S. (Schuan Unversty) 1999 A thess submtted for the degree of Doctor
More informationParallelism for Nested Loops with Non-uniform and Flow Dependences
Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr
More informationProblem Definitions and Evaluation Criteria for Computational Expensive Optimization
Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty
More informationEconometrics 2. Panel Data Methods. Advanced Panel Data Methods I
Panel Data Methods Econometrcs 2 Advanced Panel Data Methods I Last tme: Panel data concepts and the two-perod case (13.3-4) Unobserved effects model: Tme-nvarant and dosyncratc effects Omted varables
More informationThe Research of Support Vector Machine in Agricultural Data Classification
The Research of Support Vector Machne n Agrcultural Data Classfcaton Le Sh, Qguo Duan, Xnmng Ma, Me Weng College of Informaton and Management Scence, HeNan Agrcultural Unversty, Zhengzhou 45000 Chna Zhengzhou
More informationNUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS
ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana
More informationClassifier Selection Based on Data Complexity Measures *
Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.
More informationUsing Neural Networks and Support Vector Machines in Data Mining
Usng eural etworks and Support Vector Machnes n Data Mnng RICHARD A. WASIOWSKI Computer Scence Department Calforna State Unversty Domnguez Hlls Carson, CA 90747 USA Abstract: - Multvarate data analyss
More informationUSE OF ANALYTIC REDUNDANCY IN FAULT-TOLERANT SENSOR SYSTEMS
USE OF AAYIC REDUDACY I FAU-OERA SESOR SYSES J. lek and F. Kraus Automatc Control aboratory Swss Federal Insttute of echnology EH) Zürch CH-8092 Zürch, Swtzerland Abstract: he paper presents a systematc
More informationResearch on Categorization of Animation Effect Based on Data Mining
MATEC Web of Conferences 22, 0102 0 ( 2015) DOI: 10.1051/ matecconf/ 2015220102 0 C Owned by the authors, publshed by EDP Scences, 2015 Research on Categorzaton of Anmaton Effect Based on Data Mnng Na
More informationTHE THEORY OF REGIONALIZED VARIABLES
CHAPTER 4 THE THEORY OF REGIONALIZED VARIABLES 4.1 Introducton It s ponted out by Armstrong (1998 : 16) that Matheron (1963b), realzng the sgnfcance of the spatal aspect of geostatstcal data, coned the
More informationA Similarity-Based Prognostics Approach for Remaining Useful Life Estimation of Engineered Systems
2008 INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT A Smlarty-Based Prognostcs Approach for Remanng Useful Lfe Estmaton of Engneered Systems Tany Wang, Janbo Yu, Davd Segel, and Jay Lee
More informationImputation Methods for Longitudinal Data: A Comparative Study
Internatonal Journal of Statstcal Dstrbutons and Applcatons 017; 3(4): 7-80 http://www.scencepublshnggroup.com/j/sda do: 10.11648/j.sd.0170304.13 ISSN: 47-3487 (Prnt); ISSN: 47-3509 (Onlne) Imputaton Methods
More informationSynthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007
Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons
More informationDetermining the Optimal Bandwidth Based on Multi-criterion Fusion
Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn
More informationDesign of Structure Optimization with APDL
Desgn of Structure Optmzaton wth APDL Yanyun School of Cvl Engneerng and Archtecture, East Chna Jaotong Unversty Nanchang 330013 Chna Abstract In ths paper, the desgn process of structure optmzaton wth
More informationYun Bae KIM Jinsoo PARK. Dept. of Systems Management Engineering Sungkyunkwan University Cheon-cheon-dong 300, Jang-an-gu Suwon, KOREA
Proceedngs of the 008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. EW PPROCHES FOR IFERECE OF UOBSERVBLE QUEUES Yun Bae KIM Jnsoo PRK Dept. of Systems
More informationLecture #15 Lecture Notes
Lecture #15 Lecture Notes The ocean water column s very much a 3-D spatal entt and we need to represent that structure n an economcal way to deal wth t n calculatons. We wll dscuss one way to do so, emprcal
More informationAn Entropy-Based Approach to Integrated Information Needs Assessment
Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology
More informationAn Improved Image Segmentation Algorithm Based on the Otsu Method
3th ACIS Internatonal Conference on Software Engneerng, Artfcal Intellgence, Networkng arallel/dstrbuted Computng An Improved Image Segmentaton Algorthm Based on the Otsu Method Mengxng Huang, enjao Yu,
More informationCategories and Subject Descriptors B.7.2 [Integrated Circuits]: Design Aids Verification. General Terms Algorithms
3. Fndng Determnstc Soluton from Underdetermned Equaton: Large-Scale Performance Modelng by Least Angle Regresson Xn L ECE Department, Carnege Mellon Unversty Forbs Avenue, Pttsburgh, PA 3 xnl@ece.cmu.edu
More informationC2 Training: June 8 9, Combining effect sizes across studies. Create a set of independent effect sizes. Introduction to meta-analysis
C2 Tranng: June 8 9, 2010 Introducton to meta-analyss The Campbell Collaboraton www.campbellcollaboraton.org Combnng effect szes across studes Compute effect szes wthn each study Create a set of ndependent
More informationHigh Dimensional Data Clustering
Hgh Dmensonal Data Clusterng Charles Bouveyron 1,2, Stéphane Grard 1, and Cordela Schmd 2 1 LMC-IMAG, BP 53, Unversté Grenoble 1, 38041 Grenoble Cede 9, France charles.bouveyron@mag.fr, stephane.grard@mag.fr
More informationInvestigating the Performance of Naïve- Bayes Classifiers and K- Nearest Neighbor Classifiers
Journal of Convergence Informaton Technology Volume 5, Number 2, Aprl 2010 Investgatng the Performance of Naïve- Bayes Classfers and K- Nearest Neghbor Classfers Mohammed J. Islam *, Q. M. Jonathan Wu,
More informationComplex Numbers. Now we also saw that if a and b were both positive then ab = a b. For a second let s forget that restriction and do the following.
Complex Numbers The last topc n ths secton s not really related to most of what we ve done n ths chapter, although t s somewhat related to the radcals secton as we wll see. We also won t need the materal
More informationKey Variable Based Detection of Sensor Faults in a Power Plant Case
Key Varable Based Detecton of Sensor Faults n a Power Plant Case Rku-Pekka Nkula*, Vlle aukkanen**, Esko Juuso*, Kauko evskä* *Control Engneerng aboratory, P.O.Box 43, FIN-94 nversty of Oulu, Fnland (e-mal:
More information