EXST7034 Regression Techniques Geaghan Logistic regression Diagnostics Page 1

Size: px
Start display at page:

Download "EXST7034 Regression Techniques Geaghan Logistic regression Diagnostics Page 1"

Transcription

1 Logstc regresson Dagnostcs Page 1 1 **********************************************************; 2 *** Logstc Regresson - Dsease outbreak example ***; 3 *** NKNW table 14.3 (Appendx C3) ***; 4 *** Study of a dsease outbreak from a mosquto born ***; 5 *** dsease wthn two sectors of a cty. ***; 6 **********************************************************; 7 8 dm'log;clear;output;clear'; 9 optons nodate nocenter nonumber ps=512 ls=132 nolabel; 10 ODS HTML style=mnmal rs=none body='c:\geaghan\current\exst7034\fall2005\sas\dseaseoutbreak01.html' ; NOTE: Wrtng HTML Body fle: C:\Geaghan\Current\EXST7034\Fall2005\SAS\DseaseOutbreak01.html TITLE1 ''; 13 data Dsease; nfle cards mssover; 14 nput case Age Status1 Status2 sector Dsease; 15 * Status classes are upper (0, 0), Mddle (1, 0) and Lower (0, 1); 16 status = 'Upper '; 17 f status1 eq 1 then status = 'Mddle'; 18 f status2 eq 1 then status = 'Lower'; 19 label case = 'case number' 20 age = 'Patents age' 21 status = 'Socoeconomc status upper, mddle and lower' 22 dsease = 'Dsease present = 1'; 23 Cards; NOTE: The data set WORK.DISEASE has 98 observatons and 7 varables. NOTE: DATA statement used (Total process tme): 0.02 seconds 0.02 seconds 122 ; proc logstc data=dsease DESCENDING alpha=0.05; 125 TITLE2 'Logstc regresson on Dsease data (wth Status1 and Status2)'; 126 model Dsease = Age Status1 Status2 Sector; 127 run; NOTE: PROC LOGISTIC s modelng the probablty that Dsease=1. NOTE: Convergence crteron (GCONV=1E-8) satsfed. NOTE: There were 98 observatons read from the data set WORK.DISEASE. NOTE: The PROCEDURE LOGISTIC prnted page 1. NOTE: PROCEDURE LOGISTIC used (Total process tme): 0.25 seconds 0.04 seconds proc logstc data=dsease DESCENDING alpha=0.05; 130 class status; 131 TITLE2 'Logstc regresson on Dsease data (wth CLASS Status - default)'; 132 model Dsease = Age Status Sector; 133 run; NOTE: PROC LOGISTIC s modelng the probablty that Dsease=1. NOTE: Convergence crteron (GCONV=1E-8) satsfed. NOTE: There were 98 observatons read from the data set WORK.DISEASE. NOTE: The PROCEDURE LOGISTIC prnted page 2. NOTE: PROCEDURE LOGISTIC used (Total process tme): 0.19 seconds 0.03 seconds proc logstc data=dsease DESCENDING alpha=0.05; 136 class status / param = glm; 137 TITLE2 'Logstc regresson on Dsease data (wth CLASS Status - GLM)'; 138 model Dsease = Age Status Sector / lackft RSQ plots; 139 output out=next1 PREDICTED=yhat Lower=lcl95 Upper=ucl95 dfbetas=_all_ 140 resdev=resdev dfdev=dfdev; 141 run; NOTE: PROC LOGISTIC s modelng the probablty that Dsease=1. NOTE: Convergence crteron (GCONV=1E-8) satsfed. NOTE: There were 98 observatons read from the data set WORK.DISEASE. NOTE: The data set WORK.NEXT1 has 98 observatons and 19 varables. NOTE: The PROCEDURE LOGISTIC prnted pages 3-4.

2 Logstc regresson Dagnostcs Page 2 NOTE: PROCEDURE LOGISTIC used (Total process tme): 0.23 seconds 0.10 seconds The three PROC LOGISTIC analyses dffer n only one regard, and all results are the same except the parameter estmates. There are 3 soco-economc levels and many dfferent ways of settng up the dummy varables. Several are explored below. Make sure you know what you are estmatng. The frst verson uses the status dummy varables provded wth the data set where two varables status1 and status2 have the followng values for the 3 soco-economc levels: Upper (0, 0), Mddle (1, 0) and Lower (0, 1). I created a class varable STATUS wth the three levels coded as Lower, Mddle, Upper. In the second PROC LOGISTIC the STATUS varable was placed n the class statement. In the thrd PROC LOGISTIC, for whch a full analyss s provded, the opton / param = glm; was requested on the CLASS statement. Logstc regresson on Dsease data (wth Status1 and Status2) The LOGISTIC Procedure Analyss of Maxmum Lkelhood Estmates Standard Wald Parameter DF Estmate Error Ch-Square Pr > ChSq Intercept Age Status Status sector Odds Rato Estmates Pont 95% Wald Effect EstmateConfdence Lmts Age Status Status sector Logstc regresson on Dsease data (wth CLASS Status - default) The LOGISTIC Procedure Class Level Informaton Desgn Class Value Varables status Lower 1 0 Mddle 0 1 Upper -1-1 Note the codng of the dummy varables, contrastng Lower and Mddle to Upper.

3 Logstc regresson Dagnostcs Page 3 Analyss of Maxmum Lkelhood Estmates Standard Wald Parameter DF Estmate Error Ch-Square Pr > ChSq Intercept <.0001 Age status Lower status Mddle sector Odds Rato Estmates Pont 95% Wald Effect EstmateConfdence Lmts Age status Lower vs Upper status Mddle vs Upper sector Logstc regresson on Dsease data (wth CLASS Status - GLM) The LOGISTIC Procedure Model Informaton Data Set WORK.DISEASE Response Varable Dsease Number of Response Levels 2 Model bnary logt Optmzaton Technque Fsher's scorng Number of Observatons Read 98 Number of Observatons Used 98 Response Profle Ordered Total Value Dsease Frequency Probablty modeled s Dsease=1. Class Level Informaton Class Value Desgn Varables status Lower Mddle Upper Note the GLM-lke codng of the dummy varables. Model Convergence Status Convergence crteron (GCONV=1E-8) satsfed. Model Ft Statstcs Intercept Intercept and Crteron Only Covarates AIC SC Log L R-Square Max-rescaled R-Square

4 Logstc regresson Dagnostcs Page 4 Model ft statstcs 1) Akake Informaton Crteron AIC = 2log( L) + 2 p where Log(L) s the log lkelhood and p s the number of parameters 2) Schwarz Crteron SC = 2log( L) + plog( f j ) 3) 2log L 2 [ Y log ( ˆ ) (1 )log (1 ˆ e π + Y e π ) ] n 1 Ths s analogous to the SSE n regresson and s gven n SAS as the 2 Log L. Two models (full and reduced) can be compared by calculatng the dfference n 2 Log L for both models. Ths dfference follows a ch square dstrbuton wth a d.f. equal to the dfference n d.f. for the two models. 2 n 4) Generalzed R 2 L(0) 1, where L(0) s the ntercept only model. L( θ ) Snce ths value reaches ts maxmum of less than 1 for dscrete models an adjustment has 2 R been proposed. Ths s called the Max-rescaled Rsquare n SAS. 2 R j max Testng Global Null Hypothess: BETA=0 Test Ch-Square DF Pr > ChSq Lkelhood Rato Score Wald All three tests are jont tests of the regresson coeffcents (β ) aganst 0. Lkelhood rato test Ths test compares the -2 Log L values for the model wth an ntercept only model (see Model ft statstcs above). The dfference ( ) follows a ch square dstrbuton wth degrees of freedom equal to p 1, the number of parameters not countng the ntercept. Ths s analogous to the SSE n regresson and s gven n SAS as the -2 Log L. Any two models (full and reduced) can be compared by calculatng the dfference n -2 Log L for both models. Ths dfference follows a ch square dstrbuton wth a d.f. equal to the dfference n d.f. for the two models. Ths test s analogous to the General Lnear Hypothess test for lnear models. Score statstcs These are based on vectors of the partal dervatves of the log lkelhood (wrt the parameter vector) and the matrx of second partal dervatves. It asymptotcally has a ch square dstrbuton. Wald statstcs are based on large sample statstcs and assume asymptotc normalty. Under the usual large sample condtons the logstc regresson maxmum lkelhood estmators are approxmately normally dstrbuted, have no bas and have varances and covarances that are functons of the partal second dervatve (Hessan) of the log lkelhood functon.

5 Logstc regresson Dagnostcs Page 5 Type 3 Analyss of Effects Wald Effect DF Ch-Square Pr > ChSq Age status sector Analyss of Maxmum Lkelhood Estmates Standard Wald Parameter DF Estmate Error Ch-Square Pr > ChSq Intercept Age status Lower status Mddle status Upper sector Wald : used to test ndvdual parameter estmates and to place confdence ntervals. It s based on a large sample assumpton of asymptotc normalty. Ch-square Test β / Var( β ) = [ β / Stderr( β )] 2 2 ( ˆ β 1.96 ˆ ) ( ˆ 1.96 ˆ ) Confdence nterval ( ) S β S β β P β Odds Rato Estmates Pont 95% Wald Effect Estmate Confdence Lmts Age status Lower vs Upper status Mddle vs Upper sector Assocaton of Predcted Probabltes and Observed Responses Percent Concordant 77.5 Somers' D Percent Dscordant 22.1 Gamma Percent Ted 0.3 Tau-a Pars 2077 c e e = 0.95 Assocaton of Predcted Probabltes and Observed Responses Observatons wth dfferent responses are pared and compared. If the one wth the lower observed response has a lower predcted value t s sad to be concordant. Otherwse they are dscordant or ted. SAS reports concordant, dscordant, tes and the number of pars examned. A number of other statstcs are based on the same nformaton of the number concordant (n c ) and the number dscordant (n d ). Where t s the total number of pars wth dfferent responses and N s the sum of observaton frequences n the data then the followng statstcs can be derved. Note that tes are gven by t n c n d. The statstc c s equal to (n c + 0.5(t n c n d ))/t. Somers D s equal to (n c n d )/t. The Goodman-Kruskal Gamma s (n c n d )/(n c +n d ) and Kendall s Tau-a s (n c n d )/(0.5N(N 1)/

6 Logstc regresson Dagnostcs Page 6 Partton for the Hosmer and Lemeshow Test Dsease = 1 Dsease = 0 Group Total Observed Expected Observed Expected Hosmer and Lemeshow Goodness-of-Ft Test Ch-Square DF Pr > ChSq Goodness-of-ft Most Goodness-of-ft tests (Pearson and devance) requre replcaton n subpopulatons. Ths s often a problem wth contnuous varables n the model. The Hosmer-Lemeshow Goodness-of-Ft Test can be used for sparser data. Ths test s only avalable for bnary models. In ths approach the data are sorted on the bass of ther response probablty (default), and dvded nto approxmately 10 groups (mnmum = 3). See SAS help for detals on the groupng. Once n groups a Ch square statstc s calculated. For each group we have S as the observed number of successes n the group, n as the total number of observatons n the group and π as the mean predcted probablty n each group (from the model). For the g groups the Ch square statstc s then calculated as: g 2 2 ( S nπ ) χ = n π (1 π ) = 1 The usual nterpretatons apply to ths lack of ft test. Small values of P would ndcate an nadequate model. Devance The devance n logstc regresson can be calculated as n [ ˆ π ˆ π ] DEV ( X, X,..., X ) = 2 Y log ( ) + (1 Y )log (1 ) 0 1 p 1 e e 1 Partal devances can be calculated for reduced models and the dfference n the two models (wth n p and n p+dff d.f.) should follow a Ch square dstrbuton wth dff d.f. Ths test of partal devances s also called the lkelhood rato test. Devance resduals Resdual analyss s not as smple wth Logstc Regresson as t was wth Lnear Regresson. Snce the dependent varable s 0 or 1 they are not normally dstrbuted and n fact the dstrbuton s not known. As a result resdual analyss n Logstc Regresson s done on Devance resduals. These are calculated as: dev =± { 2[ Y log ( ˆ π ) + (1 Y )log (1 ˆ π )]} e e where the sgn s + f Y ˆ ˆ π and the sgn s - f Y < π. Model devance the SS of these resduals wll sum to the model devance.

7 Logstc regresson Dagnostcs Page 7 Pearson resduals Gven the resdual e ˆ = Y π, the predcted probabltes ˆ π = Y e and r = the number of events n a gven observaton wth n trals, then the Pearson resdual s gven as: r ˆ nπ n ˆ π (1 ˆ π ) Confdence nterval dsplacement dagnostcs SAS provdes two measures of confdence nterval dsplacement as nfluence dagnostcs, C for ndvdual observatons and CBAR for overall change n the parameter estmates when ndvdual observatons are removed. 143 proc sort data=next1 nodupkey; by Age status; run; NOTE: There were 98 observatons read from the data set WORK.NEXT1. NOTE: 25 observatons wth duplcate key values were deleted. NOTE: The data set WORK.NEXT1 has 73 observatons and 19 varables. NOTE: PROCEDURE SORT used (Total process tme): 0.01 seconds 0.02 seconds 144 proc prnt data=next1; 145 TITLE2 'Lstng of one kept value for each by group from Logstc Reg'; 146 run; NOTE: There were 73 observatons read from the data set WORK.NEXT1. NOTE: The PROCEDURE PRINT prnted page 5. NOTE: PROCEDURE PRINT used (Total process tme): 0.11 seconds 0.08 seconds optons ps=56 ls=111; 149 proc sort data=dsease; by Age; run; NOTE: There were 98 observatons read from the data set WORK.DISEASE. NOTE: The data set WORK.DISEASE has 98 observatons and 7 varables. NOTE: PROCEDURE SORT used (Total process tme): 0.01 seconds 0.02 seconds 150 proc sort data=next1; by Age; run; NOTE: Input data set s already sorted, no sortng done. NOTE: PROCEDURE SORT used (Total process tme): 0.00 seconds 0.01 seconds 151 proc means data=dsease noprnt; by Age; var Dsease; 152 output out=next3 n=n mean=mean var=var; run; NOTE: There were 98 observatons read from the data set WORK.DISEASE. NOTE: The data set WORK.NEXT3 has 50 observatons and 6 varables. NOTE: PROCEDURE MEANS used (Total process tme): 0.01 seconds 0.02 seconds data three; set next1 next3; run; NOTE: There were 73 observatons read from the data set WORK.NEXT1. NOTE: There were 50 observatons read from the data set WORK.NEXT3. NOTE: The data set WORK.THREE has 123 observatons and 24 varables. NOTE: DATA statement used (Total process tme): 0.02 seconds 0.03 seconds 155 proc plot data=three; plot yhat*age='x' mean*age='o' / overlay; 156 TITLE2 'Plot of observed means (o) and predcted values (p)'; 157 run; ods html close; NOTE: There were 123 observatons read from the data set WORK.THREE. NOTE: The PROCEDURE PLOT prnted page 6. NOTE: PROCEDURE PLOT used (Total process tme): 0.09 seconds 0.03 seconds NOTE: SAS Insttute Inc., SAS Campus Drve, Cary, NC USA NOTE: The SAS System used: 6.59 seconds 0.82 seconds Influence and resdual dagnostcs are plotted below. There are apparently no generally accepted crtera for evaluaton of these statstcs n Logstc regresson. Your text suggests subjectve vsual assessment of an approprate graphc

8 Logstc regresson Dagnostcs Page 8 Logstc regresson on Dsease data The LOGISTIC Procedure - RESCHI * + P e a * * * r * * * * s * * * * o 1 + * * + n * * * * ** R * * * ** e s d u * ** * ** * **** * * * * * ****** ** a ** *** * ** ** * ** ** * * * * l * * * * * * * * * * * * -1 + * * * * + * * * * * RESDEV D 2 + * * + e v ** * * * * * * a * * * * * n * * * c 1 + * ** + e ** * * * * R e s d u * * ** * **** a * * ** * * * * ** * * * * ** l ** *** * ** ** * * ** * * * * * * * * * ** -1 + * * * * + * * * * * * * * * * H H a t D a ** * * * + g * * * * * o * * * * * n * * * a * * * * * ** * l * * * + * * * * * * * * * * * * * * * * * ** * * * * ** + * ** **** * * ** * * * * * * * * ** **** * * ****** **

9 Logstc regresson Dagnostcs Page 9 Logstc regresson on Dsease data The LOGISTIC Procedure DFBETA I n * + t e r c e p t * + D * * * * * f * * * B e t * * * * * * * a * * * * * * * * + * * * ** * ** ** * **** * * *** ****** ** ** * * * * ** * * * * * * * ** * ** *** * * *** ** ** * DFBETA * + ** * * * * * * ** * A * * * ** * * g * * ** * * ** * * * ** * ** ** * ********* * e ** * * * * ** * * * * * * ** * * * + * * * * * * D * * * * ** * f * * * * B * * * * e * * * t * * + a DFBETA s * * * * * t a t u s * * * **** L * * * o ** *** ** ** * w * ** * * * *** * * * e * * ** * * * * * * + r *** * * ** ** * * * **** * * * ****** ** * * * D * * * * f * * * * * * B * * * * e t * + a

10 Logstc regresson Dagnostcs Page 10 Logstc regresson on Dsease data The LOGISTIC Procedure DFBETA ** * s t a t * ** * u * ** ** * s * * + M ** ** d * *** * * * ** * * * * d l ** * * ** * * * **** * * * * ****** **** + e * * ** * * * * * D * * * * f B * + e * * * * t a * * * * * * * * * * * DFBETA * + s e * ** * * * c * * * * * * + t * * * ** * * o ** *** * * * * * ** ** * * * r * * ** ** * * ** * ** * ** **** * * * ****** ** D f B e t * * * a * ** * + ** * * * * * * * * + ** * * C C o n f d e n c e I n t e r v * + a * * * l * * * D s p l * * ** * * * * a c * * * * * * * e * * * * * * * * m * * * * ** * ** * * * e ** * * * ** ** * ** ** * * * n * ** ** * ** * **** * * * * * ****** ** + t ---- C

11 Logstc regresson Dagnostcs Page 11 Logstc regresson on Dsease data The LOGISTIC Procedure --- C CBAR o n f d e n c e I n t e r v a l ** * * * D * * * s p l * * * a * ** * * * * c * * * * * * e * * * * * * * * m * * * * * ** * ** * * e ** * * ** * ** * ** ** * * * n * ** ** * ** * **** ** * * * * ****** ** + t --- C B - DIFDEV D e l t a 3 + * * + D e v a 2 + * * + n * * * * c * * * * e * * * * * * * * * * * 1 + * * + * * * * * * * * * * * * ** * * * * * * * * ** * *** * * * ** * * * * * * * * * * **** * * * * ****** ** DIFCHISQ * + D e l t a C h S q u 4 + * + a r e * * * * 2 + * * + * * * * * * * * * * * * * * * * * * * * * * ** * * ** **** * * ** * ** * **** ***** * ** ** * * * * 0 + * ** * ** * **** * * * * * ****** **

12 Logstc regresson Dagnostcs Page 12 Lstng of one kept value for each by group from Logstc Reg D D F D F B F D B E B F E T E B T A T E A _ A D T _ s _ F A s t s B _ D t a t E I F a t a T n B t u t A S S D _ t E u s u _ t t s s L r e T s M s s d a a e s t E l u e r A L U e c t t c e a V y c c s c _ o d p c f O a A u u t a t E h l l d e A w d p t d b s g s s o s u L a 9 9 e p g e l e o e s e e 1 2 r e s _ t 5 5 v t e r e r r v Lower Lower Mddle Upper Lower Upper Lower Lower Mddle Upper Lower Lower Mddle Upper Mddle Upper Lower Mddle Mddle Upper Lower Upper Upper Lower Upper Mddle Lower Mddle Upper Lower Lower Mddle Mddle Lower Upper

13 Logstc regresson Dagnostcs Page Upper Upper Mddle Lower Mddle Upper Upper Upper Mddle Upper Lower Mddle Upper Lower Mddle Upper Mddle Upper Mddle Mddle Upper Mddle Lower Lower Upper Upper Lower Upper Mddle Lower Upper Lower Upper Lower Upper Mddle Upper Upper

14 Logstc regresson Dagnostcs Page proc plot data=three; plot yhat*age='x' mean*age='o' / overlay; 156 TITLE2 'Plot of observed means (o) and predcted values (p)'; 157 run; Plot of observed means (o) and predcted values (p) Plot of yhat*age. Symbol used s 'x'. Plot of mean*age. Symbol used s 'o'. yhat o o o o o o o oo o oo o o x x x x x x x x x x x o x x x x xx x x o o oo o o x x x x x x xxx x x x x o o x x x x x x x x x x x x o o x x x x x x o x x xx x x x x x x x x xxx x o xxxxxxx ooooo oo o o ooo o o o o o o o oo oo Age NOTE: 123 obs had mssng values. Logstc regresson how far can you take t? Interactons, polynomals, analyss of covarance. Yes! Of course, the logstc s already a curve, so polynomals wll alter the expected shape, ncludng gvng a quadratc shape to the log-odds dependent varable. Dtto for response surfaces; both of these are feasble wthn the context of logstc regresson. Other optons that work much lke PROC GLM or MIXED are optons for a soluton, NOINT and even STB, a standardzed regresson coeffcent. How about selecton crtera (forward, backward, stepwse)? Yes, they are there n PROC LOGISTIC. T here s also a best subset selecton opton, SELECTION=SCORE. Selecton s based on the score ch-square value. Also avalable are the optons usually assocated wth SAS stepwse applcatons, ncludng the INCLUDE, SLENTRY, SLSTAY, START, STOP and BEST optons. Instead of plottng R 2, Adjusted R 2, Mallow s C p or SSE p or other ads to model selecton, the LOGISTIC equvalent would be AIC p, SBC p, 2 log lkelhood and score statstcs.

y and the total sum of

y and the total sum of Lnear regresson Testng for non-lnearty In analytcal chemstry, lnear regresson s commonly used n the constructon of calbraton functons requred for analytcal technques such as gas chromatography, atomc absorpton

More information

X- Chart Using ANOM Approach

X- Chart Using ANOM Approach ISSN 1684-8403 Journal of Statstcs Volume 17, 010, pp. 3-3 Abstract X- Chart Usng ANOM Approach Gullapall Chakravarth 1 and Chaluvad Venkateswara Rao Control lmts for ndvdual measurements (X) chart are

More information

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap

Empirical Distributions of Parameter Estimates. in Binary Logistic Regression Using Bootstrap Int. Journal of Math. Analyss, Vol. 8, 4, no. 5, 7-7 HIKARI Ltd, www.m-hkar.com http://dx.do.org/.988/jma.4.494 Emprcal Dstrbutons of Parameter Estmates n Bnary Logstc Regresson Usng Bootstrap Anwar Ftranto*

More information

Stat 5100 Handout #14.a SAS: Logistic Regression

Stat 5100 Handout #14.a SAS: Logistic Regression Stat 5100 Handout #14.a SAS: Logistic Regression Example: (Text Table 14.3) Individuals were randomly sampled within two sectors of a city, and checked for presence of disease (here, spread by mosquitoes).

More information

S1 Note. Basis functions.

S1 Note. Basis functions. S1 Note. Bass functons. Contents Types of bass functons...1 The Fourer bass...2 B-splne bass...3 Power and type I error rates wth dfferent numbers of bass functons...4 Table S1. Smulaton results of type

More information

Exercises (Part 4) Introduction to R UCLA/CCPR. John Fox, February 2005

Exercises (Part 4) Introduction to R UCLA/CCPR. John Fox, February 2005 Exercses (Part 4) Introducton to R UCLA/CCPR John Fox, February 2005 1. A challengng problem: Iterated weghted least squares (IWLS) s a standard method of fttng generalzed lnear models to data. As descrbed

More information

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp

Life Tables (Times) Summary. Sample StatFolio: lifetable times.sgp Lfe Tables (Tmes) Summary... 1 Data Input... 2 Analyss Summary... 3 Survval Functon... 5 Log Survval Functon... 6 Cumulatve Hazard Functon... 7 Percentles... 7 Group Comparsons... 8 Summary The Lfe Tables

More information

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics

NAG Fortran Library Chapter Introduction. G10 Smoothing in Statistics Introducton G10 NAG Fortran Lbrary Chapter Introducton G10 Smoothng n Statstcs Contents 1 Scope of the Chapter... 2 2 Background to the Problems... 2 2.1 Smoothng Methods... 2 2.2 Smoothng Splnes and Regresson

More information

Lecture 4: Principal components

Lecture 4: Principal components /3/6 Lecture 4: Prncpal components 3..6 Multvarate lnear regresson MLR s optmal for the estmaton data...but poor for handlng collnear data Covarance matrx s not nvertble (large condton number) Robustness

More information

Unsupervised Learning and Clustering

Unsupervised Learning and Clustering Unsupervsed Learnng and Clusterng Why consder unlabeled samples?. Collectng and labelng large set of samples s costly Gettng recorded speech s free, labelng s tme consumng 2. Classfer could be desgned

More information

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007

Synthesizer 1.0. User s Guide. A Varying Coefficient Meta. nalytic Tool. Z. Krizan Employing Microsoft Excel 2007 Syntheszer 1.0 A Varyng Coeffcent Meta Meta-Analytc nalytc Tool Employng Mcrosoft Excel 007.38.17.5 User s Gude Z. Krzan 009 Table of Contents 1. Introducton and Acknowledgments 3. Operatonal Functons

More information

RStudio for Data Management,

RStudio for Data Management, K23166 2015/1/9 17:35 page 4 #4 Usng R and RStudo for Data Management, Statstcal Analyss and Graphcs Second Edton Ncholas J. Horton Department of Mathematcs and Statstcs Amherst College Massachusetts,

More information

C2 Training: June 8 9, Combining effect sizes across studies. Create a set of independent effect sizes. Introduction to meta-analysis

C2 Training: June 8 9, Combining effect sizes across studies. Create a set of independent effect sizes. Introduction to meta-analysis C2 Tranng: June 8 9, 2010 Introducton to meta-analyss The Campbell Collaboraton www.campbellcollaboraton.org Combnng effect szes across studes Compute effect szes wthn each study Create a set of ndependent

More information

A Statistical Model Selection Strategy Applied to Neural Networks

A Statistical Model Selection Strategy Applied to Neural Networks A Statstcal Model Selecton Strategy Appled to Neural Networks Joaquín Pzarro Elsa Guerrero Pedro L. Galndo joaqun.pzarro@uca.es elsa.guerrero@uca.es pedro.galndo@uca.es Dpto Lenguajes y Sstemas Informátcos

More information

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated.

Some Advanced SPC Tools 1. Cumulative Sum Control (Cusum) Chart For the data shown in Table 9-1, the x chart can be generated. Some Advanced SP Tools 1. umulatve Sum ontrol (usum) hart For the data shown n Table 9-1, the x chart can be generated. However, the shft taken place at sample #21 s not apparent. 92 For ths set samples,

More information

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization

Problem Definitions and Evaluation Criteria for Computational Expensive Optimization Problem efntons and Evaluaton Crtera for Computatonal Expensve Optmzaton B. Lu 1, Q. Chen and Q. Zhang 3, J. J. Lang 4, P. N. Suganthan, B. Y. Qu 6 1 epartment of Computng, Glyndwr Unversty, UK Faclty

More information

Programming in Fortran 90 : 2017/2018

Programming in Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Programmng n Fortran 90 : 2017/2018 Exercse 1 : Evaluaton of functon dependng on nput Wrte a program who evaluate the functon f (x,y) for any two user specfed values

More information

Random Variables and Probability Distributions

Random Variables and Probability Distributions Random Varables and Probablty Dstrbutons Some Prelmnary Informaton Scales on Measurement IE231 - Lecture Notes 5 Mar 14, 2017 Nomnal scale: These are categorcal values that has no relatonshp of order or

More information

Hermite Splines in Lie Groups as Products of Geodesics

Hermite Splines in Lie Groups as Products of Geodesics Hermte Splnes n Le Groups as Products of Geodescs Ethan Eade Updated May 28, 2017 1 Introducton 1.1 Goal Ths document defnes a curve n the Le group G parametrzed by tme and by structural parameters n the

More information

Unsupervised Learning

Unsupervised Learning Pattern Recognton Lecture 8 Outlne Introducton Unsupervsed Learnng Parametrc VS Non-Parametrc Approach Mxture of Denstes Maxmum-Lkelhood Estmates Clusterng Prof. Danel Yeung School of Computer Scence and

More information

Biostatistics 615/815

Biostatistics 615/815 The E-M Algorthm Bostatstcs 615/815 Lecture 17 Last Lecture: The Smplex Method General method for optmzaton Makes few assumptons about functon Crawls towards mnmum Some recommendatons Multple startng ponts

More information

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010

Simulation: Solving Dynamic Models ABE 5646 Week 11 Chapter 2, Spring 2010 Smulaton: Solvng Dynamc Models ABE 5646 Week Chapter 2, Sprng 200 Week Descrpton Readng Materal Mar 5- Mar 9 Evaluatng [Crop] Models Comparng a model wth data - Graphcal, errors - Measures of agreement

More information

Air Transport Demand. Ta-Hui Yang Associate Professor Department of Logistics Management National Kaohsiung First Univ. of Sci. & Tech.

Air Transport Demand. Ta-Hui Yang Associate Professor Department of Logistics Management National Kaohsiung First Univ. of Sci. & Tech. Ar Transport Demand Ta-Hu Yang Assocate Professor Department of Logstcs Management Natonal Kaohsung Frst Unv. of Sc. & Tech. 1 Ar Transport Demand Demand for ar transport between two ctes or two regons

More information

Support Vector Machines

Support Vector Machines /9/207 MIST.6060 Busness Intellgence and Data Mnng What are Support Vector Machnes? Support Vector Machnes Support Vector Machnes (SVMs) are supervsed learnng technques that analyze data and recognze patterns.

More information

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning

Outline. Type of Machine Learning. Examples of Application. Unsupervised Learning Outlne Artfcal Intellgence and ts applcatons Lecture 8 Unsupervsed Learnng Professor Danel Yeung danyeung@eee.org Dr. Patrck Chan patrckchan@eee.org South Chna Unversty of Technology, Chna Introducton

More information

Feature Reduction and Selection

Feature Reduction and Selection Feature Reducton and Selecton Dr. Shuang LIANG School of Software Engneerng TongJ Unversty Fall, 2012 Today s Topcs Introducton Problems of Dmensonalty Feature Reducton Statstc methods Prncpal Components

More information

CS 534: Computer Vision Model Fitting

CS 534: Computer Vision Model Fitting CS 534: Computer Vson Model Fttng Sprng 004 Ahmed Elgammal Dept of Computer Scence CS 534 Model Fttng - 1 Outlnes Model fttng s mportant Least-squares fttng Maxmum lkelhood estmaton MAP estmaton Robust

More information

GWR 3 Software for Geographically Weighted Regression

GWR 3 Software for Geographically Weighted Regression GWR 3 Software for Geographcally Weghted Regresson Martn Charlton Stewart Fotherngham Chrs Brunsdon Spatal Analyss Research Group Department of Geography Unversty of Newcastle upon Tyne Newcastle upon

More information

TN348: Openlab Module - Colocalization

TN348: Openlab Module - Colocalization TN348: Openlab Module - Colocalzaton Topc The Colocalzaton module provdes the faclty to vsualze and quantfy colocalzaton between pars of mages. The Colocalzaton wndow contans a prevew of the two mages

More information

Wishing you all a Total Quality New Year!

Wishing you all a Total Quality New Year! Total Qualty Management and Sx Sgma Post Graduate Program 214-15 Sesson 4 Vnay Kumar Kalakband Assstant Professor Operatons & Systems Area 1 Wshng you all a Total Qualty New Year! Hope you acheve Sx sgma

More information

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation

Solutions to Programming Assignment Five Interpolation and Numerical Differentiation College of Engneerng and Coputer Scence Mechancal Engneerng Departent Mechancal Engneerng 309 Nuercal Analyss of Engneerng Systes Sprng 04 Nuber: 537 Instructor: Larry Caretto Solutons to Prograng Assgnent

More information

Abstract. 2.0 Some Fundamental Concepts of AIC. 1.0 Introduction

Abstract. 2.0 Some Fundamental Concepts of AIC. 1.0 Introduction Model Selecton n Lnear Mxed Effects Models Usng SAS PROC MIXED Long Ngo, Ischema Research, San Francsco ; Unversty of Calforna, Berkeley, CA Rchard Brand, Unversty of Calforna, Berkeley, CA Abstract Although

More information

Smoothing Spline ANOVA for variable screening

Smoothing Spline ANOVA for variable screening Smoothng Splne ANOVA for varable screenng a useful tool for metamodels tranng and mult-objectve optmzaton L. Rcco, E. Rgon, A. Turco Outlne RSM Introducton Possble couplng Test case MOO MOO wth Game Theory

More information

An Entropy-Based Approach to Integrated Information Needs Assessment

An Entropy-Based Approach to Integrated Information Needs Assessment Dstrbuton Statement A: Approved for publc release; dstrbuton s unlmted. An Entropy-Based Approach to ntegrated nformaton Needs Assessment June 8, 2004 Wllam J. Farrell Lockheed Martn Advanced Technology

More information

Help for Time-Resolved Analysis TRI2 version 2.4 P Barber,

Help for Time-Resolved Analysis TRI2 version 2.4 P Barber, Help for Tme-Resolved Analyss TRI2 verson 2.4 P Barber, 22.01.10 Introducton Tme-resolved Analyss (TRA) becomes avalable under the processng menu once you have loaded and selected an mage that contans

More information

Multilevel Analysis with Informative Weights

Multilevel Analysis with Informative Weights Secton on Survey Research Methods JSM 2008 Multlevel Analyss wth Informatve Weghts Frank Jenkns Westat, 650 Research Blvd., Rockvlle, MD 20850 Abstract Multlevel modelng has become common n large scale

More information

Parameter estimation for incomplete bivariate longitudinal data in clinical trials

Parameter estimation for incomplete bivariate longitudinal data in clinical trials Parameter estmaton for ncomplete bvarate longtudnal data n clncal trals Naum M. Khutoryansky Novo Nordsk Pharmaceutcals, Inc., Prnceton, NJ ABSTRACT Bvarate models are useful when analyzng longtudnal data

More information

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009.

Assignment # 2. Farrukh Jabeen Algorithms 510 Assignment #2 Due Date: June 15, 2009. Farrukh Jabeen Algorthms 51 Assgnment #2 Due Date: June 15, 29. Assgnment # 2 Chapter 3 Dscrete Fourer Transforms Implement the FFT for the DFT. Descrbed n sectons 3.1 and 3.2. Delverables: 1. Concse descrpton

More information

Classifier Selection Based on Data Complexity Measures *

Classifier Selection Based on Data Complexity Measures * Classfer Selecton Based on Data Complexty Measures * Edth Hernández-Reyes, J.A. Carrasco-Ochoa, and J.Fco. Martínez-Trndad Natonal Insttute for Astrophyscs, Optcs and Electroncs, Lus Enrque Erro No.1 Sta.

More information

A Bootstrap Approach to Robust Regression

A Bootstrap Approach to Robust Regression Internatonal Journal of Appled Scence and Technology Vol. No. 9; November A Bootstrap Approach to Robust Regresson Dr. Hamadu Dallah Department of Actuaral Scence and Insurance Unversty of Lagos Akoka,

More information

ECONOMICS 452* -- Stata 11 Tutorial 6. Stata 11 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors

ECONOMICS 452* -- Stata 11 Tutorial 6. Stata 11 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors ECONOMICS * -- Stata 11 Tutoral Stata 11 Tutoral TOPIC: Representng Mult-Category Categorcal Varables wth Dummy Varable Regressors DATA: wage1_econ.dta (a Stata-format dataset) TASKS: Stata 11 Tutoral

More information

Improved Methods for Lithography Model Calibration

Improved Methods for Lithography Model Calibration Improved Methods for Lthography Model Calbraton Chrs Mack www.lthoguru.com, Austn, Texas Abstract Lthography models, ncludng rgorous frst prncple models and fast approxmate models used for OPC, requre

More information

Statistics and Data Analysis. Use of the ROC Curve and the Bootstrap in Comparing Weighted Logistic Regression Models

Statistics and Data Analysis. Use of the ROC Curve and the Bootstrap in Comparing Weighted Logistic Regression Models Paper 248-27 Use of the ROC Curve and the Bootstrap n Comparng Weghted Logstc Regresson Models Davd Izrael, Annabella A. Battagla, Davd C. Hoagln, and Mchael P. Battagla, Abt Assocates Inc., Cambrdge,

More information

Performance Evaluation of Information Retrieval Systems

Performance Evaluation of Information Retrieval Systems Why System Evaluaton? Performance Evaluaton of Informaton Retreval Systems Many sldes n ths secton are adapted from Prof. Joydeep Ghosh (UT ECE) who n turn adapted them from Prof. Dk Lee (Unv. of Scence

More information

Mixed Linear System Estimation and Identification

Mixed Linear System Estimation and Identification 48th IEEE Conference on Decson and Control, Shangha, Chna, December 2009 Mxed Lnear System Estmaton and Identfcaton A. Zymns S. Boyd D. Gornevsky Abstract We consder a mxed lnear system model, wth both

More information

The BGLR (Bayesian Generalized Linear Regression) R- Package. Gustavo de los Campos, Amit Pataki & Paulino Pérez. (August- 2013)

The BGLR (Bayesian Generalized Linear Regression) R- Package. Gustavo de los Campos, Amit Pataki & Paulino Pérez. (August- 2013) Bostatstcs Department Bayesan Generalzed Lnear Regresson (BGLR) The BGLR (Bayesan Generalzed Lnear Regresson) R- Package By Gustavo de los Campos, Amt Patak & Paulno Pérez (August- 03) (contact: gdeloscampos@gmal.com

More information

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT

APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT 3. - 5. 5., Brno, Czech Republc, EU APPLICATION OF MULTIVARIATE LOSS FUNCTION FOR ASSESSMENT OF THE QUALITY OF TECHNOLOGICAL PROCESS MANAGEMENT Abstract Josef TOŠENOVSKÝ ) Lenka MONSPORTOVÁ ) Flp TOŠENOVSKÝ

More information

Analysis of Continuous Beams in General

Analysis of Continuous Beams in General Analyss of Contnuous Beams n General Contnuous beams consdered here are prsmatc, rgdly connected to each beam segment and supported at varous ponts along the beam. onts are selected at ponts of support,

More information

Adaptive Regression in SAS/IML

Adaptive Regression in SAS/IML Adaptve Regresson n SAS/IML Davd Katz, Davd Katz Consultng, Ashland, Oregon ABSTRACT Adaptve Regresson algorthms allow the data to select the form of a model n addton to estmatng the parameters. Fredman

More information

Lecture 5: Probability Distributions. Random Variables

Lecture 5: Probability Distributions. Random Variables Lecture 5: Probablty Dstrbutons Random Varables Probablty Dstrbutons Dscrete Random Varables Contnuous Random Varables and ther Dstrbutons Dscrete Jont Dstrbutons Contnuous Jont Dstrbutons Independent

More information

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision

SLAM Summer School 2006 Practical 2: SLAM using Monocular Vision SLAM Summer School 2006 Practcal 2: SLAM usng Monocular Vson Javer Cvera, Unversty of Zaragoza Andrew J. Davson, Imperal College London J.M.M Montel, Unversty of Zaragoza. josemar@unzar.es, jcvera@unzar.es,

More information

5 The Primal-Dual Method

5 The Primal-Dual Method 5 The Prmal-Dual Method Orgnally desgned as a method for solvng lnear programs, where t reduces weghted optmzaton problems to smpler combnatoral ones, the prmal-dual method (PDM) has receved much attenton

More information

Modeling Local Uncertainty accounting for Uncertainty in the Data

Modeling Local Uncertainty accounting for Uncertainty in the Data Modelng Local Uncertanty accontng for Uncertanty n the Data Olena Babak and Clayton V Detsch Consder the problem of estmaton at an nsampled locaton sng srrondng samples The standard approach to ths problem

More information

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms

Course Introduction. Algorithm 8/31/2017. COSC 320 Advanced Data Structures and Algorithms. COSC 320 Advanced Data Structures and Algorithms Course Introducton Course Topcs Exams, abs, Proects A quc loo at a few algorthms 1 Advanced Data Structures and Algorthms Descrpton: We are gong to dscuss algorthm complexty analyss, algorthm desgn technques

More information

Phd Program in Transportation. Transport Demand Modeling. Session 7

Phd Program in Transportation. Transport Demand Modeling. Session 7 Phd Program n Transportaton Transport Demand Modelng Lus Martínez (based on the Lessons of Anabela Rbero TDM2010) Sesson 7 Generalzed Lnear Models Phd n Transportaton / Transport Demand Modellng 1/48 Outlne

More information

3D FCRM MODELING IN MILES PER GALLON OF CAR

3D FCRM MODELING IN MILES PER GALLON OF CAR 3D FCRM Modelng n Mles Per Gallon of Car 3 4 3D FCRM MODELING IN MILES PER GALLON OF CAR Mohd Safullah Rusman Robah Adnan Efend N. Nasbo The new fuzzy c-regresson modelng (FcRM) are wdely used n order

More information

Problem Set 3 Solutions

Problem Set 3 Solutions Introducton to Algorthms October 4, 2002 Massachusetts Insttute of Technology 6046J/18410J Professors Erk Demane and Shaf Goldwasser Handout 14 Problem Set 3 Solutons (Exercses were not to be turned n,

More information

Fusion Performance Model for Distributed Tracking and Classification

Fusion Performance Model for Distributed Tracking and Classification Fuson Performance Model for Dstrbuted rackng and Classfcaton K.C. Chang and Yng Song Dept. of SEOR, School of I&E George Mason Unversty FAIRFAX, VA kchang@gmu.edu Martn Lggns Verdan Systems Dvson, Inc.

More information

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points;

Subspace clustering. Clustering. Fundamental to all clustering techniques is the choice of distance measure between data points; Subspace clusterng Clusterng Fundamental to all clusterng technques s the choce of dstance measure between data ponts; D q ( ) ( ) 2 x x = x x, j k = 1 k jk Squared Eucldean dstance Assumpton: All features

More information

Estimating Regression Coefficients using Weighted Bootstrap with Probability

Estimating Regression Coefficients using Weighted Bootstrap with Probability Norazan M R, Habshah Md, A H M R Imon Estmatng Regresson Coeffcents usng Weghted Bootstrap wth Probablty NORAZAN M R, HABSHAH MIDI AND A H M R IMON Faculty of Computer and Mathematcal Scences, Unversty

More information

The Man-hour Estimation Models & Its Comparison of Interim Products Assembly for Shipbuilding

The Man-hour Estimation Models & Its Comparison of Interim Products Assembly for Shipbuilding Internatonal Journal of Operatons Research Internatonal Journal of Operatons Research Vol., No., 9 4 (005) The Man-hour Estmaton Models & Its Comparson of Interm Products Assembly for Shpbuldng Bn Lu and

More information

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe

CSCI 104 Sorting Algorithms. Mark Redekopp David Kempe CSCI 104 Sortng Algorthms Mark Redekopp Davd Kempe Algorthm Effcency SORTING 2 Sortng If we have an unordered lst, sequental search becomes our only choce If we wll perform a lot of searches t may be benefcal

More information

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour

6.854 Advanced Algorithms Petar Maymounkov Problem Set 11 (November 23, 2005) With: Benjamin Rossman, Oren Weimann, and Pouya Kheradpour 6.854 Advanced Algorthms Petar Maymounkov Problem Set 11 (November 23, 2005) Wth: Benjamn Rossman, Oren Wemann, and Pouya Kheradpour Problem 1. We reduce vertex cover to MAX-SAT wth weghts, such that the

More information

CS1100 Introduction to Programming

CS1100 Introduction to Programming Factoral (n) Recursve Program fact(n) = n*fact(n-) CS00 Introducton to Programmng Recurson and Sortng Madhu Mutyam Department of Computer Scence and Engneerng Indan Insttute of Technology Madras nt fact

More information

Probability Base Classification Technique: A Preliminary Study for Two Groups

Probability Base Classification Technique: A Preliminary Study for Two Groups Mathematcal Theory and Modelng ISSN 4-5804 (Paper) ISSN 5-05 (Onlne) www.ste.org Probablty Base Classfcaton Technque: A Prelmnary Study for Two Groups Frday Znzendoff Okwonu,* Abdul Rahman Othman. Department

More information

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements

2x x l. Module 3: Element Properties Lecture 4: Lagrange and Serendipity Elements Module 3: Element Propertes Lecture : Lagrange and Serendpty Elements 5 In last lecture note, the nterpolaton functons are derved on the bass of assumed polynomal from Pascal s trangle for the fled varable.

More information

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS

NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS ARPN Journal of Engneerng and Appled Scences 006-017 Asan Research Publshng Network (ARPN). All rghts reserved. NUMERICAL SOLVING OPTIMAL CONTROL PROBLEMS BY THE METHOD OF VARIATIONS Igor Grgoryev, Svetlana

More information

Outlier Detection based on Robust Parameter Estimates

Outlier Detection based on Robust Parameter Estimates Outler Detecton based on Robust Parameter Estmates Nor Azlda Aleng 1, Ny Ny Nang, Norzan Mohamed 3 and Kasyp Mokhtar 4 1,3 School of Informatcs and Appled Mathematcs, Unverst Malaysa Terengganu, 1030 Kuala

More information

Positive Semi-definite Programming Localization in Wireless Sensor Networks

Positive Semi-definite Programming Localization in Wireless Sensor Networks Postve Sem-defnte Programmng Localzaton n Wreless Sensor etworks Shengdong Xe 1,, Jn Wang, Aqun Hu 1, Yunl Gu, Jang Xu, 1 School of Informaton Scence and Engneerng, Southeast Unversty, 10096, anjng Computer

More information

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning

A Post Randomization Framework for Privacy-Preserving Bayesian. Network Parameter Learning A Post Randomzaton Framework for Prvacy-Preservng Bayesan Network Parameter Learnng JIANJIE MA K.SIVAKUMAR School Electrcal Engneerng and Computer Scence, Washngton State Unversty Pullman, WA. 9964-75

More information

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z.

TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS. Muradaliyev A.Z. TECHNIQUE OF FORMATION HOMOGENEOUS SAMPLE SAME OBJECTS Muradalyev AZ Azerbajan Scentfc-Research and Desgn-Prospectng Insttute of Energetc AZ1012, Ave HZardab-94 E-mal:aydn_murad@yahoocom Importance of

More information

Statistical analysis on mean rainfall and mean temperature via functional data analysis technique

Statistical analysis on mean rainfall and mean temperature via functional data analysis technique Statstcal analyss on mean ranfall and mean temperature va functonal data analyss technque Jamaludn Suhala Ctaton: AIP Conference Proceedngs 1613, 368 (01); do: 10.1063/1.89361 Vew onlne: http://dx.do.org/10.1063/1.89361

More information

Machine Learning. K-means Algorithm

Machine Learning. K-means Algorithm Macne Learnng CS 6375 --- Sprng 2015 Gaussan Mture Model GMM pectaton Mamzaton M Acknowledgement: some sldes adopted from Crstoper Bsop Vncent Ng. 1 K-means Algortm Specal case of M Goal: represent a data

More information

Econometrics 2. Panel Data Methods. Advanced Panel Data Methods I

Econometrics 2. Panel Data Methods. Advanced Panel Data Methods I Panel Data Methods Econometrcs 2 Advanced Panel Data Methods I Last tme: Panel data concepts and the two-perod case (13.3-4) Unobserved effects model: Tme-nvarant and dosyncratc effects Omted varables

More information

AMath 483/583 Lecture 21 May 13, Notes: Notes: Jacobi iteration. Notes: Jacobi with OpenMP coarse grain

AMath 483/583 Lecture 21 May 13, Notes: Notes: Jacobi iteration. Notes: Jacobi with OpenMP coarse grain AMath 483/583 Lecture 21 May 13, 2011 Today: OpenMP and MPI versons of Jacob teraton Gauss-Sedel and SOR teratve methods Next week: More MPI Debuggng and totalvew GPU computng Read: Class notes and references

More information

Wavefront Reconstructor

Wavefront Reconstructor A Dstrbuted Smplex B-Splne Based Wavefront Reconstructor Coen de Vsser and Mchel Verhaegen 14-12-201212 2012 Delft Unversty of Technology Contents Introducton Wavefront reconstructon usng Smplex B-Splnes

More information

Adjustment methods for differential measurement errors in multimode surveys

Adjustment methods for differential measurement errors in multimode surveys Adjustment methods for dfferental measurement errors n multmode surveys Salah Merad UK Offce for Natonal Statstcs ESSnet MM DCSS, Fnal Meetng Wesbaden, Germany, 4-5 September 2014 Outlne Introducton Stablsng

More information

Parallelism for Nested Loops with Non-uniform and Flow Dependences

Parallelism for Nested Loops with Non-uniform and Flow Dependences Parallelsm for Nested Loops wth Non-unform and Flow Dependences Sam-Jn Jeong Dept. of Informaton & Communcaton Engneerng, Cheonan Unversty, 5, Anseo-dong, Cheonan, Chungnam, 330-80, Korea. seong@cheonan.ac.kr

More information

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS

A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Proceedngs of the Wnter Smulaton Conference M E Kuhl, N M Steger, F B Armstrong, and J A Jones, eds A MOVING MESH APPROACH FOR SIMULATION BUDGET ALLOCATION ON CONTINUOUS DOMAINS Mark W Brantley Chun-Hung

More information

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide

Lobachevsky State University of Nizhni Novgorod. Polyhedron. Quick Start Guide Lobachevsky State Unversty of Nzhn Novgorod Polyhedron Quck Start Gude Nzhn Novgorod 2016 Contents Specfcaton of Polyhedron software... 3 Theoretcal background... 4 1. Interface of Polyhedron... 6 1.1.

More information

Analysis of Malaysian Wind Direction Data Using ORIANA

Analysis of Malaysian Wind Direction Data Using ORIANA Modern Appled Scence March, 29 Analyss of Malaysan Wnd Drecton Data Usng ORIANA St Fatmah Hassan (Correspondng author) Centre for Foundaton Studes n Scence Unversty of Malaya, 63 Kuala Lumpur, Malaysa

More information

A Semi-parametric Regression Model to Estimate Variability of NO 2

A Semi-parametric Regression Model to Estimate Variability of NO 2 Envronment and Polluton; Vol. 2, No. 1; 2013 ISSN 1927-0909 E-ISSN 1927-0917 Publshed by Canadan Center of Scence and Educaton A Sem-parametrc Regresson Model to Estmate Varablty of NO 2 Meczysław Szyszkowcz

More information

Nonlinear Mixed Model Methods and Prediction Procedures Demonstrated on a Volume-Age Model

Nonlinear Mixed Model Methods and Prediction Procedures Demonstrated on a Volume-Age Model Nonlnear Mxed Model Methods and Predcton Procedures Demonstrated on a Volume-Age Model Prepared by: Shongmng Huang, Shawn X. Meng, Yuqng Yang August, 3 Techncal Report Pub. No.: T/6(3) ISBN No. 978--46-755-3

More information

Anonymisation of Public Use Data Sets

Anonymisation of Public Use Data Sets Anonymsaton of Publc Use Data Sets Methods for Reducng Dsclosure Rsk and the Analyss of Perturbed Data Harvey Goldsten Unversty of Brstol and Unversty College London and Natale Shlomo Unversty of Manchester

More information

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status

Implementation Naïve Bayes Algorithm for Student Classification Based on Graduation Status Internatonal Journal of Appled Busness and Informaton Systems ISSN: 2597-8993 Vol 1, No 2, September 2017, pp. 6-12 6 Implementaton Naïve Bayes Algorthm for Student Classfcaton Based on Graduaton Status

More information

Determining the Optimal Bandwidth Based on Multi-criterion Fusion

Determining the Optimal Bandwidth Based on Multi-criterion Fusion Proceedngs of 01 4th Internatonal Conference on Machne Learnng and Computng IPCSIT vol. 5 (01) (01) IACSIT Press, Sngapore Determnng the Optmal Bandwdth Based on Mult-crteron Fuson Ha-L Lang 1+, Xan-Mn

More information

Optimizing Document Scoring for Query Retrieval

Optimizing Document Scoring for Query Retrieval Optmzng Document Scorng for Query Retreval Brent Ellwen baellwe@cs.stanford.edu Abstract The goal of ths project was to automate the process of tunng a document query engne. Specfcally, I used machne learnng

More information

The official electronic file of this thesis or dissertation is maintained by the University Libraries on behalf of The Graduate School at Stony Brook

The official electronic file of this thesis or dissertation is maintained by the University Libraries on behalf of The Graduate School at Stony Brook Stony Brook Unversty The offcal electronc fle of ths thess or dssertaton s mantaned by the Unversty Lbrares on behalf of The Graduate School at Stony Brook Unversty. Alll Rgghht tss Reesseerrvveedd bbyy

More information

Machine Learning: Algorithms and Applications

Machine Learning: Algorithms and Applications 14/05/1 Machne Learnng: Algorthms and Applcatons Florano Zn Free Unversty of Bozen-Bolzano Faculty of Computer Scence Academc Year 011-01 Lecture 10: 14 May 01 Unsupervsed Learnng cont Sldes courtesy of

More information

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach

Skew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-Textual Document Images: A Novel Approach Angle Estmaton and Correcton of Hand Wrtten, Textual and Large areas of Non-Textual Document Images: A Novel Approach D.R.Ramesh Babu Pyush M Kumat Mahesh D Dhannawat PES Insttute of Technology Research

More information

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr)

Helsinki University Of Technology, Systems Analysis Laboratory Mat Independent research projects in applied mathematics (3 cr) Helsnk Unversty Of Technology, Systems Analyss Laboratory Mat-2.08 Independent research projects n appled mathematcs (3 cr) "! #$&% Antt Laukkanen 506 R ajlaukka@cc.hut.f 2 Introducton...3 2 Multattrbute

More information

Stata 12/13 Tutorial 4

Stata 12/13 Tutorial 4 Stata /3 Tutoral 4 TOPIC: Condtonal and Margnal Effects of Contnuous Explanatory Varables n Lnear Regresson Models DATA: auto.dta (a Stata-format dataset frst created n Stata /3 Tutoral ) TASKS: Stata

More information

The Codesign Challenge

The Codesign Challenge ECE 4530 Codesgn Challenge Fall 2007 Hardware/Software Codesgn The Codesgn Challenge Objectves In the codesgn challenge, your task s to accelerate a gven software reference mplementaton as fast as possble.

More information

FITTING A CHI -square CURVE TO AN OBSERVI:D FREQUENCY DISTRIBUTION By w. T Federer BU-14-M Jan. 17, 1951

FITTING A CHI -square CURVE TO AN OBSERVI:D FREQUENCY DISTRIBUTION By w. T Federer BU-14-M Jan. 17, 1951 FTTNG A CH -square CURVE TO AN OBSERV:D FREQUENCY DSTRBUTON By w. T Federer BU-4-M Jan. 7, 95 Textbooks n statstcs (for example, Johnson, Statstcal Methods n Research; Love, Applcaton of Statstcal Methods

More information

Cluster-Based Profile Monitoring in Phase I Analysis. Yajuan Chen. Doctor of Philosophy In Statistics

Cluster-Based Profile Monitoring in Phase I Analysis. Yajuan Chen. Doctor of Philosophy In Statistics Cluster-Based Profle Montorng n Phase I Analyss Yajuan Chen Dssertaton submtted to the faculty of the Vrgna Polytechnc Insttute and State Unversty n partal fulfllment of the requrements for the degree

More information

Very simple computational domains can be discretized using boundary-fitted structured meshes (also called grids)

Very simple computational domains can be discretized using boundary-fitted structured meshes (also called grids) Structured meshes Very smple computatonal domans can be dscretzed usng boundary-ftted structured meshes (also called grds) The grd lnes of a Cartesan mesh are parallel to one another Structured meshes

More information

EXTENDED BIC CRITERION FOR MODEL SELECTION

EXTENDED BIC CRITERION FOR MODEL SELECTION IDIAP RESEARCH REPORT EXTEDED BIC CRITERIO FOR ODEL SELECTIO Itshak Lapdot Andrew orrs IDIAP-RR-0-4 Dalle olle Insttute for Perceptual Artfcal Intellgence P.O.Box 59 artgny Valas Swtzerland phone +4 7

More information

CSE 326: Data Structures Quicksort Comparison Sorting Bound

CSE 326: Data Structures Quicksort Comparison Sorting Bound CSE 326: Data Structures Qucksort Comparson Sortng Bound Steve Setz Wnter 2009 Qucksort Qucksort uses a dvde and conquer strategy, but does not requre the O(N) extra space that MergeSort does. Here s the

More information

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur

FEATURE EXTRACTION. Dr. K.Vijayarekha. Associate Dean School of Electrical and Electronics Engineering SASTRA University, Thanjavur FEATURE EXTRACTION Dr. K.Vjayarekha Assocate Dean School of Electrcal and Electroncs Engneerng SASTRA Unversty, Thanjavur613 41 Jont Intatve of IITs and IISc Funded by MHRD Page 1 of 8 Table of Contents

More information

ECONOMICS 452* -- Stata 12 Tutorial 6. Stata 12 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors

ECONOMICS 452* -- Stata 12 Tutorial 6. Stata 12 Tutorial 6. TOPIC: Representing Multi-Category Categorical Variables with Dummy Variable Regressors ECONOMICS 45* -- Stata 1 Tutoral 6 Stata 1 Tutoral 6 TOPIC: Representng Mult-Category Categorcal Varables wth Dummy Varable Regressors DATA: wage1_econ45.dta (a Stata-format dataset) TASKS: Stata 1 Tutoral

More information