The ctest Package. January 3, 2000

Size: px
Start display at page:

Download "The ctest Package. January 3, 2000"

Transcription

1 R objects documented: The ctest Package January 3, 2000 bartlett.test binom.test cor.test fisher.test friedman.test kruskal.test ks.test mantelhaen.test mcnemar.test var.test wilco.test mood.test ansari.test shapiro.test bartlett.test Bartlett Test for Homogeneity of Variances bartlett.test bartlett.test performs Bartlett s test of the null that the variances in each of the groups (samples) are the same. bartlett.test(, g) g a numeric vector of data values, or a list of numeric data vectors. a vector or factor object giving the group for the corresponding elements of. Ignored if is a list. 1

2 2 binom.test If is a list, its elements are taken as the samples to be compared for homogeneity of variances, and hence have to be numeric data vectors. In this case, g is ignored, and one can simply use bartlett.test() to perform the test. If the samples are not yet contained in a list, use bartlett.test(list(,...)). Otherwise, must be a numeric data vector, and g must be a vector or factor object of the same length as giving the group for the corresponding elements of. A list of class "htest" containing the following components: parameter Bartlett s K-square test. the degrees of freedom of the approimate chi-square distribution of the test. the string "Bartlett test for homogeneity of variances". binom.test Eact Binomial Test binom.test binom.test performs an eact test of the null that the probability of success in a Bernoulli eperiment of length n is p, based on the number of successes observed. binom.test(, n, p = 0.5, = "two.sided") n p number of successes. number of trials. probability of success. indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. the number of successes,. parameter the number of trials, n. null.value the probability of success under the null, p. a character string describing the hypothesis. the string "Eact binomial test".

3 cor.test 3 cor.test Test for Zero Correlation cor.test cor.test tests the null that and y are uncorrelated (independent). cor.test(, y, = "two.sided", = "pearson", eact = NULL), y numeric vectors of data values. and y must have the same length. eact indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. a string indicating which correlation coefficient is used for the test. Must be one of "pearson", "kendall", or "spearman". Only the first character is necessary. a logical indicating whether an eact p-value should be computed. If is "pearson", the test is based on Pearson s product moment correlation coefficient cor(, y) and follows a t distribution with length()-2 degrees of freedom. If is "kendall" or "spearman", Kendall s tau or Spearman s rho, respectively, are used to estimate the correlation. These tests should be used if the data do not necessarily come from a bivariate normal distribution. For Kendall s test, by default (if eact is not specified), an eact p-value is computed if both samples contain less than 50 finite values and there are no ties. Otherwise, the standardized estimate is used as the test, and is approimately normally distributed. For Spearman s test, p-values are computed using algorithm AS 89. parameter estimate the value of the test. the degrees of freedom of the test in the case that it follows a t distribution. the estimated correlation coefficient, with names attribute "cor", "tau", or "rho", correspoding to the employed. null.value the value of the correlation coefficient under the null hypothesis, hence 0. a character string describing the hypothesis. a string indicating how the correlation was estimated

4 4 fisher.test References D. J. Best & D. E. Roberts (1975), Algorithm AS 89: The Upper Tail Probabilities of Spearman s ρ. Applied Statistics, 24, fisher.test Fisher s Eact Test for Count Data fisher.test fisher.test performs Fisher s eact test for testing the null of independence of rows and columns in a contingency table with fied marginals. fisher.test(, y, = "two.sided", workspace = , hybrid = FALSE) y workspace hybrid either a two-dimensional contingency table in matri form, or a factor object. a factor object; ignored if is a matri. indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. Only used in the 2 by 2 case. an integer specifying the size of the workspace used in the network algorithm. a logical indicating whether the eact probabilities (default) or a hybrid approimation thereof should be computed. In the hybrid case, asymptotic chi-square probabilies are only used provided that the Cochran conditions are satisfied. If is a matri, it is taken as a two-dimensional contingency table, and hence its entries should be nonnegative integers. Otherwise, both and y must be vectors of the same length. Incomplete cases are removed, the vectors are coerced into factor objects, and the contingency table is computed from these. In the one-sided 2 by 2 cases, p-values are obtained directly using the hypergeometric distribution. Otherwise, computations are based on a C version of the FORTRAN subroutine FEXACT which implements the network developed by Mehta and Patel (1986) and improved by Clarkson, Fan & Joe (1993). The FORTRAN code can be obtained from a character string describing the hypothesis. the string "Fisher s Eact Test for Count Data".

5 friedman.test 5 References Cyrus R. Mehta & Nitin R. Patel (1986). Algorithm 643. FEXACT: A Fortran subroutine for Fisher s eact test on unordered r*c contingency tables. ACM Transactions on Mathematical Software, 12, Douglas B. Clarkson, Yuan-an Fan & Harry Joe (1993). A Remark on Algorithm 643: FEXACT: An Algorithm for Performing Fisher s Eact Test in r c Contingency Tables. ACM Transactions on Mathematical Software, 19, friedman.test Friedman Rank Sum Test friedman.test Performs a Friedman rank sum test with unreplicated blocked data. friedman.test(y, groups, blocks) y groups blocks either a numeric vector of data values, or a data matri. a vector giving the group for the corresponding elements of y if this is a vector; ignored if y is a matri. If not a factor object, it is coerced to one. a vector giving the block for the corresponding elements of y if this is a vector; ignored if y is a matri. If not a factor object, it is coerced to one. friedman.test can be used for analyzing unreplicated complete block designs (i.e., there is eactly one observation in y for each combination of levels of groups and blocks) where the normality assumption may be violated. The null hypothesis is that apart from an effect of blocks, the location parameter of y is the same in each of the groups. If y is a matri, groups and blocks are obtained from the column and row indices, respectively. NA s are not allowed in groups or blocks; if y contains NA s, corresponding blocks are removed. parameter the value of Friedman s chi-square. the degrees of freedom of the approimate chi-square distribution of the test. the string "Friedman rank sum test".

6 6 kruskal.test kruskal.test Kruskal-Wallis Rank Sum Test kruskal.test Performs a Kruskal-Wallis rank sum test. kruskal.test(, g) g a numeric vector of data values, or a list of numeric data vectors. a vector or factor object giving the group for the corresponding elements of. Ignored if is a list. kruskal.test performs a Kruskal-Wallis rank sum test of the null that the location parameters of the distribution of are the same in each group (sample). The is that they differ in at least one. If is a list, its elements are taken as the samples to be compared, and hence have to be numeric data vectors. In this case, g is ignored, and one can simply use kruskal.test() to perform the test. If the samples are not yet contained in a list, use kruskal.test(list(,...)). Otherwise, must be a numeric data vector, and g must be a vector or factor object of the same length as giving the group for the corresponding elements of. parameter the Kruskal-Wallis rank sum. the degrees of freedom of the approimate chi-square distribution of the test. the string "Kruskal-Wallis rank sum test".

7 ks.test 7 ks.test Kolmogorov-Smirnov Tests ks.test Performs one or two sample Kolmogorov-Smirnov tests. ks.test(, y,..., = "two.sided") y a numeric vector of data values. either a numeric vector of data values, or a character string naming a distribution function.... parameters of the distribution specified by y. indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. If y is numeric, a two sample test of the null that and y were drawn from the same distribution is performed. Alternatively, y can be a character string naming a distribution function. In this case, a one sample test of the null that the distribution function underlying is y with parameters specified by... is carried out. the value of the test. a character string describing the hypothesis. a character string indicating what type of test was performed. a character string giving the name(s) of the data. Eamples <- rnorm(50) y <- runif(30) # Do and y come from the same distribution? ks.test(, y) # Does come from a gamma distribution with shape 3 and scale 2? ks.test(, "pgamma", 3, 2)

8 8 mantelhaen.test mantelhaen.test Mantel-Haenszel Chi-Square Test for Count Data mantelhaen.test mantelhaen.test performs a Mantel-Haenszel chi-square test of the null that and y are conditionally independent in each stratum. mantelhaen.test(, y = NULL, z = NULL, correct = TRUE) y z correct either an array of dimension 2 by 2 by s, where s is the number of strata, or a dichotomous factor object. a dichotomous factor object; ignored if is an array. a factor object idenitifying to which stratum the corresponding elements in and y belong; ignored if is an array. a logical indicating whether to apply continuity correction when computing the test. If is an array, it must be of dimension 2 by 2 by s, and the entries should be nonnegative integers. NA s are not allowed. Otherwise,, y and z must have the same length. Triples containing NA s are removed. Both and y must be dichotomous (take eactly 2 values). parameter the Mantel-Haenszel chi-square. always 1, the degrees of freedom of the approimate chi-square distribution of the test. a string indicating the employed, and whether or not continuity correction was used.

9 mcnemar.test 9 mcnemar.test McNemar s Chi-square Test for Count Data mcnemar.test Performs McNemar s chi-square test for symmetry of rows and columns in a two-dimensional contingency table. mcnemar.test(, y = NULL, correct = TRUE) y correct either a two-dimensional contingency table in matri form, or a factor object. a factor object; ignored if is a matri. a logical indicating whether to apply continuity correction when computing the test. The null is that the probabilities of being classified into cells [i,j] and [j,i] are the same. If is a matri, it is taken as a two-dimensional contingency table, and hence its entries should be nonnegative integers. Otherwise, both and y must be vectors of the same length. Incomplete cases are removed, the vectors are coerced into factor objects, and the contingency table is computed from these. Continuity correction is only used in the 2-by-2 case if correct is TRUE. parameter the value of McNemar s. the degrees of freedom of the approimate chi-square distribution of the test. a character string indicating the type of test performed, and whether continuity correction was used. a character string giving the name(s) of the data.

10 10 var.test var.test F Test to Compare Two Variances var.test Performs an F test to compare the variances of two samples from normal populations. var.test(, y, ratio = 1, = "two.sided", conf.level = 0.95), y numeric vectors of data values. ratio the hypothesized ratio of the population variances of and y. conf.level indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. confidence level for the returned confidence interval. The null hypothesis is that the ratio of the variances of the populations from which and y were drawn is equal to ratio. the value of the F test. parameter the degrees of the freedom of the F distribtion of the test. conf.int a confidence interval for the ratio of the population variances. estimate the ratio of the sample variances of and y. null.value the ratio of population variances under the null. a character string describing the hypothesis. the string "F test to compare two variances".

11 wilco.test 11 wilco.test Wilcoon Rank Sum and Signed Rank Tests wilco.test Performs one and two sample Wilcoon tests on vectors of data. wilco.test(, y = NULL, = "two.sided", mu = 0, paired = FALSE, eact = NULL, correct = TRUE) y mu paired eact correct numeric vector of data values. an optional numeric vector of data values. indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. a number specifying an optional location parameter. a logical indicating whether you want a paired test. a logical indicating whether an eact p-value should be computed. a logical indicating whether to apply continuity correction in the normal approimation for the p-value. If only is given, or if both and y are given and paired is TRUE, a Wilcoon signed rank test of the null that the median of (in the one sample case) or of -y (in the paired two sample case) equals mu is performed. Otherwise, if both and y are given and paired is FALSE, a Wilcoon rank sum test (equivalent to the Mann-Whitney test) is carried out. In this case, the null hypothesis is that the location of the distributions of and y differ by mu. By default (if eact is not specified), an eact p-value is computed if the samples contain less than 50 finite values and there are no ties. Otherwise, a normal approimation is used. parameter null.value the value of the test with a name describing it. the parameter(s) for the eact distribution of the test. Currently, only normal approimations are used. the p-value for the test. the location parameter mu. a character string describing the hypothesis. the type of test applied.

12 12 ansari.test mood.test Mood Two-Sample Test of Scale mood.test Performs Mood s two-sample test of scale. mood.test(, y, = "two.sided"), y numeric vectors of data values. indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. The underlying model is that the two samples are drawn from f( l) and f(( l)/s)/s, respectively, where l is a common location parameter and s is a scale parameter. The null hypothesis is s = 1. There are more useful tests for this problem. the value of the test. a character string describing the hypothesis. the string "Mood two-sample test of scale". ansari.test Ansari-Bradley Test ansari.test Performs the Ansari-Bradley test for a difference in scale parameters. ansari.test(, y, = "two.sided", eact = NULL)

13 ansari.test 13 y eact numeric vector of data values. numeric vector of data values. indicates the hypothesis and must be one of "two.sided", "greater" or "less". You can specify just the initial letter. a logical indicating whether an eact p-value should be computed. Suppose that and y are independent samples from distributions with densities f((t m)/s)/s and f(t m), respectively, where m is an unknown nuisance parameter and s is the parameter of interest. The Ansari-Bradley test is used for testing the null that s equals 1, the two-sided being that s! = 1 (the distributions differ only in variance), and the one-sided s being s > 1 (the distribution underlying has a larger variance, "greater") or s < 1 ("less"). By default (if eact is not specified), an eact p-value is computed if both samples contain less than 50 finite values and there are no ties. Otherwise, a normal approimation is used. the value of the Ansari-Bradley test. a character string describing the hypothesis. the string "Ansari-Bradley test". References Myles Hollander & Douglas A. Wolfe (1973), Nonparametric al inference. New York: John Wiley & Sons. Eamples ## Hollander & Wolfe (1973, p. 86f): ## Serum iron determination using Hyland control sera ramsay <- c(111, 107, 100, 99, 102, 106, 109, 108, 104, 99, 101, 96, 97, 102, 107, 113, 116, 113, 110, 98) jung.parekh <- c(107, 108, 106, 98, 105, 103, 110, 105, 104, 100, 96, 108, 103, 104, 114, 114, 113, 108, 106, 99) ansari.test(ramsay, jung.parekh)

14 14 shapiro.test shapiro.test Shapiro-Wilk Normality Test shapiro.test Performs the Shapiro-Wilk test for normality. shapiro.test() a numeric vector of data values, the number of which must be between 3 and Missing values are allowed. the value of the Shapiro-Wilk. the p-value for the test. the string "Shapiro-Wilk normality test". a character string giving the name(s) of the data. References Patrick Royston (1982), An Etension of Shapiro and Wilk s W Test for Normality to Large Samples. Applied Statistics, 31, Patrick Royston (1982), Algorithm AS 181: The W Test for Normality. Applied Statistics, 31, Patrick Royston (1995), A Remark on Algorithm AS 181: The W Test for Normality. Applied Statistics, 44, See Also qqnorm for producing a normal quantile-quantile plot. Eamples shapiro.test(rnorm(100, mean = 5, sd = 3)) shapiro.test(runif(100, min = 2, ma = 4))

15 Inde Topic htest ansari.test, 12 bartlett.test, 1 binom.test, 2 cor.test, 2 fisher.test, 4 friedman.test, 5 kruskal.test, 6 ks.test, 7 mantelhaen.test, 8 mcnemar.test, 9 mood.test, 12 shapiro.test, 14 var.test, 10 wilco.test, 11 ansari.test, 12 bartlett.test, 1 binom.test, 2 cor.test, 2 fisher.test, 4 friedman.test, 5 kruskal.test, 6 ks.test, 7 mantelhaen.test, 8 mcnemar.test, 9 mood.test, 12 qqnorm, 14 shapiro.test, 14 var.test, 10 wilco.test, 11 15

Interval Estimation. The data set belongs to the MASS package, which has to be pre-loaded into the R workspace prior to use.

Interval Estimation. The data set belongs to the MASS package, which has to be pre-loaded into the R workspace prior to use. Interval Estimation It is a common requirement to efficiently estimate population parameters based on simple random sample data. In the R tutorials of this section, we demonstrate how to compute the estimates.

More information

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113

Index. Bar charts, 106 bartlett.test function, 159 Bottles dataset, 69 Box plots, 113 Index A Add-on packages information page, 186 187 Linux users, 191 Mac users, 189 mirror sites, 185 Windows users, 187 aggregate function, 62 Analysis of variance (ANOVA), 152 anova function, 152 as.data.frame

More information

FreeJSTAT for Windows. Manual

FreeJSTAT for Windows. Manual FreeJSTAT for Windows Manual (c) Copyright Masato Sato, 1998-2018 1 Table of Contents 1. Introduction 3 2. Functions List 6 3. Data Input / Output 7 4. Summary Statistics 8 5. t-test 9 6. ANOVA 10 7. Contingency

More information

Nuts and Bolts Research Methods Symposium

Nuts and Bolts Research Methods Symposium Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Topics to Discuss: Types of Variables Constructing a Variable Code Book Developing Excel Spreadsheets

More information

Product Catalog. AcaStat. Software

Product Catalog. AcaStat. Software Product Catalog AcaStat Software AcaStat AcaStat is an inexpensive and easy-to-use data analysis tool. Easily create data files or import data from spreadsheets or delimited text files. Run crosstabulations,

More information

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013

Organizing Your Data. Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Organizing Your Data Jenny Holcombe, PhD UT College of Medicine Nuts & Bolts Conference August 16, 3013 Learning Objectives Identify Different Types of Variables Appropriately Naming Variables Constructing

More information

Data Management - Summary statistics - Graphics Choose a dataset to work on, maybe use it already

Data Management - Summary statistics - Graphics Choose a dataset to work on, maybe use it already Exercises Day 1 Data Management - Summary statistics - Graphics Choose a dataset to work on, maybe use it already Day 2 Inference introduction R Data Management - Summary statistics - Graphics Days 3 and

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

Eksamen ERN4110, 6/ VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert)

Eksamen ERN4110, 6/ VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert) Eksamen ERN4110, 6/9-2018 VEDLEGG SPSS utskrifter til oppgavene (Av plasshensyn kan utskriftene være noe redigert) 1 Oppgave 1 Datafila I SPSS: Variabelnavn Beskrivelse Kjønn Kjønn (1=Kvinne, 2=Mann) Studieinteresse

More information

Why is Statistics important in Bioinformatics?

Why is Statistics important in Bioinformatics? Why is Statistics important in Bioinformatics? Random processes are inherent in evolution and in sampling (data collection). Errors are often unavoidable in the data collection process. Statistics helps

More information

Basic Statistical Methods

Basic Statistical Methods Basic Statistical Methods Lecture 9 Nicholas Christian BIOST 2094 Spring 2011 Outline 1. Summary Statistics 2. Comparing Means 3. Comparing Variances 4. Comparing Proportions 5. Testing Normality 6. Testing

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

STATS PAD USER MANUAL

STATS PAD USER MANUAL STATS PAD USER MANUAL For Version 2.0 Manual Version 2.0 1 Table of Contents Basic Navigation! 3 Settings! 7 Entering Data! 7 Sharing Data! 8 Managing Files! 10 Running Tests! 11 Interpreting Output! 11

More information

SPSS: AN OVERVIEW. V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi

SPSS: AN OVERVIEW. V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi SPSS: AN OVERVIEW V.K. Bhatia Indian Agricultural Statistics Research Institute, New Delhi-110012 The abbreviation SPSS stands for Statistical Package for the Social Sciences and is a comprehensive system

More information

for statistical analyses

for statistical analyses Using for statistical analyses Robert Bauer Warnemünde, 05/16/2012 Day 6 - Agenda: non-parametric alternatives to t-test and ANOVA (incl. post hoc tests) Wilcoxon Rank Sum/Mann-Whitney U-Test Kruskal-Wallis

More information

Package npcp. August 23, 2017

Package npcp. August 23, 2017 Type Package Package np August 23, 2017 Title Some Nonparametric CUSUM Tests for Change-Point Detection in Possibly Multivariate Observations Version 0.1-9 Date 2017-08-23 Author Ivan Kojadinovic Maintainer

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2012 http://ce.sharif.edu/courses/90-91/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information

Package CBCgrps. R topics documented: July 27, 2018

Package CBCgrps. R topics documented: July 27, 2018 Package CBCgrps July 27, 2018 Type Package Title Compare Baseline Characteristics Between Groups Version 2.3 Date 2018-07-27 Author Zhongheng Zhang, Sir Run-Run Shaw hospital, Zhejiang university school

More information

In this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics:

In this computer exercise we will work with the analysis of variance in R. We ll take a look at the following topics: UPPSALA UNIVERSITY Department of Mathematics Måns Thulin, thulin@math.uu.se Analysis of regression and variance Fall 2011 COMPUTER EXERCISE 2: One-way ANOVA In this computer exercise we will work with

More information

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel

Research Methods for Business and Management. Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel Research Methods for Business and Management Session 8a- Analyzing Quantitative Data- using SPSS 16 Andre Samuel A Simple Example- Gym Purpose of Questionnaire- to determine the participants involvement

More information

SPSS Modules Features

SPSS Modules Features SPSS Modules Features Core System Functionality (included in every license) Data access and management Data Prep features: Define Variable properties tool; copy data properties tool, Visual Bander, Identify

More information

PITMAN Randomization Tests Version 5.00S (c) "One of many STATOOLS(tm)..." by. Gerard E. Dallal 54 High Plain Road Andover, MA 01810

PITMAN Randomization Tests Version 5.00S (c) One of many STATOOLS(tm)... by. Gerard E. Dallal 54 High Plain Road Andover, MA 01810 PITMAN Randomization Tests Version 5.00S (c) 1985-1991 "One of many STATOOLS(tm)..." by Gerard E. Dallal 54 High Plain Road Andover, MA 01810 PITMAN performs one- and two-sample exact randomization tests.

More information

Fathom Dynamic Data TM Version 2 Specifications

Fathom Dynamic Data TM Version 2 Specifications Data Sources Fathom Dynamic Data TM Version 2 Specifications Use data from one of the many sample documents that come with Fathom. Enter your own data by typing into a case table. Paste data from other

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2013 http://ce.sharif.edu/courses/91-92/2/ce725-1/ Agenda Features and Patterns The Curse of Size and

More information

Statistical Methods for the Analysis of Repeated Measurements

Statistical Methods for the Analysis of Repeated Measurements Charles S. Davis Statistical Methods for the Analysis of Repeated Measurements With 20 Illustrations #j Springer Contents Preface List of Tables List of Figures v xv xxiii 1 Introduction 1 1.1 Repeated

More information

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research

Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Want to Do a Better Job? - Select Appropriate Statistical Analysis in Healthcare Research Liping Huang, Center for Home Care Policy and Research, Visiting Nurse Service of New York, NY, NY ABSTRACT The

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Pattern Recognition Features and Feature Selection Hamid R. Rabiee Jafar Muhammadi Spring 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Agenda Features and Patterns The Curse of Size and

More information

Pair-Wise Multiple Comparisons (Simulation)

Pair-Wise Multiple Comparisons (Simulation) Chapter 580 Pair-Wise Multiple Comparisons (Simulation) Introduction This procedure uses simulation analyze the power and significance level of three pair-wise multiple-comparison procedures: Tukey-Kramer,

More information

Introduction To R Workshop Selected R Commands (See Dalgaard, 2008, Appendix C)

Introduction To R Workshop Selected R Commands (See Dalgaard, 2008, Appendix C) ELEMENTARY Commands List Objects in Workspace Delete object Search Path ls(), or objects() rm() search() Variable Names Combinations of letters, digits and period. Don t start with number or period. Assignments

More information

Multiple Comparisons of Treatments vs. a Control (Simulation)

Multiple Comparisons of Treatments vs. a Control (Simulation) Chapter 585 Multiple Comparisons of Treatments vs. a Control (Simulation) Introduction This procedure uses simulation to analyze the power and significance level of two multiple-comparison procedures that

More information

arxiv: v1 [stat.co] 17 Jan 2015

arxiv: v1 [stat.co] 17 Jan 2015 JavaNPST: Nonparametric Statistical Tests in Java Joaquín Derrac 1, Salvador García 2, and Francisco Herrera 2 arxiv:1501.04222v1 [stat.co] 17 Jan 2015 1 Affectv: Affectv Limited, 33-34 Alfred Place, London,

More information

Cluster Randomization Create Cluster Means Dataset

Cluster Randomization Create Cluster Means Dataset Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters

More information

WolStat: A new statistics package for the behavioral and social sciences

WolStat: A new statistics package for the behavioral and social sciences Behavior Research Methods 2008, 40 (I), 94-101 doi: 10.3758/13RM40.1.94 WolStat: A new statistics package for the behavioral and social sciences ALLEN H. WOLACH Illinois Institute of Technology, Chicago,

More information

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242

Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Mean Tests & X 2 Parametric vs Nonparametric Errors Selection of a Statistical Test SW242 Creation & Description of a Data Set * 4 Levels of Measurement * Nominal, ordinal, interval, ratio * Variable Types

More information

Minitab 17 commands Prepared by Jeffrey S. Simonoff

Minitab 17 commands Prepared by Jeffrey S. Simonoff Minitab 17 commands Prepared by Jeffrey S. Simonoff Data entry and manipulation To enter data by hand, click on the Worksheet window, and enter the values in as you would in any spreadsheet. To then save

More information

JMP 10 Student Edition Quick Guide

JMP 10 Student Edition Quick Guide JMP 10 Student Edition Quick Guide Instructions presume an open data table, default preference settings and appropriately typed, user-specified variables of interest. RMC = Click Right Mouse Button Graphing

More information

Matlab for Engineers

Matlab for Engineers Matlab for Engineers Alistair Johnson 31st May 2012 Centre for Doctoral Training in Healthcare Innovation Institute of Biomedical Engineering Department of Engineering Science University of Oxford Supported

More information

186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95

186 Statistics, Data Analysis and Modeling. Proceedings of MWSUG '95 A Statistical Analysis Macro Library in SAS Carl R. Haske, Ph.D., STATPROBE, nc., Ann Arbor, M Vivienne Ward, M.S., STATPROBE, nc., Ann Arbor, M ABSTRACT Statistical analysis plays a major role in pharmaceutical

More information

The Power and Sample Size Application

The Power and Sample Size Application Chapter 72 The Power and Sample Size Application Contents Overview: PSS Application.................................. 6148 SAS Power and Sample Size............................... 6148 Getting Started:

More information

LAMPIRAN. Tests of Normality. Kolmogorov-Smirnov a. Berat_Limfa KB KP P

LAMPIRAN. Tests of Normality. Kolmogorov-Smirnov a. Berat_Limfa KB KP P LAMPIRAN 1. Data Analisis Statistik 1.1 Berat Limpa U1 U2 U3 U4 U5 U6 Rata- SD Rata KB 0.53 0.17 0.18 0.2 0.18 0.13 0.23 0.15 KP 0.31 0.27 0.27 0.27 0.11 0.23 0.24 0.07 P1 0.23 0.21 0.12 0.2 0.24 0.23

More information

TABEL DISTRIBUSI DAN HUBUNGAN LENGKUNG RAHANG DAN INDEKS FASIAL N MIN MAX MEAN SD

TABEL DISTRIBUSI DAN HUBUNGAN LENGKUNG RAHANG DAN INDEKS FASIAL N MIN MAX MEAN SD TABEL DISTRIBUSI DAN HUBUNGAN LENGKUNG RAHANG DAN INDEKS FASIAL Lengkung Indeks fasial rahang Euryprosopic mesoprosopic leptoprosopic Total Sig. n % n % n % n % 0,000 Narrow 0 0 0 0 15 32,6 15 32,6 Normal

More information

Package crank. R topics documented: January 22, Version Title Completing ranks Date Author Jim Lemon

Package crank. R topics documented: January 22, Version Title Completing ranks Date Author Jim Lemon Version 1.0-1 Title Completing ranks Date 2010-01-15 Author , Package crank January 22, 2010 Maintainer Functions for completing and recalculating rankings. Depends

More information

SPSS: AN OVERVIEW. SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi

SPSS: AN OVERVIEW. SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi : AN OVERVIEW SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 seema@iasri.res.in 1. Introduction The abbreviation SPSS stands for Statistical Package for

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 5 Efficiency Issues

More information

Statistical Tests for Variable Discrimination

Statistical Tests for Variable Discrimination Statistical Tests for Variable Discrimination University of Trento - FBK 26 February, 2015 (UNITN-FBK) Statistical Tests for Variable Discrimination 26 February, 2015 1 / 31 General statistics Descriptional:

More information

Choosing the Right Procedure

Choosing the Right Procedure 3 CHAPTER 1 Choosing the Right Procedure Functional Categories of Base SAS Procedures 3 Report Writing 3 Statistics 3 Utilities 4 Report-Writing Procedures 4 Statistical Procedures 6 Available Statistical

More information

Excel 2010 with XLSTAT

Excel 2010 with XLSTAT Excel 2010 with XLSTAT J E N N I F E R LE W I S PR I E S T L E Y, PH.D. Introduction to Excel 2010 with XLSTAT The layout for Excel 2010 is slightly different from the layout for Excel 2007. However, with

More information

APPENDIX. Appendix 2. HE Staining Examination Result: Distribution of of BALB/c

APPENDIX. Appendix 2. HE Staining Examination Result: Distribution of of BALB/c APPENDIX Appendix 2. HE Staining Examination Result: Distribution of of BALB/c mice nucleus liver cells changes in percents between control group and intervention groups. Descriptives Groups Statistic

More information

Quantitative - One Population

Quantitative - One Population Quantitative - One Population The Quantitative One Population VISA procedures allow the user to perform descriptive and inferential procedures for problems involving one population with quantitative (interval)

More information

Package visualizationtools

Package visualizationtools Package visualizationtools April 12, 2011 Type Package Title Package contains a few functions to visualize statistical circumstances. Version 0.2 Date 2011-04-06 Author Thomas Roth Etienne Stockhausen

More information

Package PMCMR. R topics documented: January 18, Type Package

Package PMCMR. R topics documented: January 18, Type Package Type Package Package PMCMR January 18, 2018 Title Calculate Pairwise Multiple Comparisons of Mean Rank Sums Version 4.2 Date 2018-01-15 Author Thorsten Pohlert Maintainer Thorsten Pohlert

More information

Package tatest. July 18, 2018

Package tatest. July 18, 2018 Type Package Title Two-Group Ta-Test Version 1.0 Date 2018-07-10 Author Yuan-De Tan Package tatest July 18, 2018 Maintainer Yuan-De Tan The ta-test is a modified two-sample or two-group

More information

Base package The Base subscription includes the following features:

Base package The Base subscription includes the following features: IBM SPSS Statistics Subscription: Base, add-ons and features 1 2 3 Base package The Base subscription includes the following features: Data access and management Compare two data files for compatibility

More information

Chapter 12: Statistics

Chapter 12: Statistics Chapter 12: Statistics Once you have imported your data or created a geospatial model, you may wish to calculate some simple statistics, run some simple tests, or see some traditional plots. On the main

More information

The ICSNP Package. October 5, 2007

The ICSNP Package. October 5, 2007 The ICSNP Package October 5, 2007 Type Package Title Tools for Multivariate Nonparametrics Version 1.0-1 Date 2007-10-04 Author Klaus Nordhausen, Seija Sirkia, Hannu Oja, Dave Tyler Maintainer Klaus Nordhausen

More information

Supplementary text S6 Comparison studies on simulated data

Supplementary text S6 Comparison studies on simulated data Supplementary text S Comparison studies on simulated data Peter Langfelder, Rui Luo, Michael C. Oldham, and Steve Horvath Corresponding author: shorvath@mednet.ucla.edu Overview In this document we illustrate

More information

for Windows User guide Exeter Software Statistical software for biologists Version 3.3

for Windows User guide Exeter Software Statistical software for biologists Version 3.3 for Windows Statistical software for biologists Version 3.3 User guide F. James Rohlf Dennis E. Slice Department of Ecology and Evolution State University of New York Stony Brook, NY 11794 Exeter Software

More information

Multivariate Capability Analysis

Multivariate Capability Analysis Multivariate Capability Analysis Summary... 1 Data Input... 3 Analysis Summary... 4 Capability Plot... 5 Capability Indices... 6 Capability Ellipse... 7 Correlation Matrix... 8 Tests for Normality... 8

More information

One way ANOVA when the data are not normally distributed (The Kruskal-Wallis test).

One way ANOVA when the data are not normally distributed (The Kruskal-Wallis test). One way ANOVA when the data are not normally distributed (The Kruskal-Wallis test). Suppose you have a one way design, and want to do an ANOVA, but discover that your data are seriously not normal? Just

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

Multivariate Normal Random Numbers

Multivariate Normal Random Numbers Multivariate Normal Random Numbers Revised: 10/11/2017 Summary... 1 Data Input... 3 Analysis Options... 4 Analysis Summary... 5 Matrix Plot... 6 Save Results... 8 Calculations... 9 Summary This procedure

More information

Chapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea

Chapter 3. Bootstrap. 3.1 Introduction. 3.2 The general idea Chapter 3 Bootstrap 3.1 Introduction The estimation of parameters in probability distributions is a basic problem in statistics that one tends to encounter already during the very first course on the subject.

More information

Assumption 1: Groups of data represent random samples from their respective populations.

Assumption 1: Groups of data represent random samples from their respective populations. Tutorial 6: Comparing Two Groups Assumptions The following methods for comparing two groups are based on several assumptions. The type of test you use will vary based on whether these assumptions are met

More information

Package diggit. R topics documented: January 11, Version Date

Package diggit. R topics documented: January 11, Version Date Version 1.14.0 Date 2014-08-22 Package diggit January 11, 2019 Title Inference of Genetic Variants Driving Cellular Phenotypes Author Mariano J Alvarez Maintainer Mariano J Alvarez

More information

SPSS. (Statistical Packages for the Social Sciences)

SPSS. (Statistical Packages for the Social Sciences) Inger Persson SPSS (Statistical Packages for the Social Sciences) SHORT INSTRUCTIONS This presentation contains only relatively short instructions on how to perform basic statistical calculations in SPSS.

More information

Nonparametric Testing

Nonparametric Testing Nonparametric Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see

More information

Land Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida

Land Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida Land Cover Stratified Accuracy Assessment For Digital Elevation Model derived from Airborne LIDAR Dade County, Florida FINAL REPORT Submitted October 2004 Prepared by: Daniel Gann Geographic Information

More information

Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS

Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS ABSTRACT Paper 1938-2018 Frequencies, Unequal Variance Weights, and Sampling Weights: Similarities and Differences in SAS Robert M. Lucas, Robert M. Lucas Consulting, Fort Collins, CO, USA There is confusion

More information

R Commands. Marc H. Mehlman. 8 March 2013

R Commands. Marc H. Mehlman. 8 March 2013 R Commands Marc H. Mehlman 8 March 2013 1 Inputing Data 1.1 By Hand > dat=c(1, 2, 3, 4, 5) > dat [1] 1 2 3 4 5 1.2 From an Ascii File The format of the file is one of columns - one variable per column.

More information

The binmto Package. August 25, 2007

The binmto Package. August 25, 2007 The binmto Package August 25, 2007 Type Package Title Asymptotic simultaneous confdence intervals for many-to-one comparisons of proportions Version 0.0-3 Date 2006-12-29 Author Maintainer

More information

An Introduction to the Bootstrap

An Introduction to the Bootstrap An Introduction to the Bootstrap Bradley Efron Department of Statistics Stanford University and Robert J. Tibshirani Department of Preventative Medicine and Biostatistics and Department of Statistics,

More information

Package binmto. February 19, 2015

Package binmto. February 19, 2015 Type Package Package binmto February 19, 2015 Title Asymptotic simultaneous confidence intervals for many-to-one comparisons of proportions Version 0.0-6 Date 2013-09-30 Author Maintainer

More information

One Factor Experiments

One Factor Experiments One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal

More information

Acknowledgments. Acronyms

Acknowledgments. Acronyms Acknowledgments Preface Acronyms xi xiii xv 1 Basic Tools 1 1.1 Goals of inference 1 1.1.1 Population or process? 1 1.1.2 Probability samples 2 1.1.3 Sampling weights 3 1.1.4 Design effects. 5 1.2 An introduction

More information

Package quickreg. R topics documented:

Package quickreg. R topics documented: Package quickreg September 28, 2017 Title Build Regression Models Quickly and Display the Results Using 'ggplot2' Version 1.5.0 A set of functions to extract results from regression models and plot the

More information

Package irrna. April 11, 2018

Package irrna. April 11, 2018 Type Package Package irrna April 11, 2018 Title Coefficients of Interrater Reliability - Generalized for Randomly Incomplete Datasets Version 0.1.4 Date 2018-04-10 Author Markus Brueckl [aut, cre], Florian

More information

IBM SPSS Statistics Traditional License packages and features

IBM SPSS Statistics Traditional License packages and features IBM SPSS Statistics Traditional License packages and features 1 2 3 The includes the following features: Data access and management Compare two data files for compatibility Data prep features: Define Variable

More information

The Bolstad Package. July 9, 2007

The Bolstad Package. July 9, 2007 The Bolstad Package July 9, 2007 Version 0.2-12 Date 2007-09-07 Title Bolstad functions Author James Curran Maintainer James M. Curran A set of

More information

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions

MINITAB Release Comparison Chart Release 14, Release 13, and Student Versions Technical Support Free technical support Worksheet Size All registered users, including students Registered instructors Number of worksheets Limited only by system resources 5 5 Number of cells per worksheet

More information

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions)

THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533. Time: 50 minutes 40 Marks FRST Marks FRST 533 (extra questions) THE UNIVERSITY OF BRITISH COLUMBIA FORESTRY 430 and 533 MIDTERM EXAMINATION: October 14, 2005 Instructor: Val LeMay Time: 50 minutes 40 Marks FRST 430 50 Marks FRST 533 (extra questions) This examination

More information

Package infer. July 11, Type Package Title Tidy Statistical Inference Version 0.3.0

Package infer. July 11, Type Package Title Tidy Statistical Inference Version 0.3.0 Type Package Title Tidy Statistical Inference Version 0.3.0 Package infer July 11, 2018 The objective of this package is to perform inference using an epressive statistical grammar that coheres with the

More information

Table of Contents (As covered from textbook)

Table of Contents (As covered from textbook) Table of Contents (As covered from textbook) Ch 1 Data and Decisions Ch 2 Displaying and Describing Categorical Data Ch 3 Displaying and Describing Quantitative Data Ch 4 Correlation and Linear Regression

More information

file:///users/williams03/a/workshops/2015.march/final/intro_to_r.html

file:///users/williams03/a/workshops/2015.march/final/intro_to_r.html Intro to R R is a functional programming language, which means that most of what one does is apply functions to objects. We will begin with a brief introduction to R objects and how functions work, and

More information

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application

SAS/STAT 13.1 User s Guide. The Power and Sample Size Application SAS/STAT 13.1 User s Guide The Power and Sample Size Application This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete manual is as

More information

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables:

Regression Lab 1. The data set cholesterol.txt available on your thumb drive contains the following variables: Regression Lab The data set cholesterol.txt available on your thumb drive contains the following variables: Field Descriptions ID: Subject ID sex: Sex: 0 = male, = female age: Age in years chol: Serum

More information

The results section of a clinicaltrials.gov file is divided into discrete parts, each of which includes nested series of data entry screens.

The results section of a clinicaltrials.gov file is divided into discrete parts, each of which includes nested series of data entry screens. OVERVIEW The ClinicalTrials.gov Protocol Registration System (PRS) is a web-based tool developed for submitting clinical trials information to ClinicalTrials.gov. This document provides step-by-step instructions

More information

On the Analysis of Experimental Results in Evolutionary Computation

On the Analysis of Experimental Results in Evolutionary Computation On the Analysis of Experimental Results in Evolutionary Computation Stjepan Picek Faculty of Electrical Engineering and Computing Unska 3, Zagreb, Croatia Email:stjepan@computer.org Marin Golub Faculty

More information

STATISTICS (STAT) 200 Level Courses. 300 Level Courses. Statistics (STAT) 1

STATISTICS (STAT) 200 Level Courses. 300 Level Courses. Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) 200 Level Courses STAT 250: Introductory Statistics I. 3 credits. Elementary introduction to statistics. Topics include descriptive statistics, probability, and estimation

More information

Package CUFF. May 28, 2018

Package CUFF. May 28, 2018 Note -*- Encoding: utf-8 -*- Type Package Title Charles's Utility Function using Formula Version 1.5 Date 2018-05-27 Author Package CUFF May 28, 2018 Maintainer Depends R (>= 3.2.2)

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

The quantchem Package

The quantchem Package The quantchem Package February 26, 2006 Type Package Title Quantitative chemical analysis: calibration and evaluation of results Version 0.11-2 Date 2005-09-18 Depends R (>= 2.0), MASS, outliers Author

More information

The meboot Package. January 26, 2007

The meboot Package. January 26, 2007 The meboot Package January 26, 2007 Version 0.0.1 Date 2007/01/26 Title Maimum Entropy Bootstrap for Time Series Author Hrishikesh D. Vinod and Javier López-de-Lacalle Maintainer Javier

More information

BESTFIT, DISTRIBUTION FITTING SOFTWARE BY PALISADE CORPORATION

BESTFIT, DISTRIBUTION FITTING SOFTWARE BY PALISADE CORPORATION Proceedings of the 1996 Winter Simulation Conference ed. J. M. Charnes, D. J. Morrice, D. T. Brunner, and J. J. S\vain BESTFIT, DISTRIBUTION FITTING SOFTWARE BY PALISADE CORPORATION Linda lankauskas Sam

More information

Package fgpt. February 19, 2015

Package fgpt. February 19, 2015 Type Package Title Floating Grid Permutation Technique Version 2.3 Date 2015-02-19 Author Reinder Radersma & Ben Sheldon Package fgpt February 19, 2015 Maintainer Reinder Radersma

More information

Package diagis. January 25, 2018

Package diagis. January 25, 2018 Type Package Package diagis January 25, 2018 Title Diagnostic Plot and Multivariate Summary Statistics of Weighted Samples from Importance Sampling Version 0.1.3-1 Date 2018-01-25 Author Jouni Helske Maintainer

More information

Brief Guide on Using SPSS 10.0

Brief Guide on Using SPSS 10.0 Brief Guide on Using SPSS 10.0 (Use student data, 22 cases, studentp.dat in Dr. Chang s Data Directory Page) (Page address: http://www.cis.ysu.edu/~chang/stat/) I. Processing File and Data To open a new

More information

An Algorithm to Compute Exact Power of an Unordered RxC Contingency Table

An Algorithm to Compute Exact Power of an Unordered RxC Contingency Table NESUG 27 An Algorithm to Compute Eact Power of an Unordered RC Contingency Table Vivek Pradhan, Cytel Inc., Cambridge, MA Stian Lydersen, Department of Cancer Research and Molecular Medicine, Norwegian

More information

Statistical matching: conditional. independence assumption and auxiliary information

Statistical matching: conditional. independence assumption and auxiliary information Statistical matching: conditional Training Course Record Linkage and Statistical Matching Mauro Scanu Istat scanu [at] istat.it independence assumption and auxiliary information Outline The conditional

More information

Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests

Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests Biostat Methods STAT 5820/6910 Handout #4: Chi-square, Fisher s, and McNemar s Tests Example 1: 152 patients were randomly assigned to 4 dose groups in a clinical study. During the course of the study,

More information