Constrained Optimal Sample Allocation in Multilevel Randomized Experiments Using PowerUpR

Size: px
Start display at page:

Download "Constrained Optimal Sample Allocation in Multilevel Randomized Experiments Using PowerUpR"

Transcription

1 Constrained Optimal Sample Allocation in Multilevel Randomized Experiments Using PowerUpR Metin Bulus & Nianbo Dong University of Missouri March 3, 2017 Contents Introduction PowerUpR Package COSA Functions in PowerUpR Installation and Usage Constraints in COSA Example use of COSA Functions in PowerUpR Statistical Model Parameter Values Scenario 1: Constrain Power Scenario 2: Constrain MDES and J Scenario 3: Constrain Cost and J Scenario 4: Constrain Cost, n and J Plots [i] Plots [ii] Plots [iii] Plots [iv] Online GUI for PowerUpR 95% CI for MDES and Percentiles Summary Future Directions Thank you! References Note. To access an HTML version of this document please use the following link 1

2 Introduction There are several commercial and publicly available tools to assist designing multilevel randomized experiments such as CRT-Power (Borenstein, Hedges, & Rothstein, 2012), PowerUp! (Dong & Maynard, 2013), and Optimal Design (Raudenbush et al., 2011). Hedges and Borenstein (2014) has introduced the idea of constrained optimal sample allocations (COSA) considering limitations with the sample sizes in CRT-power. However, there isn't any statistical tool to assist researchers in this endeavor in R environment (R Core Team, 2017), which is free and increasingly used. 2

3 PowerUpR Package So we translated PowerUp! into R and added COSA functions. CRAN page includes all the information about the package, and is available at 3

4 COSA Functions in PowerUpR COSA functions begin with optimal, followed by a period, and a design name. There are 13 types of designs where COSA is applicable: bira2r1, bira2f1, bira2c1, cra2r2, bira3r1, bcra3r2, bcra3f2, cra3r3, bira4r1, bcra4r2, bcra4r3, bcra4f3, and cra4r4. The first three or four letters of the design stands for the type of assignment, for blocked/multisite individual random assignment bira, for cluster random assignment cra, and for blocked/multisite cluster random assignment bcra. It is followed by a number indicating number of levels. A single letter followed by a number indicates whether a block is considered to be r, random; f, fixed; orc, constant and the level of randomization. e.g., optimal.cra3r3 4

5 Installation and Usage 1. Download and install R from 2. Download and install RStudio from (optional) Open R or RStudio, then type the following code to install the package install.packages("powerupr") Load the package into the current session library(powerupr) Get help for a specific function?optimal.cra3r3 Copy and paste the code from Examples section in help file, and modify it to your own case (Important: Note default values in Usage section!) optimal.cra3r3(cn=5, cj=50, ck=200, constrain="cost", cost=100000, rho3=.06, rho2=.17) 5

6 Constraints in COSA PowerUpR package provides functions to deal with COSA problems under the following conditions under budgetary constraints given marginal costs per unit while minimizing sampling variance of the treatment effect, under power constraints given marginal costs per unit while minimizing total cost, under MDES constraints given marginal costs per unit while minimizing total cost, and under sample size constraints for one or more levels along with any of the options above. 6

7 Example use of COSA Functions in PowerUpR Institue of Education Science (IES) reported that evidence on effectiveness of Saxon Math curriclum on secondary school student's algebra subject is lacking and further research is needed (U.S. Department of Education, 2016). To evaluate the effectiveness Saxon Math curriculum, assume a three-level cluster randomized trial is considered where schools are randomly assigned to treatment (Saxon Math curriclum) and a control condition. In the grant proposal number of schools, classrooms, and students, and the cost associated with data collection from this sample needs to be justified. We will use PowerUpR package (Bulus & Dong, 2017) an implementation of PowerUp! (Dong & Maynard, 2013) software in R environment (R Core Team, 2017). 7

8 Statistical Model Due to nesting of students within classrooms and nesting of classrooms within schools, the treatment effect (δ) can be estimated via the following 3-level hierarchical linear model (HLM, Raudenbush & Bryk, 2002) where, Y: Student algebra posttest score. X: Student algebra pretest score. W: Classroom mean algebra pretest score. V: School mean algebra pretest score. T: Treatment status. 8

9 Parameter Values Consider the following hypothetical values for design parameters Proportion of variance in algebra posttest at the school level (τ3 2 ):.15 Proportion of variance in algebra posttest at the classroom level (τ2 2 ):.05 Proportion of variance in algebra posttest at the student level (σ 2 ):.65 Then, intraclass correlation coefficients are ρ2 =.05/( )=0.059 ρ3 =.15/( )=0.176 Proportion of variance in algebra posttest explained by school mean algebra pretest: (R3 2 ):.45 Proportion of variance in algebra posttest explained by classroom mean algebra pretest: (R2 2 ):.50 Proportion of variance in algebra posttest explained by student algebra pretest: (R1 2 ):.55 9

10 Scenario 1: Constrain Power Further assumptions Alpha level (α): 0.05 Statistical power (1 β):.80 Minimum detectable effect size (MDES):.232 Marginal cost per student (cn): 10 Marginal cost per classroom (cj): 200 Marginal cost per school (ck): 500 Given design parameters and further assumptions, how many schools, classrooms and students should be recruited to achieve 80% power while minimizing the total cost? library(powerupr) design1 <- optimal.cra3r3(cn=10, cj=200, ck=500, constrain="power", power=.80, mdes=.232, rho2=.06, rho3=.18, g3=1, R12=.55, R22=.50, R32=.45, P=.40) print(design1$round.optim) ## n J K cost mdes power ## [1,] print(design1$integer.optim) ## n J K cost mdes power ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] ## [8,] ## [9,] ## [10,]

11 Scenario 2: Constrain MDES and J Further assumptions Alpha level (α): 0.05 Statistical power (1 β):.80 Minimum detectable effect size (MDES):.232 Average number of classrooms per school (J): 2 Marginal cost per student (cn): 10 Marginal cost per classroom (cj): 200 Marginal cost per school (ck): 500 Given design parameters and further assumptions, how many schools, and students should be recruited to achieve an MDES of.232 while minimizing the total cost? design2 <- optimal.cra3r3(cn=10, cj=200, ck=500, constrain="mdes", mdes=.232, J=2, power=.80, rho2=.06, rho3=.18, g3=1, R12=.55, R22=.50, R32=.45, P=.40) print(design2$round.optim) ## n J K cost mdes power ## [1,] print(design2$integer.optim) ## n J K cost mdes power ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] ## [8,] ## [9,] ## [10,]

12 Scenario 3: Constrain Cost and J Further assumptions Alpha level (α): 0.05 Statistical power (1 β):.80 Minimum detectable effect size (MDES) Average number of classrooms per school (J): 2 Marginal cost per student (cn): 10 Marginal cost per classroom (cj): 200 Marginal cost per school (ck): 500 Budget: 130, 000 Given design parameters and further assumptions, how many schools, and students should be recruited considering a budget of $130,000 while minimizing the sampling variance of the treatment effect? design3 <- optimal.cra3r3(cn=10, cj=200, ck=500, constrain="cost", cost=130000, J=2, power=.80, mdes=.20, rho2=.06, rho3=.18, g3=1, R12=.55, R22=.50, R32=.45, P=.40) print(design3$round.optim) ## n J K cost mdes power ## [1,] print(design3$integer.optim) ## n J K cost mdes power ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] ## [8,] ## [9,] ## [10,]

13 Scenario 4: Constrain Cost, n and J Further assumptions Alpha level (α): 0.05 Statistical power (1 β):.80 Minimum detectable effect size (MDES):.20 Average number of students per classroom (n): 20 Average number of classrooms per school (J): 2 Marginal cost per student (cn): 10 Marginal cost per classroom (cj): 200 Marginal cost per school (ck): 500 Budget: 130, 000 Given design parameters and further assumptions, how many schools should be recruited considering a budget of $130,000 while minimizing the sampling variance of the treatment effect? design4 <- optimal.cra3r3(cn=10, cj=200, ck=500, constrain="cost", cost=130000, n=20, J=2, power=.80, mdes=.20, rho2=.06, rho3=.18, g3=1, R12=.55, R22=.50, R32=.45, P=.40) print(design4$round.optim) ## n J K cost mdes power ## [1,] print(design4$integer.optim) ## n J K cost mdes power ## [1,] ## [2,] ## [3,] ## [4,] ## [5,] ## [6,] ## [7,] ## [8,] ## [9,] ## [10,]

14 Plots [i] ## MDES - K ## Red point indicates the design location plot(design4, pars=c("mdes","k")) 14

15 Plots [ii] ## Power - K ## Red point indicates the design location plot(design4, pars=c("power","k")) 15

16 Plots [iii] ## Power - MDES - K ## Red point indicates the design location plot(design4, pars=c("power","mdes", "K")) 16

17 Plots [iv] ## Power - MDES - K ## Red point indicates the design location plot(design4, pars=c("power","mdes", "K"), type="c") 17

18 Online GUI for PowerUpR Online version is accesible through: Replicate scenarios 1-4 in PowerUpR online version 18

19 95% CI for MDES and Percentiles ## confidence intervals for MDES design5 <- optimal.to.mdes(design4) print(design5$mdes) ## mdes 95% LCL 95% UCL ## [1,] ## percentile equivalance mdes.to.pctl(design5) ## pctl 95% LCL 95% UCL ## [1,] "58%" "52%" "63%" 19

20 Summary A three-level hierarchical linear model is proposed to estimate treatment effect, where schools are randomly assigned to treatment and control conditions with 40% probability. We assume that proportion of variance in algebra posttest explained by school mean algebra pretest is 45%, proportion of variance in algebra posttest explained by classroom mean algebra pretest is 50%, proportion of variance in algebra posttest explained by student algebra pretest is 55%. Furthermore we assume intraclass correlation coefficient at the school level is 17.6%, and intraclass correlation coefficient at the classroom level is 5.9%. Having $10 marginal cost per student, $200 marginal cost per classroom, $500 marginal cost per school, based on PowerUpR package (Bulus & Dong, 2017), with a budget of $ we can afford recruting 100 schools in total, each with two classrooms and 40 students on average. For a two-tailed hypothesis test for treatment effect, and Type I error rate of 5%, such sample is estimated to detect an effect size as small as % CI[.060,.344] with statistical power of 80%, which is equaivalent to increasing an average student's algebra posttest score from 50th percentile to 58th percentile. 20

21 Future Directions Moderator and mediator effects (in ~ v0.2.3) Binary outcomes for main, moderator and mediator effects (in ~ v1.2.3) 21

22 Thank you! Thank you for your participation and comments! 22

23 References Bulus, M., & Dong, N. (2017). PowerUpR: Power Analysis Tools for Multilevel Randomized Experiments. R package version Borenstein, M., Hedges, L. V., & Rothstein, H. (2012). CRT Power. Teaneck, NJ: Biostat.[Software] Dong, N., & Maynard, R. A. (2013). PowerUp!: A Tool for Calculating Minimum Detectable Effect Sizes and Minimum Required Sample Sizes for Experimental and Quasi-Experimental Design Studies, Journal of Research on Educational Effectiveness, 6(1), Hedges, L. V., & Borenstein, M. (2014). Conditional Optimal Design in Three- and Four-Level Experiments. Journal of Educational and Behavioral Statistics, 39(4), R Core Team (2017). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL Raudenbush, S. W., Spybrook, J., Congdon, R., Liu, X. F., Martinez, A., & Bloom, H. (2011). Optimal design software for multi-level and longitudinal research (Version 3.01) [Software]. U.S. Department of Education, Institute of Education Sciences, What Works Clearinghouse. (2016, May). Secondary Mathematics intervention report: Saxon Math. Retrieved from 23

Online Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions:

Online Supplementary Appendix for. Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Online Supplementary Appendix for Dziak, Nahum-Shani and Collins (2012), Multilevel Factorial Experiments for Developing Behavioral Interventions: Power, Sample Size, and Resource Considerations 1 Appendix

More information

The linear mixed model: modeling hierarchical and longitudinal data

The linear mixed model: modeling hierarchical and longitudinal data The linear mixed model: modeling hierarchical and longitudinal data Analysis of Experimental Data AED The linear mixed model: modeling hierarchical and longitudinal data 1 of 44 Contents 1 Modeling Hierarchical

More information

Design and Analysis Methods for Cluster Randomized Trials with Pair-Matching on Baseline Outcome: Reduction of Treatment Effect Variance

Design and Analysis Methods for Cluster Randomized Trials with Pair-Matching on Baseline Outcome: Reduction of Treatment Effect Variance Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2006 Design and Analysis Methods for Cluster Randomized Trials with Pair-Matching on Baseline Outcome: Reduction

More information

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown

Z-TEST / Z-STATISTIC: used to test hypotheses about. µ when the population standard deviation is unknown Z-TEST / Z-STATISTIC: used to test hypotheses about µ when the population standard deviation is known and population distribution is normal or sample size is large T-TEST / T-STATISTIC: used to test hypotheses

More information

An Introduction to Growth Curve Analysis using Structural Equation Modeling

An Introduction to Growth Curve Analysis using Structural Equation Modeling An Introduction to Growth Curve Analysis using Structural Equation Modeling James Jaccard New York University 1 Overview Will introduce the basics of growth curve analysis (GCA) and the fundamental questions

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

Generalized least squares (GLS) estimates of the level-2 coefficients,

Generalized least squares (GLS) estimates of the level-2 coefficients, Contents 1 Conceptual and Statistical Background for Two-Level Models...7 1.1 The general two-level model... 7 1.1.1 Level-1 model... 8 1.1.2 Level-2 model... 8 1.2 Parameter estimation... 9 1.3 Empirical

More information

STATISTICS (STAT) Statistics (STAT) 1

STATISTICS (STAT) Statistics (STAT) 1 Statistics (STAT) 1 STATISTICS (STAT) STAT 2013 Elementary Statistics (A) Prerequisites: MATH 1483 or MATH 1513, each with a grade of "C" or better; or an acceptable placement score (see placement.okstate.edu).

More information

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display

Learner Expectations UNIT 1: GRAPICAL AND NUMERIC REPRESENTATIONS OF DATA. Sept. Fathom Lab: Distributions and Best Methods of Display CURRICULUM MAP TEMPLATE Priority Standards = Approximately 70% Supporting Standards = Approximately 20% Additional Standards = Approximately 10% HONORS PROBABILITY AND STATISTICS Essential Questions &

More information

= = P. IE 434 Homework 2 Process Capability. Kate Gilland 10/2/13. Figure 1: Capability Analysis

= = P. IE 434 Homework 2 Process Capability. Kate Gilland 10/2/13. Figure 1: Capability Analysis Kate Gilland 10/2/13 IE 434 Homework 2 Process Capability 1. Figure 1: Capability Analysis σ = R = 4.642857 = 1.996069 P d 2 2.326 p = 1.80 C p = 2.17 These results are according to Method 2 in Minitab.

More information

Introduction to Mixed Models: Multivariate Regression

Introduction to Mixed Models: Multivariate Regression Introduction to Mixed Models: Multivariate Regression EPSY 905: Multivariate Analysis Spring 2016 Lecture #9 March 30, 2016 EPSY 905: Multivariate Regression via Path Analysis Today s Lecture Multivariate

More information

Confidence Intervals: Estimators

Confidence Intervals: Estimators Confidence Intervals: Estimators Point Estimate: a specific value at estimates a parameter e.g., best estimator of e population mean ( ) is a sample mean problem is at ere is no way to determine how close

More information

Using HLM for Presenting Meta Analysis Results. R, C, Gardner Department of Psychology

Using HLM for Presenting Meta Analysis Results. R, C, Gardner Department of Psychology Data_Analysis.calm: dacmeta Using HLM for Presenting Meta Analysis Results R, C, Gardner Department of Psychology The primary purpose of meta analysis is to summarize the effect size results from a number

More information

Clustering human body shapes using k-means algorithm

Clustering human body shapes using k-means algorithm Clustering human body shapes using k-means algorithm Guillermo Vinué Visús PhD Student Faculty of Mathematics University of Valencia, Spain Jointly with Guillermo Ayala Gallego, Juan Domingo Esteve, Esther

More information

Power of the power command in Stata 13

Power of the power command in Stata 13 Power of the power command in Stata 13 Yulia Marchenko Director of Biostatistics StataCorp LP 2013 UK Stata Users Group meeting Yulia Marchenko (StataCorp) September 13, 2013 1 / 27 Outline Outline Basic

More information

Package MOST. November 9, 2017

Package MOST. November 9, 2017 Type Package Title Multiphase Optimization Strategy Version 0.1.0 Depends R (>= 2.15.0) Copyright The Pennsylvania State University Package MOST November 9, 2017 Description Provides functions similar

More information

Multilevel generalized linear modeling

Multilevel generalized linear modeling Multilevel generalized linear modeling 1. Introduction Many popular statistical methods are based on mathematical models that assume data follow a normal distribution. The most obvious among these are

More information

Clustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York

Clustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York Clustering Robert M. Haralick Computer Science, Graduate Center City University of New York Outline K-means 1 K-means 2 3 4 5 Clustering K-means The purpose of clustering is to determine the similarity

More information

2016 Stat-Ease, Inc. & CAMO Software

2016 Stat-Ease, Inc. & CAMO Software Multivariate Analysis and Design of Experiments in practice using The Unscrambler X Frank Westad CAMO Software fw@camo.com Pat Whitcomb Stat-Ease pat@statease.com Agenda Goal: Part 1: Part 2: Show how

More information

Unit 8 SUPPLEMENT Normal, T, Chi Square, F, and Sums of Normals

Unit 8 SUPPLEMENT Normal, T, Chi Square, F, and Sums of Normals BIOSTATS 540 Fall 017 8. SUPPLEMENT Normal, T, Chi Square, F and Sums of Normals Page 1 of Unit 8 SUPPLEMENT Normal, T, Chi Square, F, and Sums of Normals Topic 1. Normal Distribution.. a. Definition..

More information

Enduring Understandings: Some basic math skills are required to be reviewed in preparation for the course.

Enduring Understandings: Some basic math skills are required to be reviewed in preparation for the course. Curriculum Map for Functions, Statistics and Trigonometry September 5 Days Targeted NJ Core Curriculum Content Standards: N-Q.1, N-Q.2, N-Q.3, A-CED.1, A-REI.1, A-REI.3 Enduring Understandings: Some basic

More information

Table Of Contents. Table Of Contents

Table Of Contents. Table Of Contents Statistics Table Of Contents Table Of Contents Basic Statistics... 7 Basic Statistics Overview... 7 Descriptive Statistics Available for Display or Storage... 8 Display Descriptive Statistics... 9 Store

More information

One Factor Experiments

One Factor Experiments One Factor Experiments 20-1 Overview Computation of Effects Estimating Experimental Errors Allocation of Variation ANOVA Table and F-Test Visual Diagnostic Tests Confidence Intervals For Effects Unequal

More information

Title. Description. Menu. Remarks and examples. stata.com. stata.com. PSS Control Panel

Title. Description. Menu. Remarks and examples. stata.com. stata.com. PSS Control Panel Title stata.com GUI Graphical user interface for power and sample-size analysis Description Menu Remarks and examples Also see Description This entry describes the graphical user interface (GUI) for the

More information

Further Simulation Results on Resampling Confidence Intervals for Empirical Variograms

Further Simulation Results on Resampling Confidence Intervals for Empirical Variograms University of Wollongong Research Online Centre for Statistical & Survey Methodology Working Paper Series Faculty of Engineering and Information Sciences 2010 Further Simulation Results on Resampling Confidence

More information

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA

CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Examples: Mixture Modeling With Cross-Sectional Data CHAPTER 7 EXAMPLES: MIXTURE MODELING WITH CROSS- SECTIONAL DATA Mixture modeling refers to modeling with categorical latent variables that represent

More information

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition

Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Bluman & Mayer, Elementary Statistics, A Step by Step Approach, Canadian Edition Online Learning Centre Technology Step-by-Step - Minitab Minitab is a statistical software application originally created

More information

Pair-Wise Multiple Comparisons (Simulation)

Pair-Wise Multiple Comparisons (Simulation) Chapter 580 Pair-Wise Multiple Comparisons (Simulation) Introduction This procedure uses simulation analyze the power and significance level of three pair-wise multiple-comparison procedures: Tukey-Kramer,

More information

Technical Appendix B

Technical Appendix B Technical Appendix B School Effectiveness Models and Analyses Overview Pierre Foy and Laura M. O Dwyer Many factors lead to variation in student achievement. Through data analysis we seek out those factors

More information

An Experiment in Visual Clustering Using Star Glyph Displays

An Experiment in Visual Clustering Using Star Glyph Displays An Experiment in Visual Clustering Using Star Glyph Displays by Hanna Kazhamiaka A Research Paper presented to the University of Waterloo in partial fulfillment of the requirements for the degree of Master

More information

Cluster Randomization Create Cluster Means Dataset

Cluster Randomization Create Cluster Means Dataset Chapter 270 Cluster Randomization Create Cluster Means Dataset Introduction A cluster randomization trial occurs when whole groups or clusters of individuals are treated together. Examples of such clusters

More information

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 8: Interval Estimation Population Mean: Known Population Mean: Unknown Margin of Error and the Interval

More information

6th Grade Math. Lindsay Law - Curriculum Facilitator (ext. 2085)

6th Grade Math. Lindsay Law - Curriculum Facilitator (ext. 2085) 6th Grade Math Purpose Students will become flexible thinkers and complex problem solvers by applying essential mathematical ideas and concepts through a rigorous, focused, and relevant curriculum. Philosophy

More information

IQC monitoring in laboratory networks

IQC monitoring in laboratory networks IQC for Networked Analysers Background and instructions for use IQC monitoring in laboratory networks Modern Laboratories continue to produce large quantities of internal quality control data (IQC) despite

More information

Introduction to Hierarchical Linear Model. Hsueh-Sheng Wu CFDR Workshop Series January 30, 2017

Introduction to Hierarchical Linear Model. Hsueh-Sheng Wu CFDR Workshop Series January 30, 2017 Introduction to Hierarchical Linear Model Hsueh-Sheng Wu CFDR Workshop Series January 30, 2017 1 Outline What is Hierarchical Linear Model? Why do nested data create analytic problems? Graphic presentation

More information

Software for your own computer: R, RStudio, LaTeX, PsychoPy

Software for your own computer: R, RStudio, LaTeX, PsychoPy Software for your own computer: R, RStudio, LaTeX, PsychoPy You do not need your own computer for this class. There are, however, four software packages that you might want to install on your own computer,

More information

Nature Methods: doi: /nmeth Supplementary Figure 1

Nature Methods: doi: /nmeth Supplementary Figure 1 Supplementary Figure 1 Performance analysis under four different clustering scenarios. Performance analysis under four different clustering scenarios. i) Standard Conditions, ii) a sparse data set with

More information

Robust Linear Regression (Passing- Bablok Median-Slope)

Robust Linear Regression (Passing- Bablok Median-Slope) Chapter 314 Robust Linear Regression (Passing- Bablok Median-Slope) Introduction This procedure performs robust linear regression estimation using the Passing-Bablok (1988) median-slope algorithm. Their

More information

Regression Analysis and Linear Regression Models

Regression Analysis and Linear Regression Models Regression Analysis and Linear Regression Models University of Trento - FBK 2 March, 2015 (UNITN-FBK) Regression Analysis and Linear Regression Models 2 March, 2015 1 / 33 Relationship between numerical

More information

Week 4: Simple Linear Regression III

Week 4: Simple Linear Regression III Week 4: Simple Linear Regression III Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Goodness of

More information

CATEGORICAL DATA GENERATOR

CATEGORICAL DATA GENERATOR CATEGORICAL DATA GENERATOR Jana Cibulková Hana Řezanková Abstract Generated data are often required for quality evaluation of a newly proposed statistical method and its results. The number of datasets

More information

MIXED_RELIABILITY: A SAS Macro for Estimating Lambda and Assessing the Trustworthiness of Random Effects in Multilevel Models

MIXED_RELIABILITY: A SAS Macro for Estimating Lambda and Assessing the Trustworthiness of Random Effects in Multilevel Models SESUG 2015 Paper SD189 MIXED_RELIABILITY: A SAS Macro for Estimating Lambda and Assessing the Trustworthiness of Random Effects in Multilevel Models Jason A. Schoeneberger ICF International Bethany A.

More information

(Not That) Advanced Hierarchical Models

(Not That) Advanced Hierarchical Models (Not That) Advanced Hierarchical Models Ben Goodrich StanCon: January 10, 2018 Ben Goodrich Advanced Hierarchical Models StanCon 1 / 13 Obligatory Disclosure Ben is an employee of Columbia University,

More information

Chapter 3: Data Description Calculate Mean, Median, Mode, Range, Variation, Standard Deviation, Quartiles, standard scores; construct Boxplots.

Chapter 3: Data Description Calculate Mean, Median, Mode, Range, Variation, Standard Deviation, Quartiles, standard scores; construct Boxplots. MINITAB Guide PREFACE Preface This guide is used as part of the Elementary Statistics class (Course Number 227) offered at Los Angeles Mission College. It is structured to follow the contents of the textbook

More information

Cpk: What is its Capability? By: Rick Haynes, Master Black Belt Smarter Solutions, Inc.

Cpk: What is its Capability? By: Rick Haynes, Master Black Belt Smarter Solutions, Inc. C: What is its Capability? By: Rick Haynes, Master Black Belt Smarter Solutions, Inc. C is one of many capability metrics that are available. When capability metrics are used, organizations typically provide

More information

A HYBRID METHOD FOR SIMULATION FACTOR SCREENING. Hua Shen Hong Wan

A HYBRID METHOD FOR SIMULATION FACTOR SCREENING. Hua Shen Hong Wan Proceedings of the 2006 Winter Simulation Conference L. F. Perrone, F. P. Wieland, J. Liu, B. G. Lawson, D. M. Nicol, and R. M. Fujimoto, eds. A HYBRID METHOD FOR SIMULATION FACTOR SCREENING Hua Shen Hong

More information

Software for your own computer: R, RStudio, LaTeX, PsychoPy

Software for your own computer: R, RStudio, LaTeX, PsychoPy Software for your own computer: R, RStudio, LaTeX, PsychoPy There are four software packages that you might want to install on your own computer. They will allow you to work on the various class exercises

More information

Chapter 2: The Normal Distributions

Chapter 2: The Normal Distributions Chapter 2: The Normal Distributions Measures of Relative Standing & Density Curves Z-scores (Measures of Relative Standing) Suppose there is one spot left in the University of Michigan class of 2014 and

More information

PEARSON PRE ALGEBRA PDF

PEARSON PRE ALGEBRA PDF PEARSON PRE ALGEBRA PDF ==> Download: PEARSON PRE ALGEBRA PDF PEARSON PRE ALGEBRA PDF - Are you searching for Pearson Pre Algebra Books? Now, you will be happy that at this time Pearson Pre Algebra PDF

More information

Software for your own computer: R, RStudio, LaTeX, PsychoPy

Software for your own computer: R, RStudio, LaTeX, PsychoPy Software for your own computer: R, RStudio, LaTeX, PsychoPy You do not need your own computer for this class. There are, however, four software packages that you might want to install on your own computer,

More information

Slides 11: Verification and Validation Models

Slides 11: Verification and Validation Models Slides 11: Verification and Validation Models Purpose and Overview The goal of the validation process is: To produce a model that represents true behaviour closely enough for decision making purposes.

More information

Distributed similarity search algorithm in distributed heterogeneous multimedia databases

Distributed similarity search algorithm in distributed heterogeneous multimedia databases Information Processing Letters 75 (2000) 35 42 Distributed similarity search algorithm in distributed heterogeneous multimedia databases Ju-Hong Lee a,1, Deok-Hwan Kim a,2, Seok-Lyong Lee a,3, Chin-Wan

More information

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false.

ST512. Fall Quarter, Exam 1. Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. ST512 Fall Quarter, 2005 Exam 1 Name: Directions: Answer questions as directed. Please show work. For true/false questions, circle either true or false. 1. (42 points) A random sample of n = 30 NBA basketball

More information

Motivating Example. Missing Data Theory. An Introduction to Multiple Imputation and its Application. Background

Motivating Example. Missing Data Theory. An Introduction to Multiple Imputation and its Application. Background An Introduction to Multiple Imputation and its Application Craig K. Enders University of California - Los Angeles Department of Psychology cenders@psych.ucla.edu Background Work supported by Institute

More information

Ronald H. Heck 1 EDEP 606 (F2015): Multivariate Methods rev. November 16, 2015 The University of Hawai i at Mānoa

Ronald H. Heck 1 EDEP 606 (F2015): Multivariate Methods rev. November 16, 2015 The University of Hawai i at Mānoa Ronald H. Heck 1 In this handout, we will address a number of issues regarding missing data. It is often the case that the weakest point of a study is the quality of the data that can be brought to bear

More information

Interval Estimation. The data set belongs to the MASS package, which has to be pre-loaded into the R workspace prior to use.

Interval Estimation. The data set belongs to the MASS package, which has to be pre-loaded into the R workspace prior to use. Interval Estimation It is a common requirement to efficiently estimate population parameters based on simple random sample data. In the R tutorials of this section, we demonstrate how to compute the estimates.

More information

Mixed Effects Models. Biljana Jonoska Stojkova Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC.

Mixed Effects Models. Biljana Jonoska Stojkova Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC. Mixed Effects Models Biljana Jonoska Stojkova Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC March 6, 2018 Resources for statistical assistance Department of Statistics

More information

DU PhD in Physical Education

DU PhD in Physical Education DU PhD in Physical Education Topic:- DU_J18_PHD_PE 1) Classifying a person on the basis of his attribute is an example of [Question ID = 52236] 1. Discrete variable [Option ID = 88938] 2. Random Variable

More information

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes.

Resources for statistical assistance. Quantitative covariates and regression analysis. Methods for predicting continuous outcomes. Resources for statistical assistance Quantitative covariates and regression analysis Carolyn Taylor Applied Statistics and Data Science Group (ASDa) Department of Statistics, UBC January 24, 2017 Department

More information

Hands-On Data Science with R Pulling it Together into a Package

Hands-On Data Science with R Pulling it Together into a Package Hands-On Data Science with R Pulling it Together into a Package Graham.Williams@togaware.com 1st June 2016 Visit http://handsondatascience.com/ for more Chapters. In this module we review the package.skeleton()

More information

Calibrate your model!

Calibrate your model! Calibrate your model! Jonathan Rougier Department of Mathematics University of Bristol mailto:j.c.rougier@bristol.ac.uk BRISK/Cabot/CREDIBLE Summer School: Uncertainty and risk in natural hazards 7-11

More information

An Approach To ANOM Chart. Muhammad Riaz

An Approach To ANOM Chart. Muhammad Riaz An Approach To ANOM Chart Muhammad Riaz Department of tatistics, Quaid-i-Azam University, Islamabad, Pakistan E-mail: riaz76qau@yahoo.com Abstract The study proposes a scheme for the structure of Analysis

More information

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D.

Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Minitab Study Card J ENNIFER L EWIS P RIESTLEY, PH.D. Introduction to Minitab The interface for Minitab is very user-friendly, with a spreadsheet orientation. When you first launch Minitab, you will see

More information

Introductory Applied Statistics: A Variable Approach TI Manual

Introductory Applied Statistics: A Variable Approach TI Manual Introductory Applied Statistics: A Variable Approach TI Manual John Gabrosek and Paul Stephenson Department of Statistics Grand Valley State University Allendale, MI USA Version 1.1 August 2014 2 Copyright

More information

Versatile sample-size calculation using simulation

Versatile sample-size calculation using simulation The Stata Journal (2013) 13, Number 1, pp. 21 38 Versatile sample-size calculation using simulation Richard Hooper Centre for Primary Care and Public Health Queen Mary, University of London London, UK

More information

CMP 2 Grade Mathematics Curriculum Guides

CMP 2 Grade Mathematics Curriculum Guides CMP 2 Grade 7 2007 2008 Mathematics Curriculum Guides Wisconsin Mathematics Standard MPS Learning Target Wisconsin Assessment Descriptors for Mathematics Curriculum Throughout The Year A. Mathematical

More information

Data Handling. Moving from A to A* Calculate the numbers to be surveyed for a stratified sample (A)

Data Handling. Moving from A to A* Calculate the numbers to be surveyed for a stratified sample (A) Moving from A to A* A* median, quartiles and interquartile range from a histogram (A*) Draw histograms from frequency tables with unequal class intervals (A) Calculate the numbers to be surveyed for a

More information

Introduction to Mixed-Effects Models for Hierarchical and Longitudinal Data

Introduction to Mixed-Effects Models for Hierarchical and Longitudinal Data John Fox Lecture Notes Introduction to Mixed-Effects Models for Hierarchical and Longitudinal Data Copyright 2014 by John Fox Introduction to Mixed-Effects Models for Hierarchical and Longitudinal Data

More information

1 RefresheR. Figure 1.1: Soy ice cream flavor preferences

1 RefresheR. Figure 1.1: Soy ice cream flavor preferences 1 RefresheR Figure 1.1: Soy ice cream flavor preferences 2 The Shape of Data Figure 2.1: Frequency distribution of number of carburetors in mtcars dataset Figure 2.2: Daily temperature measurements from

More information

2.2.2.Relational Database concept

2.2.2.Relational Database concept Foreign key:- is a field (or collection of fields) in one table that uniquely identifies a row of another table. In simpler words, the foreign key is defined in a second table, but it refers to the primary

More information

Linear Modeling Elementary Education 4

Linear Modeling Elementary Education 4 Linear Modeling Elementary Education 4 Mathematical modeling is simply the act of building a model (usually in the form of graphs) which provides a picture of a numerical situation. Linear Modeling is

More information

Topology of the Erasmus student mobility network

Topology of the Erasmus student mobility network Topology of the Erasmus student mobility network Aranka Derzsi, Noemi Derzsy, Erna Káptalan and Zoltán Néda The Erasmus student mobility network Aim: study the network s topology (structure) and its characteristics

More information

Quick Start Instructions on using MyOMLab with the JAWS Screen Reader*

Quick Start Instructions on using MyOMLab with the JAWS Screen Reader* Quick Start Instructions on using MyOMLab with the JAWS Screen Reader* To work on assignments in MyOMLab with JAWS, follow these steps to turn on the Automatically Detect for Accessibility Setting: 1.

More information

Differentiation of Cognitive Abilities across the Lifespan. Online Supplement. Elliot M. Tucker-Drob

Differentiation of Cognitive Abilities across the Lifespan. Online Supplement. Elliot M. Tucker-Drob 1 Differentiation of Cognitive Abilities across the Lifespan Online Supplement Elliot M. Tucker-Drob This online supplement reports the results of an alternative set of analyses performed on a single sample

More information

Engineering Graphics Concept Inventory

Engineering Graphics Concept Inventory Engineering Graphics Concept Inventory Sheryl Sorby Mary Sadowski Survey https://tinyurl.com/egci-2018 Mobile device - landscape 2 Concept Inventory Workshop Creating a Concept Inventory Brainstorming

More information

Instruction: Download and Install R and RStudio

Instruction: Download and Install R and RStudio 1 Instruction: Download and Install R and RStudio We will use a free statistical package R, and a free version of RStudio. Please refer to the following two steps to download both R and RStudio on your

More information

CHAPTER 2 Modeling Distributions of Data

CHAPTER 2 Modeling Distributions of Data CHAPTER 2 Modeling Distributions of Data 2.2 Density Curves and Normal Distributions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Density Curves

More information

Math 6 Long Range Plans Bill Willis. Strand: NUMBER Develop number sense. Textbook: Math Makes Sense 6

Math 6 Long Range Plans Bill Willis. Strand: NUMBER Develop number sense. Textbook: Math Makes Sense 6 Math 6 Long Range Plans 2012-2013 Bill Willis Rationale: Based upon the mathematics program of studies, our learning environment will value and respect the diversity of students experiences and ways of

More information

Introduction to R and RStudio IDE

Introduction to R and RStudio IDE Introduction to R and RStudio IDE Wan Nor Arifin Unit of Biostatistics and Research Methodology, Universiti Sains Malaysia. email: wnarifin@usm.my December 19, 2018 Wan Nor Arifin (USM) Introduction to

More information

Package ClusterBootstrap

Package ClusterBootstrap Package ClusterBootstrap June 26, 2018 Title Analyze Clustered Data with Generalized Linear Models using the Cluster Bootstrap Date 2018-06-26 Version 1.0.0 Provides functionality for the analysis of clustered

More information

adjacent angles Two angles in a plane which share a common vertex and a common side, but do not overlap. Angles 1 and 2 are adjacent angles.

adjacent angles Two angles in a plane which share a common vertex and a common side, but do not overlap. Angles 1 and 2 are adjacent angles. Angle 1 Angle 2 Angles 1 and 2 are adjacent angles. Two angles in a plane which share a common vertex and a common side, but do not overlap. adjacent angles 2 5 8 11 This arithmetic sequence has a constant

More information

Condence Intervals about a Single Parameter:

Condence Intervals about a Single Parameter: Chapter 9 Condence Intervals about a Single Parameter: 9.1 About a Population Mean, known Denition 9.1.1 A point estimate of a parameter is the value of a statistic that estimates the value of the parameter.

More information

Blending of Probability and Convenience Samples:

Blending of Probability and Convenience Samples: Blending of Probability and Convenience Samples: Applications to a Survey of Military Caregivers Michael Robbins RAND Corporation Collaborators: Bonnie Ghosh-Dastidar, Rajeev Ramchand September 25, 2017

More information

Handbook of Statistical Modeling for the Social and Behavioral Sciences

Handbook of Statistical Modeling for the Social and Behavioral Sciences Handbook of Statistical Modeling for the Social and Behavioral Sciences Edited by Gerhard Arminger Bergische Universität Wuppertal Wuppertal, Germany Clifford С. Clogg Late of Pennsylvania State University

More information

A Quick Introduction to R

A Quick Introduction to R Math 4501 Fall 2012 A Quick Introduction to R The point of these few pages is to give you a quick introduction to the possible uses of the free software R in statistical analysis. I will only expect you

More information

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem.

STAT 2607 REVIEW PROBLEMS Word problems must be answered in words of the problem. STAT 2607 REVIEW PROBLEMS 1 REMINDER: On the final exam 1. Word problems must be answered in words of the problem. 2. "Test" means that you must carry out a formal hypothesis testing procedure with H0,

More information

Lab 5 - Risk Analysis, Robustness, and Power

Lab 5 - Risk Analysis, Robustness, and Power Type equation here.biology 458 Biometry Lab 5 - Risk Analysis, Robustness, and Power I. Risk Analysis The process of statistical hypothesis testing involves estimating the probability of making errors

More information

SAS Macros CORR_P and TANGO: Interval Estimation for the Difference Between Correlated Proportions in Dependent Samples

SAS Macros CORR_P and TANGO: Interval Estimation for the Difference Between Correlated Proportions in Dependent Samples Paper SD-03 SAS Macros CORR_P and TANGO: Interval Estimation for the Difference Between Correlated Proportions in Dependent Samples Patricia Rodríguez de Gil, Jeanine Romano Thanh Pham, Diep Nguyen, Jeffrey

More information

Introduction to R Programming

Introduction to R Programming Course Overview Over the past few years, R has been steadily gaining popularity with business analysts, statisticians and data scientists as a tool of choice for conducting statistical analysis of data

More information

Macros and ODS. SAS Programming November 6, / 89

Macros and ODS. SAS Programming November 6, / 89 Macros and ODS The first part of these slides overlaps with last week a fair bit, but it doesn t hurt to review as this code might be a little harder to follow. SAS Programming November 6, 2014 1 / 89

More information

Computation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No.

Computation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No. Computation for the Introduction to MCMC Chapter of Handbook of Markov Chain Monte Carlo By Charles J. Geyer Technical Report No. 679 School of Statistics University of Minnesota July 29, 2010 Abstract

More information

rpms: An R Package for Modeling Survey Data with Regression Trees

rpms: An R Package for Modeling Survey Data with Regression Trees rpms: An R Package for Modeling Survey Data with Regression Trees Daniell Toth U.S. Bureau of Labor Statistics Abstract In this article, we introduce the R package, rpms (Recursive Partitioning for Modeling

More information

Eight units must be completed and passed to be awarded the Diploma.

Eight units must be completed and passed to be awarded the Diploma. Diploma of Computing Course Outline Campus Intake CRICOS Course Duration Teaching Methods Assessment Course Structure Units Melbourne Burwood Campus / Jakarta Campus, Indonesia March, June, October 022638B

More information

Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming

Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming Michael Rosenblum Johns Hopkins Bloomberg School of Public Health Joint work with Xingyuan (Ethan) Fang and Han Liu Princeton

More information

Long Term Maths Plan 2013/2014 Year 6

Long Term Maths Plan 2013/2014 Year 6 Long Term Maths Plan 2013/2014 Year 6 Knowledge, skills and understanding 1. During the key stage, pupils should be taught the Knowledge, skills and understanding through: a. activities that extend their

More information

Applied Regression Modeling: A Business Approach

Applied Regression Modeling: A Business Approach i Applied Regression Modeling: A Business Approach Computer software help: SAS SAS (originally Statistical Analysis Software ) is a commercial statistical software package based on a powerful programming

More information

Airborne discrete return LiDAR data was collected on September 3-4, 2007 by

Airborne discrete return LiDAR data was collected on September 3-4, 2007 by SUPPLEMENTAL MATERIAL 2 LiDAR Specifications Airborne discrete return LiDAR data was collected on September 3-4, 2007 by Watershed Sciences, Inc. (Corvallis, Oregon USA). LiDAR was collected approximately

More information

Scientific Calculators

Scientific Calculators Science Studio is delighted to endorse a wide range of Texas Instruments technology including calculators, handheld devices, computer software and wireless systems. For further details and pricing, please

More information

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users BIOSTATS 640 Spring 2018 Review of Introductory Biostatistics STATA solutions Page 1 of 13 Key Comments begin with an * Commands are in bold black I edited the output so that it appears here in blue Unit

More information

Mathematics / Mathématiques

Mathematics / Mathématiques Mathematics / Mathématiques Test Description The Grade 9 Mathematics Achievement Test consists of two parts: Part A contains 20 numerical-response questions and was designed to be completed in 20 minutes.

More information