Sample Size calculations for Stepped Wedge Trials

Similar documents
POWER AND SAMPLE SIZE DETERMINATION FOR STEPPED WEDGE CLUSTER RANDOMIZED TRIALS

Package swcrtdesign. February 12, 2018

INLA: an introduction

GAMs semi-parametric GLMs. Simon Wood Mathematical Sciences, University of Bath, U.K.

Creating an R Package

Randolph E. Bucklin Catarina Sismeiro The Anderson School at UCLA. Overview. Introduction/Clickstream Research Conceptual Framework

Smoothing and Forecasting Mortality Rates with P-splines. Iain Currie. Data and problem. Plan of talk

An imputation approach for analyzing mixed-mode surveys

Clustering and The Expectation-Maximization Algorithm

10-701/15-781, Fall 2006, Final

Problem 1 (20 pt) Answer the following questions, and provide an explanation for each question.

STOCHASTIC USER EQUILIBRIUM TRAFFIC ASSIGNMENT WITH MULTIPLE USER CLASSES AND ELASTIC DEMAND

The linear mixed model: modeling hierarchical and longitudinal data

A Deterministic Global Optimization Method for Variational Inference

Introduction to Mixed Models: Multivariate Regression

Modelling Personalized Screening: a Step Forward on Risk Assessment Methods

Rolling Markov Chain Monte Carlo

CALIBRATION BETWEEN DEPTH AND COLOR SENSORS FOR COMMODITY DEPTH CAMERAS. Cha Zhang and Zhengyou Zhang

Patient Safety Initiatives Implementation (WP5): Presentation of Main Outcomes

INLA: Integrated Nested Laplace Approximations

Types of missingness and common strategies

DATA ANALYSIS USING HIERARCHICAL GENERALIZED LINEAR MODELS WITH R

Recursion. COMS W1007 Introduction to Computer Science. Christopher Conway 26 June 2003

Creating summary tables using the sumtable command

Learning a Manifold as an Atlas Supplementary Material

Package samplingdatacrt

Constrained Optimal Sample Allocation in Multilevel Randomized Experiments Using PowerUpR

Non-Linearity of Scorecard Log-Odds

Template text for search methods: New reviews

Hidden Markov Models in the context of genetic analysis

The impact of patient, intervention, comparison, outcome (PICO) as a search strategy tool on literature search quality: a systematic review

Real-world bandit applications: Bridging the gap between theory and practice

Searching the Cochrane Library

Applied Longitudinal Analysis, 2nd Edition

Over-contribution in discretionary databases

Industrialising Small Area Estimation at the Australian Bureau of Statistics

r v i e w o f s o m e r e c e n t d e v e l o p m

Design and Analysis Methods for Cluster Randomized Trials with Pair-Matching on Baseline Outcome: Reduction of Treatment Effect Variance

ipads and IPs: Maximizing the Use of Mobile Technology within Infection Prevention and Healthcare Epidemiology September 26 th, 2012

Link Mining & Entity Resolution. Lise Getoor University of Maryland, College Park

Missing Data and Imputation

Rolling Markov Chain Monte Carlo

Detecting changes in streaming data using adaptive estimation

Expectation-Maximization. Nuno Vasconcelos ECE Department, UCSD

Graphical Models & HMMs

for Images A Bayesian Deformation Model

MODEL SELECTION AND MODEL AVERAGING IN THE PRESENCE OF MISSING VALUES

A step by step Guide to Searching the Cochrane database

Context-sensitive Classification Forests for Segmentation of Brain Tumor Tissues

Non-Bayesian Classifiers Part II: Linear Discriminants and Support Vector Machines

Asreml-R: an R package for mixed models using residual maximum likelihood

3ie evidence gap maps platform Automatic importer guide

Statistical Analysis of the 3WAY Block Cipher

Distribution of the Number of Encryptions in Revocation Schemes for Stateless Receivers

Non-Parametric and Semi-Parametric Methods for Longitudinal Data

Global modelling of air pollution using multiple data sources

Mixture Models and the EM Algorithm

STATA 13 INTRODUCTION

STIR. Software for Tomographic Image Reconstruction. Kris Thielemans

POSTGRADUATE DIPLOMA IN RESEARCH METHODS

Recent advances in Metamodel of Optimal Prognosis. Lectures. Thomas Most & Johannes Will

Statistical Modeling of Neuroimaging Data: Targeting Activation, Task-Related Connectivity, and Prediction

Model-based Recursive Partitioning for Subgroup Analyses

Analyzing Longitudinal Data Using Regression Splines

SVM in Analysis of Cross-Sectional Epidemiological Data Dmitriy Fradkin. April 4, 2005 Dmitriy Fradkin, Rutgers University Page 1

Numerical-Asymptotic Approximation at High Frequency

Statistical modelling with missing data using multiple imputation. Session 2: Multiple Imputation

Linear Regression QuadraJc Regression Year

Facilitating Data Discovery through a Semantic Data Catalog

1 : Introduction to GM and Directed GMs: Bayesian Networks. 3 Multivariate Distributions and Graphical Models

Motivating Example. Missing Data Theory. An Introduction to Multiple Imputation and its Application. Background

Remember to SHOW ALL STEPS. You must be able to solve analytically. Answers are shown after each problem under A, B, C, or D.

REGRESSION ANALYSIS : LINEAR BY MAUAJAMA FIRDAUS & TULIKA SAHA

Bayes Factor Single Arm Binary User s Guide (Version 1.0)

Module 4. Non-linear machine learning econometrics: Support Vector Machine

A Dendrogram. Bioinformatics (Lec 17)

A Neuro Probabilistic Language Model Bengio et. al. 2003

Performance of Sequential Imputation Method in Multilevel Applications

MATH3016: OPTIMIZATION

R-Square Coeff Var Root MSE y Mean

Chapter 7: Numerical Prediction

Analysis of Imputation Methods for Missing Data. in AR(1) Longitudinal Dataset

Supplementary methods

Missing Data and Imputation

HANDLING MISSING DATA

Multidimensional Item Response Theory (MIRT) University of Kansas Item Response Theory Stats Camp 07

(Not That) Advanced Hierarchical Models

Clustering: Classic Methods and Modern Views

Missing Data. Where did it go?

Bayesian Analysis of Differential Gene Expression

Edge-exchangeable graphs and sparsity

60 2 Convex sets. {x a T x b} {x ã T x b}

Spring 2011 CSE Qualifying Exam Core Subjects. February 26, 2011

Multiple Imputation with Mplus

UK COSMOS PARTICIPANT NEWSLETTER FEBRUARY 2014

7.2. The Standard Normal Distribution

Modelling Proportions and Count Data

for Fast Clustering of Data on the Sphere

Lecture 3: Geometric and Signal 3D Processing (and some Visualization)

Visualization and text mining of patent and non-patent data

Transcription:

Sample Size calculations for Stepped Wedge Trials Gianluca Baio University College London Department of Statistical Science gianluca@stats.ucl.ac.uk (Joint work with Rumana Omar, Andrew Copas, Emma Beard, James Hargreaves and Gareth Ambler) Stepped Wedge Trials Symposium London School of Hygiene & Tropical Medicine London, 22 September 2015 Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 1 / 10

Objectives of the project/paper Work supported by a NIHR Research Methods Opportunity Funding Scheme Grant (RMOFS-2013-03-02) Critically investigate the conditions under which applying a stepped wedge design can result in potential gains in terms of Efficiency Statistical power Financial/ethical implications Produce a toolbox to perform power calculations Simulation-based approach Extension to more general models Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 2 / 10

Objectives of the project/paper Work supported by a NIHR Research Methods Opportunity Funding Scheme Grant (RMOFS-2013-03-02) Critically investigate the conditions under which applying a stepped wedge design can result in potential gains in terms of Efficiency Statistical power Financial/ethical implications Produce a toolbox to perform power calculations Simulation-based approach Extension to more general models Have lots of fun working in the Special Issue Crew! Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 2 / 10

Modelling & sample size calculations Analytical formulæ Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 3 / 10

Modelling & sample size calculations Analytical formulæ Hussey & Hughes (2007) + Reprise : Hughes et al (2015) Specifically for cross-sectional data. Defines cluster- and time-specific average outcome as µ ij = µ + α i + β j + X ijθ Can compute ( ) θ Power = Φ z α/2 V (θ) where V (θ) = f(x, I, J, σ 2 e, σ 2 α) Can use asymptotic normality, eg for binary or count outcomes Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 3 / 10

Modelling & sample size calculations Analytical formulæ Hussey & Hughes (2007) + Reprise : Hughes et al (2015) Specifically for cross-sectional data. Defines cluster- and time-specific average outcome as µ ij = µ + α i + β j + X ijθ Can compute ( ) θ Power = Φ z α/2 V (θ) where V (θ) = f(x, I, J, σ 2 e, σ 2 α) Can use asymptotic normality, eg for binary or count outcomes Design Effect (DE) Wortman et al (2013) + Hemming & Taljaard (2015) Compute inflation factor to account for induce correlation and re-scale sample size for a parallel RCT Based on Hussey and Hughes (2007) cross-sectional data Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 3 / 10

Modelling & sample size calculations Analytical formulæ Hussey & Hughes (2007) + Reprise : Hughes et al (2015) Specifically for cross-sectional data. Defines cluster- and time-specific average outcome as µ ij = µ + α i + β j + X ijθ Can compute ( ) θ Power = Φ z α/2 V (θ) where V (θ) = f(x, I, J, σ 2 e, σ 2 α) Can use asymptotic normality, eg for binary or count outcomes Design Effect (DE) Wortman et al (2013) + Hemming & Taljaard (2015) Compute inflation factor to account for induce correlation and re-scale sample size for a parallel RCT Based on Hussey and Hughes (2007) cross-sectional data Some generalisations (Hemming et al 2014) Multiple layers of clustering + incomplete SWT Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 3 / 10

Modelling & sample size calculations Simulation-based calculations Baio et al (2015) Can directly model different types of outcomes (eg binary or counts) The linear predictor is just defined using a suitable transformation g( ) Can extend model to account for specific features of the SWT Repeated measurements (eg closed-cohort) add extra random effect v ik Normal(0, σ 2 v) Specify time trends (eg quadratic or polynomial) Include cluster-specific intervention effects X ij(θ + u i) with u i Normal(0, σ 2 u) Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 4 / 10

Modelling & sample size calculations Simulation-based calculations Baio et al (2015) Can directly model different types of outcomes (eg binary or counts) The linear predictor is just defined using a suitable transformation g( ) Can extend model to account for specific features of the SWT Repeated measurements (eg closed-cohort) add extra random effect v ik Normal(0, σ 2 v) Specify time trends (eg quadratic or polynomial) Include cluster-specific intervention effects X ij(θ + u i) with u i Normal(0, σ 2 u) Helps alignment of design and analysis model This is one of the issues identified by the literature review More flexibility at design stage to match complexity of data generating process as well as analysis model (mixed effects, GEE, etc) Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 4 / 10

Simulation-based vs analytical calculations ICC Analytical power based on HH Simulation-based calculations K = 20, J = 6 K = 20, J = 6 Continuous outcome a 0 9 9 0.1 13 13 0.2 14 13 0.3 14 14 0.4 14 14 0.5 14 14 Binary outcome b 0 11 15 0.1 17 16 0.2 18 17 0.3 18 18 0.4 18 18 0.5 18 18 Count outcome c 0 8 8 0.1 13 12 0.2 13 12 0.3 13 12 0.4 13 11 0.5 13 11 a Intervention effect = 0.3785; σe = 1.55. b Baseline outcome probability = 0.26; OR = 0.56. c Baseline outcome rate = 1.5; RR = 0.8. Notation: K = number of subjects per cluster; J = total number of time points, including one baseline. The cells in the table are the estimated number of clusters as a function of the ICC and outcome type, to obtain 80% power Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 5 / 10

Cross-sectional vs closed-cohort data Effect size & ICC Continuous outcome Cross-sectional Closed-cohort I = 25 clusters, each with K = 20 subjects; J = 6 time points ( measurements) including one baseline Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 6 / 10

Cross-sectional vs closed-cohort data Number of steps Binary outcome Cross-sectional Closed-cohort I = 24 clusters, each with K = 20 subjects; individual-level ICC= 0.0016 for closed-cohort Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 7 / 10

R package SWSamp Will allow the user to run simulations for a set of basic models Cross-sectional + closed-cohort data Continuous (normal), binary and count outcome Provide template for custom data-generating models Include Bayesian alternative (based on INLA) Comparable computational time to REML Can use default priors but can also customise Explore issues with open-cohorts & time-to-event outcomes Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 8 / 10

R package SWSamp Will allow the user to run simulations for a set of basic models Cross-sectional + closed-cohort data Continuous (normal), binary and count outcome Provide template for custom data-generating models Include Bayesian alternative (based on INLA) Comparable computational time to REML Can use default priors but can also customise Explore issues with open-cohorts & time-to-event outcomes Can use the name Samp... Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 8 / 10

(Only a very few!) References Baio G, Copas A, Ambler G, Hargreaves J, Beard E, Omar R. (2015) Sample size calculations for a stepped wedge trial. Trials. 16:354. doi: 10.1186/s13063-015-0840-9 Hemming K, Lilford R, Girling A. (2014) Stepped-wedge cluster randomised controlled trials: a generic framework including parallel and multiple-level design. Statistics in Medicine. 34(2):181 196. doi: 10.1002/sim.6325 Hemming K, Taljaard M. (2015) Relative efficiencies of stepped wedge and cluster randomized trials were easily compared using a unified approach. Journal of Clinical Epidemiology. doi: 10.1016/j.jclinepi.2015.08.015. Hussey M and Hughes J. (2007). Design and analysis of stepped wedge cluster randomized trials. Contemporary Clinical Trials. 28:182 191 Hughes J, Granston T and Heagerty, P. (2015). Current issues in the design and analysis of stepped wedge trials. Contemporary Clinical Trials. doi: 10.1016/j.cct.2015.07.006 Woertman W, de Hoopa E, Moerbeek M, Zuidemac S, Gerritsen D and Teerenstra S. (2013). Stepped wedge designs could reduce the required sample size in cluster randomized trials. Journal of Clinical Epidemiology. 66:752 758 Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 9 / 10

Thank you! Gianluca Baio ( UCL) Sample size calculations for SWTs LSHTM Symposium, 22 Sep 2015 10 / 10