Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming
|
|
- Grant Powell
- 6 years ago
- Views:
Transcription
1 Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming Michael Rosenblum Johns Hopkins Bloomberg School of Public Health Joint work with Xingyuan (Ethan) Fang and Han Liu Princeton University Working paper giving full results available here: October, 05
2 Problem Motivation Goal: Testing Treatment Effects in Two Subpopulations and the Overall Population in Adaptive Enrichment Designs Example: Treating resistant HIV. Recent HIV drugs (maraviroc, raltegravir) have shown stronger benefit in those with lower phenotypic sensitivity to background therapy. We assume two, predefined, subpopulations that partition the overall population.
3 Multiple Testing Problem: Null Hypotheses Definition Define three treatment effects of interest: : Mean Treatment Effect for Subpopulation (i.e., difference between population mean of the primary outcome under treatment and under control) : Mean Treatment Effect for Subpopulation C = p + ( p ) : Mean Treatment Effect for Combined Population
4 Multiple Testing Problem: Null Hypotheses Definition Define three treatment effects of interest: : Mean Treatment Effect for Subpopulation (i.e., difference between population mean of the primary outcome under treatment and under control) : Mean Treatment Effect for Subpopulation C = p + ( p ) : Mean Treatment Effect for Combined Population Goal: construct adaptive enrichment design D and multiple testing procedure M for: H 0 : 0, H 0 : 0, H 0C : p + ( p ) 0, that strongly controls familywise Type I error rate, and is optimal in sense defined below.
5 General Two-Stage Adaptive Enrichment Design Stage Decision Stage Hyp.Test Recruit Both Popula)ons Subpopula)on Subpopula)on D... Subpopula)on : Subpopula)on : Subpopula)on : Subpopula)on : (D=) n (D=) n (D=) n (D=) n M K Subpopula)on : Subpopula)on : (D=K ) n (D=K ) n
6 Example Two-Stage Adaptive Enrichment Design Stage Decision Stage Stop trial Hyp.Test Recruit Both Popula7ons Subpop Subpop D Recruit Both Pop. Subpop Subpop Recruit Only Subpop. T > T? Subpopula7on M 4 Recruit Only Subpop. Subpopula7on
7 Example Two-Stage Adaptive Enrichment Design n = total sample size if both subpopulations enrolled in stage. Stage Decision Stage Stop trial Hyp.Test Recruit Both Popula7ons Subpop Subpop } n D 4 } Recruit Both Pop. Subpop n Subpop Recruit Only Subpop. T > T? Subpopula7on } n 4 M } Recruit Only Subpop. n Subpopula7on 4
8 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C.
9 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C. Z s () : z-statistic for subpopulation s at end of stage ; Z s (F ) : Final, cumulative z-statistic for subpop. s at end of stage.
10 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C. Z s () : z-statistic for subpopulation s at end of stage ; Z s (F ) : Final, cumulative z-statistic for subpop. s at end of stage. Decision rule D is map from Z () = (Z (), Z () ) to possible decisions D. Multiple testing procedure M is map from Z (F ) = (Z (F ), Z (F ) ) and decision D to set of null hypotheses rejected (if any).
11 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C. Z s () : z-statistic for subpopulation s at end of stage ; Z s (F ) : Final, cumulative z-statistic for subpop. s at end of stage. Decision rule D is map from Z () = (Z (), Z () ) to possible decisions D. Multiple testing procedure M is map from Z (F ) = (Z (F ), Z (F ) ) and decision D to set of null hypotheses rejected (if any). Power at alternative, to reject H 0 is Pr (, )[M(Z (F ), D(Z () )) rejects H 0 ]. User specifies: (i) loss function L(D, M;, ), e.g., total sample size; and (ii) distribution Λ on alternatives (, ).
12 Constrained Bayes Optimization Problem Problem inputs: p ; set of possible stage decisions; σ, σ ; clinically meaningful min. treatment effect min ; loss function L; distribution Λ on alternatives (, ); α, β, β, β C. Recall D = D(Z () ) and M = M(Z (F ), D(Z () )).
13 Constrained Bayes Optimization Problem Problem inputs: p ; set of possible stage decisions; σ, σ ; clinically meaningful min. treatment effect min ; loss function L; distribution Λ on alternatives (, ); α, β, β, β C. Recall D = D(Z () ) and M = M(Z (F ), D(Z () )). Constrained Bayes Opt. Problem: Find pair (D, M) minimizing: E (, )[L(D, M;, )]dλ(, ), under familywise Type I error constraints: sup Pr (, )[M rejects any true null hypothesis] α, (, ) R
14 Constrained Bayes Optimization Problem Problem inputs: p ; set of possible stage decisions; σ, σ ; clinically meaningful min. treatment effect min ; loss function L; distribution Λ on alternatives (, ); α, β, β, β C. Recall D = D(Z () ) and M = M(Z (F ), D(Z () )). Constrained Bayes Opt. Problem: Find pair (D, M) minimizing: E (, )[L(D, M;, )]dλ(, ), under familywise Type I error constraints: sup Pr (, )[M rejects any true null hypothesis] α, (, ) R and power constraints: Pr ( min,0)[m rejects H 0 ] β. Pr (0, min )[M rejects H 0 ] β. Pr ( min, min )[M rejects H 0C ] β C.
15 Our Method to Solve Optimization Problem Computational challenge: continuum of Type I error constraints. (It s not enough to satisfy only at global null (, ) = (0, 0).)
16 Our Method to Solve Optimization Problem Computational challenge: continuum of Type I error constraints. (It s not enough to satisfy only at global null (, ) = (0, 0).) Our solution: Discretize decision region R into small rectangles R; for any r R, enforce decision rule D makes same decision for any (Z (), Z () ) r. For each decision d {,..., K}, discretize rejection regions R into small rectangles R d ; for any r R d, enforce that if D = d, multiple testing procedure M rejects same set of hypotheses for any (Z (F ), Z (F ) ) r. Discretize Type I error constraints into fine grid on boundaries of null spaces.
17 Our Method to Solve Optimization Problem Computational challenge: continuum of Type I error constraints. (It s not enough to satisfy only at global null (, ) = (0, 0).) Our solution: Discretize decision region R into small rectangles R; for any r R, enforce decision rule D makes same decision for any (Z (), Z () ) r. For each decision d {,..., K}, discretize rejection regions R into small rectangles R d ; for any r R d, enforce that if D = d, multiple testing procedure M rejects same set of hypotheses for any (Z (F ), Z (F ) ) r. Discretize Type I error constraints into fine grid on boundaries of null spaces. Discretized opt. problem is not convex. However, we construct reparametrization that is sparse, linear program: max c T x s.t. Ax b. x We apply advanced optimization methods to solve this.
18 Example p = /, α = 0.05, σ = σ. L =total sample size. Prior Λ equally weighted pt. masses at (, ) equal to (0, 0), ( min, 0), (0, min ), ( min, min ).
19 Example p = /, α = 0.05, σ = σ. L =total sample size. Prior Λ equally weighted pt. masses at (, ) equal to (0, 0), ( min, 0), (0, min ), ( min, min ). Sample size n is min s.t. standard design satisfies each power constraint at β j = 0.64 for j {,, C}. Set each β j = 0.8. Stage Decision Stage Stop trial Hyp.Test M(Z (F),) Recruit Both Popula7ons Subpop Subpop } n D(Z () ) 4 Recruit Both Pop. Subpop n Subpop Recruit Only Subpop. Subpopula7on T > T? } } } n 4 M(Z (F),) M(Z (F),) Recruit Only Subpop. n Subpopula7on 4 M(Z (F),4)
20 Example p = /, α = 0.05, σ = σ. L =total sample size. Prior Λ equally weighted pt. masses at (, ) equal to (0, 0), ( min, 0), (0, min ), ( min, min ). Sample size n is min s.t. standard design satisfies each power constraint at β j = 0.64 for j {,, C}. Set each β j = 0.8.? T T > Subpopula)on Stage Decision Stage Hyp.Test Subpopula)on Subpopula)on Subpopula)on Recruit Both Popula)ons Recruit Only Subpop. Recruit Only Subpop. Recruit Both Pop. Subpopula)on Subpopula)on Stop trial D 4 M Z Z 0 0
21 Approximately Optimal Solution Decision Rule for Stage Enrollment: Z Z 0 0 Rejection Regions under Each Decision: (in terms of Z (F ), Z (F ) ) D = "" D = "" D = "" D = ""
22 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0 H0, H0C H0, H0C H0
23 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0 H0, H0C H0, H0C H0
24 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0, H0C H0
25 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0, H0C H0
26 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0, H0C H0
27 Power Comparison Compare to adaptive enrichment design using p-value combination approach (Bauer and Köhne, 994), with Dunnett intersection test and inverse-normal combination function. Early stopping is incorporated using O Brien-Fleming boundaries for each intersection null hypothesis. Decision rule for stage : if combined population statistic (Z () + Z () )/ > t c, enroll both subpop. else, enroll from each subpopulation s for which Z s () > t. Consider β = β = β = β C. For each power threshold β, we optimized over t, t c to minimize expected sample size under the power constraints. n = total sample size if both enrolled stage. Table: Minimum of ESS dλ, as power constraint β varied. Required Power β: 70% 74% 78% 8% Comparator 0.97n.0n infeasible infeasible Optimal 0.79n 0.84n 0.9n.0n
28 References Rosenblum, M., Fang, X., and Liu, H., Optimal, Two Stage, Adaptive Enrichment Designs for Randomized Trials Using Sparse Linear Programming (04). Johns Hopkins University, Dept. of Biostatistics Working Papers. Working Paper 7. Rosenblum, M., Liu, H., and Yen, E.-H. (04), Optimal Tests of Treatment Effects for the Overall Population and Two Subpopulations in Randomized Trials, using Sparse Linear Programming, Journal of American Statistical Association, Theory and Methods Section, Volume 09. Issue Rosenblum, M. (In Press), Adaptive Randomized Trial Designs that Cannot be Dominated by Any Standard Design at the Same Total Sample Size. Biometrika.
29 Collaborators Collaborators Ethan X. Fang Han Liu Princeton University
30 Thank you! Acknowledgement: This research was supported by the U.S. Food and Drug Administration (HHSF0400C) and the Patient- Centered Outcomes Research Institute (ME ). This work is solely the responsibility of the authors and does not represent the views of these agencies.
Optimal designs for comparing curves
Optimal designs for comparing curves Holger Dette, Ruhr-Universität Bochum Maria Konstantinou, Ruhr-Universität Bochum Kirsten Schorning, Ruhr-Universität Bochum FP7 HEALTH 2013-602552 Outline 1 Motivation
More informationBayes Factor Single Arm Binary User s Guide (Version 1.0)
Bayes Factor Single Arm Binary User s Guide (Version 1.0) Department of Biostatistics P. O. Box 301402, Unit 1409 The University of Texas, M. D. Anderson Cancer Center Houston, Texas 77230-1402, USA November
More informationFrequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS
MWSUG 2016 - Paper PH06 Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS Kechen Zhao, University of Southern
More informationTables for Statistical Significance with Multiple Comparisons
Tables for Statistical Significance with Multiple Comparisons J. Scott Armstrong The Wharton School, University of Pennsylvania Philadelphia, Pa. 19104 Reprinted from Armstrong, J. Scott, Long-Range Forecasting.
More informationEquivalence of dose-response curves
Equivalence of dose-response curves Holger Dette, Ruhr-Universität Bochum Kathrin Möllenhoff, Ruhr-Universität Bochum Stanislav Volgushev, University of Toronto Frank Bretz, Novartis Basel FP7 HEALTH 2013-602552
More informationLecture 14. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.
geometric Lecture 14 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University December 19, 2007 geometric 1 2 3 4 geometric 5 6 7 geometric 1 Review about logs
More informationThe fastclime Package for Linear Programming and Large-Scale Precision Matrix Estimation in R
Journal of Machine Learning Research (2013) Submitted ; Published The fastclime Package for Linear Programming and Large-Scale Precision Matrix Estimation in R Haotian Pang Han Liu Robert Vanderbei Princeton
More informationAutomatic Selection of Compiler Options Using Non-parametric Inferential Statistics
Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics Masayo Haneda Peter M.W. Knijnenburg Harry A.G. Wijshoff LIACS, Leiden University Motivation An optimal compiler optimization
More informationTTEDesigner User s Manual
TTEDesigner User s Manual John D. Cook Department of Biostatistics, Box 447 The University of Texas, M. D. Anderson Cancer Center 1515 Holcombe Blvd., Houston, Texas 77030, USA cook@mdanderson.org September
More informationBayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis
Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Xavier Le Faucheur a, Brani Vidakovic b and Allen Tannenbaum a a School of Electrical and Computer Engineering, b Department of Biomedical
More informationAn R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation
An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation Xingguo Li Tuo Zhao Xiaoming Yuan Han Liu Abstract This paper describes an R package named flare, which implements
More informationAn R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation
An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation Xingguo Li Tuo Zhao Xiaoming Yuan Han Liu Abstract This paper describes an R package named flare, which implements
More informationgsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual
gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual Keaven M. Anderson, Dan (Jennifer) Sun, Zhongxin (John) Zhang Merck Research Laboratories December 18, 2008 Abstract
More informationThe flare Package for High Dimensional Linear Regression and Precision Matrix Estimation in R
Journal of Machine Learning Research 6 (205) 553-557 Submitted /2; Revised 3/4; Published 3/5 The flare Package for High Dimensional Linear Regression and Precision Matrix Estimation in R Xingguo Li Department
More informationTwo-Stage Designs for Phase II Clinical Trials. by Chung-Li Tsai Advisor Kam-Fai Wong
Two-Stage Designs for Phase II Clinical Trials by Chung-Li Tsai Advisor Kam-Fai Wong Institute of Statistics National University of Kaohsiung Kaohsiung, Taiwan 811, R.O.C. July 2006 Contents List of Figures
More informationGene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients
1 Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients 1,2 Keyue Ding, Ph.D. Nov. 8, 2014 1 NCIC Clinical Trials Group, Kingston, Ontario, Canada 2 Dept. Public
More informationVideo shot segmentation using late fusion technique
Video shot segmentation using late fusion technique by C. Krishna Mohan, N. Dhananjaya, B.Yegnanarayana in Proc. Seventh International Conference on Machine Learning and Applications, 2008, San Diego,
More informationIntroductory Concepts for Voxel-Based Statistical Analysis
Introductory Concepts for Voxel-Based Statistical Analysis John Kornak University of California, San Francisco Department of Radiology and Biomedical Imaging Department of Epidemiology and Biostatistics
More informationMATH : EXAM 3 INFO/LOGISTICS/ADVICE
MATH 3342-004: EXAM 3 INFO/LOGISTICS/ADVICE INFO: WHEN: Friday (04/22) at 10:00am DURATION: 50 mins PROBLEM COUNT: Appropriate for a 50-min exam BONUS COUNT: At least one TOPICS CANDIDATE FOR THE EXAM:
More informationStatistics. Taskmistress or Temptress?
Statistics Taskmistress or Temptress? Naomi Altman Penn State University Dec 2013 Altman (Penn State) Reproducible Research December 9, 2013 1 / 24 Recent Headlines Altman (Penn State) Reproducible Research
More informationECLT 5810 Data Preprocessing. Prof. Wai Lam
ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate
More informationFast or furious? - User analysis of SF Express Inc
CS 229 PROJECT, DEC. 2017 1 Fast or furious? - User analysis of SF Express Inc Gege Wen@gegewen, Yiyuan Zhang@yiyuan12, Kezhen Zhao@zkz I. MOTIVATION The motivation of this project is to predict the likelihood
More informationModel-based Recursive Partitioning for Subgroup Analyses
EBPI Epidemiology, Biostatistics and Prevention Institute Model-based Recursive Partitioning for Subgroup Analyses Torsten Hothorn; joint work with Heidi Seibold and Achim Zeileis 2014-12-03 Subgroup analyses
More information9.2 Types of Errors in Hypothesis testing
9.2 Types of Errors in Hypothesis testing 1 Mistakes we could make As I mentioned, when we take a sample we won t be 100% sure of something because we do not take a census (we only look at information
More informationIntroduction to Stochastic Combinatorial Optimization
Introduction to Stochastic Combinatorial Optimization Stefanie Kosuch PostDok at TCSLab www.kosuch.eu/stefanie/ Guest Lecture at the CUGS PhD course Heuristic Algorithms for Combinatorial Optimization
More informationSpatial biosurveillance
Spatial biosurveillance Authors of Slides Andrew Moore Carnegie Mellon awm@cs.cmu.edu Daniel Neill Carnegie Mellon d.neill@cs.cmu.edu Slides and Software and Papers at: http://www.autonlab.org awm@cs.cmu.edu
More informationLeveraging multiple endpoints in small clinical trials
Leveraging multiple endpoints in small clinical trials Robin Ristl Section for Medical Statistics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna EMA FP7
More informationPackage sp23design. R topics documented: February 20, Type Package
Type Package Package sp23design February 20, 2015 Title Design and Simulation of seamless Phase II-III Clinical Trials Version 0.9 Date 2012-01-01 Author Balasubramanian Narasimhan [aut, cre], Mei-Chiung
More informationEquivalence Tests for Two Means in a 2x2 Cross-Over Design using Differences
Chapter 520 Equivalence Tests for Two Means in a 2x2 Cross-Over Design using Differences Introduction This procedure calculates power and sample size of statistical tests of equivalence of the means of
More informationWeek 4: Simple Linear Regression II
Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties
More informationInteractive Treatment Planning in Cancer Radiotherapy
Interactive Treatment Planning in Cancer Radiotherapy Mohammad Shakourifar Giulio Trigila Pooyan Shirvani Ghomi Abraham Abebe Sarah Couzens Laura Noreña Wenling Shang June 29, 212 1 Introduction Intensity
More informationProject Paper Introduction
Project Paper Introduction Tracey Marsh Group Health Research Institute University of Washington, Department of Biostatistics Paper Estimating and Projecting Trends in HIV/AIDS Generalized Epidemics Using
More informationLecture 10: SVM Lecture Overview Support Vector Machines The binary classification problem
Computational Learning Theory Fall Semester, 2012/13 Lecture 10: SVM Lecturer: Yishay Mansour Scribe: Gitit Kehat, Yogev Vaknin and Ezra Levin 1 10.1 Lecture Overview In this lecture we present in detail
More informationR-Square Coeff Var Root MSE y Mean
STAT:50 Applied Statistics II Exam - Practice 00 possible points. Consider a -factor study where each of the factors has 3 levels. The factors are Diet (,,3) and Drug (A,B,C) and there are n = 3 observations
More informationAdvanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs
Advanced Operations Research Techniques IE316 Quiz 1 Review Dr. Ted Ralphs IE316 Quiz 1 Review 1 Reading for The Quiz Material covered in detail in lecture. 1.1, 1.4, 2.1-2.6, 3.1-3.3, 3.5 Background material
More informationBootstrapping Methods
Bootstrapping Methods example of a Monte Carlo method these are one Monte Carlo statistical method some Bayesian statistical methods are Monte Carlo we can also simulate models using Monte Carlo methods
More informationManaging Uncertainty in Data Streams. Aleka Seliniotaki Project Presentation HY561 Heraklion, 22/05/2013
Managing Uncertainty in Data Streams Aleka Seliniotaki Project Presentation HY561 Heraklion, 22/05/2013 Introduction Uncertain Data Streams T V Data: incomplete, imprecise, misleading Results: unknown
More informationLinear Programming Duality and Algorithms
COMPSCI 330: Design and Analysis of Algorithms 4/5/2016 and 4/7/2016 Linear Programming Duality and Algorithms Lecturer: Debmalya Panigrahi Scribe: Tianqi Song 1 Overview In this lecture, we will cover
More informationAnalysis of variance - ANOVA
Analysis of variance - ANOVA Based on a book by Julian J. Faraway University of Iceland (UI) Estimation 1 / 50 Anova In ANOVAs all predictors are categorical/qualitative. The original thinking was to try
More informationFast Calculation of Calendar Time-, Ageand Duration Dependent Time at Risk in the Lexis Space
Fast Calculation of Calendar Time-, Ageand Duration Dependent Time at Risk in the Lexis Space arxiv:1204.0798v1 [stat.me] 3 Apr 2012 Ralph Brinks Institute for Biometry and Epidemiology German Diabetes
More informationCamera Motion Identification in the Rough Indexing Paradigm
Camera Motion Identification in the Rough Indexing aradigm etra KRÄMER and Jenny BENOIS-INEAU LaBRI University Bordeaux I, France {petra.kraemer,jenny.benois}@labri.fr Camera Motion Identification in the
More informationParameterization of triangular meshes
Parameterization of triangular meshes Michael S. Floater November 10, 2009 Triangular meshes are often used to represent surfaces, at least initially, one reason being that meshes are relatively easy to
More informationIterative CT Reconstruction Using Curvelet-Based Regularization
Iterative CT Reconstruction Using Curvelet-Based Regularization Haibo Wu 1,2, Andreas Maier 1, Joachim Hornegger 1,2 1 Pattern Recognition Lab (LME), Department of Computer Science, 2 Graduate School in
More information2017 GridGain Systems, Inc. In-Memory Performance Durability of Disk
In-Memory Performance Durability of Disk Meeting the Challenges of Fast Data in Healthcare with In-Memory Technologies Akmal Chaudhri Technology Evangelist GridGain Agenda Introduction Fast Data in Healthcare
More informationLecture 26: Missing data
Lecture 26: Missing data Reading: ESL 9.6 STATS 202: Data mining and analysis December 1, 2017 1 / 10 Missing data is everywhere Survey data: nonresponse. 2 / 10 Missing data is everywhere Survey data:
More informationAn Approach Finding the Right Tolerance Level for Clinical Data Acceptance
Paper P024 An Approach Finding the Right Tolerance Level for Clinical Data Acceptance Karen Walker, Walker Consulting LLC, Chandler Arizona ABSTRACT Highly broadcasted zero tolerance initiatives for database
More informationThis study is brought to you courtesy of.
This study is brought to you courtesy of www.google.com/think/insights Health Consumer Study The Role of Digital in Patients Healthcare Actions & Decisions Google/OTX U.S., December 2009 Background Demonstrate
More informationData Mining: Exploring Data. Lecture Notes for Chapter 3
Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for accompanying R code on the course web site. Topics Exploratory Data Analysis
More informationAcurian on. The Role of Technology in Patient Recruitment
Acurian on The Role of Technology in Patient Recruitment Wearables smartphones social networks the list of new technological tools available to patients and healthcare providers goes on and on. Many clinical
More informationIntroduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization. Wolfram Burgard
Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization Wolfram Burgard 1 Motivation Recall: Discrete filter Discretize the continuous state space High memory complexity
More informationTitle. Description. Menu. Remarks and examples. stata.com. stata.com. PSS Control Panel
Title stata.com GUI Graphical user interface for power and sample-size analysis Description Menu Remarks and examples Also see Description This entry describes the graphical user interface (GUI) for the
More informationPackage PIPS. February 19, 2015
Type Package Title Predicted Interval Plots Package PIPS February 19, 2015 Author Daniel G. Muenz, Ray Griner , Huichao Chen, Lijuan Deng, Sachiko Miyahara, and Scott R. Evans
More informationSubgroup identification for dose-finding trials via modelbased recursive partitioning
Subgroup identification for dose-finding trials via modelbased recursive partitioning Marius Thomas, Björn Bornkamp, Heidi Seibold & Torsten Hothorn ISCB 2017, Vigo July 10, 2017 Motivation Characterizing
More informationMetrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates?
Model Evaluation Metrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates? Methods for Model Comparison How to
More informationEVALUATION OF THE NORMAL APPROXIMATION FOR THE PAIRED TWO SAMPLE PROBLEM WITH MISSING DATA. Shang-Lin Yang. B.S., National Taiwan University, 1996
EVALUATION OF THE NORMAL APPROXIMATION FOR THE PAIRED TWO SAMPLE PROBLEM WITH MISSING DATA By Shang-Lin Yang B.S., National Taiwan University, 1996 M.S., University of Pittsburgh, 2005 Submitted to the
More informationPreprocessing Short Lecture Notes cse352. Professor Anita Wasilewska
Preprocessing Short Lecture Notes cse352 Professor Anita Wasilewska Data Preprocessing Why preprocess the data? Data cleaning Data integration and transformation Data reduction Discretization and concept
More informationPooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit
Pooling Clinical Data: Key points and Pitfalls October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit Introduction Are there any pre-defined rules to pool clinical data? Are there any pre-defined
More informationAdvances in FDR for fmri -p.1
Advances in FDR for fmri Ruth Heller Department of Statistics, University of Pennsylvania Joint work with: Yoav Benjamini, Nava Rubin, Damian Stanley, Daniel Yekutieli, Yulia Golland, Rafael Malach Advances
More informationData Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski
Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...
More informationStudy and Implementation of CHAMELEON algorithm for Gene Clustering
[1] Study and Implementation of CHAMELEON algorithm for Gene Clustering 1. Motivation Saurav Sahay The vast amount of gathered genomic data from Microarray and other experiments makes it extremely difficult
More informationCS 664 Slides #11 Image Segmentation. Prof. Dan Huttenlocher Fall 2003
CS 664 Slides #11 Image Segmentation Prof. Dan Huttenlocher Fall 2003 Image Segmentation Find regions of image that are coherent Dual of edge detection Regions vs. boundaries Related to clustering problems
More information- with application to cluster and significance analysis
Selection via - with application to cluster and significance of gene expression Rebecka Jörnsten Department of Statistics, Rutgers University rebecka@stat.rutgers.edu, http://www.stat.rutgers.edu/ rebecka
More informationMultiple Comparisons of Treatments vs. a Control (Simulation)
Chapter 585 Multiple Comparisons of Treatments vs. a Control (Simulation) Introduction This procedure uses simulation to analyze the power and significance level of two multiple-comparison procedures that
More informationChapter 6: Linear Model Selection and Regularization
Chapter 6: Linear Model Selection and Regularization As p (the number of predictors) comes close to or exceeds n (the sample size) standard linear regression is faced with problems. The variance of the
More informationUse of Extreme Value Statistics in Modeling Biometric Systems
Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision
More informationOn the Global Solution of Linear Programs with Linear Complementarity Constraints
On the Global Solution of Linear Programs with Linear Complementarity Constraints J. E. Mitchell 1 J. Hu 1 J.-S. Pang 2 K. P. Bennett 1 G. Kunapuli 1 1 Department of Mathematical Sciences RPI, Troy, NY
More informationOptimal Techniques and Sequences for Population Modeling of Bioavailability
Optimal Techniques and Sequences for Population Modeling of Bioavailability Roger W. Jelliffe, M.D Professor of Medicine Laboratory of Applied Pharmacokinetics, USC Keck School of Medicine www.usc.edu/hsc/lab_apk/,
More informationON SOME METHODS OF CONSTRUCTION OF BLOCK DESIGNS
ON SOME METHODS OF CONSTRUCTION OF BLOCK DESIGNS NURNABI MEHERUL ALAM M.Sc. (Agricultural Statistics), Roll No. I.A.S.R.I, Library Avenue, New Delhi- Chairperson: Dr. P.K. Batra Abstract: Block designs
More informationCS6375: Machine Learning Gautam Kunapuli. Mid-Term Review
Gautam Kunapuli Machine Learning Data is identically and independently distributed Goal is to learn a function that maps to Data is generated using an unknown function Learn a hypothesis that minimizes
More informationBayesian Inference for Sample Surveys
Bayesian Inference for Sample Surveys Trivellore Raghunathan (Raghu) Director, Survey Research Center Professor of Biostatistics University of Michigan Distinctive features of survey inference 1. Primary
More informationA GENERAL GIBBS SAMPLING ALGORITHM FOR ANALYZING LINEAR MODELS USING THE SAS SYSTEM
A GENERAL GIBBS SAMPLING ALGORITHM FOR ANALYZING LINEAR MODELS USING THE SAS SYSTEM Jayawant Mandrekar, Daniel J. Sargent, Paul J. Novotny, Jeff A. Sloan Mayo Clinic, Rochester, MN 55905 ABSTRACT A general
More informationGeneral Factorial Models
In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 34 It is possible to have many factors in a factorial experiment. In DDD we saw an example of a 3-factor study with ball size, height, and surface
More informationGeneral Factorial Models
In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 1 1 / 31 It is possible to have many factors in a factorial experiment. We saw some three-way factorials earlier in the DDD book (HW 1 with 3 factors:
More informationPackage FHtest. November 8, 2017
Type Package Package FHtest November 8, 2017 Title Tests for Right and Interval-Censored Survival Data Based on the Fleming-Harrington Class Version 1.4 Date 2017-11-8 Author Ramon Oller, Klaus Langohr
More information14.2 Do Two Distributions Have the Same Means or Variances?
14.2 Do Two Distributions Have the Same Means or Variances? 615 that this is wasteful, since it yields much more information than just the median (e.g., the upper and lower quartile points, the deciles,
More informationHealthInfoNet CLINICAL PORTAL USER REFERENCE GUIDE. Revised: Page 1 of 24
HealthInfoNet CLINICAL PORTAL USER REFERENCE GUIDE Revised: 6.3.2015 Page 1 of 24 HealthInfoNet User Reference Guide INSIDE: Accessing HealthInfoNet (HIN) 3-5 Clinical Portal 6-11 Notifications and Worklists
More informationScientific Computing: An Introductory Survey
Scientific Computing: An Introductory Survey Chapter 13 Random Numbers and Stochastic Simulation Prof. Michael T. Heath Department of Computer Science University of Illinois at Urbana-Champaign Copyright
More informationSmoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data
Smoothing Dissimilarities for Cluster Analysis: Binary Data and unctional Data David B. University of South Carolina Department of Statistics Joint work with Zhimin Chen University of South Carolina Current
More informationA Generic Separation Algorithm and Its Application to the Vehicle Routing Problem
A Generic Separation Algorithm and Its Application to the Vehicle Routing Problem Presented by: Ted Ralphs Joint work with: Leo Kopman Les Trotter Bill Pulleyblank 1 Outline of Talk Introduction Description
More informationPhysician Engagement in Quality and Safety. Improvement Associates Ltd.
Physician Engagement in Quality and Safety What does physician engagement in quality and safety mean to you? Who do you want involved and how do you want them involved? What would the ideal world look
More informationTreatment Planning Optimization for VMAT, Tomotherapy and Cyberknife
Treatment Planning Optimization for VMAT, Tomotherapy and Cyberknife Kerem Akartunalı Department of Management Science Strathclyde Business School Joint work with: Vicky Mak-Hau and Thu Tran 14 July 2015
More informationVHIMS QUICK REFERENCE GUIDE TO INCIDENT MANAGEMENT
Introduction This reference guide is aimed at Risk/Quality Managers and/or OHS Managers who are responsible for ensuring Incidents have been followed up by the respective Line Managers, and to post incidents
More informationSequential Screening: A Bayesian Dynamic Programming Analysis
Sequential Screening: A Bayesian Dynamic Programming Analysis Peter I. Frazier (Cornell), Bruno Jedynak (JHU), Li Chen (JHU) Operations Research & Information Engineering, Cornell University Applied Mathematics,
More informationCorrectly Compute Complex Samples Statistics
SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample
More informationChapter 9. Classification and Clustering
Chapter 9 Classification and Clustering Classification and Clustering Classification and clustering are classical pattern recognition and machine learning problems Classification, also referred to as categorization
More informationStatistical Analysis of List Experiments
Statistical Analysis of List Experiments Kosuke Imai Princeton University Joint work with Graeme Blair October 29, 2010 Blair and Imai (Princeton) List Experiments NJIT (Mathematics) 1 / 26 Motivation
More informationA Geometric Analysis of Subspace Clustering with Outliers
A Geometric Analysis of Subspace Clustering with Outliers Mahdi Soltanolkotabi and Emmanuel Candés Stanford University Fundamental Tool in Data Mining : PCA Fundamental Tool in Data Mining : PCA Subspace
More informationUTS Library s Guide to Finding Evidence-Based Practice Resources
UTS Library s Guide to Finding Evidence-Based Practice Resources UTS: Library UTS Library s Health librarians have made this step-by-step guide to finding evidence-based practice resources using the PICO
More informationMicroscopic Traffic Simulation
Microscopic Traffic Simulation Lecture Notes in Transportation Systems Engineering Prof. Tom V. Mathew Contents Overview 2 Traffic Simulation Models 2 2. Need for simulation.................................
More informationAuthenticating Nuclear Warheads With High Confidence
Authenticating Nuclear Warheads With High Confidence Moritz Kütt,, Sebastien Philippe, Boaz Barak, Alexander Glaser, and Robert J. Goldston, Princeton University Technische Universität Darmstadt, Germany
More informationINF 4300 Classification III Anne Solberg The agenda today:
INF 4300 Classification III Anne Solberg 28.10.15 The agenda today: More on estimating classifier accuracy Curse of dimensionality and simple feature selection knn-classification K-means clustering 28.10.15
More informationICTR UW Institute of Clinical and Translational Research. i2b2 User Guide. Version 1.0 Updated 9/11/2017
ICTR UW Institute of Clinical and Translational Research i2b2 User Guide Version 1.0 Updated 9/11/2017 Table of Contents Background/Search Criteria... 2 Accessing i2b2... 3 Navigating the Workbench...
More informationPAM 4280/ECON 3710: The Economics of Risky Health Behaviors Fall 2015 Professor John Cawley TA Christine Coyer. Stata Basics for PAM 4280/ECON 3710
PAM 4280/ECON 3710: The Economics of Risky Health Behaviors Fall 2015 Professor John Cawley TA Christine Coyer Stata Basics for PAM 4280/ECON 3710 I Introduction Stata is one of the most commonly used
More informationA Sequential, Multi-Complexity Topology Optimization Process for Aeroelastic Wing Structure Design
A Sequential, Multi-Complexity Topology Optimization Process for Aeroelastic Wing Structure Design Bill Crossley, crossley@purdue.edu Significant content from graduate student Mark Guiles and input from
More informationTowards a practical simplex method for second order cone programming
Towards a practical simplex method for second order cone programming Kartik Krishnan Department of Computing and Software McMaster University Joint work with Gábor Pataki (UNC), Neha Gupta (IIT Delhi),
More informationChapter 16. Microscopic Traffic Simulation Overview Traffic Simulation Models
Chapter 6 Microscopic Traffic Simulation 6. Overview The complexity of traffic stream behaviour and the difficulties in performing experiments with real world traffic make computer simulation an important
More informationImage Restoration and Background Separation Using Sparse Representation Framework
Image Restoration and Background Separation Using Sparse Representation Framework Liu, Shikun Abstract In this paper, we introduce patch-based PCA denoising and k-svd dictionary learning method for the
More informationModelling Proportions and Count Data
Modelling Proportions and Count Data Rick White May 4, 2016 Outline Analysis of Count Data Binary Data Analysis Categorical Data Analysis Generalized Linear Models Questions Types of Data Continuous data:
More informationNotes taken by Mea Wang. February 11, 2005
CSC2411 - Linear Programming and Combinatorial Optimization Lecture 5: Smoothed Analysis, Randomized Combinatorial Algorithms, and Linear Programming Duality Notes taken by Mea Wang February 11, 2005 Summary:
More informationTopological Mapping. Discrete Bayes Filter
Topological Mapping Discrete Bayes Filter Vision Based Localization Given a image(s) acquired by moving camera determine the robot s location and pose? Towards localization without odometry What can be
More information