Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming

Size: px
Start display at page:

Download "Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming"

Transcription

1 Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming Michael Rosenblum Johns Hopkins Bloomberg School of Public Health Joint work with Xingyuan (Ethan) Fang and Han Liu Princeton University Working paper giving full results available here: October, 05

2 Problem Motivation Goal: Testing Treatment Effects in Two Subpopulations and the Overall Population in Adaptive Enrichment Designs Example: Treating resistant HIV. Recent HIV drugs (maraviroc, raltegravir) have shown stronger benefit in those with lower phenotypic sensitivity to background therapy. We assume two, predefined, subpopulations that partition the overall population.

3 Multiple Testing Problem: Null Hypotheses Definition Define three treatment effects of interest: : Mean Treatment Effect for Subpopulation (i.e., difference between population mean of the primary outcome under treatment and under control) : Mean Treatment Effect for Subpopulation C = p + ( p ) : Mean Treatment Effect for Combined Population

4 Multiple Testing Problem: Null Hypotheses Definition Define three treatment effects of interest: : Mean Treatment Effect for Subpopulation (i.e., difference between population mean of the primary outcome under treatment and under control) : Mean Treatment Effect for Subpopulation C = p + ( p ) : Mean Treatment Effect for Combined Population Goal: construct adaptive enrichment design D and multiple testing procedure M for: H 0 : 0, H 0 : 0, H 0C : p + ( p ) 0, that strongly controls familywise Type I error rate, and is optimal in sense defined below.

5 General Two-Stage Adaptive Enrichment Design Stage Decision Stage Hyp.Test Recruit Both Popula)ons Subpopula)on Subpopula)on D... Subpopula)on : Subpopula)on : Subpopula)on : Subpopula)on : (D=) n (D=) n (D=) n (D=) n M K Subpopula)on : Subpopula)on : (D=K ) n (D=K ) n

6 Example Two-Stage Adaptive Enrichment Design Stage Decision Stage Stop trial Hyp.Test Recruit Both Popula7ons Subpop Subpop D Recruit Both Pop. Subpop Subpop Recruit Only Subpop. T > T? Subpopula7on M 4 Recruit Only Subpop. Subpopula7on

7 Example Two-Stage Adaptive Enrichment Design n = total sample size if both subpopulations enrolled in stage. Stage Decision Stage Stop trial Hyp.Test Recruit Both Popula7ons Subpop Subpop } n D 4 } Recruit Both Pop. Subpop n Subpop Recruit Only Subpop. T > T? Subpopula7on } n 4 M } Recruit Only Subpop. n Subpopula7on 4

8 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C.

9 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C. Z s () : z-statistic for subpopulation s at end of stage ; Z s (F ) : Final, cumulative z-statistic for subpop. s at end of stage.

10 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C. Z s () : z-statistic for subpopulation s at end of stage ; Z s (F ) : Final, cumulative z-statistic for subpop. s at end of stage. Decision rule D is map from Z () = (Z (), Z () ) to possible decisions D. Multiple testing procedure M is map from Z (F ) = (Z (F ), Z (F ) ) and decision D to set of null hypotheses rejected (if any).

11 Two-Stage Adaptive Enrichment Design Assume known variances and normally distributed outcomes; subpopulation cumulative sample sizes and z-statistics and are then sufficient statistics for,, C. Z s () : z-statistic for subpopulation s at end of stage ; Z s (F ) : Final, cumulative z-statistic for subpop. s at end of stage. Decision rule D is map from Z () = (Z (), Z () ) to possible decisions D. Multiple testing procedure M is map from Z (F ) = (Z (F ), Z (F ) ) and decision D to set of null hypotheses rejected (if any). Power at alternative, to reject H 0 is Pr (, )[M(Z (F ), D(Z () )) rejects H 0 ]. User specifies: (i) loss function L(D, M;, ), e.g., total sample size; and (ii) distribution Λ on alternatives (, ).

12 Constrained Bayes Optimization Problem Problem inputs: p ; set of possible stage decisions; σ, σ ; clinically meaningful min. treatment effect min ; loss function L; distribution Λ on alternatives (, ); α, β, β, β C. Recall D = D(Z () ) and M = M(Z (F ), D(Z () )).

13 Constrained Bayes Optimization Problem Problem inputs: p ; set of possible stage decisions; σ, σ ; clinically meaningful min. treatment effect min ; loss function L; distribution Λ on alternatives (, ); α, β, β, β C. Recall D = D(Z () ) and M = M(Z (F ), D(Z () )). Constrained Bayes Opt. Problem: Find pair (D, M) minimizing: E (, )[L(D, M;, )]dλ(, ), under familywise Type I error constraints: sup Pr (, )[M rejects any true null hypothesis] α, (, ) R

14 Constrained Bayes Optimization Problem Problem inputs: p ; set of possible stage decisions; σ, σ ; clinically meaningful min. treatment effect min ; loss function L; distribution Λ on alternatives (, ); α, β, β, β C. Recall D = D(Z () ) and M = M(Z (F ), D(Z () )). Constrained Bayes Opt. Problem: Find pair (D, M) minimizing: E (, )[L(D, M;, )]dλ(, ), under familywise Type I error constraints: sup Pr (, )[M rejects any true null hypothesis] α, (, ) R and power constraints: Pr ( min,0)[m rejects H 0 ] β. Pr (0, min )[M rejects H 0 ] β. Pr ( min, min )[M rejects H 0C ] β C.

15 Our Method to Solve Optimization Problem Computational challenge: continuum of Type I error constraints. (It s not enough to satisfy only at global null (, ) = (0, 0).)

16 Our Method to Solve Optimization Problem Computational challenge: continuum of Type I error constraints. (It s not enough to satisfy only at global null (, ) = (0, 0).) Our solution: Discretize decision region R into small rectangles R; for any r R, enforce decision rule D makes same decision for any (Z (), Z () ) r. For each decision d {,..., K}, discretize rejection regions R into small rectangles R d ; for any r R d, enforce that if D = d, multiple testing procedure M rejects same set of hypotheses for any (Z (F ), Z (F ) ) r. Discretize Type I error constraints into fine grid on boundaries of null spaces.

17 Our Method to Solve Optimization Problem Computational challenge: continuum of Type I error constraints. (It s not enough to satisfy only at global null (, ) = (0, 0).) Our solution: Discretize decision region R into small rectangles R; for any r R, enforce decision rule D makes same decision for any (Z (), Z () ) r. For each decision d {,..., K}, discretize rejection regions R into small rectangles R d ; for any r R d, enforce that if D = d, multiple testing procedure M rejects same set of hypotheses for any (Z (F ), Z (F ) ) r. Discretize Type I error constraints into fine grid on boundaries of null spaces. Discretized opt. problem is not convex. However, we construct reparametrization that is sparse, linear program: max c T x s.t. Ax b. x We apply advanced optimization methods to solve this.

18 Example p = /, α = 0.05, σ = σ. L =total sample size. Prior Λ equally weighted pt. masses at (, ) equal to (0, 0), ( min, 0), (0, min ), ( min, min ).

19 Example p = /, α = 0.05, σ = σ. L =total sample size. Prior Λ equally weighted pt. masses at (, ) equal to (0, 0), ( min, 0), (0, min ), ( min, min ). Sample size n is min s.t. standard design satisfies each power constraint at β j = 0.64 for j {,, C}. Set each β j = 0.8. Stage Decision Stage Stop trial Hyp.Test M(Z (F),) Recruit Both Popula7ons Subpop Subpop } n D(Z () ) 4 Recruit Both Pop. Subpop n Subpop Recruit Only Subpop. Subpopula7on T > T? } } } n 4 M(Z (F),) M(Z (F),) Recruit Only Subpop. n Subpopula7on 4 M(Z (F),4)

20 Example p = /, α = 0.05, σ = σ. L =total sample size. Prior Λ equally weighted pt. masses at (, ) equal to (0, 0), ( min, 0), (0, min ), ( min, min ). Sample size n is min s.t. standard design satisfies each power constraint at β j = 0.64 for j {,, C}. Set each β j = 0.8.? T T > Subpopula)on Stage Decision Stage Hyp.Test Subpopula)on Subpopula)on Subpopula)on Recruit Both Popula)ons Recruit Only Subpop. Recruit Only Subpop. Recruit Both Pop. Subpopula)on Subpopula)on Stop trial D 4 M Z Z 0 0

21 Approximately Optimal Solution Decision Rule for Stage Enrollment: Z Z 0 0 Rejection Regions under Each Decision: (in terms of Z (F ), Z (F ) ) D = "" D = "" D = "" D = ""

22 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0 H0, H0C H0, H0C H0

23 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0 H0, H0C H0, H0C H0

24 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0, H0C H0

25 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0, H0C H0

26 Approximately Optimal Solution Decision Rule to Enroll Stage : Z Z 0 0 Rejection Regions for Decision: D = "" Z F Z F 0 0 H0, H0C H0

27 Power Comparison Compare to adaptive enrichment design using p-value combination approach (Bauer and Köhne, 994), with Dunnett intersection test and inverse-normal combination function. Early stopping is incorporated using O Brien-Fleming boundaries for each intersection null hypothesis. Decision rule for stage : if combined population statistic (Z () + Z () )/ > t c, enroll both subpop. else, enroll from each subpopulation s for which Z s () > t. Consider β = β = β = β C. For each power threshold β, we optimized over t, t c to minimize expected sample size under the power constraints. n = total sample size if both enrolled stage. Table: Minimum of ESS dλ, as power constraint β varied. Required Power β: 70% 74% 78% 8% Comparator 0.97n.0n infeasible infeasible Optimal 0.79n 0.84n 0.9n.0n

28 References Rosenblum, M., Fang, X., and Liu, H., Optimal, Two Stage, Adaptive Enrichment Designs for Randomized Trials Using Sparse Linear Programming (04). Johns Hopkins University, Dept. of Biostatistics Working Papers. Working Paper 7. Rosenblum, M., Liu, H., and Yen, E.-H. (04), Optimal Tests of Treatment Effects for the Overall Population and Two Subpopulations in Randomized Trials, using Sparse Linear Programming, Journal of American Statistical Association, Theory and Methods Section, Volume 09. Issue Rosenblum, M. (In Press), Adaptive Randomized Trial Designs that Cannot be Dominated by Any Standard Design at the Same Total Sample Size. Biometrika.

29 Collaborators Collaborators Ethan X. Fang Han Liu Princeton University

30 Thank you! Acknowledgement: This research was supported by the U.S. Food and Drug Administration (HHSF0400C) and the Patient- Centered Outcomes Research Institute (ME ). This work is solely the responsibility of the authors and does not represent the views of these agencies.

Optimal designs for comparing curves

Optimal designs for comparing curves Optimal designs for comparing curves Holger Dette, Ruhr-Universität Bochum Maria Konstantinou, Ruhr-Universität Bochum Kirsten Schorning, Ruhr-Universität Bochum FP7 HEALTH 2013-602552 Outline 1 Motivation

More information

Bayes Factor Single Arm Binary User s Guide (Version 1.0)

Bayes Factor Single Arm Binary User s Guide (Version 1.0) Bayes Factor Single Arm Binary User s Guide (Version 1.0) Department of Biostatistics P. O. Box 301402, Unit 1409 The University of Texas, M. D. Anderson Cancer Center Houston, Texas 77230-1402, USA November

More information

Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS

Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS MWSUG 2016 - Paper PH06 Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS Kechen Zhao, University of Southern

More information

Tables for Statistical Significance with Multiple Comparisons

Tables for Statistical Significance with Multiple Comparisons Tables for Statistical Significance with Multiple Comparisons J. Scott Armstrong The Wharton School, University of Pennsylvania Philadelphia, Pa. 19104 Reprinted from Armstrong, J. Scott, Long-Range Forecasting.

More information

Equivalence of dose-response curves

Equivalence of dose-response curves Equivalence of dose-response curves Holger Dette, Ruhr-Universität Bochum Kathrin Möllenhoff, Ruhr-Universität Bochum Stanislav Volgushev, University of Toronto Frank Bretz, Novartis Basel FP7 HEALTH 2013-602552

More information

Lecture 14. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Lecture 14. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University. geometric Lecture 14 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University December 19, 2007 geometric 1 2 3 4 geometric 5 6 7 geometric 1 Review about logs

More information

The fastclime Package for Linear Programming and Large-Scale Precision Matrix Estimation in R

The fastclime Package for Linear Programming and Large-Scale Precision Matrix Estimation in R Journal of Machine Learning Research (2013) Submitted ; Published The fastclime Package for Linear Programming and Large-Scale Precision Matrix Estimation in R Haotian Pang Han Liu Robert Vanderbei Princeton

More information

Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics

Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics Automatic Selection of Compiler Options Using Non-parametric Inferential Statistics Masayo Haneda Peter M.W. Knijnenburg Harry A.G. Wijshoff LIACS, Leiden University Motivation An optimal compiler optimization

More information

TTEDesigner User s Manual

TTEDesigner User s Manual TTEDesigner User s Manual John D. Cook Department of Biostatistics, Box 447 The University of Texas, M. D. Anderson Cancer Center 1515 Holcombe Blvd., Houston, Texas 77030, USA cook@mdanderson.org September

More information

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis

Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Bayesian Spherical Wavelet Shrinkage: Applications to Shape Analysis Xavier Le Faucheur a, Brani Vidakovic b and Allen Tannenbaum a a School of Electrical and Computer Engineering, b Department of Biomedical

More information

An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation

An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation Xingguo Li Tuo Zhao Xiaoming Yuan Han Liu Abstract This paper describes an R package named flare, which implements

More information

An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation

An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation Xingguo Li Tuo Zhao Xiaoming Yuan Han Liu Abstract This paper describes an R package named flare, which implements

More information

gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual

gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual Keaven M. Anderson, Dan (Jennifer) Sun, Zhongxin (John) Zhang Merck Research Laboratories December 18, 2008 Abstract

More information

The flare Package for High Dimensional Linear Regression and Precision Matrix Estimation in R

The flare Package for High Dimensional Linear Regression and Precision Matrix Estimation in R Journal of Machine Learning Research 6 (205) 553-557 Submitted /2; Revised 3/4; Published 3/5 The flare Package for High Dimensional Linear Regression and Precision Matrix Estimation in R Xingguo Li Department

More information

Two-Stage Designs for Phase II Clinical Trials. by Chung-Li Tsai Advisor Kam-Fai Wong

Two-Stage Designs for Phase II Clinical Trials. by Chung-Li Tsai Advisor Kam-Fai Wong Two-Stage Designs for Phase II Clinical Trials by Chung-Li Tsai Advisor Kam-Fai Wong Institute of Statistics National University of Kaohsiung Kaohsiung, Taiwan 811, R.O.C. July 2006 Contents List of Figures

More information

Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients

Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients 1 Gene signature selection to predict survival benefits from adjuvant chemotherapy in NSCLC patients 1,2 Keyue Ding, Ph.D. Nov. 8, 2014 1 NCIC Clinical Trials Group, Kingston, Ontario, Canada 2 Dept. Public

More information

Video shot segmentation using late fusion technique

Video shot segmentation using late fusion technique Video shot segmentation using late fusion technique by C. Krishna Mohan, N. Dhananjaya, B.Yegnanarayana in Proc. Seventh International Conference on Machine Learning and Applications, 2008, San Diego,

More information

Introductory Concepts for Voxel-Based Statistical Analysis

Introductory Concepts for Voxel-Based Statistical Analysis Introductory Concepts for Voxel-Based Statistical Analysis John Kornak University of California, San Francisco Department of Radiology and Biomedical Imaging Department of Epidemiology and Biostatistics

More information

MATH : EXAM 3 INFO/LOGISTICS/ADVICE

MATH : EXAM 3 INFO/LOGISTICS/ADVICE MATH 3342-004: EXAM 3 INFO/LOGISTICS/ADVICE INFO: WHEN: Friday (04/22) at 10:00am DURATION: 50 mins PROBLEM COUNT: Appropriate for a 50-min exam BONUS COUNT: At least one TOPICS CANDIDATE FOR THE EXAM:

More information

Statistics. Taskmistress or Temptress?

Statistics. Taskmistress or Temptress? Statistics Taskmistress or Temptress? Naomi Altman Penn State University Dec 2013 Altman (Penn State) Reproducible Research December 9, 2013 1 / 24 Recent Headlines Altman (Penn State) Reproducible Research

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

Fast or furious? - User analysis of SF Express Inc

Fast or furious? - User analysis of SF Express Inc CS 229 PROJECT, DEC. 2017 1 Fast or furious? - User analysis of SF Express Inc Gege Wen@gegewen, Yiyuan Zhang@yiyuan12, Kezhen Zhao@zkz I. MOTIVATION The motivation of this project is to predict the likelihood

More information

Model-based Recursive Partitioning for Subgroup Analyses

Model-based Recursive Partitioning for Subgroup Analyses EBPI Epidemiology, Biostatistics and Prevention Institute Model-based Recursive Partitioning for Subgroup Analyses Torsten Hothorn; joint work with Heidi Seibold and Achim Zeileis 2014-12-03 Subgroup analyses

More information

9.2 Types of Errors in Hypothesis testing

9.2 Types of Errors in Hypothesis testing 9.2 Types of Errors in Hypothesis testing 1 Mistakes we could make As I mentioned, when we take a sample we won t be 100% sure of something because we do not take a census (we only look at information

More information

Introduction to Stochastic Combinatorial Optimization

Introduction to Stochastic Combinatorial Optimization Introduction to Stochastic Combinatorial Optimization Stefanie Kosuch PostDok at TCSLab www.kosuch.eu/stefanie/ Guest Lecture at the CUGS PhD course Heuristic Algorithms for Combinatorial Optimization

More information

Spatial biosurveillance

Spatial biosurveillance Spatial biosurveillance Authors of Slides Andrew Moore Carnegie Mellon awm@cs.cmu.edu Daniel Neill Carnegie Mellon d.neill@cs.cmu.edu Slides and Software and Papers at: http://www.autonlab.org awm@cs.cmu.edu

More information

Leveraging multiple endpoints in small clinical trials

Leveraging multiple endpoints in small clinical trials Leveraging multiple endpoints in small clinical trials Robin Ristl Section for Medical Statistics, Center for Medical Statistics, Informatics and Intelligent Systems, Medical University of Vienna EMA FP7

More information

Package sp23design. R topics documented: February 20, Type Package

Package sp23design. R topics documented: February 20, Type Package Type Package Package sp23design February 20, 2015 Title Design and Simulation of seamless Phase II-III Clinical Trials Version 0.9 Date 2012-01-01 Author Balasubramanian Narasimhan [aut, cre], Mei-Chiung

More information

Equivalence Tests for Two Means in a 2x2 Cross-Over Design using Differences

Equivalence Tests for Two Means in a 2x2 Cross-Over Design using Differences Chapter 520 Equivalence Tests for Two Means in a 2x2 Cross-Over Design using Differences Introduction This procedure calculates power and sample size of statistical tests of equivalence of the means of

More information

Week 4: Simple Linear Regression II

Week 4: Simple Linear Regression II Week 4: Simple Linear Regression II Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline Algebraic properties

More information

Interactive Treatment Planning in Cancer Radiotherapy

Interactive Treatment Planning in Cancer Radiotherapy Interactive Treatment Planning in Cancer Radiotherapy Mohammad Shakourifar Giulio Trigila Pooyan Shirvani Ghomi Abraham Abebe Sarah Couzens Laura Noreña Wenling Shang June 29, 212 1 Introduction Intensity

More information

Project Paper Introduction

Project Paper Introduction Project Paper Introduction Tracey Marsh Group Health Research Institute University of Washington, Department of Biostatistics Paper Estimating and Projecting Trends in HIV/AIDS Generalized Epidemics Using

More information

Lecture 10: SVM Lecture Overview Support Vector Machines The binary classification problem

Lecture 10: SVM Lecture Overview Support Vector Machines The binary classification problem Computational Learning Theory Fall Semester, 2012/13 Lecture 10: SVM Lecturer: Yishay Mansour Scribe: Gitit Kehat, Yogev Vaknin and Ezra Levin 1 10.1 Lecture Overview In this lecture we present in detail

More information

R-Square Coeff Var Root MSE y Mean

R-Square Coeff Var Root MSE y Mean STAT:50 Applied Statistics II Exam - Practice 00 possible points. Consider a -factor study where each of the factors has 3 levels. The factors are Diet (,,3) and Drug (A,B,C) and there are n = 3 observations

More information

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs Advanced Operations Research Techniques IE316 Quiz 1 Review Dr. Ted Ralphs IE316 Quiz 1 Review 1 Reading for The Quiz Material covered in detail in lecture. 1.1, 1.4, 2.1-2.6, 3.1-3.3, 3.5 Background material

More information

Bootstrapping Methods

Bootstrapping Methods Bootstrapping Methods example of a Monte Carlo method these are one Monte Carlo statistical method some Bayesian statistical methods are Monte Carlo we can also simulate models using Monte Carlo methods

More information

Managing Uncertainty in Data Streams. Aleka Seliniotaki Project Presentation HY561 Heraklion, 22/05/2013

Managing Uncertainty in Data Streams. Aleka Seliniotaki Project Presentation HY561 Heraklion, 22/05/2013 Managing Uncertainty in Data Streams Aleka Seliniotaki Project Presentation HY561 Heraklion, 22/05/2013 Introduction Uncertain Data Streams T V Data: incomplete, imprecise, misleading Results: unknown

More information

Linear Programming Duality and Algorithms

Linear Programming Duality and Algorithms COMPSCI 330: Design and Analysis of Algorithms 4/5/2016 and 4/7/2016 Linear Programming Duality and Algorithms Lecturer: Debmalya Panigrahi Scribe: Tianqi Song 1 Overview In this lecture, we will cover

More information

Analysis of variance - ANOVA

Analysis of variance - ANOVA Analysis of variance - ANOVA Based on a book by Julian J. Faraway University of Iceland (UI) Estimation 1 / 50 Anova In ANOVAs all predictors are categorical/qualitative. The original thinking was to try

More information

Fast Calculation of Calendar Time-, Ageand Duration Dependent Time at Risk in the Lexis Space

Fast Calculation of Calendar Time-, Ageand Duration Dependent Time at Risk in the Lexis Space Fast Calculation of Calendar Time-, Ageand Duration Dependent Time at Risk in the Lexis Space arxiv:1204.0798v1 [stat.me] 3 Apr 2012 Ralph Brinks Institute for Biometry and Epidemiology German Diabetes

More information

Camera Motion Identification in the Rough Indexing Paradigm

Camera Motion Identification in the Rough Indexing Paradigm Camera Motion Identification in the Rough Indexing aradigm etra KRÄMER and Jenny BENOIS-INEAU LaBRI University Bordeaux I, France {petra.kraemer,jenny.benois}@labri.fr Camera Motion Identification in the

More information

Parameterization of triangular meshes

Parameterization of triangular meshes Parameterization of triangular meshes Michael S. Floater November 10, 2009 Triangular meshes are often used to represent surfaces, at least initially, one reason being that meshes are relatively easy to

More information

Iterative CT Reconstruction Using Curvelet-Based Regularization

Iterative CT Reconstruction Using Curvelet-Based Regularization Iterative CT Reconstruction Using Curvelet-Based Regularization Haibo Wu 1,2, Andreas Maier 1, Joachim Hornegger 1,2 1 Pattern Recognition Lab (LME), Department of Computer Science, 2 Graduate School in

More information

2017 GridGain Systems, Inc. In-Memory Performance Durability of Disk

2017 GridGain Systems, Inc. In-Memory Performance Durability of Disk In-Memory Performance Durability of Disk Meeting the Challenges of Fast Data in Healthcare with In-Memory Technologies Akmal Chaudhri Technology Evangelist GridGain Agenda Introduction Fast Data in Healthcare

More information

Lecture 26: Missing data

Lecture 26: Missing data Lecture 26: Missing data Reading: ESL 9.6 STATS 202: Data mining and analysis December 1, 2017 1 / 10 Missing data is everywhere Survey data: nonresponse. 2 / 10 Missing data is everywhere Survey data:

More information

An Approach Finding the Right Tolerance Level for Clinical Data Acceptance

An Approach Finding the Right Tolerance Level for Clinical Data Acceptance Paper P024 An Approach Finding the Right Tolerance Level for Clinical Data Acceptance Karen Walker, Walker Consulting LLC, Chandler Arizona ABSTRACT Highly broadcasted zero tolerance initiatives for database

More information

This study is brought to you courtesy of.

This study is brought to you courtesy of. This study is brought to you courtesy of www.google.com/think/insights Health Consumer Study The Role of Digital in Patients Healthcare Actions & Decisions Google/OTX U.S., December 2009 Background Demonstrate

More information

Data Mining: Exploring Data. Lecture Notes for Chapter 3

Data Mining: Exploring Data. Lecture Notes for Chapter 3 Data Mining: Exploring Data Lecture Notes for Chapter 3 Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler Look for accompanying R code on the course web site. Topics Exploratory Data Analysis

More information

Acurian on. The Role of Technology in Patient Recruitment

Acurian on. The Role of Technology in Patient Recruitment Acurian on The Role of Technology in Patient Recruitment Wearables smartphones social networks the list of new technological tools available to patients and healthcare providers goes on and on. Many clinical

More information

Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization. Wolfram Burgard

Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization. Wolfram Burgard Introduction to Mobile Robotics Bayes Filter Particle Filter and Monte Carlo Localization Wolfram Burgard 1 Motivation Recall: Discrete filter Discretize the continuous state space High memory complexity

More information

Title. Description. Menu. Remarks and examples. stata.com. stata.com. PSS Control Panel

Title. Description. Menu. Remarks and examples. stata.com. stata.com. PSS Control Panel Title stata.com GUI Graphical user interface for power and sample-size analysis Description Menu Remarks and examples Also see Description This entry describes the graphical user interface (GUI) for the

More information

Package PIPS. February 19, 2015

Package PIPS. February 19, 2015 Type Package Title Predicted Interval Plots Package PIPS February 19, 2015 Author Daniel G. Muenz, Ray Griner , Huichao Chen, Lijuan Deng, Sachiko Miyahara, and Scott R. Evans

More information

Subgroup identification for dose-finding trials via modelbased recursive partitioning

Subgroup identification for dose-finding trials via modelbased recursive partitioning Subgroup identification for dose-finding trials via modelbased recursive partitioning Marius Thomas, Björn Bornkamp, Heidi Seibold & Torsten Hothorn ISCB 2017, Vigo July 10, 2017 Motivation Characterizing

More information

Metrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates?

Metrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates? Model Evaluation Metrics for Performance Evaluation How to evaluate the performance of a model? Methods for Performance Evaluation How to obtain reliable estimates? Methods for Model Comparison How to

More information

EVALUATION OF THE NORMAL APPROXIMATION FOR THE PAIRED TWO SAMPLE PROBLEM WITH MISSING DATA. Shang-Lin Yang. B.S., National Taiwan University, 1996

EVALUATION OF THE NORMAL APPROXIMATION FOR THE PAIRED TWO SAMPLE PROBLEM WITH MISSING DATA. Shang-Lin Yang. B.S., National Taiwan University, 1996 EVALUATION OF THE NORMAL APPROXIMATION FOR THE PAIRED TWO SAMPLE PROBLEM WITH MISSING DATA By Shang-Lin Yang B.S., National Taiwan University, 1996 M.S., University of Pittsburgh, 2005 Submitted to the

More information

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska

Preprocessing Short Lecture Notes cse352. Professor Anita Wasilewska Preprocessing Short Lecture Notes cse352 Professor Anita Wasilewska Data Preprocessing Why preprocess the data? Data cleaning Data integration and transformation Data reduction Discretization and concept

More information

Pooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit

Pooling Clinical Data: Key points and Pitfalls. October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit Pooling Clinical Data: Key points and Pitfalls October 16, 2012 Phuse 2012 conference, Budapest Florence Buchheit Introduction Are there any pre-defined rules to pool clinical data? Are there any pre-defined

More information

Advances in FDR for fmri -p.1

Advances in FDR for fmri -p.1 Advances in FDR for fmri Ruth Heller Department of Statistics, University of Pennsylvania Joint work with: Yoav Benjamini, Nava Rubin, Damian Stanley, Daniel Yekutieli, Yulia Golland, Rafael Malach Advances

More information

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski Data Analysis and Solver Plugins for KSpread USER S MANUAL Tomasz Maliszewski tmaliszewski@wp.pl Table of Content CHAPTER 1: INTRODUCTION... 3 1.1. ABOUT DATA ANALYSIS PLUGIN... 3 1.3. ABOUT SOLVER PLUGIN...

More information

Study and Implementation of CHAMELEON algorithm for Gene Clustering

Study and Implementation of CHAMELEON algorithm for Gene Clustering [1] Study and Implementation of CHAMELEON algorithm for Gene Clustering 1. Motivation Saurav Sahay The vast amount of gathered genomic data from Microarray and other experiments makes it extremely difficult

More information

CS 664 Slides #11 Image Segmentation. Prof. Dan Huttenlocher Fall 2003

CS 664 Slides #11 Image Segmentation. Prof. Dan Huttenlocher Fall 2003 CS 664 Slides #11 Image Segmentation Prof. Dan Huttenlocher Fall 2003 Image Segmentation Find regions of image that are coherent Dual of edge detection Regions vs. boundaries Related to clustering problems

More information

- with application to cluster and significance analysis

- with application to cluster and significance analysis Selection via - with application to cluster and significance of gene expression Rebecka Jörnsten Department of Statistics, Rutgers University rebecka@stat.rutgers.edu, http://www.stat.rutgers.edu/ rebecka

More information

Multiple Comparisons of Treatments vs. a Control (Simulation)

Multiple Comparisons of Treatments vs. a Control (Simulation) Chapter 585 Multiple Comparisons of Treatments vs. a Control (Simulation) Introduction This procedure uses simulation to analyze the power and significance level of two multiple-comparison procedures that

More information

Chapter 6: Linear Model Selection and Regularization

Chapter 6: Linear Model Selection and Regularization Chapter 6: Linear Model Selection and Regularization As p (the number of predictors) comes close to or exceeds n (the sample size) standard linear regression is faced with problems. The variance of the

More information

Use of Extreme Value Statistics in Modeling Biometric Systems

Use of Extreme Value Statistics in Modeling Biometric Systems Use of Extreme Value Statistics in Modeling Biometric Systems Similarity Scores Two types of matching: Genuine sample Imposter sample Matching scores Enrolled sample 0.95 0.32 Probability Density Decision

More information

On the Global Solution of Linear Programs with Linear Complementarity Constraints

On the Global Solution of Linear Programs with Linear Complementarity Constraints On the Global Solution of Linear Programs with Linear Complementarity Constraints J. E. Mitchell 1 J. Hu 1 J.-S. Pang 2 K. P. Bennett 1 G. Kunapuli 1 1 Department of Mathematical Sciences RPI, Troy, NY

More information

Optimal Techniques and Sequences for Population Modeling of Bioavailability

Optimal Techniques and Sequences for Population Modeling of Bioavailability Optimal Techniques and Sequences for Population Modeling of Bioavailability Roger W. Jelliffe, M.D Professor of Medicine Laboratory of Applied Pharmacokinetics, USC Keck School of Medicine www.usc.edu/hsc/lab_apk/,

More information

ON SOME METHODS OF CONSTRUCTION OF BLOCK DESIGNS

ON SOME METHODS OF CONSTRUCTION OF BLOCK DESIGNS ON SOME METHODS OF CONSTRUCTION OF BLOCK DESIGNS NURNABI MEHERUL ALAM M.Sc. (Agricultural Statistics), Roll No. I.A.S.R.I, Library Avenue, New Delhi- Chairperson: Dr. P.K. Batra Abstract: Block designs

More information

CS6375: Machine Learning Gautam Kunapuli. Mid-Term Review

CS6375: Machine Learning Gautam Kunapuli. Mid-Term Review Gautam Kunapuli Machine Learning Data is identically and independently distributed Goal is to learn a function that maps to Data is generated using an unknown function Learn a hypothesis that minimizes

More information

Bayesian Inference for Sample Surveys

Bayesian Inference for Sample Surveys Bayesian Inference for Sample Surveys Trivellore Raghunathan (Raghu) Director, Survey Research Center Professor of Biostatistics University of Michigan Distinctive features of survey inference 1. Primary

More information

A GENERAL GIBBS SAMPLING ALGORITHM FOR ANALYZING LINEAR MODELS USING THE SAS SYSTEM

A GENERAL GIBBS SAMPLING ALGORITHM FOR ANALYZING LINEAR MODELS USING THE SAS SYSTEM A GENERAL GIBBS SAMPLING ALGORITHM FOR ANALYZING LINEAR MODELS USING THE SAS SYSTEM Jayawant Mandrekar, Daniel J. Sargent, Paul J. Novotny, Jeff A. Sloan Mayo Clinic, Rochester, MN 55905 ABSTRACT A general

More information

General Factorial Models

General Factorial Models In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 34 It is possible to have many factors in a factorial experiment. In DDD we saw an example of a 3-factor study with ball size, height, and surface

More information

General Factorial Models

General Factorial Models In Chapter 8 in Oehlert STAT:5201 Week 9 - Lecture 1 1 / 31 It is possible to have many factors in a factorial experiment. We saw some three-way factorials earlier in the DDD book (HW 1 with 3 factors:

More information

Package FHtest. November 8, 2017

Package FHtest. November 8, 2017 Type Package Package FHtest November 8, 2017 Title Tests for Right and Interval-Censored Survival Data Based on the Fleming-Harrington Class Version 1.4 Date 2017-11-8 Author Ramon Oller, Klaus Langohr

More information

14.2 Do Two Distributions Have the Same Means or Variances?

14.2 Do Two Distributions Have the Same Means or Variances? 14.2 Do Two Distributions Have the Same Means or Variances? 615 that this is wasteful, since it yields much more information than just the median (e.g., the upper and lower quartile points, the deciles,

More information

HealthInfoNet CLINICAL PORTAL USER REFERENCE GUIDE. Revised: Page 1 of 24

HealthInfoNet CLINICAL PORTAL USER REFERENCE GUIDE. Revised: Page 1 of 24 HealthInfoNet CLINICAL PORTAL USER REFERENCE GUIDE Revised: 6.3.2015 Page 1 of 24 HealthInfoNet User Reference Guide INSIDE: Accessing HealthInfoNet (HIN) 3-5 Clinical Portal 6-11 Notifications and Worklists

More information

Scientific Computing: An Introductory Survey

Scientific Computing: An Introductory Survey Scientific Computing: An Introductory Survey Chapter 13 Random Numbers and Stochastic Simulation Prof. Michael T. Heath Department of Computer Science University of Illinois at Urbana-Champaign Copyright

More information

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data

Smoothing Dissimilarities for Cluster Analysis: Binary Data and Functional Data Smoothing Dissimilarities for Cluster Analysis: Binary Data and unctional Data David B. University of South Carolina Department of Statistics Joint work with Zhimin Chen University of South Carolina Current

More information

A Generic Separation Algorithm and Its Application to the Vehicle Routing Problem

A Generic Separation Algorithm and Its Application to the Vehicle Routing Problem A Generic Separation Algorithm and Its Application to the Vehicle Routing Problem Presented by: Ted Ralphs Joint work with: Leo Kopman Les Trotter Bill Pulleyblank 1 Outline of Talk Introduction Description

More information

Physician Engagement in Quality and Safety. Improvement Associates Ltd.

Physician Engagement in Quality and Safety. Improvement Associates Ltd. Physician Engagement in Quality and Safety What does physician engagement in quality and safety mean to you? Who do you want involved and how do you want them involved? What would the ideal world look

More information

Treatment Planning Optimization for VMAT, Tomotherapy and Cyberknife

Treatment Planning Optimization for VMAT, Tomotherapy and Cyberknife Treatment Planning Optimization for VMAT, Tomotherapy and Cyberknife Kerem Akartunalı Department of Management Science Strathclyde Business School Joint work with: Vicky Mak-Hau and Thu Tran 14 July 2015

More information

VHIMS QUICK REFERENCE GUIDE TO INCIDENT MANAGEMENT

VHIMS QUICK REFERENCE GUIDE TO INCIDENT MANAGEMENT Introduction This reference guide is aimed at Risk/Quality Managers and/or OHS Managers who are responsible for ensuring Incidents have been followed up by the respective Line Managers, and to post incidents

More information

Sequential Screening: A Bayesian Dynamic Programming Analysis

Sequential Screening: A Bayesian Dynamic Programming Analysis Sequential Screening: A Bayesian Dynamic Programming Analysis Peter I. Frazier (Cornell), Bruno Jedynak (JHU), Li Chen (JHU) Operations Research & Information Engineering, Cornell University Applied Mathematics,

More information

Correctly Compute Complex Samples Statistics

Correctly Compute Complex Samples Statistics SPSS Complex Samples 15.0 Specifications Correctly Compute Complex Samples Statistics When you conduct sample surveys, use a statistics package dedicated to producing correct estimates for complex sample

More information

Chapter 9. Classification and Clustering

Chapter 9. Classification and Clustering Chapter 9 Classification and Clustering Classification and Clustering Classification and clustering are classical pattern recognition and machine learning problems Classification, also referred to as categorization

More information

Statistical Analysis of List Experiments

Statistical Analysis of List Experiments Statistical Analysis of List Experiments Kosuke Imai Princeton University Joint work with Graeme Blair October 29, 2010 Blair and Imai (Princeton) List Experiments NJIT (Mathematics) 1 / 26 Motivation

More information

A Geometric Analysis of Subspace Clustering with Outliers

A Geometric Analysis of Subspace Clustering with Outliers A Geometric Analysis of Subspace Clustering with Outliers Mahdi Soltanolkotabi and Emmanuel Candés Stanford University Fundamental Tool in Data Mining : PCA Fundamental Tool in Data Mining : PCA Subspace

More information

UTS Library s Guide to Finding Evidence-Based Practice Resources

UTS Library s Guide to Finding Evidence-Based Practice Resources UTS Library s Guide to Finding Evidence-Based Practice Resources UTS: Library UTS Library s Health librarians have made this step-by-step guide to finding evidence-based practice resources using the PICO

More information

Microscopic Traffic Simulation

Microscopic Traffic Simulation Microscopic Traffic Simulation Lecture Notes in Transportation Systems Engineering Prof. Tom V. Mathew Contents Overview 2 Traffic Simulation Models 2 2. Need for simulation.................................

More information

Authenticating Nuclear Warheads With High Confidence

Authenticating Nuclear Warheads With High Confidence Authenticating Nuclear Warheads With High Confidence Moritz Kütt,, Sebastien Philippe, Boaz Barak, Alexander Glaser, and Robert J. Goldston, Princeton University Technische Universität Darmstadt, Germany

More information

INF 4300 Classification III Anne Solberg The agenda today:

INF 4300 Classification III Anne Solberg The agenda today: INF 4300 Classification III Anne Solberg 28.10.15 The agenda today: More on estimating classifier accuracy Curse of dimensionality and simple feature selection knn-classification K-means clustering 28.10.15

More information

ICTR UW Institute of Clinical and Translational Research. i2b2 User Guide. Version 1.0 Updated 9/11/2017

ICTR UW Institute of Clinical and Translational Research. i2b2 User Guide. Version 1.0 Updated 9/11/2017 ICTR UW Institute of Clinical and Translational Research i2b2 User Guide Version 1.0 Updated 9/11/2017 Table of Contents Background/Search Criteria... 2 Accessing i2b2... 3 Navigating the Workbench...

More information

PAM 4280/ECON 3710: The Economics of Risky Health Behaviors Fall 2015 Professor John Cawley TA Christine Coyer. Stata Basics for PAM 4280/ECON 3710

PAM 4280/ECON 3710: The Economics of Risky Health Behaviors Fall 2015 Professor John Cawley TA Christine Coyer. Stata Basics for PAM 4280/ECON 3710 PAM 4280/ECON 3710: The Economics of Risky Health Behaviors Fall 2015 Professor John Cawley TA Christine Coyer Stata Basics for PAM 4280/ECON 3710 I Introduction Stata is one of the most commonly used

More information

A Sequential, Multi-Complexity Topology Optimization Process for Aeroelastic Wing Structure Design

A Sequential, Multi-Complexity Topology Optimization Process for Aeroelastic Wing Structure Design A Sequential, Multi-Complexity Topology Optimization Process for Aeroelastic Wing Structure Design Bill Crossley, crossley@purdue.edu Significant content from graduate student Mark Guiles and input from

More information

Towards a practical simplex method for second order cone programming

Towards a practical simplex method for second order cone programming Towards a practical simplex method for second order cone programming Kartik Krishnan Department of Computing and Software McMaster University Joint work with Gábor Pataki (UNC), Neha Gupta (IIT Delhi),

More information

Chapter 16. Microscopic Traffic Simulation Overview Traffic Simulation Models

Chapter 16. Microscopic Traffic Simulation Overview Traffic Simulation Models Chapter 6 Microscopic Traffic Simulation 6. Overview The complexity of traffic stream behaviour and the difficulties in performing experiments with real world traffic make computer simulation an important

More information

Image Restoration and Background Separation Using Sparse Representation Framework

Image Restoration and Background Separation Using Sparse Representation Framework Image Restoration and Background Separation Using Sparse Representation Framework Liu, Shikun Abstract In this paper, we introduce patch-based PCA denoising and k-svd dictionary learning method for the

More information

Modelling Proportions and Count Data

Modelling Proportions and Count Data Modelling Proportions and Count Data Rick White May 4, 2016 Outline Analysis of Count Data Binary Data Analysis Categorical Data Analysis Generalized Linear Models Questions Types of Data Continuous data:

More information

Notes taken by Mea Wang. February 11, 2005

Notes taken by Mea Wang. February 11, 2005 CSC2411 - Linear Programming and Combinatorial Optimization Lecture 5: Smoothed Analysis, Randomized Combinatorial Algorithms, and Linear Programming Duality Notes taken by Mea Wang February 11, 2005 Summary:

More information

Topological Mapping. Discrete Bayes Filter

Topological Mapping. Discrete Bayes Filter Topological Mapping Discrete Bayes Filter Vision Based Localization Given a image(s) acquired by moving camera determine the robot s location and pose? Towards localization without odometry What can be

More information