Bayes Factor Single Arm Binary User s Guide (Version 1.0)

Similar documents
TTEDesigner User s Manual

Exact Calculation of Beta Inequalities

Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS

gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual

Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming

Binary Diagnostic Tests Clustered Samples

Authenticating Nuclear Warheads With High Confidence

Statistical Analysis of the 3WAY Block Cipher

Statistical Analysis of KASUMI Block Cipher

A GENERAL GIBBS SAMPLING ALGORITHM FOR ANALYZING LINEAR MODELS USING THE SAS SYSTEM

Package ThreeArmedTrials

A STATISTICAL INVESTIGATION INTO NONINFERIORITY TESTING FOR TWO BINOMIAL PROPORTIONS NICHOLAS BLOEDOW. B.S. University of Wisconsin-Oshkosh, 2011

Beyond the Assumption of Constant Hazard Rate in Estimating Incidence Rate on Current Status Data with Applications to Phase IV Cancer Trial

Statistical Analysis of the SHA-1 and SHA-2 Hash Functions

COPYRIGHTED MATERIAL. ExpDesign Studio 1.1 INTRODUCTION

Machine Learning A WS15/16 1sst KU Version: January 11, b) [1 P] For the probability distribution P (A, B, C, D) with the factorization

Project Report: "Bayesian Spam Filter"

R topics documented: 2 genbernoullidata

9.2 Types of Errors in Hypothesis testing

Testing the Randomness of Cryptographic Function Mappings

Package PTE. October 10, 2017

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

Package mmeta. R topics documented: March 28, 2017

Bayesian Inference for Sample Surveys

Manual for the R gaga package

Note Set 4: Finite Mixture Models and the EM Algorithm

Notation Index. Probability notation. (there exists) (such that) Fn-4 B n (Bell numbers) CL-27 s t (equivalence relation) GT-5.

Professor Stephen Sekula

CHAPTER 7. Copyright Cengage Learning. All rights reserved.

MS&E 226: Small Data

Grade 6 Integers. Answer the questions. Choose correct answer(s) from the given choices. For more such worksheets visit

SPERNER S LEMMA, BROUWER S FIXED-POINT THEOREM, AND THE SUBDIVISION OF SQUARES INTO TRIANGLES

Lecture 10: SVM Lecture Overview Support Vector Machines The binary classification problem

The Optimal Discovery Procedure: A New Approach to Simultaneous Significance Testing

Package acebayes. R topics documented: November 21, Type Package

JeffTrial Subject Registration Clinical Coordinator Training. Kimmel Cancer Center JeffTrials version 13.0 Ver. 1.2

Machine Learning A W 1sst KU. b) [1 P] For the probability distribution P (A, B, C, D) with the factorization

Combining Biometric Scores in Identification Systems

Algebraic Expressions

Optimal designs for comparing curves

Design and Analysis Methods for Cluster Randomized Trials with Pair-Matching on Baseline Outcome: Reduction of Treatment Effect Variance

Package sp23design. R topics documented: February 20, Type Package

Math 7 Glossary Terms

Package NNTbiomarker

Estimating survival from Gray s flexible model. Outline. I. Introduction. I. Introduction. I. Introduction

Supervised vs unsupervised clustering

Regularization and model selection

An Implementation of a 5-term GFSR Random Number Generator for Parallel Computations

FLOATING POINT FORMATS

Work Instruction Study Setup

Package DSBayes. February 19, 2015

This online supplement includes four parts: (1) an introduction to SAB models; (2) an

Cardinality of Sets. Washington University Math Circle 10/30/2016

CSE548, AMS542: Analysis of Algorithms, Fall 2012 Date: October 16. In-Class Midterm. ( 11:35 AM 12:50 PM : 75 Minutes )

Project Paper Introduction

Image Segmentation Using Iterated Graph Cuts Based on Multi-scale Smoothing

Monte Carlo for Spatial Models

Client Dependent GMM-SVM Models for Speaker Verification

The exam is closed book, closed notes except your one-page (two-sided) cheat sheet.

Monte Carlo Methods and Statistical Computing: My Personal E

Generative and discriminative classification techniques

Research Participant Registration System (RPRS) Study Creation Guide

Bayesian Methods. David Rosenberg. April 11, New York University. David Rosenberg (New York University) DS-GA 1003 April 11, / 19

Math 340 Fall 2014, Victor Matveev. Binary system, round-off errors, loss of significance, and double precision accuracy.

Effective probabilistic stopping rules for randomized metaheuristics: GRASP implementations

CS6375: Machine Learning Gautam Kunapuli. Mid-Term Review

OPTIMIZATION OF RELIABILITY VERIFICATION TEST STRATEGIES

Odd-Numbered Answers to Exercise Set 1.1: Numbers

Probabilistic Facial Feature Extraction Using Joint Distribution of Location and Texture Information

Multiple imputation using chained equations: Issues and guidance for practice

MoDisc User s Manual. Ranjan Srivastava

Package bayesdp. July 10, 2018

Unit 3: Multiplication and Division Reference Guide pages x 7 = 392 factors: 56, 7 product 392

PARAMETRIC MODEL SELECTION TECHNIQUES

How to Do Word Problems. Study of Integers

6.034 Quiz 2, Spring 2005

Manual for the R gaga package

2.1 Basics of Functions and Their Graphs

STAT 113: Lab 9. Colin Reimer Dawson. Last revised November 10, 2015

CGBoost: Conjugate Gradient in Function Space

8-1 Simple Trigonometric Equations. Objective: To solve simple Trigonometric Equations and apply them

Markov Chain Monte Carlo (part 1)

ALLIANCE CLINICAL RESEARCH PROFESSIONAL INFORMATION SESSION CTSU WEBSITE AND OPEN UPDATES

Properties. Comparing and Ordering Rational Numbers Using a Number Line

Physics 736. Experimental Methods in Nuclear-, Particle-, and Astrophysics. - Statistics and Error Analysis -

Multiple Comparisons of Treatments vs. a Control (Simulation)

Homework /681: Artificial Intelligence (Fall 2017) Out: October 18, 2017 Due: October 29, 2017 at 11:59PM

Math 231E, Lecture 34. Polar Coordinates and Polar Parametric Equations

In a two-way contingency table, the null hypothesis of quasi-independence. (QI) usually arises for two main reasons: 1) some cells involve structural

BIOS: 4120 Lab 11 Answers April 3-4, 2018

STATA Note 5. One sample binomial data Confidence interval for proportion Unpaired binomial data: 2 x 2 tables Paired binomial data

CCSSM Curriculum Analysis Project Tool 1 Interpreting Functions in Grades 9-12

An Efficient Model Selection for Gaussian Mixture Model in a Bayesian Framework

Package ssa. July 24, 2016

PIANOS requirements specifications

Analysis of Algorithms

Algebra 1. Standard 11 Operations of Expressions. Categories Combining Expressions Multiply Expressions Multiple Operations Function Knowledge

Probabilistic Learning Classification using Naïve Bayes

Polar Coordinates. Chapter 10: Parametric Equations and Polar coordinates, Section 10.3: Polar coordinates 27 / 45

1 Machine Learning System Design

Transcription:

Bayes Factor Single Arm Binary User s Guide (Version 1.0) Department of Biostatistics P. O. Box 301402, Unit 1409 The University of Texas, M. D. Anderson Cancer Center Houston, Texas 77230-1402, USA November 26, 2012

Contents 1 Method description 2 1.1 Method overview......................... 2 1.2 Bayesian hypothesis test..................... 2 1.3 Sampling density......................... 3 1.4 imom prior............................ 3 1.5 An example............................ 3 1.6 Stopping rules........................... 4 2 Running the software 4 2.1 Executing from the command line............... 4 2.2 Input parameters......................... 5 2.3 Output file............................ 6 3 References 7 4 Appendix 8 1

1 Method description 1.1 Method overview The BayesFactorBinary software implements a Bayesian hypothesis testbased method for trials with single arm binary patient outcomes as described in [1]. The software performs simulations to test a single arm binary endpoint based on a null hypothesis (H 0 ) such as: H 0 : The response rate using the treatment is 20%, versus an alternative hypothesis (H 1 ) such as: H 1 : The response rate using the treatment is 30%. Please note this is testing against a point null hypothesis. The Bayes Factor method described below is used to determine the odds in favor of the alternative hypothesis as each patient is treated and results observed during trial progression. 1.2 Bayesian hypothesis test In a Bayesian hypothesis test, the posterior odds in favor of the alternative hypothesis is a multiplication of the prior odds in favor of the alternative hypothesis and the Bayes factor. Usually, the prior odds in favor of the alternative hypothesis is set to be 1. P r(h 1 x) P r(h 0 x) = m 1(x) = Bayes Factor m 0 (x) where x represents the data, and m 1 (x) and m 0 (x) denote the marginal densities of the data defined under the alternative and null hypotheses, respectively. Bayes factor is the ratio of m 1 (x) and m 0 (x). As described in [1], the marginal density under the alternative hypothesis m 1 (x) is defined to be m 1 (x) = f(x θ)π 1 (θ)dθ Θ which is a weighted average of the sample density (f(x θ)) on the parameter of interest θ with respect to the prior density (π 1 (θ)) on θ. 2

1.3 Sampling density In the binary case, the number of m responses (successes) out of n treated subjects follows a Binomial distribution. Here θ is the probability of responses. ( ) n f(x θ) = θ m (1 θ) (n m) m 1.4 imom prior For the alternative hypothesis, the proposed class of nonlocal prior densities is used. This class of densities is called inverse moment (imom) densities. It takes the form (Equation (10) on page 149 in [2]) kτ ν/2 [ π I (θ; θ 0, k, ν, τ) = (θ θ0 ) 2] ν+1 2 exp{ [ (θ θ 0) 2 ] k } Γ(ν/2k) τ where k = 1 and ν = 2 as recommended. It s worthy pointing out that the parameter τ is calculated based on the point null hypothesis on θ, and the alternative prior mode at θ. Specifically, based on the Equation (5) on page 220 in [1], where the mode of the density function was deducted, we could solve for τ ( ) 2k 1/k τ = (ˆθ θ 0 ) 2 (1) ν + 1 where ˆθ is the mode of the prior density under H 1 on θ, and θ 0 is the null hypothesis on θ. 1.5 An example As an illustration of the method we use an example on page 221 in [1], the null and alternative hypotheses are defined as: H 0 : θ = 0.2 H 1 : π 1 (θ) π I (θ; θ 0 = 0.2, k = 1, ν = 2, τ = 0.015)I (0.2,1) (θ) where under the alternative hypothesis, the imom density has a mode at 0.2. Based on the point null of 0.2 for response rate and the mode of 0.3 3

for response rate under the alternative hypothesis, given k = 1 and ν = 2, we could calculated that τ = 0.015 based on formula (1). 1.6 Stopping rules The trial stops for inferiority if The trial stops for superiority if P r(h 1 Data) < C I P r(h 1 Data) > C S where C I is the cutoff for inferiority stopping, and C S is the cutoff for superiority stopping. Usually it is recommended that C I is a small number between 0 and 0.5, such as 0.1; C S is a number between 0.5 and 1.0, such as 0.9. C I and C S are tuned to get good operating characteristics. 2 Running the software 2.1 Executing from the command line BayesFactorBinary is a command line application. It runs on Microsoft Windows 7 using the.net framework and on Mac OS X and Linux using the Mono [3] framework. The software reads its input data from a user-prepared input file and writes its output (in HTML format) to an output file. A sample input file called InputFile BayesFactorBinary.txt is supplied along with the program. You may use this file as an example, and edit it to suit your specific trial. You may run the program from a command line like this: BayesFactorBinary.exe inputfile.txt outputfile.html where inputfile.txt and outputfile.html are file names supplied by you. If you do not provide a name for the output file, the default name will be OutputFile BayesFactorBinary.html. 4

2.2 Input parameters Here is a typical input file. Note that the program will ignore blank lines and all characters from the # symbol to the end of a line. (# is the comment character). ## Design Parameters ## 12345 # random number generator seed for simulation 50 # max number of patients 0.2 # null hypothesis of response rate 0.3 # alternative hypothesis of response rate 0.10 # stop for inferiority if Pr(H1 Data) is less than this value 0.90 # stop for superiority if Pr(H1 Data) is greater than this value 1000 # number of repetitions for the simulation Yes # Need stopping boundaries? (Could be "No", or "Yes") 10 # minimum number of patients to enroll before applying the stopping rules 5 # cohort size for monitoring the trial ################################################# # Scenario 1 0.2 # true response rate # Scenario 2 0.25 # true response rate # Scenario 3 0.3 # true response rate # Scenario 4 0.35 # true response rate # Scenario 5 0.4 # true response rate ###################### # END Random number generator seed: Any non-negative integer up to 4 billion. Maximum number of patients: Total number of patients enrolled if no early stopping rule applies, a non-negative integer up to 400. 5

Null hypothesis of response rate: Response rate under the null hypothesis, a fraction between 0 and 1. Alternative hypothesis of response rate: Response rate under the alternative hypothesis, must be greater than that under the null hypothesis and less than 1. Inferiority boundary: The trial will stop for inferiority if the posterior probability of the alternative hypothesis is less than this boundary value. This is typically a small positive value between 0 and 0.5, such as 0.10. If this boundary value is set to be 0, the trial would never stop due to inferiority. Superiority boundary: The trial will stop for superiority if the posterior probability of the alternative hypothesis is greater than this boundary value. This is typically a positive value between 0.5 and 1.0, such as 0.90. If this boundary value is set to be 1, the trial would never stop due to superiority. Number of repetitions for the simulation: Number of repetitions for the simulation. It must be positive and up to 100,000. Please note the limit (100,000) is configurable, meaning that you could change this limit in the BayesFactorBinary.exe.config file (included in the package). Need stopping boundaries? It could be No or Yes. This determines whether the program prints a table of stopping boundaries into the output file. Minimum number of patients to enroll before applying the stopping rules: The trial will run at least this number of patients before applying the stopping rules. If this number is greater than the cohort size, it must be a multiple of the cohort size. Cohort size for monitoring the trial: The trial will enroll patients with a cohort of this many patients at each enrollment, and consequently, the stopping rules will be applied for a complete cohort of patients. The maximum number of patients must be a multiple of the cohort size. Scenarios: A scenario is a true response rate. 2.3 Output file Please see the Appendix for the output html file. This file is in HTML format and may be viewed using your favorite web browser (Safari, Internet Explorer, etc). 6

We recommend saving the entire output of the BayesFactorBinary program and not just the operating characteristics. This assures that you have all the parameters that went into the design, including the version number of the software. 3 References [1] Johnson VE and Cook JD. Bayesian design of single-arm phase II clinical trials with continuous monitoring. Clinical Trials. 2009 Jun; 6(3): 217-26. [2] Johnson VE and Rossell D. On the use of non-local prior densities in Bayesian hypothesis tests. Journal of the Royal Statistical Society B. 72 : 143-170, 2010. [3] The Mono project is open source software that allows Microsoft.NET programs to run on Linux. More information is available at http://mono- Project.org 7

4 Appendix Output Report BayesFactorBinary (PID = 702) version 1.0.0 Bayes Factor One-arm Binary Outcome 11/14/2012 3:20:36 PM Random seed for simulation: 12345 Input file name: inputfile_bayesfactorbinary.txt Output file name: Output_BayesFactorBinary.html Design input parameters: Parameter Value Maximum number of patients 50 Null hypothesis of response rate 0.2 Alternative hypothesis of response rate 0.3 Cutoff to stop for inferiority (i.e., fail to reject H0) if Pr(H1 Data) < 0.1 Cutoff to stop for superiority (i.e., reject H0) if Pr(H1 Data) > 0.9 Number of repetitions for the simulation 1000 Do you need to print out the stopping boundaries Yes Minimum number of patients to enroll before applying the stopping rules 10 Cohort size for monitoring the trial 5 Note: H1 = alternative hypothesis ; H0 = null hypothesis. Simulation Results Sce- True response Pr(Stopping Pr(Stopping Average # patients treated nario rate for superiority) for inferiority) (10%, 25%,50%, 75%, 90%) 1 0.20 0.037 0.520 40.00 (20, 25, 45, 50, 50 ) 2 0.25 0.168 0.216 42.94 (25, 40, 50, 50, 50 ) 3 0.30 0.378 0.085 40.83 (20, 30, 50, 50, 50 ) 4 0.35 0.657 0.021 35.40 (15, 20, 35, 50, 50 ) 5 0.40 0.858 0.002 29.26 (10, 20, 30, 45, 50 ) 8

Stopping Boundaries # Patients Stop the trial Continue the trial Stop the trial (in complete for Inferiority if there are this for Superiority cohorts of 5) if there are many Successes if there are (inclusive) this many (inclusive) this many Successes Successes (inclusive) (inclusive) 10 Never stop 0-5 6-10 for inferiority with this many patients 15 0 1-7 8-15 20 0-1 2-8 9-20 25 0-3 4-10 11-25 30 0-4 5-11 12-30 35 0-5 6-12 13-35 40 0-6 7-14 15-40 45 0-8 9-15 16-45 50 0-9 10-16 (The trial always 17-50 stops at 50 patients, which is the maximum For example, when the current number of patients is 25, if we see the number of successes is between 0 and 3, we should stop the trial for inferiority; if the number of successes is between 4 and 10, we continue the trial; if the number of successes is between 11 and 25, we stop the trial for superiority. Please note that when the number of patients is 50, which is the maximum number of patients the trial allows, the trial always stops. However we nonetheless gave the boundaries for stopping for inferiority or superiority, so that the corresponding personnel may make appropriate final conclusion about the trial. That is, whether the final results show that the trial is or is not in favor of the treatment, or inconclusive. 9