Two-Stage Designs for Phase II Clinical Trials. by Chung-Li Tsai Advisor Kam-Fai Wong

Size: px
Start display at page:

Download "Two-Stage Designs for Phase II Clinical Trials. by Chung-Li Tsai Advisor Kam-Fai Wong"

Transcription

1 Two-Stage Designs for Phase II Clinical Trials by Chung-Li Tsai Advisor Kam-Fai Wong Institute of Statistics National University of Kaohsiung Kaohsiung, Taiwan 811, R.O.C. July 2006

2 Contents List of Figures ii List of Tables iii iv v 1 Introduction 1 2 Single-Stage Designs 4 3 Two-Stage Designs Optimal Two-Stage Designs Minimax Two-Stage Designs Two-Stage Designs of minimizing {(EN(p 0 ) + EN(p 1 ))/2} Two-Stage Designs of minimizing the maximum of EN(p 0 ) and EN(p 1 ) Compare with Above Criteria Control PET(p 0 ) Algorithm 19 5 Discussion 23 References 26 A Appendix: Single-Stage Designs 28 B Appendix: Two-Stage Designs with p 1 p 0 = C Appendix: Two-Stage Designs with p 1 p 0 = i

3 List of Figures 1 p EN(p) and p PET(p) Collocate four above designs with EN(p) ii

4 List of Tables Table 1: Three cases are belonged to S 1 (r 1, n 1 p 0 = 0.10, p 1 = 0.30, α = 0.10, β = 0.10) Table 2: Parts of Table A.1 with (p 0, p 1 ) = (0.05, 0.25) and (α, β) = (0.10, 0.10) 5 Table.3: Parts of Table B.2 with (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10) 9 Table.4: Parts of Table B.1 with (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.05, 0.20) 10 Table.5: Parts of Table B.2 with (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.05, 0.20) 10 Table.6: Parts of Table B.2 and B.3 with (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10) Table.7: Part of Table B.2 and B.4 with (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10) Table.8: Parts of Table B.1, B.2, B.3 and B.4 with (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.10, 0.10) Table.9: Parts of Table B.1 and B.5 with (p 0, p 1 ) = (0.30, 0.50) and (α, β) = (0.10, 0.10), (minimax) Table.10: Parts of Table B.2 and B.6 with (p 0, p 1 ) = (0.60, 0.80) and (α, β) = (0.10, 0.10), (optimal) Table.11: Parts of Table B.3 and B.7 with (p 0, p 1 ) = (0.60, 0.80) and (α, β) = (0.10, 0.10), (min{(en(p 0 ) + EN(p 1 ))/2} Table.12: Parts of Table B.4 and B.8 with (p 0, p 1 ) = (0.30, 0.50) and (α, β) = (0.10, 0.10), (min max{en(p 0 ), EN(p 1 )}) iii

5 ÏÞÞ$ð'Œ ¼0>0? ` œ ŠÝ (p 0, p 1, α, β) D3 /) S = {(r 1, n 1, r 2, n 2 ) r 1 n 1, r 2 n 2, n 1 N, (r 1, r 2, n 2 ) N {0}}, ÍS âxb 'ÝlIClIIý0^ Þ$ð'Œ N ÍÞ$ð'ŒKE TëÍÙŒ 5½ÎTøÍÍóEN(p)@ 3Ï $ðý^ PET(p) CÀø Íón = n 1 + n 2 Ãyî ëíùœ!ýãj ÿõ8etýt Þ$ð' Œ Simon31989OèŒÝÞãJoptimalõ minimax OptimalÞ$ð'ŒÎÃyEN(p 0)ÿ Õ minimaxþ$ð'œjî ÊnõEN(p 0 ) Íp 0 Î Í ÝW ^ Í Z3 ÍðŒÝãJì ÍXETÝt Þ$ð'Œ M"DÍ8þF n"þ Û@ ÏÞ@ Þ$ð'Œ Š iv

6 Two-Stage Designs for Phase II Clinical Trials Advisor: Dr. Kam-Fai Wong Institute of Statistics National University of Kaohsiung Student: Chung-Li Tsai Institute of Statistics National University of Kaohsiung ABSTRACT Two-stage designs play an important role in the study of new drugs especially toxicity is the endpoint of interest. Given (p 0, p 1, α, β), there exists a set S = {(r 1, n 1, r 2, n 2 ) r 1 n 1, r 2 n 2, n 1 N, (r 1, r 2, n 2 ) N {0}} including all the probable two-stage designs which are satisfied with the two probabilities of error (α, β). Every two-stage designs lead to give an expected sample size EN(p), a probability of early termination PET(p), and a total sample size n = n 1 + n 2. A suggested design can be searched from S once a criterion is given based on EN(p), PET(p), n. Optimal and minimax two-stage designs have been discussed in the literature (Simon, 1989). The optimal designs are based on EN(p 0 ), and the minimax can be considered for n and EN(p 0 ) where p 0 is a given success probability. We find different designs under distinct criteria and these designs give us dissimilar advantage. Several designs under distinct criteria are tabulated for our interested parameters. Keywords: clinical trials, phase II trials, two-stage design v

7 1 Introduction In drug development, clinical trials are often classified clearly as phase I, II, III, and IV. Four types of clinical trial designs are commonly described as follows : Phase I Trials (Clinical Pharmacology and Toxicity) The primary goal of phase I trials is often used to determine maximum tolerated dose (MTD) of the treatment and to define the toxicities of the treatment. And the results supply direction for determining appropriate dose for use in phase II. Phase II Trials (Initial Clinical Investigation for Treatment Effect) The main purpose of phase II trials is a type of design with safety and efficacy estimation. Phase II clinical trials can be divided into two parts IIA and IIB. The studies designed to evaluate dosing are referred to as phase IIA studies, and studies designed to determine the effectiveness of the drug are called phase IIB. Phase III Trials (Full-scale Evaluation of Treatment) The primary objectives of phase III trials are gathered the additional information about effectiveness and safety needed to evaluate the overall benefit-risk relationship of the drug and provided an adequate basis for physician labeling. Phase IV Trials (Postmarketing Surveillance) The purpose for conducting phase IV trials is to elucidate further the incidence of adverse reactions and determine the effect of a drug on morbidity of mortality. In practice, phase IV are usually considered useful market-oriented comparison studies against competitor products. In Neyman(1950, pp ), the accepting and rejecting a statistical hypothesis are very convenient and are well established. It is important, however, to keep their exact meaning in mind and to discard various additional implications which may be suggested by intuition. Thus, to accept a hypothesis H means only to take an action A rather than 1

8 B. This does not mean that we necessarily believe that the hypothesis H is true. Also, if the application of the rule of inductive behavior rejects H, this means only that the rule prescribes action B and does not mean that we believe the H is false. In many studies especially in the phase II clinical trial, researchers may interest in deciding to have further investigation or to terminate the study based on the probability of success. In most of the cases, they wish to warrant further investigation if the probability of success is large enough or to end the study if the probability of success is too small. Usually two success probabilities p 0 and p 1 (p 0 < p 1 ) are specified so that and max P (to have further investigation p) α p p 0 max P (to terminate the sduty p) β, p p 1 where α, β are two given values corresponding to the probabilities of making inaccurate decisions. This can be done by considering the simplest hypothesis H 0 : p = p 0 versus H A : p = p 1 with rejection region R = {the number of success among n patients > c} which satisfy P (R p 0 ) α and P (R p 1 ) 1 β. Therefore, rejecting H 0 implies to have further investigation. Accepting H 0 implies to have early termination. It is noteworthy that we do not well define what is an inaccurate decisions if the probability of success is between p 0 and p 1 due to the limit of our knowledge. In fact, to reject H 0 means we have more than (1 α) 100% confidence that the probability of success is larger than p 0. To accept H 0 means we have more than (1 β) 100% confidence that the probability of success is smaller than p 1. In this thesis, when we say to accept H 0 means to decide early termination. When we say to reject H 0 means to have further investigation. Later, we review the single-stage designs in section 2. In section 3, we study the two-stage designs and discuss different 2

9 designs under distinct criteria. Then an algorithm of searching the best two-stage designs under different criteria is introduced in section 4. Finally, we give a discussion and final remarks in section 5. 3

10 2 Single-Stage Designs Suppose there are n 1 individuals in the study. Let X i = I(the ith individual is success), n 1 i = 1,..., n 1, and T 1 = X i. Clearly, T 1 has a binomial distribution B(n 1, p). Specifically, P (T 1 = x) = ( ) n1 p x (1 p) n1 x, x for x = 0, 1, 2,..., n 1. Let the rejection region R = {T 1 > r 1 }, then P (R p) = Given (p 0, p 1, α, β), there exists a set n 1 i=r 1 +1 ( n1 i ) p i (1 p) n 1 i. S 1 (r 1, n 1 p 0, p 1, α, β) = {(r 1, n 1 ) r 1 n 1, n 1 N, r 1 N {0}} such that for all (r 1, n 1 ) S 1 (r 1, n 1 p 0, p 1, α, β) P (R p 0 ) α and P (R p 1 ) 1 β. The optimal single-stage design is defined as (r 1, n 1 ) S 1 (r 1, n 1 p 0, p 1, α, β), which has the smallest value of n 1. Table 1: Three cases are belonged to S 1 (r 1, n 1 p 0 = 0.10, p 1 = 0.30, α = 0.10, β = 0.10) p 0 p 1 α β r 1 n 1 α β For example, (4, 25), (5, 29) and (6, 33) are belonged to S 1 (r 1, n 1 p 0 = 0.10, p 1 = 0.30, α = β = 0.10) since the probabilities of making inaccurate decisions, α and β, are smaller than α = β = 0.10 (see Table 1). As we can see, (4, 25) has the smallest total sample size n 1 among them. In fact, (4, 25) has the smallest value of n 1 among (r 1, n 1 ) S 1 (r 1, n 1 p 0 = 0.10, p 1 = 0.30). Therefore, given p 0 = 0.10, p 1 = 0.30 and α = β = 0.10, 4

11 the optimal single-stage design suggests to recruit 25 individuals in the study. That is, there are more than 4 individuals give positive responses, then reject H 0. Otherwise, accept H 0. To simplify the notation, we define (r s, n s ) = arg min n 1 S 1 (r 1, n 1 p 0, p 1, α, β) which has the smallest total sample size n 1 for all (r 1, n 1 ) S 1 (r 1, n 1 p 0, p 1, α, β). We give the result of the optimal single-stage designs with p 1 p 0 = 0.20 and p 1 p 0 = 0.15 in Table A.1 and A.2. For each (p 0, p 1 ), the optimal single-stage designs with (α, β) = (0.10, 0.10), (0.05, 0.20), (0.05, 0.10), and (0.05, 0.05), respectively, are given in four rows. The results which are tabulated include the critical point of decision r 1 and the total sample size n 1. Two exact probabilities of error (α, β ) are also on them. For example, the first row in Table A.1 means a single-stage design with p 0 = 0.05, p 1 = 0.25 and α = β = Table 2: Parts of Table A.1 with (p 0, p 1 ) = (0.05, 0.25) and (α, β) = (0.10, 0.10) p 0 p 1 α β r 1 n 1 α β Table A.1 The optimal single-stage design contains 20 individuals. If two or fewer positive responses are found, the study is suggested to terminate. Conversely, if more than two positive responses are seen in the trial, we conclude to go for further study. In this case, the two exact probabilities of error are α = and β = 0.091, respectively. 5

12 3 Two-Stage Designs Sometime, researchers prefer to recruit fewer individuals at the beginning of the trial especially toxicity is the endpoint of interest. If there are too fewer positive responses obviously, the clinical trial is terminated and conclude to accept H 0. If there are many positive responses, the clinical trial is also terminated but conclude to reject H 0. However, if the number of positive responses are fallen in a range, researchers will re-size individuals until decision of either accept or reject H 0 can be made. This leads to a two-stage design. Denote the number of individuals are studied in the first and the second stage by n 1 and n 2, respectively. Let r 1 and r 2 be the critical values needed to declare decision. Specifically, if r 1 or fewer positive responses are observed among the n 1 individuals in the first stage, we stop the trial and accept H 0. If more than r 1 + r 2 successes among the n 1 individuals in the first stage, we also stop the trial but reject H 0. Otherwise accrual continues to a total of n = n 1 + n 2 individuals. We give a general procedure of two-stage designs as follows: The first stage, n 1 if X i r 1, stop in the first stage and accept H 0 ; n 1 if X i > r 1 + r 2, stop in the first stage and reject H 0 ; n 1 if r 1 < X i r 1 + r 2, go to the second stage, The second stage, if if n 1 +n 2 n 1 +n 2 X i r 1 + r 2, accept H 0 ; X i > r 1 + r 2, reject H 0, That is, the rejection region is n 1 n 1 R = { X i > r 1 + r 2 or r 1 < X i r 1 + r 2 6 and n 1 +n 2 X i > r 1 + r 2 }.

13 Hence, the probability of reject H 0 with success probability p is P (R p) = 1 [B(r 1 ; p, n 1 ) + min(n 1,r) x=r 1 +1 b(x; p, n 1 )B(r x; p, n 2 )], (1) where b and B are the probability mass function and the cumulative density function of the binomial distribution, respectively, and r = r 1 + r 2. Similarly, given (p 0, p 1, α, β), there exists a set S 2 (r 1, n 1, r 2, n 2 p 0, p 1, α, β) = {(r 1, n 1, r 2, n 2 ) r 1 n 1, r 2 n 2, n 1 N, (r 1, r 2, n 2 ) N {0}} including all the probable two-stage designs which are satisfied with the two probabilities of error (α, β). As we can see, the total sample size of a two-stage design is either n 1 or n due to the number of positive responses in the first stage. Therefore, the expected total sample size n is related to the true probability of success. Let EN(p) be the expected sample size when response rate is equal to p. Then, EN(p) = n 1 + (1 PET(p))n 2 = n 1 (PET(p)) + n(1 PET(p)), where PET(p) is the probability of early termination after the first stage when response rate is p. At the end of the first stage, we will terminate the trial early and accept H 0 if r 1 or fewer positive responses are observed. We have enough evidence for making a decision to reject H 0 and the trial is unnecessary into the second stage when n 1 X i > r. As a result, PET(p) is given by B(r 1 ; p, n 1 ) + 1 B(r; p, n 1 ), if n 1 > r PET(p) = B(r 1 ; p, n 1 ), otherwise. Simon (1989) did not allow early termination for a high response rate and the probability of early termination was defined as PET(p) = B(r 1, p, n 1 ). 7

14 In other words, early acceptance of the drug is not permitted. Simon (1989) noted that when the drug has substantial activity ( p 1 ), there is often interesting in studying additional patients in order to estimate the proportion, extent and durability of response. However, it may not sufficiently improve efficiency of estimation for all scenarios, especially when size of n 2 is small related to n 1. For example, based on likelihood function L(p x) = ( ) n1 p x (1 p) n1 x = ˆp = x = x. x n 1 If we let p ˆp 0.05 under 95% confidence level, we need about n 1 = 385. In this thesis, we allow to have early acceptance of the drug. Later, we will review optimal and minimax two-stage designs which are given by Simon (1989) in section 3.1 and 3.2, respectively. In section 3.3 and 3.4, we consider two criteria such as minimize {(EN(p 0 ) + EN(p 1 ))/2} and minimize the maximum of EN(p 0 ) and EN(p 1 ). We compare these four designs in section 3.5. Finally, we control PET(p 0 ) 0.8 in section

15 3.1 Optimal Two-Stage Designs The optimal two-stage design is the two-stage design that the expected sample size is minimized when the response rate is p 0. It can be expressed as (r 1o, n 1o, r o, n o ) = arg min S 2 (r 1, n 1, r, n p 0, p 1, α, β) EN(p 0 ) which has the smallest expected sample size among (r 1, n 1, r, n) belong to S 2 (r 1, n 1, r, n p 0, p 1, α, β). For example, consider the case of (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10). Table.3: Parts of Table B.2 with (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10) p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) Table /12 5/ B.2 The first stage includes 12 individuals. If one or fewer than one positive responses are observed, the trial is suggested to stop in the first stage and accept H 0. If more than one positive responses are observed, the trial is also suggested to stop in the first stage but conclude to reject H 0. Otherwise accrual continues to a total of 35 individuals. When response rate p = p 0, the expected sample size EN(p 0 ) = is really the smallest and smaller than that of the optimal single-stage design which has sample size n 1 = 25. However, if the true response rate p = p 1, the expected sample size EN(p 1 ) = which is larger than that of the optimal single-stage design. In fact, the true response rate is unknown. 9

16 3.2 Minimax Two-Stage Designs The minimax two-stage design is defined to minimize the total sample size n first and then minimize the expected sample size EN(p 0 ) later. Specifically, we represent it as (r 1m, n 1m, r m, n m ) = arg min min S 2 (r 1, n 1, r, n p 0, p 1, α, β). EN(p 0 ) n For example, consider the case of (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.05, 0.20). Table.4: Parts of Table B.1 with (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.05, 0.20) p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) Table /18 10/ B.1 When response rate p = p 0, the expected sample size EN(p 0 ) = is smaller than the total sample size n 1 = 35 of the optimal single-stage design. Furthermore, the total sample size n = 33 is also smaller than that of the optimal single-stage design. In some cases, the minimax design may be more attractive than that with the minimum expected sample size. Consider the same case with optimal two-stage design as Table.4. Table.5: Parts of Table B.2 with (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.05, 0.20) p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) Table /13 12/ B.2 The optimal design has an expected sample size under H 0 of and a total sample size of 43 (see Table.5). The minimax two-stage design has an expected sample size of and a total sample size of 33 (see Table.4). If the accrual rate is only ten patients per year, it could take 1 year longer to complete the optimal design than the minimax design. 10

17 3.3 Two-Stage Designs of minimizing {(EN(p 0 ) + EN(p 1 ))/2} In section 3.1, we always consider acting as the situation that the response probability is p 0. In fact, we do not know what the true response rate p is. We may have p 0 with 50% and p 1 with 50%. Here, we make an adjustment to consider minimizing {(EN(p 0 ) + EN(p 1 ))/2}. It can be expressed as (r 1a, n 1a, r a, n a ) = arg min {(EN(p 0 )+EN(p 1 ))/2} S 2(r 1, n 1, r, n p 0, p 1, α, β). This design takes advantage of EN(p 1 ) than optimal two-stage design. For example, consider the case of (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10). Table.6: Parts of Table B.2 and B.3 with (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10) p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) Table /12 5/ B.2 1/16 4/ B.3 If the true response probability is p 1, the expected sample size of the optimal twostage design EN(p 1 ) = is bigger than the total sample of the optimal single-stage design which has the sample size n 1 = 25. However, the two stage design that minimize {(EN(p 0 ) + EN(p 1 ))/2} will have EN(p 0 ) = and EN(p 1 ) = which are both smaller than n 1 = 25. Although the expected sample size EN(p 0 ) for this design (EN(p 0 ) = 20.21) is larger than that of the optimal two-stage design (EN(p 0 ) = 19.83), it is only increasing by 1.9%. If we only control EN(p 0 ) but ignore EN(p 1 ), the twostage designs which we found may be give a large value of EN(p 1 ). Sometime, we can find a more robust design by small adjustment. Here, we only consider minimizing {(EN(p 0 ) + EN(p 1 ))/2}. In fact, it can be done by considering the prior distribution of the success probability p. 11

18 3.4 Two-Stage Designs of minimizing the maximum of EN(p 0 ) and EN(p 1 ) In some situation, we are interested in controlling the risk for different success probability simultaneously. This leads to the design that minimize the maximum value of EN(p) for some p, which form the formal minimax design we are familiar. In section 3.2, the minimax two-stage design given by Simon, in fact, is minimin. In this section, we consider minimizing the maximum of EN(p 0 ) and EN(p 1 ). It can be expressed as (r 1mm, n 1mm, r mm, n mm ) = arg min max{en(p 0 ),EN(p 1 )} S 2(r 1, n 1, r, n p 0, p 1, α, β). For example, consider the case of (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10). Table.7: Part of Table B.2 and B.4 with (p 0, p 1 ) = (0.10, 0.30) and (α, β) = (0.10, 0.10) p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) Table /12 5/ B. 2 2/18 4/ B.4 If the true response probability is p 1, the expected sample size of the optimal twostage design EN(p 1 ) = is bigger than the total sample of the optimal single-stage design which has the sample size n 1 = 25. However, the two-stage design that minimize the maximum two values of EN(p 0 ) and EN(p 1 ) will have EN(p 0 ) = and EN(p 1 ) = which are both smaller than n 1 = 25. Although the expected sample size EN(p 0 ) for this design (EN(p 0 ) = 19.90) is larger than that of the optimal two-stage design (EN(p 0 ) = 19.83), it is only increasing by 0.4%. The expected sample size of the criterion minimizing the maximum two values of EN(p 0 ) and EN(p 1 ) is surely smaller than the total sample size of the optimal single-stage design. 12

19 3.5 Compare with Above Criteria In this section, we compare these four designs by an example as follows. Consider the case of (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.10, 0.10) Table.8: Parts of Table B.1, B.2, B.3 and B.4 with (p 0, p 1 ) = (0.20, 0.40) and (α, β) = (0.10, 0.10) r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) Criteria 3/19 10/ minimax 3/17 10/ optimal 6/25 10/ minimize {(EN(p 0 ) + EN(p 1 ))/2} 6/27 10/ minimize max{en(p 0 ), EN(p 1 )} Figure 1: p EN(p) and p PET(p) Four criteria (p EN(p)) Simon (p EN(p)) minimax optimal min[(en(p 0 )+EN(p 1 ))/2] min max[en(p 0 ),EN(p 1 )] Four criteria (p PET(p)) Simon (p PET(p)) We can see the curve p EN(p), p [0, 1], that the criterion minimizing the maximum of EN(p 0 ) and EN(p 1 ) has smooth but the optimal has quake. The minimax and the criterion minimizing the maximum of EN(p 0 ) and EN(p 1 ) have same design under PET(p) 13

20 which given by Simon. The optimal and the criterion minimizing {(EN(p 0 ) + EN(p 1 ))/2} also have same design under PET(p) which given by Simon. In these cases, we find some patterns as follows (see Figure 1). Minimax versus another three designs If p (0.35, 0.75), the EN(p) of optimal two-stage design is always bigger than minimax. If p (0.20, 0.60), the EN(p) of the criterion minimizing {(EN(p 0 )+EN(p 1 ))/2} is always smaller than minimax. If p (0.25, 0.65), the EN(p) of the criterion minimizing the maximum of EN(p 0 ) and EN(p 1 ) is always smaller than minimax. Optimal versus two designs which given by criteria minimizing {(EN(p 0 )+EN(p 1 ))/2} and minimizing the maximum of EN(p 0 ) and EN(p 1 ) If p (0.30, 0.65), the EN(p) of optimal two-stage design is always bigger than the criterion minimizing{(en(p 0 ) + EN(p 1 ))/2}. If p (0.30, 0.65), the EN(p) of optimal two-stage design is always bigger than the criterion minimizing the maximum of EN(p 0 ) and EN(p 1 ). Minimize {(EN(p 0 ) + EN(p 1 ))/2} versus minimize the maximum of EN(p 0 ) and EN(p 1 ) If p (0.30, 0.65), the EN(p) of the criterion minimizing {(EN(p 0 )+EN(p 1 ))/2} is always bigger than the criterion minimizing the maximum two values of EN(p 0 ) and EN(p 1 ). If p (0.35, 0.75), the EN(p) of optimal two-stage design is the biggest. If p 0.30 or p 0.75, the EN(p) of optimal two-stage design is the smallest. 14

21 We also find that under Simon s condition, these four curves are always non-decreasing (see Figure 2). In fact, the true response rate is unknown. The optimal two-stage design is not defined obviously. We find different designs under distinct criteria and these designs can give us dissimilar advantage. 15

22 3.6 Control PET(p 0 ) 0.80 The ethical imperative for early termination occurs when the drug has low activity especially toxicity is the endpoint of interest. If we only control minimizing the expected sample size EN(p 0 ), the trial also has high probability into the second stage under the drugs with toxicity. Therefore, we think that we consider the PET(p 0 ) 0.80 first and minimize the expected sample size EN(p 0 ) later. There exists a set S 2(r 1, n 1, r 2, n 2 p 0, p 1, α, β, PET(p 0 ) 0.80) S 2 (r 1, n 1, r 2, n 2 p 0, p 1, α, β) = {(r 1, n 1, r 2, n 2 ) r 1 n 1, r 2 n 2, n 1 N, (r 1, r 2, n 2 ) N {0}} Then, it can be expressed as min S EN(p 0 2(r 1, n 1, r 2, n 2 p 0, p 1, α, β, PET(p 0 ) 0.80) ) We collocate four above designs with EN(p 0 ) by four cases. Case 1: Consider the case of (p 0, p 1 ) = (0.30, 0.50) and (α, β) = (0.10, 0.10) Table.9: Parts of Table B.1 and B.5 with (p 0, p 1 ) = (0.30, 0.50) and (α, β) = (0.10, 0.10), (minimax) r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) minimax 7/28 15/ PET(p 0 ) /37 15/ PET(p 0 ) 0.80 Case 2: Consider the case of (p 0, p 1 ) = (0.60, 0.80) and (α, β) = (0.10, 0.10) Table.10: Parts of Table B.2 and B.6 with (p 0, p 1 ) = (0.60, 0.80) and (α, β) = (0.10, 0.10), (optimal) r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) optimal 6/11 26/ PET(p 0 ) /27 24/ PET(p 0 )

23 Case 3: Consider the case of (p 0, p 1 ) = (0.60, 0.80) and (α, β) = (0.10, 0.10) Table.11: Parts of Table B.3 and B.7 with (p 0, p 1 ) = (0.60, 0.80) and (α, β) = (0.10, 0.10), (min{(en(p 0 ) + EN(p 1 ))/2}) r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) min{(en(p 0 ) + EN(p 1 ))/2} 9/16 25/ PET(p 0 ) /27 24/ PET(p 0 ) 0.80 Case 4: Consider the case of (p 0, p 1 ) = (0.30, 0.50) and (α, β) = (0.10, 0.10) Table.12: Parts of Table B.4 and B.8 with (p 0, p 1 ) = (0.30, 0.50) and (α, β) = (0.10, 0.10), (min max{en(p 0 ), EN(p 1 )}) r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) min max{en(p 0 ), EN(p 1 )} 8/30 15/ PET(p 0 ) /34 16/ PET(p 0 ) 0.80 Figure 2: Collocate four above designs with EN(p) minimax optimal min[(en(p 0 )+EN(p 1 ))/2] 30 PET(p 0 ) 0.00 PET(p 0 ) min max[en(p 0 ),EN(p 1 )] From Figure 3, we find these curves which are always smooth under controlling PET(p 0 ) And under this condition, it brings a advantage which reduces the risks 17

24 into the second stage if the drugs with toxicity. If the true response rate is p 0, in case 1, the first row in Table.9, the trial is early terminated in the first stage with probability The second row in Table.9, the trial is early terminated in the first stage with probability If the true response rate is p 1, in case 1, the first row in Table.9, the trial is early terminated in the first stage with probability The second row in Table.9, the trial is early terminated in the first stage with probability Whether the true response rate is p 0 or p 1 under PET(p 0 ) 0.80, the trial is early terminated in the first stage with probability If the drugs with toxicity, the condition is very useful. 18

25 4 Algorithm Although S 2 (r 1, n 1, r, n p 0, p 1, α, β) has infinite probable combination, we can make an order to the elements in S 2 (r 1, n 1, r, n p 0, p 1, α, β). Here, the elements in S 2 (r 1, n 1, r, n p 0, p 1, α, β) are rank by n 1 followed with n, r 1 and r. It is trivial that EN(p) n 1 for all p [0, 1]. Therefore, only finite many elements in S 2 (r 1, n 1, r, n p 0, p 1, α, β) need to be checked. In this section, we give an algorithm to the criteria of minimizing EN(p 0 ). The idea can be extended to all the other criteria. Consider the formula (1) f p (n, r, n 1, r 1 ) = B(r 1 ; p, n 1 ) + min(n 1,r) x=r 1 +1 some properties we have found are as follows: b(x; p, n 1 )B(r x; p, n 2 ), Property 1. f p (n, r, n 1, r 1 ) f p (n, r, 0, 0). Proof : f p (n, r, 0, 0) = P ( = = n x i r) r n 1 P ( x i = j, j=0 r 1 j=0 r 1 j=0 n 1 P ( x i = j, n x i r) n x i r) + n 1 P ( x i = j) + n 1 = P ( x i r 1 ) + = f p (n, r, n 1, r 1 ) min(r,n 1 ) j=r 1 +1 min(r,n 1 ) j=r 1 +1 min(r,n 1 ) j=r 1 +1 n 1 P ( x i = j, n 1 P ( x i = j, n 1 P ( x i = j, n x i r) n x i r) n x i r) Property 2. f p (n + 1, r, n 1, r 1 ) f p (n, r, n 1, r 1 ). n 1 Proof : f p (n + 1, r, n 1, r 1 ) = P ( x i r 1 ) + n 1 P ( x i r 1 ) + = f p (n, r, n 1, r 1 ) min(r,n 1 ) j=r 1 +1 min(r,n 1 ) j=r 1 +1 n 1 n+1 P ( x i = j, x i r) n 1 P ( x i = j, n x i r) 19

26 Property 3. f p (n, r + 1, n 1, r 1 ) f p (n, r, n 1, r 1 ). n 1 Proof : f p (n, r + 1, n 1, r 1 ) = P ( x i r 1 ) + n 1 = P ( x i r 1 ) + min(r,n 1 ) + j=r 1 +1 min(r,n 1 ) j=r 1 +1 min(r,n 1 ) j=r 1 +1 n 1 P ( x i = j, = f p (n, r, n 1, r 1 ) + f p (n, r, n 1, r 1 ) min(r,n 1 ) j=r 1 +1 n 1 P ( x i = j, n 1 P ( x i = j, n x i = r + 1) n 1 P ( x i = j, n x i r + 1) n x i r) n x i = r + 1) Property 1 tells us that if f p1 (n, r, 0, 0) = β + ɛ, we are impossible to find any twostage design (r 1, n 1, r, n) satisfy f p1 (n, r, n 1, r 1 ) β. If f p0 (n, r 1, 0, 0) < 1 α, then any two-stage design (r1, n 1, r, n ) satisfy f p0 (n, r, n 1, r1) 1 α and f p1 (n, r, n 1, r1) β will imply n > n (Property 3). Furthermore, given (r 1, n 1 ), we can adjust (r, n) so that f p0 (n, r, n 1, r 1 ) 1 α and f p1 (n, r, n 1, r 1 ) β (Properties 2 and 3). We give an algorithm which uses by above properties in the following: Step 1: Given (p 0, p 1, α, β). Step 2: Find n s, r s satisfy f p0 (n s, r s, 0, 0) 1 α ɛ 1 and f p1 (n s, r s, 0, 0) β + ɛ 2, where ɛ 1, ɛ 2 > 0. Step 3: Let CN = n s, Cr = r s, Cn 1 = 0, Cr 1 = 0 and CEN =. Step 4: Let n 1 = 1, r 1 = 0, n = n s, r = r s and calculate PET(p 0 ). Step 5: If n 1 + (n s n 1 )[1 PET(p 0 )] CEN, we stop searching and output the recent optimal. Else, reset n = n s + 1, r = r and go to Step 6. Step 6: If f p0 (n, r, n 1, r 1 ) 1 α, go to Step 8. Else, reset r = r + 1 and go to Step 7. Step 7: If r n 1, go to Step 6. Else, reset n = n + 1, r = r and go to Step 6. 20

27 Step 8: If f p1 (n, r, n 1, r 1 ) β, calculate EN(p 0 ) based on n, r, n 1, r 1 and go to Step 11. Else, reset n = n + 1 and go to Step 9. Step 9: If n n max, where n max CEN n 1 [ 1 PET(p 0 ) + n 1], go to Step 6. Else, reset r 1 = r and go to Step 10. Step 10: If r 1 = n 1, reset n 1 = n 1 + 1, r 1 = 0 and go to Step 3. Else, go to Step 3. Step 11: If EN(p 0 ) CEN, record (n, r, n 1, r 1 ), reset r 1 = r and go to Step 10. Else, reset r 1 = r and go to Step

28 Start Find n, r s s satisfy f ( ) p ns, rs,0,0 1 α 1 0 ε and p ( ns, rs,0,0) β + 2 f ε ε 0 1 ε 1, 2 > CEN n + C,Cn = n,cr = r,cn = 0,Cr1 = s s s 1 = 0 Let n 1 = 1,r1 = 0,n = n,r = Calcuate PET p ( ) s r s 0 n = ns + 1, r = r No Yes EN( p 0 ) CEN Output * f p ( nr, n, ) 1 α, r Yes f p ( n, r, n ) β, r No n = n + 1 No Yes Yes n n max r = r + 1 ( ) Calculate EN p Based on n, r, n 0, r 1 1 No 1 = r1 + r 1 Yes r n 1 No No EN( p 0 ) CEN r 1 = n 1 No n = n + 1 r = r Yes Record n, r, n, r 1 1 Yes n 1 = n 1 + 1, r 1 = 0 22

29 5 Discussion One aspect of the Simon design rests with the concept that if the response rate is high, it might be in the best interest of the individuals not to close the study for significance after the first stage. This strategy seems to put more individuals on effective treatments, perhaps at the expense of doing fewer studies and/or delaying the release of promising results to the public. In fact, the true response rate p we do not know. If we only consider the true response rate p 0, it is improper. Simon optimal two-stage designs have a defect which the expected sample size EN(p 1 ) may be bigger than the total sample size of the optimal single-stage design. Therefore, we do an adjustment that we consider minimizing {(EN(p 0 )+EN(p 1 ))/2}. This consideration can really reduce the expected sample size EN(p 1 ). However, the expected sample size EN(p 1 ) is not always smaller than the total sample size of the optimal single-stage design. Then, we consider another criterion minimizing the maximumof EN(p 0 ) and EN(p 1 ) that the expected sample size EN(p) is surely smaller than the total sample size of the optimal single-stage design. Simon minimax designs are minimin in fact. We consider minimizing the maximum of EN(p 0 ) and EN(p 1 ) which is the minimax we are familiar. Under controlling PET(p 0 ) 0.80, we can prevent individuals from drugs with toxicity into the second stage. If the study has to take long time, choosing two-stage designs could be wasted double time than optimal single-stage design. It is not smart. But if the drugs with toxicity, it is reasonable to use two-stage designs. When to use optimal single-stage or two-stage designs is not fixed. This is because of the proper designs being considered the case. How to estimate the response probability p in two-stage designs is important. Suppose the number of patients n 1 and n 2 are studied in the first and second stage design, respectively. Let Y 1 be the number of successes with response probability p in the first stage. Not only Y 1 or fewer than r 1 but also n 1 > r are observed, the study is termi- 23

30 nated. Otherwise, the study is allowed to continue. The number of successes Y 2 with the same response probability p is observed in the second stage. We find the possible values of Y 1 which are truncated by r 1 and all of these range from r to min(n 1, r). The probability mass function of Y 1 is mentioned the truncated binomial distribution with response probability p. So, the probability mass function of Y 1 is ( ) n1 p y 1 (1 p) n 1 y 1 P (Y 1 = y 1 ) = min(n 1,r) k=r 1 +1 y 1 ( n1 k ) p k (1 p) n 1 k While Y 2 is a binomial distribution with same response probability p, and probability (2) mass function of Y 2 is P (Y 2 = y 2 ) = ( n2 y 2 ) p y 2 (1 p) n 2 y 2, (3) where y 2 = 0,..., n 2. Based on formula (2) and (3), we obtain the joint probability mass function of Y 1 and Y 2 is P (Y 1 = y 1, Y 2 = y 2 ) = P (Y 1 = y 1 )P (Y 2 = y 2 ) ( )( ) n1 n2 p y 1+y 2 (1 p) n 1+n 2 y 1 y 2 = y 1 y 2 min(n 1,r) k=r 1 +1 And the likelihood function can be expressed as ( n1 L(p) = P (Y 1 = y 1 )P (Y 2 = y 2 ), k ) p k (1 p) n 1 k. (4) then by (4), the above expression L(p) = min(n 1,r) k=r ) p k (1 p) n 1 k k ( n1 ( )( n1 n2 y 1 y 2 ) p y 1+y 2 (1 p) n 1+n 2 y 1 y 2. (5) It is hard to differentiate by (5), so we can easily differentiate the log likelihood function log L(p) = log min(n 1,r) k=r 1 +1 ( n1 log(1 p) + log k ) p k (1 p) n 1 k + (y 1 + y 2 ) log P + (n 1 + n 2 y 1 y 2 ) ( n1 y 1 ) + log ( n1 y 1 24 )

31 The first derivative of log L(p) with respective to p is min(n 1,r) ( ) n1 kp k 1 (1 p) n1 k k log L(p) k=r 1 +1 = p + y 1 + y 2 p min(n 1,r) k=r 1 +1 n 1 + n 2 y 1 y 2 1 p ( n1 k min(n 1,r) k=r 1 +1 ) p k (n 1 k)(1 p) n 1 k 1 k ( n1 ) p k (1 p) n 1 k Let (6) be equal to zero and solve this equation, but the solution does have close form. Hence, Newton-Raphson s algorithm can be considered to obtain numerical estimate of p. By the Newton-Raphson s iteration for p, we can get a numerical solution and it is the maximum likelihood estimator. (6) 25

32 References [1] Chang MN, Therneau TM, Wieand HS, Cha SS (1987): Designs for Group Sequential Phase II Clinical Trials. Biometrics, 43, [2] Colton T, McPherson K (1976): Two-Stage Plans Compared with Fixed-Sample- Size and Wald SPRT Plans. Journal of The American Statistical Association, 71, [3] Fleming TR (1982): One-Sample Multiple Testing Procedure for Phase II Clinical Trials. Biometrics, 38, [4] Herson J (1979): Predictive Probability Early Termination Plans for Phase II Clinical Trials. Biometrics, 35, [5] Jennison C (1987): Efficient Group Sequential Tests with Unpredictable Group Sizes. Biometrika, 74, [6] Jung SH, Lee Taiyeong, Kim KM and George SL (2004): Admissible Two-Stage Designs for Phase II Cancer Clinical Trials. Statistics In Medicine, 23, [7] Neyman J(1950): A First Course in Probability and Statistics. Henry Holt & Co., New York. [8] O Brien PC, Fleming TR (1979): A Multiple Testing Procedure for Clinical Trials. Biometrics, 35, [9] Schultz JR, Nichol FR, Elfring GL, Weed SD (1973): Multiple-Stage Procedures for Drug Screening. Biometics, 29, [10] Shuster J (2002): Optimal Two-Stage Designs for Single Arm Phase II Cancer Trials. Journal of Biopharmaceutical Statistics, 12,

33 [11] Simon R (1989): Optimal Two-Stage Designs for Phase II Clinical Trials. Controlled Clinical Trials, 10,

34 A Appendix: Single-Stage Designs Table A.1 Single-Stage Designs for p 1 p 0 = Table A.2 Single-Stage Designs for p 1 p 0 =

35 Table A.1 Single-Stage Designs for p 1 p 0 = 0.20 p 0 p 1 α β r 1 n 1 α β p 0 p 1 α β r 1 n 1 α β For each value of (p 0,p 1 ), the single-stage designs are given for four sets of the two probabilities of error (α,β). The first, second, third and four rows correspond to error probabilities limits (0.10, 0.10), (0.05, 0.20), (0.05, 0.10) and (0.05, 0.05). For each design, (α, β ) denote the two exact probabilities of error (α,β). Table A.2 Single-Stage Designs for p 1 p 0 = 0.15 p 0 p 1 α β r 1 n 1 α β p 0 p 1 α β r 1 n 1 α β For each value of (p 0,p 1 ), the single-stage designs are given for four sets of the two probabilities of error (α,β). The first, second, third and four rows correspond to error probabilities limits (0.10, 0.10), (0.05, 0.20), (0.05, 0.10) and (0.05, 0.05). For each design, (α, β ) denote the two exact probabilities of error (α,β). 29

36 B Appendix: Two-Stage Designs with p 1 p 0 = 0.20 Table B.1 Minimax Designs for p 1 p 0 = Table B.2 Optimal Designs for p 1 p 0 = Table B.3 Minimize {(EN(p 0 ) + EN(p 1 ))/2} with p 1 p 0 = Table B.4 Minimize the maximum of EN(p 0 ) and EN(p 1 ) with p 1 p 0 = Table B.5 Minimax Designs for p 1 p 0 = 0.20, PET(p 0 ) 0.8. Table B.6 Optimal Designs for p 1 p 0 = 0.20, PET(p 0 ) 0.8. Table B.7 Minimize {(EN(p 0 ) + EN(p 1 ))/2} with p 1 p 0 = 0.20, PET(p 0 ) 0.8. Table B.8 Minimize the maximum of EN(p 0 ) and EN(p 1 ) with p 1 p 0 = 0.20, PET(p 0 )

37 Table B.1 Minimax Designs for p 1 p 0 = 0.20 p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) /13 2/ /12 2/ /15 3/ /26 4/ /16 4/ /15 5/ /22 6/ /25 7/ /19 10/ /18 10/ /24 13/ /24 16/ /28 15/ /19 16/ /24 21/ /36 26/ /28 20/ /34 20/ /29 27/ /44 33/ /23 23/ /23 23/ /27 32/ /37 38/ /27 24/ /13 25/ /26 32/ /28 39/ /16 20/ /23 21/ /18 26/ /18 33/ For each value of (p 0,p 1 ), the designs are given for four sets of the two probabilities of error (α,β). The first, second, third and four rows correspond to error probabilities limits (0.10, 0.10), (0.05, 0.20), (0.05, 0.10) and (0.05, 0.05). The expected sample size EN(p 0 ) and EN(p 1 ) and probability of terminating the trial at the end of the first stage PET(p 0 ) and PET(p 1 ) if the response probability is p 0 and p 1, respectively. For each design, (α, β ) denote the two exact probabilities of error (α,β). 31

38 Table B.2 Optimal Designs for p 1 p 0 = 0.20 p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) /9 2/ /9 2/ /9 3/ /18 4/ /12 5/ /10 5/ /18 6/ /20 8/ /17 10/ /13 12/ /19 15/ /28 17/ /22 17/ /15 18/ /24 24/ /34 27/ /18 22/ /16 23/ /25 32/ /29 38/ /21 26/ /15 26/ /24 36/ /27 45/ /11 26/ /11 30/ /19 37/ /28 46/ /9 22/ /6 22/ /15 29/ /18 35/ For each value of (p 0,p 1 ), the designs are given for four sets of the two probabilities of error (α,β). The first, second, third and four rows correspond to error probabilities limits (0.10, 0.10), (0.05, 0.20), (0.05, 0.10) and (0.05, 0.05). The expected sample size EN(p 0 ) and EN(p 1 ) and probability of terminating the trial at the end of the first stage PET(p 0 ) and PET(p 1 ) if the response probability is p 0 and p 1, respectively. For each design, (α, β ) denote the two exact probabilities of error (α,β). 32

39 Table B.3 Minimize {(EN(p 0 ) + EN(p 1 ))/2} with p 1 p 0 = 0.20 p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) /11 2/ /9 2/ /15 3/ /19 4/ /16 4/ /15 5/ /19 6/ /25 7/ /25 10/ /18 10/ /30 13/ /39 16/ /30 15/ /19 16/ /24 21/ /39 26/ /33 20/ /17 21/ /29 27/ /39 35/ /17 24/ /16 24/ /27 32/ /37 38/ /16 25/ /13 25/ /14 33/ /28 39/ /9 22/ /6 22/ /18 26/ /18 35/ For each value of (p 0,p 1 ), the designs are given for four sets of the two probabilities of error (α,β). The first, second, third and four rows correspond to error probabilities limits (0.10, 0.10), (0.05, 0.20), (0.05, 0.10) and (0.05, 0.05). The expected sample size EN(p 0 ) and EN(p 1 ) and probability of terminating the trial at the end of the first stage PET(p 0 ) and PET(p 1 ) if the response probability is p 0 and p 1, respectively. For each design, (α, β ) denote the two exact probabilities of error (α,β). 33

40 Table B.4 Minimize the maximum of EN(p 0 ) and EN(p 1 ) with p 1 p 0 = 0.20 p 0 p 1 α β r 1 /n 1 r/n EN(p 0 ) EN(p 1 ) α β PET(p 0 ) PET(p 1 ) /11 2/ /9 2/ /17 3/ /21 4/ /18 4/ /18 5/ /22 6/ /29 7/ /27 10/ /26 10/ /35 13/ /43 16/ /30 15/ /31 16/ /43 21/ /54 26/ /35 20/ /34 20/ /46 27/ /57 33/ /34 23/ /33 23/ /46 32/ /56 38/ /31 24/ /13 25/ /41 32/ /50 39/ /23 20/ /23 21/ /18 26/ /38 33/ For each value of (p 0,p 1 ), the designs are given for four sets of the two probabilities of error (α,β). The first, second, third and four rows correspond to error probabilities limits (0.10, 0.10), (0.05, 0.20), (0.05, 0.10) and (0.05, 0.05). The expected sample size EN(p 0 ) and EN(p 1 ) and probability of terminating the trial at the end of the first stage PET(p 0 ) and PET(p 1 ) if the response probability is p 0 and p 1, respectively. For each design, (α, β ) denote the two exact probabilities of error (α,β). 34

Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS

Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS MWSUG 2016 - Paper PH06 Frequentist and Bayesian Interim Analysis in Clinical Trials: Group Sequential Testing and Posterior Predictive Probability Monitoring Using SAS Kechen Zhao, University of Southern

More information

Samuel Coolidge, Dan Simon, Dennis Shasha, Technical Report NYU/CIMS/TR

Samuel Coolidge, Dan Simon, Dennis Shasha, Technical Report NYU/CIMS/TR Detecting Missing and Spurious Edges in Large, Dense Networks Using Parallel Computing Samuel Coolidge, sam.r.coolidge@gmail.com Dan Simon, des480@nyu.edu Dennis Shasha, shasha@cims.nyu.edu Technical Report

More information

TTEDesigner User s Manual

TTEDesigner User s Manual TTEDesigner User s Manual John D. Cook Department of Biostatistics, Box 447 The University of Texas, M. D. Anderson Cancer Center 1515 Holcombe Blvd., Houston, Texas 77030, USA cook@mdanderson.org September

More information

Pre-control and Some Simple Alternatives

Pre-control and Some Simple Alternatives Pre-control and Some Simple Alternatives Stefan H. Steiner Dept. of Statistics and Actuarial Sciences University of Waterloo Waterloo, N2L 3G1 Canada Pre-control, also called Stoplight control, is a quality

More information

PASS Sample Size Software. Randomization Lists

PASS Sample Size Software. Randomization Lists Chapter 880 Introduction This module is used to create a randomization list for assigning subjects to one of up to eight groups or treatments. Six randomization algorithms are available. Four of the algorithms

More information

Module 7. Independent sets, coverings. and matchings. Contents

Module 7. Independent sets, coverings. and matchings. Contents Module 7 Independent sets, coverings Contents and matchings 7.1 Introduction.......................... 152 7.2 Independent sets and coverings: basic equations..... 152 7.3 Matchings in bipartite graphs................

More information

Slides 11: Verification and Validation Models

Slides 11: Verification and Validation Models Slides 11: Verification and Validation Models Purpose and Overview The goal of the validation process is: To produce a model that represents true behaviour closely enough for decision making purposes.

More information

Classification and Regression Trees

Classification and Regression Trees Classification and Regression Trees David S. Rosenberg New York University April 3, 2018 David S. Rosenberg (New York University) DS-GA 1003 / CSCI-GA 2567 April 3, 2018 1 / 51 Contents 1 Trees 2 Regression

More information

A noninformative Bayesian approach to small area estimation

A noninformative Bayesian approach to small area estimation A noninformative Bayesian approach to small area estimation Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 glen@stat.umn.edu September 2001 Revised May 2002 Research supported

More information

SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION

SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION CHAPTER 5 SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION Alessandro Artale UniBZ - http://www.inf.unibz.it/ artale/ SECTION 5.5 Application: Correctness of Algorithms Copyright Cengage Learning. All

More information

Table : IEEE Single Format ± a a 2 a 3 :::a 8 b b 2 b 3 :::b 23 If exponent bitstring a :::a 8 is Then numerical value represented is ( ) 2 = (

Table : IEEE Single Format ± a a 2 a 3 :::a 8 b b 2 b 3 :::b 23 If exponent bitstring a :::a 8 is Then numerical value represented is ( ) 2 = ( Floating Point Numbers in Java by Michael L. Overton Virtually all modern computers follow the IEEE 2 floating point standard in their representation of floating point numbers. The Java programming language

More information

COURSE: NUMERICAL ANALYSIS. LESSON: Methods for Solving Non-Linear Equations

COURSE: NUMERICAL ANALYSIS. LESSON: Methods for Solving Non-Linear Equations COURSE: NUMERICAL ANALYSIS LESSON: Methods for Solving Non-Linear Equations Lesson Developer: RAJNI ARORA COLLEGE/DEPARTMENT: Department of Mathematics, University of Delhi Page No. 1 Contents 1. LEARNING

More information

Lecture 2 September 3

Lecture 2 September 3 EE 381V: Large Scale Optimization Fall 2012 Lecture 2 September 3 Lecturer: Caramanis & Sanghavi Scribe: Hongbo Si, Qiaoyang Ye 2.1 Overview of the last Lecture The focus of the last lecture was to give

More information

Solutions to In-Class Problems Week 4, Fri

Solutions to In-Class Problems Week 4, Fri Massachusetts Institute of Technology 6.042J/18.062J, Fall 02: Mathematics for Computer Science Professor Albert Meyer and Dr. Radhika Nagpal Solutions to In-Class Problems Week 4, Fri Definition: The

More information

Lab 5 - Risk Analysis, Robustness, and Power

Lab 5 - Risk Analysis, Robustness, and Power Type equation here.biology 458 Biometry Lab 5 - Risk Analysis, Robustness, and Power I. Risk Analysis The process of statistical hypothesis testing involves estimating the probability of making errors

More information

COPYRIGHTED MATERIAL. ExpDesign Studio 1.1 INTRODUCTION

COPYRIGHTED MATERIAL. ExpDesign Studio 1.1 INTRODUCTION 1 ExpDesign Studio 1.1 INTRODUCTION ExpDesign Studio (ExpDesign) is an integrated environment for designing experiments or clinical trials. It is a powerful and user - friendly statistical software product

More information

Regularization and model selection

Regularization and model selection CS229 Lecture notes Andrew Ng Part VI Regularization and model selection Suppose we are trying select among several different models for a learning problem. For instance, we might be using a polynomial

More information

Approximability Results for the p-center Problem

Approximability Results for the p-center Problem Approximability Results for the p-center Problem Stefan Buettcher Course Project Algorithm Design and Analysis Prof. Timothy Chan University of Waterloo, Spring 2004 The p-center

More information

Ramsey s Theorem on Graphs

Ramsey s Theorem on Graphs Ramsey s Theorem on Graphs 1 Introduction Exposition by William Gasarch Imagine that you have 6 people at a party. We assume that, for every pair of them, either THEY KNOW EACH OTHER or NEITHER OF THEM

More information

SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION

SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION CHAPTER 5 SEQUENCES, MATHEMATICAL INDUCTION, AND RECURSION Copyright Cengage Learning. All rights reserved. SECTION 5.5 Application: Correctness of Algorithms Copyright Cengage Learning. All rights reserved.

More information

1 Definition of Reduction

1 Definition of Reduction 1 Definition of Reduction Problem A is reducible, or more technically Turing reducible, to problem B, denoted A B if there a main program M to solve problem A that lacks only a procedure to solve problem

More information

!"# $ # # $ $ % $ &% $ '"# $ ()&*&)+(( )+(( )

!# $ # # $ $ % $ &% $ '# $ ()&*&)+(( )+(( ) !"# # # % &% '"# ) !#, ' "# " "# -. / # 0 0 0 0 0 "0 "# " # 1 #! " " 0 0 0 0 0 0 2# 0 # # 3 ' 4 56 7-56 87 9# 5 6 7 6 & 0 " : 9 ; 4 " #! 0 - '% # % "# " "# " < 4 "! % " % 4 % % 9# 4 56 87 = 4 > 0 " %!#

More information

Chapter 1. Math review. 1.1 Some sets

Chapter 1. Math review. 1.1 Some sets Chapter 1 Math review This book assumes that you understood precalculus when you took it. So you used to know how to do things like factoring polynomials, solving high school geometry problems, using trigonometric

More information

Categorical Data in a Designed Experiment Part 2: Sizing with a Binary Response

Categorical Data in a Designed Experiment Part 2: Sizing with a Binary Response Categorical Data in a Designed Experiment Part 2: Sizing with a Binary Response Authored by: Francisco Ortiz, PhD Version 2: 19 July 2018 Revised 18 October 2018 The goal of the STAT COE is to assist in

More information

Verifying Safety Property of Lustre Programs: Temporal Induction

Verifying Safety Property of Lustre Programs: Temporal Induction 22c181: Formal Methods in Software Engineering The University of Iowa Spring 2008 Verifying Safety Property of Lustre Programs: Temporal Induction Copyright 2008 Cesare Tinelli. These notes are copyrighted

More information

CHAPTER 2. Graphs. 1. Introduction to Graphs and Graph Isomorphism

CHAPTER 2. Graphs. 1. Introduction to Graphs and Graph Isomorphism CHAPTER 2 Graphs 1. Introduction to Graphs and Graph Isomorphism 1.1. The Graph Menagerie. Definition 1.1.1. A simple graph G = (V, E) consists of a set V of vertices and a set E of edges, represented

More information

Dealing with Categorical Data Types in a Designed Experiment

Dealing with Categorical Data Types in a Designed Experiment Dealing with Categorical Data Types in a Designed Experiment Part II: Sizing a Designed Experiment When Using a Binary Response Best Practice Authored by: Francisco Ortiz, PhD STAT T&E COE The goal of

More information

Modeling and Performance Analysis with Discrete-Event Simulation

Modeling and Performance Analysis with Discrete-Event Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation Chapter 10 Verification and Validation of Simulation Models Contents Model-Building, Verification, and Validation Verification

More information

Introduction to Graph Theory

Introduction to Graph Theory Introduction to Graph Theory George Voutsadakis 1 1 Mathematics and Computer Science Lake Superior State University LSSU Math 351 George Voutsadakis (LSSU) Introduction to Graph Theory August 2018 1 /

More information

Math 5593 Linear Programming Lecture Notes

Math 5593 Linear Programming Lecture Notes Math 5593 Linear Programming Lecture Notes Unit II: Theory & Foundations (Convex Analysis) University of Colorado Denver, Fall 2013 Topics 1 Convex Sets 1 1.1 Basic Properties (Luenberger-Ye Appendix B.1).........................

More information

Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data

Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data Incorporating Known Pathways into Gene Clustering Algorithms for Genetic Expression Data Ryan Atallah, John Ryan, David Aeschlimann December 14, 2013 Abstract In this project, we study the problem of classifying

More information

A SHARKOVSKY THEOREM FOR VERTEX MAPS ON TREES

A SHARKOVSKY THEOREM FOR VERTEX MAPS ON TREES A SHARKOVSKY THEOREM FOR VERTEX MAPS ON TREES CHRIS BERNHARDT Abstract. Let T be a tree with n vertices. Let f : T T be continuous and suppose that the n vertices form a periodic orbit under f. We show:

More information

Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming

Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming Optimal Two-Stage Adaptive Enrichment Designs, using Sparse Linear Programming Michael Rosenblum Johns Hopkins Bloomberg School of Public Health Joint work with Xingyuan (Ethan) Fang and Han Liu Princeton

More information

Greedy Algorithms 1 {K(S) K(S) C} For large values of d, brute force search is not feasible because there are 2 d {1,..., d}.

Greedy Algorithms 1 {K(S) K(S) C} For large values of d, brute force search is not feasible because there are 2 d {1,..., d}. Greedy Algorithms 1 Simple Knapsack Problem Greedy Algorithms form an important class of algorithmic techniques. We illustrate the idea by applying it to a simplified version of the Knapsack Problem. Informally,

More information

Optimal designs for comparing curves

Optimal designs for comparing curves Optimal designs for comparing curves Holger Dette, Ruhr-Universität Bochum Maria Konstantinou, Ruhr-Universität Bochum Kirsten Schorning, Ruhr-Universität Bochum FP7 HEALTH 2013-602552 Outline 1 Motivation

More information

Roundoff Errors and Computer Arithmetic

Roundoff Errors and Computer Arithmetic Jim Lambers Math 105A Summer Session I 2003-04 Lecture 2 Notes These notes correspond to Section 1.2 in the text. Roundoff Errors and Computer Arithmetic In computing the solution to any mathematical problem,

More information

Lecture 19. Lecturer: Aleksander Mądry Scribes: Chidambaram Annamalai and Carsten Moldenhauer

Lecture 19. Lecturer: Aleksander Mądry Scribes: Chidambaram Annamalai and Carsten Moldenhauer CS-621 Theory Gems November 21, 2012 Lecture 19 Lecturer: Aleksander Mądry Scribes: Chidambaram Annamalai and Carsten Moldenhauer 1 Introduction We continue our exploration of streaming algorithms. First,

More information

Line Graphs and Circulants

Line Graphs and Circulants Line Graphs and Circulants Jason Brown and Richard Hoshino Department of Mathematics and Statistics Dalhousie University Halifax, Nova Scotia, Canada B3H 3J5 Abstract The line graph of G, denoted L(G),

More information

Lecture : Topological Space

Lecture : Topological Space Example of Lecture : Dr. Department of Mathematics Lovely Professional University Punjab, India October 18, 2014 Outline Example of 1 2 3 Example of 4 5 6 Example of I Topological spaces and continuous

More information

Indenting with Style

Indenting with Style ABSTRACT Indenting with Style Bill Coar, Axio Research, Seattle, WA Within the pharmaceutical industry, many SAS programmers rely heavily on Proc Report. While it is used extensively for summary tables

More information

Lecture 25: Bezier Subdivision. And he took unto him all these, and divided them in the midst, and laid each piece one against another: Genesis 15:10

Lecture 25: Bezier Subdivision. And he took unto him all these, and divided them in the midst, and laid each piece one against another: Genesis 15:10 Lecture 25: Bezier Subdivision And he took unto him all these, and divided them in the midst, and laid each piece one against another: Genesis 15:10 1. Divide and Conquer If we are going to build useful

More information

Handout 9: Imperative Programs and State

Handout 9: Imperative Programs and State 06-02552 Princ. of Progr. Languages (and Extended ) The University of Birmingham Spring Semester 2016-17 School of Computer Science c Uday Reddy2016-17 Handout 9: Imperative Programs and State Imperative

More information

LP-Modelling. dr.ir. C.A.J. Hurkens Technische Universiteit Eindhoven. January 30, 2008

LP-Modelling. dr.ir. C.A.J. Hurkens Technische Universiteit Eindhoven. January 30, 2008 LP-Modelling dr.ir. C.A.J. Hurkens Technische Universiteit Eindhoven January 30, 2008 1 Linear and Integer Programming After a brief check with the backgrounds of the participants it seems that the following

More information

CS321 Introduction To Numerical Methods

CS321 Introduction To Numerical Methods CS3 Introduction To Numerical Methods Fuhua (Frank) Cheng Department of Computer Science University of Kentucky Lexington KY 456-46 - - Table of Contents Errors and Number Representations 3 Error Types

More information

Lecture 10: SVM Lecture Overview Support Vector Machines The binary classification problem

Lecture 10: SVM Lecture Overview Support Vector Machines The binary classification problem Computational Learning Theory Fall Semester, 2012/13 Lecture 10: SVM Lecturer: Yishay Mansour Scribe: Gitit Kehat, Yogev Vaknin and Ezra Levin 1 10.1 Lecture Overview In this lecture we present in detail

More information

Verifying a Border Array in Linear Time

Verifying a Border Array in Linear Time Verifying a Border Array in Linear Time František Franěk Weilin Lu P. J. Ryan W. F. Smyth Yu Sun Lu Yang Algorithms Research Group Department of Computing & Software McMaster University Hamilton, Ontario

More information

1 Variations of the Traveling Salesman Problem

1 Variations of the Traveling Salesman Problem Stanford University CS26: Optimization Handout 3 Luca Trevisan January, 20 Lecture 3 In which we prove the equivalence of three versions of the Traveling Salesman Problem, we provide a 2-approximate algorithm,

More information

The Largest Pure Partial Planes of Order 6 Have Size 25

The Largest Pure Partial Planes of Order 6 Have Size 25 The Largest Pure Partial Planes of Order 6 Have Size 5 Yibo Gao arxiv:1611.00084v1 [math.co] 31 Oct 016 Abstract In this paper, we prove that the largest pure partial plane of order 6 has size 5. At the

More information

Online Graph Exploration

Online Graph Exploration Distributed Computing Online Graph Exploration Semester thesis Simon Hungerbühler simonhu@ethz.ch Distributed Computing Group Computer Engineering and Networks Laboratory ETH Zürich Supervisors: Sebastian

More information

Automatic Machinery Fault Detection and Diagnosis Using Fuzzy Logic

Automatic Machinery Fault Detection and Diagnosis Using Fuzzy Logic Automatic Machinery Fault Detection and Diagnosis Using Fuzzy Logic Chris K. Mechefske Department of Mechanical and Materials Engineering The University of Western Ontario London, Ontario, Canada N6A5B9

More information

Zambia Medicines Regulatory Authority APPLICATION FOR MARKETING AUTHORISATION OF A MEDICINE FOR HUMAN USE

Zambia Medicines Regulatory Authority APPLICATION FOR MARKETING AUTHORISATION OF A MEDICINE FOR HUMAN USE Zambia Medicines Regulatory Authority APPLICATION FOR MARKETING AUTHORISATION OF A MEDICINE FOR HUMAN USE COMMON TECHNICAL DOCUMENT FORMAT ZAMBIA Module 1 CTD-Modules 2-5 Version 03, April 2015 ZAMBIA

More information

CS103 Spring 2018 Mathematical Vocabulary

CS103 Spring 2018 Mathematical Vocabulary CS103 Spring 2018 Mathematical Vocabulary You keep using that word. I do not think it means what you think it means. - Inigo Montoya, from The Princess Bride Consider the humble while loop in most programming

More information

Sketchable Histograms of Oriented Gradients for Object Detection

Sketchable Histograms of Oriented Gradients for Object Detection Sketchable Histograms of Oriented Gradients for Object Detection No Author Given No Institute Given Abstract. In this paper we investigate a new representation approach for visual object recognition. The

More information

Flexibility and Robustness of Hierarchical Fuzzy Signature Structures with Perturbed Input Data

Flexibility and Robustness of Hierarchical Fuzzy Signature Structures with Perturbed Input Data Flexibility and Robustness of Hierarchical Fuzzy Signature Structures with Perturbed Input Data B. Sumudu U. Mendis Department of Computer Science The Australian National University Canberra, ACT 0200,

More information

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems

RSM Split-Plot Designs & Diagnostics Solve Real-World Problems RSM Split-Plot Designs & Diagnostics Solve Real-World Problems Shari Kraber Pat Whitcomb Martin Bezener Stat-Ease, Inc. Stat-Ease, Inc. Stat-Ease, Inc. 221 E. Hennepin Ave. 221 E. Hennepin Ave. 221 E.

More information

Authenticating Nuclear Warheads With High Confidence

Authenticating Nuclear Warheads With High Confidence Authenticating Nuclear Warheads With High Confidence Moritz Kütt,, Sebastien Philippe, Boaz Barak, Alexander Glaser, and Robert J. Goldston, Princeton University Technische Universität Darmstadt, Germany

More information

4. Simplicial Complexes and Simplicial Homology

4. Simplicial Complexes and Simplicial Homology MATH41071/MATH61071 Algebraic topology Autumn Semester 2017 2018 4. Simplicial Complexes and Simplicial Homology Geometric simplicial complexes 4.1 Definition. A finite subset { v 0, v 1,..., v r } R n

More information

5 Matchings in Bipartite Graphs and Their Applications

5 Matchings in Bipartite Graphs and Their Applications 5 Matchings in Bipartite Graphs and Their Applications 5.1 Matchings Definition 5.1 A matching M in a graph G is a set of edges of G, none of which is a loop, such that no two edges in M have a common

More information

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Approximation algorithms Date: 11/27/18

/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Approximation algorithms Date: 11/27/18 601.433/633 Introduction to Algorithms Lecturer: Michael Dinitz Topic: Approximation algorithms Date: 11/27/18 22.1 Introduction We spent the last two lectures proving that for certain problems, we can

More information

nquery Sample Size & Power Calculation Software Validation Guidelines

nquery Sample Size & Power Calculation Software Validation Guidelines nquery Sample Size & Power Calculation Software Validation Guidelines Every nquery sample size table, distribution function table, standard deviation table, and tablespecific side table has been tested

More information

Semantics via Syntax. f (4) = if define f (x) =2 x + 55.

Semantics via Syntax. f (4) = if define f (x) =2 x + 55. 1 Semantics via Syntax The specification of a programming language starts with its syntax. As every programmer knows, the syntax of a language comes in the shape of a variant of a BNF (Backus-Naur Form)

More information

Comparing and Combining Dichotomous and Polytomous Items with SPRT Procedure in Computerized Classification Testing

Comparing and Combining Dichotomous and Polytomous Items with SPRT Procedure in Computerized Classification Testing Comparing and Combining Dichotomous and Polytomous Items with SPRT Procedure in Computerized Classification Testing C. Allen Lau Harcourt Brace Educational Measurement Tianyou Wang ACT Paper presented

More information

4.3 The Normal Distribution

4.3 The Normal Distribution 4.3 The Normal Distribution Objectives. Definition of normal distribution. Standard normal distribution. Specialties of the graph of the standard normal distribution. Percentiles of the standard normal

More information

CS264: Homework #4. Due by midnight on Wednesday, October 22, 2014

CS264: Homework #4. Due by midnight on Wednesday, October 22, 2014 CS264: Homework #4 Due by midnight on Wednesday, October 22, 2014 Instructions: (1) Form a group of 1-3 students. You should turn in only one write-up for your entire group. (2) Turn in your solutions

More information

A TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY

A TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY Paper # DM08 A TEXT MINER ANALYSIS TO COMPARE INTERNET AND MEDLINE INFORMATION ABOUT ALLERGY MEDICATIONS Chakib Battioui, University of Louisville, Louisville, KY ABSTRACT Recently, the internet has become

More information

Byzantine Consensus in Directed Graphs

Byzantine Consensus in Directed Graphs Byzantine Consensus in Directed Graphs Lewis Tseng 1,3, and Nitin Vaidya 2,3 1 Department of Computer Science, 2 Department of Electrical and Computer Engineering, and 3 Coordinated Science Laboratory

More information

COVERING SPACES AND SUBGROUPS OF THE FREE GROUP

COVERING SPACES AND SUBGROUPS OF THE FREE GROUP COVERING SPACES AND SUBGROUPS OF THE FREE GROUP SAMANTHA NIEVEEN AND ALLISON SMITH Adviser: Dennis Garity Oregon State University Abstract. In this paper we will use the known link between covering spaces

More information

A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY

A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY A GRAPH FROM THE VIEWPOINT OF ALGEBRAIC TOPOLOGY KARL L. STRATOS Abstract. The conventional method of describing a graph as a pair (V, E), where V and E repectively denote the sets of vertices and edges,

More information

Interactive Treatment Planning in Cancer Radiotherapy

Interactive Treatment Planning in Cancer Radiotherapy Interactive Treatment Planning in Cancer Radiotherapy Mohammad Shakourifar Giulio Trigila Pooyan Shirvani Ghomi Abraham Abebe Sarah Couzens Laura Noreña Wenling Shang June 29, 212 1 Introduction Intensity

More information

Lecture 6: Graph Properties

Lecture 6: Graph Properties Lecture 6: Graph Properties Rajat Mittal IIT Kanpur In this section, we will look at some of the combinatorial properties of graphs. Initially we will discuss independent sets. The bulk of the content

More information

On Covering a Graph Optimally with Induced Subgraphs

On Covering a Graph Optimally with Induced Subgraphs On Covering a Graph Optimally with Induced Subgraphs Shripad Thite April 1, 006 Abstract We consider the problem of covering a graph with a given number of induced subgraphs so that the maximum number

More information

demand point given tracks

demand point given tracks 2 Introduction Establishing stops (or stations) within a transportation network is fundamental for offering public transportation service, since stops are an important part of the PTN. But it is not clear

More information

P Is Not Equal to NP. ScholarlyCommons. University of Pennsylvania. Jon Freeman University of Pennsylvania. October 1989

P Is Not Equal to NP. ScholarlyCommons. University of Pennsylvania. Jon Freeman University of Pennsylvania. October 1989 University of Pennsylvania ScholarlyCommons Technical Reports (CIS) Department of Computer & Information Science October 1989 P Is Not Equal to NP Jon Freeman University of Pennsylvania Follow this and

More information

Recursive Functions. 6.1 Primitive Recursive Functions

Recursive Functions. 6.1 Primitive Recursive Functions 6 Recursive Functions The intuitive notion of an effectively computable function is the notion of a function for which there are definite, explicit rules, following which one could in principle compute

More information

Climbing the Kaggle Leaderboard by Exploiting the Log-Loss Oracle

Climbing the Kaggle Leaderboard by Exploiting the Log-Loss Oracle Climbing the Kaggle Leaderboard by Exploiting the Log-Loss Oracle Jacob Whitehill jrwhitehill@wpi.edu Worcester Polytechnic Institute https://arxiv.org/abs/1707.01825 Machine learning competitions Data-mining

More information

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems

A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics

More information

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CHAPTER 4 CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS 4.1 Introduction Optical character recognition is one of

More information

Let denote the number of partitions of with at most parts each less than or equal to. By comparing the definitions of and it is clear that ( ) ( )

Let denote the number of partitions of with at most parts each less than or equal to. By comparing the definitions of and it is clear that ( ) ( ) Calculating exact values of without using recurrence relations This note describes an algorithm for calculating exact values of, the number of partitions of into distinct positive integers each less than

More information

1. Suppose you are given a magic black box that somehow answers the following decision problem in polynomial time:

1. Suppose you are given a magic black box that somehow answers the following decision problem in polynomial time: 1. Suppose you are given a magic black box that somehow answers the following decision problem in polynomial time: Input: A CNF formula ϕ with n variables x 1, x 2,..., x n. Output: True if there is an

More information

humor... May 3, / 56

humor... May 3, / 56 humor... May 3, 2017 1 / 56 Power As discussed previously, power is the probability of rejecting the null hypothesis when the null is false. Power depends on the effect size (how far from the truth the

More information

3760 bcrm Defining and Running Simulations

3760 bcrm Defining and Running Simulations bcrm Defining and Running Simulations Tessella Project Number 3760 Tessella Support Services plc e-mail: post@tessella.com Head Office: Abingdon 3 Vineyard Chambers, Abingdon, Oxfordshire OX14 3PX, England

More information

Variable Selection 6.783, Biomedical Decision Support

Variable Selection 6.783, Biomedical Decision Support 6.783, Biomedical Decision Support (lrosasco@mit.edu) Department of Brain and Cognitive Science- MIT November 2, 2009 About this class Why selecting variables Approaches to variable selection Sparsity-based

More information

Analysis of variance - ANOVA

Analysis of variance - ANOVA Analysis of variance - ANOVA Based on a book by Julian J. Faraway University of Iceland (UI) Estimation 1 / 50 Anova In ANOVAs all predictors are categorical/qualitative. The original thinking was to try

More information

HUMAN CENTERED DESIGN

HUMAN CENTERED DESIGN HUMAN CENTERED DESIGN Ed Israelski PhD CHFP, Convener of Usability and Ergonomics Working Groups IEC/ISO Retired Director Human Factors, AbbVie Past Co-Chair HFE Committee, AAMI HFE Consultant Roundtable

More information

Computing intersections in a set of line segments: the Bentley-Ottmann algorithm

Computing intersections in a set of line segments: the Bentley-Ottmann algorithm Computing intersections in a set of line segments: the Bentley-Ottmann algorithm Michiel Smid October 14, 2003 1 Introduction In these notes, we introduce a powerful technique for solving geometric problems.

More information

Trees. 3. (Minimally Connected) G is connected and deleting any of its edges gives rise to a disconnected graph.

Trees. 3. (Minimally Connected) G is connected and deleting any of its edges gives rise to a disconnected graph. Trees 1 Introduction Trees are very special kind of (undirected) graphs. Formally speaking, a tree is a connected graph that is acyclic. 1 This definition has some drawbacks: given a graph it is not trivial

More information

gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual

gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual gsdesign: An R Package for Designing Group Sequential Clinical Trials Version 2.0 Manual Keaven M. Anderson, Dan (Jennifer) Sun, Zhongxin (John) Zhang Merck Research Laboratories December 18, 2008 Abstract

More information

Planar graphs. Math Prof. Kindred - Lecture 16 Page 1

Planar graphs. Math Prof. Kindred - Lecture 16 Page 1 Planar graphs Typically a drawing of a graph is simply a notational shorthand or a more visual way to capture the structure of the graph. Now we focus on the drawings themselves. Definition A drawing of

More information

Framework for Design of Dynamic Programming Algorithms

Framework for Design of Dynamic Programming Algorithms CSE 441T/541T Advanced Algorithms September 22, 2010 Framework for Design of Dynamic Programming Algorithms Dynamic programming algorithms for combinatorial optimization generalize the strategy we studied

More information

NP-Hardness. We start by defining types of problem, and then move on to defining the polynomial-time reductions.

NP-Hardness. We start by defining types of problem, and then move on to defining the polynomial-time reductions. CS 787: Advanced Algorithms NP-Hardness Instructor: Dieter van Melkebeek We review the concept of polynomial-time reductions, define various classes of problems including NP-complete, and show that 3-SAT

More information

Graph Connectivity G G G

Graph Connectivity G G G Graph Connectivity 1 Introduction We have seen that trees are minimally connected graphs, i.e., deleting any edge of the tree gives us a disconnected graph. What makes trees so susceptible to edge deletions?

More information

CUE-10: Moderation Page 1. Comparative Usability Evaluation 10. Moderation. Observing usability test moderators

CUE-10: Moderation Page 1. Comparative Usability Evaluation 10. Moderation. Observing usability test moderators CUE-10: Moderation Page 1 Comparative Usability Evaluation 10 Moderation Observing usability test moderators Workshop: Boston, MA, USA, Wednesday 9 May 2018 CUE-10: Moderation Page 2 Call For Participation

More information

INF 4300 Classification III Anne Solberg The agenda today:

INF 4300 Classification III Anne Solberg The agenda today: INF 4300 Classification III Anne Solberg 28.10.15 The agenda today: More on estimating classifier accuracy Curse of dimensionality and simple feature selection knn-classification K-means clustering 28.10.15

More information

Greedy Algorithms 1. For large values of d, brute force search is not feasible because there are 2 d

Greedy Algorithms 1. For large values of d, brute force search is not feasible because there are 2 d Greedy Algorithms 1 Simple Knapsack Problem Greedy Algorithms form an important class of algorithmic techniques. We illustrate the idea by applying it to a simplified version of the Knapsack Problem. Informally,

More information

Heuristic (Informed) Search

Heuristic (Informed) Search Heuristic (Informed) Search (Where we try to choose smartly) R&N: Chap., Sect..1 3 1 Search Algorithm #2 SEARCH#2 1. INSERT(initial-node,Open-List) 2. Repeat: a. If empty(open-list) then return failure

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

Incompatibility Dimensions and Integration of Atomic Commit Protocols

Incompatibility Dimensions and Integration of Atomic Commit Protocols The International Arab Journal of Information Technology, Vol. 5, No. 4, October 2008 381 Incompatibility Dimensions and Integration of Atomic Commit Protocols Yousef Al-Houmaily Department of Computer

More information

Preemptive Scheduling of Equal-Length Jobs in Polynomial Time

Preemptive Scheduling of Equal-Length Jobs in Polynomial Time Preemptive Scheduling of Equal-Length Jobs in Polynomial Time George B. Mertzios and Walter Unger Abstract. We study the preemptive scheduling problem of a set of n jobs with release times and equal processing

More information

Pair-Wise Multiple Comparisons (Simulation)

Pair-Wise Multiple Comparisons (Simulation) Chapter 580 Pair-Wise Multiple Comparisons (Simulation) Introduction This procedure uses simulation analyze the power and significance level of three pair-wise multiple-comparison procedures: Tukey-Kramer,

More information

On Rainbow Cycles in Edge Colored Complete Graphs. S. Akbari, O. Etesami, H. Mahini, M. Mahmoody. Abstract

On Rainbow Cycles in Edge Colored Complete Graphs. S. Akbari, O. Etesami, H. Mahini, M. Mahmoody. Abstract On Rainbow Cycles in Edge Colored Complete Graphs S. Akbari, O. Etesami, H. Mahini, M. Mahmoody Abstract In this paper we consider optimal edge colored complete graphs. We show that in any optimal edge

More information