A Unified Decision Framework for Phase I Dose-Finding Designs
Abstract
The purpose of a phase I dose-finding clinical trial is to investigate the toxicity profiles of various doses for a new drug and identify the maximum tolerated dose. Over the past three decades, various dose-finding designs have been proposed and discussed, including conventional model-based designs, new model-based designs using toxicity probability intervals, and rule-based designs. We present a simple decision framework that can generate several popular designs as special cases. We show that these designs share common elements under the framework, such as the same likelihood function, the use of loss functions, and the nature of the optimal decisions as Bayes rules. They differ mostly in the choice of the prior distributions. We present theoretical results on the decision framework and its link to specific and popular designs like mTPI, BOIN, and CRM. These results provide useful insights into the designs and their underlying assumptions, and convey information to help practitioners select an appropriate design.
1 Introduction
We construct a Bayesian decision theoretic framework for dose finding trials and show how several popular designs can be derived as special cases. Understanding many designs as special cases of one common general construction helps investigators to put a rapidly increasing number of seemingly competing alternative designs into perspective and to appreciate the relative strengths and limitations of different algorithms. A phase I clinical trial is the first stage of in-human investigation of a new drug or therapy. Phase I dose-finding designs aim to identify the maximum tolerate dose (MTD) and to provide dose recommendation for later phase trials. In the vast majority of phase I trials, a set of ascending candidate doses is tested for toxicity and the dose toxicity probability is assumed to be monotonically increasing with the dose level. Typically, the MTD is defined as the highest dose with a dose limiting toxicity (DLT) probability closest to, or not higher than a target toxicity probability . Usually ranges from and . In addition, some designs include the notion of an equivalence interval (EI) to allow for variations in the definition of the MTD. For example, one may choose to set and EI . This means that the target DLT probability of the MTD is 0.3, but doses with DLT probabilities between 0.25 and 0.35 can also be considered as the MTD. In other words, the EI allows investigators to consider doses with toxicity probabilities within the EI interval as appropriate MTD candidates.
A variety of statistical designs for phase I dose-finding trials has been discussed in the literature. A design consecutively assigns patients to recommended dose levels based on the observed DLT outcomes from previously enrolled patients. Existing designs can broadly be divided into two categories, rule-based designs and model-based designs. Among model-based designs, some use simple models and are sometimes called “model-assisted” designs. See Figure 1 for an illustration. We provide a brief introduction of the designs in Figure 1 next.

The 3+3 design [16] is rule-based and consecutively assigns patients to the current dose or the adjacent higher or lower doses based on observed DLT outcomes. For example, if the current dose at which patients are assigned is , then 3+3 assigns the next patient cohort to doses , , or itself. This is called the “up-and-down” rule. Based on the same up-and-down rule, a smarter rule-based design, i3+3, is proposed in Liu et al. [12] by accounting for higher sampling variability when the sample size at a dose is small. The i3+3 design maintains simplicity of rule-based designs, and exhibits operating characteristics comparable to more complex model-based/assisted designs.
The continual reassessment method (CRM) [15], as the first model-based design in the literature, is based on an inference model with a parsimoniously parameterized dose-response curve. During the trial, CRM continuously updates the estimated dose-response curve based on the observed DLT data throughout the trial. CRM has motivated subsequent important work on model-based designs, including the Bayesian logistic regression method (BLRM) in Neuenschwander et al. [14] and the escalation with over-dose control (EWOC) in Tighiouart and Rogatko [18], among many others, all leveraging parametric dose-response models for statistical inference.
Recently, a class of designs, collectively known as “interval-based designs” take advantage of the notion of an EI to simplify statistical modeling and decision making for phase I trials. Notable examples include the toxicity probability interval (TPI) design [9] and its two modifications, mTPI [10], mTPI-2 [5] (equivalently, Keyboard [19]), the cumulative cohort (CCD) design [7], and the Bayesian optimal interval (BOIN) design [13]. These designs use simple models such as the beta/binomial hierarchical model and assume independence across dose toxicity probabilities, without attempting to explicitly model a dose-response curve. While the independence model assumption is apparently not true because dose toxicity is assumed to be monotonically increasing, it does not affect the operating characteristics of the interval-based designs due to various reasons like the safety restrictions in practice (e.g., no skipping in dose escalation). As a result, interval-based designs show robust performance based on simple up-and-down rules, restricting dosing decisions to be no more than one dose-level change from the current dose used in the trial. In other words, a simple independent beta/binomial model coupled with a simple up-and-down rule leads to desirable simulation performance that justifies the application of these designs. Importantly, the interval-based designs often generate a decision table that greatly simplifies the trial conduct and allow investigators to easily execute the dosing decisions provided in the table.
In the past three decades, many designs (CRM, mTPI, mTPI-2, BOIN, etc.) have been successfully applied to real-world trials. It is natural to wonder which design or designs are suitable for a particular trial. Recent reviews [20] and [6] provide some assessment of these designs, mainly from the perspective of simulation performance. Occasionally, conflicting conclusions might arise from different reviews based on the criteria used for design evaluation, or from different scenarios considered in the comparison. While simulation results can provide important information on the numerical performance of the designs, we argue that a theoretical investigation would complement the simulation results. In this article, we show that using the same optimal decision rule under the proposed decision framework, one can generate several published designs as special cases. In other words, we show a theoretical connection across different designs. These designs include mTPI, mTPI-2, BOIN, and CCD. In addition, based on the proposed decision framework, we develop a new version of the CRM design, called Int-CRM, that is founded on the same model assumption with the original CRM design but a different decision rule. We show that Int-CRM achieves comparable simulation performance as the original CRM design and other interval-based designs. The general decision framework provides insight into the similarities and differences across various designs and may assist investigators to select the right design for their specific needs.
The remainder of the paper is structured as follows. In Section 2, we introduce the unified decision framework and its main components. Section 3 shows how known designs fit into this framework, including the mTPI, mTPI-2, BOIN, CCD, and Int-CRM designs. In Section 4, we conduct simulation studies to assess the operating characteristics of the designs using the i3+3 and CRM designs as benchmarks. Finally, we conclude and end the paper with a discussion in Section 5.
2 Decision Framework
2.1 Overview
We cast the problem of dose finding as an optimization in a decision problem. In particular we focus on the myopic decision problem of selecting the dose for the next patient (cohort). It is myopic because the problem does not address the global decision of stopping the trial and dose selection; instead, the problem only considers the local optimal decision of finding the next dose for future patients. The main components of the decision framework have been briefly illustrated in Guo et al. [5] for the mTPI and mTPI-2 designs. A decision problem is characterized by an action space for decisions , a probability model for all unknown quantities, and a loss function [1]. Table 1 shows a summary of these components.
Component | Notation | Notes |
Probability model | A hierarchical model with parameters and . | |
Action | Up-and-down dosing decisions, including | |
(de-escalate), (stay), and (escalate). | ||
Loss function | The loss for taking action | |
where is the true parameter. | ||
Optimal rule | Bayes’ rule that chooses the action | |
with the minimal posterior expected loss. |
2.2 General Framework
In dose-finding trials with binary DLT endpoints, the parameter of interest is a set of toxicity probabilities, at dose levels , , where is the number of dose levels, and is the toxicity probability at dose . Let denote the number of patients who experience DLTs out of patients treated at dose , and let . For all methods in the upcoming discussion the sampling model in Table 1 is a binomial distribution with parameter , i.e.,
implying a likelihood function,
For model-based designs with a dose-response curve, toxicity probabilities are modeled as a function of the dose levels . For example, a version of the CRM assumes , and a single parameter (the values are fixed and are known as the “skeleton”). The BLRM design uses with parameters .
The proposed decision framework uses a concept of probability intervals. The parameter of interest is , and the parameter space of is . Consider a set of intervals within , denoted as , which form a partition of the parameter space . That is, and . The true value of belongs to one and only one of the intervals. For example, is a partition, and if , . We introduce a latent indicator (or, for short, just ) with if , and define a hierarchical model prior and . For example, and a truncated beta distribution. Here, is an indicator function. That is, are conditionally independent with pdf
where is a p.d.f.
We consider a special partition where , , and . Therefore, and we use notations , , and to associate the intervals with corresponding up-and-down dose-finding decisions S, E, and D, respectively. We summarize the proposed decision framework below.
Likelihood
(1) |
where is the toxicity probability for dose , .
Prior
We assume are a priori independent and
For example, .
Partition
, where , and , and , , and .
Actions
The actions are the three up-and-down decisions for dose-finding, i.e.,
where denotes the action space. Here , , denote the dosing decisions “Escalation”, “Stay”, and “De-escalation,” respectively. In particular, if the last patient was assigned dose , then , , or means treating future patients at dose , , or , respectively.
Loss
We proceed with a myopic perspective, focusing on the decision for the respective next patient (cohort), and therefore specify a loss function for the next dose assignment only.
We use a 0-1 loss function,
(4) |
In words, when the action corresponds to an interval which contains the true parameter, the loss takes the value 0; otherwise, the loss equals 1. The loss function is stated in Table 2.
a = | 1 | 1 | 0 |
a = | 1 | 0 | 1 |
a = | 0 | 1 | 1 |
In other words, the loss function defines a 0-1 estimation loss for , i.e., the interval that contains .
Two more comments about the loss function and the setup of the decision problem. First, in general a loss (or, equivalently, utility) function could also be an argument of the outcome . This is relevant, for example, if instead of inference loss we focus on the patients’ preferences. However, the intention of this discussion is only to highlight common structure in existing dose finding methods, for which we only need this restricted inference loss. Another important limitation is the myopic nature of the setup. We consider the dose allocation for each patient (or patient cohort) in isolation, ignoring that dose allocation now might help later decisions. That is, we ignore the sequential nature of the problem. Again, for the upcoming exposition of common underlying structure for the considered dose finding methods we will only refer to this myopic decision problem.
Bayes’ rule
The optimal decision rule for dose is the Bayes’ rule, defined as
(5) |
which minimizes the posterior expected loss. Here, is the posterior distribution of .
In general, under a 0-1 estimation loss for a discrete parameter the Bayes rule is simply the posterior mode. The following result states this in the context of our problem. The Bayes’ rule is equivalent to the result of finding the interval with the maximal posterior probability.
Proposition 1
Denote , where , , . Suppose dose is the current dose. Let denote the event . Let . Assume . The Bayes’ rule under the 0-1 loss in Equation (4) is given by
See Appendix C for a proof.
3 Design Examples
We show how various designs fit as special cases into this framework. That is, we provide examples of the decision framework that give rise to well-known designs including mTPI, BOIN, CCD, mTPI-2, and a new version of CRM, called the Int-CRM design.
3.1 Interval-based designs
We first introduce the connection between the decision framework and the interval-based designs, mTPI, mTPI-2, BOIN and CCD. These designs share some common components under the framework, but also include some elements specific to each design.
Common components
Individual components
Prior , the specific partition and the definition of the action set .
All four interval-based designs use the binomial sampling model (1). And the designs share the same discrete uniform prior , the 0-1 loss function and the use of Bayes’ rule to select a decision. They divide the parameter space of into different intervals and use different priors. See Table 3 as a summary. We discuss details for each design next.
mTPI | mTPI-2 | BOIN/CCD | |
Actions | |||
Intervals | |||
Priors | |||
111See Theorem 1 for details. | |||
1See Theorem 1 for details. |
The mTPI design
For the mTPI design, given the equivalence interval EI, the parameter space is naturally partitioned into three intervals that correspond to the actions , as shown in Table 3. The mTPI decision is equivalent to the Bayes’ rule under the decision framework. See Corollary 1 below for a formal mathematical description.
Corollary 1.
The mTPI decision in Ji et al. [10] is given by
where UPM stands for “unit probability mass” and ; here is the length of , and is calculated based on . Let , , and . Then , the Bayes rule under
where denotes the density function of distribution and is a normalizing constant.
See Appendix C for a proof.
The mTPI-2 design
Ockham’s razor is a principle in statistical inference calling for an explanation of the facts to be no more complicated than necessary [17, 8]. In the context of model selection, the Ockham’s razor prefers parsimonious models that describe the data equally well as more complex models. In the proposed decision framework, , is equivalent to models , and choosing the value of is equivalent to a model selection problem. Bayesian model selection chooses the model with the largest posterior probability (compare Proposition 1), i.e., , and models are automatically penalized for their complexity. In other words, Bayes’ rule implements Ockham’s razor if we define model complexity as . Therefore, when the three models are , , , the “simplest” model is , since and are typically small probabilities ().
Guo et al. [5] explain mTPI-2 as aiming to blunt Ockham’s razor by redefining a (finer) partition where and . Probability intervals and have the same length as . Let The selected model under the mTPI-2 design can then be shown to be Bayes’ rule , under an action set . Corollary 2 next summarizes the results.
Corollary 2.
Under mTPI-2, , . Assume the prior on is conditionally independent and given by
Then Bayes’ rule is
See Appendix C for a proof. Corollary 2 establishes the Bayes’ rule as an action in . To see the connection to the dose-finding decision in mTPI-2, we refer to the next result.
Corollary 3.
Let
Then
In other words, if , then is the interval with the largest UPM, for . And if is below, equal to, or above the EI, the decision is , , or , respectively. This is the same as the up-and-down rule in the mTPI-2 design in Guo et al. [5]. Proof of Corollary 3 is immediate and omitted.
The BOIN design
The BOIN design [13] uses a decision rule
where , and
(10) |
In particular, , and are pre-specified values. Here, and play a similar rule as and in the mTPI and mTPI-2 designs, which defines the boundaries of an initial equivalence interval elicited from clinicians. We show that the decision rule is also a Bayes’ rule under the proposed decision framework next.
Theorem 1.
See Appendix C for a proof. By Theorem 1 the BOIN design takes the form of the Bayes’ rule under the same decision framework using the 0-1 loss. BOIN uses a point-mass prior for on three values, , , , while mTPI/mTPI-2 using truncated beta priors instead. Next, we show that the BOIN design is almost the same as the CCD design. This is easiest seen under the perspective of the proposed decision framework. The difference between the two designs are the locations of the point-mass priors.
The CCD design
The CCD design [7] compares with and , and uses the following up-and-down rule,
Corollary 4 shows that the decision of the CCD design is the same Bayes’ rule in the same framework as BOIN but with a different prior distribution.
Corollary 4.
See Appendix C for a proof.
Corollary 4 shows that BOIN and CCD are identical designs with the only difference being that BOIN uses a point-mass prior , whereas CCD uses .
The Int-CRM design
Using the same decision framework, we propose a variation of the CRM design, called Int-CRM. We assume the same parametric dose-response model as in the CRM design [15], with the probability of toxicity monotonically increasing with dose. Let denote the dose for the th patient, , and the binary indicator of DLT. The dose-response curve is assumed to be the power model as in the CRM,
where (“skeleton”) are a priori pre-specified dose toxicity probabilities. Other sensible dose-response models, such as a logit model, may be considered as well. The toxicity rates are dependent across doses through the dose-response curve and the inference is based on the parameter . The likelihood function is given by
where is the number of patients in the trial.
Following Cheung and Chappell [3], we define an interval for that is wide enough to allow for a wide range of dose-response curves. For example, set and so that and , which correspond to response curves constantly equal to 1.0 and 0.0, respectively, and allows choices in-between these extremes. Using and , we define sub-intervals for as the set of values that imply having toxicity probability closest to ,
k=1, …, T. As shown in Cheung and Chappell [3], is an interval, denoted as , , , where is implicitly defined as the solution of
Given the “skeleton” , we can obtain the numerical result of the interval boundaries ’s by solving the equation above. See Appendix B for details. Each interval consists of a set of values where dose ’s toxicity probability is the closest to among all the doses. We use these intervals ’s in our framework for Int-CRM. We propose hierarchical priors
and
where is the density function of the normal distribution .
The action space of the Int-CRM design is , corresponding to the dose for treating the next patient. Following the proposed decision framework, we use the 0-1 loss function and the Bayes’ rule that minimizes the posterior expected loss for the Int-CRM decision.
Theorem 2.
Under the 0-1 loss, i.e.,
the Int-CRM decision is the Bayes’ rule
The proof is immediate by the definition of Bayes’ rule. Below is the proposed Int-CRM dose-finding algorithm.
The Int-CRM Algorithm: Dose Finding Rules: After each cohort of patients completes the DLT follow-up period, the dose to be assigned is the , the Bayes’ rule, unless the following safety rules apply. Safety Rules: Four additional rules are applied for safety. Rule 1: Dose Exclusion: If the current dose is considered excessively toxic, i.e., (see below about evaluating this probability), where the threshold is close to 1, say 0.95, the current and all higher doses will be excluded in the remainder of the trial to avoid assigning any patients to those doses. Rule 2: Early Stopping: If the current dose is the lowest dose (first dose) and is considered excessively toxic, i.e., , where the threshold is close to 1, say 0.95, stop the trial early and declare no MTD. To evaluate in Rules 1 and 2 we use a distribution with . Rule 3: No-Skipping Escalation: If the dose-finding rule recommends escalation, such escalation shall not increase the dose by more than one level. Dose-escalation cannot increase by more than one level. That is, suppose the current dose is . If the next recommended dose is such that , escalate to dose instead. Rule 4: Coherence: No escalation is permitted if the empirical rate of DLT for the most recent cohort is higher than , according to the coherence principle [2]. Trial Termination: The trial proceeds unless any of the following stopping criteria is met: • If the pre-specified maximum total sample size is reached. • Rule 2 above. MTD Selection: Once all the enrolled patients complete the DLT observation and the trial is not stopped early, the last dose level is selected as the MTD.
4 Simulation Studies
4.1 Simulation Settings
We set up simulation studies to evaluate the operating characteristics of the different designs that we have shown to be special cases of the proposed general framework, including the mTPI, mTPI-2, BOIN, CCD and the Int-CRM designs. We show how the common underlying decision framework leads to very similar performances of the designs under consideration. We also compare to the i3+3 design and the original CRM design as benchmarks.
Fixed Scenarios
We use a total of 15 scenarios, with a set of or doses. Assume the target toxicity probability ( is the notation in BOIN), and maximum sample size of 30. For all designs we apply the same safety rules as in the mTPI, mTPI-2, and Int-CRM designs. See Appendix A for details. For interval-based designs we use EI , . For the Int-CRM and CRM design, the skeleton is generated using the approach proposed in Lee and Cheung [11], which selects the skeleton based on indifference intervals for the MTD. Also, we set the half width of the indifference intervals, . We apply the coherence principle [2], avoiding immediate escalation after toxic outcomes.
For the BOIN design, we set , . This is equivalent to setting , and . By Theorem 1, these values for and make the BOIN decision identical to the CCD design, leading to same operating characteristics of the two designs.
Random Scenarios
We generate additional 1,000 random scenarios to further evaluate the designs. Scenarios are generated based on the pseudo-uniform algorithm in Clertant et al. [4]. Figure 2 plots the first 20 scenarios. Other settings of the designs are the same as the fixed scenarios, such as and EI, for the BOIN design, and for the Int-CRM and CRM designs.

4.2 Simulation Results
We evaluate the performance of the phase I designs using several metrics, based on their ability to identify the MTD and the safety in dose selection and patient allocation. Table 4 summarizes the means and standard deviations of key performance metrics for the simulation with 1,000 scenarios. All designs show remarkable similarity with the largest mean difference across designs only about 0.02. This highlights the underlying connection of these designs and echoes our findings based on the unified decision framework that can generate most designs as special cases. Figures 3, 4 and 5 in Appendix D present the simulation results of the 15 fixed scenarios.
In general, the five designs tested in the simulation studies exhibit remarkably similar performances. Specifically, they show comparable probabilities (across repeated simulation) of allocating patients to the true MTD, and similar risk of allocating patients to overly toxic doses. The BOIN/CCD and Int-CRM designs yield slightly higher PCS (probability of correct selection of MTD) in some cases, such as scenarios 1 and 4 in Figure 4. However, they also report a higher risk in selecting doses beyond the true MTD. For example, in scenarios 2, 3, and 5 in Figure 4 in Appendix D, the probabilities of over-dosing selection under BOIN, CCD and Int-CRM are higher compared to the other designs.
Metrics | mTPI | mTPI-2 | BOIN/CCD | Int-CRM | CRM | i3+3 |
Correct Sel. of MTD | ||||||
Sel. over MTD | ||||||
Pat. at MTD | ||||||
Pat. over MTD | ||||||
Tox. | ||||||
None Sel. |
5 Discussion
We have developed a general decision framework for phase I dose-finding designs. We have shown that interval-based designs, like mTPI, mTPI-2, BOIN, CCD, and the model-based design Int-CRM fit into this unified framework.
All designs use the same 0-1 loss function, and all interval designs assume a binomial likelihood function. The prior construction for some designs involves the notion of candidate models. Candidate models are specified assuming different toxic profiles for the doses. Given the model, the mTPI, mTPI-2 and Int-CRM assume continuous prior distributions, using beta or normal distributions truncated to the restricted parameter space implied by the given model. The BOIN and CCD designs use a different approach with a discrete prior on , supported at three distinct values. Choosing those atoms is challenging and may be difficult to interpret. However, as shown in many simulations conducted and published in literature, the BOIN and CCD designs perform very well in Phase I trials with relatively small sample size.
Additionally, different loss functions can be considered in the proposed framework penalize undesirable actions and outcomes. For examples, the loss for mistakenly making an escalation decision may be larger than for a wrong de-escalation. However, such loses usually lead to more complex and less interpretable decision rules.
It is demonstrated that the designs in this paper perform similarly with comparable reliability and safety. The i3+3 rule-based design is not a part of this framework, but also generates similar operating characteristics, comparable with the other designs. The i3+3 design shares a practically important feature with interval based designs. One can pre-tabulate decision tables, which is a critical feature for the implementation in actual trials. Clinicians can choose a desirable design for phase I clinical trial based on their preference, including the model-based design CRM, the interval designs, mTPI, mTPI-2, BOIN and CCD, and the rule-based design i3+3.
References
- Berger [2013] Berger, J. O. (2013). Statistical decision theory and Bayesian analysis. Springer Science & Business Media.
- Cheung [2011] Cheung, Y. K. (2011). Dose finding by the continual reassessment method. CRC Press.
- Cheung and Chappell [2002] Cheung, Y. K. and Chappell, R. (2002). A simple technique to evaluate model sensitivity in the continual reassessment method. Biometrics 58, 671–674.
- Clertant et al. [2017] Clertant, M., O’Quigley, J., et al. (2017). Semiparametric dose finding methods. Journal of the Royal Statistical Society Series B 79, 1487–1508.
- Guo et al. [2017] Guo, W., Wang, S.-J., Yang, S., Lynn, H., and Ji, Y. (2017). A Bayesian interval dose-finding design addressing Ockham’s razor: mTPI-2. Contemporary clinical trials 58, 23–33.
- Horton et al. [2017] Horton, B. J., Wages, N. A., and Conaway, M. R. (2017). Performance of toxicity probability interval based designs in contrast to the continual reassessment method. Statistics in medicine 36, 291–300.
- Ivanova et al. [2007] Ivanova, A., Flournoy, N., and Chung, Y. (2007). Cumulative cohort design for dose-finding. Journal of Statistical Planning and Inference 137, 2316–2327.
- Jefferys and Berger [1992] Jefferys, W. H. and Berger, J. O. (1992). Ockham’s razor and Bayesian analysis. American Scientist 80, 64–72.
- Ji et al. [2007] Ji, Y., Li, Y., and Nebiyou Bekele, B. (2007). Dose-finding in phase I clinical trials based on toxicity probability intervals. Clinical trials 4, 235–244.
- Ji et al. [2010] Ji, Y., Liu, P., Li, Y., and Nebiyou Bekele, B. (2010). A modified toxicity probability interval method for dose-finding trials. Clinical trials 7, 653–663.
- Lee and Cheung [2011] Lee, S. M. and Cheung, Y. K. (2011). Calibration of prior variance in the Bayesian continual reassessment method. Statistics in medicine 30, 2081–2089.
- Liu et al. [2020] Liu, M., Wang, S.-J., and Ji, Y. (2020). The i3+ 3 design for phase I clinical trials. Journal of Biopharmaceutical Statistics 30, 294–304.
- Liu and Yuan [2015] Liu, S. and Yuan, Y. (2015). Bayesian optimal interval designs for phase I clinical trials. Journal of the Royal Statistical Society: Series C: Applied Statistics pages 507–523.
- Neuenschwander et al. [2008] Neuenschwander, B., Branson, M., and Gsponer, T. (2008). Critical aspects of the Bayesian approach to phase I cancer trials. Statistics in Medicine 27, 2420–2439.
- O’Quigley et al. [1990] O’Quigley, J., Pepe, M., and Fisher, L. (1990). Continual reassessment method: a practical design for phase 1 clinical trials in cancer. Biometrics pages 33–48.
- Storer [1989] Storer, B. E. (1989). Design and analysis of phase I clinical trials. Biometrics pages 925–937.
- Thorburn [1918] Thorburn, W. M. (1918). The myth of Occam’s razor. Mind 27, 345–353.
- Tighiouart and Rogatko [2010] Tighiouart, M. and Rogatko, A. (2010). Dose finding with escalation with overdose control (EWOC) in cancer clinical trials. Statistical Science 25, 217–226.
- Yan et al. [2017] Yan, F., Mandrekar, S. J., and Yuan, Y. (2017). Keyboard: a novel Bayesian toxicity probability interval design for phase I clinical trials. Clinical Cancer Research 23, 3994–4003.
- Yuan et al. [2016] Yuan, Y., Hess, K. R., Hilsenbeck, S. G., and Gilbert, M. R. (2016). Bayesian optimal interval design: a simple and well-performing design for phase I oncology trials. Clinical Cancer Research 22, 4291–4301.
Appendix A
Safety rules: Following the mTPI and mTPI-2 designs [10, 5], two safety rules are added, as ethical constraints to avoid excessive toxicity, to all the designs in the simulation study when needed.
- Rule 1: Dose Exclusion.
-
If the current dose is considered excessively toxic, i.e., , where the threshold is close to 1, say 0.95, then the current and all higher doses are excluded and never used again for the remainder of the trial.
- Rule 2: Early Stopping.
-
If the current dose is the lowest dose (first dose) and is considered excessively toxic according to Rule 1, then stop the trial for safety.
In safety Rules 1 and 2, is a function of the cumulative beta distribution , and is used by default.
Appendix B
Intervals in Int-CRM design: The intervals in the Int-CRM design are,
which have the form , , . The boundary of the intervals satisfies the equation
Under the model of Int-CRM,
Therefore, given the skeleton and , we can obtain the numerical solution of the interval boundary by searching over a sequence of .
Appendix C
Proof of Proposition 1
Proof:
By definition,
which proves the first equation in Proposition 1. In addition, we have
(14) |
Since
equation 14 becomes
The penultimate equation is true since
Proof of Corollary 1
Proof:
Recall the action space . Based on Equation (2.2) in Proposition 1, the Bayes’ rule is
Proof of Corollary 2
Proof:
Based on Equation (2.2), the Bayes’ rule is equal to
Proof of Corollary 4
Proof:
Proof of Theorem 1
Proof:
According to Proposition 1, under the proposed decision framework, the Bayes’ rule is
(17) | |||||
Let then equation 17 becomes
When , and is monotonically decreasing with . When ,
Note that the term . Then, if ; and if . Therefore, the function first increases and then decreases with . It only has one mode which is a maximum. In summary, either monotonically decreases or first increases and decreases with .
When ,
And because , we have Therefore, Hence,
and by taking exponentiation on both sides, we have
which leads to
i.e., Since either monotonically decreases or first increases and then decreases with , and because and , we have Therefore, when ,
When ,
Since ,
and
which leads to
i.e., Again, either monotonically decreases or first increases then decreases with . In either cases, we have Therefore, when ,
When , we have
and
i.e., and Therefore, when ,
Appendix D
Operating characteristics for the 15 fixed scenarios in the simulation study. The yellow bar highlights the true MTD. Prob of Select at/over MTD refers to the probability (over repeat simulation) of selecting the true MTD and a dose above the true MTD, respectively. Prob of Pat. at/over MTD refers to the relative frequency of patients assigned at or above the true MTD. Prob of Toxicity refers to the frequency of patients experienced DLT in all simulated trials. Prob of no selection refers to the probability of failing to recommend any dose.