This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Unified Decision Framework for Phase I Dose-Finding Designs

Yunshan Duan Department of Statistics and Data Science, University of Texas at Austin Shijie Yuan Laiya Consulting, Inc., Shanghai, China Yuan Ji Department of Public Health Sciences, University of Chicago Corresponding author: Yuan Ji, [email protected] Peter Mueller Department of Statistics and Data Science, University of Texas at Austin
Abstract

The purpose of a phase I dose-finding clinical trial is to investigate the toxicity profiles of various doses for a new drug and identify the maximum tolerated dose. Over the past three decades, various dose-finding designs have been proposed and discussed, including conventional model-based designs, new model-based designs using toxicity probability intervals, and rule-based designs. We present a simple decision framework that can generate several popular designs as special cases. We show that these designs share common elements under the framework, such as the same likelihood function, the use of loss functions, and the nature of the optimal decisions as Bayes rules. They differ mostly in the choice of the prior distributions. We present theoretical results on the decision framework and its link to specific and popular designs like mTPI, BOIN, and CRM. These results provide useful insights into the designs and their underlying assumptions, and convey information to help practitioners select an appropriate design.

1 Introduction

We construct a Bayesian decision theoretic framework for dose finding trials and show how several popular designs can be derived as special cases. Understanding many designs as special cases of one common general construction helps investigators to put a rapidly increasing number of seemingly competing alternative designs into perspective and to appreciate the relative strengths and limitations of different algorithms. A phase I clinical trial is the first stage of in-human investigation of a new drug or therapy. Phase I dose-finding designs aim to identify the maximum tolerate dose (MTD) and to provide dose recommendation for later phase trials. In the vast majority of phase I trials, a set of ascending candidate doses is tested for toxicity and the dose toxicity probability is assumed to be monotonically increasing with the dose level. Typically, the MTD is defined as the highest dose with a dose limiting toxicity (DLT) probability closest to, or not higher than a target toxicity probability pTp_{T}. Usually pTp_{T} ranges from 0.170.17 and 0.30.3. In addition, some designs include the notion of an equivalence interval (EI) to allow for variations in the definition of the MTD. For example, one may choose to set pT=0.3p_{T}=0.3 and EI =(pTϵ1,pT+ϵ2)=(0.25,0.35)=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2})=(0.25,0.35). This means that the target DLT probability of the MTD is 0.3, but doses with DLT probabilities between 0.25 and 0.35 can also be considered as the MTD. In other words, the EI allows investigators to consider doses with toxicity probabilities within the EI interval as appropriate MTD candidates.

A variety of statistical designs for phase I dose-finding trials has been discussed in the literature. A design consecutively assigns patients to recommended dose levels based on the observed DLT outcomes from previously enrolled patients. Existing designs can broadly be divided into two categories, rule-based designs and model-based designs. Among model-based designs, some use simple models and are sometimes called “model-assisted” designs. See Figure 1 for an illustration. We provide a brief introduction of the designs in Figure 1 next.

Refer to caption
Figure 1: Illustration of some Phase I designs. Dotted lines connect designs across different categories, and solid lines connect designs within the same category. Arrows represents the sequence of original publication dates.

The 3+3 design [16] is rule-based and consecutively assigns patients to the current dose or the adjacent higher or lower doses based on observed DLT outcomes. For example, if the current dose at which patients are assigned is dd, then 3+3 assigns the next patient cohort to doses (d+1)(d+1), (d1)(d-1), or dd itself. This is called the “up-and-down” rule. Based on the same up-and-down rule, a smarter rule-based design, i3+3, is proposed in Liu et al. [12] by accounting for higher sampling variability when the sample size at a dose is small. The i3+3 design maintains simplicity of rule-based designs, and exhibits operating characteristics comparable to more complex model-based/assisted designs.

The continual reassessment method (CRM) [15], as the first model-based design in the literature, is based on an inference model with a parsimoniously parameterized dose-response curve. During the trial, CRM continuously updates the estimated dose-response curve based on the observed DLT data throughout the trial. CRM has motivated subsequent important work on model-based designs, including the Bayesian logistic regression method (BLRM) in Neuenschwander et al. [14] and the escalation with over-dose control (EWOC) in Tighiouart and Rogatko [18], among many others, all leveraging parametric dose-response models for statistical inference.

Recently, a class of designs, collectively known as “interval-based designs” take advantage of the notion of an EI to simplify statistical modeling and decision making for phase I trials. Notable examples include the toxicity probability interval (TPI) design [9] and its two modifications, mTPI [10], mTPI-2 [5] (equivalently, Keyboard [19]), the cumulative cohort (CCD) design [7], and the Bayesian optimal interval (BOIN) design [13]. These designs use simple models such as the beta/binomial hierarchical model and assume independence across dose toxicity probabilities, without attempting to explicitly model a dose-response curve. While the independence model assumption is apparently not true because dose toxicity is assumed to be monotonically increasing, it does not affect the operating characteristics of the interval-based designs due to various reasons like the safety restrictions in practice (e.g., no skipping in dose escalation). As a result, interval-based designs show robust performance based on simple up-and-down rules, restricting dosing decisions to be no more than one dose-level change from the current dose used in the trial. In other words, a simple independent beta/binomial model coupled with a simple up-and-down rule leads to desirable simulation performance that justifies the application of these designs. Importantly, the interval-based designs often generate a decision table that greatly simplifies the trial conduct and allow investigators to easily execute the dosing decisions provided in the table.

In the past three decades, many designs (CRM, mTPI, mTPI-2, BOIN, etc.) have been successfully applied to real-world trials. It is natural to wonder which design or designs are suitable for a particular trial. Recent reviews [20] and [6] provide some assessment of these designs, mainly from the perspective of simulation performance. Occasionally, conflicting conclusions might arise from different reviews based on the criteria used for design evaluation, or from different scenarios considered in the comparison. While simulation results can provide important information on the numerical performance of the designs, we argue that a theoretical investigation would complement the simulation results. In this article, we show that using the same optimal decision rule under the proposed decision framework, one can generate several published designs as special cases. In other words, we show a theoretical connection across different designs. These designs include mTPI, mTPI-2, BOIN, and CCD. In addition, based on the proposed decision framework, we develop a new version of the CRM design, called Int-CRM, that is founded on the same model assumption with the original CRM design but a different decision rule. We show that Int-CRM achieves comparable simulation performance as the original CRM design and other interval-based designs. The general decision framework provides insight into the similarities and differences across various designs and may assist investigators to select the right design for their specific needs.

The remainder of the paper is structured as follows. In Section 2, we introduce the unified decision framework and its main components. Section 3 shows how known designs fit into this framework, including the mTPI, mTPI-2, BOIN, CCD, and Int-CRM designs. In Section 4, we conduct simulation studies to assess the operating characteristics of the designs using the i3+3 and CRM designs as benchmarks. Finally, we conclude and end the paper with a discussion in Section 5.

2 Decision Framework

2.1 Overview

We cast the problem of dose finding as an optimization in a decision problem. In particular we focus on the myopic decision problem of selecting the dose for the next patient (cohort). It is myopic because the problem does not address the global decision of stopping the trial and dose selection; instead, the problem only considers the local optimal decision of finding the next dose for future patients. The main components of the decision framework have been briefly illustrated in Guo et al. [5] for the mTPI and mTPI-2 designs. A decision problem is characterized by an action space for decisions aa, a probability model for all unknown quantities, and a loss function [1]. Table 1 shows a summary of these components.

Table 1: Components of the proposed decision framework.
Component Notation Notes
Probability model f(𝒚𝜽)π(𝜽m)π(m)f(\bm{y}\mid\bm{\theta})\pi(\bm{\theta}\mid m)\pi(m) A hierarchical model with parameters 𝜽\bm{\theta} and mm.
Action aa Up-and-down dosing decisions, including
DD (de-escalate), SS (stay), and EE (escalate).
Loss function (a,𝜽)\ell(a,\bm{\theta}) The loss for taking action aa
where θ\theta is the true parameter.
Optimal rule =argmina(a,𝜽)p(𝜽𝒚)𝑑𝜽\mathcal{R}=\arg\min_{a}\int\ell(a,\bm{\theta})p(\bm{\theta}\mid\bm{y})d\bm{\theta} Bayes’ rule that chooses the action
with the minimal posterior expected loss.

2.2 General Framework

In dose-finding trials with binary DLT endpoints, the parameter of interest is a set of toxicity probabilities, 𝜽=(p1,,pT)\bm{\theta}=(p_{1},...,p_{T}) at dose levels xdx_{d}, d=1,,Td=1,\ldots,T, where TT is the number of dose levels, and pdp_{d} is the toxicity probability at dose dd. Let ydy_{d} denote the number of patients who experience DLTs out of ndn_{d} patients treated at dose dd, and let 𝒚=(y1,,yT)\bm{y}=(y_{1},...,y_{T}). For all methods in the upcoming discussion the sampling model f(𝒚𝜽)f(\bm{y}\mid\bm{\theta}) in Table 1 is a binomial distribution with parameter pdp_{d}, i.e.,

ydpdBin(nd,pd),d=1,,T,y_{d}\mid p_{d}\sim\mbox{Bin}(n_{d},p_{d}),\;\;\;d=1,...,T,

implying a likelihood function,

f(𝒚𝜽)d=1Tpdyd(1pd)ndyd.f(\bm{y}\mid\bm{\theta})\propto\prod_{d=1}^{T}p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}.

For model-based designs with a dose-response curve, toxicity probabilities are modeled as a function of the dose levels xdx_{d}. For example, a version of the CRM assumes pd=qdexp(α)p_{d}=q_{d}^{exp(\alpha)}, and a single parameter θ=α\theta=\alpha (the values qdq_{d} are fixed and are known as the “skeleton”). The BLRM design uses pd=logit1(α+βxd)p_{d}=logit^{-1}(\alpha+\beta x_{d}) with parameters 𝜽=(α,β)\bm{\theta}=(\alpha,\beta).

The proposed decision framework uses a concept of probability intervals. The parameter of interest is pdp_{d}, and the parameter space of pdp_{d} is I=[0,1]I=[0,1]. Consider a set of intervals within II, denoted as Ω={Ik,k=1,,K}\Omega=\{I_{k},\;k=1,...,K\}, which form a partition of the parameter space II. That is, k=1KIk=I\bigcup_{k=1}^{K}I_{k}=I and IkIk=,kkI_{k}\cap I_{k^{\prime}}=\emptyset,\;\;k\neq k^{\prime}. The true value of pdp_{d} belongs to one and only one of the intervals. For example, Ω={I1=[0,0.5],I2=(0.5,1]}\Omega=\{I_{1}=[0,0.5],I_{2}=(0.5,1]\} is a partition, and if pd=0.3p_{d}=0.3, pdI1p_{d}\in I_{1}. We introduce a latent indicator mdm_{d} (or, for short, just mm) with md=km_{d}=k if pdIkp_{d}\in I_{k}, and define a hierarchical model prior π(m)\pi(m) and π(pdm)\pi(p_{d}\mid m). For example, π(m=k)=1K,k=1,,K,\pi(m=k)=\frac{1}{K},\;\;\;k=1,...,K, and π(𝜽m=k)d=1TBe(α,β)δ(pdIk),\pi(\bm{\theta}\mid m=k)\propto\prod_{d=1}^{T}\mbox{Be}(\alpha,\beta)\delta(p_{d}\in I_{k}), a truncated beta distribution. Here, δ()\delta(\cdot) is an indicator function. That is, pdp_{d} are conditionally independent with pdf

p(pdm=k)=beta(pd;α,β)δ(pdIk)Ikbeta(pd;α,β)𝑑pd,p(p_{d}\mid m=k)=\frac{beta(p_{d};\alpha,\beta)\delta(p_{d}\in I_{k})}{\int_{I_{k}}beta(p_{d};\alpha,\beta)dp_{d}},

where beta(pd;α,β)=Γ(α+β)Γ(α)Γ(β)pdα1(1pd)β1,α>0,β>0beta(p_{d};\alpha,\beta)=\frac{\Gamma(\alpha+\beta)}{\Gamma(\alpha)\Gamma(\beta)}p_{d}^{\alpha-1}(1-p_{d})^{\beta-1},\alpha>0,\beta>0 is a Be(α,β)\mbox{Be}(\alpha,\beta) p.d.f.

We consider a special partition Ω={I1,I2,I3}\Omega=\{I_{1},I_{2},I_{3}\} where I2=IS=EI=(pTϵ1,pT+ϵ2)I_{2}=I_{S}=EI=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}), I1=IE=[0,pTϵ1]I_{1}=I_{E}=[0,p_{T}-\epsilon_{1}], and I3=ID=[pT+ϵ2,1]I_{3}=I_{D}=[p_{T}+\epsilon_{2},1]. Therefore, K=3K=3 and we use notations ISI_{S}, IEI_{E}, and IDI_{D} to associate the intervals with corresponding up-and-down dose-finding decisions S, E, and D, respectively. We summarize the proposed decision framework below.

Likelihood

f(𝒚𝜽)d=1Dpdyd(1pd)ndyd,f(\bm{y}\mid\bm{\theta})\propto\prod_{d=1}^{D}p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}, (1)

where pdp_{d} is the toxicity probability for dose dd, d=1,,Dd=1,\dots,D.

Prior

We assume pdp_{d} are a priori independent and

π(pdm=k)\displaystyle\pi(p_{d}\mid m=k) \displaystyle\propto g(pd)δ(pdIk),k=1,,K,\displaystyle g(p_{d})\delta(p_{d}\in I_{k}),\;\;\;k=1,...,K,
π(m=k)\displaystyle\pi(m=k) =\displaystyle= 1K,k=1,,K.\displaystyle\frac{1}{K},\;\;\;k=1,...,K.

For example, g(pd)=beta(pd;α,β)g(p_{d})=beta(p_{d};\alpha,\beta).

Partition

Ω={IE(I1),IS(I2),ID(I3)}\Omega=\{I_{E}(I_{1}),\;I_{S}(I_{2}),\;I_{D}(I_{3})\}, where IE=[0,pTϵ1],IS=(pTϵ1,pT+ϵ2)I_{E}=[0,p_{T}-\epsilon_{1}],I_{S}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}), and ID=[pT+ϵ2,1]I_{D}=[p_{T}+\epsilon_{2},1], and I1=IEI_{1}=I_{E}, I2=ISI_{2}=I_{S}, and I3=IDI_{3}=I_{D}.

Actions

The actions are the three up-and-down decisions for dose-finding, i.e.,

a𝒜={E,S,D},a\in\mathcal{A}=\{E,S,D\},

where 𝒜\mathcal{A} denotes the action space. Here EE, SS, DD denote the dosing decisions “Escalation”, “Stay”, and “De-escalation,” respectively. In particular, if the last patient was assigned dose dd, then EE, SS, or DD means treating future patients at dose (d+1)(d+1), dd, or (d1)(d-1), respectively.

Loss

We proceed with a myopic perspective, focusing on the decision for the respective next patient (cohort), and therefore specify a loss function for the next dose assignment aa only.

We use a 0-1 loss function,

(a,pd)={1,pdIa0,pdIa,a𝒜={E,S,D}.\displaystyle\ell(a,p_{d})=\left\{\begin{array}[]{ll}1,&p_{d}\notin I_{a}\\ 0,&p_{d}\in I_{a}\end{array}\right.,\;\;\;\;a\in\mathcal{A}=\{E,S,D\}. (4)

In words, when the action corresponds to an interval which contains the true parameter, the loss takes the value 0; otherwise, the loss equals 1. The loss function (a,pd)\ell(a,p_{d}) is stated in Table 2.

Table 2: The 0-1 loss function (a,θ)\ell(a,\theta).
pdp_{d}\in
(a,pd)\ell(a,p_{d}) [0,pTϵ1][0,p_{T}-\epsilon_{1}] (pTϵ1,pT+ϵ2)(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}) [pT+ϵ2,1][p_{T}+\epsilon_{2},1]
a = DD 1 1 0
a = SS 1 0 1
a = EE 0 1 1

In other words, the loss function (a,pd)\ell(a,p_{d}) defines a 0-1 estimation loss for mm, i.e., the interval that contains pdp_{d}.

Two more comments about the loss function and the setup of the decision problem. First, in general a loss (or, equivalently, utility) function could also be an argument of the outcome ydy_{d}. This is relevant, for example, if instead of inference loss we focus on the patients’ preferences. However, the intention of this discussion is only to highlight common structure in existing dose finding methods, for which we only need this restricted inference loss. Another important limitation is the myopic nature of the setup. We consider the dose allocation for each patient (or patient cohort) in isolation, ignoring that dose allocation now might help later decisions. That is, we ignore the sequential nature of the problem. Again, for the upcoming exposition of common underlying structure for the considered dose finding methods we will only refer to this myopic decision problem.

Bayes’ rule

The optimal decision rule for dose dd is the Bayes’ rule, defined as

d=argmina𝒜(a,pd)p(pd𝒚)𝑑pd,\displaystyle\mathcal{R}_{d}=\arg\min_{a\in\mathcal{A}}\int\ell(a,p_{d})p(p_{d}\mid\bm{y})dp_{d}, (5)

which minimizes the posterior expected loss. Here, p(pd𝒚)p(p_{d}\mid\bm{y}) is the posterior distribution of pdp_{d}.

In general, under a 0-1 estimation loss for a discrete parameter the Bayes rule is simply the posterior mode. The following result states this in the context of our problem. The Bayes’ rule is equivalent to the result of finding the interval with the maximal posterior probability.

Proposition 1

Denote Ω={I1,I2,I3}={IE,IS,ID}\Omega=\{I_{1},I_{2},I_{3}\}=\{I_{E},I_{S},I_{D}\}, where I1=IEI_{1}=I_{E}, I2=ISI_{2}=I_{S}, I3=IDI_{3}=I_{D}. Suppose dose dd is the current dose. Let {m=k}\{m=k\} denote the event {pdIk},k{1,2,3}\{p_{d}\in I_{k}\},\;k\in\{1,2,3\}. Let 𝒜={E,S,D}\mathcal{A}=\{E,S,D\}. Assume π(m=k)=13,k{1,2,3}\pi(m=k)=\frac{1}{3},\;k\in\{1,2,3\}. The Bayes’ rule under the 0-1 loss in Equation (4) is given by

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmaxa𝒜Pr(pdIa𝒚)\displaystyle\arg\max_{a\in\mathcal{A}}Pr(p_{d}\in I_{a}\mid\bm{y})
=\displaystyle= argmaxk{1,2,3}Pr(m=k𝒚)\displaystyle\arg\max_{k\in\{1,2,3\}}Pr(m=k\mid\bm{y})

See Appendix C for a proof.

3 Design Examples

We show how various designs fit as special cases into this framework. That is, we provide examples of the decision framework that give rise to well-known designs including mTPI, BOIN, CCD, mTPI-2, and a new version of CRM, called the Int-CRM design.

3.1 Interval-based designs

We first introduce the connection between the decision framework and the interval-based designs, mTPI, mTPI-2, BOIN and CCD. These designs share some common components under the framework, but also include some elements specific to each design.

Common components

Likelihood (1), prior π(m)\pi(m), loss function (4), and the nature of the defined dose allocation as Bayes’ rule.

Individual components

Prior π(pdm)\pi(p_{d}\mid m), the specific partition Ω={Ik,k=1,,K},\Omega=\{I_{k},\;k=1,...,K\}, and the definition of the action set 𝒜\mathcal{A}.

All four interval-based designs use the binomial sampling model (1). And the designs share the same discrete uniform prior π(m)\pi(m), the 0-1 loss function and the use of Bayes’ rule to select a decision. They divide the [0,1][0,1] parameter space of pdp_{d} into different intervals and use different priors. See Table 3 as a summary. We discuss details for each design next.

Table 3: Individual components of the proposed decision framework for some interval-based designs.
mTPI mTPI-2 BOIN/CCD
Actions 𝒜={E,S,D}\mathcal{A}=\{E,S,D\} 𝒜={1,,K}\mathcal{A}=\{1,...,K\} 𝒜={E,S,D}\mathcal{A}=\{E,S,D\}
IE=[0,pTϵ1]I_{E}=[0,p_{T}-\epsilon_{1}] IE=IE,1IE,K1I_{E}=I_{E,1}\cup\cdots\cup I_{E,K_{1}} IE=[0,ϕE]I_{E}=[0,\phi_{E}]
Intervals IS=(pTϵ1,pT+ϵ2)I_{S}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}) IS=(pTϵ1,pT+ϵ2)I_{S}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}) IS=(ϕE,ϕD)I_{S}=(\phi_{E},\phi_{D})
ID=[pT+ϵ2,1]I_{D}=[p_{T}+\epsilon_{2},1] ID=ID,1ID,K2I_{D}=I_{D,1}\cup\cdots\cup I_{D,K_{2}} ID=[ϕD,1]I_{D}=[\phi_{D},1]
Priors π(pdm=k)\pi(p_{d}\mid m=k)\propto π(pdm=k)\pi(p_{d}\mid m=k)\propto π(pdm=k)=\pi(p_{d}\mid m=k)=
Be(1,1)δ(pdIk)Be(1,1)\delta(p_{d}\in I_{k}) Be(1,1)δ(pdIk)Be(1,1)\delta(p_{d}\in I_{k}) δ(pd=ϕk)\delta(p_{d}=\phi_{k})111See Theorem 1 for details.
1See Theorem 1 for details.

The mTPI design

For the mTPI design, given the equivalence interval EI=(pTϵ1,pT+ϵ2)=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}), the [0,1][0,1] parameter space is naturally partitioned into three intervals Ω={I1=IE=[0,pTϵ1],I2=IS=(pTϵ1,pT+ϵ2),I3=ID=[pT+ϵ2,1]}\Omega=\{I_{1}=I_{E}=[0,p_{T}-\epsilon_{1}],I_{2}=I_{S}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}),I_{3}=I_{D}=[p_{T}+\epsilon_{2},1]\} that correspond to the actions 𝒜={E,S,D}\mathcal{A}=\{E,S,D\}, as shown in Table 3. The mTPI decision is equivalent to the Bayes’ rule d\mathcal{R}_{d} under the decision framework. See Corollary 1 below for a formal mathematical description.


Corollary 1.

The mTPI decision in Ji et al. [10] is given by

RmTPI=argmaxa{E,S,D}UPM(Ia),R_{mTPI}=\arg\max_{a\in\{E,S,D\}}\mbox{UPM}(I_{a}),

where UPM stands for “unit probability mass” and UPM(Ia)=Pr(pdIa)/Ia\mbox{UPM}(I_{a})=Pr^{*}(p_{d}\in I_{a})/||I_{a}||; here Ia||I_{a}|| is the length of IaI_{a}, and Pr(pdIa)= B(yd+1,ndyd+1)pdyd(1pd)ndydδ(pdIa)𝑑pdPr^{*}(p_{d}\in I_{a})= \int B(y_{d}+1,n_{d}-y_{d}+1)\cdot p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\cdot\delta(p_{d}\in I_{a})dp_{d} is calculated based on pdBe(yd+1,ndyd+1)p_{d}\sim\mbox{Be}(y_{d}+1,n_{d}-y_{d}+1). Let I1=IE=[0,pTϵ1]I_{1}=I_{E}=[0,p_{T}-\epsilon_{1}], I2=IS=(pTϵ1,pT+ϵ2)I_{2}=I_{S}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}), and I3=ID=[pT+ϵ2,1]I_{3}=I_{D}=[p_{T}+\epsilon_{2},1]. Then RmTPI=dR_{mTPI}=\mathcal{R}_{d}, the Bayes rule under

π(pdm=k)=Ckbeta(pd;1,1)δ(pdIk),\pi(p_{d}\mid m=k)=C_{k}\cdot beta(p_{d};1,1)\delta(p_{d}\in I_{k}),

where beta(;1,1)beta(\cdot;1,1) denotes the density function of Be(1,1)\mbox{Be}(1,1) distribution and Ck=1Ikbeta(p;1,1)𝑑pC_{k}=\frac{1}{\int_{I_{k}}beta(p;1,1)dp} is a normalizing constant.

See Appendix C for a proof.


The mTPI-2 design

Ockham’s razor is a principle in statistical inference calling for an explanation of the facts to be no more complicated than necessary [17, 8]. In the context of model selection, the Ockham’s razor prefers parsimonious models that describe the data equally well as more complex models. In the proposed decision framework, {m=k},k=1,,K\{m=k\},\;k=1,...,K, is equivalent to KK models {Mk:m=k}\{M_{k}:m=k\}, and choosing the value of mm is equivalent to a model selection problem. Bayesian model selection chooses the model with the largest posterior probability (compare Proposition 1), i.e., Pr(m=ky)Pr(m=k\mid y), and models are automatically penalized for their complexity. In other words, Bayes’ rule d=argmaxk{1,2,3}Pr(m=ky)\mathcal{R}_{d}=\arg\max_{k\in\{1,2,3\}}Pr(m=k\mid y) implements Ockham’s razor if we define model complexity as Ik||I_{k}||. Therefore, when the three models are I1=IE=[0,pTϵ1]I_{1}=I_{E}=[0,p_{T}-\epsilon_{1}], I2=IS=(pTϵ1,pT+ϵ2)I_{2}=I_{S}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}), I3=ID=[pT+ϵ2]I_{3}=I_{D}=[p_{T}+\epsilon_{2}], the “simplest” model is I2=ISI_{2}=I_{S}, since ϵ1\epsilon_{1} and ϵ2\epsilon_{2} are typically small probabilities (0.05\leq 0.05).

Guo et al. [5] explain mTPI-2 as aiming to blunt Ockham’s razor by redefining a (finer) partition Ω={IE,IS=EI,ID},\Omega^{*}=\{I_{E}^{*},I_{S}=EI,I_{D}^{*}\}, where IE={IE,1,,IE,K1}I_{E}^{*}=\{I_{E,1},...,I_{E,K_{1}}\} and ID={ID,1,,ID,K2}I_{D}^{*}=\{I_{D,1},...,I_{D,K_{2}}\}. Probability intervals {IE,k}k=2K1\{I_{E,k}\}_{k=2}^{K_{1}} and {ID,k}k=2K2\{I_{D,k}\}_{k=2}^{K_{2}} have the same length as IS=EII_{S}=EI. Let K=K1+K2+1.K=K_{1}+K_{2}+1. The selected model mm under the mTPI-2 design can then be shown to be Bayes’ rule d\mathcal{R}_{d}, under an action set 𝒜m={1,,K}\mathcal{A}_{m}=\{1,...,K\}. Corollary 2 next summarizes the results.

Corollary 2.

Under mTPI-2, Ω={IE={IE,1,,IE,K1},IS=(pTϵ1,pT+ϵ2),ID={ID,1,,ID,K2}}\Omega^{*}=\{I_{E}^{*}=\{I_{E,1},\cdots,I_{E,K_{1}}\},I_{S}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}),I_{D}^{*}=\{I_{D,1},\cdots,I_{D,K_{2}}\}\}, 𝒜m={1,,K}\mathcal{A}_{m}=\{1,...,K\}. Assume the prior on pdp_{d} is conditionally independent and given by

π(pdm=k)=Ckbeta(pd;1,1)δ(pdIk).\pi(p_{d}\mid m=k)=C_{k}\cdot beta(p_{d};1,1)\delta(p_{d}\in I_{k}).

Then Bayes’ rule is

d=argmaxm𝒜mPr(m=ky)=argmaxm𝒜mUPM(Im).\mathcal{R}_{d}=\arg\max_{m\in\mathcal{A}_{m}}Pr(m=k\mid y)=\arg\max_{m\in\mathcal{A}_{m}}\mbox{UPM}(I_{m}).

See Appendix C for a proof. Corollary 2 establishes the Bayes’ rule d\mathcal{R}_{d} as an action in 𝒜m={1,,K}\mathcal{A}_{m}=\{1,...,K\}. To see the connection to the dose-finding decision in mTPI-2, we refer to the next result.


Corollary 3.

Let

mTPI-2={E,argmaxImUPM(Im)(0,pTϵ1),S,argmaxImUPM(Im)=(pTϵ1,pT+ϵ2),D,argmaxImUPM(Im)(pT+ϵ2,1).\displaystyle\mathcal{R}_{\mbox{mTPI-2}}=\left\{\begin{array}[]{ll}E,&\arg\max_{I_{m}}\mbox{UPM}(I_{m})\subset(0,p_{T}-\epsilon_{1}),\\ S,&\arg\max_{I_{m}}\mbox{UPM}(I_{m})=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}),\\ D,&\arg\max_{I_{m}}\mbox{UPM}(I_{m})\subset(p_{T}+\epsilon_{2},1).\end{array}\right.

Then

mTPI-2={E,Id(0,pTϵ1),S,Id=(pTϵ1,pT+ϵ2),D,Id(pT+ϵ2,1).\displaystyle{\mathcal{R}_{\mbox{mTPI-2}}}=\left\{\begin{array}[]{ll}E,&I_{\mathcal{R}_{d}}\subset(0,p_{T}-\epsilon_{1}),\\ S,&I_{\mathcal{R}_{d}}=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}),\\ D,&I_{\mathcal{R}_{d}}\subset(p_{T}+\epsilon_{2},1).\end{array}\right.

In other words, if d=m\mathcal{R}_{d}=m^{*}, then ImI_{m^{*}} is the interval with the largest UPM, for m𝒜mm^{*}\in\mathcal{A}_{m}. And if IMI_{M^{*}} is below, equal to, or above the EI=(pTϵ1,pT+ϵ2)=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}), the decision is EE, SS, or DD, respectively. This is the same as the up-and-down rule in the mTPI-2 design in Guo et al. [5]. Proof of Corollary 3 is immediate and omitted.


The BOIN design

The BOIN design [13] uses a decision rule

RBOIN={E,p^dλ1,S,λ1<p^d<λ2,D,p^dλ2,\displaystyle R_{BOIN}=\left\{\begin{array}[]{ll}E,&\hat{p}_{d}\leq\lambda_{1},\\ S,&\lambda_{1}<\hat{p}_{d}<\lambda_{2},\\ D,&\hat{p}_{d}\geq\lambda_{2},\end{array}\right.

where p^d=ydnd\hat{p}_{d}=\frac{y_{d}}{n_{d}}, λ1=ξ(ϕE;ϕS),λ2=ξ(ϕD;ϕS),\lambda_{1}=\xi(\phi_{E};\phi_{S}),\lambda_{2}=\xi(\phi_{D};\phi_{S}), and

ξ(ϕi;ϕj)=log(1ϕi1ϕj)log(ϕj(1ϕi)ϕi(1ϕj)).\xi(\phi_{i};\phi_{j})=\frac{log\left(\frac{1-\phi_{i}}{1-\phi_{j}}\right)}{log\left(\frac{\phi_{j}(1-\phi_{i})}{\phi_{i}(1-\phi_{j})}\right)}. (10)

In particular, ϕS=pT\phi_{S}=p_{T}, ϕE(<pT)\phi_{E}(<p_{T}) and ϕD(>pT)\phi_{D}(>p_{T}) are pre-specified values. Here, ϕE\phi_{E} and ϕD\phi_{D} play a similar rule as (pTϵ1)(p_{T}-\epsilon_{1}) and (pT+ϵ2)(p_{T}+\epsilon_{2}) in the mTPI and mTPI-2 designs, which defines the boundaries of an initial equivalence interval elicited from clinicians. We show that the decision rule RBOINR_{BOIN} is also a Bayes’ rule under the proposed decision framework next.


Theorem 1.

Assume ydpdBin(nd,pd)y_{d}\mid p_{d}\sim\mbox{Bin}(n_{d},p_{d}), π(pdm=k)=δ(pd=ϕk)\pi(p_{d}\mid m=k)=\delta(p_{d}=\phi_{k}), for k=1,2,3k=1,2,3, and ϕ1=ϕE\phi_{1}=\phi_{E}, ϕ2=ϕS=pT\phi_{2}=\phi_{S}=p_{T}, and ϕ3=ϕD\phi_{3}=\phi_{D}. Under the 0-1 loss (a,pd)\ell(a,p_{d}) in Equation (4), the Bayes’ rule is equivalent to RBOINR_{BOIN}, i.e.,

d=RBOIN={E,p^dλ1,S,λ1<p^d<λ2,D,p^dλ2,\displaystyle\mathcal{R}_{d}=R_{BOIN}=\left\{\begin{array}[]{ll}E,&\hat{p}_{d}\leq\lambda_{1},\\ S,&\lambda_{1}<\hat{p}_{d}<\lambda_{2},\\ D,&\hat{p}_{d}\geq\lambda_{2},\end{array}\right.

where p^d=ydnd\hat{p}_{d}=\frac{y_{d}}{n_{d}}, λ1=ξ(ϕE;ϕS),λ2=ξ(ϕD;ϕS),\lambda_{1}=\xi(\phi_{E};\phi_{S}),\lambda_{2}=\xi(\phi_{D};\phi_{S}), and ξ\xi is defined as in Equation (10)

See Appendix C for a proof. By Theorem 1 the BOIN design takes the form of the Bayes’ rule under the same decision framework using the 0-1 loss. BOIN uses a point-mass prior for pdp_{d} on three values, ϕE\phi_{E}, ϕS\phi_{S}, ϕD\phi_{D}, while mTPI/mTPI-2 using truncated beta priors instead. Next, we show that the BOIN design is almost the same as the CCD design. This is easiest seen under the perspective of the proposed decision framework. The difference between the two designs are the locations of the point-mass priors.


The CCD design

The CCD design [7] compares p^d\hat{p}_{d} with (pTϵ1)(p_{T}-\epsilon_{1}) and (pT+ϵ2)(p_{T}+\epsilon_{2}), and uses the following up-and-down rule,

CCD\displaystyle\mathcal{R}_{\mbox{CCD}} =\displaystyle= {Ep^dpTϵ1SpTϵ1<p^d<pT+ϵ2Dp^dpT+ϵ2\displaystyle\left\{\begin{array}[]{ll}E&\hat{p}_{d}\leq p_{T}-\epsilon_{1}\\ S&p_{T}-\epsilon_{1}<\hat{p}_{d}<p_{T}+\epsilon_{2}\\ D&\hat{p}_{d}\geq p_{T}+\epsilon_{2}\end{array}\right.

Corollary 4 shows that the decision of the CCD design is the same Bayes’ rule in the same framework as BOIN but with a different prior distribution.

Corollary 4.

The CCD decision CCD=d\mathcal{R}_{\mbox{CCD}}=\mathcal{R}_{d} with

π(pdm=k)=δ(pd=ϕk),k=E,S,D,\pi(p_{d}\mid m=k)=\delta(p_{d}=\phi^{\prime}_{k}),\;\;\;k=E,S,D,

where ϕE=ξ1(pTϵ1)\phi^{\prime}_{E}=\xi^{-1}(p_{T}-\epsilon_{1}), ϕD=ξ1(pT+ϵ2)\phi^{\prime}_{D}=\xi^{-1}(p_{T}+\epsilon_{2}), ϕS=ϕS=pT\phi^{\prime}_{S}=\phi_{S}=p_{T}, and ξ(ϕ)ξ(ϕ,pT)\xi(\phi)\equiv\xi(\phi,p_{T}) in Equation (10).

See Appendix C for a proof.

Corollary 4 shows that BOIN and CCD are identical designs with the only difference being that BOIN uses a point-mass prior π(pdm=k)=δ(pd=ϕk)\pi(p_{d}\mid m=k)=\delta(p_{d}=\phi_{k}), whereas CCD uses π(pdm=k)=δ(pd=ϕk)\pi(p_{d}\mid m=k)=\delta(p_{d}=\phi^{\prime}_{k}).


The Int-CRM design

Using the same decision framework, we propose a variation of the CRM design, called Int-CRM. We assume the same parametric dose-response model as in the CRM design [15], with the probability of toxicity monotonically increasing with dose. Let did_{i} denote the dose for the iith patient, di{1,,T}d_{i}\in\{1,...,T\}, and YiY_{i} the binary indicator of DLT. The dose-response curve is assumed to be the power model as in the CRM,

F(d,θ)=qdexp(θ),F(d,\theta)=q_{d}^{exp(\theta)},

where (q1,..,qT)(q_{1},..,q_{T}) (“skeleton”) are a priori pre-specified dose toxicity probabilities. Other sensible dose-response models, such as a logit model, may be considered as well. The toxicity rates are dependent across doses through the dose-response curve and the inference is based on the parameter θ\theta. The likelihood function is given by

f(𝒚θ)i=1nF(di,θ)yi{1F(di,θ)}1yi,f(\bm{y}\mid\theta)\propto\prod_{i=1}^{n}F(d_{i},\theta)^{y_{i}}\{1-F(d_{i},\theta)\}^{1-y_{i}},

where nn is the number of patients in the trial.

Following Cheung and Chappell [3], we define an interval [A1,AT+1][A_{1},A_{T+1}] for θ\theta that is wide enough to allow for a wide range of dose-response curves. For example, set A1A_{1} and AT+1A_{T+1} so that q1exp(A1)>1105q_{1}^{exp(A_{1})}>1-10^{-5} and qTexp(AT+1)<105q_{T}^{exp(A_{T+1})}<10^{-5}, which correspond to response curves constantly equal to 1.0 and 0.0, respectively, and θ[A1,AT+1]\theta\in[A_{1},A_{T+1}] allows choices in-between these extremes. Using A1A_{1} and AT+1A_{T+1}, we define sub-intervals for θ\theta as the set of values that imply dkd_{k} having toxicity probability closest to pTp_{T},

Ik={θ[A1,AT+1]:|F(k,θ)pT|<|F(d,θ)pT|,dk},I_{k}=\{\theta\in[A_{1},A_{T+1}]:|F(k,\theta)-p_{T}|<|F(d,\theta)-p_{T}|,\forall d\neq k\},

k=1, …, T. As shown in Cheung and Chappell [3], IkI_{k} is an interval, denoted as I1=[ψ1=A1,ψ2)I_{1}=[\psi_{1}=A_{1},\psi_{2}), Ik=[ψk,ψk+1),k=2,,(T1)I_{k}=[\psi_{k},\psi_{k+1}),\;k=2,...,(T-1), IT=[ψT,ψT+1=AT+1]I_{T}=[\psi_{T},\psi_{T+1}=A_{T+1}], where ψk\psi_{k} is implicitly defined as the solution of

F(k1,ψk)+F(k,ψk)=2pT,k=2,,T.F(k-1,\psi_{k})+F(k,\psi_{k})=2p_{T},\;\;\;k=2,...,T.

Given the “skeleton” (q1,,qT)(q_{1},...,q_{T}), we can obtain the numerical result of the interval boundaries ϕk\phi_{k}’s by solving the equation above. See Appendix B for details. Each interval consists of a set of θ\theta values where dose kk’s toxicity probability is the closest to pTp_{T} among all the doses. We use these intervals IkI_{k}’s in our framework for Int-CRM. We propose hierarchical priors

π(m=k)=1T,k=1,,T\pi(m=k)=\frac{1}{T},\;\;\;k=1,...,T

and

π(θm=k)=ϕ(θ)δ(θIk)Ikϕ(θ)𝑑θ,\pi(\theta\mid m=k)=\frac{\phi(\theta)\delta(\theta\in I_{k})}{\int_{I_{k}}\phi(\theta)d\theta},

where ϕ(θ)\phi(\theta) is the density function of the normal distribution N(0,σ2)N(0,\sigma^{2}).

The action space of the Int-CRM design is 𝒜={1,,T}\mathcal{A}=\{1,...,T\}, corresponding to the dose for treating the next patient. Following the proposed decision framework, we use the 0-1 loss function and the Bayes’ rule that minimizes the posterior expected loss for the Int-CRM decision.

Theorem 2.

Under the 0-1 loss, i.e.,

(a,θ)={1,θIa0,θIa,a𝒜={1,,T},\displaystyle\ell(a,\theta)=\left\{\begin{array}[]{ll}1,&\theta\notin I_{a}\\ 0,&\theta\in I_{a}\end{array}\right.,\;\;\;\;a\in\mathcal{A}=\{1,...,T\},

the Int-CRM decision is the Bayes’ rule

Int-CRM\displaystyle\mathcal{R}_{\mbox{Int-CRM}} =\displaystyle= argmaxk𝒜Pr(m=ky)\displaystyle\arg\max_{k\in\mathcal{A}}Pr(m=k\mid y)
=\displaystyle= argmaxk𝒜p(yθ)π(θm=k)𝑑θ\displaystyle\arg\max_{k\in\mathcal{A}}\int p(y\mid\theta)\pi(\theta\mid m=k)d\theta
=\displaystyle= argmaxk𝒜i=1nF(di,θ)yi{1F(di,θ)1yi}ϕ(θ)δ(θIk)ϕ(θ)δ(θIk)𝑑θdθ.\displaystyle\arg\max_{k\in\mathcal{A}}\int\prod_{i=1}^{n}F(d_{i},\theta)^{y_{i}}\{1-F(d_{i},\theta)^{1-y_{i}}\}\frac{\phi(\theta)\delta(\theta\in I_{k})}{\int\phi(\theta)\delta(\theta\in I_{k})d\theta}d\theta.

The proof is immediate by the definition of Bayes’ rule. Below is the proposed Int-CRM dose-finding algorithm.

The Int-CRM Algorithm: Dose Finding Rules: After each cohort of patients completes the DLT follow-up period, the dose to be assigned is the Int-CRM\mathcal{R}_{\mbox{Int-CRM}}, the Bayes’ rule, unless the following safety rules apply. Safety Rules: Four additional rules are applied for safety. Rule 1: Dose Exclusion: If the current dose is considered excessively toxic, i.e., Prob{pd>pTdata}>ξProb\{p_{d}>p_{T}\mid data\}>\xi (see below about evaluating this probability), where the threshold ξ\xi is close to 1, say 0.95, the current and all higher doses will be excluded in the remainder of the trial to avoid assigning any patients to those doses. Rule 2: Early Stopping: If the current dose is the lowest dose (first dose) and is considered excessively toxic, i.e., p{p1>pTdata}>ξp\{p_{1}>p_{T}\mid data\}>\xi, where the threshold ξ\xi is close to 1, say 0.95, stop the trial early and declare no MTD. To evaluate p{pd>pTdata}p\{p_{d}>p_{T}\mid data\} in Rules 1 and 2 we use a Be(α0+yd,β0+ndyd)\mbox{Be}(\alpha_{0}+y_{d},\beta_{0}+n_{d}-y_{d}) distribution with α0=β0=1\alpha_{0}=\beta_{0}=1. Rule 3: No-Skipping Escalation: If the dose-finding rule recommends escalation, such escalation shall not increase the dose by more than one level. Dose-escalation cannot increase by more than one level. That is, suppose the current dose is dd. If the next recommended dose Int-CRM\mathcal{R}_{\mbox{Int-CRM}} is such that (Int-CRMd)>1(\mathcal{R}_{\mbox{Int-CRM}}-d)>1, escalate to dose (d+1)(d+1) instead. Rule 4: Coherence: No escalation is permitted if the empirical rate of DLT for the most recent cohort is higher than pTp_{T}, according to the coherence principle [2]. Trial Termination: The trial proceeds unless any of the following stopping criteria is met: If the pre-specified maximum total sample size nn is reached. Rule 2 above. MTD Selection: Once all the enrolled patients complete the DLT observation and the trial is not stopped early, the last dose level Int-CRM\mathcal{R}_{\mbox{Int-CRM}} is selected as the MTD.

4 Simulation Studies

4.1 Simulation Settings

We set up simulation studies to evaluate the operating characteristics of the different designs that we have shown to be special cases of the proposed general framework, including the mTPI, mTPI-2, BOIN, CCD and the Int-CRM designs. We show how the common underlying decision framework leads to very similar performances of the designs under consideration. We also compare to the i3+3 design and the original CRM design as benchmarks.

Fixed Scenarios

We use a total of 15 scenarios, with a set of T=4,5,T=4,5, or 66 doses. Assume the target toxicity probability pT=ϕS=0.3p_{T}=\phi_{S}=0.3 (ϕS\phi_{S} is the notation in BOIN), and maximum sample size of 30. For all designs we apply the same safety rules as in the mTPI, mTPI-2, and Int-CRM designs. See Appendix A for details. For interval-based designs we use EI =(pTϵ1,pT+ϵ2)=(p_{T}-\epsilon_{1},p_{T}+\epsilon_{2}), ϵ1=ϵ2=0.05\epsilon_{1}=\epsilon_{2}=0.05. For the Int-CRM and CRM design, the skeleton qdq_{d} is generated using the approach proposed in Lee and Cheung [11], which selects the skeleton based on indifference intervals for the MTD. Also, we set the half width of the indifference intervals, δ=0.05\delta=0.05. We apply the coherence principle [2], avoiding immediate escalation after toxic outcomes.

For the BOIN design, we set λ1=pTϵ1\lambda_{1}=p_{T}-\epsilon_{1}, λ2=pT+ϵ2\lambda_{2}=p_{T}+\epsilon_{2}. This is equivalent to setting ϕE=ξ1(pTϵ1)\phi_{E}=\xi^{-1}(p_{T}-\epsilon_{1}), and ϕD=ξ1(pT+ϵ2)\phi_{D}=\xi^{-1}(p_{T}+\epsilon_{2}). By Theorem 1, these values for λ1\lambda_{1} and λ2\lambda_{2} make the BOIN decision identical to the CCD design, leading to same operating characteristics of the two designs.

Random Scenarios

We generate additional 1,000 random scenarios to further evaluate the designs. Scenarios are generated based on the pseudo-uniform algorithm in Clertant et al. [4]. Figure 2 plots the first 20 scenarios. Other settings of the designs are the same as the fixed scenarios, such as pTp_{T} and EI, λ1,λ2\lambda_{1},\lambda_{2} for the BOIN design, and δ\delta for the Int-CRM and CRM designs.

Refer to caption
Figure 2: Illustration of the first 20 random scenarios of true toxicity probabilities in the simulation.

4.2 Simulation Results

We evaluate the performance of the phase I designs using several metrics, based on their ability to identify the MTD and the safety in dose selection and patient allocation. Table 4 summarizes the means and standard deviations of key performance metrics for the simulation with 1,000 scenarios. All designs show remarkable similarity with the largest mean difference across designs only about 0.02. This highlights the underlying connection of these designs and echoes our findings based on the unified decision framework that can generate most designs as special cases. Figures 3, 4 and 5 in Appendix D present the simulation results of the 15 fixed scenarios.

In general, the five designs tested in the simulation studies exhibit remarkably similar performances. Specifically, they show comparable probabilities (across repeated simulation) of allocating patients to the true MTD, and similar risk of allocating patients to overly toxic doses. The BOIN/CCD and Int-CRM designs yield slightly higher PCS (probability of correct selection of MTD) in some cases, such as scenarios 1 and 4 in Figure 4. However, they also report a higher risk in selecting doses beyond the true MTD. For example, in scenarios 2, 3, and 5 in Figure 4 in Appendix D, the probabilities of over-dosing selection under BOIN, CCD and Int-CRM are higher compared to the other designs.

Table 4: Simulation results of 1,000 random scenarios. Entries are the 𝒎𝒆𝒂𝒏sd\bm{mean}_{sd} (across the 1,000 scenarios) proportion of simulated trials for each design and metric.
Metrics mTPI mTPI-2 BOIN/CCD Int-CRM CRM i3+3
Correct Sel. of MTD 0.60(0.15)0.60_{(0.15)} 0.62(0.14)0.62_{(0.14)} 0.62(0.14)0.62_{(0.14)} 0.62(0.14)0.62_{(0.14)} 0.62(0.14)0.62_{(0.14)} 0.62(0.14)0.62_{(0.14)}
Sel. over MTD 0.10(0.11)0.10_{(0.11)} 0.11(0.11)0.11_{(0.11)} 0.12(0.12)0.12_{(0.12)} 0.12(0.12)0.12_{(0.12)} 0.11(0.11)0.11_{(0.11)} 0.11(0.11)0.11_{(0.11)}
Pat. at MTD 0.50(0.22)0.50_{(0.22)} 0.50(0.21)0.50_{(0.21)} 0.50(0.20)0.50_{(0.20)} 0.51(0.20)0.51_{(0.20)} 0.51(0.21)0.51_{(0.21)} 0.50(0.21)0.50_{(0.21)}
Pat. over MTD 0.11(0.10)0.11_{(0.10)} 0.11(0.10)0.11_{(0.10)} 0.12(0.11)0.12_{(0.11)} 0.12(0.11)0.12_{(0.11)} 0.12(0.10)0.12_{(0.10)} 0.11(0.10)0.11_{(0.10)}
Tox. 0.26(0.05)0.26_{(0.05)} 0.26(0.05)0.26_{(0.05)} 0.26(0.05)0.26_{(0.05)} 0.26(0.05)0.26_{(0.05)} 0.26(0.05)0.26_{(0.05)} 0.26(0.05)0.26_{(0.05)}
None Sel. 0.04(0.07)0.04_{(0.07)} 0.04(0.07)0.04_{(0.07)} 0.04(0.07)0.04_{(0.07)} 0.04(0.07)0.04_{(0.07)} 0.04(0.07)0.04_{(0.07)} 0.04(0.07)0.04_{(0.07)}

5 Discussion

We have developed a general decision framework for phase I dose-finding designs. We have shown that interval-based designs, like mTPI, mTPI-2, BOIN, CCD, and the model-based design Int-CRM fit into this unified framework.

All designs use the same 0-1 loss function, and all interval designs assume a binomial likelihood function. The prior construction for some designs involves the notion of candidate models. Candidate models are specified assuming different toxic profiles for the doses. Given the model, the mTPI, mTPI-2 and Int-CRM assume continuous prior distributions, using beta or normal distributions truncated to the restricted parameter space implied by the given model. The BOIN and CCD designs use a different approach with a discrete prior on pdp_{d}, supported at three distinct values. Choosing those atoms is challenging and may be difficult to interpret. However, as shown in many simulations conducted and published in literature, the BOIN and CCD designs perform very well in Phase I trials with relatively small sample size.

Additionally, different loss functions can be considered in the proposed framework penalize undesirable actions and outcomes. For examples, the loss for mistakenly making an escalation decision may be larger than for a wrong de-escalation. However, such loses usually lead to more complex and less interpretable decision rules.

It is demonstrated that the designs in this paper perform similarly with comparable reliability and safety. The i3+3 rule-based design is not a part of this framework, but also generates similar operating characteristics, comparable with the other designs. The i3+3 design shares a practically important feature with interval based designs. One can pre-tabulate decision tables, which is a critical feature for the implementation in actual trials. Clinicians can choose a desirable design for phase I clinical trial based on their preference, including the model-based design CRM, the interval designs, mTPI, mTPI-2, BOIN and CCD, and the rule-based design i3+3.

References

  • Berger [2013] Berger, J. O. (2013). Statistical decision theory and Bayesian analysis. Springer Science & Business Media.
  • Cheung [2011] Cheung, Y. K. (2011). Dose finding by the continual reassessment method. CRC Press.
  • Cheung and Chappell [2002] Cheung, Y. K. and Chappell, R. (2002). A simple technique to evaluate model sensitivity in the continual reassessment method. Biometrics 58, 671–674.
  • Clertant et al. [2017] Clertant, M., O’Quigley, J., et al. (2017). Semiparametric dose finding methods. Journal of the Royal Statistical Society Series B 79, 1487–1508.
  • Guo et al. [2017] Guo, W., Wang, S.-J., Yang, S., Lynn, H., and Ji, Y. (2017). A Bayesian interval dose-finding design addressing Ockham’s razor: mTPI-2. Contemporary clinical trials 58, 23–33.
  • Horton et al. [2017] Horton, B. J., Wages, N. A., and Conaway, M. R. (2017). Performance of toxicity probability interval based designs in contrast to the continual reassessment method. Statistics in medicine 36, 291–300.
  • Ivanova et al. [2007] Ivanova, A., Flournoy, N., and Chung, Y. (2007). Cumulative cohort design for dose-finding. Journal of Statistical Planning and Inference 137, 2316–2327.
  • Jefferys and Berger [1992] Jefferys, W. H. and Berger, J. O. (1992). Ockham’s razor and Bayesian analysis. American Scientist 80, 64–72.
  • Ji et al. [2007] Ji, Y., Li, Y., and Nebiyou Bekele, B. (2007). Dose-finding in phase I clinical trials based on toxicity probability intervals. Clinical trials 4, 235–244.
  • Ji et al. [2010] Ji, Y., Liu, P., Li, Y., and Nebiyou Bekele, B. (2010). A modified toxicity probability interval method for dose-finding trials. Clinical trials 7, 653–663.
  • Lee and Cheung [2011] Lee, S. M. and Cheung, Y. K. (2011). Calibration of prior variance in the Bayesian continual reassessment method. Statistics in medicine 30, 2081–2089.
  • Liu et al. [2020] Liu, M., Wang, S.-J., and Ji, Y. (2020). The i3+ 3 design for phase I clinical trials. Journal of Biopharmaceutical Statistics 30, 294–304.
  • Liu and Yuan [2015] Liu, S. and Yuan, Y. (2015). Bayesian optimal interval designs for phase I clinical trials. Journal of the Royal Statistical Society: Series C: Applied Statistics pages 507–523.
  • Neuenschwander et al. [2008] Neuenschwander, B., Branson, M., and Gsponer, T. (2008). Critical aspects of the Bayesian approach to phase I cancer trials. Statistics in Medicine 27, 2420–2439.
  • O’Quigley et al. [1990] O’Quigley, J., Pepe, M., and Fisher, L. (1990). Continual reassessment method: a practical design for phase 1 clinical trials in cancer. Biometrics pages 33–48.
  • Storer [1989] Storer, B. E. (1989). Design and analysis of phase I clinical trials. Biometrics pages 925–937.
  • Thorburn [1918] Thorburn, W. M. (1918). The myth of Occam’s razor. Mind 27, 345–353.
  • Tighiouart and Rogatko [2010] Tighiouart, M. and Rogatko, A. (2010). Dose finding with escalation with overdose control (EWOC) in cancer clinical trials. Statistical Science 25, 217–226.
  • Yan et al. [2017] Yan, F., Mandrekar, S. J., and Yuan, Y. (2017). Keyboard: a novel Bayesian toxicity probability interval design for phase I clinical trials. Clinical Cancer Research 23, 3994–4003.
  • Yuan et al. [2016] Yuan, Y., Hess, K. R., Hilsenbeck, S. G., and Gilbert, M. R. (2016). Bayesian optimal interval design: a simple and well-performing design for phase I oncology trials. Clinical Cancer Research 22, 4291–4301.

Appendix A

Safety rules: Following the mTPI and mTPI-2 designs [10, 5], two safety rules are added, as ethical constraints to avoid excessive toxicity, to all the designs in the simulation study when needed.

Rule 1: Dose Exclusion.

If the current dose is considered excessively toxic, i.e., p{pd>pTData}>ξp\{p_{d}>p_{T}\mid Data\}>\xi, where the threshold ξ\xi is close to 1, say 0.95, then the current and all higher doses are excluded and never used again for the remainder of the trial.

Rule 2: Early Stopping.

If the current dose is the lowest dose (first dose) and is considered excessively toxic according to Rule 1, then stop the trial for safety.

In safety Rules 1 and 2, Prob{pd>pTData}Prob\{p_{d}>p_{T}\mid Data\} is a function of the cumulative beta distribution Be(α0+yd,β0+ndyd)\mbox{Be}(\alpha_{0}+y_{d},\beta_{0}+n_{d}-y_{d}), and α0=β0=1\alpha_{0}=\beta_{0}=1 is used by default.

Appendix B

Intervals in Int-CRM design: The intervals in the Int-CRM design are,

Ik={θ[A1,AT+1]:|F(k,θ)pT|<|F(d,θ)pT|,dk},k=1,,T,I_{k}=\{\theta\in[A_{1},A_{T+1}]:|F(k,\theta)-p_{T}|<|F(d,\theta)-p_{T}|,\forall d\neq k\},\;\;\;k=1,\dots,T,

which have the form I1=[ψ1=A1,ψ2)I_{1}=[\psi_{1}=A_{1},\psi_{2}), Ik=[ψk,ψk+1),k=2,,(T1)I_{k}=[\psi_{k},\psi_{k+1}),\;k=2,...,(T-1), IT=[ψT,ψT+1=AT+1]I_{T}=[\psi_{T},\psi_{T+1}=A_{T+1}]. The boundary of the intervals ψk\psi_{k} satisfies the equation

F(k1,ψk)+F(k,ψk)=2pT,k=2,,T.F(k-1,\psi_{k})+F(k,\psi_{k})=2p_{T},\;\;\;k=2,...,T.

Under the model of Int-CRM,

qk1exp(ϕk)+qkexp(ϕk)=2pT.q_{k-1}^{exp(\phi_{k})}+q_{k}^{exp(\phi_{k})}=2p_{T}.

Therefore, given the skeleton (q1,,qT)(q_{1},...,q_{T}) and pTp_{T}, we can obtain the numerical solution of the interval boundary by searching over a sequence of ϕk[A1,AT+1]\phi_{k}\in[A_{1},A_{T+1}].

Appendix C

Proof of Proposition 1

Proof:

By definition,

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmina𝒜(a,pd)p(pd𝒚)𝑑pd=argmina𝒜δ(pdIa)p(pd𝒚)𝑑pd\displaystyle\arg\min_{a\in\mathcal{A}}\int\ell(a,p_{d})p(p_{d}\mid\bm{y})dp_{d}=\arg\min_{a\in\mathcal{A}}\int\delta(p_{d}\notin I_{a})p(p_{d}\mid\bm{y})dp_{d}
=\displaystyle= argmina𝒜{1δ(pdIa)}p(pd𝒚)𝑑pd=argmaxa𝒜δ(pdIa)p(pd𝒚)𝑑pd\displaystyle\arg\min_{a\in\mathcal{A}}\int\{1-\delta(p_{d}\in I_{a})\}p(p_{d}\mid\bm{y})dp_{d}=\arg\max_{a\in\mathcal{A}}\int\delta(p_{d}\in I_{a})p(p_{d}\mid\bm{y})dp_{d}
=\displaystyle= argmaxa𝒜Pr(pdIa𝒚),\displaystyle\arg\max_{a\in\mathcal{A}}Pr(p_{d}\in I_{a}\mid\bm{y}),

which proves the first equation in Proposition 1. In addition, we have

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmaxa𝒜k=13δ(pdIa)p(𝒚pd)π(pdm=k)π(m=k)dpd\displaystyle\arg\max_{a\in\mathcal{A}}\int\sum_{k=1}^{3}\delta(p_{d}\in I_{a})p(\bm{y}\mid p_{d})\pi(p_{d}\mid m=k)\pi(m=k)dp_{d} (14)

Since

𝒜={E,S,D},I1=IE,I2=IS,I3=ID,\mathcal{A}=\{E,S,D\},I_{1}=I_{E},I_{2}=I_{S},I_{3}=I_{D},

equation 14 becomes

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmaxa𝒜k=13δ(pdIa)p(𝒚pd)g(pd)δ(pdIk)π(m=k)dpd\displaystyle\arg\max_{a\in\mathcal{A}}\int\sum_{k=1}^{3}\delta(p_{d}\in I_{a})p(\bm{y}\mid p_{d})g(p_{d})\delta(p_{d}\in I_{k})\pi(m=k)dp_{d}
=\displaystyle= argmaxk{1,2,3}p(𝒚pd)g(pd)δ(pdIk)π(m=k)𝑑pd\displaystyle\arg\max_{k\in\{1,2,3\}}\int p(\bm{y}\mid p_{d})g(p_{d})\delta(p_{d}\in I_{k})\pi(m=k)dp_{d}
=\displaystyle= argmaxk{1,2,3}p(𝒚pd)π(pdm=k)π(m=k)𝑑pd\displaystyle\arg\max_{k\in\{1,2,3\}}\int p(\bm{y}\mid p_{d})\pi(p_{d}\mid m=k)\pi(m=k)dp_{d}
=\displaystyle= argmaxk{1,2,3}p(𝒚pd)π(pdm=k)𝑑pd\displaystyle\arg\max_{k\in\{1,2,3\}}\int p(\bm{y}\mid p_{d})\pi(p_{d}\mid m=k)dp_{d}
=\displaystyle= argmaxk{1,2,3}Pr(m=k𝒚)\displaystyle\arg\max_{k\in\{1,2,3\}}Pr(m=k\mid\bm{y})

The penultimate equation is true since π(m=k)=13,k=1,2,3.\pi(m=k)=\frac{1}{3},\;k=1,2,3.

\square

Proof of Corollary 1

Proof:

Recall the action space 𝒜={E,S,D}\mathcal{A}=\{E,S,D\}. Based on Equation (2.2) in Proposition 1, the Bayes’ rule is

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmaxk{1,2,3}Pr(m=ky)\displaystyle\arg\max_{k\in\{1,2,3\}}Pr(m=k\mid y)
=\displaystyle= argmaxk{1,2,3}p(ypd)π(pdm=k)𝑑pd\displaystyle\arg\max_{k\in\{1,2,3\}}\int p(y\mid p_{d})\pi(p_{d}\mid m=k)dp_{d}
=\displaystyle= argmaxa𝒜pdyd(1pd)ndydCkbeta(pd;1,1)δ(pdIa)𝑑pd\displaystyle\arg\max_{a\in\mathcal{A}}\int p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\cdot C_{k}beta(p_{d};1,1)\delta(p_{d}\in I_{a})dp_{d}
=\displaystyle= argmaxa𝒜pdyd(1pd)ndydbeta(pd;1,1)δ(pdIa)Iabeta(p;1,1)𝑑p𝑑pd\displaystyle\arg\max_{a\in\mathcal{A}}\int p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\frac{beta(p_{d};1,1)\delta(p_{d}\in I_{a})}{\int_{I_{a}}beta(p;1,1)dp}dp_{d}
=\displaystyle= argmaxa𝒜pdyd(1pd)ndydδ(pdIa)Ia𝑑pd=argmaxa𝒜UPM(Ia)=RmTPI\displaystyle\arg\max_{a\in\mathcal{A}}\int p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\frac{\delta(p_{d}\in I_{a})}{||I_{a}||}dp_{d}=\arg\max_{a\in\mathcal{A}}\mbox{UPM}(I_{a})=R_{mTPI}

\square

Proof of Corollary 2

Proof:

Based on Equation (2.2), the Bayes’ rule is equal to

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmaxm𝒜mPr(m=ky)\displaystyle\arg\max_{m\in\mathcal{A}_{m}}Pr(m=k\mid y)
=\displaystyle= argmaxm{1,,K}p(ypd)π(pdm=k)𝑑pd\displaystyle\arg\max_{m\in\{1,...,K\}}\int p(y\mid p_{d})\pi(p_{d}\mid m=k)dp_{d}
=\displaystyle= argmaxm{1,,K}pdyd(1pd)ndydCkbeta(pd;1,1)δ(pdIm)𝑑pd\displaystyle\arg\max_{m\in\{1,...,K\}}\int p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\cdot C_{k}beta(p_{d};1,1)\delta(p_{d}\in I_{m})dp_{d}
=\displaystyle= argmaxm{1,,K}pdyd(1pd)ndydbeta(pd;1,1)δ(pdIm)Imbeta(p;1,1)𝑑p𝑑pd\displaystyle\arg\max_{m\in\{1,...,K\}}\int p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\frac{beta(p_{d};1,1)\delta(p_{d}\in I_{m})}{\int_{I_{m}}beta(p;1,1)dp}dp_{d}
=\displaystyle= argmaxm{1,,K}pdyd(1pd)ndydδ(pdIm)Im𝑑pd\displaystyle\arg\max_{m\in\{1,...,K\}}\int p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\frac{\delta(p_{d}\in I_{m})}{||I_{m}||}dp_{d}
=\displaystyle= argmaxm𝒜mUPM(Im).\displaystyle\arg\max_{m\in\mathcal{A}_{m}}\mbox{UPM}(I_{m}).

\square

Proof of Corollary 4

Proof:

Based on Equation (2.2) and the proof of Theorem 1, the Bayes’ rule is

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmaxa{E,S,D}ϕayd(1ϕa)ndyd,{E,p^dλ1,S,λ1<p^d<λ2,D,p^dλ2,\displaystyle\arg\max_{a\in\{E,S,D\}}{\phi^{\prime}_{a}}^{y_{d}}(1-{\phi^{\prime}_{a}})^{n_{d}-y_{d}},\left\{\begin{array}[]{ll}E,&\hat{p}_{d}\leq\lambda_{1},\\ S,&\lambda_{1}<\hat{p}_{d}<\lambda_{2},\\ D,&\hat{p}_{d}\geq\lambda_{2},\end{array}\right.

with λ1=ξ(ϕE)=ξ(ξ1(pTϵ1))=pTϵ1\lambda_{1}=\xi(\phi_{E})=\xi(\xi^{-1}(p_{T}-\epsilon_{1}))=p_{T}-\epsilon_{1} and λ2=ξ(ϕD)=ξ(ξ1(pT+ϵ2))=pT+ϵ2\lambda_{2}=\xi(\phi_{D})=\xi(\xi^{-1}(p_{T}+\epsilon_{2}))=p_{T}+\epsilon_{2}. Therefore,

d\displaystyle\mathcal{R}_{d} =\displaystyle= {Ep^dpTϵ1SpTϵ1<p^d<pT+ϵ2Dp^dpT+ϵ2=CCD.\displaystyle\left\{\begin{array}[]{ll}E&\hat{p}_{d}\leq p_{T}-\epsilon_{1}\\ S&p_{T}-\epsilon_{1}<\hat{p}_{d}<p_{T}+\epsilon_{2}\\ D&\hat{p}_{d}\leq p_{T}+\epsilon_{2}\end{array}\right.\hskip 28.45274pt=\mathcal{R}_{\mbox{CCD}}.

\square

Proof of Theorem 1

Proof:

According to Proposition 1, under the proposed decision framework, the Bayes’ rule is

d\displaystyle\mathcal{R}_{d} =\displaystyle= argmaxk{1,2,3}Pr(m=ky)=argmaxk{1,2,3}p(ypd)π(pdm=k)𝑑pd\displaystyle\arg\max_{k\in\{1,2,3\}}Pr(m=k\mid y)=\arg\max_{k\in\{1,2,3\}}\int p(y\mid p_{d})\pi(p_{d}\mid m=k)dp_{d} (17)
=\displaystyle= argmaxk{1,2,3}pdyd(1pd)ndydδ(pd=ϕk)𝑑pd\displaystyle\arg\max_{k\in\{1,2,3\}}\int p_{d}^{y_{d}}(1-p_{d})^{n_{d}-y_{d}}\delta{(p_{d}=\phi_{k})}dp_{d}
=\displaystyle= argmaxk{1,2,3}ϕkyd(1ϕk)ndyd=argmaxa{E,S,D}ϕayd(1ϕa)ndyd.\displaystyle\arg\max_{k\in\{1,2,3\}}{\phi_{k}}^{y_{d}}(1-{\phi_{k}})^{n_{d}-y_{d}}=\mbox{arg}\max_{a\in\{E,S,D\}}{\phi_{a}}^{y_{d}}{(1-\phi_{a})}^{n_{d}-y_{d}}.

Let g(p)=pyd(1p)ndyd,g(p)=p^{y_{d}}(1-p)^{n_{d}-y_{d}}, then equation 17 becomes

d=argmaxa{E,S,D}g(ϕa).\mathcal{R}_{d}=\mbox{arg}\max_{a\in\{E,S,D\}}g(\phi_{a}).

When yd=0y_{d}=0, g(p)=(1p)nd,g(p)=(1-p)^{n_{d}}, and g(p)g(p) is monotonically decreasing with pp. When yd>0y_{d}>0,

dg(p)dp=(ydndp){ydp(yd1)(1p)ndyd1}.\frac{dg(p)}{dp}=(y_{d}-n_{d}p)\left\{y_{d}p^{(y_{d}-1)}(1-p)^{n_{d}-y_{d}-1}\right\}.

Note that the term {ydpyd1(1p)ndyd1}>0\left\{y_{d}p^{y_{d}-1}(1-p)^{n_{d}-y_{d}-1}\right\}>0. Then, dg(p)dp>0\frac{dg(p)}{dp}>0 if p(0,ydnd)p\in(0,\frac{y_{d}}{n_{d}}); and dg(p)dp<0\frac{dg(p)}{dp}<0 if p(ydnd,1)p\in(\frac{y_{d}}{n_{d}},1). Therefore, the function g(p)g(p) first increases and then decreases with pp. It only has one mode which is a maximum. In summary, g(p)g(p) either monotonically decreases or first increases and decreases with pp.

1)1) When p^dλ1\hat{p}_{d}\leq\lambda_{1},

ydndλ1=log(1ϕE1ϕS)log(ϕS(1ϕE)ϕE(1ϕS))=log(1ϕE)log(1ϕS)log(ϕS1ϕS)log(ϕE1ϕE).\frac{y_{d}}{n_{d}}\leq\lambda_{1}=\frac{log\left(\frac{1-\phi_{E}}{1-\phi_{S}}\right)}{log\left(\frac{\phi_{S}(1-\phi_{E})}{\phi_{E}(1-\phi_{S})}\right)}=\frac{log(1-\phi_{E})-log(1-\phi_{S})}{log(\frac{\phi_{S}}{1-\phi_{S}})-log(\frac{\phi_{E}}{1-\phi_{E}})}.

And because 0<ϕE<ϕS<ϕD<10<\phi_{E}<\phi_{S}<\phi_{D}<1, we have log(ϕE1ϕE)<log(ϕS1ϕS)<log(ϕD1ϕD).log(\frac{\phi_{E}}{1-\phi_{E}})<log(\frac{\phi_{S}}{1-\phi_{S}})<log(\frac{\phi_{D}}{1-\phi_{D}}). Therefore, log(ϕS1ϕS)log(ϕE1ϕE)>0.log(\frac{\phi_{S}}{1-\phi_{S}})-log(\frac{\phi_{E}}{1-\phi_{E}})>0. Hence,

yd{log(ϕS1ϕS)log(ϕE1ϕE)}nd{log(1ϕE)log(1ϕS)},y_{d}\left\{log(\frac{\phi_{S}}{1-\phi_{S}})-log(\frac{\phi_{E}}{1-\phi_{E}})\right\}\leq n_{d}\left\{log(1-\phi_{E})-log(1-\phi_{S})\right\},

and by taking exponentiation on both sides, we have

(ϕS1ϕS)yd(1ϕS)nd(ϕE1ϕE)yd(1ϕE)nd,(\frac{\phi_{S}}{1-\phi_{S}})^{y_{d}}(1-\phi_{S})^{n_{d}}\leq(\frac{\phi_{E}}{1-\phi_{E}})^{y_{d}}(1-\phi_{E})^{n_{d}},

which leads to

ϕSyd(1ϕS)ndydϕEyd(1ϕE)ndyd,\phi_{S}^{y_{d}}(1-\phi_{S})^{n_{d}-y_{d}}\leq\phi_{E}^{y_{d}}(1-\phi_{E})^{n_{d}-y_{d}},

i.e., g(ϕS)g(ϕE).g(\phi_{S})\leq g(\phi_{E}). Since g(p)g(p) either monotonically decreases or first increases and then decreases with pp, and because ϕE<ϕS<ϕD\phi_{E}<\phi_{S}<\phi_{D} and g(ϕE)g(ϕS)g(\phi_{E})\geq g(\phi_{S}), we have g(ϕE)g(ϕS)>g(ϕD).g(\phi_{E})\geq g(\phi_{S})>g(\phi_{D}). Therefore, when p^dλ1\hat{p}_{d}\leq\lambda_{1}, d=argmaxa{E,S,D}g(ϕa)=E.\mathcal{R}_{d}=\mbox{arg}\max_{a\in\{E,S,D\}}g(\phi_{a})=E.

2)2) When p^dλ2\hat{p}_{d}\geq\lambda_{2},

ydndλ2=log(1ϕD1ϕS)log(ϕS(1ϕD)ϕD(1ϕS))=log(1ϕD)log(1ϕS)log(ϕS1ϕS)log(ϕD1ϕD).\frac{y_{d}}{n_{d}}\geq\lambda_{2}=\frac{log\left(\frac{1-\phi_{D}}{1-\phi_{S}}\right)}{log\left(\frac{\phi_{S}(1-\phi_{D})}{\phi_{D}(1-\phi_{S})}\right)}=\frac{log(1-\phi_{D})-log(1-\phi_{S})}{log(\frac{\phi_{S}}{1-\phi_{S}})-log(\frac{\phi_{D}}{1-\phi_{D}})}.

Since log(ϕS1ϕS)log(ϕD1ϕD)<0log(\frac{\phi_{S}}{1-\phi_{S}})-log(\frac{\phi_{D}}{1-\phi_{D}})<0,

yd{log(ϕS1ϕS)log(ϕD1ϕD)}nd{log(1ϕD)log(1ϕS)},y_{d}\left\{log(\frac{\phi_{S}}{1-\phi_{S}})-log(\frac{\phi_{D}}{1-\phi_{D}})\right\}\leq n_{d}\left\{log(1-\phi_{D})-log(1-\phi_{S})\right\},

and

(ϕS1ϕS)yd(1ϕS)nd(ϕD1ϕD)yd(1ϕD)nd,(\frac{\phi_{S}}{1-\phi_{S}})^{y_{d}}(1-\phi_{S})^{n_{d}}\leq(\frac{\phi_{D}}{1-\phi_{D}})^{y_{d}}(1-\phi_{D})^{n_{d}},

which leads to

ϕSyd(1ϕS)ndydϕDyd(1ϕD)ndyd,\phi_{S}^{y_{d}}(1-\phi_{S})^{n_{d}-y_{d}}\leq\phi_{D}^{y_{d}}(1-\phi_{D})^{n_{d}-y_{d}},

i.e., g(ϕS)g(ϕD).g(\phi_{S})\leq g(\phi_{D}). Again, either g(p)g(p) monotonically decreases or first increases then decreases with pp. In either cases, we have g(ϕE)<g(ϕS)g(ϕD).g(\phi_{E})<g(\phi_{S})\leq g(\phi_{D}). Therefore, when p^dλ2\hat{p}_{d}\geq\lambda_{2}, d=argmaxa{E,S,D}g(ϕa)=D.\mathcal{R}_{d}=\mbox{arg}\max_{a\in\{E,S,D\}}g(\phi_{a})=D.

3)3) When λ1<p^d<λ2\lambda_{1}<\hat{p}_{d}<\lambda_{2}, we have

ϕSyd(1ϕS)ndyd>ϕEyd(1ϕE)ndyd,\phi_{S}^{y_{d}}(1-\phi_{S})^{n_{d}-y_{d}}>\phi_{E}^{y_{d}}(1-\phi_{E})^{n_{d}-y_{d}},

and

ϕSyd(1ϕS)ndyd>ϕDyd(1ϕD)ndyd,\phi_{S}^{y_{d}}(1-\phi_{S})^{n_{d}-y_{d}}>\phi_{D}^{y_{d}}(1-\phi_{D})^{n_{d}-y_{d}},

i.e., g(ϕS)>g(ϕE)g(\phi_{S})>g(\phi_{E}) and g(ϕS)>g(ϕD).g(\phi_{S})>g(\phi_{D}). Therefore, when λ1<p^d<λ2\lambda_{1}<\hat{p}_{d}<\lambda_{2}, d=argmaxa{E,S,D}g(ϕa)=S.\mathcal{R}_{d}=\mbox{arg}\max_{a\in\{E,S,D\}}g(\phi_{a})=S.

\square

Appendix D

Operating characteristics for the 15 fixed scenarios in the simulation study. The yellow bar highlights the true MTD. Prob of Select at/over MTD refers to the probability (over repeat simulation) of selecting the true MTD and a dose above the true MTD, respectively. Prob of Pat. at/over MTD refers to the relative frequency of patients assigned at or above the true MTD. Prob of Toxicity refers to the frequency of patients experienced DLT in all simulated trials. Prob of no selection refers to the probability of failing to recommend any dose.

Figure 3: Simulation results of the 5 fixed scenarios with 4 doses levels.

Refer to caption

Refer to caption

Refer to caption

[Uncaptioned image]

[Uncaptioned image]

Figure 4: Simulation results of the 5 fixed scenarios with 4 doses levels.

Refer to caption

Refer to caption

Refer to caption

[Uncaptioned image]

[Uncaptioned image]

Figure 5: Simulation results of the 5 fixed scenarios with 4 doses levels.

Refer to caption

Refer to caption

Refer to caption

[Uncaptioned image]

[Uncaptioned image]