This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\renewtheoremstyle

plain

\nolinenumbers

A Risk-Ratio-Based Marginal Sensitivity Model for Causal Effects in Observational Studies

MD ABDUL BASIT    MAHBUB AHM LATIF [email protected] [email protected] Institute of Statistical Research and Training,
Univeristy of Dhaka, Bangladesh
      ABDUS S. WAHED [email protected] Department of Biostatistics and Computational Biology,
University of Rochester, USA
Abstract

In observational studies, the identification of causal estimands depends on the no unmeasured confounding (NUC) assumption. As this assumption is not testable from observed data, sensitivity analysis plays an important role in observational studies to investigate the impact of unmeasured confounding on the causal conclusions. In this paper, we proposed a risk-ratio-based sensitivity analysis framework by introducing a modified marginal sensitivity model for observational studies with binary treatments. We further extended the proposed framework to the multivalued treatment setting. We then showed how the point estimate intervals and the corresponding percentile bootstrap confidence intervals can be constructed efficiently under the proposed framework. Simulation results suggested that the proposed framework of sensitivity analysis performs well in the presence of adequate overlap among the treatment groups. Lastly, we demonstrated our proposed sensitivity analysis framework by estimating the causal effect of maternal education on female fertility in Bangladesh.

keywords:
Causal Inference, Inverse Probability Weighting, Percentile Bootstrap, Sensitivity Analysis.

\arabicsection Introduction

Randomized controlled trials (RCTs) are considered to be the gold standard for estimating the causal effect of a treatment on an outcome of interest. However, researchers are often limited to investigating causal relationships using only observational data when conducting randomized experiments is not feasible. In the absence of randomized treatment assignments, the estimation of causal effects in observational studies is usually based on a set of identifiability assumptions. One of the most important such assumptions is the strong ignorability or the no unmeasured confounding (NUC) assumption, which implies that we can observe all relevant confounders in an observational study. Since the NUC assumption is essentially unverifiable using observed data, it is necessary to perform sensitivity analyses that assess the robustness of the causal conclusions obtained from observational studies in the presence of unmeasured confounding.

Sensitivity analysis is recognized as an essential step in the process of causal inference from observational studies. It refers to the approaches that investigate how causal conclusions are impacted in the presence of unmeasured confounding in observational studies. Bind & Rubin (2021), in their guidelines for the statistical reporting of observational studies, listed sensitivity analysis as one of the five major steps that need to be carried out before reporting causal conclusions elicited from observational studies. Many sponsors and regulatory bodies also nowadays require researchers to conduct some form of sensitivity analysis while drawing causal inferences from observational studies (for example, see standard MD-4 of the Patient-Centered Outcome Research Institute (PCORI) methodology standards for handling missing data at https://tinyurl.com/kcbkvfwv).

Cornfield et al. (1959) conducted the first formal sensitivity analysis to investigate the impact of unmeasured confounding on the causal relationship between smoking and lung cancer. However, this initial sensitivity analysis framework was applicable to only binary outcomes, and it did not account for the sampling variation. Rosenbaum and colleagues overcame these limitations and greatly expanded the theory and methods for sensitivity analysis in a series of pioneering works (Rosenbaum, 1987; Gastwirth et al., 1998; Rosenbaum et al., 2002). Recently, sensitivity analysis has gained a lot of research interest including Bonvini et al. (2022); Dorn et al. (2021); Carnegie et al. (2016); VanderWeele & Ding (2017); Zhao et al. (2019); Kallus et al. (2019); Yadlowsky et al. (2022), among others. Most of these proposed sensitivity analysis frameworks deal with only binary treatments.

Zhao et al. (2019) recently proposed a sensitivity analysis framework for smooth estimators of causal effects, such as the inverse probability weighting (IPW) estimators. Their proposed marginal sensitivity model is a natural modification of Rosenbaum’s sensitivity model (Rosenbaum, 2002) for matched observational studies, and it quantifies the magnitude of unmeasured confounding by the odds ratio between the conditional probability of being treated given the measured confounders (observed propensity score) and conditional probability of being treated given both the measured and unmeasured confounders (true propensity score). It is well known that risk ratios are more consistent with the general intuition than odds ratios, and hence, are easier to interpret. Therefore, it is desirable to develop sensitivity analysis frameworks that interpret the sensitivity analysis results using risk ratios instead of odds ratios.

In this paper, we first modified the odds-ratio-based sensitivity analysis framework of Zhao et al. (2019) to a risk-ratio-based framework. In particular, we proposed a modified marginal sensitivity model that measures the violation of the NUC assumption using the risk ratio between the true propensity score and the observed propensity score (see Section \arabicsection.\arabicsubsection for the exact definition). The proposed modified marginal sensitivity model introduces a new implicit sensitivity parameter that restricts the true propensity scores within its valid range for a given sensitivity model. We then extended the proposed modified marginal sensitivity model to the multivalued treatment setting based on the work of Basit et al. (2023). We showed that the estimation of causal effects for binary and multivalued treatments can be performed efficiently under the proposed sensitivity analysis framework.

The rest of the paper is structured as follows: Section \arabicsection describes necessary notations and briefly reviews the existing odds-ratio-based marginal sensitivity model; Section \arabicsection introduces the proposed modified marginal sensitivity model for the binary treatment setting; Section \arabicsection extends the proposed sensitivity model to multivalued treatment settings; Section \arabicsection reports results from simulation studies; Section \arabicsection demonstrates sensitivity analysis under the proposed framework using a real data application; Section \arabicsection provides some concluding remarks.

\arabicsection The Sensitivity Analysis Framework

\arabicsection.\arabicsubsection The Potential Outcome Framework

Consider an observational study with a binary treatment AA (1, if treated, and 0, if control), a vector of measured confounders 𝑿𝒳d\boldsymbol{X}\in\mathscr{X}\subset\mathbb{R}^{d}, and the observed binary outcome Y, where (A,𝑿,Y)F0(A,\boldsymbol{X},Y)\sim F_{0}. We observe (A1,𝑿1,Y1)(A_{1},\boldsymbol{X}_{1},Y_{1}), (A2,𝑿2,Y2),,(An,𝑿n,Yn)(A_{2},\boldsymbol{X}_{2},Y_{2}),\ldots,(A_{n},\boldsymbol{X}_{n},Y_{n}), which denote data points observed from nn i.i.d. units from the true data generating distribution F0F_{0}. Moreover, let Yi(0)Y_{i}(0) and Yi(1)Y_{i}(1) denote the potential outcomes corresponding to treatment levels 0 and 11, respectively, for each unit i[n]i\in[n]. Under the stable unit treatment value assumption (SUTVA) (Rubin, 1978), the observed outcomes can be defined in terms of potential outcomes as Yi=Yi(Ai)=AiYi(1)+(1Ai)Yi(0)Y_{i}=Y_{i}(A_{i})=A_{i}Y_{i}(1)+(1-A_{i})Y_{i}(0). In order to estimate causal effects, we make the following identifiability assumptions:

{assumption}

[strong ignorability or NUC] There is no unmeasured confounding (NUC), i.e., (Y(0),Y(1))A|𝑿.(Y(0),Y(1))\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}A\,|\,\boldsymbol{X}. In other words, the set of observed covariates, 𝑿\boldsymbol{X} includes all common causes of AA and YY.

{assumption}

[positivity or overlap] Each unit has a non-zero probability of receiving the treatment. That is, 0<0(A=1|X=x)<1,𝒙𝑿0<\mathbb{P}_{0}(A=1\,|\,X=x)<1,\thickspace\forall\thickspace\boldsymbol{x}\in\boldsymbol{X}.

In observational studies with a binary treatment, one of the most commonly used causal estimands of interest is the average treatment effect (ATE), Δ:=𝔼0[Y(1)]𝔼0[Y(0)]\Delta:=\mathbb{E}_{0}[Y(1)]-\mathbb{E}_{0}[Y(0)], where 𝔼0\mathbb{E}_{0} indicates that the expectation is taken over the true data generating distribution F0F_{0}.

If we define the observed propensity score as e0(𝑿)=0(A=1|𝑿)e_{0}(\boldsymbol{X})=\mathbb{P}_{0}(A=1\,|\,\boldsymbol{X}), then under Assumptions \arabicsection.\arabicsubsection and \arabicsection.\arabicsubsection, a consistent estimator of ATE Δ\Delta based on inverse probability weighting (IPW) is defined as

Δ^IPW=1ni=1n[AiYie^(𝑿i)(1Ai)Yi1e^(𝑿i)],\displaystyle\hat{\Delta}_{\mathrm{IPW}}=\frac{1}{n}\sum_{i=1}^{n}\left[\frac{A_{i}Y_{i}}{\hat{e}\left(\boldsymbol{X}_{i}\right)}-\frac{\left(1-A_{i}\right)Y_{i}}{1-\hat{e}\left(\boldsymbol{X}_{i}\right)}\right],

where e^(𝑿)\hat{e}(\boldsymbol{X}) is a sample estimate of e0(𝑿)e_{0}(\boldsymbol{X}). It is well known that the IPW estimates become unstable when the estimated propensity scores are close to 0 or 11 (Kang et al., 2007). Therefore, the stabilized IPW (SIPW) estimator of ATE, obtained by multiplying the inverse probability weights by the probability of receiving the actual treatment, is frequently used in practice.

\arabicsection.\arabicsubsection The Odds-Ratio-Based Marginal Sensitivity Model

Zhao et al. (2019) developed a sensitivity analysis framework that can be used for smooth estimators of causal effects, e.g., the inverse probability weighting (IPW) estimators of ATE in observational studies with a binary treatment. Let us consider an unmeasured confounder UU that sums up all unmeasured confounding present in an observational study. Robins (2002) suggested that the variable UU can be considered as any of the potential outcomes while defining the conditional probabilities of receiving treatments, i.e., propensity scores. So, let us denote the unobserved or true propensity score (conditional on both observed and unobserved confounders) by ea,0(𝒙,y)=0{A=a|𝑿=𝒙,Y(a)=y}e_{a,0}(\boldsymbol{x},y)=\mathbb{P}_{0}\{A=a\,|\,\boldsymbol{X}=\boldsymbol{x},Y(a)=y\} for a{0,1}a\in\{0,1\} and the observed propensity score (conditional on observed confounders only) by ea,0(𝒙)=0{A=a|𝑿=𝒙}e_{a,0}(\boldsymbol{x})=\mathbb{P}_{0}\{A=a\,|\,\boldsymbol{X}=\boldsymbol{x}\}. Then, the marginal sensitivity model assumes that

1ΛOR{ea(𝒙,y),ea,𝜷0(𝒙)}Λ,𝒙𝒳,y,a{0,1},\displaystyle\frac{1}{\Lambda}\leqslant\operatorname{OR}\big{\{}e_{a}(\boldsymbol{x},y),e_{a,\boldsymbol{\beta}_{0}}(\boldsymbol{x})\big{\}}\leqslant\Lambda,\thickspace\forall\thickspace\boldsymbol{x}\in\mathscr{X},y\in\mathbb{R},a\in\{0,1\}, (\arabicequation)

where OR(p1,p2)=[p1/(1p1)]/[p2/(1p2)]\operatorname{OR}(p_{1},p_{2})=[p_{1}/(1-p_{1})]/[p_{2}/(1-p_{2})] is the odds ratio, ea,𝜷0(𝒙)e_{a,\boldsymbol{\beta}_{0}}(\boldsymbol{x}) is a parametric for ea,0(𝒙)e_{a,0}(\boldsymbol{x}) for a{0,1}a\in\{0,1\} using observed confounders, and Λ1\Lambda\geqslant 1 is a fixed sensitivity parameter that quantifies the magnitude of unmeasured confounding by the odds ratio of the true propensity score ea(𝒙,y)e_{a}(\boldsymbol{x},y) and the observed propensity score eβa,0(𝒙)e_{\beta_{a,0}}(\boldsymbol{x}) for a{0,1}a\in\{0,1\}. The marginal sensitivity model can also be defined nonparametrically by replacing ea,𝜷0(𝒙)e_{a,\boldsymbol{\beta}_{0}}(\boldsymbol{x}) with a nonparametric model ea,0(𝒙)e_{a,0}(\boldsymbol{x}) in the sensitivity model defined in Expression (\arabicequation).

Under the marginal sensitivity models, Zhao et al. (2019) used a generalized mini-max inequality and percentile bootstrap to convert the estimation problem of causal effects under any given sensitivity model to a linear programming problem (LPP), which can be solved very efficiently.

\arabicsection The proposed risk-ratio-based Framework of Sensitivity Analysis

\arabicsection.\arabicsubsection The Modified Marginal Sensitivity Model

For observational studies with binary treatments, the marginal sensitivity model (\arabicequation) presented in Section \arabicsection measures the violation of the NUC assumption in terms of the odds ratio between the observed and unobserved propensity score e0(𝒙)e_{0}(\boldsymbol{x}) and e0(𝒙,y)e_{0}(\boldsymbol{x},y). Using the assumptions and notations defined in Section \arabicsection, a natural modification of the marginal sensitivity model (\arabicequation) that quantifies the magnitude of the violation of the NUC assumption by the risk ratio between ea,0(𝒙)e_{a,0}(\boldsymbol{x}) and ea,0(𝒙,y)e_{a,0}(\boldsymbol{x},y) could be defined as ea,0(𝒙,y)β0(Γ)e_{a,0}(\boldsymbol{x},y)\in\mathcal{E}_{\beta_{0}}(\Gamma), where

β0(Γ)={ea(𝒙,y):1ΓRR{ea,β0(𝒙),ea(𝒙,y)}Γ,𝒙𝒳,y,a{0,1}},\displaystyle\mathcal{E}_{\beta_{0}}(\Gamma)=\Big{\{}e_{a}(\boldsymbol{x},y):\frac{1}{\Gamma}\leq\operatorname{RR}\big{\{}e_{a,\beta_{0}}(\boldsymbol{x}),e_{a}(\boldsymbol{x},y)\big{\}}\leq\Gamma,\forall\boldsymbol{x}\in\mathscr{X},y\in\mathbb{R},a\in\{0,1\}\Big{\}}, (\arabicequation)

where Γ1\Gamma\geqslant 1 is the sensitivity parameter and RR(p1,p2)=p1/p2\mathrm{RR}\left(p_{1},p_{2}\right)=p_{1}/p_{2} is the risk ratio.

Next, let us define

ka,β0(𝒙)=log(ea,β0(𝒙)),ka,0(𝒙,y)=log(ea,0(𝒙,y)), and ka(𝒙,y)=log(ea(𝒙,y)),\displaystyle k_{a,\beta_{0}}(\boldsymbol{x})=\log\big{(}e_{a,\beta_{0}}(\boldsymbol{x})\big{)},\;\;\;k_{a,0}(\boldsymbol{x},y)=\log\big{(}e_{a,0}(\boldsymbol{x},y)\big{)},\;\;\;\text{ and }\;\;\;k_{a}(\boldsymbol{x},y)=\log\big{(}e_{a}(\boldsymbol{x},y)\big{)},

and let la,β0(𝒙,y)=ka,β0(𝒙)ka,0(𝒙,y)l_{a,\beta_{0}}(\boldsymbol{x},y)=k_{a,\beta_{0}}(\boldsymbol{x})-k_{a,0}(\boldsymbol{x},y) be the log-scale difference of the observed and the unobserved propensity scores. Similarly, for a postulated sensitivity model ea(𝒙,y)e_{a}(\boldsymbol{x},y), we define la(𝒙,y)=ka,β0(𝒙)ka(𝒙,y)l_{a}(\boldsymbol{x},y)=k_{a,\beta_{0}}(\boldsymbol{x})-k_{a}(\boldsymbol{x},y). However, under the sensitivity model (\arabicequation), the unobserved propensity score ea(𝒙,y)e_{a}(\boldsymbol{x},y) does not lie within (0,1)(0,1). Because, note that when la(𝒙,y)l_{a}(\boldsymbol{x},y)\rightarrow\infty, then log(ea(𝒙,y))\log\big{(}e_{a}(\boldsymbol{x},y)\big{)}\rightarrow-\infty, and hence, ea(𝒙,y)0e_{a}(\boldsymbol{x},y)\rightarrow 0. But, as la(𝒙,y)l_{a}(\boldsymbol{x},y)\rightarrow-\infty, then log(ea(𝒙,y))\log\big{(}e_{a}(\boldsymbol{x},y)\big{)}\rightarrow\infty, and hence, ea(𝒙,y)e_{a}(\boldsymbol{x},y)\rightarrow\infty. That is, under the sensitivity model (\arabicequation), the unobserved or true propensity score e(𝒙,y)(0,)e(\boldsymbol{x},y)\in(0,\infty).

To circumvent this problem, we introduce an additional sensitivity parameter in the sensitivity model (\arabicequation) that restricts ea(𝒙,y)e_{a}(\boldsymbol{x},y) to be in its valid range (0,1)(0,1) as la(x,y)l_{a}(x,y)\rightarrow-\infty.

Definition \arabicsection.\arabictheorem (modified marginal sensitivity model).

Fix a parameter Γ01\Gamma_{0}\geqslant 1. Moreover, define Γ1=max{ea,β0(𝐱),Γ01}\Gamma_{1}=\operatorname{max}\big{\{}e_{a,\beta_{0}}(\boldsymbol{x}),\Gamma_{0}^{-1}\big{\}}. In observational studies with binary treatments, the modified marginal sensitivity model assumes that ea,0(𝐱,y)β0(Γ0,Γ1)e_{a,0}(\boldsymbol{x},y)\in\mathcal{E}_{\beta_{0}}(\Gamma_{0},\Gamma_{1}), where

β0(Γ0,Γ1)={ea(𝒙,y)\displaystyle\mathcal{E}_{\beta_{0}}(\Gamma_{0},\Gamma_{1})=\Big{\{}e_{a}(\boldsymbol{x},y) :Γ1RR{ea,β0(𝒙),ea(𝒙,y)}Γ0,\displaystyle:\Gamma_{1}\leq\operatorname{RR}\{e_{a,\beta_{0}}(\boldsymbol{x}),e_{a}(\boldsymbol{x},y)\}\leq\Gamma_{0},
𝒙𝒳,y,a{0,1}},\displaystyle\;\;\;\;\;\;\;\;\forall\,\boldsymbol{x}\in\mathscr{X},y\in\mathbb{R},a\in\{0,1\}\Big{\}}, (\arabicequation)

and RR(p1,p2)=p1/p2\mathrm{RR}\left(p_{1},p_{2}\right)=p_{1}/p_{2} is the risk ratio.

Remark \arabicsection.\arabictheorem.

From the Definition \arabicsection.\arabictheorem, we can see that the sensitivity parameter Γ1\Gamma_{1} implicitly depends on the observed propensity score ea,β0(𝐱)e_{a,\beta_{0}}(\boldsymbol{x}) and the sensitivity parameter Γ0\Gamma_{0}. Therefore, we do not need to explicitly specify the value of Γ1\Gamma_{1} while conducting a sensitivity analysis. Consequently, the introduction of the additional sensitivity parameter Γ1\Gamma_{1} in the modified marginal sensitivity model does not necessarily increase the complexity of conducting the sensitivity analysis. The main purpose of the parameter Γ1\Gamma_{1} is to restrict the unobserved propensity score ea(𝐱,y)e_{a}(\boldsymbol{x},y) within its valid range (0,1)(0,1) under a specific sensitivity model.

Now, Under the modified marginal sensitivity model (\arabicequation), it is easy to observe that

γ1la(𝒙,y)γ0,-\gamma_{1}\leq l_{a}(\boldsymbol{x},y)\leq\gamma_{0},

where γ0=log(Γ0)\gamma_{0}=\log(\Gamma_{0}) and γ1=log(Γ1)=max{log(e0(𝒙)),γ0}\gamma_{1}=\log(\Gamma_{1})=\operatorname{max}\big{\{}\log(e_{0}(\boldsymbol{x})),-\gamma_{0}\big{\}}. Therefore, it can be shown that the sensitivity model (\arabicequation) is similar to assuming laβ0(γ0,γ1)l_{a}\in\mathcal{L}_{\beta_{0}}\left(\gamma_{0},\gamma_{1}\right), where

β0(γ0,γ1)={la:𝒳× and γ1laγ0,a{0,1}}.\mathcal{L}_{\beta_{0}}\left(\gamma_{0},\gamma_{1}\right)=\left\{l_{a}:\mathscr{X}\times\mathbb{R}\rightarrow\mathbb{R}\text{ and }\gamma_{1}\leqslant l_{a}\leqslant\gamma_{0},a\in\{0,1\}\right\}.

For the remainder of the paper, we consider la(𝒙,y)l_{a}(\boldsymbol{x},y) as sensitivity models for fixed sensitivity parameters γ00\gamma_{0}\geqslant 0.

Remark \arabicsection.\arabictheorem.

We can also define the modified marginal sensitivity model (\arabicequation) nonparametrically by replacing ea,β0(𝐱)e_{a,\beta_{0}}(\boldsymbol{x}), and consequently, ka,β0(𝐱)k_{a,\beta_{0}}(\boldsymbol{x}) with corresponding nonparametric model ea(𝐱)e_{a}(\boldsymbol{x}) and ka(𝐱)k_{a}(\boldsymbol{x}), respectively. The statistical methods used in the proposed sensitivity analysis framework can be applied regardless of the choice of parametric or non-parametric model for ea(𝐱)e_{a}(\boldsymbol{x}), as long as the model is smooth enough for a valid bootstrap.

\arabicsection.\arabicsubsection Estimation of ATE under Modified Marginal Sensitivity models

In order to estimate the ATE under the modified marginal sensitivity model, under a specific sensitivity model laβ0(γ0,γ1)l_{a}\in\mathcal{L}_{\beta_{0}}(\gamma_{0},\gamma_{1}) for a{0,1}a\in\{0,1\}, let us define the shifted propensity scores as

ea(la)(𝒙,y)\displaystyle e^{(l_{a})}_{a}(\boldsymbol{x},y) =[exp{la(𝒙,y)ka,β0(𝒙)}]1,\displaystyle=\bigg{[}\exp\Big{\{}l_{a}(\boldsymbol{x},y)-k_{a,\beta_{0}}(\boldsymbol{x})\Big{\}}\bigg{]}^{-1},

and the shifted estimand of ATE as

Δ(l0,l1)=\displaystyle\Delta^{(l_{0},l_{1})}= {𝔼0[Ae1(l1)(𝑿,Y)]1𝔼0[AYe1(l1)(𝑿,Y)]}\displaystyle\Bigg{\{}\mathbb{E}_{0}\Bigg{[}\frac{A}{e^{(l_{1})}_{1}(\boldsymbol{X},Y)}\Bigg{]}^{-1}\mathbb{E}_{0}\Bigg{[}\frac{AY}{e^{(l_{1})}_{1}(\boldsymbol{X},Y)}\Bigg{]}\Bigg{\}}
{𝔼0[1Ae0(l0)(𝑿,Y)]1𝔼0[(1A)Ye0(l0)(𝑿,Y)]}.\displaystyle-\Bigg{\{}\mathbb{E}_{0}\Bigg{[}\frac{1-A}{e^{(l_{0})}_{0}(\boldsymbol{X},Y)}\Bigg{]}^{-1}\mathbb{E}_{0}\Bigg{[}\frac{(1-A)Y}{e^{(l_{0})}_{0}(\boldsymbol{X},Y)}\Bigg{]}\Bigg{\}}. (\arabicequation)

Note that, for any laβ0(γ0,γ1)l_{a}\in\mathcal{L}_{\beta_{0}}(\gamma_{0},\gamma_{1}), we have

ea(la)(𝒙,y)\displaystyle e^{(l_{a})}_{a}(\boldsymbol{x},y) =[exp(ka,β0(𝒙)ka(𝒙,y)ka,β0(𝒙))]1\displaystyle=\big{[}\exp\big{(}k_{a,\beta_{0}}(\boldsymbol{x})-k_{a}(\boldsymbol{x},y)-k_{a,\beta_{0}}(\boldsymbol{x})\big{)}\big{]}^{-1}
=[exp(log(ea(𝒙,y)))]1\displaystyle=\big{[}\exp\big{(}-\log(e_{a}(\boldsymbol{x},y))\big{)}\big{]}^{-1}
=[1ea(𝒙,y)]1=ea(𝒙,y).\displaystyle=\Bigg{[}\frac{1}{e_{a}(\boldsymbol{x},y)}\Bigg{]}^{-1}=e_{a}(\boldsymbol{x},y). (\arabicequation)

That is, under any given sensitivity model lal_{a}, our defined shifted propensity score is equivalent to the true propensity score ea(𝒙,y)e_{a}(\boldsymbol{x},y). We can estimate these shifted propensity scores with

e^a(la)(𝒙,y)\displaystyle\hat{e}^{(l_{a})}_{a}(\boldsymbol{x},y) =[exp{la(𝒙,y)k^a(𝒙)}]1,\displaystyle=\bigg{[}\exp\Big{\{}l_{a}(\boldsymbol{x},y)-\hat{k}_{a}(\boldsymbol{x})\Big{\}}\bigg{]}^{-1},

where k^a(𝒙)=k^a,𝜷^(𝒙)=log(e^a,β(𝒙))\hat{k}_{a}(\boldsymbol{x})=\hat{k}_{a,\hat{\boldsymbol{\beta}}}(\boldsymbol{x})=\log\big{(}\hat{e}_{a,\beta}(\boldsymbol{x})\big{)} and e^a,β(𝒙)\hat{e}_{a,\beta}(\boldsymbol{x}) is a parametric estimate of ea,β(𝒙)e_{a,\beta}(\boldsymbol{x}) for a{0,1}a\in\{0,1\}.

Consequently, we can define the stabilized IPW (SIPW) estimate of Δ(l0,l1)\Delta^{(l_{0},l_{1})} as

Δ^(l0,l1)=\displaystyle\hat{\Delta}^{(l_{0},l_{1})}= {[1ni=1nAie^1(l1)(𝑿i,Yi)]11ni=1nAiYie^1(l1)(𝑿i,Yi)}\displaystyle\Bigg{\{}\Bigg{[}\frac{1}{n}\sum_{i=1}^{n}\frac{A_{i}}{\hat{e}^{(l_{1})}_{1}\left(\boldsymbol{X}_{i},Y_{i}\right)}\Bigg{]}^{-1}\cdot\frac{1}{n}\sum_{i=1}^{n}\frac{A_{i}Y_{i}}{\hat{e}^{(l_{1})}_{1}\left(\boldsymbol{X}_{i},Y_{i}\right)}\Bigg{\}}
{[1ni=1n1Aie^0(l0)(𝑿i,Yi)]11ni=1n(1Ai)Yie^0(l0)(𝑿i,Yi)}\displaystyle-\Bigg{\{}\Bigg{[}\frac{1}{n}\sum_{i=1}^{n}\frac{1-A_{i}}{\hat{e}^{(l_{0})}_{0}\left(\boldsymbol{X}_{i},Y_{i}\right)}\Bigg{]}^{-1}\cdot\frac{1}{n}\sum_{i=1}^{n}\frac{(1-A_{i})Y_{i}}{\hat{e}^{(l_{0})}_{0}\left(\boldsymbol{X}_{i},Y_{i}\right)}\Bigg{\}}
=\displaystyle= i=1nAiYie^1(l1)(𝑿i,Yi)i=1nAie^1(l1)(𝑿i,Yi)i=1n(1Ai)Yie^0(l0)(𝑿i,Yi)i=1n(1Ai)e^0(l0)(𝑿i,Yi)\displaystyle\frac{\sum_{i=1}^{n}A_{i}Y_{i}\>\hat{e}^{(l_{1})}_{1}\left(\boldsymbol{X}_{i},Y_{i}\right)}{\sum_{i=1}^{n}A_{i}\>\hat{e}^{(l_{1})}_{1}\left(\boldsymbol{X}_{i},Y_{i}\right)}-\frac{\sum_{i=1}^{n}(1-A_{i})Y_{i}\>\hat{e}^{(l_{0})}_{0}\left(\boldsymbol{X}_{i},Y_{i}\right)}{\sum_{i=1}^{n}(1-A_{i})\>\hat{e}^{(l_{0})}_{0}\left(\boldsymbol{X}_{i},Y_{i}\right)}
=\displaystyle= i=1nAiYi[exp{l1(𝑿i,Yi)k^1(𝑿i)}]i=1nAi[exp{l1(𝑿i,Yi)k^1(𝑿i)}]\displaystyle\frac{\sum_{i=1}^{n}A_{i}Y_{i}\big{[}\exp\{l_{1}(\boldsymbol{X}_{i},Y_{i})-\hat{k}_{1}(\boldsymbol{X}_{i})\}\big{]}}{\sum_{i=1}^{n}A_{i}\big{[}\exp\{l_{1}(\boldsymbol{X}_{i},Y_{i})-\hat{k}_{1}(\boldsymbol{X}_{i})\}\big{]}}
i=in(1Ai)Yi[exp{l0(𝑿i,Yi)k^0(𝑿i)}]i=1n(1Ai)[exp{l0(𝑿i,Yi)k^0(𝑿i)}].\displaystyle-\frac{\sum_{i=i}^{n}(1-A_{i})Y_{i}\big{[}\exp\{l_{0}(\boldsymbol{X}_{i},Y_{i})-\hat{k}_{0}(\boldsymbol{X}_{i})\}\big{]}}{\sum_{i=1}^{n}(1-A_{i})\big{[}\exp\{l_{0}(\boldsymbol{X}_{i},Y_{i})-\hat{k}_{0}(\boldsymbol{X}_{i})\}\big{]}}. (\arabicequation)

Zhao et al. (2019) showed that estimation problems such as the one given in Equation (\arabicequation) can be transformed to a linear fractional programming (LFP) problem using Charnes-Cooper transformation (Charnes & Cooper, 1962). Before defining the LFP for Equation (\arabicequation), let us simplify the notations by assuming that the first mnm\leqslant n units are in the treatment group (A=1)(A=1) and the rest of the observations are in the control group (A=0)(A=0), and that the outcomes are sorted in decreasing order among the first mm units and the other nmn-m units. Finally, we can convert Equation (\arabicequation) to the following LFP

min or max i=1nYi[ziexp{k^1(𝑿i)}]i=1m[ziexp{k^1(𝑿i)}]i=m+1nYi[ziexp{k^0(𝑿i)}]i=m+1n[ziexp{k^0(𝑿i)}]\displaystyle\;\;\;\;\frac{\sum_{i=1}^{n}Y_{i}\big{[}z_{i}\exp\{-\hat{k}_{1}(\boldsymbol{X}_{i})\}\big{]}}{\sum_{i=1}^{m}\big{[}z_{i}\exp\{-\hat{k}_{1}(\boldsymbol{X}_{i})\}\big{]}}-\frac{\sum_{i=m+1}^{n}Y_{i}\big{[}z_{i}\exp\{-\hat{k}_{0}(\boldsymbol{X}_{i})\}\big{]}}{\sum_{i=m+1}^{n}\big{[}z_{i}\exp\{-\hat{k}_{0}(\boldsymbol{X}_{i})\}\big{]}} (\arabicequation)
subject to Γ1iziΓ0, for 1in,\displaystyle\;\;\;\;\Gamma_{1i}\leqslant z_{i}\leqslant\Gamma_{0},\;\;\;\;\;\;\text{ for }1\leqslant i\leqslant n,

where zi=exp{l1(𝑿i,Yi)}z_{i}=\exp\big{\{}l_{1}\left(\boldsymbol{X}_{i},Y_{i}\right)\big{\}}, for 1im,1\leqslant i\leqslant m, and zi=exp{l0(𝑿i,Yi)},z_{i}=\exp\big{\{}l_{0}\left(\boldsymbol{X}_{i},Y_{i}\right)\big{\}}, for m+1inm+1\leqslant i\leqslant n. The LFP defined in Equation (\arabicequation) can further be transformed to a linear programming problem (LPP), which can be solved efficiently. The solution of the LPP (\arabicequation) yields a partially identified point estimate interval of ATE under a specific sensitivity model laβ0(γ0,γ1)l_{a}\in\mathcal{L}_{\beta_{0}}\left(\gamma_{0},\gamma_{1}\right) and a{0,1}a\in\{0,1\}. We can also obtain a 100(1α)%100(1-\alpha)\% asymptotic confidence interval for ATE under a postulated sensitivity model lal_{a} using a percentile bootstrap approach (Zhao et al., 2019).

\arabicsection Extension to Observational Studies with Multivalued Treatments

In this section, we extend our proposed sensitivity analysis framework to the multivalued treatment setting. Suppose we observe i.i.d. (Ai,𝑿i,Yi)i=1n(A_{i},\boldsymbol{X}_{i},Y_{i})_{i=1}^{n} from an observational study with J>2J>2 treatment levels, where Ai𝒜={1,2,,J}A_{i}\in\mathcal{A}=\{1,2,\ldots,J\} with the corresponding set of potential outcomes 𝒴i={Yi(1),Yi(2),,Yi(J)}\mathcal{Y}_{i}=\{Y_{i}(1),Y_{i}(2),\ldots,Y_{i}(J)\}, and 𝑿i𝒳d\boldsymbol{X}_{i}\in\mathscr{X}\subset\mathbb{R}^{d} is a vector of observed confounders for each subject i[n]i\in[n]. Let us also define a treatment indicator Di(a)D_{i}(a) (11 if Ai=aA_{i}=a, 0 otherwise) for a𝒜a\in\mathcal{A}.

The identifiability assumptions for observational studies with multivalued treatments are almost equivalent to those in the binary treatment setting. We assume the overlap assumption that implies 0(A=a|𝑿=𝒙)>0\mathbb{P}_{0}(A=a|\boldsymbol{X}=\boldsymbol{x})>0 for all a𝒜a\in\mathcal{A}. However, instead of the strong ignorability assumption, we assume weak ignorability (Imbens, 2000).

{assumption}

[weak ignorability] D(a)Y(a)|𝑿,a𝒜.D(a)\mathchoice{\mathrel{\hbox to0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to0.0pt{$\textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to0.0pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Y(a)\thinspace|\thinspace\boldsymbol{X},\thickspace\forall\thinspace a\in\mathcal{A}.

Imbens (2000) extended the concept of propensity scores from binary treatments to multivalued treatments introducing the generalized propensity score (GPS), which is defined as ra,0(𝒙)=0(A=a|𝑿=𝒙)r_{a,0}(\boldsymbol{x})=\mathbb{P}_{0}(A=a|\boldsymbol{X}=\boldsymbol{x}) for a𝒜a\in\mathcal{A}. As in Section \arabicsection, let us further denote the unobserved or true propensity score as ra,0(𝒙,y)=0(A=a|𝑿=𝒙,Y(a)=y)r_{a,0}(\boldsymbol{x},y)=\mathbb{P}_{0}(A=a|\boldsymbol{X}=\boldsymbol{x},Y(a)=y).

Based on these assumptions and notations, Basit et al. (2023) recently proposed a sensitivity analysis framework for the multivalued treatment setting extending the framework of Zhao et al. (2019) for binary treatments. Using similar ideas, we propose the following risk-ratio-based modified marginal sensitive model for observational studies with multivalued treatments.

Definition \arabicsection.\arabictheorem (modified marginal sensitivity model for multivalued treatments).

For fixed Γ01\Gamma_{0}\geqslant 1 and Γ1=max{ra,β0(𝐱),Γ01}\Gamma_{1}=\operatorname{max}\big{\{}r_{a,\beta_{0}}(\boldsymbol{x}),\Gamma_{0}^{-1}\big{\}}, the modified marginal sensitivity model for multivalued treatments assumes that

Γ1RR{ra,β0(𝒙),ra(𝒙,y)}Γ0,𝒙𝒳,a𝒜,y,\displaystyle\Gamma_{1}\leqslant\mathrm{RR}\big{\{}r_{a,\beta_{0}}(\boldsymbol{x}),r_{a}(\boldsymbol{x},y)\big{\}}\leqslant\Gamma_{0},\thickspace\forall\thickspace\boldsymbol{x}\in\mathscr{X},a\in\mathcal{A},y\in\mathbb{R}, (\arabicequation)

where Γ0\Gamma_{0} and Γ1\Gamma_{1} are sensitivity parameters and RR(p1,p2)=p1/p2\mathrm{RR}\left(p_{1},p_{2}\right)=p_{1}/p_{2} is the risk ratio.

Next, as in section \arabicsection.\arabicsubsection, let us define

ka,β0(𝒙)=log(ra,β0(𝒙)) and ka(𝒙,y)=log(ra(𝒙,y)),\displaystyle k_{a,\beta_{0}}(\boldsymbol{x})=\log\big{(}r_{a,\beta_{0}}(\boldsymbol{x})\big{)}\;\;\;\text{ and }\;\;\;k_{a}(\boldsymbol{x},y)=\log\big{(}r_{a}(\boldsymbol{x},y)\big{)},

and let la(𝒙,y)=ka,β0(𝒙)la(𝒙,y)l_{a}(\boldsymbol{x},y)=k_{a,\beta_{0}}(\boldsymbol{x})-l_{a}(\boldsymbol{x},y) for all a𝒜a\in\mathcal{A} be the log-scale difference between the observed and the unobserved GPSs under any specified sensitivity model ra(𝒙,y)r_{a}(\boldsymbol{x},y). Then, it can be shown that the sensitivity model (\arabicequation) is equivalent to assuming laβ0(γ0,γ1)l_{a}\in\mathcal{L}_{\beta_{0}}(\gamma_{0},\gamma_{1}), where

β0(γ0,γ1)\displaystyle\mathcal{L}_{\beta_{0}}(\gamma_{0},\gamma_{1}) ={la:𝒳× and γ1laγ0,a𝒜},\displaystyle=\big{\{}l_{a}:\mathscr{X}\times\mathbb{R}\rightarrow\mathbb{R}\text{ and }\gamma_{1}\leqslant l_{a}\leqslant\gamma_{0},\;a\in\mathcal{A}\big{\}},

γ0=log(Γ0)\gamma_{0}=\log(\Gamma_{0}), and γ1=log(Γ1)=max(log(ra,β0(𝒙)),γ0)\gamma_{1}=\log(\Gamma_{1})=\operatorname{max}\big{(}\log(r_{a,\beta_{0}}(\boldsymbol{x})),-\gamma_{0}\big{)} are the sensitivity parameters.

In order to define estimands of causal effects in the multivalued treatment setting, we use a general class of additive causal estimands proposed by Basit et al. (2023). This class of estimands is based on inverse probability weighting and is defined as

τ(𝒄)=a=1Jcam(a)=a=1Jca𝔼0[YD(a)ra(𝑿)],\tau(\boldsymbol{c})=\sum_{a=1}^{J}c_{a}m(a)=\sum_{a=1}^{J}c_{a}\mathbb{E}_{0}\Bigg{[}\frac{Y\thinspace D(a)}{r_{a}(\boldsymbol{X})}\Bigg{]}, (\arabicequation)

where 𝒄=(c1,,cJ)\boldsymbol{c}=(c_{1},\ldots,c_{J})^{\prime} is a vector of contrasts, m(a)=𝔼0[Y(a)]m(a)=\mathbb{E}_{0}[Y(a)] is the average potential outcome for a𝒜a\in\mathcal{A}. These estimands encompass many commonly used causal estimands of interest for multivalued treatments, such as the pairwise average treatment effects (PATEs).

Based on the identifiability assumptions defined for multivalued treatments, we define the shifted estimand of the average potential outcome m(a)m(a) under a specified sensitivity model lal_{a} as

m(ha)(a)=𝔼0[YD(a)ra(𝑿)]=𝔼0[YD(a)ra(𝑿,Y)]=𝔼0[YD(a)r(ha)(𝑿,Y)],\displaystyle m^{(h_{a})}(a)=\mathbb{E}_{0}\Bigg{[}\frac{YD(a)}{r_{a}(\boldsymbol{X})}\Bigg{]}=\mathbb{E}_{0}\Bigg{[}\frac{YD(a)}{r_{a}(\boldsymbol{X},Y)}\Bigg{]}=\mathbb{E}_{0}\Bigg{[}\frac{YD(a)}{r^{(h_{a})}(\boldsymbol{X},Y)}\Bigg{]}, (\arabicequation)

where r(la)(𝒙,y)=[exp(la(𝒙,y)ka,β0(𝒙))]1r^{(l_{a})}(\boldsymbol{x},y)=\big{[}\exp\big{(}l_{a}(\boldsymbol{x},y)-k_{a,\beta_{0}}(\boldsymbol{x})\big{)}\big{]}^{-1} is the shifted GPS. Consequently, under the modified marginal sensitivity model (\arabicequation), the shifted causal estimand becomes

τ(la)(𝒄)=a=1Jcam(la)(a).\displaystyle\tau^{(l_{a})}(\boldsymbol{c})=\sum_{a=1}^{J}c_{a}m^{(l_{a})}(a). (\arabicequation)

Similar to the estimation of the shifted ATE defined in Equation (\arabicequation) for binary treatments, we can show that the estimation of the causal estimand can be converted to a linear programming problem (LPP) that allows us to efficiently estimate the partially identified point estimate intervals of τ(𝒄)\tau(\boldsymbol{c}) for any laβ0(γ0,γ1)l_{a}\in\mathcal{L}_{\beta_{0}}(\gamma_{0},\gamma_{1}). The percentile bootstrap approach of Zhao et al. (2019) is also applicable for the computation of 100(1α)%100(1-\alpha)\% asymptotic confidence intervals for τ(𝒄)\tau(\boldsymbol{c}) (Basit et al., 2023).

\arabicsection Simulation Study

We conducted simulated studies to investigate the performance of the proposed sensitivity analysis framework in the multivalued treatment setting. Our data generating mechanism and simulation settings are equivalent to Basit et al. (2023). We simulate three covariates as Xi1Bernoulli(0.5)X_{i1}\sim\operatorname{Bernoulli}(0.5), Xi2U(1,1)X_{i2}\sim\operatorname{U}(-1,1), and Xi3N(0,0.5)X_{i3}\sim\operatorname{N}(0,0.5). For each i[n]i\in[n], the covariate vector then becomes 𝑿i=(1,Xi1,Xi2,Xi3)T\boldsymbol{X}_{i}=(1,X_{i1},X_{i2},X_{i3})^{T}. We simulated the treatment assignment mechanism using the following multinomial distribution

(Di(1),Di(2),Di(3))|𝑿iMultinom(r1(𝑿i),r2(𝑿i),r3(𝑿i)),\big{(}D_{i}(1),D_{i}(2),D_{i}(3)\big{)}\;\big{|}\;\boldsymbol{X}_{i}\sim\operatorname{Multinom}\big{(}r_{1}(\boldsymbol{X}_{i}),r_{2}(\boldsymbol{X}_{i}),r_{3}(\boldsymbol{X}_{i})\big{)},

where Di(a)D_{i}(a) is the treatment indicator and

ra(𝑿i,Yi)=ra(𝑿i)=exp(𝑿iTβa)a=13exp(𝑿iTβa)r_{a}(\boldsymbol{X}_{i},Y_{i})=r_{a}(\boldsymbol{X}_{i})=\frac{\exp\big{(}\boldsymbol{X}_{i}^{T}\beta_{a}\big{)}}{\sum_{a^{\prime}=1}^{3}\exp\big{(}\boldsymbol{X}_{i}^{T}\beta_{a^{\prime}}\big{)}}

denotes the complete or unobserved GPSs for a{1,2,3}a\in\{1,2,3\} with β1=(0,0,0,0)T\beta_{1}=(0,0,0,0)^{T}, β2=k2×(0,1,1,1)T\beta_{2}=k_{2}\times(0,1,1,1)^{T}, and β3=k3×(0,1,1,1)T\beta_{3}=k_{3}\times(0,1,1,-1)^{T}. In order to assess the influence of the degree of overlap on the point estimates and confidence intervals of our proposed frameworks, we considered two simulation scenarios. In the first scenario, we set (k2,k3)=(0.1,0.1)(k_{2},k_{3})=(0.1,-0.1) to simulate a scenario with adequate overlap in the covariates, and in the second scenario, we set (k2,k3)=(3,3)(k_{2},k_{3})=(3,3) to induce lack of overlap. The potential outcomes are generated from the following multinomial distribution

(Yi(1),Yi(2),Yi(3))|𝑿iMultinom(pY1(𝑿i),pY2(𝑿i),pY3(𝑿i)),\big{(}Y_{i}(1),Y_{i}(2),Y_{i}(3)\big{)}\;\big{|}\;\boldsymbol{X}_{i}\sim\operatorname{Multinom}\big{(}p_{Y_{1}}(\boldsymbol{X}_{i}),p_{Y_{2}}(\boldsymbol{X}_{i}),p_{Y_{3}}(\boldsymbol{X}_{i})\big{)},

where Yi(a)Y_{i}(a) is the potential outcome for treatment level aa and

pYa(𝑿i)=(Y(a)=1|𝑿i)=exp(𝑿iTδa)a=13exp(𝑿iTδa)p_{Y_{a}}(\boldsymbol{X}_{i})=\mathbb{P}(Y(a)=1\big{|}\boldsymbol{X}_{i})=\frac{\exp\big{(}\boldsymbol{X}_{i}^{T}\delta_{a}\big{)}}{\sum_{a^{\prime}=1}^{3}\exp\big{(}\boldsymbol{X}_{i}^{T}\delta_{a^{\prime}}\big{)}}

with δ1=(1,1,1,1)T\delta_{1}=(1,1,1,1)^{T}, δ2=(1,1,1,1)T\delta_{2}=(1,1,-1,1)^{T}, and δ3=(1,1,1,1)T\delta_{3}=(1,1,1,-1)^{T}. The observed outcome was then obtained as Yi=a=1JDi(a)Yi(a)Y_{i}=\sum_{a=1}^{J}D_{i}(a)Y_{i}(a) for each subject i[n]i\in[n]. We simulated 10001000 datasets with sample size n=750n=750 for each scenario and estimate the interval of point estimates and confidence intervals for the pairwise ATEs using our proposed sensitivity analysis framework. The pairwise ATEs are denoted by τi,j\tau_{i,j}, for i,j{1,2,3}{i,j}\in\{1,2,3\}. We considered six values of the sensitivity parameters γ0\gamma_{0}, namely, γ0={0,0.1,0.2,0.5,1,2}\gamma_{0}=\{0,0.1,0.2,0.5,1,2\}. The true partially identified intervals were obtained under each simulation scenario using large scale numerical approximations. Since the treatment under consideration is multivalued with three treatment levels, the observed generalized propensity scores (GPSs) are modeled using the multinomial logit regression model.

Table \arabictable: Simulation results for the sensitivity analysis of pairwise ATEs under the RR framework in an observational study with three treatment levels
% Bias
Scenario ATE γ0\gamma_{0} Γ0\Gamma_{0} Lower Upper Non- coverage Partially identified interval Point estimate interval Confidence interval
0.0 1.00 2.792.79 2.792.79 0.103 (0.050,0.050)(-0.050,-0.050) (0.048,0.048)(-0.048,-0.048) (0.115,0.017)(-0.115,0.017)
0.1 1.11 1.761.76 2.882.88 0.099 (0.137,0.038)(-0.137,0.038) (0.136,0.040)(-0.136,0.040) (0.201,0.106)(-0.201,0.106)
0.2 1.22 2.642.64 1.721.72 0.108 (0.223,0.126)(-0.223,0.126) (0.223,0.127)(-0.223,0.127) (0.285,0.193)(-0.285,0.193)
0.5 1.65 4.794.79 0.760.76 0.116 (0.460,0.375)(-0.460,0.375) (0.459,0.376)(-0.459,0.376) (0.512,0.436)(-0.512,0.436)
1.0 2.72 11.1411.14 6.18-6.18 0.123 (0.744,0.694)(-0.744,0.694) (0.741,0.693)(-0.741,0.693) (0.771,0.729)(-0.771,0.729)
τ1,2\tau_{1,2} 2.0 7.39 5.035.03 7.64-7.64 0.095 (0.900,0.882)(-0.900,0.882) (0.900,0.882)(-0.900,0.882) (0.915,0.900)(-0.915,0.900)
0.0 1.00 1.491.49 1.491.49 0.108 (0.034,0.034)(-0.034,-0.034) (0.035,0.035)(-0.035,-0.035) (0.102,0.032)(-0.102,0.032)
0.1 1.11 2.562.56 1.421.42 0.107 (0.121,0.053)(-0.121,0.053) (0.121,0.053)(-0.121,0.053) (0.186,0.119)(-0.186,0.119)
0.2 1.22 4.834.83 2.182.18 0.109 (0.207,0.139)(-0.207,0.139) (0.207,0.140)(-0.207,0.140) (0.271,0.205)(-0.271,0.205)
0.5 1.65 6.456.45 1.08-1.08 0.113 (0.444,0.385)(-0.444,0.385) (0.443,0.385)(-0.443,0.385) (0.499,0.444)(-0.499,0.444)
1.0 2.72 9.699.69 5.88-5.88 0.117 (0.735,0.699)(-0.735,0.699) (0.733,0.697)(-0.733,0.697) (0.766,0.734)(-0.766,0.734)
τ1,3\tau_{1,3} 2.0 7.39 10.1910.19 1.84-1.84 0.102 (0.903,0.885)(-0.903,0.885) (0.903,0.885)(-0.903,0.885) (0.918,0.904)(-0.918,0.904)
0.0 1.00 1.221.22 1.221.22 0.097 (0.015,0.015)(0.015,0.015) (0.016,0.016)(0.016,0.016) (0.052,0.084)(-0.052,0.084)
0.1 1.11 0.780.78 1.32-1.32 0.098 (0.075,0.106)(-0.075,0.106) (0.074,0.106)(-0.074,0.106) (0.142,0.173)(-0.142,0.173)
0.2 1.22 2.952.95 0.14-0.14 0.105 (0.165,0.194)(-0.165,0.194) (0.164,0.195)(-0.164,0.195) (0.230,0.260)(-0.230,0.260)
0.5 1.65 2.882.88 2.97-2.97 0.111 (0.414,0.440)(-0.414,0.440) (0.413,0.440)(-0.413,0.440) (0.471,0.495)(-0.471,0.495)
1.0 2.72 8.498.49 5.93-5.93 0.121 (0.722,0.735)(-0.722,0.735) (0.719,0.733)(-0.719,0.733) (0.753,0.763)(-0.753,0.763)
I τ2,3\tau_{2,3} 2.0 7.39 4.674.67 1.94-1.94 0.085 (0.897,0.898)(-0.897,0.898) (0.897,0.899)(-0.897,0.899) (0.913,0.914)(-0.913,0.914)
0.0 1.00 13.40-13.40 13.40-13.40 0.188 (0.047,0.047)(-0.047,-0.047) (0.084,0.084)(-0.084,-0.084) (0.261,0.097)(-0.261,0.097)
0.1 1.11 10.14-10.14 16.80-16.80 0.197 (0.133,0.040)(-0.133,0.040) (0.166,0.000)(-0.166,0.000) (0.332,0.184)(-0.332,0.184)
0.2 1.22 7.29-7.29 20.24-20.24 0.204 (0.214,0.124)(-0.214,0.124) (0.242,0.080)(-0.242,0.080) (0.397,0.263)(-0.397,0.263)
0.5 1.65 0.970.97 29.69-29.69 0.239 (0.427,0.352)(-0.427,0.352) (0.442,0.300)(-0.442,0.300) (0.561,0.475)(-0.561,0.475)
1.0 2.72 9.999.99 42.90-42.90 0.287 (0.673,0.638)(-0.673,0.638) (0.676,0.591)(-0.676,0.591) (0.747,0.713)(-0.747,0.713)
τ1,2\tau_{1,2} 2.0 7.39 16.5216.52 54.85-54.85 0.393 (0.891,0.893)(-0.891,0.893) (0.888,0.872)(-0.888,0.872) (0.912,0.911)(-0.912,0.911)
0.0 1.00 14.93-14.93 14.93-14.93 0.205 (0.031,0.031)(-0.031,-0.031) (0.072,0.072)(-0.072,-0.072) (0.244,0.102)(-0.244,0.102)
0.1 1.11 12.24-12.24 17.45-17.45 0.207 (0.117,0.052)(-0.117,0.052) (0.155,0.007)(-0.155,0.007) (0.314,0.186)(-0.314,0.186)
0.2 1.22 8.85-8.85 20.76-20.76 0.219 (0.201,0.132)(-0.201,0.132) (0.234,0.083)(-0.234,0.083) (0.379,0.265)(-0.379,0.265)
0.5 1.65 2.15-2.15 29.86-29.86 0.240 (0.422,0.349)(-0.422,0.349) (0.443,0.295)(-0.443,0.295) (0.556,0.463)(-0.556,0.463)
1.0 2.72 7.207.20 42.13-42.13 0.269 (0.683,0.626)(-0.683,0.626) (0.689,0.578)(-0.689,0.578) (0.755,0.700)(-0.755,0.700)
τ1,3\tau_{1,3} 2.0 7.39 14.2514.25 54.48-54.48 0.371 (0.903,0.885)(-0.903,0.885) (0.902,0.863)(-0.902,0.863) (0.921,0.902)(-0.921,0.902)
0.0 1.00 1.32-1.32 1.32-1.32 0.128 (0.015,0.015)(0.015,0.015) (0.014,0.014)(0.014,0.014) (0.108,0.134)(-0.108,0.134)
0.1 1.11 1.31-1.31 2.93-2.93 0.122 (0.072,0.100)(-0.072,0.100) (0.074,0.098)(-0.074,0.098) (0.195,0.218)(-0.195,0.218)
0.2 1.22 0.490.49 3.65-3.65 0.131 (0.156,0.177)(-0.156,0.177) (0.158,0.176)(-0.158,0.176) (0.274,0.290)(-0.274,0.290)
0.5 1.65 4.484.48 8.21-8.21 0.136 (0.378,0.379)(-0.378,0.379) (0.380,0.375)(-0.380,0.375) (0.476,0.472)(-0.476,0.472)
1.0 2.72 10.4710.47 14.50-14.50 0.171 (0.643,0.622)(-0.643,0.622) (0.642,0.616)(-0.642,0.616) (0.702,0.679)(-0.702,0.679)
II τ2,3\tau_{2,3} 2.0 7.39 18.6318.63 20.99-20.99 0.225 (0.882,0.862)(-0.882,0.862) (0.879,0.857)(-0.879,0.857) (0.899,0.881)(-0.899,0.881)
  • The first four columns represent the simulation settings: Scenario and ATE are the simulation scenarios and pairwise ATEs of interest, respectively; λ\lambda is the sensitivity parameter and Λ=exp(λ)\Lambda=exp(\lambda). The next five columns are the results: respectively percentage average bias (in SD units) in the lower and upper bound of the point estimate interval; the non-coverage rate of the confidence intervals under the proposed framework (the desired non-coverage rate is 10%10\%); the partially identified point estimate intervals of the pairwise ATEs; the median interval for the point estimates; the median confidence interval.

We simulate 10001000 datasets for each scenario and estimate the interval of point estimates and construct 90%90\% confidence intervals for the pairwise ATEs under the proposed sensitivity analysis for multivalued treatment settings. We report the percentage average bias (in SD units) in the lower and upper bounds of the point estimate interval, the non-coverage rate of the confidence interval, the median interval of point estimates, and the median confidence intervals calculated from the 10001000 simulated datasets.

The simulation results under the proposed framework of sensitivity analysis are presented in Table \arabictable. When there is adequate overlap (Scenario-I), we observe that the average bias in the SIPW point estimators of the pairwise ATEs lies within 10%10\% of its standard deviation in almost all cases under under the proposed framework. The percentile bootstrap confidence intervals also satisfy the nominal 90%90\% coverage rate in the presence of adequate overlap. However, we observe that the performance of the SIPW point estimators and the percentile bootstrap confidence intervals deteriorates when there is a lack of overlap (Scenario-II). These findings are similar to the ones observed by Zhao et al. (2019) and Basit et al. (2023), and therefore, we recommend to interpret the sensitivity analysis results under the proposed framework with caution when there is a lack of overlap in the covariate distribution among different treatment groups.

\arabicsection Real Data Application

In this section, we applied our proposed sensitivity analysis framework for multivalued treatments to estimate the causal effect of maternal education on the fertility rate in Bangladesh. The dataset used in this analysis was obtained from the latest round of the Bangladesh Multiple Indicator Cluster Survey (MICS) 2019 (BBS & UNICEF, 2019). We considered the number of children ever born to women in the reproductive age (154915-49 years) as a measure of fertility, which is the outcome of interest. The treatment variable was the maternal education level that has four levels- ”pre-primary or none”, ”primary”, ”secondary”, and ”higher secondary or beyond”. The following variables were considered as the observed confounders, which were assumed to be associated with both treatment and outcome: place of residence (”urban” or ”rural”), district, religion (”muslim” or ”non-muslim”), household’s wealth index, women’s age at marriage, and education level of the household head (Tomal et al., 2022). After discarding missing values from the outcome, treatment, and the observed confounders, we obtained complete data on 53,48153,481 women aged 154915-49 years to conduct the analysis.

Using the proposed sensitivity analysis framework, we conduct the sensitivity analysis for the three pairwise ATEs between ”pre-primary or none”, ”primary”, and ”secondary” vs. ”higher secondary or beyond” level of maternal education, denoted by τ1,4\tau_{1,4}, τ2,4\tau_{2,4}, and τ3,4\tau_{3,4}, respectively. We considered Γ0=exp(γ0)={1,1.25,1.5,1.75,2.00,2.25,2.50,2.75}\Gamma_{0}=\exp(\gamma_{0})=\{1,1.25,1.5,1.75,2.00,2.25,2.50,2.75\} to assess the sensitivity of the causal effect estimates to the presence of varying magnitudes of unmeasured confounding. As discussed in Remark \arabicsection.\arabictheorem, we did not need to specify the values of the sensitivity parameter Γ1\Gamma_{1} to conduct the sensitivity analysis. Since the treatment variable is ordinal, we fitted an ordinal logistic regression model, namely the continuation ratio regression model, to estimate the generalized propensity scores (GPSs) using the chosen observed confounders. The estimated GPSs ranged from 0.010.01 to 0.980.98 with a mean of 0.250.25 across the four treatment groups. In light of our findings in the simulation studies in Section \arabicsection, we recommend to interpret the results obtained from the SIPW estimators with caution as some of the estimated GPSs are close to zero.

Table \arabictable: SIPW point estimate intervals and 90% percentile bootstrap confidence intervals for the pairwise ATEs under the RR framework for different values of the sensitivity parameter Γ0\Gamma_{0}. The pariwise ATEs of pre-primary or no education, primary education, and secondary education vs. higher seondary or beyond education are denoted by τ1,4\tau_{1,4}, τ2,4\tau_{2,4}, and τ3,4\tau_{3,4}, respectively
Estimand Γ0\Gamma_{0} γ0\gamma_{0} Point estimate interval 90% confidence Interval
1 0.00 (1.97,1.97)(1.97,1.97) (1.92,2.02)(1.92,2.02)
1.25 0.22 (1.59,2.46)(1.59,2.46) (1.52,2.53)(1.52,2.53)
1.5 0.41 (1.22,2.83)(1.22,2.83) (1.14,2.90)(1.14,2.90)
1.75 0.56 (0.91,3.14)(0.91,3.14) (0.83,3.21)(0.83,3.21)
2 0.69 (0.66,3.39)(0.66,3.39) (0.59,3.46)(0.59,3.46)
2.25 0.81 (0.46,3.61)(0.46,3.61) (0.38,3.68)(0.38,3.68)
2.5 0.92 (0.29,3.82)(0.29,3.82) (0.22,3.90)(0.22,3.90)
τ1,4\tau_{1,4} 2.75 1.01 (0.13,4.01)(0.13,4.01) (0.03,4.09)(0.03,4.09)
1 0.00 (1.46,1.46)(1.46,1.46) (1.43,1.48)(1.43,1.48)
1.25 0.22 (1.15,1.95)(1.15,1.95) (1.10,1.99)(1.10,1.99)
1.5 0.41 (0.82,2.27)(0.82,2.27) (0.77,2.32)(0.77,2.32)
1.75 0.56 (0.56,2.57)(0.56,2.57) (0.50,2.63)(0.50,2.63)
2 0.69 (0.30,2.83)(0.30,2.83) (0.24,2.88)(0.24,2.88)
2.25 0.81 (0.07,3.06)(0.07,3.06) (0.02,3.11)(0.02,3.11)
2.5 0.92 (0.12,3.24)(-0.12,3.24) (0.17,3.28)(-0.17,3.28)
τ2,4\tau_{2,4} 2.75 1.01 (0.28,3.39)(-0.28,3.39) (0.33,3.44)(-0.33,3.44)
1 0.00 (0.75,0.75)(0.75,0.75) (0.72,0.80)(0.72,0.80)
1.25 0.22 (0.45,1.17)(0.45,1.17) (0.39,1.23)(0.39,1.23)
1.5 0.41 (0.13,1.49)(0.13,1.49) (0.08,1.54)(0.08,1.54)
1.75 0.56 (0.13,1.75)(-0.13,1.75) (0.18,1.80)(-0.18,1.80)
2 0.69 (0.33,1.97)(-0.33,1.97) (0.38,2.01)(-0.38,2.01)
2.25 0.81 (0.49,2.13)(-0.49,2.13) (0.54,2.17)(-0.54,2.17)
2.5 0.92 (0.62,2.27)(-0.62,2.27) (0.67,2.31)(-0.67,2.31)
τ3,4\tau_{3,4} 2.75 1.01 (0.73,2.38)(-0.73,2.38) (0.78,2.41)(-0.78,2.41)
Refer to caption
Figure \arabicfigure: Graphical representation of the sensitivity analysis results for the three pairwise ATEs under the RR framework. The solid error bars are the range of point estimates and the dashed error bars (together with the solid bars) are the confidence intervals. The circles/triangles/squares are the mid-points of the solid bars.

For each value of the sensitivity parameter Γ0\Gamma_{0}, we estimated the partially identified point estimate intervals and obtained 100(1α)%100(1-\alpha)\% confidence intervals using percentile bootstrap with B=1000B=1000. Table \arabictable and Figure \arabicfigure represent the sensitivity analysis results under the risk-ratio-based framework. We can observe that in the case of the pairwise ATE between ”pre-primary or none” and ”higher secondary or beyond” level of education (τ1,4\tau_{1,4}), the confidence intervals do not contain the null value of zero, suggesting a significant non-zero causal effect, for at least Γ0=2.75\Gamma_{0}=2.75. That is, the estimated pairwise ATE τ1,4\tau_{1,4} is significantly different from zero even in the presence of unmeasured confounders that cause the true (unobserved) GPSs to be 2.75 times higher than the observed GPSs. Similarly, the ATE between the ”primary” and ”higher secondary or beyond” level of education (τ2,4\tau_{2,4}) is statistically significant for at least Γ0=2.25\Gamma_{0}=2.25. However, the ATE between ”secondary” and ”higher secondary or beyond” education (τ3,4\tau_{3,4}) is insignificant for values of Γ0\Gamma_{0} as small as about 1.51.5.

Based on the conducted sensitivity analysis under our proposed framework, we can imply that the estimated pairwise ATE between ”pre-primary or none” vs ”higher secondary or beyond” education (τ1,4\tau_{1,4}) and ”primary” vs ”higher secondary or beyond” education (τ2,4\tau_{2,4}) are less sensitive to the presence of unmeasured confounding in that very strong unmeasured confounders are needed to invalid the causal estimates of τ1,4\tau_{1,4} and τ2,4\tau_{2,4}. This can be perceived as a substantial evidence of significant causal effect of maternal education on the fertility of women in Bangladesh.

\arabicsection Conclusion

Sensitivity analysis is crucial for the accurate interpretation of causal conclusions drawn from observational studies. In this paper, we proposed a risk-ratio-based modified marginal sensitivity model by extending the odds-ratio-based sensitivity model of Zhao et al. (2019). We further extended our proposed sensitivity model to observational studies with multivalued treatments. As the sensitivity parameter in our proposed framework quantifies the degree of unmeasured confounding using risk ratios of the observed and true (generalized) propensity scores, it is easier and more intuitive to interpret the sensitivity analysis results under the proposed framework compared to that under odds-ratio-based frameworks. We illustrated that estimation of the inverse probability weighting (IPW) causal effect estimators under our proposed framework can be computed efficiently using a percentile bootstrap approach. We also conducted simulation studies that suggest that our proposed sensitivity analysis framework performs well when there is an adequate overlap among the treatment groups. However, the performance deteriorates due to the instability of the inverse probability weights when there is a lack of overlap. Finally, we demonstrated sensitivity analysis under the proposed framework using an empirical study, where we have estimated the causal effect of maternal education on the female fertility in Bangladesh.

There are a number of further potential research directions. We are currently working on incorporating other smooth causal effect estimators, such as the doubly-robust augmented IPW (AIPW) estimators (Robins et al., 1994) and generalized overlap weighting (GOW) estimators (Li et al., 2019) into our proposed framework. Furthermore, we intend to work on calibrating the the sensitivity parameter in our framework to the observed confounders. Zhang & Small (2020) have recently worked on such calibration of the sensitivity model of Gastwirth et al. (1998) in matched observational studies.

References

  • Basit et al. (2023) Basit, M. A., Latif, M. A. & Wahed, A. S. (2023). Sensitivity analysis for causal effects in observational studies with multivalued treatments. arXiv preprint arXiv:2308.15986 .
  • BBS & UNICEF (2019) BBS & UNICEF (2019). Bangladesh multiple indicator cluster survey 2019, progotir pathey.
  • Bind & Rubin (2021) Bind, M.-A. C. & Rubin, D. (2021). The importance of having a conceptual stage when reporting non-randomized studies. Biostatistics & Epidemiology 5, 9–18.
  • Bonvini et al. (2022) Bonvini, M., Kennedy, E., Ventura, V. & Wasserman, L. (2022). Sensitivity analysis for marginal structural models. arXiv preprint arXiv:2210.04681 .
  • Carnegie et al. (2016) Carnegie, N. B., Harada, M. & Hill, J. L. (2016). Assessing sensitivity to unmeasured confounding using a simulated potential confounder. Journal of Research on Educational Effectiveness 9, 395–420.
  • Charnes & Cooper (1962) Charnes, A. & Cooper, W. W. (1962). Programming with linear fractional functionals. Naval Research Logistics Quarterly 9, 181–186.
  • Cornfield et al. (1959) Cornfield, J., Haenszel, W., Hammond, E. C., Lilienfeld, A. M., Shimkin, M. B. & Wynder, E. L. (1959). Smoking and lung cancer: recent evidence and a discussion of some questions. Journal of the National Cancer Institute 22, 173–203.
  • Dorn et al. (2021) Dorn, J., Guo, K. & Kallus, N. (2021). Doubly-valid/doubly-sharp sensitivity analysis for causal inference with unmeasured confounding. arXiv preprint arXiv:2112.11449 .
  • Gastwirth et al. (1998) Gastwirth, J. L., Krieger, A. M. & Rosenbaum, P. R. (1998). Dual and simultaneous sensitivity analysis for matched pairs. Biometrika 85, 907–920.
  • Imbens (2000) Imbens, G. W. (2000). The role of the propensity score in estimating dose-response functions. Biometrika 87, 706–710.
  • Kallus et al. (2019) Kallus, N., Mao, X. & Zhou, A. (2019). Interval estimation of individual-level causal effects under unobserved confounding. In The 22nd international conference on artificial intelligence and statistics. PMLR.
  • Kang et al. (2007) Kang, J. D., Schafer, J. L. et al. (2007). Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science 22, 523–539.
  • Li et al. (2019) Li, F. et al. (2019). Propensity score weighting for causal inference with multiple treatments. The Annals of Applied Statistics 13, 2389–2415.
  • Robins (2002) Robins, J. M. (2002). Comment on ‘Covariance adjustment in randomized experiments and observational studies’. Statistical Science 17, 309–321.
  • Robins et al. (1994) Robins, J. M., Rotnitzky, A. & Zhao, L. P. (1994). Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association 89, 846–866.
  • Rosenbaum (1987) Rosenbaum, P. R. (1987). Sensitivity analysis for certain permutation inferences in matched observational studies. Biometrika 74, 13–26.
  • Rosenbaum (2002) Rosenbaum, P. R. (2002). Observational studies. Springer.
  • Rosenbaum et al. (2002) Rosenbaum, P. R. et al. (2002). Covariance adjustment in randomized experiments and observational studies. Statistical Science 17, 286–327.
  • Rubin (1978) Rubin, D. B. (1978). Bayesian inference for causal effects: The role of randomization. The Annals of Statistics 6, 34–58.
  • Tomal et al. (2022) Tomal, J. H., Khan, J. R. & Wahed, A. S. (2022). Weighted bayesian poisson regression for the number of children ever born per woman in bangladesh. Journal of Statistical Theory and Applications 21, 79–105.
  • VanderWeele & Ding (2017) VanderWeele, T. J. & Ding, P. (2017). Sensitivity analysis in observational research: introducing the e-value. Annals of internal medicine 167, 268–274.
  • Yadlowsky et al. (2022) Yadlowsky, S., Namkoong, H., Basu, S., Duchi, J. & Tian, L. (2022). Bounds on the conditional and average treatment effect with unobserved confounding factors. The Annals of Statistics 50, 2587–2615.
  • Zhang & Small (2020) Zhang, B. & Small, D. S. (2020). A calibrated sensitivity analysis for matched observational studies with application to the effect of second-hand smoke exposure on blood lead levels in children. Journal of the Royal Statistical Society Series C: Applied Statistics 69, 1285–1305.
  • Zhao et al. (2019) Zhao, Q., Small, D. S. & Bhattacharya, B. B. (2019). Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 81, 735–761.
\printhistory