Sensitivity Analysis of Causal Treatment Effect Estimation for Clustered Observational Data with Unmeasured Confounding
Abstract
Identifying causal treatment (or exposure) effects in observational studies requires the data to satisfy the unconfoundedness assumption which is not testable using the observed data. With sensitivity analysis, one can determine how the conclusions might change if assumptions are violated to a certain degree. In this paper, we propose a new technique for sensitivity analysis applicable to clusters observational data with a normally distributed or binary outcome. The proposed methods aim to assess the robustness of estimated treatment effects in a single study as well as in multiple studies, i.e., meta-analysis, against unmeasured confounders. Simulations with various underlying scenarios were conducted to assess the performance of our methods. Unlike other existing sensitivity analysis methods, our methods have no restrictive assumptions on the number of unmeasured confounders or on the relationship between measured and unmeasured confounders, and do not exclude possible interactions between measured confounders and the treatment. Our methods are easy to implement using standard statistical software packages. Sensitivity analysis; Unmeasured confounding; Bias factor; Clustered observational data; Conditional average treatment effect; Mixed model
1 Introduction
When attempting to estimate the causal effect of an experimental treatment, a risk exposure, or a new policy, researchers often prefer the approach of randomized experimental design. Through randomization, patients who are assigned to different treatment arms are comparable in baseline factors thus potential biases can be avoided. However, when it is not feasible to conduct randomized experiments because of ethical, financial, or other concerns, observational studies are brought into play to estimate the casual treatment effect. To identify the average treatment effects (ATE) and the conditional average treatment effects (CATE) from observational data, further assumptions must be made. The no unmeasured confounder assumption, which is also referred to as the exchangeability assumption, is one of the assumptions needed to identify ATE and CATE from observational data which asserts that the confounding mechanism is determined by observed covariates only. One must address the concern of bias caused by the violation of the exchangeability assumption. The bias introduced by unmeasured treatment-outcome confounding will threaten the usefulness of the statistical inferences that rely on this assumption. For example, Willett (1990) claimed dietary carotenoids reduce the risk of lung cancer using observational data, while a randomized clinical trial conducted later (The Alpha-Tocopherol Beta Carotene Cancer Prevention Study Group, 1994) showed that the group that received carotene has a higher incidence of lung cancer than those that did not receive carotene. The contradictory conclusions are most likely because that the observational study was unable to adjust for unmeasured confounders. For example, people taking dietary carotenoids tend to be healthier due to social and behavioral factors that were beneficial to them. However, adjusting for measured confounders only was not enough to adjust for the difference of health condition between groups taking and not taking carotenoids. Similar contradictory conclusions have been seen in observational and randomized studies analyzing the relationship between antioxidant vitamins and cardiovascular disease (Stampfer et al., 1993, Gaetano and Collaborative Group of the Primary Prevention Project, 2001, Lawlor et al., 2004), which could also be results of violation of the exchangeability assumption.
To deal with unmeasured confounding, researchers propose to use instrumental variable analysis with an exogenous variable, i.e., an instrument that is not directly related to the outcome yet can affect the choice of treatments. This method uses an approximated randomization to form comparison groups of similar characteristics including both measured and unmeasured confounders.
The difference-in-differences (DID) approach can be used for estimating ATE in the presence of unobserved confounding when pre- and post-treatment outcome measurements are available. This approach requires that the association between the unobserved confounder and the outcome is equal in the two treatment arms and that it is a constant over time (Sofer et al., 2016).
Another way to account for unmeasured confounder is through the use of a proxy variable. Tchetgen Tchetgen et al. (2020) introduced negative controls which are auxiliary variables not causally associated with the treatment or outcome, and a generalization of negative controls, namely the proximal causal learning framework, that offers an opportunity to learn causal effects in settings which the assumption of no unmeasured confounding fails.
Among all methods dealing with unmeasured confounders in observational studies, the most common approach is using sensitivity analysis to assess the robustness of causal effect estimation and inference against unmeasured confounders, i.e., how much the estimation will be affected by unmeasured confounders. Sensitivity analysis methods for various models and data structures have been proposed in Lin et al. (1998), Rosenbaum and Rubin (1983), Imbens (2003), Marcus (1997), and Robins et al. (2000). These methods are applicable under additional restrictions, for examples, only for a single binary type of unmeasured confounder (Rosenbaum and Rubin, 1983); or assuming conditional independence that requires the measured and unmeasured confounders to be independent conditional on the treatment (Lin et al., 1998). VanderWeele and Ding (2017) and Ding and VanderWeele (2017) proposed a sensitivity method to estimate the E-value for binary outcome, which is the minimum risk ratio a unmeasured confounder must have with both treatment and outcome to explain away the confounded treatment effect. Mathur and VanderWeele (2019b) extended the idea by developing a sensitivity analysis method that is applicable to multiple studies (meta-analysis). To estimate the ATE on a continuous outcome, VanderWeele and Ding (2017) approximated the difference in the mean continuous outcomes between two treatment arms to a risk ratio then applied the E-value on this approximated risk ratio under additional assumptions.
In this study, we propose a general sensitivity analysis technique for normally distributed and binary outcomes without making distributional assumptions for the measured and unmeasured confounders. We successfully incorporated data from clustered observational designs (e.g., hierarchical or longitudinal) without additional restrictions on the number of unmeasured confounders or the interaction effects between treatment and measured confounders. We also extended the idea and developed a method for multiple studies (meta-analysis).
The rest of the paper is organized as follows. Section 2 introduces the concept of our proposed methods, including cases for single and multiple studies. Section 3 shows the interpretation of our methods. Section 4 provides the assessment of our proposed methods under several simulated scenarios. We conclude our findings and provide final comments in Section 5.
2 Methods
2.1 Single Study - Continuous Outcome
Consider a binary treatment variable with value 1 for the treated and for the control, and a continuous outcome variable . Denote X as a set of measured confounders, and U as a set of unmeasured confounders. Without loss of generality, we assume one measured confounder and one unmeasured confounder. Let (or ) be the counterfactual or potential outcome, which is the outcome that would have been observed if individual had been given treatment . Denote as the observed outcome that corresponds to the treatment value actually experienced.
Individual causal effects are defined as the difference between potential outcomes. Because only one of those outcomes is observed for each individual, individual effects cannot be identified. The average treatment effects (ATE) and conditional average treatment effects (CATE) were proposed and can be estimated from the observed data when additional assumptions hold. The causal ATE of on is the difference between the expectation of the potential outcomes in the population while the conditional causal ATE is the ATE in any subpopulation.
To estimate the ATE or CATE, the following causal assumptions are necessary. The stable unit treatment value assumption (SUTVA) indicates that treatment assignment of one subject does not affect the outcome of another subject. The consistency assumption requires that the potential outcome under treatment , , is equal to the observed outcome if treatment is actually received, i.e., The positivity assumption states that treatment assignment is not deterministic for every set of values of and , i.e., The exchangeability assumption requires that given and , treatment assignment is independent of the potential outcomes, i.e., for all .
Under these assumptions, ATE and CATE can be estimated from the data collected from an observational study:
In clustered observational study, a common approach to estimate is to use a generalized mixed effects model or a generalized linear model with GEE, where all potential confounders are included. We will introduce our sensitivity analysis method based on generalized mixed effects model, where random effects will be used to account for within-cluster correlation. Let and be the indicators for the level-1 (e.g., subjects nested in clusters) and the level-2 unit (e.g., clusters), respectively. Consider a continuous outcome with the marginal mean model:
and the corresponding mixed model:
(1) |
where and are unknown regression parameters; and are the random intercepts and the level-1 error terms, respectively, where and with unknown variances and .
Our goal is to estimate the CATE, denoted as , with the form
Since is unmeasured, we can only fit the following confounded model:
(2) |
where are unknown regression parameters; and are random intercepts and level-1 error terms, respectively, where and with unknown variances and . There is potential violation of assuming a constant variance of random effect and error term, but our methods are robust against this potential violation based on the simulation results.
Denote confounded treatment effect as:
We refer and as the true and confounded coefficients, respectively. The element in cannot be estimated directly using the observed data while can. Therefore, our goal is to establish the relationship between and to infer .
Let be the cumulative distribution function of given and . The conditional expectation of given and based on model (LABEL:1) becomes:
(3) |
where and .
The conditional expectation of based on the confounded model described in equation (2) has the form:
(4) |
Comparing equations (3) and (4), we have the following relationship:
(5) |
where , , and . Note that is the CATE conditional on and is the confounded treatment effect conditional on . Equation (5) specifies the relationship between causal CATE and confounded treatment effect. Define as the bias factor, which is a function of the effect of unmeasured confounders and the difference of the mean unmeasured confounders between the treated and control groups given . Let , and be the sensitivity parameters.
2.2 Sensitivity Analysis for Single Study with Continuous Outcome
Denote the estimated confounded treatment effect as (95% CI, to ), where and are the lower and upper bounds of its 95% confidence interval. Following equation (5), the estimated conditional average treatment effects is (95% CI, to ). We say an unmeasured confounder can explain away the estimated confounded treatment effect if the unmeasured confounder is able to shift the confidence interval of the estimated confounded treatment effect to include the null value (no treatment effect). If confidence interval of estimated confounded treatment effect includes the null value, then no confounding is needed to explain away the results or move confidence interval to include the null value.
Once the sensitivity parameters are specified, we can calculate bias factor and the estimated CATE. However, in practice, we do not know the strengths and distributions of unmeasured confounders, thus, the sensitivity parameters are usually unknown. What we can do is to calculate how the estimate is affected under different values of the bias factor. Among all different bias factors that are able to explain away the confounded treatment effect, minimal bias factor is defined as the smallest value of bias factors that is able to explain away the confounded treatment effect, meaning any bias factor larger than the minimal bias factor can also explain away the confounded treatment effect. The robustness of the estimated confounded treatment effect can be measured by the minimal bias factor. A small value of the minimal bias factor indicates the estimated treatment effect is not robust to unmeasured confounders.
Let and be apparently positive effect and apparently negative effect, respectively. In apparently positive effect case, the confounded treatment effect will be explained away when the lower bound of confidence interval of estimated confounded treatment effect is smaller than 0, i.e., . It means any bias factor, , such that , can explain away the confounded treatment effect. Denote as the minimal bias factor that can explain away the confounded treatment effect. Thus, when , the minimal bias factor that can explain away the confounded treatment effect is , i.e., .
In the apparently negative case, define , thus, estimated CATE can be expressed as (95% CI, to ). The confounded treatment effect will be explained away when . It means any bias factor, , such that , can explain away the confounded treatment effect. Denote as the minimal bias factor that can explain away the confounded treatment effect. Thus, when , the minimal bias factor that can explain away the confounded treatment effect is , i.e., .
As shown above, bias factor is a function of the effects of unmeasured confounders, and the difference of mean of unmeasured confounders between treated and control groups given , such that and . It means that, when , a set of , , and , such that , can explain away estimated confounded treatment effect. When , a set of , , and , such that , can explain away estimated confounded treatment effect. We recommend to provide a table or contour plot which present different values of , , , and bias factor to give researchers a sense of what are combinations of sensitivity parameters that can explain away the confounded treatment effect and therefore a sense of how sensitive the confounded treatment effect is to unmeasured confounders.
2.3 Single Study - Binary Outcome
For a binary outcome, risk difference (RD) and relative risk (RR) can be used to explore causal treatment effect. In this article, RR will be used to explore the causal relationship. With same notations as in the case of continuous outcome, except that denotes a binary variable now, when same assumptions hold:
When the event is rare in both treatment and control group, it can be shown that odds ratio is approximately equal to the relative risk. A logistic regression model is widely used to estimate odds ratio. Let the response variable be related to , , and through following generalized linear mixed model:
(6) |
where and are unknown regression parameters, and .
When is small, it can be shown (proof is provided in Appendix A)
Denote CATE on the log-RR scale as . Thus, when the event is rare and is small, has the form:
Since is unmeasured, we can only fit the following confounded model:
(7) |
where and are unknown regression parameters. Similarly, when is small, it can be shown that for the confounded model:
(8) |
Denote confounded treatment effect on the log-RR scale as which has the form:
Following the confounded model described in equation (8), we have:
(10) |
When , , and are small, comparing equations (9) and (10), we have the following relationship:
(11) |
where , , and .
When is normally distributed such that , i.e., the expectation of is dependent on both and , then if , , and are small, we have (proof is provided in Appendix C):
(12) |
When , , , and are small, comparing equations (10) and (12), we have the following relationship:
(13) |
where , , and .
Intraclass correlation coefficient (ICC) is the proportion of total variance of the outcome that is attributable to the between-subject variation, i.e., how much variation of the outcome can be explained by the random effects. For a logistic model, It means when , ICC, and confounded ICC are small, equations (11) and (13) hold.
With equations (11) and (13), we can calculate the minimal bias factor that can explain away the confounded treatment effect by moving the confidence interval to include the null value. Similarly, if a small minimal bias factor is able to explain away the confounded treatment effect, the confounded treatment effect is not robust to unmeasured confounders.
2.4 Multiple Studies
When multiple observational studies is involved where only study-level estimates not individual-level data can be merged, meta-analysis is the most commonly used analysis technique to estimate the population covariate effects which combines estimates from all studies and takes study heterogeneous into consideration. In this section, we introduce our proposed method to assess how sensitive the estimated population covariate effects from meta-analysis are to the unmeasured confounders.
Suppose we have studies which are random samples from a population of interest. For study , let be the continuous response variable, which is related to treatment assignment , measured confounders , and unmeasured confounder through the following random intercepts model:
where and are the indicators for the level-1 (e.g.,subjects in clusters) and the level-2 unit (e.g., clusters), respectively; and ’s and are the regression parameters adjusting for both measured and unmeasured confounders. Similarly, when causal assumptions hold, the model can be used to estimate the causal ATE. Denote as the CATE for study given , such that .
Since is unmeasured, we can only fit the following confounded model for each of the studies:
where ’s are confounded regression parameters only adjusting for measured confounders. Denote as the confounded treatment effect for study given , such that .
In a standard setup of parametric random effects meta-analysis (Borenstein et al., 2007), it assumes that each of the studies measures a potentially unique confounded effect size , such that , for . Let be the point estimate of the treatment effect and be the known within-study variance for study . In random effects meta-analysis setup, is determined by the true confounded effect size plus the known within-study variance , while is determined by the grand mean, and the between-study variance, . More generally, for any observed effect ,
where and . Fixed effects meta-analysis can be viewed as a special case where .
The relationship between the CATE and confounded treatment effects in equation (5) still hold for each of the studies as we illustrated in the single study case,
where and , and . It means that we allow the bias factor to vary across studies. Assume that is independent of . is independent of but not because studies with a larger bias factor may observe a larger confounded treatment effect. Thus, the distribution of the CATE can be written as , where
2.5 Sensitivity Analysis for Multiple Studies
Although we use continuous outcome as an example to demonstrate the method in multiple studies, similar derivation can be easily extended to the binary outcome case by replacing the error distribution and the link function in the generalized linear mixed model for binary outcomes. Given equations (11) and (13) hold for each of the studies, we can similarly derive the distribution of CATE under the case of binary outcomes.
Following the newly designed metrics for meta-analyses of heterogeneous effects in Mathur and VanderWeele (2019a) and Mathur and VanderWeele (2019b), we define that there is a treatment effect if the probability of observing a CATE larger than a scientifically meaningful effect size , is larger than a threshold . This means that a bias factor will explain away the confounded treatment effect in meta-analysis if the bias factor can reduce the probability of observing a conditional average treatment effects larger than to less than . Note, for continuous outcome, is on the difference of the continuous outcomes scale, while for binary outcome, is on the log-RR scale.
Denote and as the estimators of and . Let and be the apparently positive effect and apparently negative effect, respectively, in meta-analysis.
For the apparently positive effect, we define the probability of observing a CATE larger than as where . can be expressed using , , , and :
(14) |
where is the cumulative function of standard normal distribution. and such that will explain away the estimated confounded treatment effect in meta-analysis. When , a constant bias factor among studies, i.e., , will lead to an upper bound of , i.e.,
It means a constant bias factor, such that where
(15) |
and any other non-constant bias factors with mean larger or equal to can explain away the confounded treatment effect in meta-analysis. Thus, can be viewed as the minimal common constant bias factor that reduce the probability of observing a CATE larger than a scientifically meaningful effect size , to a threshold .
For the apparently negative case, define ; therefore and where and . We define the probability of observing a CATE larger than as . can be expressed using , , , and :
Similarly, when , a constant bias factor among studies will lead to an upper bound of . Let be the minimal common constant bias factor that can reduce the probability of observing a conditional average treatment effects larger than a scientifically meaningful effect size , to a threshold such that
(16) |
Consider fixed , , and therefore a fixed bias factor across studies, meaning the unmeasured confounder has the same distribution and effect across all studies. In the apparently positive effect case, this means that if each of the studies suffers from unmeasured confounder with the same strength such that , the confounded treatment effect will be explained away. In the apparently negative effect case, it means that if in each of the studies, the confounded treatment effect will be explained away.
Consider variant , , and therefore a variant bias factor across studies. In apparently positive effect case, any random bias factor with mean equal or larger than can explain away the confounded treatment effect. In apparently negative effect case, any random bias factor with mean equal or larger than can explain away the confounded treatment effect.
Similar to the case of the single study, the robustness of the estimated confounded treatment effect can be measured by the minimal common constant bias factor. A small minimal common constant bias factor indicates that the estimated confounded treatment effect is not robust to unmeasured confounders.
Table 1 summarizes how to calculate the minimum (common constant) bias factor for a single study and for multiple studies (meta-analysis) under different scenarios. Once or is calculated, one can determine the minimum effect of unmeasured confounders on response and difference between conditional mean of unmeasured confounders, that are sufficient to explain away the confounded treatment effect.
3 Interpretations
3.1 Single study
We take a prospective clustered study, which focuses on identifying potential causal effect of breastfeeding on children’s intelligence quotient (IQ), as an example to explain how to apply our method to assess the robustness of results from an observational study. Luby et al. (2016) examined relationship between breastfeeding and children’s IQ. After adjusting for children’s age, sex, history of depression, caregiver’s highest level of education, and familial income-to-needs ratio, the authors found that the IQ of breastfed child was (95% CI: to ) higher than the IQ of non-breastfed child. The investigators did not control for maternal smoking during pregnancy while maternal smoking may be a confounder as it could associate with less breastfeeding as well as lower offspring IQ. We are worried this result is confounded by maternal smoking status, so we conduct a sensitivity analysis to assess how robust the estimate is to the unmeasured confounding: maternal smoking status.
As the study indicated an apparently positive effect of breastfeeding on children’s IQ, the minimal bias factor, , that is able to explain away the confounded effect equals the lower bound of confidence interval, i.e., , which means that any bias factor larger than can also explain away the confounded effect. The minimum bias factor can be decomposed into the product of two factors: the effect of maternal smoking on children’s IQ adjusting for other covariates and the difference between and . Our methods allow investigators to make statements of the following form: the confounded effect of breastfeeding on children’s IQ could be explained away if the difference between the prevalence of maternal smoking in breastfed children and non-breastfed children is larger than and the effect of smoking on IQ is greater than 3. There are also other decompositions that could explain away the confounded effect of breastfeeding on children’s IQ. A contour plot depicted in Figure 1 can be used to visualize the relationship between bias factors, difference between and (i.e., ), and the effect of unmeasured confounder . Any combination of and in the contour plot that corresponds to a bias factor equal or larger than are able to explain away the confounded effect. Using both the information of the minimal bias factor and different combinations of sensitivity parameters that are able to explain away the confounded treatment effect, investigators can get a sense of how sensitive the conclusions are to potential unmeasured confounders.
3.2 Multiple studies
Applications of our methods for meta-analysis and single study are similar. Investigators can calculate the minimal common constant bias factor based on equations (15) or (16) by plugging in , , , and . If each of the studies suffers from an unmeasured confounder with the same distribution and effect on outcome that lead to a bias factor equal or greater than the minimal common constant bias factor, the confounded treatment effect from the meta-analysis can be explained away. Further, any non-fixed unmeasured confounder across studies with expectation equal or larger than minimal common constant bias factor can also explain away the confounded effect. Similar to single study, investigators can use our method to assess whether a set of sensitivity parameters can explain away the confounded effect and get a sense of how sensitive the conclusions are to potential unmeasured confounders.
We take a meta-analysis, which focuses on identifying potential causal effect of body mass index (BMI) in midlife on dementia, as an example to explain how to apply our method to assess the robustness of results from meta-analysis. Albanese et al. (2017) examined relationship between BMI in midlife and dementia by combining results, on relative risk (RR) scale, from 19 longitudinal studies on 589,649 participants (2,040 incident dementia cases) followed up for up to 42 years. The authors found that midlife (age 35 to 65 years) obesity (BMI 30) was associated with larger risk of dementia in late life (RR, 1.33; 95% CI, 1.08 to 1.63). The study reported ( is on log RR scale), while the study level summary measures reported from Albanese et al. (2017) can be used to obtain . We estimated via methods illustrated in Borenstein et al. (2007). Note that the choice of and should be adapted based on the context and specific research questions, which means that different analysis can have difference choice on and . In this example, let us consider a RR of 1.2 as a scientifically meaningful effect size, meaning , and consider having at least 40% chance to observe an effect size above for results to be of scientific interest, meaning .
As the meta-analysis indicated an apparently positive effect of BMI on dementia, we have the minimal common constant bias factor, which can reduce the probability of observing a conditional average treatment effects larger than a scientifically meaningful effect size by a threshold , such that
It means that any constant bias factor across studies such that and any other non-constant bias factors across studies with mean larger or equal to can explain away the confounded treatment effect in meta-analysis. The minimum common constant bias factor can be decomposed into the product of two factors: the effect of unmeasured confounder on dementia and the difference between the mean of unmeasured confounder in different BMI groups. Our methods allow investigators to make statement of the following form: the confounded effect of BMI on dementia could be explained away if an unmeasured confounder exists such that its mean difference between BMI groups is larger than 0.17 and its effect on dementia is greater than 1 in each of the studies. We can similarly refer to the contour plot in Figure 1 to visualize combinations of sensitivity parameters that corresponds to a bias factor equal or larger than 0.17 to explain away the confounded effect. Using both the information of the minimal common constant bias factor and different combination of sensitivity parameters that are able to explain away the confounded treatment effect, investigators can get a sense of how sensitive the conclusions are to potential unmeasured confounders.
4 Simulation studies
We conducted simulation studies to assess the performance of our proposed methods; specifically, we examined the bias and the precision of the estimator in equation (5) for a single study setting with continuous outcome, the estimator in equation (13) for a single study setting with binary outcome and continuous unmeasured confounder, and estimator in equation (14) for the meta-analysis settings.
4.1 Single Study - Continuous Outcome
First, we assessed the performance of the estimator of , . We assumed that , , and are dependent, which is a frequently encountered condition in observational studies.
Let the number of subjects be and the number of repeated measures be per subject. We simulated a normally distributed unmeasured confounder , one measured binary confounder if and if , and treatment assignment if and if . We simulated random intercept and the level-1 residuals from the distributions and , respectively. We proceeded to simulate the response variable using the underlying true model described in (LABEL:1) setting and while varying the values of , , and . can be calculated after we estimated the confounded coefficient and in (2) using R package lme4 (Bates et al., 2015). We simulated 1,000 such studies and inferred the estimated conditional average treatment effects described in equation (5) under and . We repeated the simulations with different values of , , , , , and .
To measure the performance of our methods in estimating conditional causal treatment effect, we assessed the bias, standard error (SE) and coverage probability (CP) of 95% confidence intervals of . Results in Table 2 show that the estimated conditional average treatment effects, , is unbiased and the standard error is relatively small. The coverage of 95% confidence interval is nominal. The accuracy of the point estimate appears to be independent of .
4.2 Single Study - Binary Outcome
Then, we assessed the performance of the estimator of in equation (13) for a single study setting with binary outcome and continuous unmeasured confounder. Assuming 200 subjects and the 4 repeated measures per subject, , , and were simulated similarly as for continuous outcome. We simulated random intercept and the level-1 residuals from the distributions and , respectively. We proceeded to simulate the response variable using the underlying true model described in (6) setting , , , and while varying the value of .
As we have shown, equation (13) holds when , , true ICC, and confounded ICC are small, thus, in order to fully understand the relationship between bias of , , , and true ICC, we will estimate bias of under a set of different , , and true ICC. Similarly, can be calculated after we estimated the confounded coefficient and . We simulated 1,000 such studies. To measure the performance of our methods in estimating conditional causal treatment effect, we assessed the bias, standard error (SE) and coverage probability (CP) of 95% confidence intervals of . Results in Table 3 show that when , , and true ICC is in range of simulated scenarios, our proposed estimators is unbiased and precise. There is no monotone relationship between accuracy of and , , or true ICC, as the approximation that allows equation (13) to hold also depends on the confounded ICC which is related with all parameters.
4.3 Multiple Studies
For multiple studies, we assessed the performance of the estimator of , . Let be the number of studies, be the subjects in each study and be the number of repeated measurements for each subject. Let , , and . For each study, we simulate to allow for between study variation in sample size. We simulated and from a bivariate normally distribution with , , , , and , and simulated . Settings used in single study with continuous outcome were used to simulate , , , , , and in each of the studies. Then we estimated the effect sizes in each of the studies and then fit the random effects meta-analysis using R package meta (Schwarzer et al., 2015). We repeated the process 1,000 times.
Results in Table 4 show that our proposed estimator is unbiased and precise, as the SE are relatively small. The coverage of 95% confidence interval is nominal. The proposed estimator performed well for all simulated conditions across various settings on the number of studies, subjects, and repeated measurements per subject.
5 Discussion
In this study, we propose new sensitivity analysis methods to assess robustness of the treatment effect estimated from observational data against violation of the assumption of no unmeasured confounding. We developed methods for clustered outcomes (e.g., repeated measures per subject, patients nested within a clinic) of a single study and of multiple studies (meta-analysis) separately. Our methods enable researchers to estimate the minimal effect of the unmeasured confounder sufficient to explain away the confounded treatment effect, when applied after data analysis using the standard mixed effects modeling approach. If only an unmeasured confounder with strong effect can explain away the confounded effect, it means the observational study is robust to unmeasured confounding. The major advantage of our methods is to avoid unrealistic assumptions, in addition to efficient computation and uncomplicated implementation.
Unlike previously developed methods, our methods can be easily extended to deal with multiple unmeasured confounders. Taking continuous outcome case as an example, equations (LABEL:1) to (5) can be modified by including all unmeasured confounders, , and the bias factor, , can be derived similarly such that , where is the effect of on outcome and for , if we assume that unmeasured confounders are independent with each other. Our methods can be easily extended to deal with multiple treatments. To find the relationship between the CATE and confounded treatment effects in a study involving treatments, we need to obtain for . The bias factor for each treatment effect will have the same form as that for two treatments.
For multilevel data, for example, patients are nested within hospitals so unmeasured confounders could be at the patient or at the hospital level. In our derivations, we considered that the source of unmeasured confounders is at the patient level, so we incorporated in the model. To assess the impact of unmeasured confounders at the hospital level, we may replace with in the model and the same subsequent derivation follows.
The minimal bias factor that can explain away the confounded result in a meta-analysis involves parameters and representing a scientifically meaningful effect size and the probability of observing a study that have an effect size larger than , respectively. The choice of and should be adapted based on the context and specific research questions. For parameter , our recommendation is to choose any value of less than 0.5 in order to keep equations (15) and (16) valid. Selection of should base on the research goals and the associated background knowledge. For example, if the goal of a cohort study is to test whether there is a treatment effect in a new physical therapy, the value of could be set to the treatment effect of the current protocol physical therapy.
While the focus of our proposed methods is on moving the confidence interval to include the null, we care not only about the statistical uncertainty of the estimate but also about the point estimate itself in practice. For this reason, researchers can also derive the bias factor that is able to move the point estimate to the null or even to reverse direction of the confounded effect. It is also possible to derive the minimal bias factor that can move a near null confounded treatment effect to a clinically meaningful level if it is the interest. No matter what target is set, a similar sensitivity analysis can be conducted once the (minimal) bias factor is calculated.
In practice, there could be violation against the assumption that the random effect and error term have constant variance in the confounded model, i.e., violation against the assumption that the variance of random effect and error term are independent of measured confounders and treatment. Given different relationship between unmeasured confounder, measured confounders, treatment and outcome, there could be different degree of violation. In our simulated scenarios, the data is generated in the way that this assumption is violated. The accurate and precise results of our simulation indicates that our methods are robust against the violation of the assumption that the random effect and error term have constant variance in the confounded model.
Depending on the context, the same value of minimal bias factor could carry different interpretations. We suggest that for all observational studies whose goal is to investigate causal relationships, to report the minimal bias factor needed to explain away both the point estimate and confidence interval of the observed effect, and then interpret minimal bias factor and assess the robustness of the study to unmeasured confounding in its own context.
Reference
References
- Albanese et al. (2017) Albanese, E., Launer, L. J., Egger, M., Prince, M. J., Giannakopoulos, P., Wolters, F. J. and Egan, K. (2017). Body mass index in midlife and dementia: Systematic review and meta-regression analysis of 589,649 men and women followed in longitudinal studies. Alzheimers Dement (Amst) 20(8), 165–178.
- Bates et al. (2015) Bates, D, Mächler, M, Bolker, B and Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software 67(1), 1–48.
- Borenstein et al. (2007) Borenstein, Michael, Hedges, Larry V., Higgins, Julian P.T. and Rothstein, Hannah R. (2007). Introduction to Meta-Analysis, 1th edition. United Kingdom: Wiley.
- Ding and VanderWeele (2017) Ding, Peng and VanderWeele, Tyler. (2017). Sensitivity analysis without assumptions. Epidemiology 27, 368–377.
- Gaetano and Collaborative Group of the Primary Prevention Project (2001) Gaetano, G. D. and Collaborative Group of the Primary Prevention Project. (2001). Low-dose aspirin and vitamin e in people at cardiovascular risk: a randomised trial in general practice. Lancet 357, 89–95.
- Imbens (2003) Imbens, G. W. (2003). Sensitivity to exogeneity assumptions in program evaluation. American Economic Review 93 (2), 126–132.
- Lawlor et al. (2004) Lawlor, D. A., Davey, S. G., Kundu, D., Bruckdorfer, K. R. and Ebrahim, S. (2004). Those confounded vitamins: what can we learn from the differences between observational versus randomised trial evidence? Lancet 363, 1724–1727.
- Lin et al. (1998) Lin, D. Y., Psaty, B. M. and Kronmal, R. A. (1998). Assessing the sensitivity of regression results to unmeasured confounders in observational studies. Biometrics 3, 948–963.
- Luby et al. (2016) Luby, J., Belden, A., Whalen, D., Harms, M. and Barch, D. (2016). Breastfeeding and childhood iq - the mediating role of gray matter volume. Journal of the American Academy of Child and Adolescent Psychiatry 55, 67–75.
- Marcus (1997) Marcus, S. M. (1997). Using omitted variable bias to assess uncertainty in the estimation of an aids education treatment effect. Journal of Educational and Behavioral Statistics 22, 193–201.
- Mathur and VanderWeele (2019a) Mathur, M. B. and VanderWeele, T. J. (2019a). New metrics for meta-analyses of heterogeneous effects. Statistics in Medicine 38(1), 1336–1342.
- Mathur and VanderWeele (2019b) Mathur, Maya B. and VanderWeele, Tyler J. (2019b). Sensitivity analysis for unmeasured confounding in meta-analyses. Journal of the American Statistical Association 115, 163–172.
- Robins et al. (2000) Robins, J. M., Rotnitzky, A. and Scharfstein, D. O. (2000). Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. Statistical models in epidemiology, the environment, and clinical trials, 1–94.
- Rosenbaum and Rubin (1983) Rosenbaum, PR and Rubin, DB. (1983). Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. Journal of the Royal Statistical Society 45.2, 212–218.
- Schwarzer et al. (2015) Schwarzer, G., Carpenter, J. R. and Rucker, G. (2015). Meta-Analysis with R, 1st edition. Springer.
- Sofer et al. (2016) Sofer, T., Richardson, D. B., Colicino, E., Schwartz, J. and Tchetgen Tchetgen, E. J. (2016). On negative outcome control of unobserved confounding as a generalization of difference-in-differences. Statistical Science 31(3), 348–361.
- Stampfer et al. (1993) Stampfer, M. J., Hennekens, C. H., Manson, J. E., Colditz, G. A., Rosner, B. and Willett, W. C. (1993). itamin e consumption and the risk of coronary disease in women. Lancet 328, 1444–1449.
- Tchetgen Tchetgen et al. (2020) Tchetgen Tchetgen, EJ, Ying, A, Cui, Y, Shi, X and Miao, W. (2020). An introduction to proximal causal learning. arXiv.
- The Alpha-Tocopherol Beta Carotene Cancer Prevention Study Group (1994) The Alpha-Tocopherol Beta Carotene Cancer Prevention Study Group. (1994). The effect of vitamin e and beta carotene on the incidence of lung cancer and other cancers in male smokers. New England Journal of Medicine 330, 1029–1035.
- VanderWeele and Ding (2017) VanderWeele, Tyler and Ding, Peng. (2017). Sensitivity analysis in observational research: Introducing the e-value. Annals of internal medicine 167.4, 268–274.
- Willett (1990) Willett, W. C. (1990). Vitamin a and lung cancer. Nutrition Reviews 48(5), 201–211.
Appendix A Binary Outcome
We establish a result that for binary outcome when is small, we have
Denote , where . Thus, where . If , it means
Thus, when
Therefore, when
Appendix B Binary Outcome and Binary Unmeasured Confounder
We establish a result that for binary outcome with presence of binary unmeasured confounder in the study, when both and are small, we have
Denote , where . Thus, .
(17) |
where .
(18) |
Appendix C Binary Outcome and Continuous Unmeasured Confounder
We establish a result that for binary outcome with presence of continuous unmeasured confounder in the study, when both , , and are small, we have
Denote , as we assume that , thus , where and . We have,
When is small,

Single study | Multiple studies (meta-analysis) | |
---|---|---|
Apparently positive treatment effect | ||
Apparently negative treatment effect |
: minimum (common constant) bias factor in study with apparently positive treatment effect; : minimum (common constant) bias factor in study with apparently negative treatment effect; : lower bound of confidence interval; : upper bound of confidence interval; : estimated grand mean of confounded treatment effect in meta analysis; : estimated between study variance of confounded treatment effect in meta analysis; : a scientifically meaningful effect size; : threshold of probability of observing a conditional average treatment effects larger than a scientifically meaningful effect size for results to be of scientific interest
Bias | SE | CP | Bias | SE | CP | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
0.018 | 0.281 | 0.948 | 0.009 | 0.285 | 0.956 | |||||||
0.03 | 0.157 | 0.952 | 0.017 | 0.214 | 0.951 | |||||||
0.006 | 0.109 | 0.951 | 0.002 | 0.115 | 0.948 | |||||||
0.004 | 0.203 | 0.94 | 0.002 | 0.199 | 0.96 | |||||||
0.01 | 0.195 | 0.952 | 0.009 | 0.212 | 0.954 | |||||||
0.001 | 0.159 | 0.954 | 0.005 | 0.210 | 0.959 | |||||||
0.002 | 0.155 | 0.969 | 0.020 | 0.199 | 0.967 |
: size of level-1 unit (e.g., clusters); : size of level-2 unit (e.g., subjects nested in clusters); : regression parameters; : effect of unmeasured confounder on outcome; : variance of unmeasured confounder
ICC | Bias | SE | CP | Bias | SE | CP | |||
---|---|---|---|---|---|---|---|---|---|
-0.5 | 0.15 | 0.25 | 0.001 | 0.987 | 0.983 | 0.025 | 0.289 | 0.960 | |
-0.5 | 0.15 | 1 | 0.047 | 0.707 | 0.981 | 0.031 | 0.278 | 0.957 | |
-0.5 | 0.15 | 2.25 | 0.034 | 0.670 | 0.968 | 0.048 | 0.376 | 0.959 | |
-0.5 | 0.25 | 0.25 | 0.028 | 0.637 | 0.974 | 0.019 | 0.301 | 0.952 | |
-0.5 | 0.25 | 1 | 0.039 | 0.659 | 0.972 | 0.034 | 0.292 | 0.953 | |
-0.5 | 0.25 | 2.25 | 0.033 | 0.588 | 0.967 | 0.033 | 0.287 | 0.967 | |
-0.5 | 0.35 | 0.25 | 0.045 | 0.586 | 0.963 | 0.008 | 0.321 | 0.956 | |
-0.5 | 0.35 | 1 | 0.032 | 0.575 | 0.968 | 0.005 | 0.304 | 0.954 | |
-0.5 | 0.35 | 2.25 | 0.079 | 0.579 | 0.954 | 0.056 | 0.307 | 0.951 | |
0.5 | 0.25 | 0.25 | 0.012 | 0.649 | 0.969 | 0.004 | 0.302 | 0.953 | |
0.5 | 0.25 | 1 | 0.047 | 0.645 | 0.971 | 0.034 | 0.289 | 0.954 | |
0.5 | 0.25 | 2.25 | 0.057 | 0.602 | 0.968 | 0.038 | 0.275 | 0.958 |
: effect of unmeasured confounder on outcome; ICC: intraclass correlation coefficient; : variance of unmeasured confounder
X=0 | X=1 | ||||||||
---|---|---|---|---|---|---|---|---|---|
Bias | SE | CP | Bias | SE | CP | ||||
15 | 100 | 3 | 0.048 | 0.125 | 0.956 | 0.001 | 0.126 | 0.962 | |
15 | 100 | 5 | 0.094 | 0.124 | 0.937 | 0.006 | 0.126 | 0.973 | |
15 | 200 | 3 | 0.079 | 0.124 | 0.940 | 0.025 | 0.125 | 0.96 | |
15 | 200 | 5 | 0.05 | 0.125 | 0.964 | 0.017 | 0.124 | 0.966 | |
30 | 100 | 3 | 0.044 | 0.089 | 0.962 | 0.008 | 0.090 | 0.972 | |
30 | 100 | 5 | 0.056 | 0.089 | 0.941 | 0.031 | 0.090 | 0.972 | |
30 | 200 | 3 | 0.046 | 0.088 | 0.936 | 0.015 | 0.090 | 0.98 | |
30 | 200 | 5 | 0.009 | 0.090 | 0.979 | 0.030 | 0.091 | 0.969 | |
100 | 100 | 3 | 0.012 | 0.049 | 0.985 | 0.009 | 0.048 | 0.985 | |
100 | 100 | 5 | 0.002 | 0.048 | 0.987 | 0.017 | 0.049 | 0.981 | |
100 | 200 | 3 | 0.004 | 0.049 | 0.972 | 0.009 | 0.049 | 0.989 | |
100 | 200 | 5 | 0.003 | 0.048 | 0.989 | 0.005 | 0.048 | 0.989 |
: size of studies included in meta analysis; : average size of level-1 unit (e.g., clusters); : size of level-2 unit (e.g., subjects nested in clusters);